@AbouBenAdhem

AbouBenAdhem@lemmy.world · 1 day ago

“Up until the semi finals, it seemed like nothing would be able to stop Grok 4 on its way to winning the event,” Pedro Pinhata, a writer for Chess.com, said in its coverage. “Despite a few moments of weakness, X’s AI seemed to be by far the strongest chess player… But the illusion fell through on the last day of the tournament.” He said Grok’s “unrecognizable” and “blundering” play enabled o3 to claim a succession of “convincing wins”.

I think the main takeaway is that these models are fundamentally inconsistent, and you can never assume they’re going to be reliable based on past performance.

AbouBenAdhem@lemmy.world · edit-2 4 days ago

The typical pattern for leaders is to get “second opinions” from advisors who tell them whatever they want to hear, so… maybe asking the equivalent of a magic 8 ball is a marginal improvement?

AbouBenAdhem@lemmy.world · edit-2 12 days ago

“Researchers in the field sometimes describe our goal as to pass the ‘Visual Turing Test,’” said Suyeon Choi […] “A visual Turing Test then means, ideally, one cannot distinguish between a physical, real thing as seen through the glasses and a digitally created image being projected on the display surface,” Choi said.

So they just came up with a needlessly opaque synonym of “verisimilitude”.

AbouBenAdhem@lemmy.world · 16 days ago

Doom Quixote.

AbouBenAdhem@lemmy.world · 16 days ago

As a 50-something, I can see the case for putting the “golden age” of the internet between the birth of Wikipedia in 2001 and Facebook in 2006.

AbouBenAdhem@lemmy.world · edit-2 1 month ago

I think it does accurately model the part of the brain that forms predictions from observations—including predictions about what a speaker is going to say next, which lets human listeners focus on the surprising/informative parts. But with LLMs they just keep feeding it its own output as if it were a third party whose next words it’s trying to predict.

It’s like a child describing an imaginary friend, if you keep repeating “And what would your friend say after that?”

AbouBenAdhem@lemmy.world · 2 months ago

IMO the focus should have always been on the potential for AI to produce copyright-violating output, not on the method of training.

AbouBenAdhem@lemmy.world · 2 months ago

Why would the article’s credited authors pass up the chance to improve their own health status and health satisfaction?

AbouBenAdhem@lemmy.world · 2 months ago

Critical paragraph:

Our research highlights the importance of Germany’s unique institutional context, characterized by strong labor protections, extensive union representation, and comprehensive employment legislation. These factors, combined with Germany’s gradual adoption of AI technologies, create an environment where AI is more likely to complement rather than displace worker skills, mitigating some of the negative labor market effects observed in countries like the US.

AbouBenAdhem@lemmy.world · 2 months ago

That makes sense—being raised by ChatGPT might be marginally better than being raised by Sam Altman.

AbouBenAdhem@lemmy.world · edit-2 2 months ago

How does that compare to the growth in size of the overall code base?

AbouBenAdhem@lemmy.world · 2 months ago

I assume it’s because it reduces the possibility of other processes outside of the linked containers accessing the files (so security and stability).

AbouBenAdhem@lemmy.world · 3 months ago

Here’s a list of WP’s templates for adding social media links to articles—looks like they have one for Mastodon.

https://en.wikipedia.org/wiki/Category:Social_media_external_link_templates

AbouBenAdhem@lemmy.world · 3 months ago

CasaOS is not an operating system and more like a GUI for Docker

So it’s more like Portainer?

AbouBenAdhem@lemmy.world · 4 months ago

The current version of Affinity is great and will continue to work forever—there’s no need to switch to an alternative if you’re already using it. I just don’t have much hope for its future development.

AbouBenAdhem@lemmy.world · 4 months ago

Just over a year ago.

AbouBenAdhem@lemmy.world · 4 months ago

I guess technically, Raspbian.

AbouBenAdhem@lemmy.world · edit-2 4 months ago

The Affinity Suite is great, but I’m suspicious of its acquisition by Canva—I’m afraid their solution to “bringing the suite to Linux” will be turning it into a web service.

AbouBenAdhem@lemmy.world · edit-2 4 months ago

One metric you might want to add is the network effect: how much of a difference does it make to the user experience to join a large instance (or the same instance most of your friends are on) compared to a small or self-hosted one? (Or in other words—does the nature of the platform software potentially incentivize consolidation?)

AbouBenAdhem@lemmy.world · 5 months ago

Ok—to the extent that SVG is HTML, the variant of HTML that it is is a flavor of XML.