• Asafum@feddit.nl
    link
    fedilink
    English
    arrow-up
    57
    ·
    1 day ago

    “Grok then generated an image of a chess board being flipped over and complained “I only lost because the JEWS own chess!” Elon Musk could not be reached for comment as he’s currently lost in a K hole.”

  • AbouBenAdhem@lemmy.world
    link
    fedilink
    English
    arrow-up
    32
    ·
    1 day ago

    “Up until the semi finals, it seemed like nothing would be able to stop Grok 4 on its way to winning the event,” Pedro Pinhata, a writer for Chess.com, said in its coverage. “Despite a few moments of weakness, X’s AI seemed to be by far the strongest chess player… But the illusion fell through on the last day of the tournament.” He said Grok’s “unrecognizable” and “blundering” play enabled o3 to claim a succession of “convincing wins”.

    I think the main takeaway is that these models are fundamentally inconsistent, and you can never assume they’re going to be reliable based on past performance.

  • Repple (she/her)@lemmy.world
    link
    fedilink
    English
    arrow-up
    11
    ·
    1 day ago

    I haven’t tried in a while, but shortly after gpt4 came out I tried to play chess against it. It just completely changed the board position nearly every move making illegal moves, adding pieces etc. do current models keep track of the board and make legal moves without special prompting to help? Were these assisted by agentic tools handling state?