• TheAlbatross@lemmy.blahaj.zone
    link
    fedilink
    arrow-up
    52
    ·
    1 month ago

    Gotta remember that AI isn’t occasionally hallucinating and often recalling information, it’s never recalling information, it’s always hallucinating, just that we say we like some of the hallucinations, so it does those more often.

    • Takapapatapaka
      link
      fedilink
      arrow-up
      8
      ·
      1 month ago

      Well, not sure what you mean by hallucinating, but there are some LLMs (chatgpt3.5-turbo-instruct) that could play chess at a good amateur level (1750 elo), and by studying them, some people found parameters in the model that seemed to represent the current state of the chess board (as in, they corresponded to the game, and changing them artificially made the LLM play as if the game as always been that way).

  • FiskFisk33@startrek.website
    link
    fedilink
    arrow-up
    9
    ·
    edit-2
    1 month ago

    A screwdriver beat a hammer in a screw driving competition. well done.

    Maybe it sounds impressive when hammers are super hyped and everyone and their mother is driving their screws with them, but it honestly really isn’t.

  • teft
    link
    fedilink
    arrow-up
    8
    ·
    1 month ago

    TIL that I play chess like an AI model.

    • The Picard ManeuverOP
      link
      fedilink
      arrow-up
      7
      arrow-down
      2
      ·
      1 month ago

      I wouldn’t be surprised if it’s literally zero. I’ve tried with a few LLMs, and they’re all very confident that they know how to play chess, but they just start hallucinating illegal moves immediately.

      • la_scriba@sopuli.xyz
        link
        fedilink
        arrow-up
        2
        ·
        1 month ago

        Immediately? When was the last time you tried? The newer models can hold a game well for 10-20 moves.

        • The Picard ManeuverOP
          link
          fedilink
          arrow-up
          2
          ·
          1 month ago

          A few weeks ago, Gemini got confused when it tried to go first as black multiple times, so that’s the most immediate one I can remember. Last week, chatGPT offered to set up chess puzzles for me, but it made mistakes 3 out of 3 times.

          Maybe I’ll try again. Is there a certain one you’ve seen good performance out of?

          • la_scriba@sopuli.xyz
            link
            fedilink
            arrow-up
            3
            ·
            27 days ago

            DeepSeek is very consistent ime. ChatGPT is hit or miss–sometimes it’s excellent, sometimes it gets really confused and says random stuff. Though DeepSeek has a server reliability problem.

  • Fredselfish
    link
    fedilink
    arrow-up
    3
    arrow-down
    1
    ·
    1 month ago

    To be fair the Atari was built to play games. /s

    But for real wasn’t AI supposed to be bare minimum good at this game. Is this not how were to train them in order to know if they are intelligent or not?