You know how Google’s new feature called AI Overviews is prone to spitting out wildly incorrect answers to search queries? In one instance, AI Overviews told a user to use glue on pizza to make sure the cheese won’t slide off (pssst…please don’t do this.)

Well, according to an interview at The Vergewith Google CEO Sundar Pichai published earlier this week, just before criticism of the outputs really took off, these “hallucinations” are an “inherent feature” of  AI large language models (LLM), which is what drives AI Overviews, and this feature “is still an unsolved problem.”

  • RBG
    link
    fedilink
    English
    544 months ago

    I let you in on a secret: scientific literature has its fair share of bullshit too. The issue is, it is much harder to figure out its bullshit. Unless its the most blatant horseshit you’ve scientifically ever seen. So while it absolutely makes sense to say, let’s just train these on good sources, there is no source that is just that. Of course it is still better to do it like that than as they do it now.

    • @givesomefucks
      link
      English
      344 months ago

      The issue is, it is much harder to figure out its bullshit.

      Google AI suggested you put glue on your pizza because a troll said it on Reddit once…

      Not all scientific literature is perfect. Which is one of the many factors that will stay make my plan expensive and time consuming.

      You can’t throw a toddler in a library and expect them to come out knowing everything in all the books.

      AI needs that guided teaching too.

      • @RidcullyTheBrown
        link
        English
        -24 months ago

        Google AI suggested you put glue on your pizza because a troll said it on Reddit once…

        Genuine question: do you know that’s what happened? This type of implementation can suggest things like this without it having to be in the training data in that format.

          • @assassin_aragorn
            link
            English
            44 months ago

            It’s going to be hilarious to see these companies eventually abandon Reddit because it’s giving them awful results, and then they’re completely fucked

            • @[email protected]
              link
              fedilink
              English
              03 months ago

              You’re wrong. Anyone who has ever used Google knows Reddit is an absolute goldmine of valuable information. The problem is it’s also full of jokes and puns and bad information, and AI isn’t able to sort one from the other (yet).

          • @RidcullyTheBrown
            link
            English
            -134 months ago

            This doesn’t mean that there are reddit comments suggesting putting glue on pizza or even eating glue. It just means that the implementation of Google’s LLM is half baked and built it’s model in a weird way.

            • @ozymandias117
              link
              English
              7
              edit-2
              4 months ago

              I literally linked you to the Reddit comment, and pointed out that Google’s response used the same measurements as the comment

              Are you an LLM?

              • @RidcullyTheBrown
                link
                English
                -24 months ago

                Oh, hah sorry! thanks, I didn’t realise that the reddit link pointed to the glue thing

        • @givesomefucks
          link
          English
          04 months ago

          Genuine question: do you know that’s what happened?

          Yes

    • @[email protected]
      link
      fedilink
      English
      04 months ago

      “Most published journal articles are horseshit, so I guess we should be okay with this too.”

      • Turun
        link
        fedilink
        English
        14 months ago

        No, it’s simply contradicting the claim that it is possible.

        We literally don’t know how to fix it. We can put on bandaids, like training on “better” data and fine-tune it to say “I don’t know” half the time. But the fundamental problem is simply not solved yet.