Google rolled out AI overviews across the United States this month, exposing its flagship product to the hallucinations of large language models.

  • @adam_y@lemmy.world
    link
    fedilink
    English
    1751 month ago

    Can we swap out the word “hallucinations” for the word “bullshit”?

    I think all AI/LLM stuf should be prefaced as “someone down the pub said…”

    So, “someone down the pub said you can eat rocks” or, “someone down the pub said you should put glue on your pizza”.

    Hallucinations are cool, shit like this is worthless.

    • @Eheran@lemmy.world
      link
      fedilink
      English
      791 month ago

      No, hallucination is a really good term. It can be super confident and seemingly correct but still completely made up.

      • @Emmie@lemm.ee
        link
        fedilink
        English
        9
        edit-2
        1 month ago

        You just described entirety of reddit and last I checked we didn’t call that hallucinating

      • @yukijoou
        link
        English
        71 month ago

        for it to “hallucinate” things, it would have to believe in what it’s saying. ai is unable to think - so it cannot hallucinate

        • @Jrockwar@feddit.uk
          link
          fedilink
          English
          101 month ago

          Hallucination is a technical term. Nothing to do with thinking. The scientific community could have chosen another term to describe the issue but hallucination explains really well what’s happening.

          • @yukijoou
            link
            English
            101 month ago

            because it’s a text generation machine…? i mean, i wouldn’t say i can prove it, but i don’t think anyone can prove it’s capable of thinking, much less of reasoning

            like, it can string together a coherent sentence thanks to well crafted equations, sure, but i wouldn’t qualify that as “thinking”, though i guess the definition of “thinking” is debatable

            • DominusOfMegadeus
              link
              fedilink
              English
              11 month ago

              It’s an interesting question. I am inclined to believe that the faster it gets at running those equations, over and over and over, reanalysing is data and responses as it goes, that that ultimately leads to some kind of evolution. You know, Vger style.

            • @Eheran@lemmy.world
              link
              fedilink
              English
              11 month ago

              It can tell you how to stack things on top of each other the best way to get a high tower. Etc.

              Those are not random sentences. If you can not define thinking in a way this machine fails at, then stop saying it does not think.

              • @Aceticon@lemmy.world
                link
                fedilink
                English
                21 month ago

                A parrot can be trained to tell you how to stack things on top of each other the best way to get a high tower.

                This is just an electronic parrot, millions of times faster to train than the biological parrot, specialized in repetition alone (can’t really do anything else a parrot can) and which has been trained on billions of texts.

                You’re confusing one specific form in which humans externally express cogniscence with the actual cogniscence itself: just because intelligence can produce some forms of textual communication doesn’t mean that the relationship holds in the opposite direction and such forms of textual communication require intelligence, or if you will, just because you can photograph a real pizza to get a picture of a pizza doesn’t mean a picture of a pizza is actually of a real pizza and not something with glue to make it look like it has stringy melted cheese.

                • @Eheran@lemmy.world
                  link
                  fedilink
                  English
                  11 month ago

                  Again, it is absolutely capable to come up with it’s own logical stuff, hence my example. Stop saying it just copies existing stuff, that is simply wrong.

                  • @yukijoou
                    link
                    English
                    21 month ago

                    it is absolutely capable to come up with it’s own logical stuff

                    interesting, in my experience, it’s only been good at repeating things, and failing on unexpected inputs - it’s able to answer pretty accurately if a small number is even or odd, but not if it’s a large number, which indicates it’s not reasoning but parroting answers to me

                    do you have example prompts where it showed clear logical reasoning?

    • @Delphia@lemm.ee
      link
      fedilink
      English
      61 month ago

      I want an AI/LLM that has been trained exclusively on the technical documentation and a haynes manual for a make and model of car.

      “Hey AI, how do I change the fuel filter and what tools will I need?”

    • @atrielienz@lemmy.world
      link
      fedilink
      English
      41 month ago

      I don’t even think hallucinations is the right word for this. It’s got a source. It is giving you information from that source. The problem is it’s treating the words at that source as completely factual despite the fact that they are not. Hallucinations from what I’ve read actually is more like when it queries it’s data set, can’t find an answer, and then generates nonsense in order to provide an answer it doesn’t have. Don’t think that’s the same thing.

      • Balder
        link
        fedilink
        English
        3
        edit-2
        1 month ago

        I don’t even think it’s correct to say it’s querying anything, in the sense of a database. An LLM predicts the next token with no regard for the truth (there’s no sense of factual truth during training to penalize it, since that’s a very hard thing to measure).

        Keep in mind that the same characteristic that allows it to learn the language also allows it to sort of come up with facts, it’s just a statistical distribution based on the whole context, which needs a bit randomness so it can be “creative.” So the ability to come up with facts isn’t something LLMs were designed to do, it’s just something we noticed that happens as it learns the language.

        So it learned from a specific dataset, but the measure of whether it will learn any information depends on how well represented it is in that dataset. Information that appears repeatedly in the web is quite easy for it to answer as it was reinforced during training. Information that doesn’t show up much is just not gonna be learned consistently.[1]

        [1] https://youtu.be/dDUC-LqVrPU

        • @atrielienz@lemmy.world
          link
          fedilink
          English
          11 month ago

          I understand the gist but I don’t mean that it’s actively like looking up facts. I mean that it is using bad information to give a result (as in the information it was trained on says 1+1 =5 and so it is giving that result because that’s what the training data had as a result. The hallucinations as they are called by the people studying them aren’t that. They are when the training data doesn’t have an answer for 1+1 so then the LLM can’t do math to say that the next likely word is 2. So it doesn’t have a result at all but it is programmed to give a result so it gives nonsense.

          • Balder
            link
            fedilink
            English
            2
            edit-2
            1 month ago

            Yeah, I think the problem is really that language is ambiguous and the LLMs can get confused about certain features of it.

            For example, I often ask different models when was the Go programming language created just to compare them. Some say 2007 most of the time and some say 2009 — which isn’t all that wrong, as 2009 is when it was officially announced.

            This gives me a hint that LLMs can mix up things that are “close enough” to the concept we’re looking for.