• JoeKrogan@lemmy.world
    link
    fedilink
    English
    arrow-up
    22
    ·
    edit-2
    8 months ago

    Says the android. He’s like that alien in men in black pretending to be a human.

    • funkless_eck@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      11
      ·
      8 months ago

      “ooh it’s more advanced but don’t worry- it’s not conscious”

      is as much a marketing tactic as “how it feels to chew 5 gum” or buzzfeedesque “top 10 celebrity mistakes - number 3 will blow your mind”

      it’s a tech product that runs a series of complicated loops against a large series of texts and returns the closest comparison, as it stands it’s never going to be dangerous in and of itself.

      • kromem@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        8 months ago

        it’s a tech product that runs a series of complicated loops against a large series of texts and returns the closest comparison, as it stands it’s never going to be dangerous in and of itself.

        That’s not how it works. I really don’t get what’s with people these days being so willing to be confidently incorrect. It’s like after the pandemic people just decided that if everyone else was spewing BS from their “gut feelings,” well gosh darnit they could too!

        It uses gradient descent on a large series of texts to build a neural network capable of predicting those texts as accurately as possible.

        How that network actually operates ends up a black box, especially for larger models.

        But research over the past year and a half in simpler toy models has found that there’s a rather extensive degree of abstraction. For example, a small GPT trained only on legal Othello or Chess moves ends up building a virtual representation of the board and tracks “my pieces” and “opponent pieces” on it, despite never being fed anything that directly describes the board or the concept of ‘mine’ vs ‘other’. In fact, in the Chess model, the research found there was even a single vector in the neural network that could be flipped to have the model play well or play like shit regardless of the surrounding moves fed in.

        It’s fairly different from what you seem to think it is. Though I suspect that’s not going to matter to you in the least, as I’ve come to find that explaining transformers to people spouting misinformation about them online has about the same result as a few years ago explaining vaccine research to people spouting misinformation about that.

        • funkless_eck@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          3
          ·
          edit-2
          8 months ago

          I dont know if saying “it’s not a loop! it’s an iterative process using a series of steps!” is that much of a burn.

          my dude, that’s a loop.

          • Chakravanti@sh.itjust.works
            link
            fedilink
            English
            arrow-up
            2
            ·
            8 months ago

            Well He That Remains came by just to show that everything we experience is always part of a bigger loop. You can fucking kill him and even slam the break; crash to his design of the the highest number of alternate dimensions and then some and it won’t stop the loop. 99.99% of the time he’ll be back. We only need to consciously accept the concept of no more than the notion to summon his return. Even if we were to successfully crack the time management mech and undo his manipulation, he’ll be back when we track him down to build another one.

            The Loop is more nature than matter to energy combined. When everything in all of reality would expand infinitely far apart, the whole shebang goes lateral mirror again with a whole new dimension. There is no end to any aspect of reality. Anywhere it would be, turns out it’s “just” “another” Loop Mirror.

    • kromem@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      8 months ago

      Exactly. People try to scare into regulatory capture talking about paperclip maximizers when meanwhile it’s humans and our corporations that are literally making excess shit to the point of human extinction.

      To say nothing for how often theorizing around ‘superintelligence’ imagines the stupidest tendencies of humanity being passed on to it while denying our smartest tendencies as “uniquely human” despite existing models largely already rejecting the projected features and modeling the ‘unique’ ones like empathy.

    • Uriel238 [all pronouns]
      link
      fedilink
      English
      arrow-up
      1
      ·
      8 months ago

      One day, he will be an android and we won’t notice.

      All this time Android-Zuck will tell us were totally on the verge of real AGI that can dominate the world, but not yet.

    • xcjs@programming.dev
      link
      fedilink
      English
      arrow-up
      4
      ·
      edit-2
      8 months ago

      I was reflecting on this myself the other day. For all my criticisms of Zuckerberg/Meta (which are very valid), they really didn’t have to release anything concerning LLaMA. They’re practically the only reason we have viable open source weights/models and an engine.

  • kromem@lemmy.world
    link
    fedilink
    English
    arrow-up
    3
    ·
    edit-2
    8 months ago

    It’s not as good as it seems at the surface.

    It is a model squarely in the “fancy autocomplete” category along with GPT-3 and fails miserably at variations of logic puzzles in ways other contemporary models do not.

    It seems that the larger training data set allows for better modeling around the fancy autocomplete parts, but even other similarly sized models like Mistral appear to have developed better underlying critical thinking capacities when you scratch below the surface that are absent here.

    I don’t think it’s a coincidence that Meta’s lead AI researcher is one of the loudest voices criticizing the views around emergent capabilities. There seems to be a degree of self-fulfilling prophecy going on. A lot of useful learnings in the creation of Llama 3, but once other models (i.e. Mistral) also start using extended training my guess is that any apparent advantages to Llama 3 right now are going to go out the window.

  • AutoTL;DR@lemmings.worldB
    link
    fedilink
    English
    arrow-up
    2
    ·
    8 months ago

    This is the best summary I could come up with:


    Meta launched the latest iteration of its AI chatbot on Thursday with Llama 3, and CEO Mark Zuckerberg says it’s supposed to be really good.

    The new model boasts “state-of-the-art” performance on various industry-standard benchmarks and comes with “improved reasoning,” according to a company blog post.

    “In terms of all of the concerns around the more existential risks, I don’t think that anything at the level of what we or others in the field are working on in the next year is really in the ballpark of those types of risks,” he told the publication.

    It’s one reason Zuckerberg feels that the company can continue making Llama open-source or available for the public or researchers to tinker with.

    If Meta’s model achieves multimodality — meaning the ability to deliver results in various forms of media, including text, images, and video — then that may be a case when the company won’t want to make all aspects of its model open-source, Zuckerberg said.

    "For example, image generation is one that we’re looking at closely Especially in an election year, is that a net positive thing to do?


    The original article contains 314 words, the summary contains 186 words. Saved 41%. I’m a bot and I’m open source!