• Snot Flickerman
    link
    English
    98
    edit-2
    5 months ago

    OpenAI wants to eat everyone’s lunch. They don’t even respect robots.txt.

    I’m gonna be real, I don’t think an AI company stealing every bit of data I ever put online to ingest into their AI so they can use it to make money really falls under “fair use.”

    If it were individuals, who were doing it for shits and giggles, and not to make a profit, I’d have a different view. Because I kind of don’t care if you’re just some random asshat making an art project and aren’t using it to try make beaucoup bucks on the back of the whole planets intellectual labor.

    Fair Use should be for humans not corporations. Fuck this “corporations are people too” schtick, they aren’t and the fact that they’re taking this data and using it to profit really should be viewed no differently than the argument that people made about The Pirate Bay, that through piracy, they were making money, which is what made it wrong.

    If we keep allowing the rich to just steal from the public commons because of dumb shit like corporations having the same rights as people, we’re barrelling headlong into a Cyberpunk Dystopia, if we’re not there already.

    Humans who are creating art and culture should have rights to Fair Use, not giant fucking corporations using it to profit wildly at the public expense. Copyright is all fucked up and broken, and this is a great example of why, because we’re aiming provisions that were meant to protect the public are now aimed at protecting corporations profits.

    • @QuandaleDingle@lemmy.world
      link
      fedilink
      English
      20
      edit-2
      5 months ago

      This is all exactly what I’ve been thinking, and better parsed too. If “we” don’t guide AI with the law now, we’re screwed.

    • @Fisch@lemmy.ml
      link
      fedilink
      English
      45 months ago

      I don’t like corporations using the data I put online to train their AI either. I’d happily give it to people to train a FOSS AI tho. I’ve also contributed my voice to Mozillas Common Voice project, which is an open dataset that everyone is allowed to use. If it’s something that everyone can use and benefit from equally, I’m happy to help. I’m not happy to help some corporation make even more profits tho. At least not without getting payed.

    • @bionicjoey@lemmy.ca
      link
      fedilink
      English
      105 months ago

      Sort of like if there was a guy named Open Kevin, whose whole thing was walking into people’s homes and eating all of their food.

  • @AVincentInSpace@pawb.social
    link
    fedilink
    English
    24
    edit-2
    5 months ago

    Yeah, they’ve been wanting that for a while. Microsoft and Google both already have AI search products, and the only thing we’ve really learned from those are that AI search products don’t work.

    Also

    Altman is Google’s nightmare that it can’t wake up from.

    Holy mother of dickriding. This article mainly just seems like a list of statements for AI tech bros to parrot to each other.

    • amzd
      link
      fedilink
      75 months ago

      ai search products don’t work

      Idk how you got to that conclusion. I’m trying to change my habit of googling coding problems because phind.com will 9/10 times have an answer faster and more accurate than if I googled it.

  • @nucleative@lemmy.world
    link
    fedilink
    English
    145 months ago

    OpenAI’s existential problem is that they’ll eat their own lunch and then have nothing left. The reason people make useful content now and give it away for free is because they can get paid for the traffic.

    Take that traffic away and all the content goes behind paywalls and login screens where OpenAI can’t touch it.

    • @sunbeam60@lemmy.one
      link
      fedilink
      English
      65 months ago

      But the content has already been absorbed. I wouldn’t be surprised if they have all of it sucked up (many would argue illegally) and stored as a corpus for them to iterate onto. It’s not like they go out to touch all the web every time they train a new version of their model.

        • @QuaternionsRock@lemmy.world
          link
          fedilink
          English
          15 months ago

          One of the craziest facts about GPT (to me) is that it was trained on 570GB of text data. That’s obviously a lot of text, but it’s bewildering to me that I could theoretically store their entire training dataset on my laptop.

  • @jdeath@lemm.ee
    link
    fedilink
    English
    55 months ago

    awesome. google ruined the internet, would be great for them to face an existential threat

  • AutoTL;DRB
    link
    fedilink
    English
    45 months ago

    This is the best summary I could come up with:


    The AI search engine could be a new feature for ChatGPT, or a potentially separate app altogether.

    Microsoft Bing would allegedly power the service from Sam Altman, which could be the most serious threat Google Search has ever faced.

    The report says this new search product could be faster than ChatGPT, without sacrificing its powerful summarizing abilities.

    Now, OpenAI is coming directly for the crown, in an attempt to dethrone the most dominant internet service of all time.

    Perplexity, a small AI search startup with backing from Jeff Bezos, is also competing for Google’s crown.

    If such a small team can steal 10 million people from Google, imagine what OpenAI and Microsoft can do.


    The original article contains 405 words, the summary contains 114 words. Saved 72%. I’m a bot and I’m open source!