ChatGPT is full of sensitive private information and spits out verbatim text from CNN, Goodreads, WordPress blogs, fandom wikis, Terms of Service agreements, Stack Overflow source code, Wikipedia pages, news blogs, random internet comments, and much more.

  • regbin_@lemmy.world
    link
    fedilink
    English
    arrow-up
    11
    ·
    1 year ago

    “leak training data”? What? That’s not how LLMs work. I guess a sensational headline attracts more clicks than a factually accurate one.

    • wahming@monyet.cc
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      Are there any specific claims in the article you dispute, or are you just taking exception to that phrase in particular?