• medgremlin@lemmy.sdf.org
    link
    fedilink
    English
    arrow-up
    8
    ·
    1 year ago

    On a related note, I would be very curious to see how something like ChatGPT trained exclusively on works in the public domain would turn out. It would likely have a very different diction and style based on the older source material, but I wonder what other differences there would be.

    • Fredselfish@lemmy.world
      link
      fedilink
      English
      arrow-up
      10
      ·
      edit-2
      1 year ago

      What do they mean train? If by reading then how can that be wrong. But if copying the text and using it as it’s own works that would be wrong.

      After reading the article the authors are fucking stupid. Makes me not want to support their books. If you get mad because AI read you book then they could sue if someone asked me about the authors books and I wrote a description of what I read.

      • ExpensiveConstant@kbin.social
        link
        fedilink
        arrow-up
        4
        ·
        1 year ago

        The problem I have with this view is that AI “reading” a book is not the same as you or I reading. It doesn’t actually learn it’s just predicting the most likely sequence of words to be a response to whatever prompt it receives. In that sense, the words are just data, not actual words. Given how valuable data is in this day and age, I think it makes perfect sense for OpenAI to have to either: only use public domain/authorized works, or pay the creators for their work.