Authors using a new tool to search a list of 183,000 books used to train AI are furious to find their works on the list.

  • just another dev
    link
    fedilink
    English
    69 months ago

    No, but the training data does contain a copy. And making a model is not criticising, commenting upon, or creating a parody of it.

    • FaceDeer
      link
      fedilink
      69 months ago

      That list is not exclusive, it’s just a list of examples of fair use.

      The training data is not distributed with the AI model.

      • just another dev
        link
        fedilink
        English
        6
        edit-2
        9 months ago

        it’s just a list of examples of fair use.

        Yes, it’s a list of quite similar ways of commenting upon a work. Please explain how training an LLM is like any of those things, and thus, how Fair use would apply.

        • FaceDeer
          link
          fedilink
          19 months ago

          I’m not saying that training an LLM is like any of those things. I’m saying it doesn’t have to be like those things in order for it to still be fair use.