• _tezz@lemmy.world
    link
    fedilink
    arrow-up
    5
    ·
    8 months ago

    I was a little curious about this as well, how can you know whether or not this is dissuading the content scrapers? I’m familiar with like robot.txt but I’d imagine AI models don’t respect that type of thing the same way.

    • Cosmic Cleric@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      8 months ago

      I was a little curious about this as well, how can you know whether or not this is dissuading the content scrapers?

      Ultimately? I don’t. You usually can’t tell when other people are doing something legally or illegally, you just take it at good faith value they’re not doing anything illegally, just like with any other law on the books

      I’m familiar with like robot.txt but I’d imagine AI models don’t respect that type of thing the same way.

      Well it would be the owners of the robots that are scraping to build the AI models to honor the licensing.

      If they don’t and it ever gets out that would cause problems for them, so I’m assuming they will, as the alternative is to try to scrub the text and thay would be a lot more time consuming for them to do so (extra steps).

      My hope Is that the computer/Linux geeks that are programming those bots are open source minded, and will honor the licensing.

      Either way, I’m doing my part, and assuming the bot owners will do what they are supposed to do as well.

      Anti Commercial-AI license (CC BY-NC-SA 4.0)