The AI-focused COPIED Act would make removing digital watermarks illegal (as well as training any kind of AI on copyrighted content)

Grimy@lemmy.world · edit-2 10 months ago

The AI-focused COPIED Act would make removing digital watermarks illegal (as well as training any kind of AI on copyrighted content)

just another dev@lemmy.my-box.dev · 10 months ago

I’m the opposite, actually. I like generative AI. But as a creator who shares his work with the public for their (non-commercial) enjoyment, I am not okay with a billionaire industry training their models on my content without my permission, and then use those models as a money machine.

interdimensionalmeme@lemmy.ml · edit-2 10 months ago

Removed by mod

just another dev@lemmy.my-box.dev · 10 months ago

What are you basing that on?

Content owners, including broadcasters, artists, and newspapers, could sue companies they believe used their materials without permission or tampered with authentication markers.

Doesn’t say anything about the right just applying to giant tech companies, it specifically mentions artists as part of the protected content owners.

interdimensionalmeme@lemmy.ml · edit-2 10 months ago

Removed by mod

just another dev@lemmy.my-box.dev · 10 months ago

I respectfully disagree. I think small time AI (read: pretty much all the custom models on hugging face) will get a giant boost out of this, since they can get away with training on “custom” data sets - since they are too small to be held accountable.

However, those models will become worthless to enterprise level models, since they wouldn’t be able to account for the legality. In other words, once you make big bucks of of AI you’ll have to prove your models were sourced properly. But if you’re just creating a model for small time use, you can get away with a lot.

interdimensionalmeme@lemmy.ml · edit-2 10 months ago

Removed by mod

just another dev@lemmy.my-box.dev · 10 months ago

I don’t think so either, but to me that is the purpose.

Somewhere between small time personal-use ML and commercial exploitation, there should be ethical sourcing of input data, rather than the current method of “scrape all you can find, fuck copyright” that OpenAI & co are getting away with.

interdimensionalmeme@lemmy.ml · edit-2 10 months ago

Removed by mod

just another dev@lemmy.my-box.dev · 10 months ago

Why?

Once this passes, OpenAI can’t build ChatGPT on the same (“stolen”) dataset. How does that cement their position?

Taking someone’s creation (without their permission) and turning it into a commercial venture, without giving payment or even attribution is immoral.

If a creator (in the widest meaning of the word) is fine with their works being used as such - great, go ahead. But otherwise you’ll just have to wait before the work becomes public domain (which obviously does not mean publicly available).