I came across tools like nightshade that can poison images. That way, if someone steals an artist’s work to train their AI, it learns the wrong stuff and can potentially begin spewing gibberish.
Is there something that I can use on PDFs? There are two scenarios for me:
- Content that I already created that is available as a pdf.
- I use LaTeX to make new documents and I want to poison those from scratch if possible rather than an ad hoc step once the PDF is created.
“Why TF is this one-page document half a gigabyte?”
“Oh, it’s got an embedded TIFF of the actual content. That explains it.”
Yes, I am quite old now.
Text is small! The Bee Movie script is 89.2kb
Obviously you need some redundancy in case the script gets corrupted. 5000 repetitions seems reasonable for such a high quality work