cross-posted from: https://infosec.pub/post/8775123

Reddit said in a filing to the Securities and Exchange Commission that its users’ posts are “a valuable source of conversation data and knowledge” that has been and will continue to be an important mechanism for training AI and large language models. The filing also states that the company believes “we are in the early stages of monetizing our user base,” and proceeds to say that it will continue to sell users’ content to companies that want to train LLMs and that it will also begin “increased use of artificial intelligence in our advertising solutions.”

The long-awaited S-1 filing reveals much of what Reddit users knew and feared: That many of the changes the company has made over the last year in the leadup to an IPO are focused on exerting control over the site, sanitizing parts of the platform, and monetizing user data.

Posting here because of the privacy implications of all this, but I wonder if at some point there should be an “Enshittification” community :-)

  • Kyre@kbin.social
    link
    fedilink
    arrow-up
    23
    ·
    edit-2
    8 months ago

    I’m sure it’s like pissing into the ocean but I went back and edited my most popular posts and replaced them with AI generated nonsense that is supposed to be difficult to classify for LLM AI’s. I doubt it will have an effect but it would certainly be funny if you had enough people do it.

    I guess because they are grammatically correct but contain paradoxes, ambiguity, and are utter nonsense.
    Here are some samples:
    “Silent thunder vibrates noiselessly through the colorful darkness, illuminating unseen sights with invisible light in a transparent fog.”
    “The invisible painting, clear as day, vividly colors the transparent wall, telling untold stories in a language never spoken.”
    “The motionless wind, still yet turbulent, swiftly calms the turbulent stillness of a restless peace in a serene tempest.”