The fediverse is discussing if we should defederate from Meta’s new Threads app. Here’s why I probably won’t (for now).

(Federation between plume and my lemmy instance doesn’t work correctly at the moment, otherwise I would have made this a proper crosspost)

  • dfyx@lemmy.helios42.deOP
    link
    fedilink
    English
    arrow-up
    6
    ·
    1 year ago

    They can siphon your data no matter what you do. As I’ve said in other comments, everything on the internet has been crawled and scraped for literal decades. This post is already indexed by a bunch of different search engines and most likely by some other scrapers that harvest our data for AI or ad profiles. And you can do nothing about it without hurting your legitimate audience. Nothing at all. There’s robots.txt as a mechanism to tell a crawler what it should or shouldn’t index but that’s just asking nicely (mostly to prevent search engines from indexing pages that don’t contain actual content). You could in theory block certain IP ranges or user agents but those change faster than you can identify them. This dilemma is the whole reason why Twitter implemented rate limiting. They wanted to protect their stuff from scrapers. See where it got them.

    Most important rule of the internet: if you don’t want something archived forever, don’t post it!