cross-posted from: https://lemmy.ml/post/15471632

Codeberg was asking about this. The linked toot by a commenter points to :

SEqlite

These are CC-BY-SA 4.0 remixes of the Stack Exchange Creative Commons Data Dumps. 100% Unendorsed by Stack Exchange, Inc.

They are minimal. They provide the data you probably care about and the data you need to comply with the original license in SQLite format.

  • Miaou@jlai.lu
    link
    fedilink
    English
    arrow-up
    2
    ·
    7 months ago

    They have already access to SO’s CC content, why would they get it from the fediverse?

    • lambalicious@lemmy.sdf.org
      link
      fedilink
      English
      arrow-up
      1
      ·
      7 months ago

      They already have it.

      I said alternative to SO. As in, likely, a place to post new content (answers, comments). Nothing can really be done with the content OAI already got their hands on other than firing off a few well-placed EMP bombs.

      • Miaou@jlai.lu
        link
        fedilink
        English
        arrow-up
        1
        ·
        7 months ago

        Yes, but you mentioned importing old content is problematic, and I don’t see why?

        • lambalicious@lemmy.sdf.org
          link
          fedilink
          English
          arrow-up
          1
          ·
          7 months ago

          Because to import old content, you have to respect the old license (or get every contributor of back-then to relicense). That would mean having a site with contents under differing licenses depending on date, which is something the corpos can use as an excuse to continue siphoning everything without consequence.

          I’m fine with a mirror / archive of SO. But it shoudl very definitively be a different thing than an active SO alternative, and their users and data storages should be also different.