Codeberg was asking about this. The linked toot by a commenter points to :

SEqlite

These are CC-BY-SA 4.0 remixes of the Stack Exchange Creative Commons Data Dumps. 100% Unendorsed by Stack Exchange, Inc.

They are minimal. They provide the data you probably care about and the data you need to comply with the original license in SQLite format.

  • huginn@feddit.it
    link
    fedilink
    arrow-up
    2
    ·
    7 months ago

    Federated Stack Exchange isn’t harder for AI to eat. If anything it’s easier.