• trevor
    link
    fedilink
    English
    arrow-up
    1
    ·
    edit-2
    8 hours ago

    They did not release the final model without the data

    They literally did exactly that. Show me the training data. If it has been provided under an open source license, then I’ll revise my statement.

    You literally cannot create a useful LLM without the training data. That is a part of the framework used to create the model, and they kept that proprietary. It is a part of the source. This is such an obvious point that I should not have to state it.