DeepSeek Proves It: Open Source is the Secret to Dominating Tech Markets (and Wall Street has it wrong).

Cat@ponder.cat · 2 months ago

DeepSeek Proves It: Open Source is the Secret to Dominating Tech Markets (and Wall Street has it wrong).

vrighter@discuss.tchncs.de · 2 months ago

I view it as the source code of the model is the training data. The code supplied is a bespoke compiler for it, which emits a binary blob (the weights). A compiler is written in code too, just like any other program. So what they released is the equivalent of the compiler’s source code, and the binary blob that it output when fed the training data (source code) which they did NOT release.

pishadoot@sh.itjust.works · 2 months ago

This is probably the best explanation I’ve seen so far and really helped me actually understand what it means when we talk about “weights” for LLMs.