TIL That the entirety of Wikipedia is only ~100Gb and you can download it for offline use

retrospectology@lemmy.world · edit-2 6 months ago

TIL That the entirety of Wikipedia is only ~100Gb and you can download it for offline use

lolola · 6 months ago

So something akin to this joke image I saw the other day is actually feasible for Wikipedia?

Max@lemmy.world · 6 months ago

Chatgpt is also probably around 50-100GB at most

souperk@reddthat.com · 6 months ago

Probably a lot less, keep in mind that whenever it answers a question the whole model is traversed multiple times, going through multiple GBs is not possible in the matter of seconds the model answers.

Max@lemmy.world · 6 months ago

I’d be surprised if it was significantly less. A comparable 70 billion parameter model from llama requires about 120GB to store. Supposedly the largest current chatgpt goes up to 170 billion parameters, which would take a couple hundred GB to store. There are ways to tradeoff some accuracy in order to save a bunch of space, but you’re not going to get it under tens of GB.

These models really are going through that many Gb of parameters once for every word in the output. GPUs and tensor processors are crazy fast. For comparison, think about how much data a GPU generates for 4k60 video display. Its like 1GB per second. And the recommended memory speed required to generate that image is like 400GB per second. Crazy fast.

lolola · edit-2 6 months ago

Plus input data?

jose1324@lemmy.world · 6 months ago

No, but it’s the model after the input that you need.

anivia@lemmy.ml · 6 months ago

So it would fit on a Bluray disc

mctoasterson@reddthat.com · 6 months ago

I mean, you can self-host your own local LLMs using something like Ollama. The performance will be bound by the disk space you have (the complexity of the model you’re able to store), and the performance of the CPU or GPU you are using to run it, but it does work just fine. Probably as good results as ChatGPT for most use cases.

Nooodel@lemmy.world · 6 months ago

We do this at work (lots of sensitive data that we don’t want Openai to capitalize on) and it works pretty well. Hosted locally, setup by a data security and privacy sensitive admin, who specifically runs the settings to not save any queries even on the server. Bit slower than chatgpt but not by much

Slovene@feddit.nl · 6 months ago

https://m.youtube.com/watch?v=1lRI35gKSPA