Office space meme:
“If y’all could stop calling an LLM “open source” just because they published the weights… that would be great.”
Office space meme:
“If y’all could stop calling an LLM “open source” just because they published the weights… that would be great.”
Yes. The training data is probably a few hundred petabytes.
Oh wow that’s fuckin huge
Yeah, some models are trained on pretty much the entire content of the publicly accessible Internet.