Hello GPT-4o

@[email protected] · 10 months ago

@abhibeckert · edit-2 10 months ago

you can run locally some small models

Emphasis on “small” models. The large ones need over a terabyte of RAM and it has to be high bandwidth (DDR is not fast enough).

And for most tasks, smaller models hallucinate way too often. Even the largest models are only just barely good enough.

@[email protected] · 10 months ago

Llama 2 70B can run on a specc-ed out current gen MacBook Pro. Not cheap hardware in any sense, but it isn’t a large data center cluster.