Using Mac M2 Ultra 192GB to Self-Host LLMs?

@[email protected] · edit-2 1 hour ago

Using Mac M2 Ultra 192GB to Self-Host LLMs?

@KoalaUnknown · edit-2 8 hours ago

There are some videos on youtube of people running local LLMs on the newer M4 chips which have pretty good AI performance. Obviously, a 5090 is going to destroy it in raw compute power, but the large unified memory on Apple Silicon is nice.

That being said, there are plenty of small ITX cases at about 13-15L that can fit a large nvidia GPU.

@[email protected] · edit-2 5 hours ago

Thanks! Hadn’t thought of YouTube at all but it’s super helpful. I guess that’ll help me decide if the extra Ram is worth it considering that inference will be much slower if I don’t go NVIDIA.