Any of you have a self-hosted AI "hub"? (e.g. for LLM, stable-diffusion, ...)

@[email protected] · 9 months ago

Any of you have a self-hosted AI "hub"? (e.g. for LLM, stable-diffusion, ...)

@[email protected] · 9 months ago

Thanks! Glad to see the 8x7B performing not too bad - I assume that’s a Mistral model? Also, does the CPU significantly affect inference speed in such a setup, do you know?

@Audalin · 9 months ago

If your CPU isn’t ancient, it’s mostly about memory speed. VRAM is very fast, DDR5 RAM is reasonably fast, swap is slow even on a modern SSD.

8x7B is mixtral, yeah.