I suspect this machine will be popular with hobbyists for running really large open weight LLMs.
Yeah.
It will probably spur a lot of development! I’ve seen a lot of bs=1 speedup “hacks” shelved because GPUs are fast enough, and memory efficiency is the real bottleneck. But suddenly all these devs are going to have a 48GB-96GB pool that’s significantly slower than a 3090. And multimodal becomes much more viable.
Not to speak of better ROCM compatibility. AMD should have done this ages ago…
Yeah.
It will probably spur a lot of development! I’ve seen a lot of bs=1 speedup “hacks” shelved because GPUs are fast enough, and memory efficiency is the real bottleneck. But suddenly all these devs are going to have a 48GB-96GB pool that’s significantly slower than a 3090. And multimodal becomes much more viable.
Not to speak of better ROCM compatibility. AMD should have done this ages ago…