Using an AMD GPU for NN training/inference?

@leakybits · edit-2 2 years ago

Using an AMD GPU for NN training/inference?

ShittyKopper [they/them] · edit-2 2 years ago

Get something new enough and continue getting something new enough when AMD pushes them out. The drivers suck for anything older than an RX580, and things like Blender require even newer GPUs despite the hardware being more than capable.

Run Arch and use the ROCm’d PyTorch from the repos. Those packagers know what they’re doing.

Other than that, expect everything premade to be made for CUDA (and therefore unusable). There are some tools like https://github.com/ROCm-Developer-Tools/HIPIFY but they aren’t “there”.

Source: Been running Stable Diffusion on an RX580.

@leakybits · 2 years ago

Thanks! Sounds doable but definitely frustrating… I’m surprised this is the state of things at the moment. I mean, when you buy a CPU, you don’t really think about whether your choice limits you in some ways. But with a GPU, it’s a big consideration.

@[email protected] · 2 years ago

Yeah GPUs never got standardized like x86 did from the old IBM machine days. GPUs are still operating on the mindset of “specific hardware” rather than something generic. If GPUs could be programmed on as easily as CPUs we could target something like vulkan for ML.

Even ARM faces similar, but different problems of the lack of standard boot methods.