On my machine I’m running opensuse tumbleweed and has the amdgpu driver installed. I use it for gaming and recently I’ve become interested in running LLMs. So I would like to keep a balance of both without compromising too much on performance.

I know that there are proprietary drivers for AMD cards but I’m hesitant to install it as I’ve heard that it performs less efficiently in games when compared to the open source driver.

I’m mainly confused about this ROCM thing. Is it not included with the opensource amdgpu drivers ? Or is it available as a separate package?

So what driver to use ?

Or perhaps, is it possible to run oogabooga or stable diffusion within a distrobox container (with the proprietary drivers) and still keep using the open source gpu drivers for the Host operating system.

    • @madnificent
      link
      English
      55 months ago

      Has that changed recently? I’ve ran ROCm successfully on an RX6800. I seem to recall that was supported, the host OS (Arch) was not.

      • @[email protected]
        link
        fedilink
        English
        25 months ago

        When I tried it maybe a year or so ago there were four supported chipset in that version (5.4.2 I think) of rocm but I don’t remember which card models those were since they were only specified in that internal chip name. Mine wasn’t supported at the time (5700XT)

    • @turbodrooler
      link
      English
      35 months ago

      This link is misleading. For example, the Radeon RX6800 IS supported because it is the same chip as one of the Radeon Pros. GFX1030. Many others are too…though support does not go very far back.

    • exu
      link
      fedilink
      English
      25 months ago

      Llama.cpp supports OpenCL as well and performs better than rocm in my limited experience. That should work on basically any GPU.

  • @madnificent
    link
    English
    45 months ago

    Latest ollama has support for AMD GPUs. I had to compile from source to make it pick up the GPU on my system.

  • @turbodrooler
    link
    English
    25 months ago

    Look into llamafile. It makes things so easy.

  • @Falcon
    link
    English
    15 months ago

    Basically, RoCM and CUDA allows one to do math on the GPU. Most Linear Algebra operations (i.e. LLM or NNs and ML generally) can be parallelized over a GPU which is much more performant than CPU.

    To perform calculations on GPU, one needs some sort of interface to to their programming language of choice, NVIDIA has CUDA which is in CPP with bindings to python: (pytorch, Tensorflow etc. ), Julia: Flux etc.

    RoCM is AMDs solution, there bindings are young and not widely implemented.

    My advice, play around with Flux RoCM and PyTorch RoCM just to get an idea. Suffice it to say, when I started doing RL and LLMs more seriously I gave up my colab and sold my AMDs to fund a 3060.