Consumer GPUs to run LLMs

marauding_gibberish142@lemmy.dbzer0.com · edit-2 2 days ago

Consumer GPUs to run LLMs

umami_wasabi@lemmy.ml · edit-2 1 day ago

Using 7900XTX with LMS. Speed are everwhere, driver dependent. With QwQ-32B-Q4_K_M, I got about 20 tok/s, with all VRAM filled. Phi-4 runs at about 30-40 tok/s. I can give more numbers if you can wait for a bit.

If you don’t enjoy finding which driver works best, I strongly aginst running AMD for AI workload.

marauding_gibberish142@lemmy.dbzer0.com · 1 day ago

I didn’t know that. I thought just one ROCM binary to install, run Ollama and that’s it. Thanks for the explanation