@[email protected] to

[email protected]English • 1 year ago

Difference between GGML & GPTQ

2

7

Difference between GGML & GPTQ

@[email protected] to

[email protected]English • 1 year ago

2

Apologies for the basic question, but what’s the difference between GGML and GPTQ? Do these just refer to different compression methods? Which would you choose if you’re using a 3090ti GPU?

Chat

@markon
link
English
1•1 year ago
Also llama.cpp offers very fast performance with the ggmls compared to using transformers, and sometimes faster than ExLlama.