What should I use: big model-small quant or small model-no quant?

Smorty [she/her] · edit-2 4 months ago

What should I use: big model-small quant or small model-no quant?

@j4k3 · edit-2 4 months ago

deleted by creator

Smorty [she/her] · 4 months ago

Another user @[email protected] commented about there being a way to split it between GPU and CPU. Are you talking about this nvidia only and windows only thingy, which only works with the proprietary driver? If so, I’m really not gonna use that…

Have you tried some of the abliterated models? They work really nicely even for the spiciest of topics. They literally can’t refuse your instruction, so they just go ahead and do what you want. But maybe even these models are too narrow for your specific application…

@j4k3 · 4 months ago

deleted by creator