Free Open-Source AI LLM Guide (Summer 2023)

@Blaed · edit-2 1 year ago

Free Open-Source AI LLM Guide (Summer 2023)

@Rand0mA · 2 years ago

Just like to say, good info. It’s taken me months to work out all of this manually pretty much.

I’d like to say. 8gb is enough for most of theblokes gptq models. I’ve been running everything 7b parameters or less on a Nvidia 1080 8gb.

The text-generation-webui API is great. With a few tweaks to how things are displayed, having a terminal AI assistant in Linux is a game changer.

Also, stable diffusion should sit along side this in your exploratory trip through ML. Especially the text2video extension… Crazy sh!t!!

@Blaed · edit-2 2 years ago

Hey, thanks for commenting. You’re not alone. I started my Machine Learning journey ~6 months ago in early 2023 without any knowledge of the underlying tech. Granted, I have some experience with infrastructure - but it has taken me a few months to absorb certain concepts and get things working the manual way too. 100% worth it though. I’m glad some of the resources I’ve found along the way are helping you and anyone else who comes across our community. It’s an exciting time to be in this field and the perfect time to jump in.

Love to hear about your 1080 champing through inference. I have a 1080 TI I still hold onto for sentimental reasons… I have considered dusting it off as a standalone inference server. Glad to know it can reach 7B models. That’s awesome.

I had no idea Stable Diffusion had a text2video extension… I’ll admit, I’m a big fan of SD, but don’t have as much time to commit to it as I’d like. It’s definitely something I plan on making more resources on after I reach a few of my text-based LLM goals.

I foresee some very exciting ecosystems in our near future, ones that combine text2image2video workflows to create some really innovating applications. That being said, if you ever run into something cool, don’t hesitate to share it with us here!

Model	VRAM Used	Minimum Total VRAM	Card Examples	RAM/Swap to Load*
LLaMA-7B	9.2GB	10GB	3060 12GB, 3080 10GB	24 GB
LLaMA-13B	16.3GB	20GB	3090, 3090 Ti, 4090	32 GB
LLaMA-30B	36GB	40GB	A6000 48GB, A100 40GB	64 GB
LLaMA-65B	74GB	80GB	A100 80GB	128 GB

Model	Minimum Total VRAM	Card Examples	RAM/Swap to Load*
LLaMA-7B	6GB	GTX 1660, 2060, AMD 5700 XT, RTX 3050, 3060	6 GB
LLaMA-13B	10GB	AMD 6900 XT, RTX 2060 12GB, 3060 12GB, 3080, A2000	12 GB
LLaMA-30B	20GB	RTX 3080 20GB, A4500, A5000, 3090, 4090, 6000, Tesla V100	32 GB
LLaMA-65B	40GB	A100 40GB, 2x3090, 2x4090, A40, RTX A6000, 8000	64 GB

Free Open-Source AI LLM Guide (Summer 2023)

Free Open-Source AI LLM Guide (Summer 2023)

Getting Started With Free Open-Source AI

8-bit System Requirements

4-bit System Requirements

FOSAI Resources

Large Language Model Hub

oobabooga

Exllama

gpt4all

TavernAI

SillyTavern

Koboldcpp

KoboldAI-Client

h2oGPT

Models

The Bloke

70B

30B

13B

7B

More Models

More General AI/LLM Resources

LLM Leaderboards

Places to Find Models

Training & Datasets

GL, HF!