[email protected] • 12 hours ago

Can you self-host AI at parity with chatgpt?

5

9

Can you self-host AI at parity with chatgpt?

[email protected] • 12 hours ago

5

My office computer has a Ryzen 7 5700, RX 580x, and 32gb of ram. Running ollama with deepseekv2 or llama3 is much slower than chatgpt in the browser. Same with my newer, more powerful home computer.

What kind of hardware do you need to run with comparable responsiveness to chatgpt? How much does it cost? Presuming such hardware is commercial, where do you find it?

Chat

JoYo 🇺🇸
link
fedilink
English
4•
edit-2
8 hours ago
It’s all dependent on VRAM. If you can load the distilled models with your GPU without maxing out your VRAM it will run just as fast as any server farm.

RX 580x

It looks like your video card only has 8 GB of VRAM. That will be your bottleneck.

[email protected]

[email protected]

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

A loosely moderated place to ask open-ended questions

Search asklemmy 🔍

If your post meets the following criteria, it’s welcome here!

Open-ended question
Not offensive: at this point, we do not have the bandwidth to moderate overtly political discussions. Assume best intent and be excellent to each other.
Not regarding using or support for Lemmy: context, see the list of support communities and tools for finding communities below
Not ad nauseam inducing: please make sure it is a question that would be new to most members
An actual topic of discussion

Looking for support?

Looking for a community?

Lemmyverse: community search
sub.rehab: maps old subreddits to fediverse options, marks official as such
[email protected]: a community for finding communities

_Icon _by _{@[email protected]}

1.76K users / day
3.7K users / week
8.05K users / month
19.4K users / 6 months
44.7K subscribers
6.19K Posts
316K Comments
Modlog