Hello GPT-4o

@[email protected] · 9 months ago

Hello GPT-4o

Dran · edit-2 9 months ago

I have this running at home on a used r630 (CPU only). oobabooga/automatic1111 for LLM/SD backends, vosk + mimic3 for tts/stt. A little bit of custom python to tie it all together. I certainly don’t have latency as low as theirs, but it’s definitely conversational when my sentences are short enough.

Sabata11792 · edit-2 9 months ago

Check out the vladmandic fork of auto1111. It seems to be much quicker with new model support.

Been wanting to try voice cloning and totally not cobble together a DIY Ai wiafu.

@[email protected] · 9 months ago

I can’t tell if you are for real or joking with those concatenations of letters. Have you tried the new Oongaboonga123? I hear it’s got great support for bpm°C

Dran · 9 months ago

I am not joking lol but I do sometimes forget most people don’t live in this space the same way I do. I think people use these names because the programs themselves are forked often and the software names are very unspecific otherwise. I meant to imply that I was using the main branches of these softwares.

https://github.com/oobabooga/text-generation-webui

https://github.com/AUTOMATIC1111/stable-diffusion-webui