Which selfhosted TTS provider should I use with home assistant?

Scrubbles · 18 hours ago

Which selfhosted TTS provider should I use with home assistant?

ALERT · 12 hours ago

if you need English - right now it’s kokoro-fastapi https://github.com/remsky/Kokoro-FastAPI set this container up and use it as an openai TTS endpoint using this hacs integration https://github.com/sfortis/openai_tts

Scrubbles · 6 hours ago

Very nice! I’ll check this out!

@[email protected] · 12 hours ago

Piper works pretty well. I’m only using it because it was easier to find a custom glados voice.

Kokoro has good default voices. I also started trying out Speaches recently. It provides an open ai api wrapper around several options

@[email protected] · 9 hours ago

Any tips on getting speaches to work with Home assistant? Got speaches working but haven’t gone the next step yet.

@Vector · 18 hours ago

Don’t know much about the training side of things, but I have Piper set up with home assistant using the Wyoming protocol and it just goes. Some of the out-of-the-box voices are pretty decent too.

@just_another_person · 18 hours ago

Pretty much just personal preference at this point. XTTS is certainly not the most efficient though.

Scrubbles · 18 hours ago

any personal preferences you recommend?

@just_another_person · edit-2 18 hours ago

Pico, Piper, Mary, and Google all run locally and off of CPU only.

I think all the rest require cloud accounts or acceleration hardware to work quickly.

I’m personally fine with Mary or Piper, but I know some people like the fancier ones.

𝕽𝖚𝖆𝖎𝖉𝖍𝖗𝖎𝖌𝖍 · 8 hours ago

Google? Have you verified that?

@just_another_person · 6 hours ago

Yes. Have a look at the docs: https://www.home-assistant.io/integrations/google_translate/

𝕽𝖚𝖆𝖎𝖉𝖍𝖗𝖎𝖌𝖍 · 12 minutes ago

The docs don’t say it’s completely offline. Can you turn off your LAN connection and it still works? Have you tried this? Or just firewall off out bound access to Google services?

This comment:

Contrary to what the name suggests, the integration only does text-to-speech and does not translate messages sent to it.

doesn’t say it doesn’t call out to Google services; it says only that it doesn’t use translation services. I didn’t see anything else that implies it doesn’t send data to Google.