I can see the that it means “Where” but how do you say when I break down the words I get the following:
g-d-ye t-ee
How do I pronounce it?
I can see the that it means “Where” but how do you say when I break down the words I get the following:
g-d-ye t-ee
How do I pronounce it?
Silero is an open source Text To Speech large language model that can run on your own hardware. It actually does quite well running on just a CPU, even an older one if you have the RAM to load the model in the first place. The documentation kinda sucks for doing more advanced stuff and there are some issues with generating extremely long stuff like reading books.
Silero is a Russian company. They do both TTS and STT, although STT requires advanced fine tuning and there are newer and more advanced options from others.
Hugging Face is like the GitHub of offline open source AI tools. They host example instances of many models. Unless you make an account with HF, you can only use them for something like 2 queries per day anonymously. This is an instance of Silero TTS with Russian speakers setup:
https://huggingface.co/spaces/NeuroSenko/tts-silero