Title question mostly. I’ve played with XTTS-v2 and it worked pretty well, but I’m wondering if folks are using anything else special. I’d like to train my own voice finetune which is what I did with XTTS-v2, and then use it with home assistant’s voice feature. Welcome all opinions on it!

  • @[email protected]
    link
    fedilink
    English
    212 hours ago

    Piper works pretty well. I’m only using it because it was easier to find a custom glados voice.

    Kokoro has good default voices. I also started trying out Speaches recently. It provides an open ai api wrapper around several options

    • @[email protected]
      link
      fedilink
      English
      19 hours ago

      Any tips on getting speaches to work with Home assistant? Got speaches working but haven’t gone the next step yet.

  • @Vector
    link
    English
    218 hours ago

    Don’t know much about the training side of things, but I have Piper set up with home assistant using the Wyoming protocol and it just goes. Some of the out-of-the-box voices are pretty decent too.

  • @just_another_person
    link
    English
    218 hours ago

    Pretty much just personal preference at this point. XTTS is certainly not the most efficient though.

      • @just_another_person
        link
        English
        3
        edit-2
        18 hours ago

        Pico, Piper, Mary, and Google all run locally and off of CPU only.

        I think all the rest require cloud accounts or acceleration hardware to work quickly.

        I’m personally fine with Mary or Piper, but I know some people like the fancier ones.