I definitely do not want to support this practice, but there’s no way to filter these out 😠.

  • @[email protected]
    link
    fedilink
    English
    288 months ago

    One blogger cited in the report claimed converting an ebook to audio using the AI narration took just 52 minutes

    This does not inspire confidence. The technology is there to do this very well, but it takes skill and effort. The technology to automate it end to end with high quality does not yet exist.

    52 minutes. That’s maybe 1/10th the time it would take to listen to it. I wonder how much of these 40,000 books were even proof-listened once.

    • Admiral PatrickOP
      link
      fedilink
      English
      358 months ago

      Honestly, I don’t really care if the LLM can spit out a perfect replica of Stephen Fry with every inflection and intonation possible and in the correct spots.

      Tools like these can and will be used to take jobs from actual voice actors. I want no part of it.

      • @[email protected]
        link
        fedilink
        English
        198 months ago

        I get where you’re coming from, but it doesn’t sit quite right with me. The whole point of technology is to save human time and effort. That should be a good thing. The problem is the capitalist hellscape that is the status quo. I don’t think we should put the onus of propping up that capitalist hellscape onto book authors. I mean, maybe that’s the easiest way to maintain the status quo, but the status quo was never sustainable in the first place.

        I don’t know. This is not a fully fleshed out philosophy. At some level I’m sure it’s the same old idealism-vs-pragmatism debate.

        • Admiral PatrickOP
          link
          fedilink
          English
          138 months ago

          I mean, when that time and effort is someone’s chosen profession, then there’s only so many ways it can go.

          The authors are free to not release an audiobook with some soulless, robotic voice behind it and stick to print/ebook. Amazon is also free to use the AI voices as enhanced TTS for regular books, and I would be fine with that (no one expects those to sound human, and they’re not sold as audiobooks).

          For me, the narrator makes the audiobook experience. As an example, pretty much all of the Revelation Space series was narrated by John Lee. One of the later books was narrated by someone else (forget their name, but they were definitely forgettable), and it just didn’t do it for me. It was an actual person, but they read it so robotically I lost interest halfway through the prologue and just read it on e-reader.

      • boredsquirrel
        link
        fedilink
        58 months ago

        And most importantly it will never be creative, only recreate. Just that if you mix many many people it sounds new.

      • @[email protected]
        link
        fedilink
        48 months ago

        Yep, it’s already happening. I did freelance voice work for a client for awhile but was replaced by a voice model because it’s vastly cheaper, even if the output is also proportionally worse.

        • Admiral PatrickOP
          link
          fedilink
          English
          5
          edit-2
          8 months ago

          That sucks. Just know I’m doing my very tiny, infinitesimally small part to not support that practice.

          • @[email protected]
            link
            fedilink
            48 months ago

            It’s appreciated! Luckily it wasn’t my whole income, but seeing myself replaced with a really bad voice model was just gross. But hey, think of all the value it creates for shareholders!

      • @[email protected]
        link
        fedilink
        48 months ago

        That’s how automation works. It’s just a fact of technological advancement.

        If people will happily listen to it and it’s way cheaper it’s going to happen I’m afraid.

        • Admiral PatrickOP
          link
          fedilink
          English
          68 months ago

          I agree on the technological advancement, but automating human creativity and emotion (yes, there’s definitely a degree of creativity in voice acting) is a bridge too far for me. An AI-narrated audiobook automatically loses 3 stars on the review lol.

      • @[email protected]
        link
        fedilink
        28 months ago

        I agree with the job loss part, but it seems like a really weak argument. What about the increase deals for the author? Many steps in progress lead to job losses, because the world changes. What’s important is to do it in a responsible manner, and I think that’s where Amazon is failing.

      • shastaxc
        link
        fedilink
        -18 months ago

        This is the same thing that happened to scribes when the printing press was invented. It’s not going away.

  • paraphrand
    link
    English
    188 months ago

    If this moves forward they should cost 10% the average price of human narration.

  • @[email protected]
    link
    fedilink
    English
    148 months ago

    If they are as shitty as the obviously ai powered closed captioning we are seeing now, they will hopefully be easy to recognize.

    • Admiral PatrickOP
      link
      fedilink
      English
      98 months ago

      Yeah, and you can still look at the “Narrated by” to see if it’s a real voice actor. You just can’t filter by “not AI”.

  • @Reddfugee42
    link
    78 months ago

    Just ban it and have people report the violators.

  • @[email protected]
    link
    fedilink
    78 months ago

    I very much understand the misgivings about this, and certain parts make me uncomfortable with it, too. But this could be revolutionary for media accessibility, and in my mind could easily be worth it for the ability to make new media immediately accessible to folks with vision challenges, deaf and hard of hearing individuals, and a lot of other folks for whom most media is not easily interactive/accessible. For many people in this situation, you wait months after a traditional version of something is published before an accessible version is released, if it ever is. Often, it’s just not seen as worth a publisher’s time to make their content accessible to an audience they don’t see as significantly profitable.

    Like the printing press took jobs from scribes, but had far more significant impacts democratizing information and education, so might AI in the long run.

    • Admiral PatrickOP
      link
      fedilink
      English
      128 months ago

      But this could be revolutionary for media accessibility, and in my mind could easily be worth it for the ability to make new media immediately accessible to folks with vision challenges, deaf and hard of hearing individuals, and a lot of other folks for whom most media is not easily interactive/accessible

      As an accessibility add-on / upgrade to standard TTS, sure. Sounds great, even. But I will not accept soulless, robotic, AI-generated voices for something being sold as an audiobook. I just won’t.

      • The Pantser
        link
        28 months ago

        What about if we sweeten the deal and allow you to choose the voice actor on the fly? Want Star wars novels read by James Earl Jones, or Tolkien read by Arnold Schwarzenegger? You can have that with AI voices.

        • Admiral PatrickOP
          link
          fedilink
          English
          58 months ago

          Lol, there is no sweetening the deal. Every one of those is a lost job opportunity for an actual voice actor.

          • The Pantser
            link
            28 months ago

            Except many books get read by the author or never get read at all so it opens new opportunities for people who wouldn’t ever use audio books or books who never will get audio version now vision impaired people have the option to hear them.

            • Admiral PatrickOP
              link
              fedilink
              English
              68 months ago

              TTS already exists for the visually impaired. Amazon can use AI voices for that all they want (they’re better than the default TTS now). Just don’t sell me an “audiobook” that’s not read by a person.

            • Jackie's Fridge
              link
              68 months ago

              How about opening up new opportunities for real (if unknown) voice actors to read these books for people?

              • just another dev
                link
                fedilink
                English
                -18 months ago

                There’d be a market for that, just like there’s a market for handmade furniture.

      • @[email protected]
        link
        fedilink
        -58 months ago

        “I will not accept soulless robotic machine-generated typescript being sold as a book. I just won’t.” -Some hipster arguing against the printing press

        Not denying audiobook performers don’t have real valuable talent (and should be fairly paid for it), but when was the last time you paid a premium for a handwritten novel?

  • @[email protected]
    link
    fedilink
    38 months ago

    Have y’all ever tried listening to an audiobook but had to give up because you didn’t like the reader? Imagine being able to choose the voice, the accent, the rhythm, the speed.

    Imagine a future where you could train a model to read it in your own voice so that you could read to your loved ones even after you’ve passed.

    I’m ready for AI to take over all the jobs.

      • @[email protected]
        link
        fedilink
        48 months ago

        Yeah but I’m talkin’ any book. Not just the ones I could take the time to record myself reading. Imagine anyone being able to have any book read to them in any voice, including my deceased grandmother’s. We have some recordings of her voice that would be enough to train AI so I could have her read me any book. That’s what I’m talkin’ about.

      • @Psychodelic
        link
        28 months ago

        You listen to audiobooks of yourself narrating? That’s kinda fuckin weird. lol

        • The Assman
          link
          fedilink
          18 months ago

          Username checks out. Reread the thread when the shrooms wear off.

          • @Psychodelic
            link
            18 months ago

            Why would you even suggest that? So weird

    • @[email protected]
      link
      fedilink
      48 months ago

      Yes, I have. And I don’t want to choose every detail of a performance no matter how simple.

      I want to hear a performance independent of my own experience. I want to hear something that I didn’t know I wanted.

      AI can only show you an elaborate mixture of the data it was trained on and a set of instructions. It cannot make decisions, iterate on experiences, or have creatively beneficial mistakes like a human does.

      • @[email protected]
        link
        fedilink
        18 months ago

        Considering how good the technology is now and how it will continue to improve, I think we’ll soon have a hard time telling the difference. I can’t see the value in having a human spend hours reading and recording and editing when a program will be able to do it almost instantly in the near future.

        • @[email protected]
          link
          fedilink
          48 months ago

          You’re looking at an audiobook as if it’s only a method of distributing the contents of a book, but a human conveying a story is a performance. Those performances have artistic merit. Sometimes more, sometimes less, different take aways for different listeners, however it’s an inherent symptom of a human being sharing something.

          This is where it gets important to remember that Machine Learning isn’t actually AI. It cannot make decisions or have accidents outside of mistakes made by the developers. It is not intelligent, it is a computer program.

          • @[email protected]
            link
            fedilink
            28 months ago

            Fair point. I still think professional voice actors are an endangered species. The clock is ticking…

    • The Pantser
      link
      38 months ago

      I plan to make a “me” and integrate with Home Assistant so I can be the voice of my house.