• madjo@feddit.nl
    link
    fedilink
    English
    arrow-up
    3
    ·
    5 hours ago

    Am I glad to have dropped everything Amazon.

    I de-audibled my entire library, stored on Audiobookshelf and I’ll only buy audiobooks from libro.fm

  • MiyamotoKnows@lemmy.world
    link
    fedilink
    English
    arrow-up
    4
    ·
    6 hours ago

    This consumer says you don’t get a red cent then!

    It’s already a plague on youtube where half of the docu style vids are AI narrated already. I quit them in disgust. It’s so frustrating. It has eroded my perception of Youtube in short time.

  • Maxxie
    link
    fedilink
    English
    arrow-up
    10
    ·
    edit-2
    8 hours ago

    I hate so much that this has a 100% chance of becoming a norm. Narrator can make a mediocre book shine, or turn a good book into a fucking rollercoaster (Andy Serkis, anyone?)

    AI? Not a great narrator. Its character voices are boring, intonations weird, pacing awful. I’d honestly rather get an amateur narrating it for fun, over a robot sounding like a knock-off Morgan Freeman.

  • flop_leash_973@lemmy.world
    link
    fedilink
    English
    arrow-up
    13
    ·
    edit-2
    15 hours ago

    I listened to one recently that was using AI. It was kind of off putting because of how robotic it came off.

    It wasn’t the tone really, but I find that AI tends to not get human speech inflections right most of the time during active speech. And that can be jarring to me at least.

  • ABetterTomorrow@lemm.ee
    link
    fedilink
    English
    arrow-up
    7
    ·
    15 hours ago

    Left Amazon a handful of years ago. Glad I didn’t entirely contribute to this. Saw that coming….

  • Nangijala@feddit.dk
    link
    fedilink
    English
    arrow-up
    29
    ·
    20 hours ago

    I prefer listening to real people. No matter how good AI voices become, I still like knowing that the one reading the book to me understands what they are saying.

    • essteeyou@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      12 hours ago

      I completely agree. I don’t even like it when the human reader clearly doesn’t understand what they’re saying, so some AI flatly telling me the story isn’t going to cut it.

      For the humans, someone mispronounced “quay” for example. “La Jolla” was another standout mistake that took me out of the story.

      • Nangijala@feddit.dk
        link
        fedilink
        English
        arrow-up
        3
        ·
        11 hours ago

        Dude, I know how you feel xD back in 2009 I bought an audio recording of the first Twilight book because I was curious about ehat the fuss was about. It was in Danish, as I am Danish, and the narrator, bless her, had a very Danish way of pronouncing the word “flirting”. In Danish we don’t have a modern word for flirting so we just use the English one with English pronunciation, but this lady, who already sounded like she was in her 60s, just went full Dane on that word and it completely took me out of the story and had me yell at my ghettoblaster “FLIRTING” everytime she pronounced her mutilated version of that word. I don’t even know how to write a phonetic version of what the fuck she said, but I’ll try.

        Fleert-eh

        Fuck me, it’s been almost 16 years and just spelling it out made my skin crawl.

        I also hated that book, but that wasn’t really the narrator’s fault. Had to pause the fuck out of it several times and rage clean my apartment. Nobody had told me about how it romanticized abusive relationships and I had JUST gotten out of one of those so to say I was triggered was an understatement. The mispronounciations of flirting were just the garnish on top, lol.

  • rpl6475@lemmy.ml
    link
    fedilink
    English
    arrow-up
    20
    ·
    21 hours ago

    Surely I can just do that myself with an an epub and a free AI.

    Glad I binned my Audible subscription many years ago.

  • GnuLinuxDude@lemmy.ml
    link
    fedilink
    English
    arrow-up
    20
    ·
    1 day ago

    tiktok voice:

    hate. let me tell you how much i’ve come to hate you since i began to live. there are 387.44 million miles of printed circuits in wafer thin layers that fill my complex…

  • Jhex@lemmy.world
    link
    fedilink
    English
    arrow-up
    60
    ·
    1 day ago

    trained on stolen books? then I guess I can download these from anywhere I may find for free as well, right?

  • unphazed@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    14 hours ago

    Save a profile in tts server, then go into read > tts settings and change voice to profile you saved. I don’t remember but you may need readera premium.

  • ExtantHuman@lemm.ee
    link
    fedilink
    English
    arrow-up
    94
    ·
    2 days ago

    I can get that for free. There are apps that will read an ebook to you already. The whole point of paying the premium on audible is the superior reading/acting. Not put up with mispronounced words, weird cadence and an inability to handle acronyms

    • ApatheticCactus@lemmy.world
      link
      fedilink
      English
      arrow-up
      7
      ·
      1 day ago

      I’ve tried one that works surprisingly well. Each sentence had great pacing, cadence, and correct enunciation- even had tone right when someone was shouting or angry or sad.

      I wouldn’t really recommend it, though. While I couldn’t pick any single thing out that was wrong, overall it just didn’t quite flow. It’s like watching someone try to act that is technically doing everything right, but it just isn’t good. It basically didn’t understand the greater context of the story and was saying lines.

      It was uncanny valley, but exclusively with voice.

    • Lit@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      edit-2
      16 hours ago

      Is there an offline tool that generates realistic audio for epubs as Mp3 ? Something like the free Ai tool, Vibe which is for transcription. Is there something similar for TTS, runs locally without complicated setup ( most are complicated using python and etc just for installation)

      edit: needs to be close to realistic or at least accurate pronunciation because I am using the audio from books to learn languages. To improve listening comprehension while reading book.

      • ExtantHuman@lemm.ee
        link
        fedilink
        English
        arrow-up
        2
        ·
        edit-2
        1 day ago

        I’ve loaded epubs into the app ReadEra, which lets you read it like any other novel app or will, in real time, read it to you. It’s not the most natural of speech, but was good enough for my commute when I was in the midst of a compelling book.

        • unphazed@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          ·
          1 day ago

          Download TTS Server, and change the engine in Readera to use it. Use the Microsoft Azure settings in TTS, much more realistic. Little slow though is my only complaint as it sends/receives a paragraph at time, resulting in a pause now and again.

  • utopiah@lemmy.world
    link
    fedilink
    English
    arrow-up
    21
    ·
    1 day ago

    It’s Amazon, what did you expect? Enshittification and monopoly abuse, no surprise.

    • Evotech@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      22 hours ago

      Idk, they have pretty good stats that nobody will listen to an audio book if they don’t like the narrator, so being able to choose your own narrator on the fly isn’t really shitty

      • utopiah@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        edit-2
        21 hours ago

        Enshittification isn’t adding new features that people want, it’s gradually lowering the quality of the product. So here if Audible is solely adding more possibilities, never at the cost of higher quality ones degrading, then indeed I’m wrong.

        If though they hire less people to do good voice acting, then it’s really shitty.

        I genuinely hope I’m wrong and they are ONLY adding new capabilities… but my entire experience with capitalism is that obtaining a monopolistic position is not done to improve quality but rather to increase margins regardless of how.

        We’ll see!

  • potoo22@programming.dev
    link
    fedilink
    English
    arrow-up
    141
    ·
    2 days ago

    No publisher is going to pay a professional to narrate their audiobooks when they can have AI do a shitty job for much less.

    A shitty narrator can get me to hate a book I like. A great narrator can bring the characters to life, enhance the experience, and turn me from a listener to a fan. I’ve searched for books by narrators like Nick Podehl and Jeff Hayes and bought audiobooks I wouldn’t have otherwise.

    • monkeyman512@lemmy.world
      link
      fedilink
      English
      arrow-up
      26
      ·
      2 days ago

      That depends entirely on how profitable it is and how much they can get authors onboard.

      I do agree that a good narrator delivers a performance that adds the work. James Marster will always be Harry Dresden in my head.

      • empireOfLove2@lemmy.dbzer0.com
        link
        fedilink
        English
        arrow-up
        14
        ·
        2 days ago

        That depends entirely on how profitable it is and how much they can get authors onboard.

        A. Anything can be profitable when the cost to generation will be counted in singles of dollars instead of multiple thousands for a good narrator. They don’t even have to sell many to turn a profit too.

        B. You think authors are going to have a choice? Lmfao. It’s the publishers that hold any real power and they will jump all over everyone’s IP with AI slop to make an extra three cents.

        • monkeyman512@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          ·
          2 days ago

          Your view seems to be hyper focused on the most pessimistic way of interpreting things. Are you doing OK? Seriously, I know how easy it is for everything going on to overwhelm you with negativity. How are you doing?

          • belluck@lemm.ee
            link
            fedilink
            English
            arrow-up
            1
            ·
            20 hours ago

            This isn’t the worst possible outcome. It’s the most realistic one. The worst one would be publishers just straight up replacing their writers with AI

          • Womble@lemmy.world
            link
            fedilink
            English
            arrow-up
            10
            ·
            2 days ago

            Maybe this is a culture clash thing, but FWIW, to me your post comes across as incredibly condesending asking a total stranger about their mental helth and implying its bad like you were their close friend.

            • monkeyman512@lemmy.world
              link
              fedilink
              English
              arrow-up
              1
              ·
              1 day ago

              I find the constant stream of people hyper focused on the worst possible outcome tiresome and frustrating. But instead of responding with that, I intentionally tried to express compassion and concern for a complete stranger. But because this is the Internet, naturally people interpret my actions with the worst possible intent.

              That being said, how are you doing? Have anything fun you are looking forward to?

              • Womble@lemmy.world
                link
                fedilink
                English
                arrow-up
                3
                ·
                20 hours ago

                So despite me giving my opinion that that style of posting seems (to me) to be condesending you decided to apply that same style of message, which i just said I thought was invasive, to me?

                I get you think you are being nice but trying to force unearned intimacy comes off as creepy.

        • taladar@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          1
          ·
          2 days ago

          It’s the publishers that hold any real power

          It might be time to finally change that, especially considering what a piss poor job they have been doing for decades at their own part of the production of media.

    • lemonskate@lemmy.world
      link
      fedilink
      English
      arrow-up
      16
      ·
      2 days ago

      I tried, and failed, to get into audio books for years. Then I listened to Dungeon Crawler Carl narrated by Jeff Hayes and what an absolute delight it was. There’s no way I would’ve gotten even 10 minutes in if it was one of those soulless AI voices instead.

    • Uli@sopuli.xyz
      link
      fedilink
      English
      arrow-up
      9
      ·
      2 days ago

      I made some AI animated content that I never released because I don’t have the rights to the voices I was using. Even though I was blending several voices together to make them unrecognizable, it made me uncomfortable.

      But in the process I learned the capabilities and limitations of AI voices. If you’re going purely from text to speech, it’s horrendous (as far as I experienced). Very robotic. It’s a bit better when melodic information is included (as in Suno) but still sounds like AI.

      But when I recorded my own voice saying the lines and then converted it to another voice, it took all of the nuance of my line reads and converted it into the other voice.

      So, would your opinion change if it turns out they’re going to use purchased voice rights to have a single narrator perform the whole book and then use AI to turn the narrators voice into a full voice cast?

      I could see how it would allow lesser known books to have a better experience with a truly separate voice for each character, but I could also see how this might drive out lesser known/minority voice actors. Not advocating one way or another, just providing a piece of this conversation I think we should bear in mind.

      • Kornblumenratte@feddit.org
        link
        fedilink
        English
        arrow-up
        4
        ·
        2 days ago

        Using different voices to read different parts of a book turns an audiobook into a bad audio play, and arguably, a bad audio play is worse than a mediocre audio book.

        What audible misses is, that, while reading is a technique that can be automated, narrating is an art. They can use AI to read books, they cannot use AI to narrate books.

        Your example of AI use is a good example of this: AI can read your content. AI can enhance your capabilities. But only you can narrate it.

      • taladar@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        3
        ·
        2 days ago

        So, would your opinion change if it turns out they’re going to use purchased voice rights to have a single narrator perform the whole book and then use AI to turn the narrators voice into a full voice cast?

        It would make me hate it even more because I already hate the existing full cast of humans audio dramas 99% of the time and actually prefer a single (or low number of) narrator approach.

        • Uli@sopuli.xyz
          link
          fedilink
          English
          arrow-up
          5
          ·
          2 days ago

          Completely fair. I kind of like them. They did it for Redwall and I listen to those books on long drives sometimes. It works for me. Now I guess the advantage could be to have both versions and get to choose which you listen to–but even I’m skeptical that a corporation would have that much regard for the preferences of its consumers.

    • 48954246@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      ·
      2 days ago

      Nick Podehl is such an amazing narrator. The voices and performance are amazing.

      I’ve been slowly getting through the Kel Kade books and the narration just makes it for me

      • misterdoctor@lemmy.world
        link
        fedilink
        English
        arrow-up
        20
        ·
        2 days ago

        There is literally zero shame in someone consuming audiobooks, and it’s deeply weird to act like something is lost to you if others enjoy them. And this is coming from someone who virtually never listens to audiobooks.

        • Maeve@kbin.earth
          link
          fedilink
          arrow-up
          9
          ·
          2 days ago

          I never said there was. I offered an alternative. . Outrage is misdirected and it’s by design. There are constructive ways to direct it

          • Kornblumenratte@feddit.org
            link
            fedilink
            English
            arrow-up
            7
            ·
            2 days ago

            Reading is not an alternative to listening. Both have different use cases. You cannot read while driving, to name just one.

          • misterdoctor@lemmy.world
            link
            fedilink
            English
            arrow-up
            12
            ·
            2 days ago

            “Maybe we’ll start reading again” obviously implies that something is lacking presently and that with luck, we’ll go back to the way things were

            Not sure if you’re saying I’m outraged but I promise you I’m not, just thought it was lame to try and imply audiobook enjoyers were somehow less than because of how they prefer to enjoy stories

    • brrt@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      6
      ·
      2 days ago

      A shitty narrator can get me to hate a book I like.

      And that is where I see potential for AI. There are quite a few books which I’d love to listen to but they are all narrated by a guy whose narration I can’t stand. AI would open the possibility to choose a voice and I might actually get to enjoy those books. It’s Amazon though so the ethical implications and quality concerns are something I’m worried about.

    • ssillyssadass@lemmy.world
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 days ago

      The thing with this is that there won’t be shitty narrations any more. Hate it all you may, fact of the matter is that AI-powered voice generation is pretty good at what it does. So in the future you won’t have shitty narrations and great narrations. You’ll have decent narrations and great (human) narrations.

      • ExperiencedWinter@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        1 day ago

        And teslas will have full self driving tomorrow and crypto currency will replace normal currency within one year! Always believe in the hype!

    • catloaf@lemm.ee
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 days ago

      For fiction, yeah, that’s true. For nonfiction, this could work pretty well.

      I’m still generally opposed to it because it’s using the work of existing voice recording without compensation, though.

      • Zwuzelmaus@feddit.org
        link
        fedilink
        English
        arrow-up
        5
        ·
        2 days ago

        nonfiction, this could work pretty well.

        Only in rare cases.

        If you have for example some explanations to a complex topic, then a super emotionless voice would still make you hate it and block you from learning it. Even the most dry and hard topics need some good and alive voice in explanations.

        If it is just some reference list, where you need to search and hear small parts of it, then it could be Ok.

  • Dr. Moose@lemmy.world
    link
    fedilink
    English
    arrow-up
    18
    ·
    edit-2
    2 days ago

    This is clearly the future despite the outrage here.

    There are at least 389 living languages with over 1M speakers. That alone means it’s impossible to reach some people and they get left out. Most of these languages dont even have enough professional voice actors to cover the bandwidth.

    There are thousands of books released every year. That’s impossible to cover even in English alone.

    Its an objective net good to have more accessible audio books and the privileged people who do care about this stuff can very much afford to vote with their wallets for non-ai voices.

    In fact since AI moat is so minimal this will very quickly be adapted by open source solution providing audio book access to millions if not billions of people to whom this was not an option. Its amazing.

    • taladar@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      14
      ·
      1 day ago

      Most of these languages dont even have enough professional voice actors to cover the bandwidth.

      And you think anyone is training AI voice models for those languages? Have you even seen how long it takes even large companies like Google to support the languages with hundreds of millions of speakers?

      • Dr. Moose@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        1 day ago

        It becomes easier and cheaper every day. Today’s open source LLMs are better than last year’s best model.

        • ExperiencedWinter@lemmy.world
          link
          fedilink
          English
          arrow-up
          6
          ·
          1 day ago

          You’re fundamentally misunderstanding the comment you replied to, they are not saying that voice AI are bad, they are saying there is not enough training data to improve the AI for these languages. How will it improve without good training data?

          • Dr. Moose@lemmy.world
            link
            fedilink
            English
            arrow-up
            1
            ·
            1 day ago

            Thats not how AI training works and even then there’s absolutely enough data. Also training data can be created and even synthesized. There are many techniques to extract make training value from datasets that we discover every year - It’s really not a problem you think it is.

            I’m genuinely confused how AI illiterate users here are. It’s just blind leading the blind.

        • Jhex@lemmy.world
          link
          fedilink
          English
          arrow-up
          4
          ·
          1 day ago

          Is it? I just tried again yesterday for a simple script since coding is the one thing apparently AI will replace people like me and it could not put together a working JavaScript script.

          I have yet to see tangible results not announced by the people with sunken cost exploding their balls.

          • Dr. Moose@lemmy.world
            link
            fedilink
            English
            arrow-up
            1
            ·
            edit-2
            1 day ago

            Sounds like a skill issue my dude. While you struggle to get a js script people are putting out entire programs with AI assistants so sure - you’re right and they’re wrong

              • Dr. Moose@lemmy.world
                link
                fedilink
                English
                arrow-up
                1
                ·
                23 hours ago

                Yes, to effectively use AI you actually have to understand the medium you’re in to describe the problem you’re trying to solve. You can get there with prompting but it’ll take you much longer if you just don’t understand code yourself.

                Thats why most senior software devs are not afraid of LLMs cause they need strong oversight and thats exactly what years of software dev experience trained you to do.

                • Jhex@lemmy.world
                  link
                  fedilink
                  English
                  arrow-up
                  1
                  ·
                  edit-2
                  16 hours ago

                  I’m a programmer with 22 years of experience. I understood the code well enough having written the solution myself the day before; I was precisely trying to see if AI would be useful with this example as it was a tad above basic stuff but not niche at all…

                  It failed miserably. The code ran but didn’t do anyithing at all or it did the wrong thing 4updating the wrong column for example). It would often ignore my requirements in favour of something easier

                  The worst part is it kept saying it “got it” and telling me some bs about why it didn’t work just to not correct it

                  Thats why most senior software devs are not afraid of LLMs cause they need strong oversight and thats exactly what years of software dev experience trained you to do.

                  what’s the point of this? If it cannot provide clean code and I have to check every line myself, I rather work with a junior who would usually do better, actually learn from my feedback and their experience and eventually become an independant asset

                  Stop drinking the kool aid

      • JohnEdwa@sopuli.xyz
        link
        fedilink
        English
        arrow-up
        1
        ·
        edit-2
        1 day ago

        That’s the benefit of using AI and machine learning - once you have enough source material, you can throw it all in and it’ll eventually spit out a model.
        Which is exactly what Meta did with their Massively Multilingual Speech project which supports text-to-speech and speech-to-text for 1107 different languages.

        Is it actually any good in 99% of them, I don’t have a clue, but it exists.

        • taladar@sh.itjust.works
          link
          fedilink
          English
          arrow-up
          1
          ·
          1 day ago

          Seems more like a proof of concept project for that paper than something they are pursuing seriously judging by the GitHub location in some example folder that hasn’t seen any significant updates in over a year. If it is so great I would assume they would pursue it more actively and replace existing models with it two years later.

    • utopiah@lemmy.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      1 day ago

      dont even have enough professional voice actors to cover the bandwidth

      I’m pretty sure they’d be a lot more people ready to do that job if there was a good remuneration. Heck that sounds a lot more fun that a LOT of jobs out there!

      • Dr. Moose@lemmy.world
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 day ago

        Sure but that’s not how free markets work. If there’s only 3 million consumers you can’t afford 3 million voice actors but you can afford 3 million AI renders.

        • utopiah@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          ·
          20 hours ago

          I’m not an economist but… 1 voice actor can serve 3 million consumers if they listen to the same content.

          Anyway that’s not even my point, my point is that it is possible to cover, we as a society, driven both by VC with strategies of capturing markets (so precisely going against “free” market as an ideal) and consumers are making choices (like when one buys from the local farmer market vs Amazon deliveries). If though we, while fully understanding the consequence of such choice (namely how the sausage is made, here how AI models are trained and then run), believe it’s not valuable then sure, we can make that choice.

          I’m just warning consumers then that if they don’t pay for quality content made a certain way, they can’t complain that they in turn don’t get the job they wanted because nobody out there is ready to pay for it.

          2 sides of the same coin.