ElevenReader

305 pointsby mfiguiere3 months ago

56 comments

xnx3 months ago

Zonos is a new open weights text-to-speech model that has quality at least as good as ElevenLabs: <a href="https://www.zyphra.com/post/beta-release-of-zonos-v0-1" rel="nofollow">https://www.zyphra.com/post/beta-release-of-zonos-v0-1</a>

评论 #43025390 未加载

评论 #43025320 未加载

评论 #43030218 未加载

csantini3 months ago

You can get pretty close with open source software:<a href="https://claudio.uk/posts/audiblez-v4.html" rel="nofollow">https://claudio.uk/posts/audiblez-v4.html</a>

评论 #43022918 未加载

评论 #43022851 未加载

评论 #43023670 未加载

评论 #43023077 未加载

评论 #43023455 未加载

评论 #43023579 未加载

评论 #43024500 未加载

emptysongglass3 months ago

I would never trust the company that acquired Omnivore only to sunset it with 2 weeks notice to retrieve data.Companies won't stop pulling this garbage unless we stop supporting them.

评论 #43024224 未加载

评论 #43022833 未加载

评论 #43023152 未加载

评论 #43023832 未加载

评论 #43032044 未加载

rickcarlino3 months ago

I wish there was a reader app that was serious about text speech. This is not it, unfortunately. Reader apps need to focus on a text to speech experience that is identical to a music player so that you can use the app while in hands free situations. The app is also hard to use as a “read it later” tool on iOS.I was really hoping they would fix these issues by now because it was promising. This app truly does feel like a portfolio demo app for a text to speech engine company rather than an actual reader app.UPDATE: yes, I have actually used the app, no it does not work well. See replies for details.

评论 #43026251 未加载

评论 #43026370 未加载

评论 #43030682 未加载

评论 #43027164 未加载

评论 #43025935 未加载

评论 #43029831 未加载

woadwarrior013 months ago

Hasn't this been around for ~4 months? Interesting to see this here, since their competitor Zyphra, just released two Apache 2.0 licensed open weights TTS models yesterday[1].[1]: <a href="https://news.ycombinator.com/item?id=43004589">https://news.ycombinator.com/item?id=43004589</a>

BeetleB3 months ago

If you want free/ultracheap, the Google Cloud TTS is good enough for simple use cases. You get enough free minutes that it may end up being free (I think I've paid a cent so far).Some of their voices sound very artificial, some very real. I've been slowly making a list of the good ones.I use it to convert long articles into audio, and have a script to add it to my podcast feed to listen to while driving:<a href="https://blog.nawaz.org/posts/2024/Apr/reading-articles-via-podcast-software/" rel="nofollow">https://blog.nawaz.org/posts/2024/Apr/reading-articles-via-p...</a>

评论 #43030297 未加载

hiAndrewQuinn3 months ago

This is excellent. I just tested the Finnish voices on my simple news archive [1], and the pronunciation was quite good and clear.It's unfortunate that I can't export audio clips locally; otherwise I would immediately look into using this for generating my Finnish flashcard decks from the same material [2]. I've thought about doing the same with the audio and video feeds included with this news broadcast, but getting Whisper to sync up properly with what's written down and cutting up the raw audio in that way still seems like more effort than I'm willing to invest right now.[1]: <a href="https://hiandrewquinn.github.io/selkouutiset-archive/" rel="nofollow">https://hiandrewquinn.github.io/selkouutiset-archive/</a>[2]: <a href="https://github.com/Selkouutiset-Archive/selkokortti">https://github.com/Selkouutiset-Archive/selkokortti</a>

评论 #43023432 未加载

Kabukks3 months ago

Last time I tried Elevenlabs for German text, it got a lot of numbers and dates wrong.E. g. saying "1963" when the actual year in the text was 1967. Yeah, the voices sound very realistic. But I'm not sure how useful that is if you can't trust the spoken words.Does anyone know if it got better in the last weeks?

评论 #43024157 未加载

bjackman3 months ago

Really glad these products are appearing!So much of my time for "reading" is in a context where I can't physically read, so audiobooks are incredibly useful. But being limited to the set of books that gets recorded by the publisher is a real shame.Haven't tried it yet but AI TTV seems basically perfect now so I'm very optimistic this will work great.

评论 #43022775 未加载

wedn3sday3 months ago

I immediately copy/pasted in some smut to check if it was going to lecture me on my moral failings and was pleasantly shocked to find a corporate AI model that did what I asked without pushing puritanical nonsense one me.

评论 #43049241 未加载

barrell3 months ago

Been using eleven labs for several years now. I was really impressed with their multilingual model a few years ago.Since then, they’ve released a few cheaper models, but the quality suffers greatly (they still have the old models though so it’s not an issue). They’ve also been releasing a ton of different products around TTS.I don’t mean this as a criticism — I just am curious why SOTA TTS has not improved from one model by one company several years ago, and why even said company isn’t able to improve on that model.

评论 #43022864 未加载

sky22243 months ago

The video shows scenarios of people listening to pdfs of pretty dense material (e.g., computer science, bio mechanics).Does anyone here actually have positive results doing this? It seems to me listening to anything that's even remotely complex with the intent of learning it just isn't something that's feasible.

评论 #43023052 未加载

评论 #43023298 未加载

评论 #43023176 未加载

评论 #43023014 未加载

评论 #43023142 未加载

theothertimcook3 months ago

This is so impressive.No audiobook exists, drop epub into ElevenReader and have Bert Reynolds read it to you, honestly better than some human narrators.

benrutter3 months ago

I've been looking for a good and convenient way to read papers that are published in PDF for a while.Ideally, I'd be able to strip out the text content and send it to my kindle in readable form. Since apparently that's science fiction, this looks like a really good plan B! Will definitely give it a go.

评论 #43023446 未加载

评论 #43026292 未加载

评论 #43023501 未加载

darkwater3 months ago

I know I'm growing old but this is the kind of tech application that I don't like. Arts should be the last thing to be 100% fully done by a program. Enhancing capabilities in artists? Hell yeah. Replacing completely voice actors? No, thanks.

评论 #43023421 未加载

评论 #43026213 未加载

评论 #43028808 未加载

评论 #43023795 未加载

randysalami3 months ago

I’ve actually used this extensively for months now since it’s free and works with PDFs I’ve downloaded off the internet. I was so frustrated with ridiculously overpriced TTS (must pay for annual sub! no monthly) when I found this gem.My main use case is comp. sci and philosophy books. I download PDFs of varying quality off the internet onto my phone and import them into this app. The text translation is always solid but for the former, graphs and diagrams really break it. It’s a tricky problem because these often are important to the text so skipping them (for the app) isn’t ideal but the current solution just makes the reader goof up. I think it would be cool if the model could identify these objects and maybe generate some text describing the object and TTSing that. Minor gripe and for the latter, it’s perfect.I’ve probably used this app for 70 reading hours at 1.5x speed across long road trips and walking my dog at the park. I’ve gotten through numerous books I wouldn’t have and for free. I’m happy!(annoying bug I find often: it seems certain characters or tokens just break it and it freezes. I need to manually skip ahead hoping it doesn’t get stuck again. Really detracts from the hands free nature and is difficult to manage while driving)

_qua3 months ago

I recognize and appreciate that this is free right now. But surely it won't always be. And I can't keep paying $10-20/mo for every individual AI tool.

cube22223 months ago

So, I wanted to like this, but frankly the quality isn't fantastic.The text to speech is alright, but it lacks almost any emotion, and it reads everything literally, which when the article/pdf has a weird layout, or has figures, doesn't sound natural. Though I expect they're just not using their top-of-the-line models for this - I've had much more luck pushing a pdf through Claude to generate the "verbal version" (which is mostly literal, but also describes the layout and figures) and then the result through the top-of-the-line ElevenLabs model.Now, I've also checked out the podcast feature, and it's pretty clear they first do a textual generation, and then a simple text to speech. Again, lack of emotion, very mechanical flow.I made a podcast of a technical article[0] in both ElevenLabs reader and Google's NotebookLM, and the NotebookLM podcast is a night-and-day improvement - maybe they use a better model, maybe they use straight "article to podcast" end-to-end multimodal generation, I don't know, but the quality, flow, emotion, is just on a completely different level. I had to quickly turn off the ElevenLabs-generated podcast cause I couldn't keep listening to it, while NotebookLM's one is legitimately enjoyable.Now to finish on a more positive note, fingers crossed for the ElevenLabs team improving this, and us getting some competition in the area of article-to-audio, both podcast-style, and direct! I think, in general, it's a very promising product direction. Feature-wise, I would also love to get a daily overview podcast based on all my RSS feed articles for a given day.[0]: <a href="https://huggingface.co/blog/modernbert" rel="nofollow">https://huggingface.co/blog/modernbert</a>

nmca3 months ago

I’ve listened to a few audiobooks on long drives, and have been surprised how hard it is to find good voices on audible. Often a book that might otherwise be good has a prohibitively annoying tone. So honestly the exciting thing here is the customisation.That said, even in their cherries the emphasis still isn’t quite right in the Tolkien example.

smoothbenny3 months ago

Tried this app last week w/ an EPUB. It read all of the drop caps as individual letters, before moving on to the remaining portion of the word. It said “tilde” before each item in an unordered list. Too distracting to be of any practical use for me, unless there’s a setting I missed.

jnsaff23 months ago

It seems that this is using one of the less refined models. In English it sounds like a 4th grader reading in front of a class. Kinda stilted word by word voicing with static pauses between words and no variation in intonation. Tried with two voices and both are the same.

评论 #43023023 未加载

评论 #43022965 未加载

codybontecou3 months ago

I just wish this had a Chrome Extension so I can listen to article while on my computer.

zeroq3 months ago

A friend of mine is an actor (cinema, tv, theater) who makes a significant amount of his income as a voice actor.For a long time I wanted to make a game - think The Stanley Parable or Thomas Was Alone - that would be narrated by the voice of either David Attenborough or Morgan Freeman. You know, it's a low hanging fruit, you can have a two hours long footage of zebras running around narrated by either of these and it's suddenly eerily fascinating.So far I'm AI skeptic, but this voice thing really makes me think about an actual shift in how certain jobs can become irrelevant in foreseeable future.

reustle3 months ago

I’ve been using this for a few weeks, it works great. Can’t wait until this is built natively into browsers or even the OS (ios voice is currently terrible)

评论 #43023984 未加载

andrewstuart3 months ago

TTS seemed to take a great leap forward a few years ago and seems to have stalled again.Services are expensive and in most cases the voices are easily detectable as not human. I would find it very hard to listen to such voices for a long period of time.Even ElevenLabs voices which seem to be known as the best have only a few that are really good quality but even then they're very, very far from the capabilities of a human.

saeedesmaili3 months ago

It's a nice idea, but pretty much useless without a pocket integration or an api to programmatically import content.

milofeynman3 months ago

This raises an interesting question around the rights of the author/publisher and who they sold their ebook rights to. If in 3 years we have a perfect AI voice that can read any book as good or better than mid-level narrators, why would you ever buy an audiobook when you could just buy the ebook and pick your voice(s). What a time to be alive

评论 #43022906 未加载

评论 #43023288 未加载

评论 #43028900 未加载

mkmk33 months ago

Damn, tried a unicornriot article [1] and it just skipped several paragraphs past the grisly stuff.Can anyone else confirm?[1] - <a href="https://unicornriot.ninja/2024/sextortion-coms-inside-a-vile-child-exploitation-cult-run-by-nazi-linked-teens/" rel="nofollow">https://unicornriot.ninja/2024/sextortion-coms-inside-a-vile...</a>

评论 #43023424 未加载

评论 #43023002 未加载

macco3 months ago

How is the quality compared to speechify?I use it to listen to PDFs. It works, but has plenty of hiccups with headers, footers and colons.

评论 #43023429 未加载

jacek3 months ago

I love the idea as I listen to a lot of podcasts and an occasional audiobook.The first impression is not that great. There's nothing natural about the voice. While individual words and phrases sound good, there's still no decent cadence and intonation. Feels flat and robotic.However, I will definitely experiment some more.

davidanekstein3 months ago

I use ElevenLabs to narrate tutorials for my app and I’m a happy customer thus far.Here is an example: <a href="https://youtube.com/shorts/UKjqrydITLA?si=iC7ehp6LmlLH0M-U" rel="nofollow">https://youtube.com/shorts/UKjqrydITLA?si=iC7ehp6LmlLH0M-U</a>

zoba3 months ago

I've been enjoying this app except I could not find a way to export the content to an audio file. I want to send the content to others - I'd even take a link to a website with a Play button (just not one that forces an app download)

crakhamster013 months ago

The generative podcasts feature feels so dystopian. I didn't realize this SNL skit was based off of a real product lol<a href="https://www.youtube.com/watch?v=ua4rYsMdC4U" rel="nofollow">https://www.youtube.com/watch?v=ua4rYsMdC4U</a>

wink3 months ago

> Application error: a client-side exception has occurred (see the browser console for more information).Probably because I have WebGL disabled in this browser. Not exactly sure what they're doing with it on the landing page, maybe the fluffy effects.

berbec3 months ago

I have used Moon+ Reader [1] for years with the build-in Android TTS service. It works very well, is free, and sounds good enough for me.1: <a href="https://moondownload.com/" rel="nofollow">https://moondownload.com/</a>

评论 #43032034 未加载

t0lo3 months ago

This is definitely the future, I'm worried about the electric slip and slide world we're heading into though, where everything is completely spoonfed and consumptive. I can't help but think we're heading back into animalism.

评论 #43023067 未加载

mozzieman3 months ago

The best ive heard but still too monotone over time compared too real productions. Feel blown away at first but listen a chapter or two gets difficult. Just a matter of time most likely until it becomes as good or better then the real thing.

juliendorra3 months ago

You should try it with your own voice! (By first creating a custom voice on the web interface. The quick basic clone should be enough).I found that it’s my preferred way to use their reader, as it makes the reading more neutral and transparent for my brain.

评论 #43024518 未加载

评论 #43026384 未加载

sys327683 months ago

I was briefly excited to try this on out-of-print books I find on Google Books, but alas the OCR in Acrobat PRO is super glitchy.I need to find some AI-assist OCR to fix tons of mistakes like "186o" for 1860 or "gla)" for glad.

评论 #43026611 未加载

dazzaji3 months ago

I rely on ElevenReader several times a week for quick text to voice on snippets of text I’m working on or sometimes on full web pages when I hand it a url. It’s quick and easy to use and the performance and quality is high.

jdlyga3 months ago

The voices are excellent, but the app needs work. It lost my place in a book a few times, so I switched back to VoiceDream (don't use VoiceDream, it stinks unless you're a legacy purchaser).

unbecoming3 months ago

As a first impression, french sounding names should be read as french sounding, even in english text. The voice per se is ok, but as delivery goes (pausing, title vs content), it could be better.

mindwork3 months ago

Downloaded the app and inserted 3 publicly visible URLs. Only last one was to be downloaded and listened. Not sure whether it's their UserAgent string or what

cooper_ganglia3 months ago

The company I work for has been using ElevenLabs to translate hour-long programs into Spanish, French, Portuguese, Greek, German, and Chinese. We have a large international audience, so it's worked great for this purpose!Before, we were hiring people to translate, and then hiring others to dub the audio. Now, our files are automatically translated and spoken in the voice of the actual speaker, and we just have a small Quality Control team of native speakers quickly verify the results are accurate. We've reduced costs and increased the quality of our translated media.

__rito__3 months ago

Is there a pricing page? I am not seeing any.

评论 #43022890 未加载

whazor3 months ago

Is there any technology that can do separate voices for each individual person speaking in an audiobook?

Kerbiter3 months ago

Would've been great as a TTS component that could be installed and used in existing e-readers.

评论 #43027462 未加载

b33f3 months ago

Is this streaming server-side audio or is the TTS running locally on device ? Can it work offline ?

评论 #43023071 未加载

flakiness3 months ago

Are there any good papers from which I can learn the recent development of TTS tech?

tarponjargon3 months ago

try <a href="https://clipcast.it" rel="nofollow">https://clipcast.it</a>Ingests URLs in a variety of ways, converts to natural language audio, puts it in your podcast feed.Free to use.

gtirloni3 months ago

The ToS are a nightmare, as usual for these services.

leumon3 months ago

Unfortunately the app is not compatible with Android 15.

layer83 months ago

Does this work for reading articles on websites?

jeswin3 months ago

The ad shows someone listening to an article or a story while driving a large vehicle - this is unsafe (depending on the individual). It's not like listening to music.

评论 #43023270 未加载

ratedgene3 months ago

Honestly, why isn't this same service baked into my OS? the reader there is really atrocious, but I imagine even for a single voice a pretty small model can be downloaded and made available as a plugin for the reader app.

yapyap3 months ago

yeah, no thanks.if you are reading for information, I guess if this helps, sure go ahead.when reading for pleasure, this is not it though.