Show HN: An open-source, self-hostable synced narration platform for ebooks

338 点作者 smoores超过 1 年前

Hi, I made a thing! This is by far the most work I've ever sunk into a side project; I've been working on this thing for over two years, and I'm super proud of it, even though there's still a lot more to do!Storyteller is a self-hosted platform for ebooks with synced narration. This is basically self-hosted WhisperSync, for anyone familiar with that Amazon product.It's currently made up of two self-hostable backend systems and a mobile app for reading and listening to the books it produces. Technically it uses an open spec, EPUB 3's "Media Overlay", for syncing the narration, but very few ebook apps actually support Media Overlays, and even fewer work well and have nice interfaces.The mobile app is available on the Apple App Store as "Storyteller Reader", and I plan to release it for Android as well early next year.Anyway, I hope someone finds this interesting or useful!

26 条评论

r4victor超过 1 年前

Amazing! I've made a similar ebooks-audiobooks aligner years ago: <a href="https://github.com/r4victor/syncabook">https://github.com/r4victor/syncabook</a>. At that time, I chose to synthesize the text and align two audio sequences because I found texts-alignment approaches (including ML-based ones) too compute-intensive and inadequate for long texts. I see Storyteller works by aligning the texts. Could you give some view on how long it takes to sync a book?Also, my experience was that audio and text versions are often very different (e.g. the audio having an intro missing from the text). It'd be very interesting to know how well Storyteller handles such cases. Does it require manual audio/text editing or handle the differences automatically?

评论 #38758837 未加载

评论 #38754575 未加载

sphars超过 1 年前

This is really neat, it's something I hadn't thought about before. I've started listening to audiobooks on my commute, but I read at night. I currently use audiobookshelf[0] to listen to my ebooks, and it has support for ebooks as well. I've added a comment[1] on a discussion if audiobookshelf could read the epubs your took creates.[0]: <a href="https://www.audiobookshelf.org/" rel="nofollow noreferrer">https://www.audiobookshelf.org/</a>[1]: <a href="https://github.com/advplyr/audiobookshelf/issues/189#issuecomment-1868505882">https://github.com/advplyr/audiobookshelf/issues/189#issueco...</a>

评论 #38758854 未加载

rpxio超过 1 年前

I absolutely love this. However, my wife and kids all read EPUBS on kobo e-readers, so I wish we could somehow sync the last page read from kobo to Storyteller so that we could pick up on audiobook later. I’m not opposed to installing koreader on all of our kobos either if that would be required for syncing… it does look like koreader doesn’t support epub3 media overlays, but it does have a sync feature.

评论 #38758773 未加载

评论 #38758853 未加载

smoores超过 1 年前

Wow, this really blew up while I wasn't looking! Thank you everyone who's popped in here to ask questions and give feedback. If anyone does spend some time trying to set this up, please don't hesitate to hop into our Gitter channel (<a href="https://smoores.gitlab.io/storyteller/docs/say-hi" rel="nofollow noreferrer">https://smoores.gitlab.io/storyteller/docs/say-hi</a>) and say hi or ask for support or give feedback.

0x073超过 1 年前

More information would be nice, a link to the iOS app or screenshots or what features the project have.Is it a ebook/a book library like audiobookshelf with sync or just sync? ( <a href="https://www.audiobookshelf.org/" rel="nofollow noreferrer">https://www.audiobookshelf.org/</a> )

评论 #38758957 未加载

评论 #38753190 未加载

mike986超过 1 年前

Super cool project!> even though there's still a lot more to doA few have asked on this thread already, but since you're already using AI to transcibe, it would be super cool if we can use AI to generate audio using TTSI quit audible (signed up a few times) because there are very few high quality audio book, even those spoke by the authors are bad (most of them are not pro narrator)A good AI would be amazing, as they never get tired speaking for hours, yet maintaining the same energetic voice, intonation and pace.

评论 #38758860 未加载

评论 #38754911 未加载

sandreas超过 1 年前

This is pretty interesting...I once wrote a similar thing for building a custom LJSPEECH dataset out of ebook/audiobook combinations to synthesize my favorite narrator voices using coqui-tts and the VITS model and make them "publish" books that never came out as audiobook.It was able to synchronize the book contents to timestamps, split the spoken word in to sentences and create a LJSPEECH datasets out of the combinations. I used aeneas[1], it was a bit finicky to set up, but after a while it even was able to map non-english languages (in my case german) with more than 80% accuracy. Worked out pretty well, the LJSPEECH datasets were good (I still have them here), but the TTS tech was not there yet :-) Maybe it's time to revive this project using newer modelling approaches like XTTS or something...[1]: <a href="https://www.readbeyond.it/aeneas/" rel="nofollow noreferrer">https://www.readbeyond.it/aeneas/</a>

评论 #38755106 未加载

cyberax超过 1 年前

You didn't include the link: <a href="https://smoores.gitlab.io/storyteller/" rel="nofollow noreferrer">https://smoores.gitlab.io/storyteller/</a>Looks super nice, the next step is to build a fully synced ecosystem for book management.

评论 #38758709 未加载

评论 #38759856 未加载

qwerty456127超过 1 年前

What I really want to get from the new era of machine learning we supposedly are coming through is human-quality self-hosted text-to-speech and speech-to-text so I would be able to listen to text ebooks and convert big podcasts and video/audio lecture courses to text making it easy to search through them and quote phrases from them. Is this it? Whatever I could find so far were either significantly worse than a human could do or expensive online services.

bberenberg超过 1 年前

Amazing, I’ve been wanting something like this for years. If only Libby would integrate this so it could be used with rented books.It would be great if you could add a link to the app on the App Store.

评论 #38758890 未加载

评论 #38752770 未加载

timmb超过 1 年前

Looks great! Is there an e-ink e-reader it’s compatible with? Would love to abandon the Amazon castle but could not go back to reading on a screen.

jupiter909超过 1 年前

Looks like an interesting project.I do highly suggest that a quick intro demo video and/or screen shots of a tool like this would be beneficial to the project.

评论 #38758911 未加载

chrisweekly超过 1 年前

Awesome! Thanks for sharing and working on this! WhisperSync functionality is a game-changer; it's one of the main reasons I'm able to read so much (switching modalities several times per day). I'd love to see this featureset become ubiquitous instead of being so tightly coupled to proprietary, DRM'd Amazon / Audible.

评论 #38758878 未加载

majora2007超过 1 年前

Looks really nice. I wanted to do exactly this with my project Kavita, but have been distracted with other things. I've heard Whisper has great potential and a few of my users have been doing something similar with it.Look forward to see how this project matures. We need more options in the book reading scene that are self-hosted and not Calibre.

zachlatta超过 1 年前

This looks absolutely incredible, and like something I’ve been trying to find for years! Thank you so much building this!

评论 #38759023 未加载

roywashere超过 1 年前

How does the narration work, is it automatically generated? For a year now I have a long commute and listen to audiobooks. However I find the narration vary wildly in quality and think oftentimes text-to-speech might actually be better

评论 #38752392 未加载

评论 #38752398 未加载

评论 #38758795 未加载

atmosx超过 1 年前

Good job! I'm probably going to use this. Would love to have my collection accessible from mobile. A small "nit". Would be great to have non-docker installation instructions readily available.

评论 #38758933 未加载

pseufaux超过 1 年前

This looks awesome. I might be missing it somewhere, but what's the minimum required hardware to run something like this locally?

causality0超过 1 年前

Can this function as "Plex for audiobooks"? I don't really have a need for synced books but it would be nice to keep fewer audiobooks on my phone.

评论 #38768408 未加载

mosselman超过 1 年前

Is there a demo of the narration? I couldn’t find any

评论 #38753488 未加载

joshstrange超过 1 年前

This is super cool, I love my audiobook app (Prologue) but this could tempt me away. Looking forward to setting this up and trying it out!

评论 #38758918 未加载

t0mk超过 1 年前

Is there a tool that would convert ebook to a single (or a set of) mp3?

ZunarJ5超过 1 年前

Thank you for your hard work!!

grigio超过 1 年前

Does it sync the reading progress of the ebook among clients?

评论 #38758941 未加载

klakierr超过 1 年前

This works only for drm-free ebooks and audiobooks?

评论 #38758798 未加载

评论 #38754632 未加载

snapplebobapple超过 1 年前

man.. if someone could hook the creation service into audiobookshelf this could be an extremely potent combination..