TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Nerd-dictation, hackable speech to text on Linux

202 pointsby ideasman42over 3 years ago

11 comments

yjftsjthsd-hover 3 years ago
This is better than any other speech-to-text setup I&#x27;ve ever encountered, for one simple reason: I followed the dead-simple install steps in the readme, started the program, <i>and it worked.</i> Bonus points for the install being a git clone and pip install away. I don&#x27;t know why this is a hard bar to clear, but bravo. (I <i>suspect</i> that it&#x27;s because a lot of FOSS speech recognition is from academia where &quot;follow the following 13 steps, including hand-crafting recognition parameters&quot; is more normal and acceptable because everyone involved is already a domain expert, whereas I, as a user, just want &quot;plug in a mic, run this thing, and get text on stdout&quot;.)
评论 #29975017 未加载
评论 #29978341 未加载
评论 #29977172 未加载
评论 #29975368 未加载
2Gkashmiriover 3 years ago
You know... I have an idea. How about we use vosk and this tech to integrate with ffmpeg somehow so that peertube videos can get subtitles while being transcoded. Once we get English SRT, we could use libretranslate to translate that English SRT to multiple languages.<p>This could be similar to what YouTube does with it&#x27;s automatic subtitles. What do you guys say?
评论 #29991571 未加载
评论 #29976836 未加载
评论 #29988793 未加载
abetuskover 3 years ago
I&#x27;ve never even heard of VOSK-API [0], the underlying offline speech to text engine that this project uses.<p>Does anyone have experience using it? Is it any good?<p>[0] <a href="https:&#x2F;&#x2F;github.com&#x2F;alphacep&#x2F;vosk-api" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;alphacep&#x2F;vosk-api</a>
评论 #29973629 未加载
评论 #29975486 未加载
评论 #29973550 未加载
评论 #29991534 未加载
评论 #29973970 未加载
评论 #29976559 未加载
allanrboover 3 years ago
Nice. Another notable mention in this space is Talon. Useful for automating all OS tasks with voice commands, as well as just dictation: <a href="https:&#x2F;&#x2F;talonvoice.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;talonvoice.com&#x2F;</a>
评论 #29988367 未加载
评论 #29977383 未加载
phantom_oracleover 3 years ago
This is such an amazing technology for the many tech people who are having to deal with hand&#x2F;finger&#x2F;elbow issues after extensive usage for years on their keyboards.<p>I was looking for this type of tech for at least 2 years and I am glad it now exists.<p>FOSS is amazing!
zelphirkaltover 3 years ago
Has anyone used this somehow inside Emacs or knows how to make Emacs take its output and put it into a buffer?
评论 #29977265 未加载
评论 #29977540 未加载
sundarurfriendover 3 years ago
I was wondering how well it dealt with accents, them I saw that the Vosk API page specifically mentions &quot;English, Indian English, German, French, ...&quot; :D I don&#x27;t know the story behind &quot;Indian English&quot; specifically being listed as a separate language, but I&#x27;m glad to see it&#x27;s supported.
评论 #29977127 未加载
zoomablemindover 3 years ago
Vosk, is it &quot;wax&quot; in Russian (&quot;воск&quot;)?<p>I think of wax recording rolls - old days CDs, aka Phonograph cylinder:<p><a href="https:&#x2F;&#x2F;en.m.wikipedia.org&#x2F;wiki&#x2F;Phonograph_cylinder" rel="nofollow">https:&#x2F;&#x2F;en.m.wikipedia.org&#x2F;wiki&#x2F;Phonograph_cylinder</a>
评论 #29976272 未加载
kristopolousover 3 years ago
I&#x27;m throwing another hat in the ring as this technology totally working most of the time. I used it to write this comment.<p>This should make my life a lot easier because I find myself going to my phone and using the dictation feature a lot recently. It&#x27;s not as good as the one on my android, but it&#x27;s 95% of the way there.
评论 #29977414 未加载
评论 #29977114 未加载
评论 #29977390 未加载
deknosover 3 years ago
is there an offline good program for text to speech for german,french,spanish,english? and no, festival and espeak are not what i would consider good.<p>the at&amp;t website with text to speech as audio file which were used in these anonymous publications are good, but not espeak. if i had sth like this for european (and russian and arab languages) as open source standalone, i would be happy :(
评论 #29991444 未加载
suifbwishover 3 years ago
Very cool. Does it have an erotic voice? Asking for a friend.
评论 #29975163 未加载
评论 #29976908 未加载