TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Project Common Voice

205 pointsby mhr_onlinealmost 8 years ago

16 comments

albertzeyeralmost 8 years ago
The terminology is a bit confusing. They are saying that they want to build voice recognition but it seems like they actually might want to build a speech recognition engine. Speech recognition is about recognizing the speech, the spoken words. Voice recognition is about recognizing the speakers voice, i.e. identifying the speaker. Also, maybe they also want to build a text-to-speech (TTS) system but I&#x27;m not sure.<p>No matter what, the collected data might be useful for all of that, maybe except of voice recognition actually, because I guess the data will be collected anonymously?<p>Note that there are some other existing big open speech corpora such as LibriSpeech (<a href="http:&#x2F;&#x2F;www.openslr.org&#x2F;12&#x2F;" rel="nofollow">http:&#x2F;&#x2F;www.openslr.org&#x2F;12&#x2F;</a>) which could already be used right now to build a quite good speech recognition system.
评论 #14796617 未加载
评论 #14795647 未加载
评论 #14795312 未加载
评论 #14795224 未加载
apeddlealmost 8 years ago
This looks great! I use voice control to program on occasion due to an rsi injury. The standard stack for this is a mess due to closed source systems that aren&#x27;t designed for voice programmers. A good open solution could really save me from a lot of headaches.
评论 #14794921 未加载
评论 #14795365 未加载
cooper12almost 8 years ago
If they&#x27;re planning to make a voice recognition system, why are they using example statements that are clearly taken from novels? [0] That&#x27;s not how real people talk. They use a lot more slang, a lot more stopping and starting, filler words, etc. Instead you have people saying things like &quot;irresolute&quot;, &quot;rumbling&quot;, and other complex words. It would be useful for training a novel dictation system, but it&#x27;s not how people would speak to their browser for example.<p>[0]: An example sentence is &quot;a thin circle of bright metal showed between the top and the bottom of the body of the cylinder&quot;, which is from H. G. Wells&#x27; <i>War of the Worlds</i>.
评论 #14795147 未加载
glandiumalmost 8 years ago
Sadly, in Demographic Data, only native english accents can be selected.
评论 #14795462 未加载
pebersalmost 8 years ago
Is the data going to be freely available as well? It&#x27;s a little unclear whether they intend to make it separately available or not.
评论 #14794924 未加载
评论 #14794879 未加载
评论 #14796677 未加载
评论 #14794976 未加载
therealunrealalmost 8 years ago
Any plans for languages other than English?
评论 #14795459 未加载
jlduggeralmost 8 years ago
And... 503&#x27;d. I didn&#x27;t catch what the intended use case was before it died, but I&#x27;m guessing computer generated voice?<p>Most of the computer generated stuff I&#x27;ve seen uses trained actors. Which neatly avoids the problem of trying to reconcile a myriad of accents and dialects, which was immediately apparent from the first two samples I tried.<p>edit: back up, seems to be about voice recognition, which this could help with no problem.
评论 #14794774 未加载
评论 #14796689 未加载
eatbitseverydayalmost 8 years ago
It would be useful to collect data from non-native speakers of a language. More and more such individuals are appearing in all countries, and devices that accept spoken words should not break because of someone&#x27;s level of command of a spoken language. For example, a Swiss speaking German (Hochdeutsch), or more clearly, a Brit speaking French, etc. Some children who grow up in multi-lingual families also intermix words from multiple languages into their sentences. We can still understand them.
评论 #14797642 未加载
giancarlostoroalmost 8 years ago
I wonder if implementing a new type of Recaptcha with these type of projects in mind would make sense. The data wouldn&#x27;t be going to some data center in Google land, but instead to some open end that anyone should be able to get their hands on. Also a free and open source recaptcha alternative would be nice. Trick is keeping it complex enough that bots cannot just reuse the existing public data set. Maybe withhold on making some of the data public for a few years till deemed &#x27;retired&#x27;.
ZoomZoomZoomalmost 8 years ago
I hope this data will be used purely for voice recognition purposes and not for voice generation, or we&#x27;ll be stuck with robots talking with this horrible gurgling and clicking accent due to poor recording conditions of most participants!
评论 #14795599 未加载
tanguealmost 8 years ago
Cool project, really aligned with the mission of Mozilla, and with a pleasant UX. And if you&#x27;re a non-english speaker like me validating sentences is a nice way of improving your comprehension.
评论 #14795018 未加载
sexydefinesheralmost 8 years ago
Should i have any privacy concerns with contributing? I dont want just anyone to have the data to recreate my voice digitally.
olegkikinalmost 8 years ago
Man, most people have horrible microphones.
评论 #14798203 未加载
timwaaghalmost 8 years ago
this is an important development. voice control has good potential. would be cool if they used it as an alternative way to control firefox and&#x2F;or servo?
ibottyalmost 8 years ago
Any idea why the duplicate detection did not work for this link: <a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=14786881" rel="nofollow">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=14786881</a><p>Anyhow: these should be merged (even though there is no discussion on the other submission)
评论 #14795027 未加载
kgdineshalmost 8 years ago
Can&#x27;t believe it&#x27;s down already.