HTML5 Speech Recognition (in Chrome)

83 pointsby philfreoover 14 years ago

18 comments

zmmmmmover 14 years ago

Wow, this is pretty intriguing and may actually be a solid differentiator in the browser space since it probably requires a considerable stored database of speech samples coupled with a decent back end server farm to do it effectively. Hard for the other players to replicate. Clever move Google!I wonder if they will add this as a standard feature for any text field at some point? It's probably not going to get much sunlight if it requires a chrome-specific attribute on the field.

评论 #1973318 未加载

评论 #1973813 未加载

51Cardsover 14 years ago

I suspect this slide's covert purpose is to make me (yes, just me) sit here and say 'hello' to my computer like a moron for 3 minutes while seeing no effect what-so-ever. In this it has succeeded brilliantly.Running Chrome, mic is on, no one is home... sigh. I so wanted to be wowed.

评论 #1973037 未加载

评论 #1973019 未加载

bajsejohannesover 14 years ago

It actually does a pretty good job at simple words and sentences. So, jumping in the deep end, I tried "The reflected binary code was originally designed to prevent spurious output from electromechanical switches". Can anyone get it to recognize that? I did manage to get it to respond correctly to every word by itself (sometimes only after a couple of tries), but not the whole thing.(non-native speaker)

评论 #1973473 未加载

评论 #1973217 未加载

bemmuover 14 years ago

Varies from poor to amazing."I have met Jesus, he was a nice guy" -> "ice melt cheese""hacker news is amazing" -> "hacker news""are you afraid of santa claus?" 100% correct"if a woodchuck could chuck wood how much wood would a woodchuck chuck" 100% correct

ImJasonHover 14 years ago

Did anybody check out the slide before this, device orientation? <a href="http://slides.html5rocks.com/#slide23" rel="nofollow">http://slides.html5rocks.com/#slide23</a>That's pretty awesome too, I could see this being great for mobile web apps, especially games.

评论 #1973794 未加载

评论 #1973421 未加载

评论 #1973471 未加载

gilanialiover 14 years ago

How are they accessing my laptop mic? Is it the Google voice plugin?Shouldn't the browser ask for permission before allowing access?

评论 #1973341 未加载

评论 #1973397 未加载

评论 #1973390 未加载

评论 #1973408 未加载

colandermanover 14 years ago

Why does this have anything to do with HTML5 -- isn't it up to the UA to determine how best to accept form input? Specifying in the form that a particular field is a "voice recognition" field seems to be encoding presentation details in what should be structure.I can understand that it's important to mark a particular form field as more "important" than others (and thus more likely that a user would like to use their voice to input text to it), but wouldn't this be better served by semantic markup declaring the field as a "primary" field or some such?

varencover 14 years ago

Is there a speech recognition engine built in to chrome this is leveraging?

评论 #1973175 未加载

评论 #1973292 未加载

评论 #1973099 未加载

评论 #1973038 未加载

评论 #1973191 未加载

评论 #1973063 未加载

评论 #1973078 未加载

mdxchover 14 years ago

I whipped up a Chrome extension for voice search if anyone is interested:<a href="http://dl.dropbox.com/u/1047706/VoiceSearch.crx" rel="nofollow">http://dl.dropbox.com/u/1047706/VoiceSearch.crx</a><a href="https://github.com/raneath/chrome-voice-search" rel="nofollow">https://github.com/raneath/chrome-voice-search</a>

ImJasonHover 14 years ago

Aw, curse words are censored? That's pretty ####### lame.

评论 #1973219 未加载

Herringover 14 years ago

I thought speech recognition was still very inaccurate & hasn't improved much in the last 5-10 years. Has it suddenly become usable?

评论 #1973155 未加载

评论 #1975556 未加载

评论 #1973137 未加载

评论 #1973104 未加载

GeneralMaximusover 14 years ago

This is exactly like the speech recognition on Android. It works brilliantly with short phrases that also happen to be popular searches on Google (or Google Voice Search) but fails at longer or obscure sentences. It's all about the data, baby.I use Voice Search heavily on my Desire, but I prefer to type out my communications because of this exact limitation.

ughover 14 years ago

That is awesome, works even for German without a problem. I couldn’t get it to recognize an English sentence properly (which probably only means that my English pronunciation is horrible). I’m wondering, however, how they manage to recognize the language in the three word sentences I tried.

评论 #1973116 未加载

评论 #1973105 未加载

wildmXranatover 14 years ago

It was rather good, but not even close to rely on it for anything practical. It felt a bit like this <a href="http://www.youtube.com/watch?v=5FFRoYhTJQQ" rel="nofollow">http://www.youtube.com/watch?v=5FFRoYhTJQQ</a>

codejoustover 14 years ago

What version of chrome does this work on? Either I'm missing something or on an older version of chromium: Chromium 5.0.375.127 (Developer Build 55887) Ubuntu 10.04.

评论 #1974795 未加载

评论 #1973096 未加载

评论 #1973345 未加载

评论 #1973052 未加载

zmanianover 14 years ago

Two things I would want upon seeing this.1. Chrome extension to use speech recognition in every text box.2. Speech recognition inside the google apps: Gmail, etc.

评论 #1973275 未加载

nowarninglabelover 14 years ago

"Hack the planet" -> "Mayo clinic"You win this round Google.

sandipagrover 14 years ago

wow this is really good. It recognizes almost everything and I am not even a native speaker.

18 comments

zmmmmmover 14 years ago

评论 #1973318 未加载

评论 #1973813 未加载

51Cardsover 14 years ago

评论 #1973037 未加载

评论 #1973019 未加载

bajsejohannesover 14 years ago

评论 #1973473 未加载

评论 #1973217 未加载

bemmuover 14 years ago

ImJasonHover 14 years ago

评论 #1973794 未加载

评论 #1973421 未加载

评论 #1973471 未加载

gilanialiover 14 years ago

How are they accessing my laptop mic? Is it the Google voice plugin?Shouldn't the browser ask for permission before allowing access?

评论 #1973341 未加载

评论 #1973397 未加载

评论 #1973390 未加载

评论 #1973408 未加载

colandermanover 14 years ago

varencover 14 years ago

Is there a speech recognition engine built in to chrome this is leveraging?

评论 #1973175 未加载

评论 #1973292 未加载

评论 #1973099 未加载

评论 #1973038 未加载

评论 #1973191 未加载

评论 #1973063 未加载

评论 #1973078 未加载

mdxchover 14 years ago

ImJasonHover 14 years ago

Aw, curse words are censored? That's pretty ####### lame.

评论 #1973219 未加载

Herringover 14 years ago

I thought speech recognition was still very inaccurate & hasn't improved much in the last 5-10 years. Has it suddenly become usable?

评论 #1973155 未加载

评论 #1975556 未加载

评论 #1973137 未加载

评论 #1973104 未加载

GeneralMaximusover 14 years ago

ughover 14 years ago

评论 #1973116 未加载

评论 #1973105 未加载

wildmXranatover 14 years ago

codejoustover 14 years ago

What version of chrome does this work on? Either I'm missing something or on an older version of chromium: Chromium 5.0.375.127 (Developer Build 55887) Ubuntu 10.04.

评论 #1974795 未加载

评论 #1973096 未加载

评论 #1973345 未加载

评论 #1973052 未加载

zmanianover 14 years ago

Two things I would want upon seeing this.1. Chrome extension to use speech recognition in every text box.2. Speech recognition inside the google apps: Gmail, etc.

评论 #1973275 未加载

nowarninglabelover 14 years ago

"Hack the planet" -> "Mayo clinic"You win this round Google.

sandipagrover 14 years ago

wow this is really good. It recognizes almost everything and I am not even a native speaker.