There is now a (very rudimentary) demo on the GitHub page: zzmp.github.io/juliusjs<p>Much thanks to @iffy for writing the first pass.<p>It uses voxforge's sample vocabulary, so you'll need to say things like "Dial 1 2 3" or "Call Kenneth McDougall" for it to understand you, but the vocabulary is easily swapped out for your own projects, as explained in the README.
Thanks for sharing, nice work!<p>Quick question the Julius website says there is no English acoustic model available [1], how did you solve this? Do you provide a default acoustic model?<p>[1] <a href="http://julius.sourceforge.jp/en_index.php?q=en_grammar.html" rel="nofollow">http://julius.sourceforge.jp/en_index.php?q=en_grammar.html</a>
This is sweet...to get an idea of how much fun this could be for web apps, check out the Annyang library (<a href="https://www.talater.com/annyang/" rel="nofollow">https://www.talater.com/annyang/</a>), which wraps around the Google Web voice recognition API...it works very well, but of course, is subject to Google's terms...so an open source system is very welcome
Pretty cool! When I did my project I had to use <a href="https://github.com/kn/speak.js" rel="nofollow">https://github.com/kn/speak.js</a> which is an amazing library. The library still works on Firefox 30, 31 by the time I finished my project (and the project itself hasn't change much for a year or two!).<p>I would definitely give this JuliusJS library a try. I am actually amazed that JuliusJS doesn't carry all the heavy data like speak.js does (multiple languages support though). I love the fact that you state 100% client side!
Nice work. Can it return confidence scores? Say I want to load 3 commands in my page:
1. Click blue button
2. Scroll down in the yellow text area
3. Expand image of man
I feed those to the engine, and when somebody speaks, I get a confidence score on each word so I can determine with a level of configurable certainty that the user is using the command:
{click: 0.9878 confidence,
blue: 0.8789 confidence,
button: 0.1889 confidence)<p>Something like that...
Genuinely not sure if the demo is a joke or not.<p>I said "hello" it said "DIAL OH OH".<p>I said "Apple" it said "GET KENT".<p>WTF?
This is the kind of technological challenge which must be fun to complete. And it must be quite satisfying for the author.
However, whenever I see a 'XYZ in pure javascript', I keep getting the impression we are only delaying the inevitable moment browsers have to step to a superior language. Kinda like instead of quickly ripping off a bandaid is better than slooowwwwllly removing it ....