In a similar vein: <a href="http://www.adultswim.com/etcetera/choir/" rel="nofollow">http://www.adultswim.com/etcetera/choir/</a><p>The most interesting thing about this one is the chord progressions it generates.
A little while back I used this combined with a physics simulator to make a toy where you throw polygons in the air and they scream: <a href="https://ohgodwhathaveidone.stackblitz.io/" rel="nofollow">https://ohgodwhathaveidone.stackblitz.io/</a><p>Code's here if anyone wants to play: <a href="https://stackblitz.com/edit/ohgodwhathaveidone" rel="nofollow">https://stackblitz.com/edit/ohgodwhathaveidone</a> – I did a fairly medium job of abstracting the synthesis engine away from the UI, but it might be a decent starting point if you're looking to make other Trombone-based web silliness.
This reminds me of the Sprechmaschine [1] ("speaking machine") built at the end of the 18th century by Wolfgang von Kempelen (the guy who build the original mechanical turk [2]). Here is a YouTube video showing it in action (for example, the machine says "Mama" around 1:14): <a href="https://www.youtube.com/watch?v=k_YUB_S6Gpo" rel="nofollow">https://www.youtube.com/watch?v=k_YUB_S6Gpo</a><p>[1] <a href="https://de.wikipedia.org/wiki/Wolfgang_von_Kempelen#Die_Sprechmaschine" rel="nofollow">https://de.wikipedia.org/wiki/Wolfgang_von_Kempelen#Die_Spre...</a><p>[2] <a href="https://en.wikipedia.org/wiki/The_Turk" rel="nofollow">https://en.wikipedia.org/wiki/The_Turk</a>
Lest we forget that speech synthesis is not just for grotesque but amusing semi-real vocal synths like this, here's a BBC Radio 4 history of speech synthesis as an assistive technology - Klatt's Last Tapes, by Stephen Hawking's daughter, Lucy:<p><a href="https://www.youtube.com/watch?v=097K1uMIPyQ" rel="nofollow">https://www.youtube.com/watch?v=097K1uMIPyQ</a>
Man this brings back memories. We used this program for my linguistics homework in college. In 1996. Although I think it was an app then not a web page.
This would be useful to demonstrate the difference between p/f and l/r for those brought up without those distinctions.<p>I'd also (as an English speaker) like to see/hear Dutch g and Xhosan clicks.
Reminds me of Xiph's Speex/CELP model of speech as a mix of noise and frequency to achieve high compression, requiring as little as 2.15 kilobits (275 bytes) per second. It sounds perceptibly similar to the original recording, even though the difference between the input and output sampled data may be high:<p><a href="https://www.speex.org/docs/manual/speex-manual/node9.html" rel="nofollow">https://www.speex.org/docs/manual/speex-manual/node9.html</a><p>Bitrate comparison:<p><a href="https://www.speex.org/comparison/" rel="nofollow">https://www.speex.org/comparison/</a><p>Samples:<p><a href="https://www.speex.org/samples/" rel="nofollow">https://www.speex.org/samples/</a><p>Maybe higher compression can be achieved with better prediction, aka machine learning.
I've actually been looking for the <i>opposite</i> of this (i.e. sound in, mouth representation out) for a while. Does anyone know of such a thing?
Wow! I was able to successfully recreate all sorts of letters and sounds just by imagining how my own mouth works, and then manipulating the different components on the pink trombone in the same way. I'm impressed!
This was also ported to C++ inside a modular synth:<p><a href="https://www.youtube.com/watch?v=PDn7ygnJUfI" rel="nofollow">https://www.youtube.com/watch?v=PDn7ygnJUfI</a><p><a href="https://www.youtube.com/watch?v=3jcqKnIa8T4" rel="nofollow">https://www.youtube.com/watch?v=3jcqKnIa8T4</a><p><a href="https://www.youtube.com/watch?v=bo5ZEgBEapk" rel="nofollow">https://www.youtube.com/watch?v=bo5ZEgBEapk</a><p><a href="https://github.com/giuliomoro/pink-trombone" rel="nofollow">https://github.com/giuliomoro/pink-trombone</a>
What someone needs to do is put sensors in people's mouths, record them saying known phrases, then stick the sensor data+phrases into some AI and see if we can't get that Trombone talkin'!
Previous post in Apr 2017: <a href="https://news.ycombinator.com/item?id=14135658" rel="nofollow">https://news.ycombinator.com/item?id=14135658</a>
The shape of the tongue control reminds me a lot of the rhombus in <a href="https://en.wikipedia.org/wiki/International_Phonetic_Alphabet#Vowels" rel="nofollow">https://en.wikipedia.org/wiki/International_Phonetic_Alphabe...</a><p>... which is because the latter was patterned after the shape of the mouth.
Thats nice and everything but until I have a hardware version where I can just switch a button on and do this and that, I'm not really going to be truly satisfied. Please hardware'ify.
Is the author aware of the other meaning of "pink trombone"?[1]<p>[1] <a href="https://www.urbandictionary.com/define.php?term=pink%20trombone" rel="nofollow">https://www.urbandictionary.com/define.php?term=pink%20tromb...</a>