This would of course be even more fun with ChatGPT, but it is a nice and funny demo of their whisper.cpp library. The second video is worth watching: <a href="https://user-images.githubusercontent.com/1991296/202914175-115793b1-d32e-4aaa-a45b-59e313707ff6.mp4" rel="nofollow">https://user-images.githubusercontent.com/1991296/202914175-...</a>
<p><pre><code> The total data that the page will have to load on startup (probably using Fetch API) is:
- 74 MB for the Whisper tiny.en model
- 240 MB for the GPT-2 small model
- Web Speech API is built-in in modern browsers
</code></pre>
cool but im now wondering what it would take to bring this down enough to put this in real apps? anyone talking about this?
Listening to that demo, it's incredible how far we've come!<p>Or, not.<p>Racter was <i>commercially</i> released for Mac in December 1985:<p><i>Racter strings together words according to "syntax directives", and the illusion of coherence is increased by repeated re-use of text variables. This gives the appearance that Racter can actually have a conversation with the user that makes some sense, unlike Eliza, which just spits back what you type at it. Of course, such a program has not been written to perfection yet, but Racter comes somewhat close.</i><p><i>Since some of the syntactical mistakes that Racter tends to make cannot be avoided, the decision was made to market the game in a humorous vein, which the marketing department at Mindscape dubbed "tongue-in-chip software" and "artificial insanity".</i><p><a href="https://www.mobygames.com/game/macintosh/racter" rel="nofollow">https://www.mobygames.com/game/macintosh/racter</a><p><a href="https://www.myabandonware.com/game/racter-4m/play-4m" rel="nofollow">https://www.myabandonware.com/game/racter-4m/play-4m</a><p>It's only amazing that chatGPT backed by GPT-3 is the <i>first thing since then</i> to do enough better that <i>everyone</i> is engaged.<p>I owned that in 1985, and having studied AI/ML previously I've been (and remain something of) an AGI skeptic. But now in 2022, I finally think <i>“this changes everything”</i> ... not because it's AI, but because it's making the application of matching probabilistic patterns across mass knowledge practical and useful for everyday work, particularly as a structured synthesis assistant.
I implemented whisper + chatgpt + pyttsx3 and it worked. But then suddenly the chatgpt wrapper that I found on github stopped working.<p>edit: whisper is awesome
Technically this seems to work, and mad props to the author for getting to this point. On my computer (MacBook Pro) it's very slow but there are enough visual hints that it's thinking to make the wait ok. I have plenty of complaints about the output but most of that is GPT-2's problem.
I've been thinking of doing something like this but hooked up with ChatGPT/GPT-3-daviinci003. Obviously model will not load in the browser but we cna call the API. Could be a neat way to interact with the bot.