Greetings!<p>YakGPT is a simple, frontend-only, ChatGPT UI you can use to either chat normally, or, more excitingly, use your mic + OpenAI's Whisper API to chat hands-free.<p>Some features:<p>* A few fun characters pre-installed<p>* No tracking or analytics, OpenAI is the only thing it calls out to<p>* Optimized for mobile use via hands-free mode and cross-platform compressed audio recording<p>* Your API key and chat history are stored in browser local storage only<p>* Open-source, you can either use the deployed version at Vercel, or run it locally<p>Planned features:<p>* Integrate Eleven Labs & other TTS services to enable full hands-free conversation<p>* Implement LangChain and/or plugins<p>* Integrate more ASR services that allow for streaming<p>Source code: <a href="https://github.com/yakGPT/yakGPT">https://github.com/yakGPT/yakGPT</a><p>I’d love for you to try it out and hear your feedback!
Nice. It took about a minute to clone it, run it, enter my API key, and get started. The speech-to-text worked flawlessly.<p>Most people can talk faster than they can type, but they can read faster than other people can talk. So an interface where I speak but read the response is an ideal way of interfacing with ChatGPT.<p>What would be nice is if I didn't have to press the mic button to speak -- if it could just tell when I was speaking (perhaps by saying "hey YakGPT"). But I see how that might be hard to implement.<p>Would love to hook this up to some smart glasses with a heads-up display where I could speak and read the response.
I tried it, it looks good! I had to modify the code to accept 8000 tokens for chatGPT. It would be good if it saved the json payload of the responses as well.<p>It uses 2 external calls to a javascript CDN for the microphone package and something else. It would probably be best if it was localhost calls only since it handles an API key
I love the concept of this and other alternate ChatGPT UIs, but I hesitate to use them and pay for my calls when I could use chat.openai.com for free.<p>Any chance you could integrate the backend-api, and let me paste in my Bearer token from there?
> Run locally on browser – no need to install any applications<p>That's not what "run locally" means. This isn't any more "local" than talking to chatgpt directly, which is never running locally.
Honestly your "idea generator" blew my mind. Would love to see a section that includes a larger catalog of prefilled prompts.<p>I'm thinking: What would a GPT project manager do? What would a GPT money manager do? What would a GPT logistics manager do? GPT Data Analyst, Etc.
> Run locally on browser – no need to install any applications<p>> Please enter your OpenAI key<p>...<p>Do people just not get it?<p>I would in fact rather give all my company secrets to this random dude than OpenAI.
Love the idea of prompt dictation. Taking that idea a step further, would it possible to have a feature where ChatGPT responses are spoken back to the user?
This is fast. And talking to it is a nice touch. Consider adding text to speech too :)<p>One feature I am missing from all these front ends is the ability to edit your text and generate new response from that point. Official chat gpt UI is the only one that seems to do that.
Looks cool! Are you planning on adding more customization to be able to influence the AI? See <a href="https://bettergpt.chat/" rel="nofollow">https://bettergpt.chat/</a> (it's also open source and uses API in the browser). Basically with that frontend you can control the role of all messages (e.g. add system messages) and also edit them all to better influence the AI in some cases.
BRO. Your transcription is SO fast. I've hacked at a similar project passing to the Whisper API and honestly I was already blown away with its speed and accuracy (as was anyone I showed it to), but your implementation is so much faster both in TTS as well as the response from their API. I will absolutely use this.
Very cool. I use a custom local UI as well, based on a fork of a similar project called ChatPad (<a href="https://github.com/deiucanta/chatpad">https://github.com/deiucanta/chatpad</a>). That also uses Mantine UI, and lets you create and save prompts just like chats. Data is stored locally using indexdb. I embedded it in an electron app, which lets me run it from my dock rather than a terminal. But what's missing is speech-to-text, so it's great to see this project has that.<p>There are a few drawbacks to local, I've discovered. For example I doubt the new plugins can be extended to beyond ChatGPT's web UI. Also, it doesn't stream response tokens as they're generated, which is a pain. I haven't looked into whether OpenAPI let you do that though.<p>Nice work!
Looks great. Super interesting to browse other peoples code.
I'm working on a desktop app for ChatGPT.<p><a href="https://github.com/EzzatOmar/delegate">https://github.com/EzzatOmar/delegate</a>
Given that Vocode (realtime audio, llm, etc) came out a few days ago, could you speak to how yours compares to it?<p><a href="https://github.com/vocodedev/vocode-python">https://github.com/vocodedev/vocode-python</a>
Cool! I tried out the speech to text and it was instant and accurate, i had no idea whisper was that good.<p>Do you know their privacy for our voices? Do they train on it, hear it, etc ?
I absolutely love this! The UI is nice and responsive and this is the first chatGPT UI that has voice recognition that works outside of chrome!<p>I kind of want to throw this up on a server for my housemates to use, I am currently the only person with a openai account, so I would like the ability to embed my API key. Minor feature request :-)
Hi ChatGPT! Let me register using my personal information, then tell you what my tasks are at works, what I'm interesting in, what I'm struggling with in life and a bunch of other sensitive personal information. I trust you completely, and am sure a nice AI such as yourself would never use my personal data for anything.
The only thing I'd suggest to consider to add is some sort of authentication. If I deploy this on a server so I could reach it with my mobile, on the go, and it has my API credentials, I wouldn't want anyone who stumbles upon the page to be able to interface ChatGPT on my expense.<p>Otherwise, it really looks good.
I've been playing around with your Idea Generator persona for the last 15 minutes and have been absolutely blown away. Excellent prompt engineering.<p>As mentioned by others, it would be great to customize or write new personas/prompts.<p>Also could you add a voice chatbot as well using vocode? It could be an alternative UI for each of the personas.
So if you add audio output to it so I can talk to my computer like in Star Trek, I'll venmo you $100. Then, I want to have a command line module so I can ask it to write files to the local disk and run them, so I can deploy code it's just written to AWS, that's worth at least another $100.
It would be great if I can just enter "space" in the app and it just lets me talk to it. Keyboard shortcuts!<p>BTW I have a lot of these ChatGPT UI apps installed, mostly free and open-source. Perhaps this is really the era of going back to just talking to a chat interface like the old times.
This is very well made and designed. I will likely use this instead of the actual Chatgpt UI since their API is a lot cheaper than the 20$/month pricing for my usage amount.<p>Interesting note: I tried speaking mandrain chinese to the mic and it auto translated what I said into English.
Just tried this in both English and Korean. Fumbled a bit with voice control but worked well once I got it going. Very nice. Korean prompts got translated to English so had to tell ChatGPT to respond in Korean to get full non-English UX.<p>Well done.
I installed it locally about an hour ago and have been running it through some paces. Nice work! (In addition to the predefined prompts, I like the API usage meter at the top).<p>(now, I just need Openai to take me off the waitlist for GPT-4)
I’m a bit confused, I tried to utter some queries in Esperanto and French and it transcribed English (fine) translations. Can I disable this behavior to have the text transcribed in the language uttered?
I might be missing it but do we have an idea about the prompt that ChatGPT uses so we can replicate the experience?<p>I haven't played with the OpenAI API yet. Is there examples of good prompts to use to get good responses?
Love this, Few things we could add:
- Search Feature
- Way to import/export chats
- Star/Favourite replies by ChatGPT
- For GPT4 provide 8k/32k model variations
- Prompt Dictionary
I get a 404 error in the browser console for <a href="http://localhost:3000/encoderWorker.umd.js" rel="nofollow">http://localhost:3000/encoderWorker.umd.js</a>
This is exactly what I need, thank you for building this! We're using Azure cognitive services for API access to OpenAI models though. With any luck, expect a PR today for basic Azure support :)
would be so fun if you could fork a project on vercel i.e this project has a button to fork:
- which forks its github
- makes a new project on your vercel cause it's connected to your github
- it opens a new tab with your project running.
> Run locally on browser – no need to install any applications<p>This seems to be a contradiction. Am I running it locally, or is it running on someone else's server?