TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: openai-realtime-embedded-SDK Build AI assistants on microcontrollers

63 点作者 Sean-Der5 个月前
Hi HN! This is an SDK for ESP32s (microcontrollers) that runs against OpenAI&#x27;s new WebRTC service [0] My hope is that people can easily add AI to lots of &#x27;real&#x27; devices. Wearable devices, speakers around the house, toys etc... You don&#x27;t have to write any code, just buy a device and set some env variables.<p>If you have any feedback&#x2F;questions I would love to hear! I hope this kicks off a generation of new interesting devices. If you aren&#x27;t familiar with WebRTC it can do some magical things. Check out WebRTC for the Curious[1] and would love to talk about all the cool things that does also.<p>[0] <a href="https:&#x2F;&#x2F;platform.openai.com&#x2F;docs&#x2F;guides&#x2F;realtime-webrtc" rel="nofollow">https:&#x2F;&#x2F;platform.openai.com&#x2F;docs&#x2F;guides&#x2F;realtime-webrtc</a><p>[1] <a href="https:&#x2F;&#x2F;webrtcforthecurious.com" rel="nofollow">https:&#x2F;&#x2F;webrtcforthecurious.com</a>

4 条评论

kaycebasques5 个月前
Took a bit of poking to figure out what the use case is. Doesn&#x27;t seem to be mentioned in the README (usage section is empty) or the intro above. Looks like the main use case is speech-to-speech. Which makes sense since we&#x27;re talking about embedded products, and text-to-speech (for example) wouldn&#x27;t usually be relevant (because most embedded products don&#x27;t have a keyboard interface). Congrats on the launch! Cool to see WebRTC applied to embedded space. Streaming speech-to-speech with WebRTC could make a lot of sense.
评论 #42475063 未加载
jonathan-adly5 个月前
Here is a nice use-case. Put this in a pharmacy - have people hit a button, and ask questions about over-the-counter medications.<p>Really - any physical place where people are easily overwhelmed, have something like that would be really nice.<p>With some work - you can probably even run RAG on the questions and answer esoteric things like where the food court in an airport or the ATM in a hotel.
评论 #42476180 未加载
评论 #42475660 未加载
roland355 个月前
Favorited and starred! I wonder if the real power of this could be in integrating large low cost sensor networks? I think with things like video and audio it might make more sense to bump up to a single board Linux board - but maybe the AI could help parse or create notifications based on sensor readings, and push back events to the real world (lights, solenoids, etc)<p>I think it would help to either have a freertos example, or if you want to go real crazy create a zephyr integration! It would be a lot of fun to work on AI and microcontroller combination - what a cool niche!
评论 #42477101 未加载
johanam5 个月前
Love this! Excited to give it a try.
评论 #42475077 未加载