TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: How do you get started with adding voice commands to a computer system?

2 点作者 calebjosue超过 1 年前
Let&#x27;s suppose you want to add support for voice commands to a Linux Distro.<p>For simplicity&#x27;s sake, let&#x27;s say you want to be able to tell the computer (The terminal is running): &quot;Create XY directory&quot; and as a response the directory XY is created on the current directory.<p>How do you implement such a feature?<p>Will a Software developer first need to train a system over lots of people pronouncing &quot;Create directory&quot; phrases. And then perform inference on production?<p>Are some corporations&#x2F;start-ups already providing trained models for natural language - computer interaction?<p>How do you get started these sort of tasks these days?<p>And of course, for accessibility purposes, text-based interaction remains unchanged.<p>Thanks!

2 条评论

smoldesu超过 1 年前
Use Whisper! It&#x27;s a fairly small AI speech-to-text model that&#x27;s great for getting your feet wet with AI libraries. It&#x27;s extremely precise and easy to get working, I recommend it over pretty much everything else.<p><a href="https:&#x2F;&#x2F;github.com&#x2F;openai&#x2F;whisper">https:&#x2F;&#x2F;github.com&#x2F;openai&#x2F;whisper</a>
评论 #38369100 未加载
daanzu超过 1 年前
<a href="https:&#x2F;&#x2F;github.com&#x2F;dictation-toolbox&#x2F;dragonfly">https:&#x2F;&#x2F;github.com&#x2F;dictation-toolbox&#x2F;dragonfly</a><p><a href="https:&#x2F;&#x2F;github.com&#x2F;daanzu&#x2F;kaldi-active-grammar">https:&#x2F;&#x2F;github.com&#x2F;daanzu&#x2F;kaldi-active-grammar</a>