TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Show HN: Quick and dirty speech transcription on an Intel NPU

4 pointsby ellenhp6 months ago
tldr: I made a global hotkey on my laptop that will record my voice while I hold it down, transcribe the result and type out what it thinks I said.<p>Background:<p>My laptop was stolen recently and the new one I got to replace it has an Intel NPU [0] in it. The promise of the NPU is running small machine learning models efficiently on mobile hardware. I thought a good application of this would be using whisper to transcribe speech into text. There’s not really much [1] out there on Linux that can do this right now which is kind of a bummer because it’s a big accessibility thing to be able to type with your voice. I use my Sway configuration [2] to map the right control key to run a wrapper program [3] and then the release of the right control key to send a SIGINT to that program. The wrapper catches the SIGINT, ends transcription, and types the transcribed text into the focused application with the `enigo` crate.<p>Repo link: <a href="https:&#x2F;&#x2F;github.com&#x2F;ellenhp&#x2F;whisper-npu-server">https:&#x2F;&#x2F;github.com&#x2F;ellenhp&#x2F;whisper-npu-server</a><p>This is not one of my high polish projects, but I did want to throw it out there into the world, especially because the OpenVINO project doesn&#x27;t have any containerized NPU examples, even for LLMs.<p>[0] <a href="https:&#x2F;&#x2F;intel.github.io&#x2F;intel-npu-acceleration-library&#x2F;npu.html" rel="nofollow">https:&#x2F;&#x2F;intel.github.io&#x2F;intel-npu-acceleration-library&#x2F;npu.h...</a><p>[1] I found this, and based some of my code on it: <a href="https:&#x2F;&#x2F;github.com&#x2F;oddlama&#x2F;whisper-overlay">https:&#x2F;&#x2F;github.com&#x2F;oddlama&#x2F;whisper-overlay</a><p>[2] See end of post for example.<p>[3] <a href="https:&#x2F;&#x2F;github.com&#x2F;ellenhp&#x2F;whisper-transcription-wayland&#x2F;">https:&#x2F;&#x2F;github.com&#x2F;ellenhp&#x2F;whisper-transcription-wayland&#x2F;</a><p>Sample Sway config:<p>bindsym --no-repeat Control_R exec &quot;whisper-transcription&quot;<p>bindsym --release Control_R exec killall -2 whisper-transcription

1 comment

basiskarten5 months ago
This is really cool, I wish I could use it on my Windows 11 X1 Carbon which also comes with a NPU.<p>I was quite disappointed that the dictation tool Lenovo praised on their website for this &quot;CoPilot+ AI PC&quot; turned out to be a shortcut to the Windows 11 transcription tool. My hope was that they were indeed putting the NPU to use for that. Other than that this is a great machine. In any case, Lenovo should fire whoever is responsible for that feature and hire you instead.<p>It would also be great to see a demo of how this works on your machine. Thank you for sharing it at all, though!