TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Ask HN: Using AI to visually detect musical instrument note played

4 点作者 udit99将近 2 年前
I&#x27;m looking for existing tools&#x2F;libraries or a guide to training your own model to do the following:<p>From a live videostream, detect what keyboard keys the user pressed or what guitar fret&#x2F;string the user played etc..<p>I imagine with the current state of AI, if I train a model with enough data consisting of videos of me playing a musical instrument along with the tab&#x2F;sheet music equivalent, it should be able to transcribe this visually. Am I underestimating the complexity here?<p>If I were to build this from scratch, what tools would you recommend? Tensorflow comes to mind but it feels like with the Tsunami of AI developments in the past year, there&#x27;s gotta be a better tool out there.

4 条评论

kingkongjaffa将近 2 年前
If you know the tuning and scale length of the guitar, you can map the note played (audio) to string and fret position without trying to work that out from video?<p>I imagine audio waveform is a magnitude easier than video recognition.<p>Video would be a pain because you would probably need to synchronise hand fretting position with picking&#x2F;strumming timing to know which video frame actually &#x27;struck&#x27; each string or chord, right?<p>Imagine finger style guitar or flamenco, the picking patterns are at least as complex as the fretted hand position.
评论 #36787450 未加载
评论 #36788626 未加载
bjourne将近 2 年前
For both keyboard and guitar finger movements can be very subtle and therefore difficult to discriminate. Probably easier if your camera can be mounted to give top-down view of your fingers while playing the keyboard. Probably much harder on guitar since getting a good view of how the fret is fingered is hard. Fingers may also hover over strings or lightly press strings to mute them.
gregmfoster将近 2 年前
I imagine this would be brutally hard for guitar - so often, my fingers are &quot;touching&quot; the correct strings, but not pushing down hard enough to trigger the note. Piano might be more feasible because you can watch the motion of the keys themselves.<p>Question - what is your root goal? Are you convinced video processing is the best way to achieve this?
评论 #36788651 未加载
speedgoose将近 2 年前
Perhaps a fast fourrier transform (FFT) with a simple classifier would be enough.