TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Ask HN: Machine learning resources for audio processing

252 pointsby samrohnabout 6 years ago
What are some good learning resources on audio processing, detection and anomaly detection using machine learning or deep learning? I am interested in machine predictive maintenance using audio anomaly detection

19 comments

citilifeabout 6 years ago
There&#x27;s a good class at UIUC regarding signal processing:<p><a href="https:&#x2F;&#x2F;courses.engr.illinois.edu&#x2F;cs598ps&#x2F;fa2018&#x2F;material.html" rel="nofollow">https:&#x2F;&#x2F;courses.engr.illinois.edu&#x2F;cs598ps&#x2F;fa2018&#x2F;material.ht...</a><p>Course is led by Paris Smaragdis, one of top researchers in the field of audio processing.
sdenton4about 6 years ago
The folks behind audio set have been working on general audio event detection for some years now, I believe.<p><a href="https:&#x2F;&#x2F;research.google.com&#x2F;audioset&#x2F;" rel="nofollow">https:&#x2F;&#x2F;research.google.com&#x2F;audioset&#x2F;</a><p>There&#x27;s a huge amount to discuss in the audio domain... But for a starting place, using ResNet on spectrograms to build a binary classifier is a good place to start.
enisberkabout 6 years ago
I am taking a course called &quot;Speech and Audio Understanding&quot; from Prof. Michael I Mandel, you can check course website[1] , he has a good collection of resources. Also his github stars are good collection of related projects[2]. In class we are using a book called &quot;Human and Machine Hearing: Extracting Meaning from Sound&quot; by Richard F. Lyon, authors shares it for free [3] For example one of the resources you will see on the course website is presentations from interspeech2018, you can check all tutorials from there[4].<p>[1] <a href="http:&#x2F;&#x2F;mr-pc.org&#x2F;t&#x2F;csc83060&#x2F;" rel="nofollow">http:&#x2F;&#x2F;mr-pc.org&#x2F;t&#x2F;csc83060&#x2F;</a><p>[2] <a href="https:&#x2F;&#x2F;github.com&#x2F;mim?tab=stars" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;mim?tab=stars</a><p>[3] <a href="http:&#x2F;&#x2F;dicklyon.com&#x2F;hmh&#x2F;Lyon_Hearing_book_01jan2018.pdf" rel="nofollow">http:&#x2F;&#x2F;dicklyon.com&#x2F;hmh&#x2F;Lyon_Hearing_book_01jan2018.pdf</a><p>[4] <a href="http:&#x2F;&#x2F;interspeech2018.org&#x2F;program-tutorials.html" rel="nofollow">http:&#x2F;&#x2F;interspeech2018.org&#x2F;program-tutorials.html</a>
am807about 6 years ago
Just found this thread on the fast.ai forum yesterday that may help: <a href="https:&#x2F;&#x2F;forums.fast.ai&#x2F;t&#x2F;deep-learning-with-audio-thread&#x2F;38123" rel="nofollow">https:&#x2F;&#x2F;forums.fast.ai&#x2F;t&#x2F;deep-learning-with-audio-thread&#x2F;381...</a>
评论 #19693063 未加载
Tangokatabout 6 years ago
I don&#x27;t know if this is off topic but would it be possible to remove the sound of mechanical keyboards with ML in realtime from a VOIP stream? Sell the technology to Discord and profit.
评论 #19694552 未加载
destabout 6 years ago
You may reuse some concepts I have described for an audio adblock: <a href="https:&#x2F;&#x2F;www.adblockradio.com&#x2F;blog&#x2F;2018&#x2F;11&#x2F;15&#x2F;designing-audio-ad-block-radio-podcast&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.adblockradio.com&#x2F;blog&#x2F;2018&#x2F;11&#x2F;15&#x2F;designing-audio...</a><p>More precisely, audio spectral preprocessing then neural network such as LSTM.
williamsmjabout 6 years ago
I think the slides&#x2F;recording of this excellent Spotify talk will be posted shortly: <a href="https:&#x2F;&#x2F;qcon.ai&#x2F;qconai2019&#x2F;presentation&#x2F;deep-learning-audio-signals-prepare-process-design-expect" rel="nofollow">https:&#x2F;&#x2F;qcon.ai&#x2F;qconai2019&#x2F;presentation&#x2F;deep-learning-audio-...</a>.
telesillaabout 6 years ago
aubio and librosa are two excellent MIR (music information retrieval) tools I can recommend from personal use. They can both be implemented for real-time audio using pyaudio or similar.<p><a href="https:&#x2F;&#x2F;aubio.org&#x2F;doc&#x2F;latest&#x2F;" rel="nofollow">https:&#x2F;&#x2F;aubio.org&#x2F;doc&#x2F;latest&#x2F;</a><p><a href="https:&#x2F;&#x2F;librosa.github.io&#x2F;librosa&#x2F;" rel="nofollow">https:&#x2F;&#x2F;librosa.github.io&#x2F;librosa&#x2F;</a>
评论 #19693203 未加载
konsoleXDabout 6 years ago
I am also curious about this topic! I have picked up a jetson nano and fully intend to put this device to use by projecting comic-book panel-style speech bubbles (plus, who knows... random panels?) on the wall leveraging pytorch + deepspeech.<p>That&#x27;s at least the idea kicking around in my head at the moment. <a href="https:&#x2F;&#x2F;github.com&#x2F;SeanNaren&#x2F;deepspeech.pytorch" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;SeanNaren&#x2F;deepspeech.pytorch</a><p>I&#x27;m no expert. Haven&#x27;t done it. Don&#x27;t really want to send every convo into the cloud or my tinfoil hat will start burning.<p>You do not need a jetson to get started investigating. Maybe just nvidia for that particular library. If you find something, maybe you can let me know somehow.<p>Peace
devinabout 6 years ago
<a href="https:&#x2F;&#x2F;github.com&#x2F;ybayle&#x2F;awesome-deep-learning-music" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;ybayle&#x2F;awesome-deep-learning-music</a> a &quot;Non-exhaustive list of scientific articles on deep learning for music&quot;
tixocloudabout 6 years ago
Here&#x27;s a resource that breaks down the various audio processing tasks and provides case studies: <a href="https:&#x2F;&#x2F;www.analyticsvidhya.com&#x2F;blog&#x2F;2018&#x2F;01&#x2F;10-audio-processing-projects-applications&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.analyticsvidhya.com&#x2F;blog&#x2F;2018&#x2F;01&#x2F;10-audio-proces...</a><p>It&#x27;s slightly academic so here&#x27;s a more practical resource: <a href="https:&#x2F;&#x2F;towardsdatascience.com&#x2F;audio-classification-using-fastai-and-on-the-fly-frequency-transforms-4dbe1b540f89" rel="nofollow">https:&#x2F;&#x2F;towardsdatascience.com&#x2F;audio-classification-using-fa...</a>
ransom1538about 6 years ago
I would get lunch with these guys:<p><a href="https:&#x2F;&#x2F;www.audiblemagic.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.audiblemagic.com&#x2F;</a><p>These sketch balls can use your phone&#x27;s mic to detect what is streaming in a living room.
contingenciesabout 6 years ago
Recently I started looking in to this as a backup method of anomaly detection while performing automated testing of our robotics. I concluded that it&#x27;s actually pretty easy. Depending upon how simplistic your requirements, you can even achieve this cheaply and effectively on a very tiny microprocessor with an attached surface mount MEMS microphone. Additional features like anomalous audio recording, timestamping and alert transmission are not that hard either. No need for a fully-fledged general purpose operating system, or complex algorithms.
bjourneabout 6 years ago
See this book and the sources it links to: <a href="https:&#x2F;&#x2F;musicinformationretrieval.com&#x2F;" rel="nofollow">https:&#x2F;&#x2F;musicinformationretrieval.com&#x2F;</a> Also google for pitch and onset detection. If you want more specific help, you have to ask a more specific question.
评论 #19691809 未加载
ml-engineerabout 6 years ago
There are many great resources to reference here:<p><a href="https:&#x2F;&#x2F;www.science.wiki&#x2F;search?keyword=audio+processing" rel="nofollow">https:&#x2F;&#x2F;www.science.wiki&#x2F;search?keyword=audio+processing</a>
iagooarabout 6 years ago
Contact the founder &#x2F; maker of Auphonic.com - he&#x27;s a super nice and clever guy who does this kind of stuff for a living. He&#x27;ll definitely point you into the right direction.
jamesb93about 6 years ago
This depends if you&#x27;re interested in creative applications or analytical (MIR) ones. The two fields share a lot of techniques, but the way they are used is wildly different.
preetiagarwalabout 6 years ago
thanks for sharing article <a href="https:&#x2F;&#x2F;www.exltech.in&#x2F;mechanical-design-training.html" rel="nofollow">https:&#x2F;&#x2F;www.exltech.in&#x2F;mechanical-design-training.html</a>
xylophoneabout 6 years ago
piston aircraft?