TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

Thousands of bird sounds visualized using Google machine learning

232 pointsby ptrptralmost 8 years ago

23 comments

sharp11almost 8 years ago
As a birder, this looks like a failed experiment to me. Or I don't understand what their goal was. The groupings make little sense in terms of what these species sound like. I'm guessing that's an artifact of the way they sampled the sounds, losing macro properties. Kind of like grouping the words 'paramour', 'enmity' and 'hamster' together bc they all contain /m/ sound.
评论 #14581514 未加载
jlg23almost 8 years ago
Unfortunately they only hint at the envisioned application in the video and don't provide any further links, but the idea is amazing: Use sounds to monitor bio-diversity. Imagine we'd not need cameras and lots of luck to "catch" proof of an animals existence but a grid of interconnected omnidirectional microphones. We'd get real time tracking of individual animals in 3D and could have smartphones literally point the way to yet uncatalogued or even undiscovered species.
评论 #14578867 未加载
评论 #14581483 未加载
daxfohlalmost 8 years ago
From the video, it appears this is an AI-plotted hugely multidimensional space t-sne&#x27;d onto two dimensions.<p>Would be interesting to do some kind of ML on how best to present hugely multidimensional spaces onto two _interactive_ dimensions. Where one AI is deciding how things are projected and how it can be manipulated, and another AI is limited to some virtual &quot;mouse, keyboard, 2D screen&quot; to make inferences. Such that it&#x27;s optimized for faster, more correct inferences.
评论 #14577784 未加载
anotheryoualmost 8 years ago
more like sorting bird sounds, not visualizing...<p>The tiny images are just spectograms&#x2F;fft as far as I can tell.<p>edit: it&#x27;s very fun though to click+drag, haha
bkastermalmost 8 years ago
There are some instances of the same bird in multiple locations (great horned owl). Presumably multiple recordings of the same bird. My initial reaction to them not being neighboring is to wonder about the quality of the result. Maybe better feature engineering needed to make this biologically relevant. Any other interpretations?
mortehualmost 8 years ago
If you haven&#x27;t already, try zooming all the way out and drawing things using the grid as a canvas.
评论 #14577763 未加载
jbogganalmost 8 years ago
I really want to map these to a MIDI controller
评论 #14578474 未加载
评论 #14577382 未加载
bonoetmaloalmost 8 years ago
Is there a better way to pan than click-dragging 100 simultaneous bird sounds?
glupalmost 8 years ago
It would be interesting to see how similarity in bird call behavior tracks (or doesn&#x27;t) the phylogenetic relationship between species. My hypothesis would be that bird calls are influenced by other birds in the same ecosystem (imitation or differentiation, and reflecting a high degree of cultural learning) rather than the null hypothesis of genetic transmission.
thinkMOARalmost 8 years ago
Interesting, spammed a few bird lovers i know with it. Though they almost all replied the recordings are not good enough.<p>Though personally (jk) i was slightly disappointed when i zoomed out i didn&#x27;t see a big bird (or other bird) likeness.
trenalmost 8 years ago
Someone should make a Shazam&#x2F;Soundhound app for bird calls, I&#x27;d definitely buy it if it could narrow it down to a subset of possibilities.
评论 #14581257 未加载
bravuraalmost 8 years ago
I poked around and also looked at a similar experiment, the Infinite Drum Machine: <a href="https:&#x2F;&#x2F;aiexperiments.withgoogle.com&#x2F;drum-machine" rel="nofollow">https:&#x2F;&#x2F;aiexperiments.withgoogle.com&#x2F;drum-machine</a><p>Does anyone know what they are doing t-SNE on? i.e. are they just doing t-SNE on the raw waveforms? Or the MFCC spectrogram? Or what?
pishpashalmost 8 years ago
Next: add animal languages to Google Translate?
banealmost 8 years ago
It&#x27;s interesting, at the very local level I think most humans wouldn&#x27;t think two adjacent bird sounds are all that similar. But if you drag along a long line and listen to a series of different birds you can &quot;hear&quot; a definite organized progression that seems to be organizing rhythm and major tones into groups.
评论 #14581673 未加载
KasianFranksalmost 8 years ago
Feature request: Play All
voidmainalmost 8 years ago
This is my cats&#x27; favorite machine learning application so far!
评论 #14581525 未加载
zo1almost 8 years ago
If you left-click and drag around on this (with short pauses), you can almost hear something that sounds very close to R2-D2.
jaimex2almost 8 years ago
What? No Kookaburra?<p>It has the most unique call of them all.
simplehumanalmost 8 years ago
I see &quot;Oops, sorry for the tech trouble. For the best experience, view in &quot; for chrome on wayland...
MurrayHill1980almost 8 years ago
What task is made easier by this visualization?
coldcodealmost 8 years ago
Wonder what it would do with a mockingbird?
评论 #14578768 未加载
评论 #14578216 未加载
mirimiralmost 8 years ago
It doesn&#x27;t work in Firefox. How rude.
评论 #14578341 未加载
评论 #14579302 未加载
fairpxalmost 8 years ago
It&#x27;s good to see Google in the last couple of weeks launching a bunch of [1]projects that are more in line with their mission of &#x27;organising the world&#x27;s information&#x27;.<p>[1] <a href="http:&#x2F;&#x2F;www.smithsonianmag.com&#x2F;smart-news&#x2F;google-digitizes-3000-years-fashion-history-180963633&#x2F;?no-ist" rel="nofollow">http:&#x2F;&#x2F;www.smithsonianmag.com&#x2F;smart-news&#x2F;google-digitizes-30...</a>