TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

AI headphones let wearer listen to a single person in a crowd by looking at them

956 pointsby keploy12 months ago

72 comments

serial_dev12 months ago
If the size could shrink to the size of a small earplug, I&#x27;d love to use this as a person who is not hearing-impaired <i>(at least they couldn&#x27;t diagnose me with it, so now I&#x27;m not sure if their diagnostics sucks, or I&#x27;m just a normal person and others pretend better that they hear everything well)</i>.<p>In groups and with friends, it&#x27;s inevitable that you end up in a busy restaurant or a bar, and it always frustrates me that I don&#x27;t hear something, I ask the person to repeat only to not hear it again, usually because they repeat it at the same low level (considering the circumstances). Missing jokes and throwaway comments is even worse (&quot;hey what are you all laughing about, I didn&#x27;t hear it, could you repeat it for me like three times until I hear it&quot;).
评论 #40510812 未加载
评论 #40511092 未加载
评论 #40512107 未加载
评论 #40510722 未加载
评论 #40511743 未加载
评论 #40512519 未加载
评论 #40511946 未加载
评论 #40510704 未加载
评论 #40519249 未加载
评论 #40512048 未加载
评论 #40511198 未加载
评论 #40511382 未加载
评论 #40515126 未加载
评论 #40530702 未加载
评论 #40536905 未加载
评论 #40532974 未加载
评论 #40517567 未加载
评论 #40511034 未加载
评论 #40512041 未加载
评论 #40511818 未加载
评论 #40512000 未加载
评论 #40510676 未加载
KaiserPro12 months ago
One thing that the HN crowd should appreciate is just how expensive and shit hearing aids are.<p>go and look the up the price, they are deeply expensive, even for basic &quot;make it louder&quot; type aids.<p>Worse still, because they interfere with your ear, you tend to loose the ability to &quot;steer&quot; your hearing. This means that you can&#x27;t tune out other conversations&#x2F;noises or stuff.<p>The one good side effect of facebook spending billions on its (probably) futile search for practical and popular AR is <a href="https:&#x2F;&#x2F;www.projectaria.com&#x2F;glasses&#x2F;" rel="nofollow">https:&#x2F;&#x2F;www.projectaria.com&#x2F;glasses&#x2F;</a><p>Which is a (cheap) platform to do experimentation for AR type actions.<p>However it has eye tracking, microphone array and front facing cameras, so it can be fairly easily modified into being a steerable microphone.
评论 #40509660 未加载
评论 #40510606 未加载
评论 #40509808 未加载
评论 #40509607 未加载
评论 #40512388 未加载
评论 #40510143 未加载
评论 #40512322 未加载
评论 #40510319 未加载
评论 #40513634 未加载
评论 #40509566 未加载
评论 #40511623 未加载
评论 #40510432 未加载
评论 #40510553 未加载
CodeCompost12 months ago
As somebody who is hearing impaired, a feature like this would be a Godsend for me! This feature should be integrated into hearing-aids ASAP! Shut up - no, actually - keep talking and take my money!
评论 #40508848 未加载
评论 #40508697 未加载
评论 #40509521 未加载
anonzzzies12 months ago
This but more advanced would quite nicely help with my tinnitus. I hear fine when one person is speaking (even softly and at a distance), but multiple or with music, I hear nothing.
评论 #40509188 未加载
评论 #40509218 未加载
toomuchtodo12 months ago
Code: <a href="https:&#x2F;&#x2F;github.com&#x2F;vb000&#x2F;LookOnceToHear">https:&#x2F;&#x2F;github.com&#x2F;vb000&#x2F;LookOnceToHear</a>
foreigner12 months ago
I&#x27;ll bet they achieve commercial success with the reverse application. Imagine being able to mute that one obnoxiously loud person with an annoying voice at a party!
评论 #40509934 未加载
评论 #40509505 未加载
评论 #40509372 未加载
评论 #40515920 未加载
评论 #40513308 未加载
cushychicken12 months ago
I used to work at Sonos, long before their current app update debacle and headphone debut.<p>During the first aborted product effort to develop headphones, we were looking at a conceptual feature similar to this - selectively allowing people’s voices through the ANC chipset.<p>I don’t recall the exact approach the DSP folks were using (I was closer to the hardware for ANC) but they were really only able to figure out how to isolate the wearer’s voice by virtue of that signal having more power than all the others.<p>This is terribly cool. I wonder what other kinds of fun you could have with headphones. ANC chipsets are incredibly powerful and I’d wager their capabilities are not even close to fully tapped.
评论 #40512615 未加载
评论 #40511138 未加载
评论 #40512326 未加载
OkGoDoIt12 months ago
The open source code is at <a href="https:&#x2F;&#x2F;github.com&#x2F;vb000&#x2F;LookOnceToHear">https:&#x2F;&#x2F;github.com&#x2F;vb000&#x2F;LookOnceToHear</a> and the research paper is at <a href="https:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;2405.06289" rel="nofollow">https:&#x2F;&#x2F;arxiv.org&#x2F;abs&#x2F;2405.06289</a><p>So perhaps this is not as out of reach as many pop-science articles. I’d love to hear if anyone is able to get this working independently.
chabad36012 months ago
This could actually be really helpful to me, as I have trouble hearing someone speaking in a busy room because my mind is trying to pick up everything (I think this is because of my ADHD). Having a way to significantly quiet out other noises aside for the voice of the person I&#x27;m speaking with would be amazing.
评论 #40513560 未加载
评论 #40509376 未加载
评论 #40514624 未加载
maxglute12 months ago
A potential feature I didn&#x27;t know I needed. Have headphones with ANC on around home all the time, would be really useful if it auto passthrough my partners voice.
评论 #40508575 未加载
评论 #40509300 未加载
评论 #40508833 未加载
gexla12 months ago
They couldn&#x27;t use this to listen to me. They would just get &quot;I am just a large language model, I can&#x27;t help you with that.&quot;<p>I use a lot of curse words. ;)
评论 #40509140 未加载
keploy12 months ago
Imagine it helping people with Autism and ADHD! ADHD people have hard time listening to 1 person because part of the brain tries to listen to all other conversations going around.
评论 #40510743 未加载
i5heu12 months ago
This remembers me of NVIDIA RTX Voice [0]. Although not made to isolate single persons, this is quite impressive. I hope that this single person isolation will find it&#x27;s way to consumer noise-cancelling headphones<p>[0] <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=uWUHkCgslNE" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=uWUHkCgslNE</a>
astatine12 months ago
I used to think of building something related to let a mic pick up a single person to handle questions from the audience, during presentations. Will save the hassle of passing around mics.<p>This looks like it could do just that with the headphones feeding directly into the mixer and behaving like a focused mic.
btbuildem12 months ago
When I was a youngling, I dreamed of having headphones with the opposite power -- muting specific people. For me, it&#x27;s not the hubbub of a crowd that&#x27;s distracting, it&#x27;s usually one or two offending specimens - like in the video example, the inconsiderate vermin using a speakerphone in public.<p>I wonder if the problem maps easily from &quot;select this source&quot; to &quot;select everything but that source&quot;
amusingimpala7512 months ago
How much is the AI necessary for this? At least for the targeting of sounds in the line of sight, that should be fairly easy to do without AI, but I don’t know about the human voice identification.
评论 #40510594 未加载
评论 #40509901 未加载
评论 #40512914 未加载
评论 #40509492 未加载
评论 #40509798 未加载
评论 #40509203 未加载
评论 #40510582 未加载
29athrowaway12 months ago
<a href="https:&#x2F;&#x2F;en.m.wikipedia.org&#x2F;wiki&#x2F;Cocktail_party_effect" rel="nofollow">https:&#x2F;&#x2F;en.m.wikipedia.org&#x2F;wiki&#x2F;Cocktail_party_effect</a>
评论 #40510622 未加载
adwi12 months ago
The “cocktail party effect” externalized. Extremely cool.<p><a href="https:&#x2F;&#x2F;en.m.wikipedia.org&#x2F;wiki&#x2F;Cocktail_party_effect" rel="nofollow">https:&#x2F;&#x2F;en.m.wikipedia.org&#x2F;wiki&#x2F;Cocktail_party_effect</a>
评论 #40511111 未加载
masfoobar12 months ago
The concept of headphones in the last 60 years, with the exception of sound quality, comfort, and eventually being cordless.... they have not changed much in terms of style and appearance.<p>However, I think in the next 50 years, headphones will disappear or.. should I say evolve as part of the human anatomy. Same thing for screen monitors, mouse&#x2F;keyboard, smartphones, etc.<p>Think about it. The way things are going, along with &quot;AI&quot; (sure buzzword in a number of ways but something that will change our way of living) many of things we use will be replaced and, likely, be simple extensions or, dare I say, be implanted.<p>Hard drives will be a thing of the past. Everything will be (as we call today) &quot;cloud-based&quot; and we will be more cybernetic than we think. Of course, someone today will fear such an idea. As we slowly accept the little changes.. in 50 years we will look back and think &quot;how did they cope without it&quot;... a bit like how someone today look back and think &quot;how did they cope without the internet&quot;<p>Many fear what they dont understand. It is the unknown. AI is a fear factor for many. For me, I accept it for what it is and the changes it will impact our lives and our careers.<p>All I will say is -- strap yourselves in.. it will be a bumpy ride. I hope we make it through without destroying ourselves. Once we past it, the world could (finally) be at peace and to quote a famous TV show --- &quot;to bondly go where no man has gone before!&quot;
andrewstuart212 months ago
This reminds me a lot of <a href="https:&#x2F;&#x2F;github.com&#x2F;xiph&#x2F;rnnoise">https:&#x2F;&#x2F;github.com&#x2F;xiph&#x2F;rnnoise</a> and my use of it locally. It zeroes in on voice via RNN which seems to beat most other noise detection filters I&#x27;ve tried. Unfortunately, I mostly disable it these days since it&#x27;s a bit harder to tune than I&#x27;m up for, but it&#x27;s by far the most promising local noise reduction I&#x27;ve used.
bdw520412 months ago
In my experience, most people don&#x27;t seem to understand the concept of noise cancelling headphones and will still try to talk to people who clearly can&#x27;t hear them. I can&#x27;t imagine it&#x27;d be any different for these AI headphones in practical use. Probably worse because the person you&#x27;re actually trying to talk to might think you can&#x27;t hear them.
评论 #40510409 未加载
briansm12 months ago
Bit annoying that they added ambient music over the demo youtube video, spoiling the one thing you want to demonstrate.
评论 #40510772 未加载
ck_one12 months ago
Pretty cool what they are working on. However, I wished there would be more funding for restoring hair cells which are the root cause for most people with hearing loss.<p>Researchers are getting closer. Dr. Chen from Harvard was able to regenerate hair cells in mature mice last year.<p>The problem is also becoming more widespread. 30 Mio people in the US and 400 Mio people worldwide have disabling hearing loss. Regenerating hair cells and the synapses around them would also cure Tinnitus. 30 Mio x $5k for a treatment = $60B market (probably way bigger with aging population)<p>I think we probably need more rich tech billionaires to get affected to attract large funding.<p>What billionaires that you know are affected besides:<p>- Brad Jacobs<p>- Ryan from Flexport&#x2F;Founders Fund
评论 #40512950 未加载
nox10112 months ago
Sounds like a great way to spy on all people and extract all conversations. I can&#x27;t wait for judges to declare that all conversation at your office must be recorded like some of them have for chat. This tech is a step to enable such a thing.
keploy12 months ago
I hope it&#x27;s not just a prototype press release, will help people with hearing loss..
评论 #40508517 未加载
评论 #40508457 未加载
评论 #40508649 未加载
评论 #40508451 未加载
dboreham12 months ago
Presumably the tv ad would feature Gene Hackman. Edit: an AI simulation of Gene Hackman.
chrisknyfe12 months ago
Before getting all excited that your ML model runs on your brand new 2024 macbook, before you run off to create earbuds &#x2F; hearing aids with it, please try to run it on-target and see whether your model runs within your runtime budget &#x2F; power budget &#x2F; device size budget &#x2F; battery life budget.<p>And make sure if you&#x27;re going to do bluetooth + wireless, remember that both bluetooth and wifi transmit on 2.4 GHz, and need to coordinate in order to coexist in the same IoT device. There are interconnects and wire protocols to connect the bluetooth and wifi chips together - or, preferably, you buy a chip that does both.
helsinkiandrew12 months ago
Presumably this could be used to block out specific voices&#x2F;sounds.<p>There&#x27;s an episode of the sci-fi show Black Mirror (White Christmas), where a person is convicted of some hideous crime and permanently blocked&#x2F;made invisible and inaudible from everyone (the entire population has embedded audio&#x2F;video processing enhancements by then).<p>You can imagine future headphones where you could block out the guy in your office with the annoying laugh our download &#x27;blocks&#x27; from the headphone appstore - no more Rick Astley or the politician you don&#x27;t like etc.
cush12 months ago
Their paper is quoting an end-to-end latency of under 20ms… so impressive!
stubish12 months ago
Curious what sort of processing power or chipsets the &#x27;onboard embedded computer&#x27; needs. Could this be an iPhone app? Or is this going to require new, specialized hardware to commoditize?
tromp12 months ago
&gt; To use the system, a person wearing off-the-shelf headphones fitted with microphones taps a button while directing their head at someone talking. The sound waves from that speaker’s voice then should reach the microphones on both sides of the headset simultaneously; there’s a 16-degree margin of error.<p>Perhaps the accuracy of identifying the correct voice could be vastly increased by adding video input. The AI can then try to match the various voices with the lip movements in the center of the video, basically lip reading.
p0w3n3d12 months ago
No need to stand close to the Big Brother&#x27;s telescreen anymore
seydor12 months ago
I think someone made something similar in the 80s by using blind source separation techniques like ICA<p>But this is very useful for people like me who don&#x27;t hear well in the high frequencies.
glial12 months ago
This is pretty amazing -- and a practical application of a solution to a notoriously tricky problem called the &quot;cocktail party problem.&quot;[1] For a small subset of researchers, writing an algorithm to isolate a voice in a crowd is on par with e.g. writing an AI to play Go.<p>[1] <a href="https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Cocktail_party_effect" rel="nofollow">https:&#x2F;&#x2F;en.wikipedia.org&#x2F;wiki&#x2F;Cocktail_party_effect</a>
resource_waste12 months ago
&gt;A University of Washington team<p>Oh, so it barely works and its a proof of concept.<p>What is the interesting thing here? We all know how sound waves work. Pretty sure this technology is old. Until there is a product here, it just sounds like you are rehashing noise cancellation.<p>Academia has dug this grave of skepticism. I just have 0 faith this will get to market through University of Washington. Maybe it will be patented and used even less!
MisterBastahrd12 months ago
While not exactly the same, I came across an app called Tunity the other day. It allows you to use your phone camera to catch the live audio feed of the television that you are attempting to watch, whether the audio is muted or if it&#x27;s in a loud, crowded location, like a sports bar or airport. I haven&#x27;t used it, but it&#x27;s an interesting concept.
jaustin12 months ago
Next can we have them identify ambient noises that need amplification for safety reasons, like the nearly-silent electric car about to run me over, or the bike I&#x27;m about to accidentally step in front of? As someone who spends a bit too much of my time walking around on calls, I think selective amplification of ambient sounds for safety would be amazing!
评论 #40510780 未加载
andy12 months ago
I opened an issue with this. Maybe someone here knows.<p>I see a Python script I can run on my computer, I haven&#x27;t tried it yet, but I think I could connect a microphone and process real-time audio and output it in real time, but I don’t know how to detect the user looking at someone. Could you tell me how that works?
评论 #40513453 未加载
bernardlunn12 months ago
I am in the market but “ The system is not commercially available”. This is a perfect opportunity for Apple.
algasami12 months ago
When ANC headphones came out, my friends thought about something like filtering certain sounds away. I bet many people have also had this kind of idea, but nevertheless, haven&#x27;t actually built it. This looks intriguing, and with open-source POC code, it seems promising.
评论 #40509454 未加载
jmugan12 months ago
I want to filter out all non-nature sounds. I dream of walking through the airport or the park in peace. AI seems the way to go with that since you have to predict the sound to counteract it. Good to see we are finally making progress.
thrawn0r12 months ago
This could easily hold a library of voices that you interact with (e.g at a bigger table of friends and family) and let you toggle in and out voices that are relevant. Apple please include this feature for your Airpods, thanks! :)
Wildgoose12 months ago
My daughter has an auditory impairment which she describes as &quot;brain deaf&quot;.<p>Basically, her hearing is perfect but her brain struggles to process sound in a noisy environment; she can&#x27;t single out what she is listening to.<p>This sounds perfect for her!
评论 #40514530 未加载
评论 #40510307 未加载
surfingdino12 months ago
How does it solve the problem of humans being able to detect that someone&#x27;s looking at us? We tend to stop talking when we sense someone&#x27;s staring at us.
评论 #40509509 未加载
评论 #40508961 未加载
23B112 months ago
A useful tool for when you need to surveil a shady multinational called Quantum while they discuss their evil plan during a performance of Tosca.
brap12 months ago
I don&#x27;t technically have a hearing problem, sometimes when there&#x27;s a lot of noises occurring at the same time I hear it as one jumble.
评论 #40510427 未加载
fergie12 months ago
This is an actual thing that could work: AI&#x27;s ability to &quot;stem&quot; voices and instruments is really impressive.
anonu12 months ago
By looking at them AND pressing a button. But they might be able to get rid of the button with some sensors and AI.
xracy12 months ago
Feels like what AI <i>should</i> be used for... &quot;filtering out the noise&quot; rather than creating it.
nutanc12 months ago
What about the privacy concerns? So basically I can just look at a couple of people talking and eavesdrop?
评论 #40511840 未加载
causi12 months ago
I would like AI headphones that let me pinpoint the source of noises, such as inside a car engine.
slaymaker190712 months ago
Oh my god, these would be absolutely amazing as someone with auditory processing issues.
risfriend12 months ago
This is stuff for spy movies.
tacocataco12 months ago
You could even make a black list of people you don&#x27;t want to hear!
评论 #40511267 未加载
muhammad-saalim12 months ago
Curious if it will also help to find a missing person in the crowd.
hpen12 months ago
Does this use Computer Vision camera or how does it work?
m3kw912 months ago
This is the equivalent of staring but for ears.
oakpond12 months ago
Honestly AI speech recognition still sucks so bad I&#x27;m basically convinced it will fall on its face in many daily use cases.<p>I realize this is slightly tangential, but please don&#x27;t replace customer support with chatbots or whatever you want to call them. It&#x27;s a freaking horrible experience.
wuziyue121412 months ago
I think this is an awesome invention
yosito12 months ago
This will appeal to eavesdroppers.
classified12 months ago
Is it April 1st already?
m46312 months ago
get two of these for e2e communication (peer -&gt; ear)
entico12 months ago
cant wait to see people using earphones at parties
yesbut12 months ago
This is not AI.
评论 #40510601 未加载
Ibreezy12 months ago
Ok
VikingCoder12 months ago
Fry_Shut_Up_And_Take_My_Money.gif
qup12 months ago
Occasional bartender here. Okay!
a-dub12 months ago
cue 2024 gene hackman!
评论 #40509699 未加载
eisbaw12 months ago
aka beam-forming. No AI needed, just good mics.
评论 #40509602 未加载
评论 #40509936 未加载
swayvil12 months ago
Now do the same thing with video.<p>Turn anything into a mirror, or something like that.
JSDevOps12 months ago
How is this AI? Not just some form of a parabolic microphone
评论 #40509327 未加载
评论 #40509446 未加载
评论 #40510593 未加载
albert_e12 months ago
I love this.<p>I know this is just the beginning and the tech and UX will mature a lot - but being able to consciously choose what we allow into our sensory world would be a great superpower to have.<p>In the distant future this will all be embedded inside a cochlear (neural?) implant.<p>You can &quot;save&quot; known voices, prioritize them, identify various scenes&#x2F;modes automatically like meetings&#x2F;parties&#x2F;concerts&#x2F;driving&#x2F;walking etc, know when to allow external sounds in (alarms, honks, someone calling your attention, etc)<p>And with great power yada yada.<p>I can already imagine a few ways this can be misused &#x2F; abused &#x2F; create non-existent challenges and problems too. But I am (cautiously) optimistic that we as human race will collectively figure out how to steer these new technology applications into net positive territory.<p>2040: iAudio and xSmell blamed for people losing connect with nature&#x27;s sounds (like bird chirps and flowing streams) and smells (petrichor) - things that inspire us, make us creative, make life worthwile, and make us humans.