TE
科技回声
首页24小时热榜最新最佳问答展示工作
GitHubTwitter
首页

科技回声

基于 Next.js 构建的科技新闻平台,提供全球科技新闻和讨论内容。

GitHubTwitter

首页

首页最新最佳问答展示工作

资源链接

HackerNews API原版 HackerNewsNext.js

© 2025 科技回声. 版权所有。

Show HN: I made a volumetric audio visualizer

90 点作者 rslice超过 2 年前
I&#x27;m developing Hyperstep[0], a spatial language for music production. I find using existing DAWs frustrating because they don&#x27;t allow me to navigate and operate intuitively on the latent spaces behind my musical ideas. This is why I&#x27;ve decided to build my own set of &quot;seeing tools&quot;.(Bret Victor)[1]. I&#x27;m also convinced that by framing music as processes and interactions in the 3D world, spatialization and mixing should become fairly pain-free.<p>I&#x27;m still early in development and I would love to build this into an actual product that can be integrated into existing DAWs or even turn it into a musical framework itself for AR and VR experiences.<p>If you&#x27;re interested in working on it or if you simply want to know more, feel free to contact me.<p>[0] <a href="https:&#x2F;&#x2F;github.com&#x2F;a-sumo&#x2F;hyperstep" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;a-sumo&#x2F;hyperstep</a>.<p>[1] <a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=klTjiXjqHrQ" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=klTjiXjqHrQ</a>

17 条评论

knaik94超过 2 年前
I am not sure sure what is causing it, but it takes a solid 2 to 3 minutes on my computer before it does anything. I load a file and it feels like it freezes and firefox gave me a warning banner that the tab was causing all of firefox to slow down. Same thing is Chrome and I have a i7-10750h. Some people might mistake that for it not working since there is not UI feedback of anything happening. Windows 10.<p>I got two different tracks to work, and it&#x27;s clear that one was a lot harder to process than the other. It took noticeably more time on the second one, to start and the CPU utilization was higher as well. They were both instrumental tracks in the same format and around the same length. The one simpler to process was the instrumental of Britney Spear&#x27;s Baby One More Time. The harder one was Porter Robinson&#x27;s Divinity.<p>Neither audio had an effect similar to the one from the demo video, but were interesting regardless. They both looked like how I imagine sound waves echo and bounce around if contained in a cube shape.<p>I appreciate the notebook writeup where you described the goals because the visualization wasn&#x27;t inherently intuitive with the sound. I chose much more complex tones than your demo. I imagine the feature extraction is much easier on isolated sounds. This reminds me a lot of project milkdrop and so I was expecting it to be closer to that but in 3d. That was probably a misunderstanding on my part of the goals for this.<p>I think exposing more parameters about how features get mapped and scaled would be really helpful in making it feel more intuitive. Zooming the cube in and out is nice but didn&#x27;t seem to help convey more information with the tracks I chose. If anything it got in the way because on my computer the zoom sensitivity was very very high.<p>I look forward to seeing where this goes.
评论 #33426812 未加载
pvg超过 2 年前
You might want to include some audio (or maybe it&#x27;s there and not loading in my browser) so people who try it can see what it is right away. People also don&#x27;t have to wonder what happens to their audio file to check it out.
评论 #33425518 未加载
f_devd超过 2 年前
For those wanting a textual description: It seems to take a FFT of the audio and put it into rings (something like frequency = ring size), and those rings are then duplicated&#x2F;projected to make spheres, the rings (above a certain size) are truncated to a cube shape. There is an option to have the rings move over the length of the sphere&#x2F;cube so you can see the FFT over time.
评论 #33431140 未加载
tartoran超过 2 年前
Looks neat, it sort of reminds me of winamp visualization plugins. It’s probably a good learning experience but as far as utility, in a recording studio, I fail to see how this would bring anything to the table besides it just being some cool visual thing.
评论 #33442346 未加载
dotancohen超过 2 年前
This is interesting. I wonder if it could be used in speech therapy, to help the speaker understand the difference between what a sound should sound like, and the sound coming out of their mouths.<p>I personally have problems saying two different letters differently, and I just recorded myself trying to say both letters. Sure enough, there is very little difference in the way they are both displayed. I will ask other people to say this letters, and see if I can spot the difference. Then I should be able to play around with differing lip, mouth, tongue, and larynx positions to see how close I can come.<p>Any tips for highlighting subtle differences in specific places would be appreciated!
评论 #33434124 未加载
nsxwolf超过 2 年前
I tried it on my iPhone with an mp3. It displayed what looked like a blue aerogel. I could zoom in and rotate but I couldn’t tell what kind of information it was trying to convey. It wasn’t playing the audio or changing with time in any way either.
评论 #33451835 未加载
EZ-Cheeze超过 2 年前
It&#x27;s really accurate to the music being played<p>I found a slightly-to-the side isometric closely-zoomed-in view to best show the finer waves and cadences being visualized<p>You should provide example track&#x2F;s<p>10&#x2F;10
stuntkite超过 2 年前
I really like this. Thank you for posting it and sharing code. It&#x27;s in line with my interests and I&#x27;m going to use the heck out of it.
dvh超过 2 年前
Works exactly once but only on short files, then I have to close browser (all windows) and restart. Chrome, Ubuntu.
评论 #33426259 未加载
bobsmooth超过 2 年前
Some visual feedback to show the file is loading would help but this is neat.
packetlost超过 2 年前
Broke for me on FF and chrome
评论 #33425696 未加载
knaekhoved超过 2 年前
What is the point of making this 3D when it seems to be circularly symmetric? Just replace the circle with a line and make it 2D, yeah?
评论 #33427264 未加载
评论 #33429906 未加载
techbro92超过 2 年前
I agree a 3d visual would be useful for spatialization and mixing but I don’t see how it would be useful for anything else.
评论 #33426033 未加载
评论 #33425982 未加载
remedan超过 2 年前
Went to pick a song from my music library to try, only to find that the visualizer doesn&#x27;t support flac.
评论 #33434730 未加载
eimrine超过 2 年前
I tried to open mp3 but nothing happened. Chrome outdated on Windows x32. Will try on a decent pc later.
davbryn超过 2 年前
Looks good - maybe supply a demo audio file for those of us who don&#x27;t have any locally?
naillo超过 2 年前
Worked for me :) Neat!
评论 #33428832 未加载