Listen to what gets lost when an MP3 is made (2015)

142 点作者 teleforce6 个月前

26 条评论

gwbas1c6 个月前

In 1999 when MP3 was getting attention, I tried to do this. I encoded a file, then inverted it, and mixed it back into the original.It didn't cancel anything out.The reason: Mp3 dramatically alters phase. Because all the phases are different, it's hard to naively determine how the signal is altered.Years later, I took the time to write a series of tools to investigate lossy audio: <a href="https://andrewrondeau.com/blog/2016/07/deconstructing-lossy-audio-the-case-for-lossless" rel="nofollow">https://andrewrondeau.com/blog/2016/07/deconstructing-lossy-...</a>

评论 #42183458 未加载

评论 #42221947 未加载

a-french-anon6 个月前

This article could at least have a paragraph explaining (in a dumbed-down way) why various kinds of psychoacoustic masking (temporal, frequency) make what's removed almost inaudible anyway. Reading the linked source (<a href="https://www.theghostinthemp3.com/theghostinthemp3.html" rel="nofollow">https://www.theghostinthemp3.com/theghostinthemp3.html</a>), he at least used LAME, but at a fixed 128 kbps bitrate, not in VBR mode =(EDIT: nerds should read about sfb21 (<a href="https://wiki.hydrogenaud.io/index.php?title=LAME_Y_switch" rel="nofollow">https://wiki.hydrogenaud.io/index.php?title=LAME_Y_switch</a>), AAC, Vorbis and Opus (CELT) aren't just theoretical improvements

评论 #42182268 未加载

cladopa6 个月前

There is a trick there. A sound can mask another sound. You will not be able to tell the difference with both sounds playing at the same time, but if you subtract them you can hear it because there is no masking.I always loved to test the ears of my "Audiophile" friends. They will tell you how different MP3s are. You make a bet they can not differentiate them in 20 trials better than chance. I won with most people but some professional musicians that can identify little differences.

评论 #42183011 未加载

评论 #42183955 未加载

评论 #42182950 未加载

评论 #42183169 未加载

评论 #42183008 未加载

评论 #42183167 未加载

评论 #42184278 未加载

BurpyDave6 个月前

Ironically, the 'diff' is compressed anyway, because it's on Vimeo, so that's not the actual diff either!

评论 #42183271 未加载

jonathanstrange6 个月前

So what. People listened to music on mechanical gramophones and enjoyed it. Too many audio engineers think it's all about the sound, when in the end it's about the music and the feelings it expresses.

评论 #42182542 未加载

评论 #42181865 未加载

评论 #42184247 未加载

评论 #42183766 未加载

Quarondeau6 个月前

Interesting approach. So are we only able to hear those sounds now because the rest of the music was removed, which would ordinarily mask the missing sounds?To say that the mp3-encoded version is not "what the artist recorded and wanted for us to hear" would imply that we can hear all sounds in the uncompressed recording.

评论 #42181991 未加载

reliablereason6 个月前

You can try it yourself:ffmpeg -i original.wav -codec:a libmp3lame -b:a 192k output.mp3 && \ffmpeg -i output.mp3 decoded.wav && \ffmpeg -i original.wav -i decoded.wav -filter_complex "[1:a]aresample=async=1,volume=-1.0[inverted];[0:a][inverted]amix=inputs=2:weights=1 1" difference.wav

评论 #42182406 未加载

评论 #42183211 未加载

jonnycomputer6 个月前

"What MaGuire has proved here is that the songs we listen to every single day are not the exact master copy that the artist recorded and wanted for us to hear. Instead, they are slightly stripped versions of their art run through a set of standards created by a bunch of engineers in 1993. For many people, that won’t matter. The songs sound almost the same, but the compression of music into an MP3 format is an important question to weigh when considering artistic intent and analyzing songs that aren’t exactly the original."I feel like this analysis isn't well grounded in what artists and sound engineers actually do, or how they think.

评论 #42187776 未加载

评论 #42183936 未加载

NoPicklez6 个月前

Fairly lackluster article.Not all .mp3's are created equally and can vary in how lossy they are based on the bitrate.If you care enough to want to hear exactly what the artist wants you to hear, you just listen to the lossless version.

评论 #42183823 未加载

评论 #42183320 未加载

kazinator6 个月前

> You can hear so many unnecessarily rejected sounds.That accusation requires evidence based in psychoacoustics. Just because you can hear it in isolation doesn't mean you can hear it if it is added back to the host audio.For instance when some quiet sound that is masked by immediately preceding loud sound is removed, of course you can hear that quiet sound in isolation! Your hearing has something like 120 decibel dynamic range, or better.You can hear differences in the compressed audio. Nobody can claims that there's no degradation in quality. Artifacts are obvious. Much more so at lower bit rates, though. MP3 starts to sound quite good around 192 kbps.The removal of those components is necessary. It is necessary to the algorithm so that it can achieve compression.Also there's this issue. If we take a signal and apply some modest EQ to it. Say we boost the bass and treble and cut me a little bit. Or any other EQ profile. If we then level match the two signals and subtract them from the other, there will be a difference: some aspects of the original material will be recognizably heard. For instance the difference between a slightly treble cut signal and the original will be the treble. But the trouble was not completely cut from the original. What you're hearing in the difference is not something that was entirely removed.

CGamesPlay6 个月前

Found the original author's page about the project (no longer on the internet): <a href="https://web.archive.org/web/20211011015410/http://ryanmaguiremusic.com/theghostinthemp3.html" rel="nofollow">https://web.archive.org/web/20211011015410/http://ryanmaguir...</a>One interesting thing to note: this is a composition, not an analysis. It's not fully documented exactly what modifications to the "raw data" were made.

评论 #42181851 未加载

Agraillo6 个月前

Why Tom's Dinner? Because it is a cappella. There's a book "How Music Got Free" by Stephen Witt [1] detailing the history of mp3 format and related events. It is a very good read and there's an explanationIncreases in processing power spurred progress. Within a year Brandenburg’s algorithm was handling a wide variety of recorded music... But one audio source was proving intractable: what Grill, with his imperfect command of English, called “the lonely voice.” (He meant “lone.”) Human speech could not, in isolation, be psychoacoustically masked. Nor could you use Huffman’s pattern recognition approach—the essence of speech was its dynamic nature, its plosives and sibilants and glottal stops. Brandenburg’s shrinking algorithm could handle symphonies, guitar solos, cannons, even “Oye Mi Canto,” but it still couldn’t handle a newscast. Stuck, Brandenburg isolated samples of “lonely” voices. The first was a recording of a difficult German dialect that had plagued audio engineers for years. The second was a snippet of Suzanne Vega singing the opening bars of “Tom’s Diner,” her 1987 radio hit.[1] <a href="https://en.wikipedia.org/wiki/How_Music_Got_Free" rel="nofollow">https://en.wikipedia.org/wiki/How_Music_Got_Free</a>

sdk776 个月前

Very interesting! The audio of Tom's Dinner rejected by the encoding sounds mesmerizing to me. I still find it to be musical - it reminds me of a record I bought a really long time ago, it was called modulation & transformation on mille plateaux, it's a collection of songs in the abstract and experimental genre.

pvillano6 个月前

Two instances where lossy compression failed for me are the movie Koyaanisqatsi and songs by the artist TOBACCO. Koyaanisqatsi has a lot of film grain and TOBACCO uses a lot of distortion. There is noise in there, but it's very deeply mixed into the signal.

0points6 个月前

This is why we dont encode mp3 in 96kbps or whatever.

moomin6 个月前

I think the thing that's really sticks out is that the breath noise are gone, which is one of the things that gives the track its character. Willing to bet the same kind of thing happens to fret noise as well.

评论 #42183712 未加载

no-such-address6 个月前

Funny article."The exact master copy that the artist recorded and wanted for us to hear" In the digital era, does that even, uniquely, exist?"a set of standards created by a bunch of engineers in 1993" Nice!Was hoping the article would mention double blind studies about the ability to perceive differences and the quality between various audio file format, available elsewhere. Interesting, though not as overwrought as the reporting in this article.

评论 #42184009 未加载

HPsquared6 个月前

This could also be done on visual compression with JPEGs.Or on video compression, for that matter.It just shows though that these diffs are invisible to a human - by design.

评论 #42182671 未加载

chrsgrrtt6 个月前

I developed a streaming service many years ago; Dolby wanted us to use their codec for the audio, and they used a track just like this as the primary basis of their sales pitch. Was quite impressive at the time.

ezconnect6 个月前

When I first experience CD audio it was too high pitch compared to tape versions. MP3 came along and each song sound different depending on the MP3 compression settings.

评论 #42182388 未加载

Timwi6 个月前

I would have liked a comparison with Ogg and perhaps other formats. I hear a lot about MP3 throwing away a lot more than Ogg but I'd love to see real data on it.

评论 #42183566 未加载

评论 #42182658 未加载

Traubenfuchs6 个月前

...I think this person just created a new genre of music. Something like: "What's lost noise."I immensely enjoyed listening to the "lost material" of Tom's Diner and would like to hear more of this!Maybe one could diff with a lower quality version, one where more has been cut away, more is lost/left over? There are so many possibilities!

评论 #42183006 未加载

ipunchghosts6 个月前

There's still audio motifs in there that can be further optized out.If the remaining audio was noise like, I would say we reached the compression limit.

Klaster_16 个月前

The article doesn't mention at what bit-rate the difference track was made, anyone knows? Seems disingenuous and pro-"authentic" otherwise.

评论 #42181761 未加载

评论 #42181775 未加载

评论 #42181758 未加载

kazinator6 个月前

The funny capitalization of moDernisT instantly gives away that it is an anagram of Tom's Diner.

grishka6 个月前

Now I want this comparison for Opus. It doesn't do that whole psychoacoustics thing, does it? But it also somehow manages to ~double the compression ratio compared to MP3 without any noticeable difference in the sound quality.

评论 #42183638 未加载

26 条评论

gwbas1c6 个月前

评论 #42183458 未加载

评论 #42221947 未加载

a-french-anon6 个月前

评论 #42182268 未加载

cladopa6 个月前

评论 #42183011 未加载

评论 #42183955 未加载

评论 #42182950 未加载

评论 #42183169 未加载

评论 #42183008 未加载

评论 #42183167 未加载

评论 #42184278 未加载

BurpyDave6 个月前

Ironically, the 'diff' is compressed anyway, because it's on Vimeo, so that's not the actual diff either!

评论 #42183271 未加载

jonathanstrange6 个月前

评论 #42182542 未加载

评论 #42181865 未加载

评论 #42184247 未加载

评论 #42183766 未加载

Quarondeau6 个月前

评论 #42181991 未加载

reliablereason6 个月前

评论 #42182406 未加载

评论 #42183211 未加载

jonnycomputer6 个月前

评论 #42187776 未加载

评论 #42183936 未加载

NoPicklez6 个月前

评论 #42183823 未加载

评论 #42183320 未加载

kazinator6 个月前

CGamesPlay6 个月前

评论 #42181851 未加载

Agraillo6 个月前

sdk776 个月前

pvillano6 个月前

0points6 个月前

This is why we dont encode mp3 in 96kbps or whatever.

moomin6 个月前

评论 #42183712 未加载

no-such-address6 个月前

评论 #42184009 未加载

HPsquared6 个月前

This could also be done on visual compression with JPEGs.Or on video compression, for that matter.It just shows though that these diffs are invisible to a human - by design.

评论 #42182671 未加载

chrsgrrtt6 个月前

ezconnect6 个月前

When I first experience CD audio it was too high pitch compared to tape versions. MP3 came along and each song sound different depending on the MP3 compression settings.

评论 #42182388 未加载

Timwi6 个月前

I would have liked a comparison with Ogg and perhaps other formats. I hear a lot about MP3 throwing away a lot more than Ogg but I'd love to see real data on it.

评论 #42183566 未加载

评论 #42182658 未加载

Traubenfuchs6 个月前

评论 #42183006 未加载

ipunchghosts6 个月前

There's still audio motifs in there that can be further optized out.If the remaining audio was noise like, I would say we reached the compression limit.

Klaster_16 个月前

The article doesn't mention at what bit-rate the difference track was made, anyone knows? Seems disingenuous and pro-"authentic" otherwise.

评论 #42181761 未加载

评论 #42181775 未加载

评论 #42181758 未加载

kazinator6 个月前

The funny capitalization of moDernisT instantly gives away that it is an anagram of Tom's Diner.

grishka6 个月前

评论 #42183638 未加载