This is all graphically easy to see on a spectrum analyzer, which is a good visual tool to look at audio the way our ears perceive it. That's why the noise spectrum of dithered silence at 44.1kHz/16 is not -96dB, but significantly lower (depending on FFT parameters).
-
-
@xiphmont has a footnote in the article I linked that notes how with an infinite window size, the dynamic range is effectively infinite; of course our ears don't have an effectively infinite window size. We can come up with a representative approximation for perceptual purposes.1 reply 0 retweets 0 likes -
Ultimately the real point is: the dynamic range of our ears is defined as the difference between the loudest (ear-damage level) sound and the quietest sound *we can perceive*. Since *we can perceive* a sound encoded at <-96dB in 16bit PCM, that is *not* the dynamic range.
1 reply 0 retweets 0 likes -
Remember the absolute threshold of hearing is defined for a *pure tone*, not broadband noise! It would be way higher for broadband white noise.
1 reply 0 retweets 0 likes -
I just did a little test, and I can perceive a pure tone at 800Hz at about -30dB relative to white noise. That means the dynamic range of a 16-bit PCM signal should be somewhere around 96+30 = 126dB. Shaped dither would improve this further.
1 reply 0 retweets 0 likes -
Understood, thanks for the explanation. Just one tiny remark. Ear response is logarithmic (both in amplitude and frequency), so instead of measuring the noise floor using an FFT, it would be more suited to use something like 1/3 octave frequency bands...
2 replies 0 retweets 0 likes -
When using these kind of measurements, if you have white noise, the obtained noise for each frequency band will be higher, as frequency increases (since the higher the frequency is, the more bandwidth the 1/3 octave band has).
1 reply 0 retweets 0 likes -
It could be interesting repeating your experiment for higher frequencies (e.g. 8 kHz) to see if the result differs.
1 reply 0 retweets 0 likes -
With the caveat that I'm not doing this scientifically (not double-blind/ABX, I'm just muting a track in audacity), yeah it's about 20dB up at 8kHz, not 30dB.
1 reply 0 retweets 0 likes -
Makes sense. Thanks for the explanations and the tests!!!
1 reply 0 retweets 0 likes
BTW, we can define a "window size" for the human ear based on the point where tones turn into rhythm, which is about 20Hz. That's 2400 samples at 48kHz. And it so turns out that at least Audacity's FFT at window=2048 puts full scale white noise at -30dB. Food for thought. 
Loading seems to be taking a while.
Twitter may be over capacity or experiencing a momentary hiccup. Try again or visit Twitter Status for more information.