
Bit depth: definition
In digital audio, the bit depth is the number of bits of information in each sample and is closely linked to the resolution of the audio. Unlike an analog signal, which is periodic and is made up of infinite points, digital audio is a discrete signal since it is made up of a finite number of points. Use binary numbers (bits) to determine the number of states available to represent the strength of each audio sample and thus represent the signal. “The quality of the representation generally increases as this number of states increases. For example, […] recording of high-fidelity music is obtained on a CD with 65,536 levels of amplitude. The number of possible states of an n-digit (n-bit) binary system is E = 2 ^ n. ” 1. In summary, it is the resolution, in terms of amplitude, that a digitized signal will have. Determine the dynamic range that said signal has. In the following image we can see how a signal is represented in 4-bit depth. 4 bits generate 16 possible values on the vertical axis.
Requirements
A very important aspect to keep in mind is that at a greater bit depth we are going to need more resources to process the audio and more memory to save it. This is because we will have more information. The size of our audio file will be given by the following account:
Number of bits * Sample rate * number of seconds in duration [* 2 (if it is a stereo signal)]
So, for example, the size of a second of audio on a CD, which works with a depth of 16 bits and a sampling rate of 44,100Hz / second is going to be given by the following account:
1 second = 16 * 44100 * 2 (since it is stereo)
1 second = 1411200 bits (0.1764 Mb)
Comparing different bit depths
In the following table we can compare the dynamic range (in decibels) and the number of possible amplitude values of a digitized signal with different bit depths.
Obviously, the higher the number of bits, the higher the states are possible. The following example compares two pieces of music, leading them to a 16-bit to 4-bit transition. The first piece works in more depth, and the transition is much more noticeable, the result in 4-bits is perceived as the effect of “aliasing”. In the second piece, less dynamic range is used, so the transition it undergoes is almost imperceptible to the ear.











