
Definition
In digital audio, the bit depth is the number of information bits of each sample and is closely linked to the resolution of the audio. Unlike an analog signal, which is periodic and is composed of infinite points, digital audio is a discrete signal since it is composed of a finite number of points. Use binary numbers (bits) to determine the number of available states to represent the strength of each audio sample and thus represent the signal. “The quality of the representation increases, in general, when this number of states is increased. For example, […] high-fidelity music recording is obtained on a CD with 65,536 amplitude levels. The number of possible states of a binary system of n digits (n bits) is E = 2 ^ n. ” 1. In summary, it is the resolution, in terms of amplitude, that will have a digitized signal. Determine the dynamic range of that signal. In the following image we can see how a signal is represented in 4 bits of depth. 4 bits generate 16 possible values on the vertical axis.
Aspects to consider
The accuracy of each sample is determined by its bit depth. Then, the higher the bit depth, the higher the resolution in the digitized signal. In addition, the greater the bit depth, the greater the dynamic range for the signal because it will have more points to represent the amplitude of each audio sample. It follows that low levels of bit depth can affect the shape of the wave and thus not achieve a good representation of the original wave because there are fewer possible points to represent it. For example, in the following graph we can see a sinusoid represented with different bit depths. A depth of 1 bit will generate a wave more similar to the square wave (depending on the quantification) because we only have two possible points on the vertical axis.
Requirements
A very important aspect to keep in mind is that at greater bit depth we will need more resources to process the audio and more memory to save it. This is because we will have more information. The size of our audio file will be given by the following account:
Bit number * Sample rate * number of seconds duration [* 2 (if stereo signal)]
Then, for example, the size of a second of audio on a CD, which works with a depth of 16 bits and a sampling frequency of 44,100Hz / second will be given by the following account:
1 second = 16 * 44100 * 2 (since it is stereo)
1 second = 1411200 bits (0.1764 Mb)







