What is the audio bit depth?


Free Download Mp4Gain
picture

Understand what bit depth is, how it works, and how this feature will affect the quality of music during auditions;

Currently, many of those who are looking for quality audio or quality music keep mentioning “Hi-Res”, FLAC 24-bit, and MQA (Master Quality Audio) files. This is a growing trend in high-end smartphones that are trying to offer higher audio quality both in their DAC and in support of advanced Bluetooth audio codecs like LDAC, developed by Sony. Additionally, there are music streaming services that promise lossless audio quality, like Tidal.

BitDepth

The promise made by audio equipment manufacturers, developers of audio streaming and music streaming formats, is simple: superior audio quality due to the increased amount of data, also known as bit depth or English bit depth . There are 24 bits of 1 and 0 versus 16 bits on the CD. Of course, these high-resolution files are more expensive due to their quality, but the more bits, the better the result will be when listening to music, right?

Bitdepth

Low resolution audio is usually displayed using a jagged wave graph (with ladders). Source: soundguys
Low resolution audio is usually displayed using a jagged wave graph (with ladders). Source: soundguys
Well, the answer to the previous question is: not necessarily. The argument for a value in increasing bit depth is not based on something scientific, but on the distortion of what is actually happening and the exploitation of consumer ignorance about the media they are consuming. That is, it is a fact that stores selling 24-bit tracks reap far more benefits than a real improvement in promised sound quality.

Bit depth and sound quality.

The greatest example of companies selling 24-bit audio is the demonstration of a jagged sine wave, like stairs. When we look at a file that has a resolution of 16 bits, we see an irregular line, but when we look at the same song in 24 bits, it seems to be a practically smooth line, with better definition. It is a basic visual illustration, but depending on the person’s knowledge of the subject, he can be easily fooled.

Why use 24-bit or more audio files?

The utility of using a high-level bit depth applies to studios, because with each filter and conversion that is applied, the background noise increases. This increase in noise occurs due to the insertion of a new wave, as explained above. In other words, when using a higher bit depth level, the sound engineer prevents the original audio from generating noise by manipulating it for mixing and mastering.

However, remember that this will be more useful for audio production and not for the listener, as explained above.

conclusion
What will make the difference will be the balance between the sounds made in the mastering and not the bit depth itself, since the 16 bits of the CD are already more than enough for music listeners.


Free Download Mp4Gain
picture


Mp4Gain Main Window
picture


Mp4Gain Features
picture


Free Download Mp4Gain
picture

Multimedia formats: Digital audio

 

Sound is a continuous signal. To be stored with computer systems
it must be sampled, thus obtaining a digital signal.
The parameters that characterize the sampling are basically three:

 The sample rate
 Bit depth
 The number of channels
these parameters influence both the space occupied and the quality of the audio file
digital obtained.

Digital Audio

Sampling rate

The sampling frequency is the measurement expressed in Hertz (Hz) of the number
of times per second in which an analog signal is measured and stored
in digital form.

Sampling rate
The higher the sampling rate, the more the sequence of the samples
digital will be close to that of the original analog waveform.
Low sampling rates limit the frequency range that is
can record, which in turn can generate a recording that
poorly reproduces the original sound.
Two sampling frequencies:
A. Low sampling rate,
which distorts the wave of the original sound
B. High sampling rate,
which perfectly reproduces the wave of
original sound
To reproduce a certain frequency, the sampling frequency
it must be at least double it (Nyquist theorem).
For example, audio CDs have a sampling rate of 44.100 Hz,
so they can reproduce frequencies up to 22.050 Hz, which are hardly found
beyond the limit of human perception of 20,000 Hz.
The following table shows the most common sampling rates for
digital audio.

Bit depth

The bit depth represents the number of bits used to store a
single digital sample.
When a sound wave is sampled, each sample is assigned
the amplitude value closest to the original wave amplitude. A depth
in high bits it provides as many amplitude values ​​as possible, which results in a
greater dynamic range (the difference in decibels between the maximum volume that the component can sustain without
distort the waves and the background noise it produces), lower and higher background noise
fidelity.
For example if you use 8 bits you have 256 possible values ​​(28
) that, being
relatively few, offer less sound quality than a
tape; if instead 16 bits per sample are used, 65536 values ​​are obtained
possible (216).
The most common examples are the audio CD, recorded in 16 bit, and the DVD, which
supports up to 24 bit depth.

Compression formats

Hand in hand with the advent of digitalization, multimedia applications have
they are increasingly widespread until they become commonplace. One of
multimedia features is certainly the use of digital audio
vowel and sound. The biggest obstacle associated with digitizing audio is
the large size of the files that are produced, which puts them at
sector operators (especially those linked to the internet) the problem of
reduce the space occupied by the data to obtain the double advantage of:
 save in terms of memory occupation;
 save in terms of transfer time on the network.

For this reason, speaking of digitizing the audio, it is necessary to speak
also of data compression techniques. The compression techniques of the
data, of whatever nature they are, are divided into:
 lossless: compress data through a lossless process
of information that takes advantage of redundancies in data encoding
 lossy: compress data through a lossy process
of information that takes advantage of redundancies in the use of data.

Lossless formats

Lossless compression formats are more suitable for archiving rather than
to reproduction, since most of them require complete
decompression before they can be played.
One of the most common lossless compression formats is FLAC (Free Lossless Audio Codec).

Lossy formats

Lossy compression formats use compression algorithms capable of
drastically reduce the amount of data required to store a sound,
guaranteeing however an acceptable and faithful reproduction of the original file uncompressed.

Some details of the sample rate

For many years it was thought that the sample rate or sampling frequency did not decisively influence the final quality of the digital audio; There are currently several engineers who record in 44.1K or 48K without really knowing why they do it. With the advent of new and better computers, interfaces, ports and protocols, 88.2K, 96K and up to 192K entered the discussion table on the best sample rate to use. It has always been the subject of discussion between engineers and audiophiles; some argued that they did hear the difference between different sample rates and others that did not, and the topic has been subjected to millions of A / B tests with very high quality equipment, causing all kinds of opinions found and uncompromising, fights and friendships of years broken

samplerate

While this is a basic issue of digital audio, it is always surrounded by a halo of mystery, mysticism and magic (like every sound theme), which is well worth clarifying.

 What is the sample rate?

This topic, although it occurs in the first or second class of digital audio, is not always understood correctly. In scholastic thinking, sample rate is defined as the amount of audio samples transported and taken per second. Since this is a unit of measurement over a second and with events that occur cyclically, the Hertz (1 / Frequency) is used as a unit. Obviously we cannot talk about this subject without referring to the Nyquist sampling theorem, which was tested by Shannon almost twenty years after its publication and in which it is stated that for a signal of limited bandwidth (B) (for example, a vibraphone reaches 14.917Hz), the sampling frequency must be twice its bandwidth (2 * B). Then, taking the previous example, we can say that: 2 * B → 2 * 14.917Hz → The sampling frequency for 14.917Hz should be 29.834Hz. This would be equivalent to 29,834 samples per second (1/29, 834) to be able to regenerate the signal of a vibraphone without error. Hence, it is taken that the highest frequency that human beings listen to is 20kHz and if we apply Nyquist it should be 40kHz, but it takes 44.1kHz to meet the demanding ears and for a matter of multiples.

44.1K or 48K to 88.2K or 96K, the correct division

At the dawn of the digital audio era, Nyquist was used to use the sampling resolution of 44.1K, used at that time audio CD format that played at 16bit / 44.1kHz. With the advent of DVD and Blu Ray as video and audio formats, resolutions such as 24Bits / 48K or 24Bits / 96kHz began to be used. Although for many years there were recordings that were made in 24Bits / 88.2kHz or 24Bits / 96kHz, at a certain time of mastering, before sending it to the disk duplicator, the audio suffered a mutilation that reduced it to 16Bits / 44.1kHz as It was ordered by the CD format. This process should be carried out with equipment specially designed for this function and in stages so that the audio did not suffer a very noticeable cut and the bad conversion was evidenced. Although the old and dear Dither was applied since then to compensate for this process (something like “grain” in the cinema. Watch a film without “grain” and it will look like HD even though it was filmed in 1980 on tape and goes to notice until the makeup of the actor and the assembly of the special effects, something otherwise disagreeable).

Generally, to prevent the audio from mutilating or applying several conversions that degrade it, it was decided at what resolution to record before pressing the REC button (we will not mention those that come down directly with your DAW from 24Bits / 96kHz to 16Bits / 44.1kHz in one step to export the audio … there is a place reserved especially for them in hell). If the audio was going to end on CD, a 88.2kHz sample rate was generally applied, since at the time of mastering, with the symmetric re-sampling at “half”, it was 44.1kHz.

Sounds better?

The subjective point of this is that we expect recordings to “sound” better at a higher sample rate. The reality is that if we record in high sample rates, with very good sampling, our sound will not “sound better”, but will be more detailed. Obviously, if our sound source is bad, our microphones and preamps too and so on, no matter how much we record at 192K, the result will not be the best. Now, if we use a good sound source, good audio chain and a good converter, everything will be obviously good. But don’t confuse; We are talking about detail here, not if it will sound more “warm,” “fat,” or “full-bodied.” This translates into a more homogeneous capture of the entire frequency spectrum, both audible and non-audible.

sample rate

CPU, disk and plug-ins

Obviously, having a higher sample rate means that our processor must do more calculations, since it has to process more samples (or audio samples). Depending on the amount of plug-ins that we use before a multitrack in high resolution, our use of both DSP and native processors (the computer equipment), will increase significantly, making it very difficult or impossible to work. There are several options to overcome this problem, from buying more processor or DSP, using fewer processes or external equipment (hybrid mixing), to borrowing a machine. The only option that should never go through our minds is to lower the resolution of the audio, process and upload it again. The serious problem that comes with this is a cut in the audio, which is not reversible and what is limited and trimmed, so it stays.

Another aspect to consider is that the storage speed must be in accordance with the audio resolution we use. Suppose we want to record at 24Bits / 96kHz; The transfer rate would be: 2304kbits / second. Now, calculating the amount of tracks, we should use a disc that really reaches us in speed for this transfer rate (topic to be developed in another article).

In these times, storage size is not a problem, but speed is. Having three terabyte disk drives are generally used for 5400 rpm dish disks; the least that should be used if they are not solid state disks, would be 7200 rpm plate disc drives. Obviously, with 5400 rpm discs, we would have a third reduction in the final transfer speed and reading and writing possibilities called “iops” (in out per second or in and out per second), which have a certain number, depending on the disk, capacity and arrangement of the same (RAID) which, depending on how much we demand in the resolution of the audio, amount of channels, processing (plug-ins) and expected latency (if we record with real-time monitoring), we will surely face some problems like “clicks” and / or “pops” in our audio.

Clock

The importance of using a good clock (or clock) and being in sync with all the elements that belong to our audio chain is vital. Recall that a few articles ago we have exposed this topic in detail, but it should be reinforced in this article. Several ADC and DAC converters of economic interfaces do not perform sampling and quantization in the correct or expected manner; External clocks or protocols such as Dante help the synchronization between several devices to be correct and improve the audio quality. Much of the final quality of our work in audio is in this part of the process and it is important that if we take our work and passion seriously, we begin to pay attention to these kinds of details that are generally overlooked.