
For many years it was thought that the sample rate or sampling frequency did not decisively influence the final quality of the digital audio; There are currently several engineers who record in 44.1K or 48K without really knowing why they do it. With the advent of new and better computers, interfaces, ports and protocols, 88.2K, 96K and up to 192K entered the discussion table on the best sample rate to use. It has always been the subject of discussion between engineers and audiophiles; some argued that they did hear the difference between different sample rates and others that did not, and the topic has been subjected to millions of A / B tests with very high quality equipment, causing all kinds of opinions found and uncompromising, fights and friendships of years broken.
While this is a basic issue of digital audio, it is always surrounded by a halo of mystery, mysticism and magic (like every sound theme), which is well worth clarifying.
What is the sample rate?
This topic, although it occurs in the first or second class of digital audio, is not always understood correctly. In scholastic thinking, sample rate is defined as the amount of audio samples transported and taken per second. Since this is a unit of measurement over a second and with events that occur cyclically, the Hertz (1 / Frequency) is used as a unit. Obviously we cannot talk about this subject without referring to the Nyquist sampling theorem, which was tested by Shannon almost twenty years after its publication and in which it is stated that for a limited bandwidth (B) signal (for example, a vibraphone reaches 14.917Hz), the sampling frequency must be twice its bandwidth (2 * B). Then, taking the previous example, we can say that: 2 * B → 2 * 14.917Hz → The sampling frequency for 14.917Hz should be 29.834Hz. This would be equivalent to 29,834 samples per second (1/29, 834) to be able to regenerate the signal of a vibraphone without error. Hence, it is taken that the highest frequency that the human being listens to is 20kHz and if we apply Nyquist it should be 40kHz, but it takes 44.1kHz to meet the demanding ears and for a matter of multiples.
44.1K or 48K to 88.2K or 96K, the correct division
At the dawn of the digital audio era, Nyquist was used to use the sampling resolution of 44.1K, used at that time audio CD format that played at 16bit / 44.1kHz. With the advent of DVD and Blu Ray as video and audio formats, resolutions such as 24Bits / 48K or 24Bits / 96kHz began to be used. Although for many years there were recordings that were made in 24Bits / 88.2kHz or 24Bits / 96kHz, at a certain time of mastering, before sending it to the disk duplicator, the audio suffered a mutilation that reduced it to 16Bits / 44.1kHz as It was ordered by the CD format. This process should be carried out with equipment specially designed for this function and in stages so that the audio did not suffer a very noticeable cut and the bad conversion was evidenced. Although the old and dear Dither was applied since then to compensate for this process (something like “grain” in the cinema. Watch a film without “grain” and it will look like HD even though it was filmed in 1980 on tape and goes to notice until the makeup of the actor and the assembly of the special effects, something otherwise disagreeable).
Generally, to prevent the audio from mutilating or applying several conversions that degrade it, it was decided at what resolution to record before pressing the REC button (we will not mention those that come down directly with your DAW from 24Bits / 96kHz to 16Bits / 44.1kHz in one step to export the audio … there is a place reserved especially for them in hell). If the audio was going to end on CD, a sample rate of 88.2kHz was generally applied, since at the time of mastering, with the symmetrical re-sampling at “half”, it was at 44.1kHz.
Sounds better?
The subjective point of this is that we expect recordings to “sound” better at a higher sample rate. The reality is that if we record in high sample rates, with very good sampling, our sound will not “sound better”, but will be more detailed. Obviously, if our sound source is bad, our microphones and preamps too and so on, no matter how much we record at 192K, the result will not be the best. Now, if we use a good sound source, good audio chain and a good converter, everything will be obviously good. But don’t confuse; We are talking about detail here, not if it will sound more “warm,” “fat,” or “full-bodied.” This translates into a more homogeneous capture of the entire frequency spectrum, both audible and non-audible.





