What Is Audio Sampling Rate: A Comprehensive Explanation

Free Download Mp4Gain

What Is Audio Sampling Rate: A Comprehensive Explanation

Introduction

Audio sampling rate is a fundamental concept in digital audio that refers to the number of samples per second used to represent an analog audio signal in digital form. In this article, we’ll explore the technical details of audio sampling rate, its importance in digital audio, and its impact on audio quality and file size.

Sampling Rate Fundamentals

The concept of audio sampling rate is based on the Nyquist-Shannon sampling theorem, which states that in order to accurately represent an analog signal in digital form, the sampling rate must be at least twice the highest frequency present in the signal. This means that a signal with a highest frequency of 20kHz (the upper limit of human hearing) must be sampled at a rate of at least 40kHz in order to be accurately represented.

Sampling rate is measured in Hertz (Hz), which refers to the number of samples per second. Common sampling rates in digital audio range from 44.1kHz (used in CDs) to 192kHz (used in some high-resolution audio formats).

Sample Rate Conversion

In some cases, it may be necessary to convert audio from one sampling rate to another. Sample rate conversion involves resampling the audio data to a different rate, which can be done using digital signal processing techniques. However, sample rate conversion can introduce artifacts and reduce audio quality, especially when downsampling from a higher rate to a lower rate.

There are various reasons why sample rate conversion may be necessary, such as when mixing audio tracks with different sampling rates, or when preparing audio for distribution on different platforms with varying requirements.

Audio Quality and Sampling Rate

The sampling rate has a significant impact on audio quality, with higher sampling rates generally resulting in better fidelity and more accurate representation of the original signal. However, the benefits of higher sampling rates are limited by the limitations of human hearing and the practical limitations of digital audio technology.

While there is debate about the benefits of “high-resolution audio” formats with sampling rates above 44.1kHz, it is generally accepted that sampling rates above 96kHz provide little additional benefit in terms of audio quality.

Bit Depth and Sampling Rate

The bit depth of an audio sample refers to the number of bits used to represent the amplitude of the signal at each sample point. Higher bit depths allow for more precise representation of the signal, but also result in larger file sizes. The bit depth and sampling rate are related, as increasing the bit depth requires more data to be stored for each sample.

There is a trade-off between sampling rate and bit depth, as higher sampling rates require more data to be stored per second, which can limit the maximum bit depth that can be used without exceeding practical file size limits. However, this trade-off can be mitigated by using efficient audio compression techniques.

Sample Rate in Practice

Common sampling rates in digital audio include 44.1kHz (used in CDs), 48kHz (used in digital video), 88.2kHz, 96kHz, 176.4kHz, and 192kHz. Streaming services such as Spotify and Apple Music typically use lower sampling rates for their audio streams, with 44.1kHz being a common choice.

The Nyquist Theorem, named after the Swedish-American physicist Harry Nyquist, states that the sampling rate should be at least twice the highest frequency component in the signal being sampled. This is why the standard CD quality sampling rate is 44.1 kHz, which is just above the upper limit of human hearing.

However, it is important to note that there are higher sampling rates available, such as 48 kHz, 96 kHz, and even 192 kHz. These higher sampling rates can provide more detail and accuracy in the digital representation of the analog signal. However, they also require more storage space and processing power.

Another important factor to consider is the bit depth, which is the number of bits used to represent each sample. The more bits used, the more accurate and detailed the representation of the analog signal. CD quality uses a bit depth of 16 bits, but higher bit depths such as 24 bits are also available.

It is worth noting that some argue that higher sampling rates and bit depths may not necessarily result in audible improvements in sound quality, especially when considering the limitations of human hearing. Additionally, some argue that the increased storage and processing requirements may not be worth the potential improvements.

In conclusion, the sampling rate is a crucial component in the digital representation of analog audio signals. A higher sampling rate can provide more detail and accuracy in the digital representation, but also requires more storage and processing power. The Nyquist Theorem provides a guideline for choosing the appropriate sampling rate based on the highest frequency component in the signal. Additionally, the bit depth is another factor to consider in the accuracy and detail of the digital representation. While higher sampling rates and bit depths are available, the potential improvements in sound quality must be balanced against the increased storage and processing requirements.

Free Download Mp4Gain

Mp4Gain Main Window

Mp4Gain Features

Free Download Mp4Gain

What is the fundamental difference between 44100 and 48000 Hz?

In fact, this is just a question of long-standing standards.

44100 vs 48000 hz

44100 is the CD standard.
48000 is the standard for DVD.
The difference in practice is so small that it will be impossible to notice it (I’ll tell you more: many people feel the difference between mp3 and wav, but they can’t tell which is better).
The stereotype has persisted that if you need to work with TV or movies / soundtracks, it is better to do it in 48000, suddenly some old equipment will not understand sampling.
But this is very, very unlikely these days, so there isn’t much of a difference.
It can record at 96000. There is a small chance that some plug-ins / sound effects can handle such recordings better, but it requires more CPU / RAM and much more hard disk space.
Between 16 and 24 bits, it will also be difficult to feel the difference, but at the request of the sound engineer, we wrote in 24 with the same thoughts (for plug-ins).
In general, write to 44100 if you don’t need to work with a specific television crew.

Choosing the Right Sample Rate: 44100 or 48000 hz

In the world of digital audio, the choice between 44,100 Hz and 48,000 Hz sample rates is a critical one. As an audio expert, I’ve spent years diving deep into this topic, examining the real-world scenarios where this choice can make or break a sound. In this article, I’ll guide you through this audio journey, shedding light on the differences and helping you make an informed choice.

44100 Hz – The Analog Heartbeat

When we talk about 44,100 Hz, it’s like stepping into a cozy vinyl record shop, where the warm crackles and pops surround you. This sample rate mirrors the heartbeats of analog audio, capturing the subtleties of your audio source much like a vintage vinyl record player.

Imagine: You’re in a dimly lit jazz club, and a saxophonist takes the stage. You close your eyes as the music begins. 44,100 Hz is akin to capturing every breath, every emotion, and every nuance of the saxophonist’s performance. It’s the sample rate that preserves the soul of analog sound.

48000 Hz – The Digital Precision

Contrastingly, 48,000 Hz feels like entering a state-of-the-art recording studio with a digital mixing console at the heart of it all. It’s the precision tool for audio in the digital age, where every sound wave is charted with utmost accuracy.

Visualize: You’re in a high-tech laboratory, and a scientist is conducting a finely tuned experiment. 48,000 Hz is like the precise instruments that measure every data point with accuracy. It’s the sample rate that excels in capturing the clarity and detail of digital audio.

The Real-World Decision

The choice between 44,100 Hz and 48,000 Hz ultimately depends on the nature of your audio project.

Subtitle: For Vintage Vibes

If you’re aiming for a warm, nostalgic sound reminiscent of classic records, 44,100 Hz is your choice. It’s like using a vintage camera to capture that old-world charm. This sample rate will maintain the character and imperfections of your audio source.

Subtitle: For Contemporary Clarity

When you require crystal-clear audio for modern projects, such as podcasts, video games, or high-quality music production, 48,000 Hz is your ally. Think of it as upgrading to a high-definition TV for the audio world. This sample rate ensures every detail is captured and reproduced faithfully.

Last words about right sample rate for your digital audio

As an audio expert, my journey has led me to understand that the choice between 44,100 Hz and 48,000 Hz is about preserving the essence of your sound in the most appropriate way. Each sample rate has its place in the vast world of audio, just as a painter chooses different brushes for different strokes on their canvas.

So, whether you’re embracing the warmth of the past or striving for the precision of the future, remember that the right choice of sample rate can be the difference between an audio masterpiece and a missed opportunity. Choose wisely, and let your sound shine in all its glory.

The fundamental difference between them in the coverage of the frequency range on the track (from 20Hz), the 44100 sample rate allows you to work in the range up to 22kHz, 48000 to ~ 25kHz, 96000 to ~ 35kHz, etc. 48 parameters o 96kHz are used in large studios where the reproduction of these frequencies and sound engineers strive for the slightest increase in sound quality, before and after conversion to the 44100 standard, the sound of the track objectively looks better, even though the human ear does not hear these frequencies, the psychoacoustic effect remains (the closest example: if you shoot video and plan to play back in fHD, you will prefer to shoot 4k with rear cropping for the sake of image quality, and no one will say there is no point in shooting 4k, the same is here).

It’s even more interesting in movies … because 44100Hz is the playback frequency at 24fps and 48000Hz is 25fps. If you record a video at 25 fps and the sound is separately on the recorder at 44100Hz, then the length of the tracks will not match and you will have to change the timbre of the original with a small time interval.

Samplerate, what is sample rate

The sampling frequency is the time that results from the time between two samples. It is given in samples per second (S / s).

The level of the sampling frequency is a criterion for the reproducibility of the frequency of the sampled signal. The closer the sampling times are, the better the signal can be reproduced.

Sampling rate

Relationship between frequency and sampling frequency

For example, if an analog signal is sampled once per millisecond (ms), the sample rate is 1 kHz and the sample rate is 1000 samples per second. If the sampled signal has a frequency of 1 kHz, the signal is sampled once per period. It cannot be played. If, on the other hand, the frequency of the signal is 100 Hz, the signal is sampled ten times with the same sample rate. Therefore, the signal is easily reproducible. Therefore, the sampling frequency must be in a certain relation to the frequency of the signal. This relationship is through the given sampling theorem. Accordingly, the reproduction of the signal requires a sampling frequency that is at least twice the frequency of the signal. This applies to sine-type signals for their 1st harmonic, but not to square wave or pulse signals.

Audio sampling frequencies

In the case of voice transmission over ISDN with a maximum frequency range of 4 kHz, the sampling frequency is 8 kHz, which corresponds to a sampling interval of 125 µs. For audio with a maximum frequency range of 20 kHz, the sampling frequency is 44.1 kHz (22.67 µs) and 48 kHz (20.83 µs). For high-quality multi-channel audio, the sample rate can be up to 192 kHz. Much higher values are found for video and HDTV. For digital video, this results in a 6.5 MHz bandwidth for the luminance signal, a sampling frequency of more than 13 MHz and a sampling interval of 74 ns. The sample rate for HDTV is even higher with 74 MHz and a sample rate of 13.5 ns.
In the case of pulse-shaped signals, the sampling frequency must be many times greater than its fundamental oscillation, since otherwise important pulse parameters cannot be determined. If the sample rate is many times higher than the theoretically required sample rate, we are talking about oversampling.

Everything you need to know about samples and bits

I started delving into depth and sample rate in my last mixing / mastering tutorial, and while we’re not necessarily digital audio engineers, some background on what bit depth and sample rate is good information for anyone. participate in digital music. It’s something you always work with, whether you know it or not, and it’s great background information for understanding whether understanding the building blocks of digital audio is critical for personal gain or just to be able to sound smart just in case. where the conversation never comes up.

Samplerate

So the first thing to understand is that bit depth and sample rate only exist in digital audio. In digital audio, bit depth describes amplitude (vertical axis) and sample rate describes frequency (horizontal axis). So when we increase the number of bits we are using we are increasing the amplitude resolution of our sound and by increasing the number of samples per second we are using we are increasing the frequency resolution of our sound.

In an analog system (and in nature), the audio is continuous and fluid. In a digital system, the smooth analog waveform is only approximated by the samples and must be set to a limited number of amplitude values. When you sample a sound, the audio is divided into small sections (samples) and these samples are fixed at one of the available amplitude levels. The process of fixing the signal to an amplitude level is called quantization, and the process of creating the sample slices is, of course, called sampling.

In the diagram below you can see a visualization of this where there is an organic sine wave playing for one second. It starts in 0 seconds and ends in 1 second. The blue bars represent the digital approximation of the sine wave where each bar is a sample and has been set to one of the available amplitude levels. (This diagram is obviously much grosser than in real life).

samplerate

This second of audio would have 44.1K, 48K, etc. samples. From left to right depending on the selected sample rate when recording and it will cover -144dB at 0dB at 24bit (or -96dB at 0dB at 16bit bit). The dynamic range resolution (the number of possible amplitude levels for the sample to rest) would be 65,536 at 16 bits, and get this, 16,777,216 when logged at 24 bits.

Therefore, increasing the bit depth greatly increases our amplitude resolution and dynamic range. What is not so obvious is where the increase in dynamic range occurs. The added dB is added to the softest part of the sound since the amplitude can never exceed 0 dB. What this does is allow you to hear more delicate sounds (like a reverb tail running at -130 dB) to be heard, which might otherwise be cut off to a 16-bit, -96 dB sample.

Some details of the sample rate

For many years it was thought that the sample rate or sampling frequency did not decisively influence the final quality of the digital audio; There are currently several engineers who record in 44.1K or 48K without really knowing why they do it. With the advent of new and better computers, interfaces, ports and protocols, 88.2K, 96K and up to 192K entered the discussion table on the best sample rate to use. It has always been the subject of discussion between engineers and audiophiles; some argued that they did hear the difference between different sample rates and others that did not, and the topic has been subjected to millions of A / B tests with very high quality equipment, causing all kinds of opinions found and uncompromising, fights and friendships of years broken

While this is a basic issue of digital audio, it is always surrounded by a halo of mystery, mysticism and magic (like every sound theme), which is well worth clarifying.

What is the sample rate?

This topic, although it occurs in the first or second class of digital audio, is not always understood correctly. In scholastic thinking, sample rate is defined as the amount of audio samples transported and taken per second. Since this is a unit of measurement over a second and with events that occur cyclically, the Hertz (1 / Frequency) is used as a unit. Obviously we cannot talk about this subject without referring to the Nyquist sampling theorem, which was tested by Shannon almost twenty years after its publication and in which it is stated that for a signal of limited bandwidth (B) (for example, a vibraphone reaches 14.917Hz), the sampling frequency must be twice its bandwidth (2 * B). Then, taking the previous example, we can say that: 2 * B → 2 * 14.917Hz → The sampling frequency for 14.917Hz should be 29.834Hz. This would be equivalent to 29,834 samples per second (1/29, 834) to be able to regenerate the signal of a vibraphone without error. Hence, it is taken that the highest frequency that human beings listen to is 20kHz and if we apply Nyquist it should be 40kHz, but it takes 44.1kHz to meet the demanding ears and for a matter of multiples.

44.1K or 48K to 88.2K or 96K, the correct division

At the dawn of the digital audio era, Nyquist was used to use the sampling resolution of 44.1K, used at that time audio CD format that played at 16bit / 44.1kHz. With the advent of DVD and Blu Ray as video and audio formats, resolutions such as 24Bits / 48K or 24Bits / 96kHz began to be used. Although for many years there were recordings that were made in 24Bits / 88.2kHz or 24Bits / 96kHz, at a certain time of mastering, before sending it to the disk duplicator, the audio suffered a mutilation that reduced it to 16Bits / 44.1kHz as It was ordered by the CD format. This process should be carried out with equipment specially designed for this function and in stages so that the audio did not suffer a very noticeable cut and the bad conversion was evidenced. Although the old and dear Dither was applied since then to compensate for this process (something like “grain” in the cinema. Watch a film without “grain” and it will look like HD even though it was filmed in 1980 on tape and goes to notice until the makeup of the actor and the assembly of the special effects, something otherwise disagreeable).

Generally, to prevent the audio from mutilating or applying several conversions that degrade it, it was decided at what resolution to record before pressing the REC button (we will not mention those that come down directly with your DAW from 24Bits / 96kHz to 16Bits / 44.1kHz in one step to export the audio … there is a place reserved especially for them in hell). If the audio was going to end on CD, a 88.2kHz sample rate was generally applied, since at the time of mastering, with the symmetric re-sampling at “half”, it was 44.1kHz.

Sounds better?

The subjective point of this is that we expect recordings to “sound” better at a higher sample rate. The reality is that if we record in high sample rates, with very good sampling, our sound will not “sound better”, but will be more detailed. Obviously, if our sound source is bad, our microphones and preamps too and so on, no matter how much we record at 192K, the result will not be the best. Now, if we use a good sound source, good audio chain and a good converter, everything will be obviously good. But don’t confuse; We are talking about detail here, not if it will sound more “warm,” “fat,” or “full-bodied.” This translates into a more homogeneous capture of the entire frequency spectrum, both audible and non-audible.

CPU, disk and plug-ins

Obviously, having a higher sample rate means that our processor must do more calculations, since it has to process more samples (or audio samples). Depending on the amount of plug-ins that we use before a multitrack in high resolution, our use of both DSP and native processors (the computer equipment), will increase significantly, making it very difficult or impossible to work. There are several options to overcome this problem, from buying more processor or DSP, using fewer processes or external equipment (hybrid mixing), to borrowing a machine. The only option that should never go through our minds is to lower the resolution of the audio, process and upload it again. The serious problem that comes with this is a cut in the audio, which is not reversible and what is limited and trimmed, so it stays.

Another aspect to consider is that the storage speed must be in accordance with the audio resolution we use. Suppose we want to record at 24Bits / 96kHz; The transfer rate would be: 2304kbits / second. Now, calculating the amount of tracks, we should use a disc that really reaches us in speed for this transfer rate (topic to be developed in another article).

In these times, storage size is not a problem, but speed is. Having three terabyte disk drives are generally used for 5400 rpm dish disks; the least that should be used if they are not solid state disks, would be 7200 rpm plate disc drives. Obviously, with 5400 rpm discs, we would have a third reduction in the final transfer speed and reading and writing possibilities called “iops” (in out per second or in and out per second), which have a certain number, depending on the disk, capacity and arrangement of the same (RAID) which, depending on how much we demand in the resolution of the audio, amount of channels, processing (plug-ins) and expected latency (if we record with real-time monitoring), we will surely face some problems like “clicks” and / or “pops” in our audio.

Clock

The importance of using a good clock (or clock) and being in sync with all the elements that belong to our audio chain is vital. Recall that a few articles ago we have exposed this topic in detail, but it should be reinforced in this article. Several ADC and DAC converters of economic interfaces do not perform sampling and quantization in the correct or expected manner; External clocks or protocols such as Dante help the synchronization between several devices to be correct and improve the audio quality. Much of the final quality of our work in audio is in this part of the process and it is important that if we take our work and passion seriously, we begin to pay attention to these kinds of details that are generally overlooked.