What do the audio sample rates and sample sizes mean?


Free Download Mp4Gain
picture

What do the audio sample rates and sample sizes mean?

Sample Rate

You can see that MP3 audio files have audio in the number of bits (in seconds) that the player uses, that is, the bit rate that indicates the quality of the audio.

sample rate

But I am confused with the terms sample rate and sample size. Are they not dependent on bit rate or sound quality? Or can it be explained in understandable terms?

Audio
Bit rate

This is a great article on the three terms you are asking. In summary, here are three definitions.

Bit rate: the amount of data per second. This can be different in the file (variable bit rate) and can have static values.
Sample Rate – The rate at which audio is measured per second. It is usually measured in kilohertz (kHz). The usual number you can see is 44.1 kHz. This is directly related to the bit depth or the number of bits measured in each cycle.
So at this point you need to do some math and you can see that the bitrate is in bits per second (usually measured in megabits per second). Therefore, bit rate = sample rate x bit depth. As far as I know, your sample size is just one of these 1-second chunks of data.

If you run pure math, you will find that these files are very large, but there are some compression algorithms that have been adopted to keep the files low without a significant loss of quality.

Receive your personal cloud storage with 2 Gb of free space
The sample size or bit depth is included, which is a measure of the number of bits in the sample, which is a direct quality measure. However, this only applies to PCM sampling. For irreversible formats like mp3, the sample size doesn’t really define the quality.

See Audio Bit Depth for more information.

Sample rate = No sample rate. Of audio samples transported per second

Sample size = The sample size determines the maximum dynamic range of a digitized sound. Dynamic range is the ratio of the maximum amplitude to the minimum non-zero amplitude of a signal, generally expressed in decibels (dB).

The sampling frequency affects the quality of the recorded sound. Therefore, a higher sample rate will improve the quality as the number of bits increases, but will require more data and result in larger files. The bit rate used to store the samples used to store the sampled data also affects the quality of the recording. Bit rate is the amount of space that can be used to store sampled data per second. The higher the bitrate, the better the sound, but more space is required to store the file.


Free Download Mp4Gain
picture


Mp4Gain Main Window
picture


Mp4Gain Features
picture


Free Download Mp4Gain
picture

The difference between 44,100 Hz (music industry) and 48,000 Hz (video industry)

The difference between 44,100 Hz (music industry) and 48,000 Hz (video industry)

44100 vs 48000 sample rate

In video production, record the frame rate for shooting and the sample rate for recording. Remember this is one of the basics for shooting and recording.

SAMPLE RATE

First, about the difference in sampling frequency. Generally speaking

44,100Hz (44.1kHz) is the standard in the music industry

48,000Hz (48kHz) is the sound standard in the video industry

The difference between the two sample rates is just that. I talked about the sample rate as
the frame rate in video in another article, “Sound Principles Required for Video Production,” Sample Rate and Bit Depth. ”
In other words, the higher the sample rate in Hz, the softer the sound will be.

There are several theories about the historical background of 44,100Hz.
I would like to introduce you to one of the most logical.

First, when sampling sound, you need a sample rate that is at least twice the highest frequency you are recording. This is the sample rate necessary to obtain a minimum of the waveform. This is because it is not possible to record a sound that has the character of a wave if there is only one place to take a sample. Most people say that the audible range is 50 Hz to 16,000 Hz. Double is 32 kHz, but it seems that the harmonic components that make up the tone need to be recorded in order to record the voice correctly. Only when this is taken into account does it appear that up to 44,100Hz is required. Click here for more details.

Sound Processing “I want to hear my voice clearly” (link outside of Vook’s site)

What happens when the sample rate is low?
When digitizing analog information, if the sampling rate is not high, the high-frequency information will be hidden in the low-frequency information.
Then the high-frequency sound will be recorded as low-frequency sound.

In any case, by definition, 48,000 Hz has better sound quality than 44,100 Hz. The video industry has introduced 48,000 Hz.

One problem that sometimes occurs is that “I was recording 48 kHz video and the separately recorded microphone was set to 44.1 kHz.” At first I thought that different sample rates would be a big deal, but it doesn’t really seem to be the case.

Sound recorded at a small sample rate just has a small number of samples per second, but since there is almost no difference between 44.1 kHz and 48 kHz, I think you can barely tell the difference when you listen to the sound normally. At 96 kHz, the sound quality is even higher, but the number of samples is so large that ordinary people cannot hear it at all.

In some cases, the sample rate is really important.

1) By writing the audio actually recorded with a different sample number as a video file. This is because the sample rate must be converted to a video sample rate that is different from the conventional 44.1 kHz and 96 kHz sample rates, that is, 48 ​​kHz. Software that specializes in video editing seems to have sound distortion at this point.

2) When recording a 48 kHz music video to match the music played at 44.1 kHz on the site. In this case, it can be very difficult to match performance lips to post-production due to the different sample rates of the sound being played. It’s called sink drift.

3) Another point It seems that this is a problem that occurs at the time of recording, but there is a problem that the sound changes gradually when the sound recorded separately using a cheap recorder is synchronized with the sound recorded in the reference of the video. it seems that there are moments. In this case, it seems that you need to manually fast-forward the video a little and match it to the audio file, or extract a few frames at the important points in the audio and sync it up. It seems that this has nothing to do with the sample rate, so I will describe it so as not to cause misunderstandings.

What is the sample rate (sample rate)?

What is the sample rate (sample rate)?

Sample Rate

Differences in sound quality and how to check it! It also explains the settings to consider!

sample rate

Adjusting the sample rate Sound quality

The sample rate is mentioned in the DAW and audio interface settings. How should I set it?
I googled “sample rate”, but I don’t understand all the difficult words …
The higher the sample rate, the better the sound quality?

show-frequency-what-is
For those people, here we explain in a way that is easy to understand even for beginners, from the meaning of the sample rate that always appears when starting DTM, the relationship with the sound quality, the confirmation method to the setting method. I would like it in a way.

Please refer to that.

1. What is the sampling frequency?
Adjusting the sample rate Sound quality
Sample rate is a frequency that indicates how accurately sound is captured. .

Sample rate, sample rate Also sometimes called sample rate in English.

The higher the number, the better it will be caught. .

Of course, if you want to import sound to your computer, you have to convert it to data. Capturing sound data is called sampling.

* Currently, sampling mainly refers to capturing existing sounds. (Using existing music, recording material, etc.)

Rate means rate or commission in Japanese.

In other words, sampling = sound data capture. Fee = fee, commission

Literally translated, “sample rate = sound data capture rate. It will be a commission.

In other words, the sample rate indicates how well the sound is captured.

For example, if the sampling frequency is 48 kHz, the data will be taken by dividing it into 48000 times per second.

The higher this value, the finer the sound will be sampled.

It may be easier to understand if you think of the version in which each pixel art block is a sound.

2. Does the sound quality change depending on the sampling frequency?
Sample rate loading sound quality
A common question is whether the sound quality changes based on frequency.

Naturally it changes.

However, beginners and those who do not have expensive equipment will not notice much even if it changes.

So I will show you how to check it!

What is bit?

What is bit?

Sample rate

bit is an abbreviation for binary digits.
16 bits and 24 bits in catalogs, etc. represent the number of binary digits * handled by computers, etc.

SAMPLE RATE

In digital audio, analog sound is converted to a digital signal,
but the number of bits determines how accurately the amplitude value is converted when it is converted to a binary number (quantization) after sampling.
In the case of 1 bit, only 1 or 0 can be judged, but in 8 bit (10001001), 2 raised to the eighth power, that is, 256 steps can be judged in detail.

Currently, the 16-bit mainstream has 65,536 steps and the 24-bit mainstream has 16,777,216 steps.
Now,
there is a part that does not match the actual waveform (analog waveform) and the quantized and sampled digital waveform. This is called quantization noise.
This noise is especially noticeable when the number of bits is small.

So simply increasing the F’s and the number of bits will improve the sound (closer to the original sound)
, but it will consume a lot of memory. Also, in the case of digital recording, it is
very important to manage the input level to bring out the high quality of the sound.
If the recording level is too low, you won’t be able to bring out its goodness.

It is important to configure it so that it is not clipping at the maximum level of the music to be recorded,
but try to increase the overall average level as much as possible to have a wider dynamic range
(recordable high and low level difference) than analog. Make the most of it and record with a good signal-to-noise (SN) ratio.

* The decimal numbers that we usually use are represented by a combination of 10 types of numbers from 0 to 9, but in
binary numbers, are represented by a combination of 0 and 1.

For example, in a 4-digit binary number,

Decimal number 0 1 2 3 Four ・ ・ ・ ・ 14 15
Binary number 0 1 Ten 11 100 ・ ・ ・ 1110 1111
You can express a number from 0 to 15 as.

(5) What is timing?

It is a state in which each device moves in harmony with each other at the same time in the system.

Digital devices use a reference signal called a word clock, and
Each device can be synchronized with a high precision that cannot be compared with analog devices.

For the configuration of each device, the device that supplies the reference word clock is set as the word clock master, and
all other devices are configured as
word clock slaves so that they can operate synchronously in response to the instruction of a unit set by this master increases.

The role of the word clock is similar to that of the conveyor belt used on factory assembly lines.

The digitized audio data is divided into small times, it is
they are transmitted to each device, they are processed and finally the DA converter returns them to an analog audio signal.
What happens if the speed of the conveyor belt changes along the way?
The data will be lost or the time will not match.

If there are devices in the system that are not synced
, problems such as loss of sound and noise mixing will occur due to the same cause.

With regard to synchronization, if each device is precisely configured and word clock transmission between each device is guaranteed,
can achieve high-performance and comfortable operation unique to digital technology.

Difference between digital and analog

Difference between digital and analog

Audio Sample Rate

The sound is analog. And sound is the vibration of the air. How is this sound vibration transmitted?
For example, when a stone is thrown onto a surface of calm water, the ripples spread around it, but if
Cut in the direction of the waves and look at the cut end, the waveform is as shown

sample rate

Air waves spread from the point where sound is emitted even in air. Although invisible to the eye, it has a
similar waveform. This is the analog waveform of sound.

Therefore, although it is digital, when such a sound waveform is recorded or communicated by phone or wireless, as
shown in Fig. 2, the change in the analog waveform is electrically replaced with a series of numerical values ​​according to a certain promise. ..

When recording or communicating, if you handle it as analog, it is easy for noise to enter and the sound quality to deteriorate, but when trying
the waveform of the sound as digital = numerical data, you can eliminate that worry and
maintain a certain quality. You can do various processing while maintaining it.

(2) What is convenient when it is digital

Digital audio signals are convenient because they can be recorded and edited using a personal computer, for example.

In addition, 74 minutes of music can be recorded on a CD with a diameter of only 12 cm, and through digital compression processing
, music of the same length can be recorded on an MD with a smaller diameter.

Since digital signals can be compressed in this way, it is also convenient for storing large amounts of information.
Not only sound, but also video signals with a higher amount of information can be recorded and communicated at high speed through the use of compression technology.

Especially in communication, a two-way digital multiplex communication can be realized communicating multiple pieces of information with a single wire.
In addition to electrical signals, laser light can also be used for optical communication, making communication possible at extremely high speeds.

(3) What is the sampling frequency?

Digital signals are processed at predetermined fixed time intervals.
The sample rate (sample rate) indicates how many times a second is processed and is expressed as Fs or fs.

The sampling frequency unit is Hz (Hertz), and the
44.1 kHz (kilohertz) sample rate means 44,100 pieces of data are processed per second.
(K represents 1000 times)

AD conversion converts a continuous analog signal into a digital signal,
measures the size of the signal at each moment determined by the sampling frequency (sampling) and converts
the result in a binary number (quantization).

On the other hand, DA conversion converts a digital signal into an analog signal,
It reads the digital signal in the sample rate time interval and connects it smoothly.

Since digital signals can be reproduced up to half the sampling frequency, how much
The higher the sample rate, the higher the playable frequency and the better the sound quality.
In familiar areas, 44.1 kHz is used for CD, and 48 kHz is used for DAT and B modes of satellite transmission.

In addition, recent professional equipment uses high sampling frequencies (high sampling), such as 88.2 kHz and 96 kHz,
and are designed to faithfully reproduce even higher frequency sounds to improve sound quality.

Why upsampling? Part 2

Why upsampling? Part 2

Upsampling

For every doubling of the sampling frequency, the spectral density of the noise is reduced by half and the signal-to-noise ratio increases by 3 dB. Since the resolution limit for the pressure level is approximately 1 dB, these decibels are unlikely to have a noticeable effect on sound perception in the high-frequency region. Based on these numbers, it is absolutely impossible to draw tentative conclusions about the change in sound quality.

In order to relate the spectrum of quantization errors, sampling frequency and sound quality, in this article it is proposed to use a tonal signal as a music model, as is usual to evaluate the quality of sound paths. This approach relies heavily on materials published in the “Sound Engineer” magazine.

The results can be summarized as follows. Unlike analog audio, digital audio is the product of amplitude modulation. This is manifested in a rigid functional dependence of the quantization error spectrum of the frequency multiplicity factor of the audio signal F and the sampling frequency fs, represented as the ratio of prime numbers y and x (k = fs / F = y / x). The frequency spectrum of quantization errors is always discrete and is determined solely by the multiplicity factor; the components of this spectrum are also determined solely by the amplitude of the audio signal, expressed in quanta. This means that the mechanism for shaping the quantization error spectrum does not depend on the number of bits used. With an increase in the quantization bit depth, the spectrum does not change in shape and composition, but only changes in level by 6 dB with each additional digit. (There are situations where a change in bit depth leads to a change in spectrum, – Ed.) The auditory perception of the quantization error spectrum is largely determined by the frequency response of hearing, which, in turn, it depends largely on the sound pressure level.

The frequencies of digital sound are divided into multiples when x = 1 and submultiples when x> 1. At multiple frequencies, the spectrum of quantization errors is harmonic and the main pitch is the frequency of the audio signal. If y is an even number, then the spectrum contains only odd harmonics. If y is an odd number, then the odd and even harmonics of the audio signal are present in the spectrum.

At multiple sub-frequencies in the quantization error spectrum, the components appear below the frequency of the audio signal, down to zero, and the lower limit of the spectrum Fn (x) is determined by the formula x – Fn (x) = F / X. In this case, the frequency Fn (x) becomes the fundamental pitch of the sound for quantization errors, and all other components, including the frequency of the sound signal, are converted to its harmonics. If the number is even at the submultiple frequency yskr, then the spectrum contains only odd harmonics of the frequency Fn (x). If yskr is an odd number, then the spectrum contains odd and even harmonics of this frequency. Low-frequency components in the quantization error spectrum lead to the appearance of harmonics in the form of pitch or consonance. They are especially noticeable at high frequencies in the audio signal when there is no frequency masking effect.

To clarify, we will give an example of a quantization error spectrum at an audio signal level of minus 30 dB with 8-bit quantization. Let fs = 48 kHz and F = 12800 Hz, then the multiplicity factor k skr = y / x = 48000/12800 = 15/4 and therefore the lower cutoff frequency Fn (x) = F / x = 3200 Hz, and the spectrum consists of odd and even harmonics of this frequency.

1.jpg

Figure 1. Quantization error spectra at submultiple frequency deviation

When the frequency of an audio signal deviates from a submultiple value by a small amount, sidebands appear around all harmonics of the spectrum, including zero (Fig. 1a), the number of spectrum components increases dramatically, and the limit bottom of the spectrum decreases, since the current value of x increases a lot.

Suppose, for example, that the frequency increment of the audio signal is 1 Hz, then the value of the multiplicity factor k = y / x = 48000/12801 = 16000/4267 and the lower limit frequency of the deviation spectrum becomes Fno = 12801/4267 = 3 Hz, and the interval between the components of the spectrum decreases to 6 Hz (Fig. 1b).

Why upsampling?

Why upsampling?

Upsampling

When it comes to improving digital sound quality, experts in this field agree on only one thing: with an increase in sample rate, sound quality improves dramatically.

Why upsampling?
When it comes to improving digital sound quality, experts in this field agree on only one thing: As the sample rate increases, the sound quality improves dramatically. Also, under the word “improvement”, everyone already understands something for himself. All the variety of opinions on this topic boils down to the following: the sound becomes clearer, softer, more natural, the low frequencies are perceived more clearly.

However, these nuances are only noticed by listeners trained with a good ear for music on specially selected sound material and using technically advanced equipment.

There are many hypotheses that explain why sound quality is improved by higher sampling. Many technicians are inclined to believe that this relationship is due to distortions that arise from filtering and interpolation during audio signal reconstruction.

On a modern technical level, high-quality interpolators may be practically impossible to implement, therefore, instead of improving them, manufacturers simply increase the sample rate. Maybe it’s not about them at all.

Another version, which many music lovers adhere to, is that at a low sampling frequency, for example 44100 Hz, digital sound is completely devoid of nuances of high sounds, the main frequencies of which are above 7 kHz. , and at lower frequencies there are very few harmonics for a high quality perception of music.

In fact, many musical instruments generate vibrations of up to 100 kHz. It is true that the fraction of energy that falls in the frequency band above 20 kHz is 0.01 to 2% for sounds of a harmonic nature and 0.02 to 68% for sounds created by a cymbal, triangle or striking the metal edge of a drum (hoop shot – editor’s note).

Even the frequency range of speech in hissing-hissing sounds extends up to 40 kHz. Supporters of this version are not ashamed that a person cannot perceive sounds with a frequency higher than 20 kHz. Ultrasound is assumed to be perceived bypassing the auditory system, for example through bone conduction.

Discussions that harmonics above 20 kHz make a significant contribution to sounding have culminated in the creation and widespread introduction of analog-to-digital converters using 96 kHz and 192 kHz sample rates; The sample rate is expected to increase to 384 kHz.

Based on modern knowledge of human perception of sound, it must be assumed that the relationship between digital sound quality and sampling frequency is due to the transformation of the quantization error spectrum in the audio frequency range.

In technical literature, this topic is considered only for a particular mathematical model, when music is represented by a signal with a uniform distribution in level and frequency. In this case, the quantization errors are converted to noise with a uniform spectral density from 0 Hz to the Nyquist frequency.

Relationship between sound quality and sample rate

Relationship between sound quality and sample rate

SAMPLE RATE

The conversion of an analog signal to digital consists of two steps: sampling in time and quantization in amplitude.

sample rate

Time sampling means that the signal is represented by a series of samples (samples) taken at regular intervals. For example, when we say that the sample rate is 44.1 kHz, this means that the signal is measured 44 100 times in one second.

The main problem in the first stage of converting an analog to digital signal (digitization) is the choice of the sampling frequency of the analog signal. The higher the frequency, the closer the digital signal is to the analog. However, in proportion to the increase in frequency, they increase:

The intensity of the digital data flow and the bandwidth of the interfaces are not unlimited, especially if several channels are recorded / played simultaneously;
The computational load on digital processors and their computing capabilities are also limited;
The amount of memory required to store the digital signal is increased.
Obviously, a compromise is needed. The choice of the sampling frequency affects the frequency range of the received digital sound and the maximum frequency of the analog signal, correctly represented in the digital one. It is believed that a person hears frequencies in the range of 20 to 20,000 Hz. According to the well-known Kotelnikov theorem, in order for an analog signal (continuous in time) to be accurately reconstructed from its samples, the sampling frequency must be at least twice the maximum audio frequency.

An audio frequency equal to half the sampling frequency is called the Nyquist frequency and is the maximum frequency that a given digital system can store and reproduce correctly. Therefore, if the actual analog signal that we are going to convert to digital format contains frequency components from 0 to 20 kHz, then the sampling frequency of that signal must be at least 40 kHz. The most common sample rates today are 44.1 kHz (CD) and 48 kHz (DAT).

Sample rate, where it comes from

Sample rate, where it comes from

Sample rate

Where does the sample rate for CD-audio 44100 hertz come from?

Sample rate

The standard sample rate for CD-audio is 44100 Hertz. Where and why were these 44100s originally chosen for CD audio production?

Starting from the condition (see Nyquist-Shannon-Kotelnikov) of reproduction of the upper limit of the spectrum at 20 kHz, the sampling frequency should have been chosen above 40 kHz. But at the time of the creation of these standards and the development of CD-DA technology (the second half of the 70s of the last century), there was no generally accepted medium in which to record, edit and store digital sound. And for this, it was decided to use standard VCRs, which in those days worked in U-matic format. The digital signal was encoded by a special encoder into a black and white video pseudo-signal and recorded on a video cassette. The structure of the digital signal had to be linked to the frequency and structure of the fields of the television signal used for recording.

This decision was complicated by the fact that different video recording standards are used in Europe and the US: 525 lines at 60 Hz and 625 lines at 50 Hz, while not all lines can be used to record information. The selected frequency should fit the structure of both video signals. 44100 Hz meet this requirement.

In a 60 Hz NTSC video signal, 35 lines are not used for recording, leaving 490 active lines per frame, or 245 in the field for digital audio recording. When writing three samples to a string, the sample rate will be:

60 × 245 × 3 = 44100.

In a 50Hz PAL signal, 37 lines are not used, leaving 588 active lines per frame, or 249 per field, so the frequency will be:

50 × 249 × 3 = 44100.

Although digital sound at that time had nothing to do with the video signal, video equipment was used in the production of the CD, which determined the choice of sampling frequency.

Basics of digital sound theory Part 4

Basics of digital sound theory Part 4

Sample Rate

The MP3 algorithm allows you to compress the sound 20 to 30 times while maintaining good quality.

Sample Rate

The full quality of the CD is believed to be preserved at a bit rate of approximately 160 Kbps (the concepts of “sample rate” and “sample bit depth” do not apply to MP3 files). However, in most cases, much more compressed audio is quite acceptable. Therefore, in Flash animations, MP3 compression is usually used, which gives a bit rate of the order of 16-32 Kbps. The Flash player supports a range of bit rates ranging from 16 to 160 Kbps. You must select the most suitable based on film size and sound quality requirements. It is often worth leaving the MP3 file at the same quality as imported (therefore, the Use imported mp3 quality setting is on by default). If the quality changes, then the change should be in the direction of decreasing quality, but not increasing.

If the sound is processed in an external editor, you can take into account the fact that the Flash player supports not only the MP3 algorithm, which is part of the MPEG1 Layer 3 standard, but also newer algorithms (MPEG2 and MPEG2.5), that provide better sound quality when bit depth is low. In addition, the player supports MP3 encoding with both constant and variable bit depth (in the latter case, the best compression ratio is achieved).

The MP3 format is optimal for rash projects. Therefore, in practice, it is practically only used. Furthermore, MP3 files can be loaded dynamically, and they also have very useful ID3 tags with information about this sound.

• Nellymoser. A relatively new compression algorithm developed by Nellymoser Inc. Designed to compress human speech. His main idea is that a human voice can include vibrations with frequencies in a fairly narrow range. The upper and lower components can be discarded. Very low amplitude harmonics are also eliminated. The result is compression comparable to MP3 compression, but the sound quality is higher. More details about the Nellymoser algorithm can be found on the developer’s website http://www.nellymoser.com/.

The Nellymoser algorithm codec is included in the player only in Flash MX.

In the Flash IDE, Nellymoser compression is called Speech. You can adjust the quality / size ratio when using Nellymoser compression by changing the sample rate.

You can also include uncompressed audio in your SWF movie. In the development environment, this mode is called Raw. In this case, you can change the bit depth and sample rate. In theory, you can use uncompressed audio if sound quality is significantly more important than movie size (or, even less likely, if you need to save computing resources). In practice, however, it is better to use MP3 compression with a high bit rate (more than 120 Kbps).

Storage formats
There are quite a few audio formats. By default, Flash only allows you to import two of them.

• WAV. The main format for storing uncompressed audio on the Windows platform. Supports mono and stereo audio, various samples, and bit depths. Usually it is WAV where the analog signal is digitized, and only then is one of the compression algorithms applied. WAV files are extremely large, which is why this format has been significantly replaced by MP3. However, WAV is still the main format for professional sound editors like SoundForge.

• MP3. Audio format using the compression algorithm described above. The main format in the case of Flash, as it perfectly combines good sound quality and a small file size. Also, sound files in this format, unlike WAV files, can be dynamically loaded into a movie using the loadSound () method of the Sound class.

If you have QuickTime 4 or higher installed, you can import files in AIFF, QuickTime, Sun AU formats additionally.