Basic concepts of digital sound theory

Free Download Mp4Gain

Basic concepts of digital sound theory

Digital Sound

Sound is, in general, the vibrations of an elastic medium. The sound is caused by mechanical vibrations of some object (this can be a string, vocal cords, etc.) in contact with the environment. The frequency of vibration (measured in Hertz) determines the pitch. The higher the frequency, the louder the sound. The human ear can perceive sound vibrations from the air with a frequency of 20 Hz to 20 kHz. The ear perceives the amplitude of the vibration as volume. The higher the amplitude, the louder the sound.

Digital Sound

Electromagnetic waves are a direct analog of sound waves. The latter are less susceptible to dispersal by the environment, the information they carry is easier to store and process. Electromagnetic waves are the most important secondary carrier of sound. The transformation of acoustic waves into electromagnetic waves (as well as the reverse operation) is carried out due to the usual induction effect, which consists in the appearance of a current in a conductor when it is placed in an alternating magnetic field.

Simply put, the oscillation of the loudspeaker membrane magnet near the coil induces an alternating current in it. If this current is applied to another speaker, then the magnet on its membrane will move, creating a corresponding sound.

This is how the telephone and the radio work.

Sound converted to electromagnetic waveform can be easily stored. For this, some parameter of the carrier must be compared (the depth of the plate track or the degree of magnetization of the film) with the amplitude of the oscillations (that is, the strength of the induced current in the speaker coil) . Sound converted directly to electromagnetic waves is called analog sound. Its main characteristic is the direct correspondence of the electromagnetic waves transmitted or recorded with the acoustic ones.

Digital sound is relatively new. Its main difference from analog is discretion. When digitizing, a special device, an analog-to-digital converter (ADC), measures at regular intervals (approximately 0.001-0.0001 seconds) the magnitude of the amplitude of an electromagnetic wave corresponding to an analog sound form and writes its value to a file with a specified precision. This value is generally called sample, or in jargon, sample (of the sample in English, sample). The same digitization is often called sampling or sampling.

By converting sound from digital to analog (this is done by a device called a digital-to-analog converter (DAC)).

The interpolation (approximation) of the intermediate values of the amplitude is carried out according to the known ones. Since the sampling frequency is usually high, this operation allows you to fairly accurately reconstruct the original analog signal.

The digital form of sound is characterized by five parameters.

1. The sampling rate;
2. Bit size of the samples.
3. The number of channels or tracks.
4. Compression / decompression algorithm (codec).
5. Storage format.

Since each of these parameters is quite specific, we will consider them separately.

Sampling rate
The sample rate determines how many samples per second will be taken when digitizing. If we compare digital sound with digital images, then the sample rate will correspond to the resolution (a more “realistic” analogy is the frame rate in cinema). The higher the sampling frequency, the better it is possible to reconstruct the analog signal based on the digital form of the sound (more precisely, the higher the sampling frequency, the broader the spectrum of frequencies that can be recorded during digitization).
The famous Nyquist-Kotelnikov theorem states that for the correct reconstruction of an analog signal from its digital recording, it is necessary that the sampling frequency be at least twice the maximum sound frequency.

Since the upper listening limit is 20 kHz, ideally the sample rate should be at least 40 kHz. This is why the standard sampling frequency used for recording CDs is 44.1 kHz (so-called CD quality). However, the sample rate can be higher, but this sound quality is only used by recording studios and especially demanding music lovers.

A sample rate of 44.1 kHz is not always ideal. When transmitting data over a low bandwidth network, sound quality must be sacrificed in favor of size, in practice sampling frequencies two, four and eight times lower than 44.1 kHz are often used.

Free Download Mp4Gain

Mp4Gain Main Window

Mp4Gain Features

Free Download Mp4Gain

Author: R. Arias

R. Arias is the author of this article and has extensive experience for more than 30 years as a recording engineer and audio specialist, as well as more than 20 years of experience creating algorithms related to audio and video. Linkedin View all posts by R. Arias