Is the digital signal distorted during transmission and storage?


Free Download Mp4Gain
picture

Is the digital signal distorted during transmission and storage?

DIGITAL AUDIO

Since any digital signal is represented as a real voltage or current electrical curve, its shape is distorted in one way or another during any transmission, and a signal “frozen” for storage (signalogram) is subject to degradation due to physical reasons. common.

Digital Audio

All of these influences on the shape of the carrier signal are interferences that, up to a certain value, do not change the information content of the signal, since individual distortions and letter loss in words generally do not interfere with the correct understanding of words. words, and information redundancy, such as an increase in the length of the words, increases the probability of successful recognition. … In other words, the carrier signal itself can be distorted, but the information it carries, the encoded audio signal, remains unchanged in the vast majority of cases.

So that the quality of the carrier signal does not deteriorate, any transmission of useful audio information (copying, writing to a carrier and reading it) must necessarily include the operation of restoring the form of the carrier signal, and ideally, and the digital form primary of the information signal, and only after that the newly generated carrier signal can be transmitted to the next consumer. In the case of direct copy without restoration (for example, simply rewriting a video cassette with a digital signal obtained with a PCM decoder in common VCRs), the quality of the digital signal deteriorates, although it still contains all the information it carries. However, after repeated sequential copies or long-term storage, the quality deteriorates so much that unrecoverable errors begin to appear that irreversibly distort the information carried by the signal. Therefore, the copying and transmission of digital signals should be done only on digital devices and, when stored on media, should be “updated” in a timely manner without waiting for irreversible degradation (for magnetic media, this period is estimated to be several years ). A correctly transmitted or updated digital signallogram does not lose quality and can be copied and exist forever in absolutely unaltered form. without waiting for irreversible degradation (for magnetic carriers this period is estimated to be several years). A correctly transmitted or updated digital signallogram does not lose quality and can be copied and exist forever in absolutely unaltered form. without waiting for irreversible degradation (for magnetic carriers this period is estimated to be several years). A correctly transmitted or updated digital signallogram does not lose quality and can be copied and exist forever in absolutely unaltered form.

However, it should not be forgotten that the correctness of any code is finite, and the actual carriers are far from ideal, therefore the occurrence of unrecoverable errors is such a rare thing, especially with careless handling of the carrier. When reading new and correctly stored DAT cassettes or CDs on high-quality and reliable devices, these errors practically do not occur, however, with aging, contamination and damage of media and reading systems, they become more. A single uncorrected error is almost always invisible to the ear due to interpolation, however, it leads to distortion of the original sound signal, and the accumulation of such errors over time begins to be felt in the ear.

A separate problem is the difficulty of recording uncorrected errors, as well as verifying the identity of the original and the copy. Very often, designers of digital audio devices operating in real time do not care about the issue of accurate verification of the reliability of the transmission, considering that the measures taken to correct the errors are sufficient. In the general case, the impossibility of retransmitting an erroneous sample or block leads to interpolation occurring secretly and after copying it is impossible to say with certainty whether the original signal was copied exactly. Error indicators, which are found on some devices, usually light up only at the moment of their appearance, and in the case of single errors, their operation can easily go unnoticed. Even in personal computer-based systems, it is often impossible to control the accuracy of reception through a digital interface or direct reading from a CD; the only way out is to repeat the operation and compare the results.


Free Download Mp4Gain
picture


Mp4Gain Main Window
picture


Mp4Gain Features
picture


Free Download Mp4Gain
picture

What are the pros and cons of digital audio?

What are the pros and cons of digital audio?

Digital Audio

The digital representation of sound is valuable, first of all, for the possibility of endless storage and reproduction without loss of quality; however, the conversion from analog to digital and vice versa inevitably leads to its partial loss.

digital audio

The most unpleasant distortions introduced in the digitizing stage are the granular noise that occurs when the signal is quantized by level due to rounding of the amplitude to the nearest discrete value. Unlike simple broadband noise introduced by quantization errors, granular noise is the harmonic distortion of the signal, most noticeable in the upper part of the spectrum.

The power of the granular noise is inversely proportional to the number of quantization steps; However, due to the logarithmic characteristic of hearing with linear quantization (constant step value), quiet sounds have fewer quantization steps than loud sounds, and as a result, the main density of non-linear distortions falls in the region of sounds. silent. This leads to a limitation of the dynamic range, which ideally (without taking into account harmonic distortion) would be equal to the signal-to-noise ratio, but the need to limit this distortion reduces the dynamic range for 16-bit encoding to 50-60 dB. The situation could have been saved by logarithmic quantification, but its implementation in real time is very difficult and expensive.

The distortion introduced by granular noise can be reduced by adding normal white noise (random or pseudo-random signal) to the signal, with an amplitude of half the least significant bit; such an operation is called dithering. This leads to a slight increase in the noise level, but weakens the correlation of quantization errors with the components of the high-frequency signal and improves subjective perception. Anti-aliasing is also applied before rounding the samples by decreasing their bit depth. Essentially, dithering and noise shaping are special cases of the same technology, with the difference that, in the first case, white noise with a flat spectrum is used and, in the second, noise with a spectrum with a “shape “special.

When restoring audio from digital to analog, there is the problem of smoothing the stepped waveform and suppressing the harmonics introduced by the sample rate. Due to the imperfection of the frequency response of the filters, insufficient suppression of this interference or excessive attenuation of useful high-frequency components may occur. Poorly suppressed sample rate harmonics distort the shape of the analog signal (especially in the high frequency region), resulting in a “rough” and “dirty” sound.

MP3 and audio digitization.

MP3 and audio digitization.

audio digitalization

All of humanity has become accustomed to such everyday things as recording and reproducing sound, be it a voice recorder, an answering machine, or musical recordings of their favorite artists. And people who spend most of their time near the computer probably can’t imagine life without sound. This article will focus on such a common encoding format as MP3.

audio digitalization

Well, Thomas Alva Edison started recording when he yelled the words “Mary had a lamb” on his “Talking Machine”. The “talking machine” was the world’s first device to record and reproduce sound: a phonograph that mechanically recorded a soundtrack on a wax roller. At the time, this was certainly a huge step forward, as at that time, and this was in 1877, no one came up with the idea of ​​creating something similar.

However, the biggest disadvantage of this sound carrier was the fragility of the recording. With the development of science and technology, people learned to record sound not only mechanically, as Edison did, but also electromechanically and photoelectrically, and with the advent of computers, it became possible to record sound in digital form. The main advantage of this recording method is the preservation of sound quality, regardless of how many times it has been played or rewritten, and since digital information can be processed on a computer, this opened wide doors of possibilities for working with sound. . But since in the early stage of digital sound development, recording a composition cost a lot of disk space and magnetic media had a small capacity, software developers began to baffle the fact. how to put a lot of music on a small hard drive. This led to the appearance of various programs – compressors, which reduced the size of the audio file. Compression algorithms provided the removal of certain frequencies, which led to a loss in sound quality, and then the user was faced with the choice of spending money buying additional megabytes and storing uncompressed music files, or saving money. and use compressors.

First, let’s find out what “sound” is in real life. The transmission of information at a distance using acoustic vibrations is only possible due to the properties of the acoustic environment in which these same sound vibrations occur. They are possible due to the presence of elastic bonds between particles in the conductive medium. The sound source creates an area of ​​pressure by compressing air molecules. These molecules transfer their energy to others that are nearby, and these, in turn, to others, etc., which leads to the appearance of areas of increased and decreased pressure in relation to the ambient pressure. This creates a sound wave that is continuous in nature. One of the parameters of the wave is amplitude. Let’s take a simple example: a guitar string. Everyone knows that to increase the volume of the sound it is necessary to pull the string with more force, thus increasing the amplitude of its vibration, which will lead to an increase in the pressure deviation. But a wave is not enough to transmit a sound that can be perceived by the human ear. Another important point is the vibration frequency, that is, the frequency with which the sound source creates a pressure change, and it is this frequency that determines the pitch of the transmitted sound. On a guitar, to change the pitch, you need to hold down the string at a certain fret, that is, change the length of the string and, as a consequence, the frequency of its vibrations. Another important point is the vibration frequency, that is, the frequency with which the sound source creates a pressure change, and it is this frequency that determines the pitch of the transmitted sound. On a guitar, to change the pitch, you need to hold down the string at a certain fret, that is, change the length of the string and, as a consequence, the frequency of its vibrations. Another important point is the vibration frequency, that is, the frequency with which the sound source creates a pressure change, and it is this frequency that determines the pitch of the transmitted sound. On a guitar, to change the pitch, you need to hold down the string at a certain fret, that is, change the length of the string and, as a consequence, the frequency of its vibrations.

Now that we understand the nature of sound a bit, let’s move from analog to digital. To digitize “natural” sound, you must first convert it to an analog electrical signal. In this case, the analog of the amplitude of the sound wave is the amplitude of the voltage change. As mentioned above, the wave and the analog electrical signal are continuous functions, but for digitization they must be represented in discrete form. For this, an ADC (analog-digital converter) is used, which breaks the continuous wave into sections (Sample) and represents the amplitude of the wave in these sections as a number, that is, it quantifies. It is clear that for greater precision and purity of sound, the number of samples must tend to infinity and their size must go to zero. The number of samples per second is called the sample rate or sample rate and is measured in Hz. The question arises, what sample rate to use when digitizing so that the result is the most natural? It is theoretically known that for the most accurate reconstruction of a continuous analog signal from discrete values, it is necessary to use a sampling frequency at least 2 times higher than the frequency of sound (Nyquist’s theorem). It is known that the human ear can perceive sounds with a frequency of 18 to 20,000 Hz. Therefore, the optimal sampling frequency is 40 kHz or more. The most common sampling frequencies are 44.1 kHz, 48 kHz. However, due to the fact that harmonics above 20 kHz also affect the overall sound, encoders with sample rates of 96 and 192 kHz are also used. Also, the sound quality depends on the number of digits used to record the measured amplitude. The quantization error is inversely proportional to the bit width. Therefore, with 8-bit quantization, the sound level is recorded using numbers in the range [-128; 128], with 16 bits from [-32768; 32768]. For example, when recording audio CDs, exactly 16-bit quantization is used, so they have high sound quality.

Let’s make a middle conclusion: the ADC converts the analog signal into numbers and writes them as a sequence. Then comes Wave, a sound format. Note that audio CDs record sound in the same format. However, this storage method is not economical. Many people probably prefer an MP3 disc, which can contain more than 200 songs, than a regular CD. It does this by compressing the Wave file at the expense of quality. But don’t be alarmed, as the human ear is virtually incapable of recognizing the loss of sound quality after compression. Let me explain now. It all started when, in the late 1980s, the International Organization for Standardization (ISO) created the Moving Pictrures Experts Group, whose task was to develop an international standard for the presentation of digital video and audio data. The result of the group’s work is the MPEG-1 Layer 3 format, or MP3 for short, which compresses audio data by 1/12 with virtually no loss of quality. The audio compression algorithm in this format is based on the psychoacoustic characteristics of the human hearing organ, and therefore the removal of elements that are not perceived by the ear does not affect the noticeable deterioration in quality. Suppose there are many people in the room and they are all talking to each other at the top of their voices, and if you try to call a person who is only a few feet from you without raising your voice, don’t expect them to answer your call. , since due to the noise generated, it will not hear you. This is because sounds of the same frequency with higher amplitude mask other frequencies with lower amplitude. However, this unfortunate effect is happily used to compress digitized audio. The wave stream will contain all sound information, even masked, that is not audible to the ear, but after compression this information will be removed, reducing the file size. Another important characteristic of the human hearing organ used for compression is inertia. The ear, to put it vulgarly, is an inertial device, therefore, at the limit of the difference in sound level from highest to lowest for a certain time (~ 100 ms), a person cannot hear a sound of lower amplitude Therefore, the sound in this period may not be saved. It is also possible not to save the sound that is beyond the sensitivity threshold, that is, the sound level of which is below a certain value and is therefore inaudible to a person. Another interesting property used for encoding (but not by ”

Together, therefore, all of this leads to significant savings in the disk space occupied by the audio file. An average music file that occupies 30-40 MB in “full” form, after encoding it in MP3, already occupies 3-4 MB, allowing you to record more than 11 hours of music on a disc. However, this is not the limit. In 2001, the MP3 format had a successor: the MP3Pro format. Its creators are Thomson Multimedia and the Fraunhofer Institute in Germany. A distinctive feature of the new improved format is that, with the same quality, the files in the new format take up 2 times less space compared to normal MP3s. For example, an MP3Pro file with 128 kbps sound quality will be the same size as a 64 kbps MP3 file. Another advantage is

Let’s see how this is achieved. The working principle of the MP3Pro format is quite simple. When encoding, the audio stream is divided into two parts, two streams. The first is the low-frequency one, which is encoded in the usual MP3 format, which, by the way, makes the formats backward compatible, because normal players only play this part of the file. The second stream is high frequency, which is encoded in the part of the MP3 stream that older players ignore. The new decoder combines these two streams, leading to full sound across the entire frequency band.
Regarding the promotion of the new format in the market, compared to its older brother, MP3Pro has not received such a wide distribution. Thomson Multimedia offers a free version of the MP3Pro Player / Encoder for download from their website. The limitations of this version are that only 64 kbps quality is available for encoding. WinAmp lovers have a plugin to play MP3Pro files

Of course, the light did not converge on MP3, there are other digital encoding formats, but despite this, it is still the most popular.

MP3, FLAC, WAV, ALAC: the differences between audio formats

Digital audio formats

Digital Audio

Today, most people listen to music completely digitally. The differences between digital audio formats like WAV, FLAC, MP3, and ALAC are not clear to everyone. We put the facts together.

Digital audio formats

While vinyl is booming and CD sales are slowly but surely falling, today’s music is often heard without any physical medium. Whether you use your smartphone or digital audio player, you can move forward with digital audio formats on the go. After all, no one today wants to carry a Discman and multiple CDs with them when they typically have a powerful pocket computer in the form of a smartphone that can play digital music files. But what are the differences between the individual file formats and what are their advantages and disadvantages?

WAV and AIFF: the uncompressed ones

The Wave container format (.wav) was developed by Microsoft. Saves uncompressed audio content, so files require a lot of storage space (2 minutes can take 20MB of space. WAV is especially important when recording and editing audio content. The downside of .wav files is that they don’t metadata is required (about, Title Artist) can be stored,
the equivalent developed by Apple AIFF (.aif) Due to the fact that Apple computers are very common in music production, this audio format is very common there.

MP3, AAC, WMA, Ogg-Vorbis – compressed to save space, but not lossless

The MP3 file format (.mp3, named for the MPEG-1 Audio Layer 3 compression codec) developed by the Fraunhofer Institute in the 1980s is probably the best-known digital audio format. It gave the MP3 player its name, and for a long time music was digitized almost exclusively as MP3, for example, on the extremely popular and now illegal file-sharing networks around the turn of the millennium. The advantage of MP3 is the small amount of storage space required: on average, it takes up one-tenth the size of the original file. However, one disadvantage that should not be neglected is that it is lossy – frequencies that are inaudible to humans are removed to drastically reduce the memory required. To what extent this affects the sound, you can compare Flac with MP3 Read.

AAC (Advanced Audio Coding) is a successor to the MP3 format, offering slightly better sound quality. Apple continues to mainly offer songs in this audio format on the iTunes store.

WMA stands for Windows Media Audio (.wma), as the name suggests, a development by Microsoft. .Wma is also a lossy compression file format.

A somewhat rarer audio format is Ogg-Vorbis (.ogg), where Vorbis is the music compression technology and .ogg is the container format. Like MP3, .ogg is also lossy, but requires less storage space and better quality.

FLAC / ALAC / WMA lossless – the lossless

Lossless formats were developed to preserve all sound information while keeping the amount of memory required small. With all file formats, the required memory is reduced to about half the original file. With audio conversion software, the file can be converted to other lossless formats, something unthinkable with lossy formats. This is why lossless file formats are popular for archiving music collections in a space-saving way.

FLAC – Free Lossless Audio Code (.flac) is a free audio format, so it is not owned by any major corporation. ALAC: Apple Lossless Audio Codec (.alac) is Apple’s lossless file format, while Microsoft also has its own development on the market with WMA Lossless.

Introduction to digital audio

Introduction to digital audio

Digital audio is the representation of sound signals through a set
of binary data. A complete digital audio system usually begins
with a transceiver (microphone) that converts the pressure wave that represents the
Sound to an analog electrical signal.
This analog signal goes through an analog signal processing system, in
which can be made limitations on frequency, equalization, amplification and
Other processes such as compassion. Equalization aims
counteract the particular frequency response of the transceiver used of
so that the analog signal closely resembles the original audio signal.


After analog processing, the signal is sampled, quantified and encoded. The
sampling takes a discrete number of analog signal values ​​per second
(sampling rate) and quantification assigns discrete analog values ​​to those
samples, which means a loss of information (the signal is no longer the same
than the original). The encoding assigns a sequence of bits to each value
discrete analog The length of the bit sequence is a function of the number of
analog levels used in quantification. The sampling rate and the
number of bits per sample are two of the fundamental parameters to choose from
when you want to digitally process a certain audio signal.
Digital audio formats try to represent that set of samples
digital (or a modification) of them efficiently, so that it is optimized
depending on the application, either the volume of the data to be stored or the
processing capacity necessary to obtain the starting samples. In
in this sense there is a very extended audio format that is not considered audio
digital: the MIDI format. MIDI does not start with digital sound samples, but
stores the musical description of the sound, being a representation of the
score of them.
The digital audio system usually ends the reverse process to that described. From
the stored digital representation is obtained the set of samples that
represent. These samples go through a process of digital analog conversion
providing an analog signal that after processing (filtering,
amplification, equalization, etc.) affect the output transceiver (speaker)
which converts the electrical signal to a pressure wave that represents the sound.

Fundamental parameters of digital audio

The basic parameters to describe the sequence of samples it represents
The sound are:
ƒ The number of channels: 1 for mono, 2 for stereo, 4 for sound
quadraphonic, etc.
ƒ Sampling rate: The number of samples taken per second in each
channel.
ƒ Number of bits per sample: Usually 8 or 16 bits.
As a general rule, multichannel audio samples are usually organized in
frames A plot is a sequence of as many samples as channels,
each one corresponding to a channel. In this sense the number of samples per
second matches the number of frames per second. In stereo, the channel
Left is usually the first.