What you need to know about MP3


Free Download Mp4Gain
picture

What you need to know about MP3

Mp3

What is MP3?

Mp3

MP3 is short for MPEG Layer3. It is one of the transmission formats for storing and transmitting audio in digital form, developed by Fraunhofer IIS and THOMSON, and later approved as part of the MPEG1 and MPEG2 compressed video and audio standards. This scheme is the most complex scheme in the MPEG Layer 1/2/3 family. It requires the most amount of machine time to encode compared to the other two and provides higher encoding quality. It is mainly used for audio CD encoding.

The high degree of compactness of MP3 compared to other formats such as PCM (i.e. normal WAV- file) and similar formats while maintaining similar sound quality (considered 16-bit stereo at 44.1 kHz) is achieved using additional quantization according to a certain scheme, which minimizes the loss of quality. This is achieved by taking into account the peculiarities of human hearing, including the masking effect of a weak signal from one frequency range with a stronger signal from an adjacent range, when it occurs, or a strong signal from the previous frame, which causes a temporary decrease in the ear’s sensitivity to the current frame signal (simply, background sounds are eliminated, which are not heard by the human ear due to the presence at a given / previous moment of another – louder). It also takes into account the inability of most people to distinguish between signals that are below a certain power level,

This is called adaptive coding, and it allows you to save on the less perceptually significant sound details. The compression ratio (and therefore quality) is not determined by the format, but by the width of the data stream when encoded in MP3. The bit rate when encoding a signal similar to an audio CD (44.1 kHz 16 bit stereo) varies from the largest, 320 kbs (320 kilobits per second, also kbs, kbps or kb / s), up to 96 kbs and less.

Why MP3?

MP3 has two huge advantages over other formats available today. It is true that MicroSoft is trying to squeeze MP3 with its new WMA format, and there are also alternative VQF and AAC formats, but they have not yet received proper distribution and the quality is often a little worse. However, WMA is still, in fact, closed for free use, so you have problems with various encoding / listening / maintenance programs (although, who doubts MicroSoft’s mobilization capabilities :-).

The first advantage of MP3 is that none of the existing similar formats can yet be said to fully guarantee the stable preservation of sound quality at sufficiently high bit rates, except MP3, which has stood the test of time with dignity.
The second, no less important advantage: over the next few years, and perhaps the entire decade, MP3 has become the de facto standard, as the parties that use it (eg me 😉 have made a lot of investments in him, including digital radio stations. There are also many easy-to-use software programs written for MP3. Now the production of hardware MP3 players has been launched, both pocket and car. Thus, MP3 became the first massively recognized audio storage format after Audio CD (although it is often illegal).

The most famous encoders

Today there are 3 main sources that have created programs to encode MP3 music. These are Fraunhofer-IIS, Xing Technologies, and ISO itself, which adhere to the ISO MPEG standard developed by it.
Most of the encoders created to date use modified code from one of these organizations. Fraunhofer-IIS based encoders are not very fast, but very high quality, quality optimized for low bit rates.

128 kbps (11: 1)
The most popular bit rate today. The 11: 1 compression ratio is of course an argument, especially for the internet, where every kilobyte counts. However, the high frequencies are not very well preserved and there is some distortion in the sound. At the same time, I can safely say that on an ordinary computer, for example, using an ordinary sound card, computer speakers, albeit of good quality, or output through a simple recorder to your speakers (using the input for a External CD, like me), the difference will not be noticeable unless you are a sound expert.
However, in normal speakers (at least large and expensive), the lack of high frequencies is quite noticeable.


Free Download Mp4Gain
picture


Mp4Gain Main Window
picture


Mp4Gain Features
picture


Free Download Mp4Gain
picture

How is an mp3 analyzed inside?

How is an mp3 analyzed inside?

MP3 is the acronym for MPEG 1 Layer 3 and is a lossy digital audio format developed by MPEG (Moving Picture Experts Group) in conjunction with the Franunhofer Institute of Technology to include it as an audio format for the MPEG-video format. 1. It is currently an ISO (International Organization for Standardization) standard. The reason it has become so popular is that it allows for high sound quality in very little storage space: About 650 songs can be recorded on a 650MB CDROM, in instead of the 15 that we could store following the format of traditional CD-Audio. Furthermore, it is possible to adjust the quality of the output file by adjusting the bitrate (sampling rate and number of bits per sample), which will be proportional to the size of the output file. Thanks to its small size, high quality and versatility, it became a standard for streaming.

It was said at the beginning that MP3 is a lossy algorithm, this means that the original and encoded sound are not exactly the same. For this, the MP3 takes advantage of the “deficiencies” of the human ear, specifically 3 of them:

Limits of hearing in frequency: The human ear is only capable of hearing frequencies that are approximately between 20 and 20,000 KHz, with which the rest are filtered and discarded as they would not add relevant information to the encoded signal. Also, the closer you are to the 2-4 Khz range (and harder to hear as the frequency gets closer to the extremes of hearing), the more audible it will be.
Masking effect: When 2 signals of similar frequency overlap, human hatred is only able to hear the one with the highest power (volume), therefore, the rest can be eliminated without appreciable loss of quality.
Stereo redundancy: Sometimes there is redundancy between the 2 channels and, furthermore, below a certain frequency, the human ear is not able to distinguish the directionality of the sound with which a single channel can be encoded and add to the other certain complementary information to not lose the spatial sensation of the other channel.
To carry out the three previous proposals, a system based on subbands is used in which the signal is filtered using several filters in order to have the signal separated into sub-signals, each covering a frequency range. Each of these bands is compared to a psychoacoustic model that determines which bands are important and which can be removed.

Specifically, a hybrid polyphase / MDCT (Modified Discrete Cosine Transform) filter bank is used: A filter bank is a set of band-pass filters that aim to separate the original signal into several frequency bands; A multiphase / MDCT hybrid filter bank is nothing more than a normal filter bank together with a block capable of doing the discrete cosine transform (MDCT).

The choice of which bands are maintained and which are removed is made by calculating the masking threshold, that is, it analyzes each audio sub-signal and calculates the amount of noise that can be input (signal is replaced by noise to save storage space) in function of the frequency, taking into account that a frequency masks signals of a higher frequency than yours rather than lower, without being noticeable to the human ear.

The following figure outlines the process described above:

The following figure represents the structure of an mp3 file:

As can be seen, an Mp3 file is made up of different frames which in turn are made up of an Mp3 header and MP3 data. Each of the frames is independent, that is, a person can cut the frames of an MP3 file and then play them back. The graph shows that the header consists of a sync word that is used to indicate the beginning of a valid frame. Following are a series of bits that indicate that the analyzed file is a standard MPEG file and whether or not it uses layer 3.

MP3 undoubtedly owes its success to Internet music downloads and portable audio players capable of playing the format. First, Discman compatible with MP3 were born, which allowed transporting 175 songs per cd instead of the usual 6. Subsequently, MP3 players based on a (small back then) flash memory were born. These had the advantage of being much smaller and lighter than portable CD players, but with the initial disadvantage that flash memory was small and expensive. Initially these devices had 64 or 128 MB memory, which allowed them to store between 16 and 32 songs. Currently these devices are sold with a memory of 1,2,4 or even 8GB. This allows them to store between 256 (for the 1Gb model) and 2048 (for the 8GB model)