
Digital audio formats: the MP3 phenomenon

The MP3 music format (MPEG-1 Layer 3) is one of the most widely used digital audio formats in the world.


It is compatible with all portable and stationary audio devices. In May 2017, the developers of the format announced his “death”. On April 23, 2017, the Technicolor and Fraunhofer IIS licensed commercial program was canceled: the last patent included in the program expired, making the format standard in the public domain.
Can we say that the days of the most popular format are numbered? MP3 development began in the late 1980s at the Fraunhofer Institute for Integrated Circuits (IIS). In 1987, the University of Erlangen-Nuremberg and Fraunhofer IIS teamed up to work on the EU147 EUREKA Digital Audio Broadcasting (DAB) project. The first result of the alliance’s work was the LC-ATC codec, which made it possible to encode stereo music in real time.
The next step was the development of an optimal frequency domain (OCF) coding algorithm, which already had some of the characteristics of the future MP3 codec. For the first time, it is possible to encode music in good quality at 64 kbps for a mono signal. OCF was the beginning of the path towards standardization MPEG (Moving Picture Expert): an organization, responsible for the development and implementation of international standards for the compression and transmission of digital video and audio content.
In 1989, MPEG received 14 proposals for the implementation of an audio coding standard, so participants were invited to combine their developments. This led to the emergence of four potential candidates, including MUSICAM from the Institute for Broadcasting Technology IRT and Philips and ASPEC (Adaptive Spectral Perceptual Entropy Coding), which is the result of further enhancements to the OCF Fraunhofer IIS in addition to contributions from the University of Hannover in collaboration with AT&T and Thomson.
After extensive testing, MPEG proposed combining MUSICAM and ASPEC to create a family of three encoding methods: Level 1: a low-complexity version of MUSICAM; level 2 – MUSICAM codec; Level 3 (later called MP3): based on ASPEC. Technical development of the MPEG-1 standard was completed in December 1991. In 1994, Fraunhofer IIS introduced the world’s first MP3 encoder, the L3enc, and in 1995 the Fraunhofer researchers unanimously accepted “.mp3” as the file extension for MPEG Layer 3 [1].
Thanks to the compression algorithm used in the MP3 audio format, the size of the data required to reproduce the recording and ensure the quality of sound reproduction is significantly reduced to 10-12 times the original, depending on the recording bit rate. . Bit rate refers to the encoding / decoding rate of a digital audio stream; sound quality improves with increasing bit rate. The MP3 format has the following bit rates: 32 kbps (very low quality, acceptable only for voice), 96 kbps, 128 kbps (medium quality), 160 kbps, 192 kbps, 256 kbps, 320 kbps (maximum optimal quality). The principle of the compression algorithm is as follows: during the compression process, the audio codecs analyze the signals, focusing on the audible fragments, which are saved for later playback or transmission.
This rules out sounds beyond the perception range of the human ear (20 to 20,000 Hz). That is why MP3 is called lossy. There are three ways to encode MP3 files: constant bit rate (CBR), variable bit rate (VBR), and medium bit rate (ABR). CBR is the default encryption mode. In this mode, the bit rate is constant for the entire file. This means that each part of the MP3 file uses the same number of bits. Regardless of the complexity of a piece of music, the encoder uses the same bit rate, so the quality of the final file is variable.
Complex parts will be of lower quality than simpler ones. The main advantage of this mode is that the size of the final files does not change and can be accurately predicted. When encoding in VBR mode, the user selects the desired quality on a scale of 9 (lowest quality, highest distortion) to 0 (highest quality / lowest distortion). The codec then tries to maintain a certain quality throughout the file by choosing the optimal number of bits for each part of the audio recording. The main advantage is the ability to specify the level of quality to be achieved, but the significant disadvantage is the unpredictability of the final file size.












