Mp3: Variable Bit Rate Encoding

Free Download Mp4Gain

MP3 file format: Understanding Variable Bit Rate Encoding

Variable Bit Rate Encoding

MP3 file format

The MP3 file format revolutionized the way we listen to music and audio content. It allowed us to store high-quality audio files in a compact size, making it easier to share and transfer them. However, to achieve this level of compression, MP3 file format uses a lossy compression technique that removes certain parts of the audio data. As a result, the audio quality of MP3 files is lower than the original recording.

Variable Bit Rate Encoding

Variable Bit Rate Encoding (VBR) is a technique used by MP3 file format to achieve better compression while maintaining audio quality. Instead of using a constant bit rate for the entire file, VBR adjusts the bit rate according to the complexity of the audio. This means that more complex parts of the audio, such as music with lots of instruments, will have a higher bit rate, while simpler parts, like a solo voice, will have a lower bit rate. This results in smaller file sizes without sacrificing audio quality.
According to the book “Mastering Audio: The Art and the Science” by Bob Katz, “VBR is a much more efficient way of storing audio data…it allows us to use the bits more efficiently.” However, VBR can also be more complex to decode and can cause compatibility issues with some audio players.

Audio Quality

The goal of VBR is to maintain audio quality while reducing file size. However, the quality of the audio can still be affected by the bit rate used. A higher bit rate will result in better audio quality but also a larger file size, while a lower bit rate will result in a smaller file size but lower audio quality. It’s important to strike a balance between file size and audio quality based on your specific needs.
In the words of filmmaker George Lucas, “Sound is 50 percent of the movie-going experience.” So, whether you’re listening to music or watching a movie, the audio quality should be a top priority.

File Size

One of the main benefits of VBR is that it reduces the file size of MP3 files. However, the file size can still vary depending on the bit rate used and the length of the audio file. A longer audio file with a higher bit rate will result in a larger file size, while a shorter audio file with a lower bit rate will result in a smaller file size.
It’s important to keep file size in mind when sharing and transferring MP3 files. If the file size is too large, it may take longer to upload or download, which can be frustrating for both you and the recipient.

Audio Codecs

MP3 file format is not the only audio codec that uses variable bit rate encoding. Other codecs, such as AAC and Vorbis, also use VBR to achieve better compression and maintain audio quality. It’s important to understand the differences between these codecs and choose the one that best suits your needs.
In conclusion, MP3 file format’s variable bit rate encoding is a powerful tool that allows us to store high-quality audio files in a compact size. However, it’s important to strike a balance between file size and audio quality based on your specific needs. Whether you’re listening to music or watching a movie, the audio quality should always be a top priority.

Final Words

In conclusion, the MP3 file format is an incredibly popular and versatile format for audio files. However, the handling of variable bit rate encoding can be a complex and nuanced topic. It’s important to understand the differences between constant and variable bit rate encoding, as well as the potential trade-offs in file size and sound quality.

At the end of the day, it’s up to the individual user to determine which encoding method works best for their needs. Whether you’re a music lover who wants to store high-quality files on your device or a professional sound engineer who needs to carefully balance file size and audio fidelity, understanding the ins and outs of variable bit rate encoding is an important step.

As David Bowie once said, “I suppose for me as an artist, it wasn’t always just about expressing my work; I really wanted, more than anything else, to contribute in some way to the culture I was living in.” By understanding the technical aspects of audio file formats and encoding methods, we can better appreciate and contribute to the world of music and sound.

Free Download Mp4Gain

Mp4Gain Main Window

Mp4Gain Features

Free Download Mp4Gain

Mp3 File Structure

MP3 File Structure: Anatomy of an MP3 File

Mp3 File Structure

Understanding the MP3 File Format

As an audio file format, MP3 is known for its ability to compress audio data to a manageable size without sacrificing quality. The MP3 file format is based on a set of rules that determine how audio data is stored, organized, and compressed. To understand the structure of an MP3 file, it’s important to know its components, which include the header, audio data, and metadata.
The header of an MP3 file contains information about the file’s format, encoding, and bit rate. It also includes information about the length of the audio data and any additional metadata that may be included. The audio data is the compressed audio stream that makes up the bulk of the file, while metadata includes information like artist name, album name, and track number.

The Components of an MP3 File

To truly understand the structure of an MP3 file, it’s important to break down its components. The audio data is the most important component of the file, as it contains the actual audio content. This data is compressed using various algorithms to reduce its size while maintaining a high level of audio quality.
The header of the file is also important, as it contains information about the file’s format and encoding. The header is located at the beginning of the file and provides important information about the file’s size, length, and other technical specifications.

Finally, metadata is an essential component of an MP3 file. Metadata includes information like artist name, album name, track number, and other relevant details about the audio content. This information is used by media players to organize and display audio content in a user-friendly manner.

The Anatomy of an MP3 File

The structure of an MP3 file can be likened to the anatomy of a living organism. Each component of the file works together to create a cohesive and functional audio file. The header serves as the brain of the file, providing important information about the file’s structure and format.
The audio data is like the heart of the file, pumping out the compressed audio stream that makes up the bulk of the file. And metadata is like the skin of the file, providing important information about the content and giving it context.

As with any living organism, each component of an MP3 file is essential to its overall function. Understanding the structure and components of an MP3 file is key to creating and working with high-quality audio content.

Final Words:
In conclusion, understanding the structure of an MP3 file is crucial to working with audio content in a digital age. By understanding the anatomy of an MP3 file, you can better appreciate the technical complexity of audio compression and gain a deeper appreciation for the art of digital audio. As a tool for audio normalization and conversion, mp4gain is an excellent choice for anyone looking to optimize their audio content for use in a digital environment.

MP3, file format, audio compression, audio data, header, metadata, encoding, bit rate, audio normalization, digital audio, media player, algorithm, technical specifications, audio content, file structure, digital environment, audio quality, audio conversion, audio stream, audio file.

MP3 Format History

MP3 Format

I still remember the first time I heard an MP3 file. It was the late 90s, and the internet was still in its early days. I was amazed at how a song could be so compressed and still sound decent. Little did I know that this was just the beginning of a revolutionary audio technology that would change the way we listen to music forever.

The Birth of the MP3 File Format

The MP3 file format was first developed in 1987 by a German engineer named Karlheinz Brandenburg. He was working for the Fraunhofer Institute for Integrated Circuits in Erlangen, Germany, where he and his team were tasked with developing a digital audio format that could compress audio files without losing too much quality.

The breakthrough came in the early 90s when the first MP3 encoder was released. It was able to compress audio files by a factor of 10 to 12 times their original size without losing too much quality. This meant that a 50 MB audio file could be compressed down to 5 MB or less. This was a huge development at the time, as it made it possible to share audio files over the internet, which was still in its infancy.

The Evolution of MP3 Technology

Over the next few years, the MP3 format continued to evolve and improve. In 1995, the first MP3 player was released by Saehan Information Systems in South Korea. It was called the MPMan and was the size of a small portable cassette player. It had a 32 MB memory and could store up to 8 songs.

By the late 90s, MP3 players had become more common, and the MP3 format had become the standard for digital audio. The first iPod was released in 2001, and it revolutionized the way we listen to music. It had a 5 GB hard drive and could store up to 1000 songs. It was sleek, portable, and easy to use, and it quickly became the must-have gadget for music lovers around the world.

The Future of MP3 Technology

Despite its popularity, the MP3 format is not without its flaws. It is a lossy compression format, which means that some of the original audio data is lost during the compression process. This can result in a loss of audio quality, especially at lower bit rates.

However, there are new audio technologies being developed that may one day replace the MP3 format. One of these is the High-Resolution Audio (HRA) format, which is capable of reproducing audio at a much higher quality than the MP3 format. Another is the Master Quality Authenticated (MQA) format, which is designed to deliver studio-quality audio in a compact file size.

In conclusion, the MP3 format has come a long way since its inception in 1987. It has revolutionized the way we listen to music and has made it possible to share audio files over the internet. While it may one day be replaced by newer audio technologies, its legacy will live on.

mp3 compression, digital audio format, mp3 file size, audio quality, mp3 history, music industry, lossy compression, audio technology, high-quality audio, mp3 player, audio codec, file sharing, online music, digital music

What is MP3?

MP3 is a type of audio file that is compressed using a specific algorithm to reduce its size while maintaining its sound quality. MP3 stands for MPEG-1 Audio Layer 3 and was developed in the 1990s. Since then, it has become one of the most popular audio file formats in the world.

How does MP3 compression work?

MP3 compression works by removing sounds that are less important to the human ear. This process is called psychoacoustic analysis. The MP3 algorithm uses this analysis to determine which sounds to keep and which ones to discard.

For example, when you listen to music, you may not be able to hear sounds that are below a certain volume level. The MP3 algorithm takes advantage of this fact by removing these quieter sounds from the audio file.

In addition to removing quiet sounds, the MP3 algorithm also removes sounds that are masked by louder sounds. For example, if a loud drumbeat is playing at the same time as a quieter guitar solo, the algorithm will remove the guitar sounds that are masked by the drumbeat.

What are the benefits of using MP3 files?

There are several benefits to using MP3 files. One of the main benefits is that they take up less space than other audio file formats. This makes them easier to store and share.

Another benefit of using MP3 files is that they can be played on a wide variety of devices. Many music players, smartphones, and computers can play MP3 files without the need for additional software.

How are MP3 files encoded?

To create an MP3 file, you need to encode it using an MP3 encoder. The encoder takes the raw audio data and applies the psychoacoustic analysis and compression algorithms to create the final MP3 file.

There are several software programs that can be used to encode MP3 files, including iTunes and Audacity. Many music players also include built-in MP3 encoders.

What are the drawbacks of using MP3 files?

One of the main drawbacks of using MP3 files is that the compression process can result in a loss of sound quality. While the human ear may not be able to hear the sounds that are removed during the compression process, some people may be able to notice a difference in sound quality between an MP3 file and a file that has not been compressed.

Another potential drawback of using MP3 files is that they may not be suitable for all types of music. The psychoacoustic analysis used by the MP3 algorithm is based on the characteristics of typical music. As a result, some types of music, such as classical music, may not be compressed as effectively as other types of music.

What is MP3 Decoding?

MP3 decoding is the process of converting an MP3 file back into a digital audio signal that can be played by speakers or headphones. This process is the reverse of MP3 encoding and involves decompressing the audio data that was removed during the compression process.

To decode an MP3 file, you need to use an MP3 decoder. Many music players and software programs include built-in MP3 decoders that can be used to play MP3 files.

MP4Gain for Converting and Normalizing Audio and Video Formats

If you are looking to convert and normalize various audio and video formats, including MP3, you may want to consider using MP4Gain. MP4Gain is a software program that allows you to adjust the volume and normalize the loudness levels of your audio and video files.

In addition to MP3, MP4Gain supports a variety of other audio and video formats, including MP4, AAC, FLAC, and more. With MP4Gain, you can ensure that your audio and video files are all at a consistent volume level, making it easier to listen to and enjoy your media collection.

Conclusion

MP3 is a popular audio file format that is widely used for storing and sharing music. The MP3 algorithm uses psychoacoustic analysis to compress audio data and remove sounds that are less important to the human ear. While MP3 files have several benefits, they may not be suitable for all types of music, and the compression process can result in a loss of sound quality.

MP3 file format

Mp3 file format

Introduction:
MP3 file format

1. Overview:
MP3 files are made up of frames, and frames are the smallest unit of MP3 files. The full name of MP3 must be MPEG1 Layer 3 audio files. MPEG
(Motion Picture Experts Group) translates into Chinese as Moving Picture Experts Group, and refers specifically to moving video and audio compression standards.
MPEG1 standard, also known as MPEG audio layer, which is divided into three layers based on compression quality and encoding complexity, namely,
Layer-1, Layer2 and Layer3, which correspond to the three sound files of MP1, MP2 and MP3 respectively, and use different
levels of audio files according to different purposes. The higher the MPEG audio encoding level, the more complex the encoder and the higher the compression ratio. The compression ratios of MP1 and MP2 are 4:1 and
6:1-8:1 respectively, while the compression ratio of MP3 is as high as 10:1-8:1. 12:1, meaning one minute of CD-quality music requires 10MB
of storage space without compression, but only about 1 MB after MP3 compression encoding. However, MP3 uses a lossy compression method for audio signals. To reduce
sound distortion, MP3 adopts “sensory coding technology”, that is, it first analyzes the frequency spectrum of audio files during encoding, and then uses filters to filter the
noise . levels. Then the remaining bits are spread and arranged by means of quantization, and finally an MP3 file with a higher compression ratio is formed, and the
compressed file can achieve a sound effect closer to the original sound source during playback.
2. The whole structure of
MP3 files: MP3 files are roughly divided into three parts: TAG_V2 (ID3V2), Frame, TAG_V1 (ID3V1)
ID3V2 contains information like author, composer, album, etc. The length is not fixed, which expands the information volume of ID3V1.
A series of frames, the number is determined by the size of the file and the length of the frame. The length of each frame of the
frame
may not be fixed or fixed, and is determined by the bitrate
.
Each table is divided into two parts: table header and data entity Header of data.
frame
Record the bit rate, sample rate, version and other information of mp3, and each frame is independent of each other The frame
ID3V1 contains information like author, composer, album, etc., and the length is 128BYTE . 3. MP3 FRAME format: each FRAME has a FRAMEHEADER frame header, the length is 4BYTE (32 bits), there may be two CRC check bytes after the frame header, the existence of these two bytes depends on the FRAMEHEADER information If bit 16 is 0, there is no checksum after the frame header, and if it is 1, there is a checksum. The checksum length is 2 bytes, followed by the FRAMEHEADER, followed by the frame entity data. The format is as follows: FRAMEHEADER CRC (free) MAIN_DATA 4 BYTE 0 OR 2 BYTE The length is calculated from frame header 1. The format of the FRAMEHEADER frame header is as follows: AAAAAAAA AAABCCD EEEEFFGH IIJJKLMM

The mp3 phenomenon

The mp3 phenomenon

MP3

The MP3 music format (MPEG-1 Layer 3) is one of the most widely used digital audio formats in the world. It is compatible with all portable and stationary audio devices. In May 2017, the developers of the format announced his “death”.

mp3

On April 23, 2017, the Technicolor and Fraunhofer IIS licensed commercial program was canceled: the last patent included in the program expired, making the format standard in the public domain. Can we say that the days of the most popular format are numbered? MP3 development began in the late 1980s at the Fraunhofer Institute for Integrated Circuits (IIS).

In 1987, the University of Erlangen-Nuremberg and Fraunhofer IIS teamed up to work on the EU147 EUREKA Digital Audio Broadcasting (DAB) project. The first result of the alliance’s work was the LC-ATC codec, which made it possible to encode stereo music in real time. The next step was the development of an optimal frequency domain (OCF) coding algorithm, which already had some of the characteristics of the future MP3 codec. For the first time, it is possible to encode music in good quality at 64 kbps for a mono signal. OCF was the beginning of the path towards the standardization of MPEG (Moving Picture Expert), an organization, responsible for the development and implementation of international standards for the compression and transmission of digital video and audio content.

In 1989, MPEG received 14 proposals for the implementation of an audio coding standard, so participants were invited to combine their developments. This led to the emergence of four potential candidates, including MUSICAM from the Institute of Broadcasting Technology IRT and Philips and ASPEC (Adaptive Spectral Perceptual Entropy Coding), which is the result of further enhancements to OCF Fraunhofer IIS, as well as contributions from the University of Hannover in collaboration with AT&T and Thomson. After extensive testing, MPEG proposed combining MUSICAM and ASPEC to create a family of three encoding methods: Level 1: a low-complexity version of MUSICAM; level 2 – MUSICAM codec; Level 3 (later called MP3): based on ASPEC.

Technical development of the MPEG-1 standard was completed in December 1991. In 1994, Fraunhofer IIS introduced the world’s first MP3 encoder, the L3enc, and in 1995 the Fraunhofer researchers unanimously accepted “.mp3” as the file extension for MPEG Layer 3 [1]. Thanks to the compression algorithm used in the MP3 audio format, the size of the data required to reproduce the recording and ensure the quality of sound reproduction is significantly reduced to 10-12 times the original, depending on the recording bit rate. . Bit rate refers to the encoding / decoding rate of a digital audio stream; sound quality improves with increasing bit rate. The MP3 format has the following bit rates: 32 kbps (very low quality, acceptable only for voice), 96 kbps, 128 kbps (medium quality), 160 kbps, 192 kbps, 256 kbps, 320 kbps (highest best quality). The principle of the compression algorithm is as follows: during the compression process, the audio codecs analyze the signals, focusing on the audible fragments, which are saved for later playback or transmission.

This rules out sounds beyond the perception range of the human ear (20 to 20,000 Hz). That is why MP3 is called lossy. There are three ways to encode MP3 files: constant bit rate (CBR), variable bit rate (VBR), and medium bit rate (ABR). CBR is the default encryption mode. In this mode, the bit rate is constant for the entire file. This means that each part of the MP3 file uses the same number of bits. Regardless of the complexity of a piece of music, the encoder uses the same bit rate, so the quality of the final file is variable. Complex parts will be of lower quality than simpler ones. The main advantage of this mode is that the size of the final files does not change and can be accurately predicted.

When encoding in VBR mode, the user selects the desired quality on a scale of 9 (lowest quality, highest distortion) to 0 (highest quality / lowest distortion). The codec then tries to maintain a certain quality throughout the file by choosing the optimal number of bits for each part of the audio recording. The main advantage is the ability to specify the level of quality to be achieved, but a significant disadvantage is the unpredictability of the final file size. In ABR mode, the user sets the bit rate and the encoder tries to keep the average bit rate constantly while using higher bit rates for the parts of the music that require more bits. The

Size and quality of MP3 files

Size and quality of MP3 files

MP3 File

The MP3 file format is an “open format” supported by most manufacturers.

mp3 file

The MP3 format is one of the most common digital audio encoding formats. One feature of MP3 audio encoding is lossy encoding. However, the coding is based on a special model that takes into account the peculiarities of auditory perception. Therefore, the presence of losses does not lead to catastrophic sound degradation.

MP3 files have become a de facto standard and are compatible with the most popular operating systems, many CD and DVD players, and other devices.

Interestingly, the standard describes the actual storage format and not the way the sound is encoded. As a result, there are many tools available to play MP3 audio.

Special codecs are used to encode audio in MP3 format.
An audio codec can be of two types: hardware codec and software codec.

Hardware coding is done by special microcircuits.
Software coding is done using special computer programs.

Audio quality in MP3 format (all other things being equal) depends on the compression ratio (read the amount of loss) and the encoding program. That is why brand name players using well-known brand codecs and audio signal processing systems are significantly superior in playback quality to conventional devices assembled from standard assemblies.

The quality of actual playback depends on the size of the media data stream. The amount of data stream is sometimes called the stream width. There is a special term: bit rate. The data flow rate is defined in kilobits per second and is denoted kbs, kbps, kb / s. Recording can be encoded in several ways: constant bit rate and variable bit rate. Variable bit rate helps preserve details by increasing the amount of data.

Not all bit rates are suitable for high-quality music playback

MP3 digital audio format

MP3 digital audio format

MP3 File Format

High-quality digitized audio requires a large amount of disk space.

mp3 file

Attempts to reduce the size of files using standard archivers (RAR, GZIP, etc.) do not generate significant gains due to the specificity of the sound data. However, it is possible to achieve a fairly significant level of compression of the audio information using special methods based on the analysis of the data structure and subsequent compression with some loss.

The real possibility of sound processing comparable in quality to existing analog examples did not appear until the late 1980s.

In 1988, the International Organization for Standardization (ISO) formed the MPEG (Moving Picture Experts Group) committee, whose main task is to develop standards for the encoding of moving pictures, sound and their combination. During the ten years of its existence, the committee has developed a series of norms on this subject. As a result, summarizing the extensive research in this area, several specific formats were recommended for storing data, which are excellent in quality of results and data flow.

There are currently three video storage standards: MPEG-1, MPEG-2, and MPEG-4.

Within the first two formats, there are also formats for storing audio information: Layer-1, Layer-2 and Layer-3. These three audio formats are defined for MPEG-1 and minor extensions are used in MPEG-2. The three formats are similar to each other, but use different levels of trade-off between compression and complexity.

Layer-1 is the simplest, it does not require significant compression costs, but it also provides a negligible compression ratio.

Layer-3 is the most time consuming and provides the best compression. Recently, this format has gained immense popularity. It is often called MP3. This name is associated with the extension of the audio files stored in this format.

The underlying idea behind all lossy audio compression techniques is to neglect the subtle details of the original sound that are beyond the reach of the human ear. Here several points can be highlighted.

Noise level . Sound compression is based on a simple fact: if a person is near a loud siren, they are unlikely to hear the conversation of the people who are nearby. And this happens not because a person pays close attention to a loud sound, but to a greater extent because the human ear actually misses out sounds that are in the same frequency range as a louder sound. This effect is called masking, it changes with the difference in volume and frequency of the sound.

The second point is the division of the audio frequency band into subbands, each of which is further processed separately. The encoding program extracts the loudest sounds in each band and uses this information to determine an acceptable noise level for that band. The best encoding programs also take into account the influence of adjacent bands. A very loud sound in one band can affect the masking effect and nearby bands.

Another point of the codification is the use of a psychoacoustic model based on the peculiarities of the human perception of sound. The compression used by this model is based on removing frequencies known to be inaudible, while more carefully preserving sounds that can be easily heard by the human ear. Unfortunately, there can be no exact mathematical formulas here.

The human perception of sound is a complex process, not fully understood, so the choice of compression methods is based on analyzing listening and comparing compressed sounds differently by teams of experts. But here there are practically limitless possibilities in the field of improving psychoacoustic models. Most of the existing algorithms to encode the human voice are based on the high predictability of said signal; Universal MPEG compression algorithms have tried to apply this technique with variable success.

Another compression technique is the use of so-called joint stereo. It is known that the human hearing aid can only determine the direction of the mid frequencies, the high and low sound, so to speak, separately from the source. This means that these background frequencies can be encoded into a mono signal. In addition to all this, compression uses the difference in the complexity of the flows in the channels.

Why mp3 is enough for you, but Lossless is not necessary

Why mp3 is enough for you, but Lossless is not necessary

mp3

Why mp3 is enough for you, but Lossless is not necessary
Did you finish the greenhouse? So you don’t need to lose, listen to high quality mp3.

Very often there are people who, in principle, despise compressed formats. You should not be guided by your opinion. The following mods that in the studio with a 90% probability will not hear the differences between compressed and uncompressed audio.

MP3 wasn’t invented just to reduce quality. It was developed by the Fraunchhofer Society, an association of applied research institutes in Germany. Later they came up with AAC, which could become the main compressed audio format … But it didn’t work.

Did you know that MP3 comes with variable (VBR) and constant (CBR) bit rate? The constant bit rate, due to the operation of the algorithm, is encoded each time as the first. Therefore, it can produce uneven quality, which means that not all sounds in this situation will be recorded in high quality.

Since MP3 has been around for a long time, it has many limitations. Bit width is 16-24 bits. The sample rate is represented by the following set of options: 8; 11,025; 12; sixteen; 22.05; 24; 32; 44.1; 48. The maximum bit rate does not exceed 320 kbps. The maximum number of channels is 2. But we are still talking about music, we still have to search for multi-channel recordings.

Now let’s see how MP3 is encoded. The illustration shows the time-frequency distribution of sound. Same recording: Audio CD, OGG file, MP3 well encoded. What we observe is that the pieces on the right and left almost completely coincide. This means that the MP3 file sounds almost the same as the original CD recording.

Human hearing and its limits – psychoacoustics

The fact is that the main task of the Fraunchhofer Society is the development of psychoacoustic models of human perception of sound. And here are many subtleties. The main thing is that we are not dolphins.

Second, there are certain restrictions on the number of sounds perceived simultaneously. A person cannot simultaneously hear more than 250 sounds of 24 ranges (in addition, the number of simultaneous sounds in the range is also quite small).

Third, the audible range is 16 Hz to 20 kHz and at the age of 60 it is reduced by almost half. Ideally, and during training (yes, you have to train it!).

All frequencies below 100 Hz are perceived not by the hearing cells, but … by the skin. Then the low waves are reflected in the ear canal; these waves are perceived as infrabass. (This is from the bone conduction area).
mp3_7_resize
Also, the number of cells that register acoustic waves is different for each one. But what is there? For each individual, their number in the right and left ear is different.

By the way, the perception of each ear is different. Change channels of your favorite song – get a new sound.

If you dig deeper, it turns out that each sound frequency is perceived only at a certain volume. When it is reached, the silence is replaced by a sharp and quite different sound. After that, a person can hear a lower sound of this frequency.

Digital audio formats: the MP3 phenomenon

Digital audio formats: the MP3 phenomenon

MP3 format

The MP3 music format (MPEG-1 Layer 3) is one of the most widely used digital audio formats in the world.

MP3 format MP3 format : An Overview

It is compatible with all portable and stationary audio devices. In May 2017, the developers of the format announced his “death”. On April 23, 2017, the Technicolor and Fraunhofer IIS licensed commercial program was canceled: the last patent included in the program expired, making the format standard in the public domain.
Can we say that the days of the most popular format are numbered? MP3 development began in the late 1980s at the Fraunhofer Institute for Integrated Circuits (IIS). In 1987, the University of Erlangen-Nuremberg and Fraunhofer IIS teamed up to work on the EU147 EUREKA Digital Audio Broadcasting (DAB) project. The first result of the alliance’s work was the LC-ATC codec, which made it possible to encode stereo music in real time.

The next step was the development of an optimal frequency domain (OCF) coding algorithm, which already had some of the characteristics of the future MP3 codec. For the first time, it is possible to encode music in good quality at 64 kbps for a mono signal. OCF was the beginning of the path towards standardization MPEG (Moving Picture Expert): an organization, responsible for the development and implementation of international standards for the compression and transmission of digital video and audio content.

After extensive testing, MPEG proposed combining MUSICAM and ASPEC to create a family of three encoding methods: Level 1: a low-complexity version of MUSICAM; level 2 – MUSICAM codec; Level 3 (later called MP3): based on ASPEC. Technical development of the MPEG-1 standard was completed in December 1991. In 1994, Fraunhofer IIS introduced the world’s first MP3 encoder, the L3enc, and in 1995 the Fraunhofer researchers unanimously accepted “.mp3” as the file extension for MPEG Layer 3 [1].

Thanks to the compression algorithm used in the MP3 audio format, the size of the data required to reproduce the recording and ensure the quality of sound reproduction is significantly reduced to 10-12 times the original, depending on the recording bit rate. . Bit rate refers to the encoding / decoding rate of a digital audio stream; sound quality improves with increasing bit rate. The MP3 format has the following bit rates: 32 kbps (very low quality, acceptable only for voice), 96 kbps, 128 kbps (medium quality), 160 kbps, 192 kbps, 256 kbps, 320 kbps (maximum optimal quality). The principle of the compression algorithm is as follows: during the compression process, the audio codecs analyze the signals, focusing on the audible fragments, which are saved for later playback or transmission.

Complex parts will be of lower quality than simpler ones. The main advantage of this mode is that the size of the final files does not change and can be accurately predicted. When encoding in VBR mode, the user selects the desired quality on a scale of 9 (lowest quality, highest distortion) to 0 (highest quality / lowest distortion). The codec then tries to maintain a certain quality throughout the file by choosing the optimal number of bits for each part of the audio recording. The main advantage is the ability to specify the level of quality to be achieved, but the significant disadvantage is the unpredictability of the final file size.