
MP3 File Structure Analysis Part 2

Sounds in nature are very complex and waveforms are extremely complex.
![]()
Usually we use pulse code modulation coding, that is, PCM coding. PCM converts continuously changing analog signals into digital codes through three steps of sampling, quantizing, and encoding.
u Decode:
Reverse encoding process
1.1.2 Brief introduction of MP3
The full name of MP3 is MPEG Audio Layer 3. It is an efficient computer audio coding scheme. It converts audio files into smaller files with a .mp3 extension with a higher compression ratio, essentially maintaining the sound quality of the source file. MP3 is part of the ISO/MPEG standard,
The ISO/MPEG standard describes audio compression using a high performance perceptual coding scheme. This standard has been continuously updated to meet the pursuit of “high quality and low quality”. Three audio codec schemes, MPEG Layer1, Layer2 and Layer3, have been formed, respectively, corresponding to the three sound files MP1, MP2 and MP3
MPEG (Moving Picture Experts Group) is a group of moving picture experts under ISO. The MPEG standard it specifies is widely used in various multimedia. The MPEG standard includes video and audio standards. Audio standards have developed MPEG-1, MPEG -2, MPEG-2 ACC, MPEG-4. The MPEG-1 and MPEG-2 standards use the same family of Layer1, 2, 3 audio codecs, and most MP3s use the MPEG1 standard.
MP3 audio compression consists of two parts: encoding and decoding. Encoding is the process of converting the original signal to a level signal, and decoding is the reverse process. MP3 uses the PerceptualAudio Coding distortion algorithm. The frequency range of sound perceived by the human ear is 20 Hz to 20 kHz. MP3 cuts out a lot of redundant signals and irrelevant signals. The encoder transforms the original sound into the frequency domain through a mixed filter bank and uses a psychoacoustic model. to estimate that it may be only The perceived noise level is quantized and converted to Huffman coding to form an MP3 bit stream. The decoder is much simpler, its task is to extract the sound signal from the encoded spectral line components through inverse quantization and inverse transformation.
MP3 file data consists of multiple frames, and a frame is the smallest unit of an MP3 file. Each frame, in turn, consists of a frame header, additional information, and sound data. The playback time of each frame is 0.026 seconds and its duration varies with the bit rate. Some MP3 files have extra bytes at the end that contain description information for non-audio data.



