mp3 enc Archives - mp4gain.com

mp3 audio format, the most popular

Free Download Mp4Gain

mp3 audio format, the most popular

With the rapid development of file compression technology, MP3 has become the most popular music format today.

mp3 audio format, the most popular

MP3 File Format Analysis MP3 file data is made up of multiple frames, and the frame is the smallest unit of the MP3 file. Each frame consists of a frame header, additional information, and sound data. The playback time of each frame is 0.026 seconds, and its duration varies with the bit rate. Some MP3 files have extra bytes at the end to store description information for non-audio data. The structure of the MP3 file is shown in Figure 2. 3.1 Frame header format The frame header is 4 bytes long. For fixed bitrate MP3 files, the frame header format of all frames is the same. The data structure is as follows: typedef FrameHeader{ unsigned int sync:11;//Sync information unsigned int version:2 ;//version unsigned int layer:2;//layer unsigned int protection:1;//CRC check unsigned int bitrate:4;//unsigned bitrate int frequency:2;//unsigned frequency int padding:1;//unsigned frame length setting int private:1;//unsigned reserved word int mode:2; //unsigned channel mode int mode extension:2;//unsigned extended mode int copyright:1;//unsigned copyright int original:1 ;//unsigned original logo int emphasis:2;//emphasis mode }HEADER, *LPHEADER; See Table 1 for a description of the 4 byte frame header. Table 1 Explanation of the use of MP3 frame header bytes Name Length (bits) Description Synchronization information 11 All bits in the 1st and 2nd byte are 1, and the 1st byte is always FF. Version 200-MPEG 2. 5 01-undefined 10-MPEG 2 11-MPEG 1 layer 2 00-undefined 01-Layer 3 10-Layer 2 11-Layer 1 CRC check 1 0-check 1-no check Bit rate 4 The third bit Tuple sampling rate, the unit is kbps, such as MPEG-1 Layer 3, 64 kbps, the value is 0101. Frequency 2 Sampling frequency, for MPEG-1: 00-44.1 kHz 01-48 kHz 10 -32 kHz 11-setting frame length undefined 1 is used to set the length of the file header, 0-no setting, 1-setting, the specific setting calculation method see below. Reserved word 1 is not used. Channel Mode 2 The fourth byte indicates the channel, 00-Stereo 01-Joint Stereo 10-Dual Channel 11-Mono Expansion Mode 2 Only used when the channel mode is 01. Copyright 1 Whether the file is legal or not, 0-Illegal 1-Original logo legal 1 If original, 0-Not original 1-Original emphasis method 2 Used for classification of sound compensation after noise reduction and compression, which is rarely used and is it may not work in the future. 00-Undefined 01-50/15ms 10-Reserved 11-CCITT J.17 MP3 frame length depends on bit rate and frequency, the calculation formula is: frame length = 144×bit rate∕ frequency+padding For example: bit rate is 64kbps, frequency is 44.1kHz, when padding is 1, frame length is 210 bytes. After the table header there is additional information of variable length. For standard MP3 files, their length is 32 bytes, followed by compressed audio data, which will be decoded when the decoder reads here. For Constant Bit Rate (CBR) MP3 files, not all frames are the same length, and some frames may be one or more bytes longer. There is also Variable Bitrate (VBR) MP3, to minimize the length of MP3 file and ensure sound quality, compared to CBR file, except for the first frame, the rest is the same. The first frame of VBR does not contain audio data and its length is 156 bytes, which is used to store information such as standard audio frame header (4 bytes), VBR file identifier, frame number, number file byte, etc. See table 2 for the description of the structure. Table 2 Description of the first byte of the frame structure of the VBR 1-4 file The same standard sound frame header as CBR 5-40 Store the logo of the VBR file “Xing” (58 69 6E 67), the specific position of this logo depends on the adopted MPEG standard and the sound depends on the channel mode.

Free Download Mp4Gain

Mp4Gain Main Window

Mp4Gain Features

Free Download Mp4Gain

mp3 audio format, the most popular

With the rapid development of file compression technology, MP3 has become the most popular music format today.

The encoder transforms the original sound into the frequency domain through a hybrid filter bank. Using a psychoacoustic model, it is estimated that it may be sufficient to be The perceived noise level is then quantized and converted to Huffman coding to form an MP3 bitstream. The decoder is much simpler and its task is to extract the sound signal from the encoded spectral line components through inverse quantization and inverse transformation.
2.4 Modified Discrete Cosine Transform Modified Discrete Cosine Transform (MDCT) refers to converting a set of time-domain data to frequency-domain data for time-domain variation. MDCT is an enhancement of the DCT algorithm. The first fast algorithm is the Fast Fourier Transform (FFT), but FFT has operations on complex numbers and MDCT are all operations on real numbers, which is convenient for programming. When compressing audio data, first divide the original audio data into fixed blocks, and then perform forward MDCT (Forward MDCT) to convert the value of each block into MDCT 512 coefficients. When decompressing, the reverse MDCT (Reverse MDCT) The 512 coefficients are restored to the original sound data, and the original sound data before and after are inconsistent, because redundant and irrelevant data are removed during the compression process. The FMDCT transformation formula is: k=0, 1,…, N/2-1 where N is the length of the transformation window, that is, the number of sample points per block, N=8, 16 ,… ., 1024, 2048. n0=(N/2+1)/2, X(n) is the value in the time domain, X(k) is the value in the frequency domain. If N takes 1024 points, it will become 512 frequency domain values. The IMDCT transformation formula is: 4 Modified Discrete Cosine Transform Modified Discrete Cosine Transform (MDCT) refers to converting a set of time-domain data to frequency-domain data to learn the changes in the domain. weather. MDCT is an enhancement of the DCT algorithm. The first fast algorithm is the Fast Fourier Transform (FFT), but FFT has operations on complex numbers and MDCT are all operations on real numbers, which is convenient for programming. When compressing audio data, first divide the original audio data into fixed blocks, and then perform forward MDCT (Forward MDCT) to convert the value of each block into MDCT 512 coefficients. When decompressing, the reverse MDCT (Reverse MDCT) The 512 coefficients are restored to the original sound data, and the original sound data before and after are inconsistent, because redundant and irrelevant data are removed during the compression process. The FMDCT transformation formula is: k=0, 1,…, N/2-1 where N is the length of the transformation window, that is, the number of sample points per block, N=8, 16 ,… ., 1024, 2048. n0=(N/2+1)/2, X(n) is the value in the time domain, X(k) is the value in the frequency domain. If N takes 1024 points, it will become 512 frequency domain values. The IMDCT transformation formula is: 4 Modified Discrete Cosine Transform Modified Discrete Cosine Transform (MDCT) refers to converting a set of time-domain data to frequency-domain data to learn the changes in the domain. weather. MDCT is an enhancement of the DCT algorithm. The first fast algorithm is the Fast Fourier Transform (FFT), but FFT has operations on complex numbers and MDCT are all operations on real numbers, which is convenient for programming. When compressing audio data, first divide the original audio data into fixed blocks, and then perform forward MDCT (Forward MDCT) to convert the value of each block into MDCT 512 coefficients. When decompressing, the reverse MDCT (Reverse MDCT) The 512 coefficients are restored to the original sound data, and the original sound data before and after are inconsistent, because redundant and irrelevant data are removed during the compression process. The FMDCT transformation formula is: k=0, 1,…, N/2-1 where N is the length of the transformation window, that is, the number of sample points per block, N=8, 16 ,… ., 1024, 2048. n0=(N/2+1)/2, X(n) is the value in the time domain, X(k) is the value in the frequency domain.

mp3 audio format, the most popular

With the rapid development of file compression technology, MP3 has become the most popular music format today.

High-quality music quickly spreads to all parts of the world with the arrangement of 0 and 1, shaking people’s hearts. What is MP3? The full name of MP3 is MPEG Audio Layer 3. It is an efficient computer audio coding scheme. It converts audio files into smaller files with .MP3 extension with a higher compression ratio and basically maintains the sound quality of the file. original. MP3 is part of the ISO/MPEG standard. The ISO/MPEG standard describes audio compression using a high-performance perceptual coding scheme. This standard has been continuously updated to meet the pursuit of “high quality and small quantity”, and now has formed MPEG Layer 1, Layer 2. Layer 3 three audio encoding and decoding schemes. The compression rate of MPEG Layer 3 can reach from 1:10 to 1:12. A 1M MP3 file can be played for 1 minute, while a 1 minute CD-quality WAV file (44100Hz, 16bit, 2ch, 60sec) occupies 10M of space, so Calculated, the time The playback time of a 650M MP3 disc should be more than 10 hours, while the playback time of a CD with the same capacity is about 70 minutes. The advantages of MP3 are unmatched by CD. 2 Analysis of the principle of MP3 2.1 MPEG audio standard MPEG (Moving Picture Experts Group) is a moving picture expert group under ISO, and the MPEG standard formulated by it is widely used in various multimedia. MPEG standards include video and audio standards, among which MPEG-1, MPEG-2, MPEG-2 AAC, and MPEG-4 audio standards have been developed. The MPEG-1 and MPEG-2 standards use the same family of audio codecs: Layer 1, 2 and 3. A new feature of MPEG-2 is the use of low sample rate expansion kits to reduce data traffic , and another feature is the multi-channel expansion kit, which increases the number of main channels to five. Fraunhofer IIS and AT&T released the MPEG-2 AAC (MPEG-2 Advanced Audio Coding) standard in 1997 to significantly reduce data traffic. The MDCT (Modified Discrete Cosine Transform) algorithm adopted by MPEG-2 AAC, The sampling frequency can be between 8 KHz and 96 KHz, and the number of channels can be between 1 and 48. MPEG Audio Layer 1, 2 and 3 use the same filter bank, bitstream structure, and header information, and the sample rate is either 32 KHz, 44.1 KHz, or 48 KHz. Layer 1 is designed for DCC (digital compact cassette) digital compression tape, the data rate is 384 kbps, and layer 2 has made a compromise between complexity and performance, and the data rate has been reduced to 256 kbps- 192kbps. Layer 3 was designed for low data rate from the beginning, and the data rate is 128Kbps-112Kbps. Layer 3 adds MDCT transform, which makes its frequency resolution 18 times higher than that of Layer 2. Layer 3 also uses information averaging similar to MPEG video entropy coding to reduce redundant information. The vast majority of MP3 uses the MPEG-1 standard. 2.2 The purpose of audio compression The MP3 format began in the mid-1980s, and the Fraunhofer Institute in Erlangen, Germany, was committed to high-quality, low-data-rate audio coding. Let’s look at an example: You want to sample a song you like that is about 4 minutes long, store it on a disc, and sample it in CD-quality WAV format at a sample rate of 44.1 kHz, which means receiving 44100 per second. , stereo, each sample data is 16 bits (2 bytes), so the space occupied by this song is: 44100×2 channels x2 bytes x60 seconds x4 minutes=40.4MB If you download this song from the Internet, assume the transmission speed is of 56kbps, the download time is: 40.4x106x8/56x103x60=96 minutes. Even a 1M broadband network takes more than 5 minutes. It can be seen that audio compression is especially important to reduce the storage space of audio data. 2.3 MP3 encoding and decoding MP3 audio compression involves encoding and decoding in two parts. Encoding is turning the data in a WAV file into a highly compressed bitstream, and decoding is taking the bitstream and reconstructing it into a WAV file. MP3 uses a distortion algorithm called Perceptual Audio Coding. The frequency range of sound perceived by the human ear is from 20 Hz to 20 kHz. MP3 cuts out a lot of redundant and irrelevant signals.