What’s behind the MP3 Audio Format?

Free Download Mp4Gain

What’s behind the MP3 Audio Format?

When most people hear the word MP3, they usually think of songs, podcasts, and other compressed audio files. While it’s worth acknowledging the role these uncompressed files have played in the world of music, the goal of this guide is to explain in detail what’s behind these files, how they work, and what makes them so popular. Through this understanding guide, we hope to cover the core concepts behind the MP3 audio format, such as bitrate and samplerate, as well as offer some tips and tricks to ensure you’re getting the best audio quality from your MP3 files.

What is MP3 Format?

MP3 is a digital audio format used to compress audio files without losing quality. This is made possible by an audio compression algorithm called MPEG-1 Audio Layer 3, also known as MP3. Compression technology involves reducing the amount of data without losing the fundamental attributes of the original audio. Compressed data can be saved as a higher quality audio file in a much smaller size. This means MP3 files are easier to stream and share online.

MP3 files can be compressed at different bit rates depending on the user. Bitrate is generally in kilobits per second. For example, a 128 kbps (kilobits per second) MP3 file uses 128,000 bits to encode the audio every second. While bitrate is an important factor in determining the quality of an audio file, there are other factors as well, such as samplerate. The samplerate is the number of audio samples taken every second. An audio file recorded at a sample rate of 44.1 kHz (kilohertz) means that 44,100 audio samples were taken every second. The higher the samplerate, the better the audio quality.

The magic behind the MP3 format lies in its ability to shed unnecessary data without compromising audio quality. This is accomplished by removing inaudible components from the audio. These inaudible components are called high and low frequencies. MP3 is a lossy audio compression codec, which means that deleted data cannot be recovered. This is why an MP3 file encoded at a small size cannot recover the audio quality of a file encoded at a larger size. MP3 is an extremely popular audio format, as it allows you to compress audio files without losing quality.

How You Can Improve the Quality of MP3 Audio Files

How can you improve the quality of audio files in MP3 format? The answer to this is to use an audio conversion program like MP3gain to adjust the volume of your audio files. MP3Gain is a free and open source tool that you can use to normalize the volume of your audio and video files. This tool is not only useful for improving audio quality, but also for saving space on your hard drive, as MP3 files encoded at lower sample rate and bitrate are smaller in size.

Of course, there is a downside to MP3 audio compression. As with any type of compression, there is a chance that the audio may become distorted or lose quality. While MP3 files encoded at a small size will have lower audio quality than those encoded at a larger size, if the proper bitrate and samplerate are selected, the audio will not be excessively distorted. The key is to find the balance between file size and sound quality.

Conclusion

We hope this guide has provided you with a clear and simple explanation of the concepts behind the MP3 audio format. While this article has mainly focused on the basics and technology behind MP3 audio files, we hope we’ve also provided some helpful tips on how to get the best audio quality out of your MP3 files. Finally, it is also important to mention the importance of using an audio conversion program like MP4Gain to normalize the volume of all audio and video files.

Free Download Mp4Gain

Mp4Gain Main Window

Mp4Gain Features

Free Download Mp4Gain

The bitrate and its relationship with the audio quality in an MP3

The bitrate is a measure of the amount of audio information that is encoded per second in a compressed audio file, such as an MP3. Bit rate is measured in kilobits per second (kbps).

The higher the bitrate, the higher the audio quality. However, a larger file size will also be required to store the same amount of audio time. Therefore, it is important to choose a suitable bitrate to balance quality and file size.

For music files, a bitrate of at least 128 kbps is recommended for decent sound quality. However, if you want higher sound quality, you can go for a higher bitrate, such as 256 kbps or even 320 kbps.

For voice audio files, a bit rate of 64 kbps is sufficient for clear sound quality. However, if you want higher sound quality, you can go for a higher bitrate, such as 96 kbps or 128 kbps.

In short, bitrate is an important factor in the audio quality of an MP3 file. It is important to choose a suitable bitrate to balance quality and file size.

Also, it’s important to note that bitrate isn’t the only factor that affects the audio quality of an MP3. Other important factors include the sample rate and the number of channels. The sample rate refers to the number of times the sound is measured per second, while the number of channels refers to the number of audio channels in the file.

For example, an audio file with a bit rate of 128 kbps and a sample rate of 44.1 kHz and 2 audio channels will have higher sound quality than a file with the same bit rate but a sample rate of 22 kHz and 1 audio channel.

In conclusion, if you want to get the best audio quality from an MP3 file, it’s important not only to choose a suitable bitrate, but also to consider the sample rate and number of channels. It is advisable to choose an optimal combination of these factors to obtain the best sound quality.

In addition, it is important to mention that there are other audio formats, such as WAV, FLAC, AIFF, which, unlike MP3, are not compressed, which means that they do not lose audio quality to the compression process. However, these formats often have much larger file sizes than compressed formats like MP3.

So, if you want the best audio quality, it’s recommended to use uncompressed formats like WAV or FLAC, but it’s also important to consider storage space and compatibility with different devices and audio players. In case of opting for compressed formats, it is important to choose an appropriate bitrate and take into account other factors such as the sampling frequency and the number of channels.

In summary, bitrate is an important factor in the audio quality of an MP3 file, but it is not the only factor to consider. It is important to choose a suitable bitrate, as well as take into account the sample rate and the number of channels to obtain the best sound quality. In addition, there are other uncompressed audio formats that offer higher sound quality, but also have a larger file size.

Why are MP3 bitrates often multiples of 32? (power of 2) part 2

Depending on the resource, VBR can be encoded by changing the bitrate between a fixed rate above each frame, or by sharing the available bits in adjacent frames (effectively producing a non-standard bitrate for the two frames combined).

the fixed frame depends on the sampling rate, 1152 samples per frame. There is no limit to the size of the frame itself, nor to the base 2 size of the frame (ie 417 bytes for a 128 kbit/s MP3 sampled at 44.1 kHz).

In the end, a file encoded at 126kbps will sound worse than a file encoded at 128kbps, and similarly a file encoded at 131kbps will sound better. However, MP3s are encoded according to the compression psychoacoustic model of a specific encoder. The amount by which a file sounds “better” or “worse” at a given bitrate largely depends on the algorithm used to implement the model, but in general higher bitrates allow for more data, presumably for rebuild a more accurate original transmission. audio signal

Why are MP3 bitrates often multiples of 32? (power of 2)

Some people say:

I understand why multiples of 2 often show up on computers since they are binary, but I can’t figure out how the most common mp3 bitrates (64kbps, 128kbps, 160kbps, 192kbps, 256kbps, 320kbps, etc.) also tend to follow this rule.

Since MP3 is just a sequential encoding of sound waves, why is it important to represent each second in kilobits divisible by 2?
Does a music player like iTunes continue to read the file and play the encoded sound regardless of the second limit, or does it read the file every second?
In the latter case, reading a 256kbps file requires reading slightly fewer memory pages than a 257kbps file, but the player can always read 256KB chunks, regardless of their bitrate, and just process them automatically. incremental, right, Bar?
Are 128kbps MP3 songs popular simply because it’s a generally accepted bitrate, or do they really have any advantages over 126kbps and 131kbps files, apart from a very slight difference in quality/file size?

For constant bit rate (CBR) encoding, the MPEG-1 Audio Layer III standard specifies standard bit rates of 32, 40, 48, 56, 64, 80, 96, 112, 128, 160, 192, 224, 256 and 320 kbit/second. There are a few others defined in the MPEG-2 standard, but they are also multiples of 2 (actually all multiples of 8 in the range 8 to 160 – see the table called “Bitrate Index” in the link above) .

Technically, there is nothing that limits the MP3 bitrate to a multiple of 2, since variable bitrate encoding can be used, or a custom bitrate can be achieved using some flags not used in the MPEG specification ( although this must be implemented manually). . In order for MP3 to be MPEG-compliant, and therefore compatible with most MP3 decoders, it must have a bitrate defined by the specification, so all CBR-encoded MP3 files have a bitrate of two.

Audio bit rate, bit rate

Bit rate refers to the number of bits (bits) transmitted per second. The unit is bps (Bit per second) The higher the bit rate, the faster the data transmission speed.

Bitrate in sound refers to the amount of binary data per unit of time after converting an analog sound signal to a digital sound signal, which is an indirect measure of audio quality.

Bitrate refers to the sampling rate at which digital sound is converted from analog to digital format. The higher the sampling rate, the better the quality of the restored sound. As a benchmark for the efficiency of digital music compression, bit rate indicates the rate of the number of bits bps (bit per second, bits per second) transmitted per unit of time (1 second). Kbps (in layman’s terms is 1000 bits per second) is usually used as the unit. The bit rate of digital music on CD is 1411.2 kbps (that is, to burn 1 second of CD music, 1411.2 × 1024 data bits are required), the high BIT RATE of the digital music file music means that it should be processed in a unit of time (1 second) The amount of data (BIT) is large, which means that the sound quality of the music file is good. However, when the BITRATE is high, the file size increases, which will take up a lot of memory capacity. The most commonly used bitrate for music files is 128 kbps, and MP3 files can generally use 8 to 320 kbps. In the same way, most of them are 32-256 Kbps. Of course, the wider the rate, the better, but 320 Kbps is the highest level at the moment.

Bitrate calculation formula
The basic algorithm is: [Bit rate] (kbps)=[file size] (bytes) X8/[time] (seconds)/1000

Special algorithm for audio files: [bit rate] (kbps) = [quantization sample point] (kHz) × [bit depth] (bit/sampling point) × [number of channels] (typically 2)

For example, the D5 drive has a capacity of 4.3G, which takes into account different audio formats, so it is calculated as 600M (so the remaining capacity is 4.3*1000-600=3700M), so the video file should not be larger than 3.7G, in this example, take The capacity of the video file is 3.446G, and the length of the video is 100 minutes (6000 seconds). The calculation result: the bit rate is approximately equal to 4933kbps.

What is a good bitrate guide for mp3 files? Part 2

To produce high-quality MP3 files of classical and jazz music, the optimal bitrate depends on the characteristics of the song.

Smooth jazz can usually be copied at 192kbps to create a good balance between file size and diminishing returns, although 256kbps may sound better in a home entertainment center. A classical orchestra should be 256kbps for a portable player, but if you want to burn a CD at home or in your car, a 320kbps file might be a better option.

For saturated music such as hard rock, metal, arena, pop, electronic and house music, 320 kbps will provide the best results. The higher the number of bits per second, the more complex acoustic envelope will be preserved.

If possible, it’s best to create MP3 files with variable bit rates. This allows the encoding program to determine if a particular frame of music requires the full bit rate. Otherwise, the program will reduce data retention for that frame, resulting in a smaller file without sacrificing quality. Forcing the program to “oversample” frames can produce artifacts.

While this article is intended as a general guide, he or she may be equally satisfied with a lower bitrate for a particular song or songs in general. Many factors affect our ability to judge the quality of music, not only the devices we use but also our activities while listening to it. For example, for those who listen to MP3 files while exercising or taking a walk, external noise can make it more difficult to tell the difference in quality. Conversely, audiophiles may prefer to sample at 320kbps, regardless of their equipment, type of music, or listening habits.

If you create your own MP3 files, there are other settings that affect quality. LAME is an excellent MP3 encoder that is free and has many graphical interfaces as the interface for this popular command line program. LAME allows users to adjust many settings to generate high-quality MP3 files in seconds. You can also experiment with various bitrates in your source file to find the best subjective balance between quality and file size.

What is a good bitrate guide for mp3 files?

(a good bitrate guideline for mp3 files?)

MP3 files are compressed audio files created from audio formats such as wave (.wav). Wave files replicate analog recordings and digital sound files at the expense of large file size, while MP3 files sacrifice some quality for a smaller footprint. There are several factors that mitigate the quality sacrifice during the conversion process. With the correct bitrate and settings, MP3 files can provide very high quality results, making them very close to the original wave files when played on portable audio players.

An mp3 player.

The balance between file size and quality is somewhat subjective. For audiophiles, any difference is noticeable. Others may simply not be able to tell the difference between a high quality MP3 file and a raw wave source. In many cases, the nuances of the sound environment will only become clearer when played through a high-quality stereo system.

MP3s are compressed digital music files that sacrifice quality for file size.
MP3 files are primarily targeted at portable audio players. In this field, high-quality MP3 files are played with incredible sound due to their small file size. With the limited memory of portable players, it makes sense that one would want MP3 files to be as small as possible while maintaining the highest possible quality.

For this, one of the most important factors when creating MP3 files is the bit rate. In general, the more bits per second that are preserved from the original file, the higher the quality of the MP3 and the larger the file size. Lower bit rates reduce size and quality. The idea is to use the bitrate for maximum realism without saving unnecessary data, which just creates larger files with no noticeable difference to the ear.

For voice recordings such as lectures or language lessons saved to waveforms, a bit rate of 32 kilobits per second (kbps) is acceptable, although 64 kbps may offer better quality, depending on the source. At 32 kbps, the sound may sound “flat”, but that’s understandable. A 64 kbps MP3 file created from a voice recording should sound nearly identical to the original.

Desaturated acoustic music with simple arrangements should work fine at 192kbps bitrate. You can choose 256 kbps if the music will be played on a high quality device. Music that falls into this category includes folk, boy band songs, easy listening, and folk music. There are also works by many classic artists such as James Taylor, Linda Longstadt, Jonny Mitchell, and Simon Garfunkel.

Audio Intro Part 3

WAV

structure
file header
The WAV format follows the RIFF Resource Interchange File Format, so the WAV format is actually a three-layer relationship, which is simplified here. Its file header format is as follows:

Address Carving type content
00H-03H 4 character * 4 RIFF resource file exchange flag
04H-07H 4 unsigned int The number of bytes from the next address to the end of the file.
08H-0BH 4 character * 4 WAV file WAVE logo
0CH-0FH 4 character * 4 fmt wave file flag, the last digit is 0x20 space
10H-13H 4 unsigned int The size of the subchunk file header. For the WAV subfragment, the value is 0x10.
14H-15H 2 short unsigned Format type, when the value is 1, it means the data is linear PCM encoding
16H-17H 2 short unsigned number of channels
18H-1BH 4 int unsigned Sampling rate
1CH-1FH 4 int unsigned Wave file bytes per second = sample rate Bit depth PCM / 8 channels
20H-21H 2 short unsigned DATA data block unit length = number of channels * PCM bit depth / 8
22H-23H 2 short unsigned Bit depth PCM
24H-27H 4 character * 4 data stamp data
28H-2BH 4 unsigned int Total length of data part (bytes)
struct WAVHeader
{ char RIFF[ 4 ]; ///Resource file exchange flag RIFF unsigned LEN; ///Number of bytes from the next address to the end of the file char WAV[ 4 ]; ///WAV file flag WAVE char FMT [ 4 ]; ///Wave fmt file pointer, last digit is 0x20 space unsigned SubchunkSize; ///The size of the sub-chunk file header, for WAV this sub-chunk, the value is 0x10 DATATYPE short unsigned; / //Format type, when the value is 1, it means the data is unsigned linear PCM encoding short CH ; ///Number of unsigned channels F; ///Unsigned sample rate BYTERATE; ///Number of bytes per second of wave file = sample rate*PCM bit depth/8*Number of unsigned channels

short DATAUNITLEN; ///DATA block unit length=channel number*Bit depth PCM/8 unsigned short BITDEPTH; ///Bit depth character PCM DATA[ 4 ]; ///Data flag data unsigned DATALEN ; ///Data partial total length (bytes) };

data organization
After the file header is the data part of the WAV file. Its data organization is: the left channel value of the first sample point, the right channel value of the first sample point, …, the left channel value of the last sample point, the right channel value of the last sample point value. Each value has a bit depth of bits.

Generate a simple wav
First complete the Wav header.

WAVHeader getHeader ( int number )
{
WAV Header res; memcpy (res.RIFF, “RIFF” , sizeof (res.RIFF)); memcpy (res.WAV, “WAVE” , sizeof (res.WAV)); memcpy (res.FMT, “fmt ” , size of ( res.FMT )); res.SubchunkSize= 0x10 ; res.DATATYPE= 1 ; res.CH= 2 ; res.F=F; res.BITDEPTH=DEPTH; res.BYTERATE=res.F*res.BITDEPTH/ 8 *res.CH; res.DATAUNITLEN=res.CH*res.BITDEPTH/ 8 ; memcpy(res.DATA, “data”

, size of ( res.DATA ));
res.DATALEN=num*res.DATAUNITLEN;
res.LEN=res.DATALEN+ 44 -8 ; returnres; }

First, define the key name – frequency comparison table.

const double keyf[]=
{ 27.5 , 29.1352 , 30.8677 , 32.7032 , 34.6478 , 36.7081 , 38.8909 , 41.2034 , 43.6535 , 46.2493 , 48.9994 , 51.9131 , 55 , 58.2705 , 61.7354 , 65.4064 , 69.2957 , 73.4162 , 77.7817 , 82.4069 , 87.3071 , 92.4986 , 97.9989 ,

103.826 , 110 , 116.541 , 123.471 , 130.813 , 138.591 , 146.832 , 155.563 , 164.814 , 174.614 , 184.997 , 195.998 , 207.652 , 220 , 233.082 , 246.942 , 261.626 , 277.183 , 293.665 , 311.127 , 329.628 , 349.228 , 369.994 , 391.995 , 415.305 , 440

, 466.164 , 493.883 , 523.251 , 554.365 , 587.33 , 622.254 , 659.255 , 698.456 , 739.989 , 783.991 , 830.609 , 880 , 932.328 , 987.767 , 1046.5 , 1108.73 , 1174.66 , 1244.51 , 1318.51 , 1396.91 , 1479.98 , 1567.98 , 1661.22 , 1760 , 1864.66 ,

1975.53 , 2093 , 2217.46 , 2349.32 , 2489.02 , 2637.02 , 2793.83 , 2959.96 , 3135.96 , 3322.44 , 3729.31 , 3951.07 , 4186.01 } ___ { “A-0” , “A#0” , “B- 0” , “C-1” , “C#1” , “D-1” , “D#1” , “E-1” , “F-1”, “F#1”

, “G-1” , “G#1” , “A-1” , “A#1” , “B-1” , “C-2” , “C#2” , “D-2″ , ” D#2″ , “E-2” , “F-2” , “F#2” , “G-2” , “G#2” , “A-2” , “A#2” , “B- 2” , “C-3” , “C#3” , “D-3” , “D#3” , “E-3” , “F-3” , “F#3” , “G-3” , “sun#3” ,

Audio Intro Part 2

A wav is 44100 Hz 16-bit stereo or 22050 Hz 8-bit mono, what does that mean? stereo/mono refers to dual/mono.

For monophonic sound files, the sample data is an eight-bit short integer (short int 00H-FFH); for two-channel stereo sound files, each sample data is a 16-bit integer (int) and the upper eight bits (left channel) and lower eight bits (right channel) represent the two channels, respectively.

Sound is a mechanical wave, produced by the vibration of an object, and requires a medium to propagate. So, in essence, a sound is a waveform on an axis over time.

Sound has three elements: pitch, volume, and timbre:

Pitch is determined by the frequency of the sound wave, the higher the frequency, the higher the pitch.
The volume is determined by the amplitude of the sound wave, the larger the amplitude, the louder the sound.
The timbre is determined by the “shape” of the waveform (sounds like square, triangle, and sawtooth are called impulse waves and sound individual).
An audio file is a file obtained by converting an analog signal to a digital signal. In general, there are five important parameters: encoding method, number of channels, sampling rate, bit depth, and bit rate.

Encoding: how this format organizes binary data and how it is compressed.
Number of channels: mono, dual or 5.1 channels, etc.
Sampling rate: The number of samples per second.
Bit Depth: The number of binary bits used to store the y value of the sample point.
Bitrate – The desired number of bits per second for the file.
We know that there is no compression in the WAV format, so its encoding method is to directly write all the sampled points to the file in order.

WAV file size (B) = number of channels * sample rate (Hz) * bit depth (bit) / 8 + file header size (B, it’s 44B)

Implementation
When you open an mp3 or wav file with a text editor, you see numbers like this:

4944 3303 0000 0000 3d48 5459 4552 0000
0006 0000 0032 3031 3800 5444 4154 0000
0006 0000 0032 3230 3300 5449 4d45 0000
0006 0000 0031 3430 3600 5052 4956 0000
168e 0000 584d 5000 3c3f 7870 6163 6b65
7420 6265 6769 6e3d 22ef bbbf 2220 6964
3d22 5735 4D30 4D70 4365 6869 487A 7265
537A 4E54 637A 6B63 3964 223F 3E0A 3A78
6D70 6D65 7461 2078 6D6C 6E78 3D22
6F62 653A 6574 612F
5249 4646 2e3d 0e05 5741 5645 666d 7420
1200 0000 0300 0200 44ac 0000 2062 0500
0800 2000 0000 6461 7461 a026 0e05 8089
00bc 00e8 f0bb c09e 8dbc 00c2 87bc 80f1
d3bc 8063 ccbc c030 fcbc 8012 f4bc 20bb
13bd e051 0fbd c0b0 2dbd 6079 28bd 4012
46bd 6032 40bd c0e3 5dbd 6040 57bd c015
7cbd e035 74bd b058 8dbd 50e2 88bd f0a7 9dbd e0dd 98bd 70d3 acbd e0a9 a7bd
d043 b8bd b0da b2bd
00e3 c4bd 605c bfbd

This one above is the mp3/wav format of the same song. What is the difference between them?

Audio intro

An mp3 is 320kbps, 44100hz, what does this mean?

44100Hz represents the sample rate of the signal. The so-called sampling consists of obtaining the value y of the sound wave at the current moment every unit of time. Sampling is the process of discretizing continuous data (converting an analog signal to a digital signal).
image source

The sampling method mentioned above is called PCM (Pulse Code Modulation). According to the Nyquist-Shannon sampling law, the sampling rate must be at least twice the highest target frequency. The hearing range of the human ear is about 20Hz-20,000Hz (if you’re curious how loud you can hear, you can click here to test your ears), although recording software often has a 48,000 option Hz, but we can safely conclude: 44100Hz can meet almost all our needs, higher is just a waste of your memory and CPU. More than 48,000 samples are meaningless to the human ear, which is similar to 24 frames per second on a movie. 44100Hz happens to be the standard sample rate for almost all music released. In fact, for vocals and many instruments, high-frequency sounds are noise, so high sample rates can sometimes worsen sound quality (which is why we need to adjust the equalizer).

320 kbps represents your bitrate/bitrate, which is shorthand for kilobits per second, which represents the size of the data used to describe sound. In CD (uncompressed audio file), the bit rate is 1411.2kbps, and the mp3 sound quality to achieve CD quality should be higher than 128kbps/44100Hz (128kbps can be said to be the most common bit rate). Generally, a higher number means better quality. The quality depends on many factors (such as the encoding algorithm). Many times we don’t need too high bitrate: our device can play mp3 and CD without difference (sound/sound card is normal).