mp3 bitrate list Archives - Page 2 of 2

Audio Intro Part 2

Free Download Mp4Gain

Audio Intro Part 2

A wav is 44100 Hz 16-bit stereo or 22050 Hz 8-bit mono, what does that mean? stereo/mono refers to dual/mono.

For monophonic sound files, the sample data is an eight-bit short integer (short int 00H-FFH); for two-channel stereo sound files, each sample data is a 16-bit integer (int) and the upper eight bits (left channel) and lower eight bits (right channel) represent the two channels, respectively.

Sound is a mechanical wave, produced by the vibration of an object, and requires a medium to propagate. So, in essence, a sound is a waveform on an axis over time.

Sound has three elements: pitch, volume, and timbre:

Pitch is determined by the frequency of the sound wave, the higher the frequency, the higher the pitch.
The volume is determined by the amplitude of the sound wave, the larger the amplitude, the louder the sound.
The timbre is determined by the “shape” of the waveform (sounds like square, triangle, and sawtooth are called impulse waves and sound individual).
An audio file is a file obtained by converting an analog signal to a digital signal. In general, there are five important parameters: encoding method, number of channels, sampling rate, bit depth, and bit rate.

Encoding: how this format organizes binary data and how it is compressed.
Number of channels: mono, dual or 5.1 channels, etc.
Sampling rate: The number of samples per second.
Bit Depth: The number of binary bits used to store the y value of the sample point.
Bitrate – The desired number of bits per second for the file.
We know that there is no compression in the WAV format, so its encoding method is to directly write all the sampled points to the file in order.

WAV file size (B) = number of channels * sample rate (Hz) * bit depth (bit) / 8 + file header size (B, it’s 44B)

Implementation
When you open an mp3 or wav file with a text editor, you see numbers like this:

4944 3303 0000 0000 3d48 5459 4552 0000
0006 0000 0032 3031 3800 5444 4154 0000
0006 0000 0032 3230 3300 5449 4d45 0000
0006 0000 0031 3430 3600 5052 4956 0000
168e 0000 584d 5000 3c3f 7870 6163 6b65
7420 6265 6769 6e3d 22ef bbbf 2220 6964
3d22 5735 4D30 4D70 4365 6869 487A 7265
537A 4E54 637A 6B63 3964 223F 3E0A 3A78
6D70 6D65 7461 2078 6D6C 6E78 3D22
6F62 653A 6574 612F
5249 4646 2e3d 0e05 5741 5645 666d 7420
1200 0000 0300 0200 44ac 0000 2062 0500
0800 2000 0000 6461 7461 a026 0e05 8089
00bc 00e8 f0bb c09e 8dbc 00c2 87bc 80f1
d3bc 8063 ccbc c030 fcbc 8012 f4bc 20bb
13bd e051 0fbd c0b0 2dbd 6079 28bd 4012
46bd 6032 40bd c0e3 5dbd 6040 57bd c015
7cbd e035 74bd b058 8dbd 50e2 88bd f0a7 9dbd e0dd 98bd 70d3 acbd e0a9 a7bd
d043 b8bd b0da b2bd
00e3 c4bd 605c bfbd

This one above is the mp3/wav format of the same song. What is the difference between them?

Free Download Mp4Gain

Mp4Gain Main Window

Mp4Gain Features

Free Download Mp4Gain

Audio intro

An mp3 is 320kbps, 44100hz, what does this mean?

44100Hz represents the sample rate of the signal. The so-called sampling consists of obtaining the value y of the sound wave at the current moment every unit of time. Sampling is the process of discretizing continuous data (converting an analog signal to a digital signal).
image source

The sampling method mentioned above is called PCM (Pulse Code Modulation). According to the Nyquist-Shannon sampling law, the sampling rate must be at least twice the highest target frequency. The hearing range of the human ear is about 20Hz-20,000Hz (if you’re curious how loud you can hear, you can click here to test your ears), although recording software often has a 48,000 option Hz, but we can safely conclude: 44100Hz can meet almost all our needs, higher is just a waste of your memory and CPU. More than 48,000 samples are meaningless to the human ear, which is similar to 24 frames per second on a movie. 44100Hz happens to be the standard sample rate for almost all music released. In fact, for vocals and many instruments, high-frequency sounds are noise, so high sample rates can sometimes worsen sound quality (which is why we need to adjust the equalizer).

320 kbps represents your bitrate/bitrate, which is shorthand for kilobits per second, which represents the size of the data used to describe sound. In CD (uncompressed audio file), the bit rate is 1411.2kbps, and the mp3 sound quality to achieve CD quality should be higher than 128kbps/44100Hz (128kbps can be said to be the most common bit rate). Generally, a higher number means better quality. The quality depends on many factors (such as the encoding algorithm). Many times we don’t need too high bitrate: our device can play mp3 and CD without difference (sound/sound card is normal).

What is bit rate? Knowledge of the MP3 audio format. Part 2

Bitrate is a benchmark indicator of the efficiency of digital music compression.

The bit rate represents the number of bits bps (bit per second, bits per second) transmitted per unit of time (1 second). We usually use kbps (in simple terms, it is per second) clock 1000 bits) as the unit. The bit rate of digital music on CD is 1411.2 kbps (ie recording 1 second of CD music requires 1411.2 × 1024 bits of data). The higher the bit rate of the music file, the more data (Bit) must be processed in a unit of time (1 second), and the better the sound quality of the music file. However, when the bit rate is high, the file size increases, which will occupy a large amount of storage capacity. 8 to 320 kbps.

1. WMA (Windows Media Audio, Windows Media Audio)

As a Microsoft media compression method, it is a part of the technology that only compresses audio data in Windows Media Technologies. The sound quality is similar to MP3 and can be compressed with half the technology of MP3. It has the copyrighted Windows Media Rights Manager and can be played by installing it in WMP (Windows Media Player, Windows Media Player). Due to the strong influence of Microsoft and Windows, as well as major copyright reasons, the major American record companies, EMI and BMG, officially confirmed that they use the WMA method developed and produced by Microsoft. It is believed that this advanced method will become even more popular in the future.

2. MP3 (CBR, VBR, ABR)

MP3 is currently the most widely used and widely used lossy compressed digital audio format. It has been explained above and will not be repeated here.

CBR (constant bit rate)

CBR is the oldest and simplest MP3 encoding (compression) method. When this method is used for encoding, the bit rate of the entire file is the same, in other words, the bit rate used by the MP3 file per second is the same. Although the music file has sections of varying complexity, the encoder always keeps the bitrate constant, unless you use the highest sound quality; otherwise the sound quality of the different sections of the MP3 file will vary. The more complex the passage, the worse the sound quality. Its biggest advantage is that the file size is fixed, which is convenient for calculating storage space.

VBR (Variable Bit Rate, Variable Bit Rate)

VBR is a variable encoding rate MP3 compression method. Its principle is to encode the complex part of a song with a high bit rate and the simple part with a low bit rate. Through this dynamic adjustment of the encoding rate, the sound quality can be improved. additionally obtained and the size of the file. Its main advantage is that the entire song can approximately meet our sound quality requirements, but the disadvantage is that the size of the compressed file cannot be estimated during encoding.

Most MP3 players released now support VBR, but although some machines can play songs in VBR format, they can’t display the playing time correctly. Nowadays, a lot of high-quality MP3 music is encoded in VBR.

What is bit rate? Knowledge of the MP3 audio format.

Digital audio formats are audio signals that are recorded, processed, and reproduced in digital form.

The emergence of digital audio formats is to meet the needs of high-fidelity playback, storage and transmission. Simply put, early analog audio formats had issues with playback distortion and glitches due to media wear. Since the advent of the CD, digital format audio files have become popular, but another problem has arisen: the limitation of the storage volume, and the CD still has the phenomenon of wear. Saving to hard drive (relatively longer storage time) is not a good solution when storage media (mainly hard drives) are still expensive at the time. The rise of the Internet has created a requirement for long-distance file transmission. Under the restriction of bandwidth, the demand to reduce file size has become more intense. All this has led to the generation of lossy compressed digital audio formats from external factors!

In terms of internal factors, with the improvement of computing and coding capabilities, the progress of various acoustic psychological models has promoted the emergence of various lossy compressed digital audio formats. Some of the most commonly used audio formats in MP3 players are briefly introduced below: MP3 (CBR, VBR, ABR), WMA, WAV, ADPCM, and the emerging audio formats AAC, ASF, and OGG.

Before introducing various digital audio formats, let’s clarify one concept: bitrate.

In the field of computing, all information is digitized. Bit is the smallest unit of data in a computer, it refers to a number of 0 or 1, which is a mathematical binary number, a “0” or “1” , is a bit. For example, when we say a 2-digit number, it means that it is a two-digit binary number, and there are 4 combinations of “00”, “01”, “10” and “11”, which represent 0, 1, 2 and 3 is four numbers.

Everything you wanted to ask about the mp3

WHAT IS A CODEC AND WHAT IS IT FOR?

CODEC is an abbreviation of CODifier-DECodifier, which already indicates its functions. In the case of MP3, it deals with processing the sound, applying various compression methods to reduce the size necessary for its storage, and conversely, to reconstruct the sound from the encoded information. The codecs can be found in the form of an executable file, a DLL, or less frequently, incorporated into the code of another program. The DLL format allows the same codec to be easily used by different programs, while the executable format is usually more flexible in terms of configuration, although it is more complex to use.

WHAT DOES “PSYCHOCAUSTIC COMPRESSION” MEAN?

This very complicated word refers to the main compression technique that MP3 uses (although it is not the only one), which simply consists of eliminating the information (sound, in this case) that, in theory, is not able to perceive the ear human. I say “in theory” because, depending on the compression ratio that is applied, that loss may be more or less noticeable, hence some files do not sound as good as would be desired. The different codec also do not use psychoacoustic principles in the same way, so the quality they achieve will not be the same.

DOES THE SOURCE ORIGIN INFLUENCE THE MP3 QUALITY?

Evidently. The ideal is to start from an original CD. Analog sources, such as vinyl records or tapes, have lower quality in themselves, no matter how much we “digitize” or “improve” them with some programs. Recorded CDs may contain errors (which will end up being noises when played), and we usually do not know from which source they have been recorded (perhaps from previously encoded MP3 files). Re-encoding an already compressed music file, in the same or another format, is also a mistake, since quality is always lost.

DO THE PROGRAMS WE USE TO EXTRACT AND CODIFY THE QUALITY OF MP3?

It also seems obvious, so much so that this point should not be here. But many people do not take it into account, without knowing the reasons very well.

DOES THE QUALITY OF PLAYERS INCLUDE IN THE QUALITY OF MP3?

To hear it, you already know the answer, don’t you? J. To have it stored on the hard drive or on a CD, nope. Jokes aside, and in theory, all decoders should get the same sound from an MP3 file. But unfortunately this is not the case, usually because they do not strictly conform to the MP3 standard. You can find the best explanations on this topic here.

DOES THE QUALITY OF THE SOUND CARD AND SPEAKERS INCLUDE IN THE QUALITY OF THE MP3?

Well, I think you already know where the shots are going, we will not continue along this line. But nobody says “is that MP3 does not sound good” when it has a Sound Blaster 16 (what times those!) And some 800 W TurboPowerSound speakers that cost € 18 uros.

WHAT IS BITRATE?

It is the number of bits that are used to encode a second of sound, if we talk about MP3. It is measured in Kbps (kilobits per second).

As there is often confusion, we will give an example. If we have a 4-minute MP3 encoded at 128 Kbps:

-One second occupies: 128 Kbits x (1 Byte / 8 bits) = 16 KBytes.

-The complete song: 16 KBytes / second x (4 minutes x 60 seconds / 1 minute) = 3840 KBytes = 3.75 MBytes.

The formulas could be more correct, but this is not a mathematical thesis and I think they are enough to understand us.

WHAT DOES CBR, VBR and ABR MEANS, AND WHAT IS THE DIFFERENCE?

They are different coding methods. CBR means “Constant BitRate”, VBR “Variable BitRate” and ABR “Average BitRate”. It is easy to deduce that if CBR is used, the bitrate remains fixed, and if VBR is used, the bitrate changes. ABR can be considered a variant of VBR in which the objective is to achieve a certain final size, since we can set the average bitrate to be used (although it will never be as accurate as CBR).

The great advantage of VBR is that it uses more bits when the fragment to be encoded is more complex, and less when there are silences or it is simpler, so that we will achieve the highest quality using the minimum possible space. If the final size is not the deciding factor, it is the most recommended option.

WHAT IS THE BEST MP3 CODEC TO COMPRESS MUSIC? WHAT IS YOUR BEST CONFIGURATION?

The first question is easy to answer: LAME. In many different tests, done independently and with means that probably none of us will ever have, LAME is considered again and again as the best codec. And since there is no commercial motivation, since it is free distribution software, we can give full validity to these tests. The Fraunhofer codec and some of its “versions” could be valid in certain circumstances, but … do we need to use a commercial or pirate codec with a free and legal substitute? I think not (and from here, what each one does, is his own thing). For me, codec MP3 = LAME.

The second question is much harder to answer. There is no configuration that is perfect for everything (because if not, the modifiable options would be left over and everything would be much easier) J. But there are some recommended options in general, which we can establish with the assurance that the result will be of good quality. I have to admit that I barely change them, but as in this case (and in many others) there are no absolute truths, everyone has to try until they establish the one that best suits their preferences (or their ears).

The first thing that we must be clear about is that, in an MP3 file, size and quality are conflicting terms. We have to look for a practical middle ground. I put that term between 128 and 192 Kbps (some will say it is too low … I will try to justify my opinion). In principle, the purists established the CD quality at 256 kbps CBR. That is, using that bitrate, the sound of the CD and that of the MP3 file were absolutely indistinguishable. But, right now, LAME is already very developed. It uses the most advanced techniques, and manages to do perfectly things that other codecs are not able to solve properly (VBR, Joint-Stereo), which allows you to improve the quality and reduce the size.

In r3mix.net, after the corresponding tests, they have established an “ideal” configuration based on quality criteria. This configuration is so accepted, that in the latest versions of CDex, and other programs, it is already included as one of the preset configurations. But be careful: this does not mean that this is the only option, nor that it will always achieve the highest possible quality. Another thing is that using this option our ear can differentiate the results of the original audio CD, and this is where the “trick” of MP3 compression is.

After having used this configuration, I have observed that, although there are important variations, the average of the files is about 170-175 kbps, and that they rarely go beyond those 192 Kbps that I mentioned before, so I consider that overcoming that bitrate does not it will hardly improve the quality, but it will significantly increase the size. Although I repeat, there will always be someone to whom this bitrate seems insufficient, and will use even 320 Kbps, which is the maximum supported by LAME.

The second part is to take into account the equipment we have. If the MP3s are not going to sound through a card, an amplifier and very good quality speakers, the bitrate can be adjusted a little more without practically noticing variation in quality. The average 160 Kbps seems like a very reasonable amount to me, and that’s how I usually code my CDs. And I can say that they don’t sound bad at all. If someone is able to do a “blind” test and differentiate between several files encoded (correctly) at 160 and 192 Kbps, you really have to congratulate him on his ear.

Finally, if we consider size as a decisive factor, we could go down to 128 Kbps, but considering that the quality may already be too compromised. Summarizing:

Configuration that seems more appropriate for practical purposes, and looking for a high quality (about 192 Kbps):

-Joint-Stereo -> Get reduce the final size without losing quality. Sometimes, where there is a lot of variation between channels and with high bitrates, it could be configured as Stereo.

-VBR MTRH Quality 1 or R3Mix Preset -> Change the bitrrate according to the needs of each moment, improving the quality without increasing the size with respect to CBR (normally).

-Bitrate minimum 64 Kbps -> To compress to the maximum silences and very simple fragments.

-Bitrate maximum 256 Kbps -> So you have room to improve the quality in complicated fragments.