Comparison and implementation of MP3, WAV


Free Download Mp4Gain
picture

Comparison and implementation of MP3, WAV

WAV vs MP3
WAV vs MP3

Sound has three elements: pitch, volume, and timbre:

WAV vs MP3
WAV vs MP3

Pitch is determined by the frequency of the sound wave, the higher the frequency, the higher the pitch.
The volume is determined by the amplitude of the sound wave, the larger the amplitude, the louder the sound.
The timbre is determined by the “shape” of the waveform (sounds like square, triangle, and sawtooth are called impulse waves and sound individual).
An audio file is a file obtained by converting an analog signal to a digital signal. In general, there are 5 important parameters: encoding method, number of channels, sampling rate, bit depth, and bit rate.

Encoding: how this format organizes binary data and how it is compressed.
Number of channels: mono, dual or 5.1 channels, etc.
Sampling rate: The number of samples per second.
Bit Depth: The number of binary bits used to store the y value of the sample point.
Bitrate – The desired number of bits per second for the file.
We know that there is no compression in the WAV format, so its encoding method is to directly write all the sampled points to the file in order.

WAV file size (B) = number of channels * sample rate (Hz) * bit depth (bit) / 8 + the file header size (B, it’s 44B)

Implementation
When you open an mp3 or wav file with a text editor, you see numbers like this:
4944 3303 0000 0000 3d48 5459 4552 0000
0006 0000 0032 3031 3800 5444 4154 0000
0006 0000 0032 3230 3300 5449 4d45 0000
0006 0000 0031 3430 3600 5052 4956 0000
168e 0000 584d 5000 3c3f 7870 6163 6b65
7420 6265 6769 6e3d 22ef bbbf 2220 6964
3d22 5735 4D30 4D70 4365 6869 487A 7265
537A 4E54 637A 6B63 3964 223F 3E0A 3A78
6D70 6D65 7461 2078 6D6C 6E78 3D22
6F62 653A 6574 612F
5249 4646 2e3d 0e05 5741 5645 666d 7420
1200 0000 0300 0200 44ac 0000 2062 0500
0800 2000 0000 6461 7461 a026 0e05 8089
00bc 00e8 f0bb c09e 8dbc 00c2 87bc 80f1
d3bc 8063 ccbc c030 fcbc 8012 f4bc 20bb
13bd e051 0fbd c0b0 2dbd 6079 28bd 4012
46bd 6032 40bd c0e3 5dbd 6040 57bd c015
7cbd e035 74bd b058 8dbd 50e2 88bd f0a7 9dbd e0dd 98bd 70d3 acbd e0a9 a7bd
d043 b8bd b0da b2bd
00e3 c4bd 605c bfbd
This one above is the mp3/wav format of the same song. What is the difference between them?

WAV
structure
file header
The WAV format follows the RIFF Resource Interchange File Format, so the WAV format is actually a three-layer relationship, which is simplified here. Its file header format is as follows:

Address Carving type content
00H-03H 4 character * 4 RIFF resource file exchange flag
04H-07H 4 unsigned int The number of bytes from the next address to the end of the file.
08H-0BH 4 character * 4 WAV file WAVE logo
0CH-0FH 4 character * 4 fmt wave file flag, the last digit is 0x20 space
10H-13H 4 unsigned int The size of the subchunk file header. For the WAV subfragment, the value is 0x10.
14H-15H 2 short unsigned Format type, when the value is 1, it means the data is linear PCM encoding
16H-17H 2 short unsigned number of channels
18H-1BH 4 int unsigned Sampling rate
1CH-1FH 4 int unsigned Wave file bytes per second = sample rate Bit depth PCM / 8 channels
20H-21H 2 short unsigned DATA data block unit length = number of channels * PCM bit depth / 8
22H-23H 2 short unsigned Bit depth PCM
24H-27H 4 character * 4 data stamp data
28H-2BH 4 unsigned int Total length of data part (bytes)
struct WAVHeader
{ char RIFF[ 4 ]; ///Resource file exchange flag RIFF unsigned LEN; ///Number of bytes from the next address to the end of the file char WAV[ 4 ]; ///WAV file flag WAVE char FMT [ 4 ]; ///Wave fmt file pointer, last digit is 0x20 space unsigned SubchunkSize; ///The size of the sub-chunk file header, for WAV this sub-chunk, the value is 0x10 DATATYPE short unsigned; / //Format type, when the value is 1, it means the data is unsigned linear PCM encoding short CH ; ///Number of unsigned channels F; ///Unsigned sample rate BYTERATE; ///Number of bytes per second of wave file = sample rate*PCM bit depth/8*Number of unsigned channels

short DATAUNITLEN; ///DATA block unit length=channel number*PCM bit depth/unsigned 8 short BITDEPTH; ///PCM bit depth character DATA[ 4 ]; ///Unsigned data mark data DATALEN ; ///Total data section length (bytes) };


Free Download Mp4Gain
picture


Mp4Gain Main Window
picture


Mp4Gain Features
picture


Free Download Mp4Gain
picture

Comparison and implementation of MP3, WAV

Comparison and implementation of MP3, WAV

WAV vs. MP3
WAV vs. MP3

An mp3 is 320kbps, 44100hz, what does this mean?

WAV vs. MP3
WAV vs. MP3

44100Hz represents the sample rate of the signal. The so-called sampling consists of obtaining the value y of the sound wave at the current moment every unit of time. Sampling is the process of discretizing continuous data (converting an analog signal to a digital signal).
image source

The sampling method mentioned above is called PCM (Pulse Code Modulation). According to the Nyquist-Shannon sampling law, the sampling rate must be at least twice the highest target frequency. The hearing range of the human ear is about 20Hz-20,000Hz (if you’re curious how loud you can hear, you can click here to test your ears), although recording software often has a 48,000 option Hz, but we can safely conclude: 44100Hz can meet almost all our needs, higher is just a waste of your memory and CPU. More than 48,000 samples are meaningless to the human ear, which is similar to 24 frames per second on a movie. 44100Hz happens to be the standard sample rate for almost all music released. In fact, for vocals and many instruments, high-frequency sounds are noise, so high sample rates can sometimes worsen sound quality (which is why we need to adjust the equalizer).

320kbps represents your bit rate/bit rate, which is short for kilobits per second, which represents the size of the data used to describe sound. In CD (uncompressed audio file), the bit rate is 1411.2 kbps, and the mp3 sound quality to achieve CD quality should be higher than 128 kbps / 44100 Hz (128 kbps can be said to be the most common bit rate). Generally, a higher number means better quality. The quality depends on many factors (such as the encoding algorithm). Many times we don’t need too high bitrate: our device can play mp3 and CD without difference (sound/sound card is normal).

A wav is 44100 Hz 16-bit stereo or 22050 Hz 8-bit mono, what does this mean? stereo/mono refers to dual/mono. For monophonic sound files, the sample data is an eight-bit short integer (short int 00H-FFH); for two-channel stereo sound files, each sample data is a 16-bit integer (int) and the upper eight bits (left channel) and lower eight bits (right channel) represent the two channels, respectively.

Sound is a mechanical wave, produced by the vibration of an object, and requires a medium to propagate. So, in essence, a sound is a waveform on an axis over time.

When did mp3 music files first appear?

When did mp3 music files first appear?

MP3

MPEG-1 Audio Layer 3, often referred to as MP3, is one of the most popular lossy compression and digital audio encoding formats today.

mp3

 

There is no noticeable drop in sound quality compared to the original uncompressed audio. It was invented and standardized in 1991 by a group of engineers at the Fraunhofer-Gesellschaft research organization in Erlangen, Germany.
MPEG-1 Audio Layer 3, often referred to as MP3, is one of the most popular lossy compression and digital audio encoding formats today. There is no noticeable drop in sound quality compared to the original uncompressed audio. It was invented and standardized in 1991 by a group of engineers at the Fraunhofer-Gesellschaft research organization in Erlangen, Germany.

The audio format supported by the MP3 player is not only MP3 format, but also WMA, WAV, MP3Pro, ASF, AAC and VQF, etc. The WMA format can reach CD quality when compressed to 64 kbps, and output is only half the size of the corresponding MP3 file. This is very important for models with only 32 MB of flash memory. WMA and RA formats are supported, which means FlashMemory space is almost doubled. If it’s hard, be sure to ask this question when purchasing.
Among all the music formats supported by MP3, the most common ones are MP3, WMA and WAV. Others are unpopular or too bulky to be practical.

MP3 File Structure Analysis Part 2

MP3 File Structure Analysis Part 2

mp3

Sounds in nature are very complex and waveforms are extremely complex.

Mp3

Usually we use pulse code modulation coding, that is, PCM coding. PCM converts continuously changing analog signals into digital codes through three steps of sampling, quantizing, and encoding.

u Decode:

Reverse encoding process

1.1.2 Brief introduction of MP3
The full name of MP3 is MPEG Audio Layer 3. It is an efficient computer audio coding scheme. It converts audio files into smaller files with a .mp3 extension with a higher compression ratio, essentially maintaining the sound quality of the source file. MP3 is part of the ISO/MPEG standard,

The ISO/MPEG standard describes audio compression using a high performance perceptual coding scheme. This standard has been continuously updated to meet the pursuit of “high quality and low quality”. Three audio codec schemes, MPEG Layer1, Layer2 and Layer3, have been formed, respectively, corresponding to the three sound files MP1, MP2 and MP3

MPEG (Moving Picture Experts Group) is a group of moving picture experts under ISO. The MPEG standard it specifies is widely used in various multimedia. The MPEG standard includes video and audio standards. Audio standards have developed MPEG-1, MPEG -2, MPEG-2 ACC, MPEG-4. The MPEG-1 and MPEG-2 standards use the same family of Layer1, 2, 3 audio codecs, and most MP3s use the MPEG1 standard.

MP3 audio compression consists of two parts: encoding and decoding. Encoding is the process of converting the original signal to a level signal, and decoding is the reverse process. MP3 uses the PerceptualAudio Coding distortion algorithm. The frequency range of sound perceived by the human ear is 20 Hz to 20 kHz. MP3 cuts out a lot of redundant signals and irrelevant signals. The encoder transforms the original sound into the frequency domain through a mixed filter bank and uses a psychoacoustic model. to estimate that it may be only The perceived noise level is quantized and converted to Huffman coding to form an MP3 bit stream. The decoder is much simpler, its task is to extract the sound signal from the encoded spectral line components through inverse quantization and inverse transformation.

MP3 file data consists of multiple frames, and a frame is the smallest unit of an MP3 file. Each frame, in turn, consists of a frame header, additional information, and sound data. The playback time of each frame is 0.026 seconds and its duration varies with the bit rate. Some MP3 files have extra bytes at the end that contain description information for non-audio data.

MP3 file structure analysis

MP3 file structure analysis

MP3 FORMAT

ü ID3:

mp3 format

 

Usually located in several bytes at the beginning or end of an mp3 file, it records the singer, title, album name, era, style, and other mp3 file information.

ID3 is divided into two versions, the V1 ID3 version is fixed at the end of the 128-word file section, it begins with the TAG character, if there is no ID3V1 information, it is considered that there is no ID3V1 information, the V2 ID3 version is found. at the beginning of mp3 and the length is variable.

ü Sampling rate:

The number of samples extracted from a continuous signal to form a discrete signal per second. It is expressed in Hertz (Hz). Sampling rate refers to the sampling frequency when converting an analog signal to a digital signal, i.e. how many points are sampled per unit of time. The higher the sample rate, the more realistic and natural the sound will be. On today’s major capture cards, the sample rate is generally divided into three levels: 22.05 KHz, 44.1 KHz, and 48 KHz. 22.05 KHz can only achieve the sound quality of FM radio, and 44.1 KHz is the theoretical limit of CD sound quality, and 48 KHz is more accurate.

ü Bit rate:

Bit rate refers to the number of bits (bits) transmitted per second. The unit is bps (bit per second). The higher the bit rate, the more information transmitted. In the audio and video fields, bit rate often translates to bit rate. The bit rate indicates how many bits per second the encoded (compressed) audio and video data should represent, and a bit is the smallest unit in binary. 0 or 1. The relationship between bitrate and audio and video compression is simply that the higher the bitrate, the better the quality of the audio and video, but the larger the encoded file; if the bitrate is lower, the situation is just the opposite.

Bit rate = sample rate * number of samples * number of channels

ü Bitrate/Stream/Bitrate:

It refers to the data stream used by audio and video files in a unit of time. The popular understanding is the sample rate, which is the most important part of quality control in audio and video encoding. Generally, the units we use are Kb/s and Mb/s. . Generally speaking, the higher the code stream, the lower the compression ratio and the higher the quality. The higher the code stream, the higher the sampling rate per unit time, the higher the data stream, the higher the accuracy, and the closer the processed file is to the original file.

ü Code:

From the point of view of information theory, the data that describes the source of information is the sum of the redundancy of information and data, namely: data = information + data redundancy. The audio signal has correlation in the time domain and the frequency domain, that is, there is data redundancy. Taking audio as the source, the essence of audio encoding is to reduce redundancy in the audio.

Mp3 to wav: Mp3 to wav converter

Mp3 to wav: Mp3 to wav converter

MP3 to WAV converter

An mp3 file is a file that has been encoded to save a lot of disk space, as it can make the mp3 take up about a tenth of what the original file did.

Mp3 to wav converter is a need that exists without a doubt, many people look for a way to convert mp3 files to wav format.

Mp3 to Wav converter

Today that the size of an audio file is no longer as important as it was before, but today quality is the priority for many users, it is possible to use a tool like Mp4Gain to convert an mp3 to wav and give it a realize, not only normalizing the volume, but correcting the equalization, etc.

Furthermore, we would say that only Mp4Gain can convert mp3 to wav, and among many other audio and video formats. The main audio formats can be converted to each other (FLAC, OGG, AAC, A4C, WAV, etc.) and they can even be extracted from the most important video formats.

MP3 TO WAV CONVERTER

A converter must contribute, and Mp4Gain does, with a number of features that it offers in addition to the conversion. From normalization, to modifying the tempo or pitch, etc.

Why WAV?

As we said before, today size is already a priority as it was 20 years ago. Today it is no longer necessary to reduce audio files to the maximum, at least not for all users.
The WAV format has a whole series of advantages and there are those who want to enjoy their music or audio in that format.

One of many advantages is that many devices will natively play the WAV format and have a lot of sound quality. If by the way, when converting it, you take advantage of the fact that it is possible to normalize the volume level using Mp4Gain, then you already have an additional advantage.

Especially when the normalization that today can be done with the Mp4Gain, if the user wishes, is a dynamic normalization of the volume, which goes further and achieves fantastic results.

Definitely if you are wanting to convert mp3 to WAV, it is most possible that Mp4Gain is the ideal option to achieve the best results.