Audio Intro Part 2


Free Download Mp4Gain
picture

Audio Intro Part 2

Audio Intro
Audio Intro

 

A wav is 44100 Hz 16-bit stereo or 22050 Hz 8-bit mono, what does that mean? stereo/mono refers to dual/mono.

Audio Intro
Audio Intro

 

For monophonic sound files, the sample data is an eight-bit short integer (short int 00H-FFH); for two-channel stereo sound files, each sample data is a 16-bit integer (int) and the upper eight bits (left channel) and lower eight bits (right channel) represent the two channels, respectively.

Sound is a mechanical wave, produced by the vibration of an object, and requires a medium to propagate. So, in essence, a sound is a waveform on an axis over time.

Sound has three elements: pitch, volume, and timbre:

Pitch is determined by the frequency of the sound wave, the higher the frequency, the higher the pitch.
The volume is determined by the amplitude of the sound wave, the larger the amplitude, the louder the sound.
The timbre is determined by the “shape” of the waveform (sounds like square, triangle, and sawtooth are called impulse waves and sound individual).
An audio file is a file obtained by converting an analog signal to a digital signal. In general, there are five important parameters: encoding method, number of channels, sampling rate, bit depth, and bit rate.

Encoding: how this format organizes binary data and how it is compressed.
Number of channels: mono, dual or 5.1 channels, etc.
Sampling rate: The number of samples per second.
Bit Depth: The number of binary bits used to store the y value of the sample point.
Bitrate – The desired number of bits per second for the file.
We know that there is no compression in the WAV format, so its encoding method is to directly write all the sampled points to the file in order.

WAV file size (B) = number of channels * sample rate (Hz) * bit depth (bit) / 8 + file header size (B, it’s 44B)

Implementation
When you open an mp3 or wav file with a text editor, you see numbers like this:

4944 3303 0000 0000 3d48 5459 4552 0000
0006 0000 0032 3031 3800 5444 4154 0000
0006 0000 0032 3230 3300 5449 4d45 0000
0006 0000 0031 3430 3600 5052 4956 0000
168e 0000 584d 5000 3c3f 7870 6163 6b65
7420 6265 6769 6e3d 22ef bbbf 2220 6964
3d22 5735 4D30 4D70 4365 6869 487A 7265
537A 4E54 637A 6B63 3964 223F 3E0A 3A78
6D70 6D65 7461 2078 6D6C 6E78 3D22
6F62 653A 6574 612F
5249 4646 2e3d 0e05 5741 5645 666d 7420
1200 0000 0300 0200 44ac 0000 2062 0500
0800 2000 0000 6461 7461 a026 0e05 8089
00bc 00e8 f0bb c09e 8dbc 00c2 87bc 80f1
d3bc 8063 ccbc c030 fcbc 8012 f4bc 20bb
13bd e051 0fbd c0b0 2dbd 6079 28bd 4012
46bd 6032 40bd c0e3 5dbd 6040 57bd c015
7cbd e035 74bd b058 8dbd 50e2 88bd f0a7 9dbd e0dd 98bd 70d3 acbd e0a9 a7bd
d043 b8bd b0da b2bd
00e3 c4bd 605c bfbd

This one above is the mp3/wav format of the same song. What is the difference between them?


Free Download Mp4Gain
picture


Mp4Gain Main Window
picture


Mp4Gain Features
picture


Free Download Mp4Gain
picture

Audio intro

Audio intro

Audio intro
Audio intro

An mp3 is 320kbps, 44100hz, what does this mean?

Audio intro
Audio intro

44100Hz represents the sample rate of the signal. The so-called sampling consists of obtaining the value y of the sound wave at the current moment every unit of time. Sampling is the process of discretizing continuous data (converting an analog signal to a digital signal).
image source

The sampling method mentioned above is called PCM (Pulse Code Modulation). According to the Nyquist-Shannon sampling law, the sampling rate must be at least twice the highest target frequency. The hearing range of the human ear is about 20Hz-20,000Hz (if you’re curious how loud you can hear, you can click here to test your ears), although recording software often has a 48,000 option Hz, but we can safely conclude: 44100Hz can meet almost all our needs, higher is just a waste of your memory and CPU. More than 48,000 samples are meaningless to the human ear, which is similar to 24 frames per second on a movie. 44100Hz happens to be the standard sample rate for almost all music released. In fact, for vocals and many instruments, high-frequency sounds are noise, so high sample rates can sometimes worsen sound quality (which is why we need to adjust the equalizer).

320 kbps represents your bitrate/bitrate, which is shorthand for kilobits per second, which represents the size of the data used to describe sound. In CD (uncompressed audio file), the bit rate is 1411.2kbps, and the mp3 sound quality to achieve CD quality should be higher than 128kbps/44100Hz (128kbps can be said to be the most common bit rate). Generally, a higher number means better quality. The quality depends on many factors (such as the encoding algorithm). Many times we don’t need too high bitrate: our device can play mp3 and CD without difference (sound/sound card is normal).

What is bit rate? Knowledge of the MP3 audio format. Part 2

What is bit rate? Knowledge of the MP3 audio format. Part 2

bit rate
bit rate

Bitrate is a benchmark indicator of the efficiency of digital music compression.

bit rate.
bit rate

The bit rate represents the number of bits bps (bit per second, bits per second) transmitted per unit of time (1 second). We usually use kbps (in simple terms, it is per second) clock 1000 bits) as the unit. The bit rate of digital music on CD is 1411.2 kbps (ie recording 1 second of CD music requires 1411.2 × 1024 bits of data). The higher the bit rate of the music file, the more data (Bit) must be processed in a unit of time (1 second), and the better the sound quality of the music file. However, when the bit rate is high, the file size increases, which will occupy a large amount of storage capacity. 8 to 320 kbps.

1. WMA (Windows Media Audio, Windows Media Audio)

As a Microsoft media compression method, it is a part of the technology that only compresses audio data in Windows Media Technologies. The sound quality is similar to MP3 and can be compressed with half the technology of MP3. It has the copyrighted Windows Media Rights Manager and can be played by installing it in WMP (Windows Media Player, Windows Media Player). Due to the strong influence of Microsoft and Windows, as well as major copyright reasons, the major American record companies, EMI and BMG, officially confirmed that they use the WMA method developed and produced by Microsoft. It is believed that this advanced method will become even more popular in the future.

2. MP3 (CBR, VBR, ABR)

MP3 is currently the most widely used and widely used lossy compressed digital audio format. It has been explained above and will not be repeated here.

CBR (constant bit rate)

CBR is the oldest and simplest MP3 encoding (compression) method. When this method is used for encoding, the bit rate of the entire file is the same, in other words, the bit rate used by the MP3 file per second is the same. Although the music file has sections of varying complexity, the encoder always keeps the bitrate constant, unless you use the highest sound quality; otherwise the sound quality of the different sections of the MP3 file will vary. The more complex the passage, the worse the sound quality. Its biggest advantage is that the file size is fixed, which is convenient for calculating storage space.

VBR (Variable Bit Rate, Variable Bit Rate)

VBR is a variable encoding rate MP3 compression method. Its principle is to encode the complex part of a song with a high bit rate and the simple part with a low bit rate. Through this dynamic adjustment of the encoding rate, the sound quality can be improved. additionally obtained and the size of the file. Its main advantage is that the entire song can approximately meet our sound quality requirements, but the disadvantage is that the size of the compressed file cannot be estimated during encoding.

Most MP3 players released now support VBR, but although some machines can play songs in VBR format, they can’t display the playing time correctly. Nowadays, a lot of high-quality MP3 music is encoded in VBR.

What is bit rate? Knowledge of the MP3 audio format.

What is bit rate? Knowledge of the MP3 audio format.

 

bit rate
bit rate

Digital audio formats are audio signals that are recorded, processed, and reproduced in digital form.

bit rate
bit rate

The emergence of digital audio formats is to meet the needs of high-fidelity playback, storage and transmission. Simply put, early analog audio formats had issues with playback distortion and glitches due to media wear. Since the advent of the CD, digital format audio files have become popular, but another problem has arisen: the limitation of the storage volume, and the CD still has the phenomenon of wear. Saving to hard drive (relatively longer storage time) is not a good solution when storage media (mainly hard drives) are still expensive at the time. The rise of the Internet has created a requirement for long-distance file transmission. Under the restriction of bandwidth, the demand to reduce file size has become more intense. All this has led to the generation of lossy compressed digital audio formats from external factors!

In terms of internal factors, with the improvement of computing and coding capabilities, the progress of various acoustic psychological models has promoted the emergence of various lossy compressed digital audio formats. Some of the most commonly used audio formats in MP3 players are briefly introduced below: MP3 (CBR, VBR, ABR), WMA, WAV, ADPCM, and the emerging audio formats AAC, ASF, and OGG.

Before introducing various digital audio formats, let’s clarify one concept: bitrate.

In the field of computing, all information is digitized. Bit is the smallest unit of data in a computer, it refers to a number of 0 or 1, which is a mathematical binary number, a “0” or “1” , is a bit. For example, when we say a 2-digit number, it means that it is a two-digit binary number, and there are 4 combinations of “00”, “01”, “10” and “11”, which represent 0, 1, 2 and 3 is four numbers.