What is Audio Normalization?


Free Download Mp4Gain
picture

What is Audio Normalization?

Audio Normalization
Audio Normalization

Audio normalization is the process of adjusting the volume of an audio file to a desired level without changing its dynamic range, unlike compression that changes volume over time in varying amounts. There are two main reasons to normalize audio: getting the maximum volume and matching volumes. The first reason is when you have a quiet audio file and you want to make it as loud as possible (0 dBFS) without changing its dynamic range, and the second reason is when you have a group of audio files at different volumes, and you want to make them all as close as possible to the same volume.

Audio Normalization
Audio Normalization

Peak volume detection is the method of measuring the volume of audio that only considers how loud the peaks of the waveform are for deciding the overall volume of the file. This is the best method if you want to make the audio as loud as possible. RMS volume detection considers the overall loudness of a file, and it takes an average and calls that the volume. This method is closer to how the human ear works and will create more natural results across varying audio files.

The new standard in broadcast audio, EBU R 128 volume detection, is similar to RMS but can be thought of as emulating a human ear. It listens to the volume intelligently and thinks about how we will hear it. It understands that we hear frequencies between 1000 – 6000 Hz as louder and takes that into account.

Normalization can be performed in an audio editor or inside a DAW, but it is a destructive process that can change the sound quality of the file. This was a bigger issue when digital files were all stored as 16 bit. If you turned the volume down, you effectively reduced the bit depth. Your CD-quality 16-bit file could end up 12-bit or less, even if you turned it up with peak normalization. Nowadays, audio editing software works internally at a much higher bit depth, often 32-bit floating point, which means that calculations are done more accurately and affect the sound quality far less. To take advantage of the high quality of high bit depth inside audio editing software, it is essential to keep the file at the higher resolution once it has been processed. Finally, peak normalization to 0 dBFS is a bad idea for any parts to be used in a multi-track recording, as it may overload DAW or plugins.

What is RMS?

RMS stands for Root Mean Square and is a measure of the average power of a signal. It’s commonly used in electrical engineering and other fields that deal with signals, such as audio processing.

To calculate the RMS value of a signal, you first square each value in the signal and then take the average of all the squared values. Finally, you take the square root of that average. Mathematically, it can be expressed as:

RMS = sqrt((1/N) * sum(x^2))

Where N is the number of samples in the signal and x is the value of each sample.

The resulting RMS value represents the equivalent DC voltage that would produce the same amount of heat in a resistor as the original AC signal. In other words, it’s a measure of the signal’s power level.

RMS is particularly useful when dealing with signals that have both positive and negative values, as it takes into account the magnitude of both. It’s also commonly used to specify the power of audio signals, such as in the specification of the power output of an amplifier.

Overall, RMS is a useful tool for understanding the power level of signals and can help in the design and analysis of electrical and audio systems.

 

What is Bit Depht?

Bit depth refers to the number of bits used to represent the amplitude of an audio signal. In digital audio, the amplitude is quantized into a finite number of levels, which are then represented by binary numbers. The bit depth determines the number of possible levels, and therefore, the resolution of the digital signal.

For example, with a bit depth of 16 bits, there are 2^16, or 65,536 possible levels. With a bit depth of 24 bits, there are 2^24, or 16,777,216 possible levels. This means that a higher bit depth provides a more accurate representation of the original analog signal.

The bit depth of an audio signal affects its dynamic range and signal-to-noise ratio. Dynamic range refers to the difference between the loudest and softest parts of the signal, while signal-to-noise ratio refers to the ratio of the signal to any background noise present.

With a higher bit depth, the dynamic range is increased, allowing for a greater difference between the loudest and softest parts of the signal to be accurately represented. Similarly, a higher bit depth also increases the signal-to-noise ratio, since there are more levels available to represent the signal and less quantization noise is introduced.

However, a higher bit depth also requires a larger data rate and storage space, and may not be necessary for all types of audio signals. For example, speech and other types of less complex signals may not require a high bit depth, while music with a wide dynamic range and complex sounds may benefit from a higher bit depth.

In summary, the bit depth of an audio signal determines the resolution of the digital signal and affects the dynamic range and signal-to-noise ratio. A higher bit depth provides a more accurate representation of the original analog signal, but also requires a larger data rate and storage space. The appropriate bit depth for a given audio signal depends on the complexity of the signal and the desired quality.


Free Download Mp4Gain
picture


Mp4Gain Main Window
picture


Mp4Gain Features
picture


Free Download Mp4Gain
picture

DIFFERENCES BETWEEN NORMALIZE AND MASTERIZE

The process and the differences between normalizing and mastering are often confused. Although it may seem to be the same, it is not.

analog recording

Mastering can be of crucial importance according to which processes, for example: in musical matters, there are mastering engineers who are dedicated exclusively to that.

That does not mean that we cannot learn or acquire the necessary knowledge to be able to properly use some processing effect or some plugin in an appropriate way to be able to get more out of our audio file.

But you have to keep in mind that this audio processing helps your audio montage, song … sound with more punch, more strength, more energy, have more life.

16 bits vs 24 bits

Is mastering compressed or limited?

Rather those two processes and some more are done.

Its mission is to maintain the same volume amplitude throughout the audio file, that is, it compresses when it has to compress and limits when it has to limit.

I’m going to give a rough example of what manual mastering would be like.

Can you still imagine the sound technician who detects when the signal volume is too high (the singer gets too close to the microphone, shouts …) and lowers the fader. Or the opposite case, when it detects the low volume (the singer moves too far from the microphone, does not speak with enough force …) and raises the fader. Always trying to maintain the same volume amplitude.

I’m going to give you a homemade definition: “lower what is high and raise what is low”.

As before it was an invented example, to do the job of processing the sound we regulate the different parameters available to the “processor” (Mastering is also called “processing” since in the past a device called “processor” was used which comes from “dynamics processor”). These parameters are:

The threshold (threshold): fundamental characteristic of the compressor that represents the point or level from which if the volume of the sound exceeds or lowers it, the dynamics processor is put into operation.

Ratio (Attenuation or Gain Ratio): Defines the amount of attenuation or gain that is applied to the signal. At noise gates the attenuation can be preset so that it really is a mute.

Attack time: This is the time it takes for the signal to attenuate, limit, mute or amplify. In general, slower times work best at low frequencies and fast ones at high frequencies. When processing a signal containing all frequencies, a compromise situation is forced.

To maximize the energy of the signals, particularly in broadcasting applications, there are multiband compressors that divide the spectrum into several bands and apply different times to each.

Release time: It is the opposite of the attack time, that is, the time it takes to go from the state where the processing is running to rest. They are usually longer times than those of attack.

Hold (maintenance time): Specifies the minimum time that processing will take place.

Stereo link (stereo link): With dynamics processors in general when used to process a two-channel (stereo) signal, it is necessary to link the processing action of both channels to happen on both at the same time. Otherwise, the sound image will be confusing and changing from the center to one side or the other.

Automatic: This function allows you to control any of the parameters listed automatically depending on the characteristics of the signal.

By pass (deactivation): Activating it allows you to hear the unprocessed signal, while if it is not activated you hear the processed signal.

Normalization is a process by which the highest peak is sought and reduced or increased (dB) as adjusted. Never pass the 0dB in normalization or mastering, because then it would be itching “clipping”.

If when capturing the sound, the highest peak (in amplitude of volume) is close to 0 dB. Normalization will have no effect. On the other hand, if we process it with the presets of some “Dynamics Processor” effect, it will be noticed that the wave gets fatter where it is thinner and will become thinner where it is thicker