There are several types such as WAV, MP3 and FLAC, but what is the difference?


Free Download Mp4Gain
picture

There are several types such as WAV, MP3 and FLAC, but what is the difference?

Music File Formats

Comprehensive research on file formats
Do they like music to all?

audio file formats

I think many people enjoy music by downloading or playing streaming instead of CD these days, but what kind of format music is stored as data?

I researched the differences, advantages and disadvantages of each format.

Types of audio compression

There are three types of music file formats: “Not compressed”, “compressed with loss” and “compressed without loss”.

File-based compression can reduce the size of the file and reduce the download time and storage capacity.

“Uncompressed format”
As its name indicates it, it is an uncompressed file. The point are the original data.

The advantage is the accuracy of the data. However, it also has the disadvantage of a large file, which uses about 10 MB for a 1-minute audio file.

The most used are “WAV” and “AIFF”. It is common to use “WAV” for Windows and “AIFF” for Macintosh.

“Compression format with loss”
This compression format is probably the one that most uses.

Although there are individual differences, it is said that the human audible range is 20 Hz to 20000 Hz. This file format has the advantage that the file size can be deleted while maintaining a certain level of sound quality when it is removed and compressed Other parts difficult to identify.

Even so, the disadvantage is that the sound quality is lower than the original data. It is not exaggerated to say that “MP3” is the best-known file format. As successors, “AAC” is often used, which was created to achieve higher sound quality. In addition, “WMA”, “Vorbis”, etc. They are also in this format.

“Format of compression without loss”
This format compresses the original data while preserves them.

During playback, you can decompress and return to the original uncompressed format, so the sound quality is the same as that of the original data. Although the file size can be reduced compared to the uncompressed format, it is still about half, so the size of the file is greater than that of the compressed format with loss and takes more time coding and decoding.

In addition, it is currently not handled by the main music distribution sites, and the number of devices that can be reproduced is less than that of uncompressed formats. The most common of these formats is “FLAC”, and Apple uses a single format called “ALAC”.


Free Download Mp4Gain
picture


Mp4Gain Main Window
picture


Mp4Gain Features
picture


Free Download Mp4Gain
picture

What is the difference between MP3 and AAC? Part 3

What is the difference between MP3 and AAC? Part 3

aac vs mp3

Finally, let’s review the important MP3 and AAC compatible devices.

 

MP3 o AAC

MP3s are compatible with almost all music-playing devices, such as personal computers, smartphones, and audio devices. Therefore, it is generally better to save the file in MP3 format.

As for AAC, it is a recommended storage format for iPad / iPhone users and those who use iTunes, Apple’s official music player, because it is compatible with Apple devices. Even if you import the sound source from a CD with iTunes or purchase paid music content from iTunes, it will be saved in AAC format.

Which is better, MP3 or AAC?

Unsplash logo Photo by marksolarski
In this article, I explained about MP3 and AAC music file storage formats. Finally, if you do not know what is the storage format that I should use, to leave and save the music files in MP3 format, it can be said that it is the best. MP3s have many playable devices and the sound quality is not that different from AAC.

Also, files saved in MP3 format can be easily converted to AAC files using iTunesw. When saving as MP3, it can also support post file conversion, so if you have problems with the save format, it is better to select the MP3 format. Also, if you’re targeting Apple devices, try saving music by choosing an AAC format that doesn’t need to be converted.

What is the difference between MP3 and AAC? Part 2

What is the difference between MP3 and AAC? Part 2

AAC Vs. MP3

The disadvantage of the “lossy compression format” used by MP3 / AAC is that the compressed data file cannot be restored to its original size (the original sound quality of music content).

aac vs mp3

There is a music content compression format called “lossless compression format”, which has a relatively large data size and can restore the original sound quality when playing music files.

There is not much difference between the two compression formats, but if you want better sound quality, you should use the “lossless compression format”. In the next chapter, we will further compare the differences between MP3 and AAC formats.

Comparison of MP3 and AAC

Unsplash Logo Photo by duck58cth
We will compare MP3 and AAC in terms of sound quality and compression rate (bit rate).

Sound quality

As mentioned above, MP3 and AAC are compression methods that use the “lossy compression format”, so there is not a big difference in sound quality between the two formats. The sound quality of MP3 and AAC differs depending on the compression rate (bit rate) of the music content file. So what exactly is the compression rate (bitrate)? The next section describes (compression bit rate).

Compression rate (bit rate)

format Compression rate (bit rate)
64kbbp 96kbbp 128 kbbp 160 kbp 192 kbbp 256 kbbp 320kbbp

MP3 Upper Limit Frequency 8.3 kHz 11.7 kHz 15.2 kHz 18.6 kHz 21.3 kHz 22.0 kHz 22.0 kHz

MP3 size 123kB 184kB 245kB 308kB 372kB 494kB 616kB

AAC upper limit frequency 13.5 kHz 15.2 kHz 18.7 kHz 19.1 kHz 19.6 kHz 20.0 kHz 20.0 kHz

AAC size 127kB 188kB 249 KB 310kB 368kB 490kB 613kB

The comparison table above shows the compression rate (bit rate) of MP3 and AAC, and the upper limit of frequency and data size (capacity). The higher the value of the compression rate (bit rate), the higher the upper limit for frequency and data size (capacity), and the better the sound quality.

Although the compression rate (bit rate) of MP3 and AAC is the same, the data size (capacity) and the upper limit frequency are different, so please compare the sound quality using the table above as a guide.

Proper use of MP3 and AAC

In this chapter at the end of this article, we will introduce how to use MP3 and AAC correctly. Even if you read the comparative explanation of MP3 and AAC presented so far, many people may not get it right. How to use MP3 and AAC correctly should be judged by the compression rate (bit rate).

Bit rate

As discussed in the previous chapter, the higher the value expressed in “kbbs” called the compression rate (bit rate), the better the sound quality. However, MP3 and AAC have the same compression rate (bit rate) but different upper limit frequencies.

128 kbps to 160 kbps is adopted for general MP3 / AAC music files. This is because you can watch various music genres like J-POP and Jazz with high sound quality. You can fully enjoy music with your PC speakers and audio equipment.

Then I will explain the characteristics of the compression rate (bit rate) of 128 kbps or less and the compression rate (bit rate) of 160 kbps or more, and which file format should be used, MP3 or AAC.

Less than 128 kbps

A compression rate (bit rate) of less than 128 kbps does not improve the sound quality of music, etc., and is not suitable for the music content storage file format. Suitable as a conversation-focused radio sound source. Also, if it is less than 128 kbps, the AAC format has a higher frequency upper limit, so the sound quality is said to be better than MP3.

192 kbps or higher

The compression rate (bit rate) of 192 kbps or higher reproduces even delicate sounds like classical music. If you have a good ear, you can clearly tell the difference in sound quality.

When the compression rate (bit rate) is 192 kbps or higher, the MP3 format has a higher upper limit frequency and is said to have a higher sound quality. The data size (capacity) is almost the same as that of AAC, so it is recommended to save it in MP3 format.

Digital audio formats

Digital audio formats

Digital Audio

The digital audio format is a format for presenting audio data used in digital audio recording, as well as for additional storage of recorded material on a computer and other electronic media, so-called audio media.

digital audio

The audio file (a file containing a sound recording) is a computer file consisting of information about the amplitude and frequency of sound, saved for later playback on a computer or player.

Varieties of digital audio formats.

There are several concepts of audio format.

The digital representation of the audio data depends on how the digital-to-analog converter (DAC) quantizes. In sound engineering, two types of quantization are currently the most common:

pulse code modulation

sigma delta modulation

Quantization bit depth and sample rate are often specified for various audio recording and playback devices as a digital audio rendering format (24-bit / 192 kHz; 16-bit / 48 kHz).

The file format determines the structure and presentation characteristics of the audio data when stored on a PC storage device. To eliminate the redundancy of the audio data, audio codecs are used, with the help of which the audio data is compressed. There are three groups of audio file formats:

uncompressed audio formats like WAV, AIFF

lossless compressed audio formats (APE, FLAC)

lossy compressed audio formats (mp3, ogg)

Modular music file formats are highlighted. Created synthetically or from prerecorded live instrument samples, they are primarily used to create modern electronic music (MOD). Also, this can be attributed to the MIDI format, which is not a sound recording, but at the same time, using a sequencer, it allows you to record and play music using a certain set of commands in the form of text.

Digital audio media formats are used for both mass distribution of sound recordings (CD, SACD) and professional sound recording (DAT, minidisc).

For surround sound systems, sound formats can also be distinguished, which are mainly multichannel sound accompaniments for movies. These systems have complete format families from two major competitors, Digital Theater Systems Inc. – DTS and Dolby Laboratories Inc. – Dolby Digital.

The format is also called the number of channels in multichannel sound systems (5.1; 7.1). This system was originally developed for movie theaters, but has since been expanded for home theater systems.

What is digital audio and how does it work

What is digital audio and how does it work

Digital Audio

Regardless of the path chosen, after connecting the source, the sound from the source will be sent to a microprocessor called a digital audio converter (DAC for short), where there will be 2 stages:

Digital Audio

1) Conversion from analog to digital (a / d);

2) Conversion from digital to analog (d / a).

This processor is sometimes called an ad / da converter. Here, the analog audio signal is processed into digital, then redirected to the central processor and memory, and then to the storage medium. Stored digital recordings (often in .WAV format) are sent back to memory and the CPU, and then converted back to analog by the DAC.

The digital audio / MIDI sequencer allows you to record the sound of synthesizers, guitars, and microphones to files with the .wav extension. No matter how sound is transferred to the computer, it will still go to the DAC, computer memory, and hard drive. The resulting data type is called digital audio data. If you record in “CD quality” (among other things one of the lowest possible), every second of the sound is divided into 44,100 pieces. What is this data? Only numbers. But unlike the MIDI format that encodes the notes played, digital audio data is a digital representation of the actual sound wave. This is the same sound described in numbers. Can you guess that this format takes up thousands of times more space than midi data? This is true.

It is a graphical representation of digital audio data. For a computer, this is a sequence of numbers. With this data, you can perform various operations to change and improve. Outwardly, the signals appear to undergo a series of effects, but in reality what happens is a mathematical process.

How MIDI is converted to sound
You may be wondering how to convert MIDI to audio, is there a “convert” utility for that? Connect the output jacks of your synthesizer to your sound card (or audio interface, or mixer with firewire, etc.) and start recording. Analog waves go through a digital converter (DAC), are converted into numbers, and voila! you will receive digital audio data. The nice thing about a sequencer is that you first record a MIDI track and then refine it. in editors and translate it to digital audio for a perfect recording (well maybe not perfect, there is nothing perfect in the world). Yes; you are using synthesizer software, the process will be called slightly differently, but the gist is the same. The computer creates an audio track based on MIDI data and records it in audio format.

Time to process the resulting files perfectly in sync with plugins or effects. You can also save the finished tracks in MIDI format (then you can edit them at any time) and add the sound of vocals, guitars, or whatever else you want. The sequencer can work simultaneously with MIDI files and digital audio.

Effects types
One of the main and most used effects is VIBRATO.
Distinguish amplitude vibrato, when the amplitude of the signal changes periodically. The frequency of change should be small, from a few fractions of a hertz to 10-12 Hz. Tremolo is a type of amplitude vibrato. The frequency of vibration in the case of a tremolo is not less than 10-12 Hz, and the resulting signal is output in portions.

Frequency vibrato. In a non-electronic way, it was done with electric guitars. By changing the tension of the strings with a special lever, the musician changes the pitch (understand – frequency) and achieves the effect of frequency vibrato. The same can be done with synthesizers and midi keyboards using a special wheel or lever. In music editors, you can also adjust the frequency of the sound, change it within the specified or desired limits.

Ring vibrato. The signal passes through a filter, the settings of which are periodically changed. An interesting and beautiful sound is obtained due to periodic changes in the coloration of the timbre.

Effects: Reverb, Chorus, Flanger, Phaser, Delay: effects based on the delay of the signal.

Reverberation: the effect is created by mixing the main signal with copies lagged for different periods of time, obtained as a result of the reflection of various obstacles (walls, objects, etc.) The number of copies can be infinite, the reflected signal can return to reflected from another obstacle (the delay increases naturally) and again summarized with the main one. With a short delay, the effect results in an immersive and booming sound experience. .

Benefits of “digital audio”

Benefits of “digital audio”

Digital Audio

The digitized audio signal has the following advantages:

DIGITAL AUDIO

-the possibility of infinitely long storage without loss of original quality,

-the ability to reproduce for a long time without losing the original quality,

-the possibility of infinite reproduction without loss of original quality,

-simplicity and wide possibilities of processing by modern means,

-Resistance to interference in signal transmission lines.

From CD to Super Audio CD and DVD Audio

CD (Compact Disk) is a type of removable plastic disk with optical reading of information.

In 1979, Sony and Philips proposed the Red Book standard for digital audio recording.

Analog sound is digitized and recorded as a spiral track of alternating zeros and ones (micron holes and a smooth surface) on a 12 cm polycarbonate disc, slightly thicker than a millimeter, covered with the thinner layer gold (later aluminum).

The player’s laser illuminates the disc and detects binary “zeros” and “ones”, which, after processing, are converted back to sound. It is almost impossible to mistake zero for one. Possible problems associated with read errors and scratches on the disc surface were compensated for using digital error correction.

As a result, not only did the physical dimensions of the record holder decrease compared to vinyl record, but also the musical capacity increased significantly: up to 74 minutes (the then owner of Sony wanted his favorite Beethoven Ninth Symphony to fit into a disk).

In 1982 in Langenhagen (Germany) the mass production of compact discs (CD) began with the “Alpine Symphony” by I. Strauss.

Real

High-quality audio is now recorded in Super Audio CD and DVD Audio formats, which:

use a DVD media,

use multichannel recording (up to 5.1),

sampling rate up to 192 kHz,

quantization level: up to 24 bits (each bit doubles the precision of sound transmission and, at such a depth of quantization, the dynamic range of the reproduced sounds can exceed 130 dB).

The new recording formats offer the highest quality, are expensive ($ 15 per disc), and are not popular because most listeners, sadly, don’t care too much about sound quality.

Digital audio options

The important parameters of the digital representation of sound are the sample rate of the audio signals and the quantization of bits.

Quantization rates indicate how many times per second a signal is sampled (measured in amplitude) for conversion to digital code.
For CD standard it is 44KHz (44 thousand times per second), for SACD 192KHz

The quantization bit characterizes the number of signal steps and is measured by the power of 2.

For the CD standard, 16-bit audio adapters are used, which have 65,536 quantization steps (2 to the 16 power), as in an audio CD. For standard and 24-bit SACD.

Digital audio storage

About digitizing sound has a set of signal amplitude values ​​taken at regular intervals and can be written to file sequence numbers (amplitude values).

Two methods are widely used to encode audio information:

PCM (pulse code modulation)

ADPCM (Adaptive Relative Pulse Code Modulation)

PCM (Pulse Code Modulation) is a method of digitally encoding a signal by recording the absolute values ​​of the amplitudes. This is how data is recorded on all audio CDs.

ADPCM (Adaptive Delta PCM) – Records signal values ​​in relative amplitude changes (increments), allowing you to simplify data to take up less memory.

Lossless encoding (for lossless data odirovanie) allows data recovery from fully compressed (20-50%) stream.

Popular L ossless encoding algorithms:

Windows Wave (WAV) is the primary audio file format for Windows.
The Audio Interchange File Format (AIFF) is the primary audio format for the Macintosh.

L ossy encoding (lossy data encoding) enables you to achieve sound similarity of the reconstructed signal to the original with the highest possible data compression (10-1 5 times).

The basis of lossy-encoders is the use of psychoacoustic models: certain portions of the signal, in certain frequency ranges that are inaudible to the human ear, nuances (masked or inaudible frequencies) and occurs to remove them from the original signal.

Digital audio

Digital audio

Digital Audio

what happens to sound within computer programs

Digital Audio

Digital audio is a representation of analog sound used by computers and various digital devices to record and reproduce audio information. Like the frames of a movie, a digital audio signal is created from a series of sound fragments that are played when we press the play button. There are many different digital audio formats, they differ from each other in the transmission quality of the audio information.

About Pulse Code Modulation – PCM

If we talk about an acoustic sound or an analog signal, we are always talking about the propagation of sound waves in space. Whereas digital audio is only a rough description of what happens to sound or should happen within computer programs or digital devices.

This article will discuss pulse code modulation (PCM), the most common digital audio decoding system. Besides PCM, there are also DTS and Dolby Digital systems, but these are mainly applicable in the field of film and video production. Today we will not talk about them.

In pulse code modulation, a signal is read many times per second. At each reading moment the amplitude of the sound wave is recorded and reproduced. As mentioned above, a digital signal is just a rough copy of an analog signal, since an analog wave cannot be recreated with perfect precision. The values ​​of each fragment are rounded to the nearest most accurate, then all the fragments are played and we hear a copy of the original analog sound.

“What meanings are we talking about?” – you ask. Just as analog audio is defined by frequency and amplitude, digital audio is determined by two important values: the sample rate and the bit depth. The sample rate means how many times per second the fragments of the audio signal are read, and the bit depth is the value of the dynamic range of each fragment of the audio signal.

Sampling rate

The standard 44.1 kHz sample rate used for recording audio to CDs (remember those?) Might seem like a random number. But this is not the case at all. This value was chosen based on Kotelnikov’s theorem, which essentially states that the sampling frequency must be more than 2 times higher than the maximum value of the reading frequency. As you know, the upper limit of audibility of the human ear’s frequency range is 20 kHz. It turns out that the sampling frequency must be higher than 40 kHz. An additional 4.1 kHz is added to avoid distortion, the so-called aliasing effect. In theory, 44.1 kHz should be sufficient to accurately reproduce an audio signal, however there are higher values.

For example, 48 kHz is the dominant standard in film and video production. As in the case of cinema, sound is synchronized at a frame rate of 24 frames per second. We won’t go into the details of why exactly 24 frames per second was chosen, in other words, this is the minimum frequency at which we can see a smooth, eye-pleasing image. The sample rate must match this frame rate. Using a frequency of 44.1 kHz can cause a noticeable out of sync of the picture and sound. Again, based on Kotelnikov’s theorem.

Even higher sample rates are repelled by these two base frequencies of 44.1 or 48 kHz, multiplying them by multiples of 2. That is, 88.2, 96, 192 kHz are the standard sample rates for all audio equipment. modern audio.

Bit depth

The bitness or bitness of an audio file tells us about its dynamic resolution or, more simply, clarity. You can draw an analogy with digital photography: the higher the resolution of the photo, the clearer and better the image will be.

It is important to note here that we are not talking about the loudness of the signal, but about a more realistic, clean and clear sound. More accurate transmission of the audio signal.

Bit depth can be compared to text in the book. The lower the bit depth, the less meaningful the text will make. That is, lowering the bitness leads to the fact that some letters begin to disappear from words, punctuation marks from sentences. At the moment, we will still be able to grasp the meaning of the text, but if the bit depth continues to decrease, the information will become so distorted that we simply stop understanding what we are talking about. The same goes for sound: the lower the bit depth, the more distorted we hear the sound.

MP3: the digital audio revolution

Perhaps not many people know that in 1992 a silent and unstoppable revolution of digital audio began for mass, until then essentially represented by CD-Audio. This was, in fact, the year that the algorithm underlying the MP3 format was born by the Fraunhofer-Institut für Integrierte Schaltungen (IIS).

Mp3

Part of a European research project called EUREKA, which started in 1987 and ended in 1994, the then-MPEG 1 Layer 3 was one of the most important and mature fruits in the field of psychoacoustic compression algorithms. This family of compression algorithms, whose first studies date back to 1979 by Manfred R. Schroeder, German physicist at AT & T-Bell Labsc, aims to reduce the amount of information capable of describing an audio sequence, from the assumption that the human ear, fortunately for us, is not perfect. The basic idea is to exploit the inability of the man’s auditory system to recognize certain sounds and frequencies, when they are masked by others.

MP3

Audio masking is detected at two levels: frequency and temporal masking. To explain the principle quickly, let’s take an example: in the presence of two tones, depending on their frequency and intensity, our ears will be able to recognize both or only one.

In the latter case, we have a frequency masking, and therefore information related to the least audible tone can be discarded. What happens, however, if the most intense tone is lost? It will happen that the tone that was not noticed before, will now return to the foreground. However, for the hearing system to notice, time will inevitably pass, because the membrane needs to stop vibrating and readjust.

We speak, of course, of times in the order of milliseconds, which are however precious, because the sound that falls within this time will be cut by the compression algorithm and, consequently, will help to reduce the amount of information necessary to describe what is audible.

The first MP3 encoder, called l3enc, was released by the Fraunhofer Society on July 7, 1994, while the MP3 extension was officially born on July 15 of the following year.

Those who lived through this time know that we are talking about years in which ADSL did not exist, hard drives were a few hundred MB in size, and in general, both from the point of view of communications and data storage, the figures they were far from being as generous as they are today. With these limitations in mind, I want to remind you that an uncompressed audio file in PCM WAV format, with a resolution of 44 kHz and 16 bits, stereo, as required by the CD-Audio standard, has a bit rate equal to 1411.2 kbit / s. This means that if you want to rip a song from an audio CD on your hard drive, the occupied space in uncompressed WAV format is approximately 10MB per minute. Today perhaps it would not be a problem to have this space, but in the mid-nineties it was a notable limitation.

The compactness of the MP3 format combined with the more than acceptable quality (a very optimistic estimate is a bit rate of 128 kbit / s to obtain a quality comparable to CD-Audio), made it in a few years the vehicle of transmission par excellence for music. The milestones that contributed to this unstoppable technological success were the launch of the Winamp player software by Nullsoft in 1997, and the arrival on the market just one year after the first portable media players: the MPMan F10 from Eiger Labs and the Rio PMP300 from Diamond. Multimedia.

Finally, it is impossible not to mention the birth of peer-to-peer networks aimed at exchanging MP3 files with Napster, one of the most famous applications in history, both for the innovative service that was made accessible and for the inevitable judicial events that followed and which decreed its closure in 2001.

In the same year, another symbol of the multimedia revolution, the result of the same technological horizon drawn by the MP3 format, appeared on the market: the Apple iPod.
Continuing until today we find, in parallel with the birth of new and more efficient compression formats, increasingly evident examples of the revolution, also social and commercial, that led to the arrival of the MP3 format.

There was a time when playlists were decided exclusively by record companies that were mixed into albums with mediocre songs, greatest hits; Today you can create your favorite playlist, selecting the songs and the order of play without any difficulty.

DIGITAL AUDIO explained

Audio is the electronic information that represents sound, or rather, having sound of a temporary nature is the flow of information that represents it.

Sound is made up of pressure waves traveling in space, therefore it is represented by a sinusoidal.

Digital Audio

The characteristics of a sound are:

Amplitude: Measured in Hertz (Hz) and determined by the frequency of a sound, the higher the frequency, the louder the sound, the lower it is, the lower the sound.

Intensity: it is measured in decibels (db) and is determined by the power of a sound, the more intense a sound is, the greater its volume.

Duration: It is measured in seconds (s) and dermal how long a sound lasts over time.

Timbre: It is not directly measurable, but it is that sound parameter that allows us to distinguish a trumpet from a drum. It constitutes the trace of a sound and is characterized by harmonics.

digital audio

ANALOGUE AND DIGITAL

There are two different ways of representing sound as electronic, analog and digital information.

Analog audio was the first, in chronological order, to be developed.

The information varies similarly to the information it represents and can (in theory) assume any value.

If we greatly expand the sine wave that describes an analog sound, we would see that it is a continuous line without interruptions.

Instead, digital audio is encoded with a number system, which allows discretization (transition from analog to digital), during this step information is lost, but once the sound is written as a series of numbers (digital information) it is possible to reproduce it. , transmit and modify it without losing anything in terms of quality, which is impossible with analog information.

If we greatly expand the sine wave that represents a digital sound, we would realize that it is not a continuous line as in the previous case, but a series of points very close to each other.

The amount of these points in one second of information will define the “sampling frequency”.

The amount of information that each point can contain is called “bit depth”.

THE CHARACTERISTICS OF DIGITAL SOUND

Sampling rate

Determine the number of samples contained in one second of information.

It is expressed in hertz (Hz) and generally assumes the following values ​​in the musical field: 22050Hz, 44100Hz, 96000Hz.

According to Nyquist’s theorem, each sampling frequency can record and reproduce sounds that have a maximum frequency equal to half of the chosen sampling frequency, this means that a piece sampled at 44Mhz can assume values ​​of up to 22Mhz only

Bit depth

Determine the amount of information contained in each sample.

It is expressed in Bit (bit) and generally assumes the following values ​​in the musical field 8Bit, 16Bit and 24Bit.

Above all, this is the parameter that depends on the quality of a sound.

Transmission rate (bit rate)

It is a characteristic of codecs, that is, of the “machine language” used to describe a sound.

Sets the total amount of information needed to play a second of a sound.

It is expressed in Bit / s.

AUDIO PROCESSING

Whether you’re talking about studio recording or live performances, the audio signal is never sent directly from the microphone to the speakers / recording medium, but is always processed first, through tools that allow you to perform different interventions. in the sound

These instruments can be analog, therefore they have the instrument physically in the studio (which is usually inserted inside a shelf), which must be connected between the microphone and the mixer or between the mixer and the speakers / recording medium.

Or you can simulate them through some plugins for your computer.

It is necessary to have a Daw (Digital Audio Workstation), which is the workspace in which all editing operations are performed. (Ableton, Cubase, Fruitloops, Logic, Reaper).

Within this software it is possible to install smaller ones, called VST (Virtual Studio Technology) that simulate the circuits of the studio equipment, emulating the effect.

(There are also other proprietary plugins with extensions other than the classic VST like .component or .au).

Some tools are essential and are used in all audio recordings, others are used only in particular situations or to obtain / avoid certain effects.

The main ones are:

Equalizer, is used to emphasize or attenuate some frequencies, this way you get a cleaner sound and a less “mixed” mix where all the instruments occupy only the correct frequencies, without overlapping.

The compressor, as the name suggests, serves to compress the dynamic range, so that the sound is more consistent and less dispersive.

Amp, wavering of different kinds, is used to increase the intensity of a sound.

Limiter works in a similar way to the compressor, but instead of compressing all frequencies, it attenuates those that exceed a predetermined threshold (threshold), avoids entering faults.

Reverb adds a slight reverb that makes a sound recorded in a soundproof studio much more natural than it would be too “dry”.

Filters (high / low cut) allow you to cut some useless and sumptuous frequencies too low or too high. (They are just 1 band parametric equalizers).

Digital audio formats on the network

Digital audio formats on the network:

WAV: Waveform files (or simply wave) are the most common sound formats on Windows platforms. WAV files can also be played on Mac and other systems with player software.

MPEG (MP3): The Motion Pictures Experts Group (MPEG) format is a standard format with significant compression capability. MPEG level 3 or MP3 files are frequently used for web music distribution. However, due to their size, MPEG files must be downloaded completely before playing them.

RealAudio (.rm): Real Audio is the technology that currently predominates on the Web. You need a proprietary player, but the basic versions of the player are available for free.
MIDI: The Musical Instrument Digital Interface format is not a digital audio format. It represents notes and other information so that music can be synthesized. MIDI has good support and its files are very small, but it is only useful for certain applications because of the quality of its sound when played on PC hardware.

AU: The u-law format is one of the oldest sound formats on the Internet. Players are available for almost all platforms.

RMF: The Rich Music Format supported by Beatnik (www.beatnik.com) is a high quality audio format, primarily for “download-and-play”, which is becoming increasingly popular.

AIFF: The Audio Interchange File Format is very common on Macs. It is widely used in multimedia applications, but it is not very common on the Web.

Flac: Free Lossless Audio Codec (FLAC) (Lossless audio compression codec) Ogg project format without loss. The initial file can be completely recomposed with the disadvantage that the file occupies much more space than would be obtained when applying lossy compression or Lossy.

Digital audio on the network:

The digital sound is measured by the sampling frequency, or how many times the sound is digitized over a certain period of time. The sampling frequencies are indicated in kilohertz (kHz), which indicate the number of times the sound is sampled per second. The CD sound quality is obtained with 44.1 kHz, or 44,100 samples per second. For stereo sound, two channels are required, each 8 bits; At 16 bits per sample, this results in 705,600 bits of data on a CD, producing high quality sound, at the request of the end user. In reality, the transmission of this amount of data would occupy almost half the bandwidth of the T1 network. As the average user of the Web does not have this bandwidth, another solution is necessary. One possible solution is to decrease the sampling rate when digital sound is created for sending through the Web. A sampling frequency of 8 kHz, in mono, would produce acceptable results for simple applications, such as language, especially if we consider that the playback hardware generally consists of a combination of a simple sound card and a small speaker. Low quality audio does not require more than 64,000 bits of data per second, but the end user still has to wait to download the sound. Modern users need several seconds to receive, even in the best conditions, a single second of low quality sound, making continuous sound impossible.