Misconceptions about digital audio


Free Download Mp4Gain
picture

Misconceptions about digital audio

Digital Audio

The higher the bitrate, the better the track

This is not always the case. For starters, let me remind you what bitrate t (bitrate, instead of bitraid). In fact, this is the data rate in kilobits per second during playback. That is, if we take the size of the track in kilobits and divide it by its duration in seconds, we get its bit rate, the call. File-based bitrate (FBR), usually not too different from the bitrate of the audio stream (the reason for the differences is the presence of metadata on the track: tags, “embedded” images, etc.) .

Digital audio

Now let’s take an example: the uncompressed PCM audio bit rate recorded on a normal audio CD is calculated as follows: 2 (channels) × 16 (bits per sample) × 44100 (samples per second) = 1411200 (bps ) = 1411.2 kbps … Now let’s grab and compress the track with any lossless codec (“lossless” – “lossless”, that is, one that does not lead to data loss), for example, the FLAC codec. As a result, we will get a lower bit rate than the original, but the quality will remain unchanged; here is your first rebuttal.

Something else is worth adding here. The lossless compression output bitrate can be very different (but is generally lower than uncompressed audio); It depends on the complexity of the compressed signal, or rather on data redundancy. So simpler signals will compress better (ie we have smaller file size for the same duration => lower bitrate), and more complex signals will be worse. That’s why lossless classical music has a lower bitrate than, say, rock. But it must be emphasized that the bit rate here is in no way an indicator of the quality of the sound material.

Now let’s talk about lossy compression. First of all, you need to understand that there are many different encoders and formats, and even within the same format, the encoding quality for different encoders can differ (for example, QuickTime AAC encodes much better than outdated FAAC), not to mention the superiority of modern formats (OGG Vorbis, AAC, Opus) in MP3. Simply put, from two identical tracks encoded by different encoders with the same bit rate, some will sound better and some will sound worse.

Also, there is upconversion. That is, you can take a track in MP3 format with 96 kbps bit rate and convert it to 320 kbps MP3. Not only will the quality not improve (after all, data lost during the previous 96 kbit / s encoding cannot be returned), it will even get worse. It’s worth noting that at each lossy encoding stage (at any bit rate and any encoder), a certain amount of distortion is introduced into the audio.

And even more. There is one more nuance. If, say, the bitrate of an audio stream is 320 kbps, this does not mean that the 320 kbps was spent encoding that very second. This is typical for constant bit rate encoding and for those cases where a person, hoping to get the highest quality, forces a constant bit rate too high (for example, setting CBR to 512 kbps for Nero AAC ). As you know, the number of bits assigned to a particular frame is regulated by the psychoacoustic model. But in case the allocated amount is much lower than the set bitrate, even the bit deposit is not saved (for terms see the article “What is CBR, ABR, VBR?”) – as a result, we get useless “zero bits” that simply “wrap up” the frame size to the desired one (that is, increase the size of the stream to the specified size). By the way, this is easy to check: compress the resulting file with a filing cabinet (preferably 7z) and look at the compression ratio – the more, the more zero bits (as they lead to redundancy), the more space wasted.

Lossy codecs (MP3 and others) can cope with modern electronic music, but cannot efficiently encode classical (academic), live and instrumental music.
The “irony of fate” here is that, in fact, everything is the exact opposite. As you know, academic music in the vast majority of cases follows melodic and harmonic principles, as well as instrumental composition. From a mathematical point of view, this leads to a relatively simple harmonic composition of the music.


Free Download Mp4Gain
picture


Mp4Gain Main Window
picture


Mp4Gain Features
picture


Free Download Mp4Gain
picture

Choose the correct audio format

Digital music: audio formats and their basic differences

Digital audio

The formats used to be clearly specified by the player. Those who had a VHS player bought VHS cassettes and those who had a Betamax payer, well, they were unlucky. It was similar a few decades later with Blu-ray and HD-DVD. If you could bet on the wrong horse with the respective playback devices, at least the purchase decision regarding the individual media was clearly defined. In the age of digital music, one has the advantage of a nearly universal player in the form of a computer and huge media libraries, but even more difficult because choosing the most sensible format in which to buy or convert your music is more versatile.

Digital Audio

What points determine the choice of the correct audio format?

First of all, of course, it should be noted that not all programs can play all formats. But especially DJ programs like Traktor or Virtual DJ deal with a variety of formats, which doesn’t make the decision for you at first and requires knowledge of other factors. The question of the correct format is particularly important for DJs, because individual formats differ significantly in terms of handling and quality! So now we want to explain to you where the differences lie between individual audio files so that later you can decide which format is the most suitable for you! We limit ourselves to the six common formats MP3, AAC, WAV, AIFF, FLAC and ALAC.

“To compress an MP3 file, what humans cannot hear is simply cut off.”

A distinction must first be made between simple files and cabinet files. Individual files contain little information beyond the song. Cabinet files are individual file packages that together form a meaningful whole. Here, for example, song texts or album covers, including the actual audio file, can be put together in one package. Additionally, there are different audio tracks that can be contained as individual files within the container, allowing for more accurate use of the audio material.

To individual audio formats: outdated variants

Everyone knows: MPEG1 Audio Layer III or just for short: MP3. The format developed by Moving Experts Group uses psychoacoustic findings to compress the original file. In other words: what the person doesn’t hear is simply cut off. Unfortunately, since this is only what humans with primitive audio technology cannot hear, the format not only requires little hard disk space, but also offers little acoustic enjoyment – loss of important audio information is characteristic of MP3.

In addition to the advantage of the small file size, the outdated format has the main disadvantage of clipped sound quality. What cannot be heard on small, private systems is quickly noticeable at clubs or festivals. The “thump” is missing because the dynamics of some frequencies are cut off, which means that the energy of the track does not reach the listener. If you still want to use MP3, you should definitely opt for encoding with 320 kBit / s, the maximum data rate supported by the MP3 format.

Another lossy format is AAC (Advanced Audio Coding) and it also comes from the ranks of the Moving Picture Experts Group. Similar to MP3, but with the help of a different technology, the audio signal is compressed simply by filtering out what the human ear presumably cannot perceive. AAC also saves a lot of storage space. However, thanks to the improved technology, it is possible to produce a significantly better sound experience than that reserved for MP3 even at lower data rates.

The most accurate error correction and the most efficient encoding algorithms create this superiority over an MP3 file with a comparable data rate. The efficiency of the algorithms is not only noticeable in the sound: with the same audio quality, AAC files are about a quarter smaller than their counterparts in MP3 format.

Why does digital music need to be normalized?

Why does digital music need to be normalized?

For younger consumers, the focus is often on the computer, which plays MP3s through the PC’s speakers. “They’re made to rumble a lot during games,” says “c’t” expert Zota. This can be useful when reproducing the explosions in a shooting game. However, when listening to music, such boxes disappoint.

Digital Music

Other consumers use their iPod with clip-on speakers, and mini systems like Bose’s “Wave Music System” are enjoying best-sellers. Of course, they cannot match the tonal volume of a full floor standing speaker.
monitor

Digital music

Those who decide to buy a high-quality music system generally turn to home theater systems. These are multi-channel systems with up to eight speakers and multiple power amplifiers. Their specialty is DVD playback, where they evoke powerful bass thanks to the subwoofers.

The viewer also physically experiences an earthquake in the movie because the shelves begin to shake. Solo: Compared to pure stereo systems, some home theater systems are disappointing. Some subwoofers are too inaccurate to play music. Above all, the quality is significantly more expensive compared to stereo systems. “The budget has to be divided into many more individual parts than with a stereo system,” says Besic, specialist in “Stereoplay”. For 1000 euros there is a decent stereo, but only a lousy home theater system. According to GfK, Germans spend an average of just over 400 euros on complete home theater systems, and 800 euros if these consist of the individual components of an amplifier, CD player and speaker cabinets.

Music producers flatten recordings

But it’s not just bad speakers that degrade sound quality. Music producers also contribute. They have been making their songs louder and louder since the mid-1990s. In pop, hip hop, rock, and electronic dance music, there are practically no quiet passages. At the same time, musical recordings have lost their dynamism. The mids are emphasized, but very high and fine sounds, as well as very deep bass, are often missing. The idea behind it: the songs should appear and assert themselves against loud advertising on the radio or background noise in the pub.

Additionally, sound engineers increasingly manipulate the sound of rock bands and pop singers with just a few clicks. Engineers use computer programs to smooth the edges and eliminate the smallest errors. For example, the pitch of the song is fine-tuned later; and hand-played drums sound accurate after computer processing, but like a machine and somehow always the same. Not much remains of the musicians’ own sound.

“In addition, the generally short time due to lower budgets also plays a role. In the past, you had much more production time, which of course was reflected in the end result in better quality and creativity, ”says Gerhard Wölfle, director of Dorian Gray Studios in Eichenau, near Munich. Wölfle has recorded CDs with the bands Guano Apes, Reamonn and The Donots. In the past, around six weeks of production time was the guideline for such albums. Today, studio professionals are satisfied when the music industry and artists spend half their time on them. Gerhard Wölfle says: “The excessive volume due to the massive use of compressors and limiters definitely gives many productions to the rest”.

An excellent example of an extremely loud album is the album “What People Say I Am, That’s What I’m Not” by English band Arctic Monkeys from 2006. The fully adjusted mix quickly rose to the top of audience favor. . The single “I bet you look good on the dance floor” (see the band’s MySpace profile) became a number one hit.

All this has generated a problem in matters such as the loudness of the music, which almost necessarily must be normalized to get them to sound at a similar volume.

Mp4Gain is the perfect choice to get a boost to the loudness of a song or to make all instruments sound clearly and audible.

Mp4Gain offers the latest technology and algorithms to make your music sound great today.

MP3, FLAC, WAV, ALAC: the differences between audio formats

Digital audio formats

Digital Audio

Today, most people listen to music completely digitally. The differences between digital audio formats like WAV, FLAC, MP3, and ALAC are not clear to everyone. We put the facts together.

Digital audio formats

While vinyl is booming and CD sales are slowly but surely falling, today’s music is often heard without any physical medium. Whether you use your smartphone or digital audio player, you can move forward with digital audio formats on the go. After all, no one today wants to carry a Discman and multiple CDs with them when they typically have a powerful pocket computer in the form of a smartphone that can play digital music files. But what are the differences between the individual file formats and what are their advantages and disadvantages?

WAV and AIFF: the uncompressed ones

The Wave container format (.wav) was developed by Microsoft. Saves uncompressed audio content, so files require a lot of storage space (2 minutes can take 20MB of space. WAV is especially important when recording and editing audio content. The downside of .wav files is that they don’t metadata is required (about, Title Artist) can be stored,
the equivalent developed by Apple AIFF (.aif) Due to the fact that Apple computers are very common in music production, this audio format is very common there.

MP3, AAC, WMA, Ogg-Vorbis – compressed to save space, but not lossless

The MP3 file format (.mp3, named for the MPEG-1 Audio Layer 3 compression codec) developed by the Fraunhofer Institute in the 1980s is probably the best-known digital audio format. It gave the MP3 player its name, and for a long time music was digitized almost exclusively as MP3, for example, on the extremely popular and now illegal file-sharing networks around the turn of the millennium. The advantage of MP3 is the small amount of storage space required: on average, it takes up one-tenth the size of the original file. However, one disadvantage that should not be neglected is that it is lossy – frequencies that are inaudible to humans are removed to drastically reduce the memory required. To what extent this affects the sound, you can compare Flac with MP3 Read.

AAC (Advanced Audio Coding) is a successor to the MP3 format, offering slightly better sound quality. Apple continues to mainly offer songs in this audio format on the iTunes store.

WMA stands for Windows Media Audio (.wma), as the name suggests, a development by Microsoft. .Wma is also a lossy compression file format.

A somewhat rarer audio format is Ogg-Vorbis (.ogg), where Vorbis is the music compression technology and .ogg is the container format. Like MP3, .ogg is also lossy, but requires less storage space and better quality.

FLAC / ALAC / WMA lossless – the lossless

Lossless formats were developed to preserve all sound information while keeping the amount of memory required small. With all file formats, the required memory is reduced to about half the original file. With audio conversion software, the file can be converted to other lossless formats, something unthinkable with lossy formats. This is why lossless file formats are popular for archiving music collections in a space-saving way.

FLAC – Free Lossless Audio Code (.flac) is a free audio format, so it is not owned by any major corporation. ALAC: Apple Lossless Audio Codec (.alac) is Apple’s lossless file format, while Microsoft also has its own development on the market with WMA Lossless.

MP3: the digital audio revolution

Perhaps not many people know that in 1992 a silent and unstoppable revolution of digital audio began for mass, until then essentially represented by CD-Audio. This was, in fact, the year that the algorithm underlying the MP3 format was born by the Fraunhofer-Institut für Integrierte Schaltungen (IIS).

Mp3

Part of a European research project called EUREKA, which started in 1987 and ended in 1994, the then-MPEG 1 Layer 3 was one of the most important and mature fruits in the field of psychoacoustic compression algorithms. This family of compression algorithms, whose first studies date back to 1979 by Manfred R. Schroeder, German physicist at AT & T-Bell Labsc, aims to reduce the amount of information capable of describing an audio sequence, from the assumption that the human ear, fortunately for us, is not perfect. The basic idea is to exploit the inability of the man’s auditory system to recognize certain sounds and frequencies, when they are masked by others.

MP3

Audio masking is detected at two levels: frequency and temporal masking. To explain the principle quickly, let’s take an example: in the presence of two tones, depending on their frequency and intensity, our ears will be able to recognize both or only one.

In the latter case, we have a frequency masking, and therefore information related to the least audible tone can be discarded. What happens, however, if the most intense tone is lost? It will happen that the tone that was not noticed before, will now return to the foreground. However, for the hearing system to notice, time will inevitably pass, because the membrane needs to stop vibrating and readjust.

We speak, of course, of times in the order of milliseconds, which are however precious, because the sound that falls within this time will be cut by the compression algorithm and, consequently, will help to reduce the amount of information necessary to describe what is audible.

The first MP3 encoder, called l3enc, was released by the Fraunhofer Society on July 7, 1994, while the MP3 extension was officially born on July 15 of the following year.

Those who lived through this time know that we are talking about years in which ADSL did not exist, hard drives were a few hundred MB in size, and in general, both from the point of view of communications and data storage, the figures they were far from being as generous as they are today. With these limitations in mind, I want to remind you that an uncompressed audio file in PCM WAV format, with a resolution of 44 kHz and 16 bits, stereo, as required by the CD-Audio standard, has a bit rate equal to 1411.2 kbit / s. This means that if you want to rip a song from an audio CD on your hard drive, the occupied space in uncompressed WAV format is approximately 10MB per minute. Today perhaps it would not be a problem to have this space, but in the mid-nineties it was a notable limitation.

The compactness of the MP3 format combined with the more than acceptable quality (a very optimistic estimate is a bit rate of 128 kbit / s to obtain a quality comparable to CD-Audio), made it in a few years the vehicle of transmission par excellence for music. The milestones that contributed to this unstoppable technological success were the launch of the Winamp player software by Nullsoft in 1997, and the arrival on the market just one year after the first portable media players: the MPMan F10 from Eiger Labs and the Rio PMP300 from Diamond. Multimedia.

Finally, it is impossible not to mention the birth of peer-to-peer networks aimed at exchanging MP3 files with Napster, one of the most famous applications in history, both for the innovative service that was made accessible and for the inevitable judicial events that followed and which decreed its closure in 2001.

In the same year, another symbol of the multimedia revolution, the result of the same technological horizon drawn by the MP3 format, appeared on the market: the Apple iPod.
Continuing until today we find, in parallel with the birth of new and more efficient compression formats, increasingly evident examples of the revolution, also social and commercial, that led to the arrival of the MP3 format.

There was a time when playlists were decided exclusively by record companies that were mixed into albums with mediocre songs, greatest hits; Today you can create your favorite playlist, selecting the songs and the order of play without any difficulty.

DIGITAL AUDIO explained

Audio is the electronic information that represents sound, or rather, having sound of a temporary nature is the flow of information that represents it.

Sound is made up of pressure waves traveling in space, therefore it is represented by a sinusoidal.

Digital Audio

The characteristics of a sound are:

Amplitude: Measured in Hertz (Hz) and determined by the frequency of a sound, the higher the frequency, the louder the sound, the lower it is, the lower the sound.

Intensity: it is measured in decibels (db) and is determined by the power of a sound, the more intense a sound is, the greater its volume.

Duration: It is measured in seconds (s) and dermal how long a sound lasts over time.

Timbre: It is not directly measurable, but it is that sound parameter that allows us to distinguish a trumpet from a drum. It constitutes the trace of a sound and is characterized by harmonics.

digital audio

ANALOGUE AND DIGITAL

There are two different ways of representing sound as electronic, analog and digital information.

Analog audio was the first, in chronological order, to be developed.

The information varies similarly to the information it represents and can (in theory) assume any value.

If we greatly expand the sine wave that describes an analog sound, we would see that it is a continuous line without interruptions.

Instead, digital audio is encoded with a number system, which allows discretization (transition from analog to digital), during this step information is lost, but once the sound is written as a series of numbers (digital information) it is possible to reproduce it. , transmit and modify it without losing anything in terms of quality, which is impossible with analog information.

If we greatly expand the sine wave that represents a digital sound, we would realize that it is not a continuous line as in the previous case, but a series of points very close to each other.

The amount of these points in one second of information will define the “sampling frequency”.

The amount of information that each point can contain is called “bit depth”.

THE CHARACTERISTICS OF DIGITAL SOUND

Sampling rate

Determine the number of samples contained in one second of information.

It is expressed in hertz (Hz) and generally assumes the following values ​​in the musical field: 22050Hz, 44100Hz, 96000Hz.

According to Nyquist’s theorem, each sampling frequency can record and reproduce sounds that have a maximum frequency equal to half of the chosen sampling frequency, this means that a piece sampled at 44Mhz can assume values ​​of up to 22Mhz only

Bit depth

Determine the amount of information contained in each sample.

It is expressed in Bit (bit) and generally assumes the following values ​​in the musical field 8Bit, 16Bit and 24Bit.

Above all, this is the parameter that depends on the quality of a sound.

Transmission rate (bit rate)

It is a characteristic of codecs, that is, of the “machine language” used to describe a sound.

Sets the total amount of information needed to play a second of a sound.

It is expressed in Bit / s.

AUDIO PROCESSING

Whether you’re talking about studio recording or live performances, the audio signal is never sent directly from the microphone to the speakers / recording medium, but is always processed first, through tools that allow you to perform different interventions. in the sound

These instruments can be analog, therefore they have the instrument physically in the studio (which is usually inserted inside a shelf), which must be connected between the microphone and the mixer or between the mixer and the speakers / recording medium.

Or you can simulate them through some plugins for your computer.

It is necessary to have a Daw (Digital Audio Workstation), which is the workspace in which all editing operations are performed. (Ableton, Cubase, Fruitloops, Logic, Reaper).

Within this software it is possible to install smaller ones, called VST (Virtual Studio Technology) that simulate the circuits of the studio equipment, emulating the effect.

(There are also other proprietary plugins with extensions other than the classic VST like .component or .au).

Some tools are essential and are used in all audio recordings, others are used only in particular situations or to obtain / avoid certain effects.

The main ones are:

Equalizer, is used to emphasize or attenuate some frequencies, this way you get a cleaner sound and a less “mixed” mix where all the instruments occupy only the correct frequencies, without overlapping.

The compressor, as the name suggests, serves to compress the dynamic range, so that the sound is more consistent and less dispersive.

Amp, wavering of different kinds, is used to increase the intensity of a sound.

Limiter works in a similar way to the compressor, but instead of compressing all frequencies, it attenuates those that exceed a predetermined threshold (threshold), avoids entering faults.

Reverb adds a slight reverb that makes a sound recorded in a soundproof studio much more natural than it would be too “dry”.

Filters (high / low cut) allow you to cut some useless and sumptuous frequencies too low or too high. (They are just 1 band parametric equalizers).

Basics of digital audio

Basics of digital audio:

Before the computer can record, manipulate, and reproduce sound, sound must be transformed from an audible analog form to a computer-acceptable digital form, using a process called analog-to-digital conversion (ADC). Once the sound data has been stored as bytes in the computer, the power of the computer’s CPU can be used to transform this sound in thousands of ways. Finally, when you are ready to listen to the result, the digital-to-analog conversion (DAC) process transforms the sound bytes back into an analog electrical signal from the speakers.

Sampling: Analog to Digital Conversion

Given an analog signal, discrete values ​​of its amplitude are taken at small time intervals, obviously the more reliable the reproduction the more samples per second are taken. These obtained values ​​are assigned a digital value that the computer can understand and process as required. We can use 8 or 16 bit words, thus obtaining 256 or 65536 different combinations and obtaining higher resolution.

 

SAMPLE FREQUENCY: According to the Nyquist theorem, it is possible to accurately repeat a waveform if the sampling frequency is at least twice the frequency of the component with the highest frequency. The highest frequency that the human ear can perceive is close to 20 kHz, so the 44.1 kHz sampling rate of sound cards is more than enough. This value is the one used today by CD audio players.

SAMPLE SIZE: The sample size controls the dynamic range that can be recorded. For example, 8-bit samples limit the dynamic range to 256 steps (50 dB range). In contrast, a 16-bit sample has a dynamic range of 65,536 steps (90 dB range) a substantial improvement. The human ear perceives a whole world of differences between these two sample sizes. Ears are more sensitive to detecting differences in pitch than intensity, but are even more sensitive to the strength of sound.

From the previous processes we can get an audio file, such as (and since it is the best known), a WAV audio file. It is the own format of Windows. They can be 8 or 16 bit with sampling rates of 11,025 kHz, 22.05 kHz, or 44.1 kHz and generally have good sound quality.

Digital audio compression

It could be assumed that all you have to do to get good sound is to record at the 44.1 kHz speed limit with 16-bit (2-byte) samples. The only problem that appears if recording in stereo, sampling simultaneously on the left and right channels at 44.1 kHz, a one minute sound sample needs a 10.58MB storage space. This involves using large disk spaces to store these sound files. Many compressed file formats (codecs) have been developed that enable high-quality recording without the need for so much disk space.

Most common audio formats:

With the simple objective of listing a series of codecs used by different operating systems to perform audio compression. Later, a more complete description of the most used is made: MP3.

Therefore, some of the most used are:

Advanced Audio Coding (AAC): used by Apple computers. More efficient than MP3.

Audio for Unix (AU): Acoustic standard for the JAVA programming language.

Windows Media Audio (WMA)

Ogg Vorbis: It is free, open and not patented.

Atrac: compression and playback technology for minidisc.

 

The codec par excellence: the MP3

Its origin and current

The abbreviations MP3 respond to the abbreviation of MPEG (Moving Picture Expert Group) 1 Layer 3, which is a perceptual coding algorithm. This among others was developed by the Moving Picture Expert Group (MPEG) (http://www.cselt.it/mpeg/) together with the Fraunhofer Institute of Technology (http://www.ipa.fhg.de/english/ ).

Moving Picture Expert Group is an ISO / IEC research committee. MPEG is in charge of the international development of compression, decompression, processing and encoded rendering standards for movies, audio and the combination of both. It is a non-profit institution created in 1988, which brings together 300 experts from 20 countries three times a year.

Digital audio formats on the network

Digital audio formats on the network:

WAV: Waveform files (or simply wave) are the most common sound formats on Windows platforms. WAV files can also be played on Mac and other systems with player software.

MPEG (MP3): The Motion Pictures Experts Group (MPEG) format is a standard format with significant compression capability. MPEG level 3 or MP3 files are frequently used for web music distribution. However, due to their size, MPEG files must be downloaded completely before playing them.

RealAudio (.rm): Real Audio is the technology that currently predominates on the Web. You need a proprietary player, but the basic versions of the player are available for free.
MIDI: The Musical Instrument Digital Interface format is not a digital audio format. It represents notes and other information so that music can be synthesized. MIDI has good support and its files are very small, but it is only useful for certain applications because of the quality of its sound when played on PC hardware.

AU: The u-law format is one of the oldest sound formats on the Internet. Players are available for almost all platforms.

RMF: The Rich Music Format supported by Beatnik (www.beatnik.com) is a high quality audio format, primarily for “download-and-play”, which is becoming increasingly popular.

AIFF: The Audio Interchange File Format is very common on Macs. It is widely used in multimedia applications, but it is not very common on the Web.

Flac: Free Lossless Audio Codec (FLAC) (Lossless audio compression codec) Ogg project format without loss. The initial file can be completely recomposed with the disadvantage that the file occupies much more space than would be obtained when applying lossy compression or Lossy.

Digital audio on the network:

The digital sound is measured by the sampling frequency, or how many times the sound is digitized over a certain period of time. The sampling frequencies are indicated in kilohertz (kHz), which indicate the number of times the sound is sampled per second. The CD sound quality is obtained with 44.1 kHz, or 44,100 samples per second. For stereo sound, two channels are required, each 8 bits; At 16 bits per sample, this results in 705,600 bits of data on a CD, producing high quality sound, at the request of the end user. In reality, the transmission of this amount of data would occupy almost half the bandwidth of the T1 network. As the average user of the Web does not have this bandwidth, another solution is necessary. One possible solution is to decrease the sampling rate when digital sound is created for sending through the Web. A sampling frequency of 8 kHz, in mono, would produce acceptable results for simple applications, such as language, especially if we consider that the playback hardware generally consists of a combination of a simple sound card and a small speaker. Low quality audio does not require more than 64,000 bits of data per second, but the end user still has to wait to download the sound. Modern users need several seconds to receive, even in the best conditions, a single second of low quality sound, making continuous sound impossible.

Introduction to digital audio

Introduction to digital audio

Digital audio is the representation of sound signals through a set
of binary data. A complete digital audio system usually begins
with a transceiver (microphone) that converts the pressure wave that represents the
Sound to an analog electrical signal.
This analog signal goes through an analog signal processing system, in
which can be made limitations on frequency, equalization, amplification and
Other processes such as compassion. Equalization aims
counteract the particular frequency response of the transceiver used of
so that the analog signal closely resembles the original audio signal.


After analog processing, the signal is sampled, quantified and encoded. The
sampling takes a discrete number of analog signal values ​​per second
(sampling rate) and quantification assigns discrete analog values ​​to those
samples, which means a loss of information (the signal is no longer the same
than the original). The encoding assigns a sequence of bits to each value
discrete analog The length of the bit sequence is a function of the number of
analog levels used in quantification. The sampling rate and the
number of bits per sample are two of the fundamental parameters to choose from
when you want to digitally process a certain audio signal.
Digital audio formats try to represent that set of samples
digital (or a modification) of them efficiently, so that it is optimized
depending on the application, either the volume of the data to be stored or the
processing capacity necessary to obtain the starting samples. In
in this sense there is a very extended audio format that is not considered audio
digital: the MIDI format. MIDI does not start with digital sound samples, but
stores the musical description of the sound, being a representation of the
score of them.
The digital audio system usually ends the reverse process to that described. From
the stored digital representation is obtained the set of samples that
represent. These samples go through a process of digital analog conversion
providing an analog signal that after processing (filtering,
amplification, equalization, etc.) affect the output transceiver (speaker)
which converts the electrical signal to a pressure wave that represents the sound.

Fundamental parameters of digital audio

The basic parameters to describe the sequence of samples it represents
The sound are:
ƒ The number of channels: 1 for mono, 2 for stereo, 4 for sound
quadraphonic, etc.
ƒ Sampling rate: The number of samples taken per second in each
channel.
ƒ Number of bits per sample: Usually 8 or 16 bits.
As a general rule, multichannel audio samples are usually organized in
frames A plot is a sequence of as many samples as channels,
each one corresponding to a channel. In this sense the number of samples per
second matches the number of frames per second. In stereo, the channel
Left is usually the first.