
In the era of broadband connections, fiber optics and HD videos on YouTube, MP3 remains the reference format for audio files. We are now so used to listening to music in compressed formats, and often through poor quality playback systems, that it is difficult for us to remember what listening to music really means. The recent evolution from download to hit-and-run streaming has only made the situation worse by further devaluing the value of music. When was the last time you listened to a record from start to finish without interruption, spending those 30-40 minutes on “simple” listening activity?

Premise: This post is not a crusade against Spotify because I use it myself for new releases or to have some background music at work, it is not even an analog vs. digital (or vinyl vs. CD vs. MP3) post because on this topic en Much has already been said. My goal is to make you understand what you are missing, in qualitative terms, if you listen to music in compressed formats.

Sampling and theoretical aspects.
Audio recording on a computer or digital medium assumes that the signal passes through an analog> digital (AD) converter, so that the continuous electrical signal generated by microphones or musical instruments is transformed into a digital signal (series of 0 and 1) This process is called sampling. The final quality of the recording depends on several factors: converter quality, sample rate, and bit depth.
To make an easily understandable comparison: When shooting a movie, the “analog” reality perceived by our eye is stored in a movie that takes 24 frames per second. If we consider the standard of the audio CD (44.1 kHz, 16 bits), for every second of music 44100 pictures are taken from the computer to the continuous electrical signal. If with the sampling frequency we have simply established how many times in a second the waveform will be analyzed, with the bit depth we assign to each sample a numerical value: 2 ^ 16 = 65,536 possible values.
If you wonder how it got to 44,100, I refer you to the Nyquist-Shannon sampling theorem.
When we press the record button on our computer, through the PCM (pulse code modulation) sampling process described above, the files are saved in uncompressed WAV or AIFF format.
Lossless files and lossy files
PCM files take up a lot of space on our hard drives because, as we have seen, there is the data necessary to describe the analog waveform in as much detail as possible. Indicatively, a WAV or AIFF file as audio CD will occupy 10 MB for every minute of music.
To overcome this problem, remember that in the early 2000s storage space cost around $ 10 / GB, while today the price is around $ 0.03 / GB (source): Audio formats have been introduced that , through an algorithm encodes and decodes information, reduces the size of the file. These codecs fall into two categories: formats with lossless compression and formats with lossy compression.
As the name implies, lossless compression indicates a reduction in file weight (usually around 50%) without loss of information. Leaving the world of audio aside for a second, ZIP and RAR files are clear examples of this type of compression: at any time we can “unzip” such a file and have access to the original information again without this no way has changed.
The most common file formats are: FLAC (Free Lossless Audio Codec) and ALAC (Apple Lossless Audio Codec).
Lossy compression, on the other hand, implies that some of the original audio information is somehow removed to obtain a file that weighs even 90% less than the PCM.
By what criteria is information removed without “compromising” the original audio too much? Since our hearing is an imperfect instrument, codecs exploit two principles of psychoacoustics: the minimum threshold of audibility (the human ear does not perceive all frequencies in the range between 20Hz and 20kHZ equally) and masking (a weaker sound). is masked, making it inaudible, by a louder sound.)
Compression algorithms, however advanced, introduce a number of artifacts into audio files that, if played back in discrete quality audio systems, can be easily recognized or at least noticed even by an inexperienced ear. Several studies have shown that an untrained ear does not distinguish the difference between an uncompressed file and an MP3 with a bit rate equal to 256kb / s or more.
The most common lossy formats are: MP3, OGG Vorbis, AAC.
The victory of MP3
Since its introduction in the mid-1990s, MP3 has established itself as the industry-standard consumer format fueled by file-sharing through peer-to-peer channels, where, with slow connections, the heaviest file was the one it was downloaded, the longer it took to obtain it, and since the market introduction of MP3 players in which we tried to store as much music as possible and, therefore, we resorted to very compressed files.
In the transition from the era of downloading to that of small transmission files, they ensure smoother and smoother data transmission.
Despite, therefore, the evolution that has taken place in recent years in the speed of Internet connections and the reduction in the price of storage systems, only in recent years have services been created to buy files from High-quality online audio (HD tracks) or HD streaming services (Tidal).
Examples and audio files.
The main services we use to buy or listen to music use these compression levels (all information is taken from the official websites of each service at the time this publication was written).
Spotify: OGG Vorbis files at 96 kb / s (normal mobile quality), 160 kb / s (normal desktop and web player quality, high mobile quality), 320 kb / s (premium users: high desktop quality, very high quality mobile).
iTunes: By default, CDs are imported into 128 kb / s AAC files. Files in the iTunes Store are of this quality, except for “iTunes Plus” songs converted to AAC at 256 kb / s.
Pandora: 64kb / s AAC (free users), 192kb / s AAC (premium users).
YouTube: HD videos (720 or 1080p) have an audio quality equal to 384kb / s, SD videos (360, 480p) have an audio quality equal to 128kb / s.
















