Digital Audio Quality


Free Download Mp4Gain
picture

Digital Audio Quality

Digital Audio Quality
Digital Audio Quality

Data rate refers to the data flow used by a video file in a unit of time, also called bit rate or bit stream rate.

Digital Audio Quality
Digital Audio Quality

The popular interpretation is the sampling rate, which is the most important part of image quality control in video encoding. Generally, the units we use are kb/s or Mb/s. Generally speaking, at the same resolution, the higher the code stream of the video file, the lower the compression ratio and the higher the image quality. The higher the code stream, the higher the sampling rate per unit time, the higher the data stream, the higher the accuracy, the closer the processed file is to the original file, the better the image quality, the clearer the image quality and the higher the decoding capability of the playback device is required.

Of course, the larger the code stream, the larger the file size. The calculation formula is file size = time X code rate/8. For example, a 720P RMVB file with a 1 Mbps stream of 90 minutes is common on the Internet and its volume is = 5400 seconds × 1 Mb/8 = 675 MB.

Generally speaking, a video file includes images and sounds, just like an RMVB video file, which contains video information and audio information. Audio and video have their own sampling methods and different bit rates, that is, the same video Audio and video file bit rate is not the same. And what we’re talking about is the bitrate of a video file, which generally refers to the sum of the bitrate of the audio and video information in the video file.

Taking the most popular and familiar RMVB video file in China as an example, VB in RMVB refers to VBR, which is short for Variable Bit Rate. The Chinese meaning is variable bit rate, which means that RMVB adopts dynamic encoding. In this way, a higher sample rate is used for complex dynamic images (singing and dancing, flying cars, wars, actions, etc.), while a lower sample rate is used for static images, and the resources are use rationally to achieve image quality and volume .Effect.

The most fundamental difference between code rate and sample rate is that the code rate is for the source file.

 

2. Sampling rate

Sample rate (also called sample rate or sample rate) defines the number of samples per second taken from a continuous signal to form a discrete signal, and is expressed in hertz (Hz). Sampling rate refers to the sampling frequency when converting an analog signal to a digital signal, i.e. how many points are sampled per unit of time. How many bits are in the data for a sample point? Bit rate refers to the number of bits (bits) transmitted per second. The unit is bps (bit per second). The higher the bitrate, the more data transmitted and the better the sound quality. Bit rate = sample rate x number of bits used x number of channels.

The sample rate is similar to the number of frames of moving images. For example, the sampling rate of movies is 24 Hz, the sampling rate of PAL format is 25 Hz, and the sampling rate of NTSC format is 30 Hz. When we play back the still images sampled at the same rate as the sampling frequency, we see a continuous image. In the same way, when a CD recorded at a sampling rate of 44.1 kHz is played back at the same rate, a continuous sound can be heard. Obviously, the higher the sample rate, the more coherent the sound will be heard and the picture will be seen. Of course, the sampling rate that human auditory and visual organs can distinguish is limited, which is basically higher than sound sampled at 44.1 kHz, and most people haven’t noticed the difference.

The number of digits in the sound is equivalent to the number of colors on the screen, indicating the amount of data per sample. Of course, the larger the amount of data, the more accurate the playback sound, so as not to confuse the sound. of the teapot with the train whistle. In the same way, it is more clear and precise for the image, so as not to confuse blood and ketchup. However, limited by the function of human organs, 16-bit sound and 24-bit image are basically the limits of ordinary humans, and the highest digits can only be distinguished by instruments.


Free Download Mp4Gain
picture


Mp4Gain Main Window
picture


Mp4Gain Features
picture


Free Download Mp4Gain
picture

Detailed Music Format Part 2

Detailed Music Format Part 2

Music Format
Music Format

Music CD

Music Format
Music Format

 

That is, CD records. A CD can play sound files of approximately 74 minutes. The Windows system comes with a CD player. Also, the software that comes with most sound cards provides CD playback functionality, and even some CD-ROM drives are offline. from computer Can be used as a stand-alone CD player when powered on.

WMA with unlimited potential

In developing its own network media service platform, Microsoft primarily promotes ASF (Audio Streaming Format), which is an open standard that supports data transmission over various networks and protocols. It supports audio, video, and a variety of other types of multimedia. And WMA is short for Windows Media Audio, which is equivalent to an ASF file that contains only audio.
The compression ratio of WMA files can be as high as 1:18 in 80kbps 44kHz mode, which is basically the same as VQF. And the compression speed is doubled compared to MP3. So it should be more competitive than VQF.

Vorbis free music format

To avoid rising royalties charged by MP3 music companies, programmers at GMGI’s iCast company developed a new free music format, Vorbis, that rivals or even exceeds MP3 in sound quality. And it will be released over the internet and can be downloaded for free without worrying about infringement issues. But MP3 has become very popular on the Internet, and Microsoft’s Windows Media technology has also started to spread, and Vorbis’s outlook is still not optimistic.

Other audio formats

AIF/AIFF: A sound file format developed by Apple, supported by the MAC platform, and supports 16-bit stereo at 44.1 kHz.
AU: SUN’s AU Compressed Sound File Format, which only supports 8-bit sound, is a commonly used sound file format on the Internet, mainly created by SUN workstations.
CDA: CD audio track file.
CMF: A MIDI-like sound file developed by CREATIVE.
DSP: Abbreviation for digital signal processing. By improving the signal processing method, sound quality will be greatly improved and songs will be more pleasing to the ear.
S3U: MP3 playback file list
RMI: MIDI Instrument Sequence

Lossy compression:

AAC – Sound quality is second only to MPC at high bit rates and looks good at both high and low bit rates. The encoding speed is too slow!
MPC: Performance is average at low bitrate, not as good as MP3 and OGG encoded by Mp3Pro, sound quality is best at high bitrate, and encoding speed is
fast.OGG: The sound quality is better at a low bitrate, and the same is true at a high bitrate. Encoding is slightly slower.
MP3 (MP3Pro): Sound quality is lower than OGG at low bit rate and other aspects are the same as MP3
WMA: High and low bit rates are average, VBR is not supported and the highest is 192Kbit/s

lossless compression:

FLAC – Worst compression ratio of the four, decent encoding speed, good platform support.
PAC: Slightly slower encoding speed, third in compression ratio, good platform support.
APE: The fastest encoding speed, the best compression rate, and the platform is generally supported.
WV: The encoding speed is very fast, the compression rate is second among the four types, and it is only supported by the Windows platform.

Detailed music format

Detailed music format

Audio File Formats
Audio File Formats

classic wave

Audio File Formats
Audio File Formats

As the most classic Windows media audio format, the WAVE file is widely used, which uses three parameters to represent sound: the number of sampled bits, the sample rate, and the number of channels.
The channels are divided into mono and stereo, and the sample rates are generally 11025 Hz (11 kHz), 22050 Hz (22 kHz), and 44100 Hz (44 kHz). The capacity occupied by the WAVE file = (sampling frequency × sampling bits × channel) × time/8 (1 byte = 8 bits).

traditional mod

MOD is a wavetable-like music format, but its structure is similar to MIDI, it uses real samples, and the volume is small. In the earlier DOS era, MOD was often used as background music for games. Modern mods can contain many audio tracks in many formats, such as S3M, NST, 669, MTM, XM, IT, XT, and RT.

midi music computer

MIDI is short for Musical Instrument Data Interface. Records the sound played by the instrument digitally (each note is recorded as a number), and then synthesizes these records via FM or wavetable during playback: FM synthesis is the sound of the instrument is simulated by mixing the multi-frequency sounds; wavetable synthesis consists of storing the sound samples of the instrument in the wavetable of the sound card and extracting the sound from the wavetable as you play.

Boss Boss MP3

It can be said that MP3 is famous, it uses MPEG Audio Layer 3 technology to compress the sound with a compression ratio of 1:10 or even 1:12, with a sampling rate of 44kHz and a bit rate of 112kbit/s. .
MP3 music is music stored in digital form. If you want to play it, you must have a corresponding digital playback and decoding system. Generally, MP3 digital music is decoded by special software and then restored to a waveform sound signal for playback output. This type of software is called For MP3 players, such as Winamp, etc.

Overlord RA series online

RA, RAM, and RM are Real’s mature network audio formats, using “streaming audio” technology, making them well suited for network streaming. Information such as copyright, singer, producer, mail and song title can be added during production.
RA can be called the supreme lord of multimedia communication on the Internet. It is suitable for streaming on the Internet and is currently the best format for listening to online music online.

VQF with high compression ratio

VQF or TwinVQ is an audio compression technology developed by Nippon Telegraph and Telephone and Yamaha Corporation.
The audio compression rate of VQF is almost twice that of standard MPEG audio and can reach approximately 1:18 or even higher. And popular compression formats like MP3 and RA are usually only around 1:12. But it still won’t affect the sound quality, when VQF compress music at 44kHz-80kbit/s audio sampling rate, its sound quality will be better than 44kHz-128kbit/s MP3, when compress at 44kHz-96kbit/s , the music is close to 44kHz-256kbit/s MP3.

MD minidisc

MD (ie MiniDisc) is a comprehensive portable music format released by SONY in 1992. The compression algorithm it uses is ATRAC technology (the compression ratio is 1:5). MD is divided into Recordable MD (Recordable, with two heads of magnetic head and laser head) and Single Play MD (Prerecorded, only laser head).
The powerful editing function is the strong point of MD. You can quickly select tracks, move tracks, merge, split, delete and edit track titles. It is more personalized than CD and you can have your own MD album at any time. MD products include MD Walkman, MD bedside audio, MD car audio, MD recording deck, MD camera gun and MD driver, etc.

Digital audio formats or how sound is stored on a computer

Digital audio formats or how sound is stored on a computer

Digital Audio Formats

Today there are about three dozen common digital audio formats. Why you need to create so many types of sound files to store one type of content and how to manage all this, you will learn from this material.

Audio format developments | Digital audio | How to Create Digital Media  Infographics Using ConceptDraw PRO | Audio Infographic

Surely many users prefer to use their home computer not only as a workhorse, but also as a multimedia center, where they can watch movies or family photos, as well as listen to their favorite music. Although compact digital players or mobile phones are certainly more suitable for listening to musical compositions, but unlike them, a computer can not only play music.

No matter how big the built-in memory of your music player is, it will most likely be difficult to store your entire music library on it. Plus, you can create, edit, organize, and search for music with your PC. Also, don’t forget that there are around three dozen common digital audio formats today, and most players are far from omnivorous and can only play a few of them.

So why do you need to create so many music formats to store one type of content? The fact is that, in the vast majority of cases, the sound is stored in “compressed” form, since one minute of uncompressed composition occupies about 10 MB on the hard disk. On the one hand, this seems not to be much, but on the other, if you are a music lover and your collection consists of several hundred or even thousands of songs, then it is clear that the sound must be compressed to reduce the space it occupies in electronic media.

Various special algorithms are used to compress music files, which subsequently determine the structure and presentation of the audio data, or so-called digital audio file formats. All audio formats can be divided into three groups: uncompressed audio formats, lossless compression, and lossy compression.

No compression
One of the most widespread formats related to this type is the well-known WAV. The sound of files with this extension is stored without compression or changes. It is true that much more space is required to store uncompressed files and therefore WAV is more widely used only in professional audio and video applications, where the sound should not have a loss of quality before processing. Keeping ordinary musical compositions in this form is unwarranted waste.

To play WAV files, you do not need any special software, as all media players understand this format, including the standard Windows Media audio player built into the Windows system.

Another format used to store uncompressed audio that is worth mentioning is Apple’s development called AIFF (Audio Interchange File Format). As you may have guessed, it is most commonly used on Macintosh computers running Mac OS X.

Lossless compression (lossless)
Lossless compression algorithms for audio files work on the principle of conventional file cabinets. They do not provide the highest level of compression (40 to 60%), while they have virtually no effect on sound quality. It is also worth noting that in this case, the encrypted data can be fully restored to its original form. Therefore, the use of lossless compression is most often used when it is important to keep the compressed data identical to the original.

The most popular audio formats in this group are FLAC (Free Lossless Audio Codec), APE (Monkey’s Audio), WMA (Windows Media Lossless), and ALAC (Apple Lossless Audio Codec). Each has its own pros and cons. For example, the APE codec offers slightly better compression gains, while FLAC is more common. In general, all true music lovers store their music collections in lossless formats, as they do not remove any data from the audio stream and the files created with these codecs can be listened to even on high-quality stereos.

To play lossless compressed formats, as a rule, third-party players (except WMA) are used, such as MPlayer, foobar, AIMP, Winamp, VLC and others, since all the necessary codecs are already built into them. Another option is to separately install an additional codec pack (for example, K-Lite), after which you can listen to files in lossless format from almost any audio player.

Lossy compression
This is the most popular group of algorithms that provides the maximum audio compression ratio (up to 10 times or more). However, unlike previous formats, the audio file loses quality here, and how much depends

Varieties of digital audio formats.

Varieties of digital audio formats.

Audio Formats

There are several concepts of audio format.

Audio Format

The audio data presentation format in digital form depends on the quantization method of a digital-to-analog converter (DAC). The sound equipment at the present time the most common two types of quantization:

Pulse – code modulation
sigma – delta – modulation
Often bit quantization and frequency sampling point for various audio devices that record and play back as digital audio presentation format (24-bit / 192 kHz, 16-bit / 48 kHz).

The file format determines the structure and presentation of the audio characteristics of the data when stored on a PC storage device. To eliminate redundancy of audio data using audio codecs, with the help of which compression of audio data is carried out. There are three groups of audio file formats:

uncompressed audio formats, such as WAV, AIFF
lossless compressed audio formats (APE, FLAC)
audio formats, with the use of lossy compression (mp3, ogg)
There are only modular music format files. By synthetically or sampled pre-recorded live instruments, they are, in the main, used for the creation of modern electronic music (MOD). Also here the format of MIDI can be attributed, which is not a sound recording, but in this with the help of a sequencer it allows to record and play music, using a specific set of commands in the form of text.

Sound digital media formats are used as that of mass-propagated sound recordings (the CD, the SACD), so and in a professional recording (the DAT, MiniDisc).

For surround sound systems and you can select sound formats, in a multi-channel accompaniment largely without sound for movies. Such systems have a set family of two large formats that compete the companies of the Digital Theater then Systems Inc. – DTS and Dolby Laboratories Inc. – Dolby Digital.

Also called format the number of channels in multi-channel sound systems (5. 1; 7. 1). Initially, this system was designed for the cinema, but later it was extended to home theater systems.

What formats are used to represent digital audio?

What formats are used to represent digital audio?

Audio Formats

The format is used in two different ways.

Digital Audio Formats

When using a specialized medium or recording method and special read / write devices, the concept of format includes both physical characteristics of a sound carrier: the dimensions of a cassette with a magnetic tape or disk, the tape itself, or a disc, recording method, signal parameters, encoding and error protection principles, etc. .P. When using a universal information medium of wide application, for example, a flexible computer or a hard disk, the format is understood only as a method of encoding a digital signal, the peculiarities of the arrangement of bits and words and the structure of service information; all the “low-level” part directly related to working with the media, in this case, remains under the control of the computer and its operating system.

Of the specialized digital audio formats and media, the following are the best known today:

CD (Compact Disc) is a 120mm or 90mm single sided optical laser read / write disc, containing a maximum of 74 minutes of stereo sound at 44.1 kHz sampling rate and 16 linear quantization bits. The system is offered by Sony and Philips and is called CD-DA (Compact Disc – Digital Audio). For error protection, Cross Interleaved Reed-Solomon code (CIRC) and Hamming code 8-14 modulation (Eight to Fourteen Modulation, EFM) are used. A distinction is made between stamped compact discs (CD) write-only (CD-R) and rewritable (CD-RW).
PCM decoder (PCM deck): a system for converting the digital audio signal into a pseudo-video signal compatible with popular video formats (NTSC, PAL / SECAM) and vice versa. PCM decoders are used in combination with home (VHS) or studio (S-VHS, Beta, U-Matic) VCRs, using them as read / write devices. The devices operate with 16-bit linear quantization at sample rates of 44.056 kHz (NTSC) and 44.1 kHz (PAL / SECAM) and can record a two- or four-channel digital signal. In fact, such a decoder is a modem (modulator-demodulator) for a video signal.
S-DAT (Fixed Head Digital Audio Tape – Fixed Head Digital Audio Tape) is a system similar to a conventional cassette recorder, in which recording and reading is performed by a block of thin film fixed heads in a 3.81 mm wide tape in a double-sided cassette with dimensions of 86 x 55.5 x 9.5 mm. It implements two- or four-channel 16-bit recording at 32, 44.1, and 48 kHz.
R-DAT (Rotating Head Digital Audio Tape) is a VCR-like system with cross-tilted rotating head recording. The most popular tape-based digital recording format, R-DAT systems are often referred to simply as DAT. The R-DAT uses a 73 x 54 x 10.5mm cassette, with a 3.81mm wide tape, and the cassette and tape system itself is very similar to a typical VCR. The basic belt speed is 8.15mm / s, the rotation speed of the main unit is 2000rpm. R-DAT operates with a two-channel signal (on some models, four channels) at sample rates of 44.1 and 48 kHz with 16-bit linear quantization and 32 kHz with 12-bit non-linear quantization. To guard against errors, a double Reed-Solomon code and modulation with an 8-10 code are used. Cassette capacity – 80. .240 minutes depending on speed and belt length. Domestic DAT recorders are usually equipped with a phonogram illegal copy protection system, which does not allow recording from the analog input at a frequency of 44.1 kHz, as well as direct digital copying in the presence of SCMS prohibition codes (Serial Code Managenent System). Studio tape recorders have no such restrictions.
DASH (Digital Audio Stationary Head) is a 6.3 and 12.7 mm wide magnetic tape recording system with fixed heads. Belt speed is 19.05, 38.1, 76.2 cm / sec. Implements 16-bit recording with sample rates of 44.056, 44.1 and 48 kHz from 2 to 48 channels.
ADAT (Alesis DAT) is a proprietary system for recording eight-channel audio on S-VHS videotape, developed by Alesis. It uses linear quantization of 16 bits at 48 kHz, the capacity of the cassette is up to 60 minutes per channel. ADAT tape recorders can be cascaded so that a 128-channel synchronous recording system can be assembled.

Digital audio file formats wav, mp3, aiff, ogg, flac, m4a

Digital audio file formats wav, mp3, aiff, ogg, flac, m4a

digital audio formats

The last five years gave a great boost to the development of portable and stationary audio systems, and with this support for a variety of digital audio formats.

DIGITAL AUDIO FORMATS

Small pocket devices have a large internal memory and fixed audio equipment has become even smarter and more demanding. That is why, now, we can not save space on the player and download songs that weigh between 15 and 30 MB each, but at home, listen to digital music in a quality equal to the sound of an analog vinyl.

Description of popular digital audio formats
However, the most widespread audio formats still have their pros and cons, and even in an urgent matter like digital audio, a “panacea” has not yet been found. Classic digital audio formats are divided into “compressed” and “uncompressed” streams, as well as “lossless” formats, which exclude loss of sound.

Description of digital audio formats Description of digital audio formats

Wav audio format
The waveform audio file format (WAVE, WAV – “in waveform”) is a file format for storing a recording of an uncompressed digitized audio sequence. In general, this is the most common format for working in the studio and in broadcasting. allows you to get the most honest sound quality. For example, the standard audio CD format is an LPCM audio stream, with parameters: 2ch (stereo), 44-100Hz, 16bit.

Mp3 audio format
MPEG-1/2 Audio Layer 3: (MP3) is the most popular digital format for storing compressed audio. The MP3 format uses a special algorithm designed to greatly reduce the size of the original file. This format allows you to keep the audio close to the original sound, but thanks to a variety of settings, extremely small size.
Compared to the standard audio CD format, a file in MP3 format and a bit rate of 128 kbps will be approximately 1/11 the size of the original file.

FLAC audio format
FLAC (Free Lossless Audio Codec) is a popular free codec designed for lossless compression of audio data. What does that mean? Unlike lossy audio codecs such as MP3 or OGG, the FLAC audio codec does not remove any information from the audio stream. This format is ideal for audiophiles who create their own music collections and listen to music on high-quality equipment.

Ogg audio format
OGG is a format that has not gained great popularity, but is nonetheless used by a fairly large audience. The OGG format, similar to MP3, compresses audio with loss of quality, but is fundamentally different in practical conversions. This made it possible to get better quality with a smaller file size and to display this codec as absolutely independent. In addition to similar formats that convert lossy audio, OGG has the ability to adjust container properties.

Aiff audio format
The Audio Interchange File Format (AIFF) is a fairly universal audio file format developed by Apple, which is used to store audio data. Like its counterpart, the WAV format, it is uncompressed audio and is widely used in professional recordings and music production.
The .aiff and .aif files created by Apple Loops are used by GarageBand and Logic Audio music editors.

M4a audio format
Apple Losseles (also known as Apple Lossless Encoder, ALE or Apple Lossless Audio Codec, ALAC) (m4a) is another Apple development. This audio format refers to uncompressed audio, which provides lossless playback. It is a fairly specific format, which is mainly supported by products of the creator company, and in some cases, as in the iPhone system sounds, where it is possible to use exclusively the m4a format.

ABOUT DIGITAL AUDIO FORMATS

ABOUT DIGITAL AUDIO FORMATS

Digital Audio Formats

Today, there are several digital audio formats that are superior in quality to compact discs and are available on both physical media and the Internet. What are advanced sound lovers listening to now? Let’s find out.

Digital Audio Formats

The capabilities and quality of the CD-DA format were initially limited by the capabilities of CD as a medium. Legend has it that the standard 74-minute compact disc capacity was chosen in order to be able to record long classical pieces without splitting into two discs. And to be absolutely precise, this figure appeared thanks to Beethoven’s Ninth Symphony: it lasts exactly 74 minutes. Another default parameter was the 44.1 kHz sample rate. This figure defines the upper limit of the reproduced frequency range. For a CD that had to reproduce frequencies up to 20 kHz, this was the lowest possible carrier frequency. As a result, the only field of maneuver was the bit depth, the level of which was 16 bits. With regard to sound recording, bit depth determines its dynamic range and resolution.

The CD cannot be copied into the memory of the computer in the usual way, since we usually copy files. To save a CD-DA, you need a special program, a program that allows you to convert data recorded on an audio disc to PCM format (WAV file). A properly organized CD-DA ripping process allows you to get a completely identical digital copy on your hard drive. Audio CDs are generally saved on a computer as a large FLAC audio file (also WAV, WV, or APE) with a CUE index card or as separate tracks.

As the best digital audio format, the CD did not last that long, just over ten years. In the mid-nineties, the first format appeared that allows for better sound quality. HDCD was an improved version of CD-DA. Their difference consisted in a special recording algorithm that made it possible to save additional data on the sampling depth in a standard CD format. With an HDCD decoder, the output signal received not 16, but 20 bits, which did not give the standard of 96, but up to 120 dB of dynamic range and a very noticeable increase in recording resolution. At the same time, devices without an HDCD decoder played discs like normal CD-DAs. Interestingly, when saving such a disk on a PC in the same way,

The next leap in terms of sound quality came at the beginning of the new millennium. Two HD audio formats were introduced to the audiophile audience at once, appearing almost simultaneously. DVD-Audio, a further development of the traditional recording method and promoted by Panasonic and Toshiba. It is capable of recording 24-bit / 192 kHz in stereo mode and 24-bit / 96 kHz in multi-channel mode.

The SACD format competed with it, which, by the way, looked much less like a normal CD, although it was called “super CD”. Super Audio CD, developed by Sony, was based on the revolutionary DSD encoding algorithm. This digitizing method assumed one-bit sampling at an ultra-high frequency of 2.8224 MHz. The encoding and decoding principles of a DSD stream are much simpler than in high-bit formats and are essentially closer to the principles of analog technology. At the same time, the SACD format retains all the advantages of the advanced digital format and has output characteristics comparable to DVD-Audio in both sound quality and number of channels.

Both DVD-Audio and SACD were designed with a high level of copy protection, but inquisitive minds have already won over both formats, so if desired, the content of both disc types can be saved to a PC as images. ISO (without changing the structure and original codec) or FLAC tracks in 24-bit / 96 kHz or 24-bit / 192 kHz. Almost simultaneously with the DVD-Audio and SACD formats, another original format for publishing high-quality music was born: DAD 24/96. DAD stands for Digital Audio Disk, but it is essentially a DVD-Video with a high-quality still image and sound that can be played on any standard DVD player or PC.

Obviously, with this approach, Blu-ray media, with its HD sound formats, recorded in high quality without compression, is quite applicable for recording music in high quality. However, at the moment there are few such publications, and a special version of the BD-Audio format has every chance of not seeing the light of day, as the sale of high-quality audio material is already very active on the Internet. Anyone who does not want to convert DVD-Audio, DAD and SACD discs to the FLAC format on their own can officially buy albums already converted in 24-bit / 96 kHz or 24-bit / 192 kHz quality.

Advantages and disadvantages of popular audio formats

Advantages and disadvantages of popular audio formats

 Audio File Formats

In today’s music world, there are a large number of audio file formats that are often confusing to the unprepared user. To understand all this, to find out what they are and what they are used for, the presented review will help.

Audio formats

Types of audio formats

Today is the time when all music lovers, not to mention professional musicians and audio editors, need to understand concepts like audio file formats, bit rates, extensions, bit depth, sample rate and many others. to achieve high quality sound. Sound has gone digital, which means that it can be used for various purposes, eg for listening to evidence, for presentations, video dubbing. In fact, digital sound, like an image, is a collection of individual pixels, and the more there are, the better the sound image. This “pixelated” sound can be edited and processed.

An important role in evaluating the quality of audio formats and consequently sound quality is a parameter such as bit rate, which shows how many bits or kilobits it takes to record one second of sound. Low bit rates mean low quality sound, high bit rates mean high quality sound.

But for the storage and further use of audio in one form or another, audio formats are used – digital recordings of audio data. We can say that the format is a kind of container where the sound is stored. Virtually all audio formats can be divided into two broad categories: lossless compressed and lossy compressed.

No loss, no loss

To avoid as much as possible a decrease in sound quality during the compression of an audio file, special methods have been developed to store audio information, avoiding losses, which in fact can be compared with the file when the information is simply packed in a zip file, the size of which is noticeably smaller than the original data. Later, this data can be clearly restored on each bit. And the bitrate itself is not important for these files. These audio files are collectively called Lossless, Music As Is. These algorithms allow you to compress files two to three times. As a result, the size becomes quite large, but at the same time the original sound is preserved.

Digital audio formats

Digital Audio Formats

Now there are several formats, but a basic distinction is made between lossless and lossy formats and compressed or uncompressed formats. Lossy formats are always compressed, which means a reduction in required storage space, but at the expense of playback quality. Lossless compressed formats offer faithful playback with low memory requirements.

However, the savings are less than with lossy formats. Lossless and uncompressed formats offer true-to-original music reproduction, but require a comparatively large amount of storage space. In return, they sometimes support even higher resolutions than compressed formats.

digital audio formats

What are sample rates and bit depth?

When talking about the resolution of digital music, two numbers are often mentioned. For CD quality around 44.1 kHz and 16 bit. The first number is the sample rate of the file. Describes how often the computer or network player extracts a signal from the file and processes it. 44.1 kHz means that a certain amount of data is transmitted 44,100 times per second. This amount of data is described by the bit depth (also word depth), the second number.

At the quality described, 16 bits of data are transmitted 44,100 times per second. If you want to determine the actual amount of data per second, you need to multiply these two numbers and get 705,600 accordingly. Since this is a stereo file with 2 channels, this number should be taken twice.

With CD quality music, 1,411,200 bits per second or, for the sake of simplicity, 1,411.2 kilobits are transmitted. A good MP3 file only transmits 320 kbps, so it only contains about a third of the information on a CD. Compared to 192 kHz 24-bit files, even less.What is the difference between compressed and uncompressed formats?
Uncompressed formats like

WAV do not affect music in any way. Frequencies and information are stored exactly as they are read during encoding. Therefore, uncompressed formats require more storage space in the first place than compressed formats. However, compressed does not automatically mean lossy. Formats like Apple’s FLAC or ALAC save music losslessly as a WAV file. However, they pack existing data more neatly without removing any information, thus requiring less storage space. Normally, there should be no effects on music information.

Why aren’t MP3 files high fidelity?

The MP3 format was introduced in 1992. It was revolutionary for the time, because by encoding music in MPEG-Audio Layer III, the full name of the format, you could achieve file compression of at least 4: 1, usually even 10: 1, compared to the classic CD. . This is possible because encoding in MP3 format removes the parts of the original file that are considered the least useful.

You can never make an exact copy of a music file in MP3 format and you cannot add information that has been deleted. So there is no point in converting an MP3 back to a lossless format. The AAC format used by Apple also cuts information from the original file to save space during compression.

We speak here of lossy or in English also of “lossy”, in contrast to the formats without loss or “without loss”. Meanwhile, it doesn’t really make sense to use such formats anymore, as more storage space shouldn’t be a problem today, unlike in 1992. The sound quality of MP3s is also significantly lower than that of other formats, as only 320 kbps is transmitted here at best, usually only 192 kbps or 256 kbps.

What is metadata?

Metadata are files attached to a file that contain additional information. In the case of digital music, these typically include things like sample rate, bit depth, and file format. In the best case, information about the song title, artist, album, composer, track number, etc. is also attached to the file. Modern streaming clients display this information when they play games on their screen or in an app. Also, these hidden attachments are often responsible for how the music in memory is organized.