Structure of an mp3

Structure of an mp3

 

Structure of an mp3
Structure of an mp3

audio compression

Structure of an mp3
Structure of an mp3

 

The MP3 format began in the mid-1980s and the Fraunhofer Institute in Erlangen, Germany, was committed to high-quality, low-data-rate audio coding.

MP3 audio compression includes encoding and decoding in two parts. Encoding is converting the data in the WAV file into a highly compressed bitstream format, and decoding is accepting the bitstream and reconstructing it into the WAV file.

MP3 uses the distortion algorithm of Perceptual Audio Coding (PerceptualAudioCoding). The frequency range of sound perceived by the human ear is from 20 Hz to 220 kHz. MP3 cuts out a lot of redundant signals and irrelevant signals. The encoder transforms the original sound into the frequency domain through a hybrid filter bank. Using the psychoacoustic model, it is estimated that it may simply be The perceived noise level is quantized and converted to Huffman coding to form an MP3 bitstream. The decoder is much simpler and its task is to extract the sound signal from the encoded spectral line components through inverse quantization and inverse transformation.

When compressing audio data, the original sound data is first divided into fixed blocks, and then direct MDCT is performed. MDCT itself does not perform data compression, but only converts a set of time-domain data to frequency-domain data to obtain time-domain data. In case of change, the direct MDCT converts the value of each block into 512 MDCT coefficients. Quantization compresses data, and when bits are allocated to transformed samples after quantization, it is necessary to consider making the entire quantized block the smallest, which becomes lossy compression. When decompressing, the 512 coefficients are restored to the original sound data by reverse MDCT, and the original sound data before and after are inconsistent, because redundant and irrelevant data are removed during the compression process.

 

MP3 file structure
MP3 files are roughly divided into three parts: TAG_V2(ID3V2), Frame, TAG_V1(ID3V1)

ID3V2 Contains information such as author, composer, album, etc., the duration is not fixed, expanding the amount of information of ID3V1
framework

 

 

 

A series of frames, the number is determined by the file size and frame length

The length of each frame can be variable or fixed, determined by the bit rate.

Each FRAME is divided into two parts: frame header and data entity

The frame header records the bitrate, sample rate, version, and other mp3 information, and each frame is independent of each other.

ID3V1    Contains author, composer, album and other information, length is 128BYTE

Structure of an mp3

Structure of an mp3

 

Structure of an mp3
Structure of an mp3

The full name of MP3 is MPEG Audio Layer3, which is an efficient computer audio coding scheme.

Structure of an mp3
Structure of an mp3

It converts audio files into smaller files with .MP3 extension with a higher compression ratio and basically keeps the sound quality of the original file. MP3 is part of the ISO/MPEG standard. The ISO/MPEG standard describes audio compression using a high-performance perceptual coding scheme. This standard has been continuously updated to meet the pursuit of “high quality, low volume”. MPEGLayer1, Layer2 , and Layer 3 have now formed three audio codec schemes. The compression rate of MPEGLayer3 can reach from 1:10 to 1:12. A 1M MP3 file can play for 1 minute, while a 1 minute CD-quality WAV file (44100 Hz, 16-bit, two channels, 60 seconds) will take up 10M of space. , A 650M MP3 disc should play for more than 10 hours, while a CD with the same capacity should play for about 70 minutes. The advantages of MP3 are unmatched by CD.

MPEG audio standard
MPEG (Motion Picture Experts Group) is a moving picture expert group under ISO, and the MPEG standard it creates is widely used in various multimedia. MPEG standards include video and audio standards, among which MPEG-1, MPEG-2, MPEG-2AAC, and MPEG-4 audio standards have been developed.

The MPEG-1 and MPEG-2 standards use the same family of audio codecs: Layer 1, 2, and 3. A new feature of MPEG-2 is the use of low sample rate expansion to reduce data traffic, and another feature is multi-channel expansion, which increases the number of main channels to five. The MPEG-2AAC (MPEG-2 Advanced Audio Coding) standard was released by FraunhoferIIS and AT&T in 1997, with the goal of significantly reducing data traffic. MPEG22AAC adopts the Modified Discrete Cosine Transform (MDCT) algorithm and the sampling rate can be between 8 KHz and 96 KHz. The number of channels can be between 1 and 48.

MPEG Audio Layer1, 2, and 3 use the same filter bank, bitstream structure, and header information, and the sample rate is either 32 KHz, 4411 KHz, or 48 KHz. Layer1 is designed for DCC (DigitalCompactCassette) digital compression tape, the data rate is 384kbps, Layer2 has made a compromise between complexity and performance, and the data rate has been reduced to 256kbps-192kbps. Layer 3 was designed for low data traffic from the start, and data traffic ranges from 128 kbps to 112 kbps. Layer 3 adds MDCT transform, making its frequency resolution 18 times higher than Layer 2. Layer 3 also uses EntropyCoding similar to MPEGVid2eo, reducing redundant information. The vast majority of MP3s use the MPEG21 standard.

What are MP3 files?

What are MP3 files?

What are MP3 files?

 

The audio format is directly related to the quality and purpose of the audio track, i.e. where and on which device it will be played and what is its purpose.

What are MP3 files?

But before you can figure out the difference between them and choose the best audio format for your music, you need to know what categories they fall into. Let’s keep going!

Uncompressed audio is like a picture, and uncompressed audio is of better quality, larger file size, safer to copy, and nearly identical in detail to the original sound.

WAV is the most widely used of these audio formats and plays music just as accurately as it records it.

compressed audio
When music is compressed, the files become smaller and can be easily stored on a device. Due to this advantage, users tend to choose compressed audio more.

However, it must be remembered that some audio formats in this category may lose quality depending on the option selected, just like MP3 and AAC.

What is the best audio format?
As we said before, the first step in deciding on an audio format is to know the final objective of the track. Whether it’s for music lessons, performances, karaoke, auditions, or recording versions, you need to understand the pros and cons of each option.

WAV
WAV (Waveform Audio File Format) is an uncompressed format and therefore requires ample storage space. This is suitable for those who already work with music, such as subject matter experts, or users who want to edit audio.

At high fidelity rates, WAV faithfully reproduces the elements and characteristics of the original soundtrack. Also, this format allows you to choose between different sample rates and bit rates and can be used on multiple platforms.

FLAC
FLAC (Free Lossless Audio Codec) is one of the most widely used compression formats by music lovers these days.

Digital audio encoding allows you to preserve its quality, but the resulting file will be smaller. Over the years, this format has become more widely used and compatible with different devices and platforms.

FLAC is free and open source, ready to use and can be easily played on smartphones and other devices.

MP3
Before deciding on the best audio format, it is worth taking a look at the most famous format in the world of music: MP3.

MP3 is one of the leading audio compression formats, and has become synonymous with the convenience and efficiency of producing files quickly, with smaller files, and at a certain level of quality.

Many devices and programs can play this format. But MP3 is difficult to use in professional audio processing and advanced audio editing.

As is known, this format exists on almost all platforms and is ideal for sharing audio.

Another interesting factor is its bitrate, although in a compressed format it can vary depending on the user’s objectives and quality improvements.

AAC Like MP3, Advanced Audio Coding (AAC) is a more efficient audio format than its predecessor.

If you need to create smaller files with less storage space, AAC is a great choice, reducing the file size for the user while maintaining a high-quality audio track.

Compatible with different platforms and devices, it is convenient to apply in different situations.

Analysis of the above audio formats leads to the conclusion that it is impossible to say which format is better than the other, just that each target has its own ideal format. So before downloading or uploading a file, check what platform the music will play on and what it is for.

What are MP3 files?

What are MP3 files?

What are MP3 files?
What are MP3 files?

A file with the .mp3 extension is a digitally encoded file format for audio files, officially based on MPEG-1 Audio Layer III or MPEG-2 Audio Layer III.

What are MP3 files?
What are MP3 files?

It was developed by the Moving Picture Experts Group (MPEG) using Layer 3 audio compression. The compression achieved by the MP3 file format is 1/10 the size of a .WAV or .AIF file. This format offers the advantage of streaming such audio files over the Internet for online listening, which was previously not possible due to the large size of audio files. The sound quality of MP3 audio files can be controlled by setting parameters such as bit rate, sample rate, common or normal stereo.

A brief history of MP3

The MP3 format was invented and developed by a German company, Fraunhofer-Gesellshart. The algorithm has licensed patents for the compression techniques it uses. Here’s a helpful MP3 schedule:

• 1987 : The Fraunhofer Institute in Germany begins research on high-quality, low-bitrate audio coding. It’s called the EUREKA project EU147, Digital Audio Broadcasting.

• January 1988: The Moving Picture Experts Group (MPEG) is formed.

• **April 1989**: Fraunhofer patented the MP3 in Germany.

• 1992-Dieter Seitzer, who helped Fraunhofer with his research, integrated his audio encoding with MPEG-1.

• 1993 – Publication of the MPEG-1 standard.

• 1994 – The MPEG-2 standard was developed and released a year later.

• November 26, 1996 : US patent for MP3 is published.

• September 1998 – Fraunhofer begins to enforce the patent. People who used the MP3 audio codec paid Fraunhofer a license fee.

• February 1999 – SubPop, a record label, releases music in MP3 format, the first to do so.

• 1999 – The first portable MP3 player appears.

File format MP3##
MP3 files consist of MP3 frames, where each frame consists of a header and a data block. Frames are not independent and generally cannot be mined at arbitrary frame boundaries. The data blocks of a file contain frequency and amplitude information about the audio. The sync word in the header identifies the start of a valid frame. This is followed by 3 bits where the first bit indicates that it is an MPEG standard and the remaining 2 bits indicate that layer 3 is used; therefore, MPEG-1 Audio Layer 3 or MP3. After this, the value will vary depending on the MP3 file. ISO/IEC 11172-3 defines the range of values for each part of the header and the header specification. Most current MP3 files contain ID3 metadata, which precedes or follows the MP3 frame, as shown. Data streams may contain an optional checksum.

Detailed music format

Detailed music format

Audio File Formats
Audio File Formats

classic wave

Audio File Formats
Audio File Formats

As the most classic Windows media audio format, the WAVE file is widely used, which uses three parameters to represent sound: the number of sampled bits, the sample rate, and the number of channels.
The channels are divided into mono and stereo, and the sample rates are generally 11025 Hz (11 kHz), 22050 Hz (22 kHz), and 44100 Hz (44 kHz). The capacity occupied by the WAVE file = (sampling frequency × sampling bits × channel) × time/8 (1 byte = 8 bits).

traditional mod

MOD is a wavetable-like music format, but its structure is similar to MIDI, it uses real samples, and the volume is small. In the earlier DOS era, MOD was often used as background music for games. Modern mods can contain many audio tracks in many formats, such as S3M, NST, 669, MTM, XM, IT, XT, and RT.

midi music computer

MIDI is short for Musical Instrument Data Interface. Records the sound played by the instrument digitally (each note is recorded as a number), and then synthesizes these records via FM or wavetable during playback: FM synthesis is the sound of the instrument is simulated by mixing the multi-frequency sounds; wavetable synthesis consists of storing the sound samples of the instrument in the wavetable of the sound card and extracting the sound from the wavetable as you play.

Boss Boss MP3

It can be said that MP3 is famous, it uses MPEG Audio Layer 3 technology to compress the sound with a compression ratio of 1:10 or even 1:12, with a sampling rate of 44kHz and a bit rate of 112kbit/s. .
MP3 music is music stored in digital form. If you want to play it, you must have a corresponding digital playback and decoding system. Generally, MP3 digital music is decoded by special software and then restored to a waveform sound signal for playback output. This type of software is called For MP3 players, such as Winamp, etc.

Overlord RA series online

RA, RAM, and RM are Real’s mature network audio formats, using “streaming audio” technology, making them well suited for network streaming. Information such as copyright, singer, producer, mail and song title can be added during production.
RA can be called the supreme lord of multimedia communication on the Internet. It is suitable for streaming on the Internet and is currently the best format for listening to online music online.

VQF with high compression ratio

VQF or TwinVQ is an audio compression technology developed by Nippon Telegraph and Telephone and Yamaha Corporation.
The audio compression rate of VQF is almost twice that of standard MPEG audio and can reach approximately 1:18 or even higher. And popular compression formats like MP3 and RA are usually only around 1:12. But it still won’t affect the sound quality, when VQF compress music at 44kHz-80kbit/s audio sampling rate, its sound quality will be better than 44kHz-128kbit/s MP3, when compress at 44kHz-96kbit/s , the music is close to 44kHz-256kbit/s MP3.

MD minidisc

MD (ie MiniDisc) is a comprehensive portable music format released by SONY in 1992. The compression algorithm it uses is ATRAC technology (the compression ratio is 1:5). MD is divided into Recordable MD (Recordable, with two heads of magnetic head and laser head) and Single Play MD (Prerecorded, only laser head).
The powerful editing function is the strong point of MD. You can quickly select tracks, move tracks, merge, split, delete and edit track titles. It is more personalized than CD and you can have your own MD album at any time. MD products include MD Walkman, MD bedside audio, MD car audio, MD recording deck, MD camera gun and MD driver, etc.

Differences between audio formats and how to convert them to MP3, OGG, WAV, WMA, MKA, FLAC, APE, AAC, AIFF, etc.Gain

Differences between audio formats and how to convert them to MP3, OGG, WAV, WMA, MKA, FLAC, APE, AAC, AIFF, etc.

How to convert MP3, WMA, APE, FLAC, AAC, MMF, AMR, M4A, OGG, WAV, WAVPack and MP2 audio formats with iWisoft Free Video Converter, Mp4Gain (Best Option hands down) or Audacity.

ogg

The most commonly used audio format online is undoubtedly the MP3 format, which does not lose much quality despite compression. As we have already seen with image file formats, audio formats are also divided into lost and lossless. Lossless formats keep quality intact but are heavier, while lost formats are compressed to be lighter but can lose quality.

The MP3 format is a good compromise between these two needs, as it maintains exceptional quality against minimal space. NO LOSSES AND LOSSES OF AUDIO FILES WAV is a universal lossless format that is a copy of the original audio source AIFF is another lossless format developed by Apple FLAC or Free Lossless Audio Codec is another lossless format and probably the most widespread.

losseless

Despite the compression, it retains its original quality and is free and open source. ALAC or Apple Lossless is similar to FLAC, which compresses it without data loss. This is a file compatible with iTunes and iOS. The APE is a compressed file and the sound quality is the same as the source, and the compression is better than in FLAC and ALAC files.

However, it is not universally compatible. MP3 is the most widespread file at a loss and has become synonymous with music downloaded from the internet. This is not the best quality option, but it is certainly the most compatible format. Advanced audio encoding or AAC is very similar to MP3, more efficient but less compatible OGG Vorbis is another lost format and is open source, so it is not limited by patents, but is less popular and compatible than MP3 and ACC WMA or Windows Media Audio is Microsoft’s lossy format, similar to MP3 and AAC.

The difference in quality between different formats is also mainly given by the bit rate or bit rate used for analog to digital conversion. the quality basically depends on the number of bits processed in the time unit. Just an example for the most common files on the web, which are MP3 files, they can have a creature speed ranging from 32Kb / sec to 320Kb / sec.

However, there is no compression in lossless files and the bit rate is comparable to an audio CD. However, the reality is a little different, as it has been shown that the human ear can barely detect quality differences between a 32Kb / sec compressed file compared to a 320Kb / sec compressed file.

Ogg Vorbis. The sound of the future

We all know that MP3 is the standard in audio compression, but there is a solution on the market with a future Ogg format, which unlike the rest has no use limit and its developers do not charge anyone for its use and much less do they impose their patent. In this article you are going to immerse yourself in the new revolution of sound for computers.

” A little history

We all know the MP3 music format which allows you to take music on the Internet with a quality similar to that of music CDs, exchange it with others, store it on your computer, save music CDs on your hard drive, listen to music on a small portable device no moving parts.

The future of MP3 is at stake. And now they are not the lawyers, it turns out that the format itself is patented from the beginning and they will ask for a commission for use shortly, so for a long time the one that will be the most advanced successor is being perfected: the OGG.

Programmers have used MP3 freely without problems since it was born, but the fact is that the institute has the intellectual property of the format.

In September 1998, Fraunhofer began sending letters to software developers saying that they plan to start charging for licenses to use MP3. Fraunhofer and the other members of the MPEG Consortium claim that it is impossible to create an mp3 encoder without infringing on their patents.

Ogg Vorbis is a high-quality, general-purpose compressed audio format (44.1-48.0kHz, 16+ bit and polyphonic, supporting up to 255 independent audio channels), putting Vorbis in the same category as MPEG-1 audio layer 3, MPEG-4 audio (AAC and TwinVQ), and PAC.

To create or use an encoder, the law says that royalties must be paid both to the institute and to other members of the consortium. In other words, you can listen to MP3, but you cannot contribute by recording anything to mp3.

It is a problem, the patent can limit the growth and make that only those who can afford it use the mp3. They say that there is no problem without solution and OGG Vorbis is the technological solution to the MP3 patent challenge.

In fact we can talk about Ogg Vorbis as an MPEG-4 compressor, which is trying to lead the rest of the competitors that exist in this format, specifically we are talking about AAC and TwinVQ.

Ogg Vorbis format files have the ogg extension and are just the beginning of a family of multimedia products that OggSquish is developing as part of the Xiphophorus project.

»OGG Vorbis the solution to the problem

It is an open format, that is, without an owner and without the possibility of being patented, created by volunteers in the style of free software and, therefore, more technologically advanced when receiving contributions and ideas from a huge community of programmers.

It supports high quality audio, in variable bitrates, several channels and for now up to 128kb / channel. This puts OGG on the same footing currently as MP3, MP4 (AAC, and TwhinVQ), and PAC.

The leader of the project is Christopher Montgomery and he started coding ogg from the moment he received the news of the patent collection threats from the German institute. Since then, many volunteers have joined Montgomery while contributing ideas and lines of code, making OGG files 25% smaller on average than mp3s of the same quality.

OGG Vorbis has been designed to be used in a final way, that is, you can encode everything in OGG without paying patents and never have to go back to MP3, so you can also share the OGG format on P2P networks. The most popular players already support OGG with or without extensions, as well as many reprogrammable hardware players.

The license is the GPL, it is the seed of the entire free software movement, and which allows no one to take advantage of and take ownership of the code that volunteers selflessly provide.

The fact that it is an open format ensures that OGG grows and improves. MP3 is defined from the first moment, and will never have more quality than it corresponds to nor will it be smaller or more compressed, because it is closed.

OGG however will benefit from the improvements that research brings and gradually it will be more compressed, more optimized and will sound better than it already sounds.

Live audio streaming is an important component of Vorbis. The format has been designed to be easily transmitted live.

The designers of Vorbis are working hand in hand with the creators of Icecast (a program for live broadcasts) to make Icecast compatible with Vorbis.

Likewise, they are working on a player that supports live ogg files. In addition, soon from the ogg website these components will be available as accessories for current players. This will be when Vorbis version 1.0 comes out.