sample rate and bit depth for music Archives

Bit Depth and Sample Rate PART 2

Free Download Mp4Gain

Bit Depth and Sample Rate PART 2

Fade processing

Bit Depth and Sample Rate

We now know that digital signal processing is bound to be very buggy. So the approximation of the total will also have a lot of error. These errors not only render the audio unrecoverable, but also introduce an unnatural sound.

To remove these artifacts, we add computed low-amplitude noise to the signal, which we call dithering. The amplitude of the jitter noise is very low, and although some is still heard, it is better than no addition.

Note that jitter noise accumulates. When you add noise to a signal, the signal-to-noise ratio decreases. If the operation is repeated, this ratio will continue to decrease, adding uncertainty to the signal. This is why dithering is often applied as the last step in mastering, and only once.

Dithering has quite an interesting history:

The first dither processing appeared during World War II. Bombers use mechanical computers for navigation and ballistic calculations. Interestingly, these computers are more precise in their processing performance in the air. Engineers realized that vibrations from the plane reduced errors in moving parts. His movements become more continuous, rather than sudden vibrations. Computers have little vibrating motors, and their vibrations are called oscillation, which is derived from the medieval English word “didderen,” meaning “to shake.” Modern dictionaries define dither as a state of high tension, confusion, or anxiety. Dithering brings digital systems closer to analog systems in some way.

– Ken Pohlmann, Digital Audio Rules

Sampling rate
According to theory, the sampling rate of 44.1 K per second is sufficient to cover the hearing range of the human ear. You may have inadvertently learned about Nyquist’s theorem, which states how to avoid aliasing (a type of distortion) and how to reconstruct all frequencies by sampling, which requires sampling at twice the highest frequency of the signal (this theorem also applies to non-audio media, we won’t go into that here).

The human ear has a hearing range of up to 20kHz (most studies show that this number is actually around 17K), so a sample rate of 40K is enough to hear every frequency clearly. 44.1K is the industry standard, which was determined by SONY, which was an oligopoly at the time, for a few reasons.

In a nutshell, the digital audio samples must be above the Nyquist frequency because, in practice, the samples are low-pass filtered during the digital-to-analog conversion process to prevent aliasing. The smoother the slope of the low pass filter, the lower the manufacturing cost. So an audio signal that normally uses a low pass filter will have a smooth slope at 2 kHz. For example, to keep the full spectrum below 20kHz, it should be done at a 44kHz sample rate (20K[highest frequency]+2K[low pass filter slope]x2[Nyquist theory]=44K)

Ultimately, the 44.1K standard was resolved in a battle between Sony and Philips (both had similar end goals). This is also based on the math behind audio sample rate and videotape anatomy. In this way, audio and video can coexist on the same video tape, which has a higher cost performance. However, 48K is the standard for video related to audio. CD audio remains at 44.1K.

Free Download Mp4Gain

Mp4Gain Main Window

Mp4Gain Features

Free Download Mp4Gain

Bit depth and sample rate

The first thing to understand is that bit depth and sample rate only exist in digital audio.

In digital audio, bit depth describes amplitude (vertical axis) and sample rate describes frequency (horizontal axis). So increasing the number of bits we use increases the resolution of the sound’s amplitude, and increasing the number of samples per second increases the resolution of the sound’s frequency.

In an analog system (the natural world), the audio is continuous and smooth. In digital systems, smooth analog waveforms can only be roughly sampled and limited to a certain amplitude range. When sampling a sound, the audio is divided into small segments (samples) that are fixed at an amplitude level. The process of correcting a signal to a certain amplitude level is called quantization, and the process of creating a sample segment is called sampling.

In the graph below, a natural sine wave is displayed for up to 1 s, starting from 0 and ending at 1 s. The blue bars represent approximations of the digital quantization of the sine wave, and each bar is a sample, clipped to the approximate available amplitude level. (Of course, the graph is more incomplete than reality).

Depending on the choice made during recording, an audio of 1 s duration can have samples of 44.1K, 48K and, in the case of 24 bits, contains an amplitude level of -144 dB at 0 dB (- 96dB to 0dB for 16bit). The dynamic range resolution (the number of amplitude level units that can be used for a sample, ie the number of rectangles displayed) is 65536 at 16 bits and 16777216 at 24 bits.

Therefore, increasing bit depth can greatly improve amplitude resolution and dynamic range. So where does the increase in dynamic range appear? Since the amplitude cannot exceed 0dB, the added dB is distributed to samples with smaller amplitudes. So one can hear more small sounds (such as a reverb track stretching at -130dB) that would cut off at 16 bits, -96dB.

round and discard

In digital audio, each sample is analyzed, processed, converted to audio, and then played through speakers. When a sample is processed in your DAW (gain, distortion, etc.), they go through basic multiply and divide operations that allow you to change the digital representation of the sample. Very simply, if we don’t do the rounding process (the 1dB gain must be multiplied by 1.122018454), even 8 or 4 bits of sample precision will exceed the 24-bit space.

So since we only have 24 bits, these long numbers need to fit in this space. To do this, the DSP rounds or discards the least significant bit (LSB, the last digit in the number of bits, for example, the 16th digit in a 16-bit sample). Rounding is fairly straightforward and uses algorithms that you are familiar with. Discard discards the information after the least significant bit without analysis.

Both processes have certain errors, they will introduce errors into the equation, these errors accumulate through signal chain processing and are eventually reflected. On the plus side, the LSB is the digital bit with the smallest amplitude, so the error occurs at -96dB for 16-bit samples and -144dB for 24-bit samples. At the same time, the different structures and methods of digital signal processors will also lead to different results.

Difference between digital and analog

Difference between digital and analog

Difference Between Analog Signal and Digital Signal

The sound is analog. And sound is the vibration of the air. How is this sound vibration transmitted?

Analog VS Digital
For example, when a stone is thrown into a calm water surface, the ripples spread around it, but if
Cut in the direction of the waves and look at the cut end, the waveform is as shown in Fig.1.

Air waves spread from the point where sound is emitted even in air. Although it is invisible to the eye, it has a
similar waveform. This is the analog waveform of sound.

Therefore, although it is digital, when such a sound waveform is recorded or communicated by phone or wireless, as
shown in Fig. 2, the change in the analog waveform is electrically replaced with a series of numerical values according to a certain promise. ..

When recording or communicating, if you handle it as analog, it is easy for noise to enter and the sound quality to deteriorate, but when trying
the waveform of the sound as digital = numerical data, you can eliminate that worry and
maintain a certain quality. You can do various processing while maintaining it.

(2) What is convenient when it is digital

Digital audio signals are convenient because they can be recorded and edited using a personal computer, for example.

In addition, 74 minutes of music can be recorded on a CD with a diameter of only 12 cm, and through digital compression processing
, music of the same length can be recorded on an MD with a smaller diameter.

Since digital signals can be compressed in this way, it is also convenient for storing large amounts of information.
Not only sound, but also video signals with a higher amount of information can be recorded and communicated at high speed by using compression technology.

Especially in communication, a two-way digital multiplex communication can be realized communicating multiple pieces of information with a single wire.
In addition to electrical signals, laser light can also be used for optical communication, making communication possible at extremely high speeds.

(3) What is the sampling frequency?

Digital signals are processed at predetermined fixed time intervals.
The sample rate (sample rate) indicates how many times a second is processed and is expressed as Fs or fs.

The sampling frequency unit is Hz (Hertz), and the
44.1 kHz (kilohertz) sample rate means 44,100 pieces of data are processed per second.
(K represents 1000 times)

AD conversion converts a continuous analog signal into a digital signal,
measures the size of the signal at each moment determined by the sampling frequency (sampling) and converts
the result in a binary number (quantization).

On the other hand, DA conversion converts a digital signal into an analog signal,
It reads the digital signal in the sample rate time interval and connects it smoothly.

Since digital signals can be reproduced up to half the sampling frequency, how much
The higher the sample rate, the higher the playable frequency and the better the sound quality.
In familiar areas, 44.1 kHz is used for CD, and 48 kHz is used for DAT and B modes of satellite transmission.

In addition, recent professional equipment uses high sampling frequencies (high sampling), such as 88.2 kHz and 96 kHz,
and are designed to faithfully reproduce even higher frequency sounds to improve sound quality.

What is the audio bit rate? Relationship between “bit depth” and “sample rate” and sound quality PART 2

What is the audio bit rate? Relationship between “bit depth” and “sample rate” and sound quality PART 2

Audio bit rate

What is the bit depth that determines the sound quality?
What is the bit depth that determines the quality of the sound?
The bit rate is calculated by multiplying the two factors of bit depth and sample rate. Bit depth represents the amount of data per divided sample and is an element that affects the quality and expressiveness of the sound.

The sample rate is the finely divided horizontal axis and the bit depth is the one that overlaps the vertical axis. By providing a large amount of data (bit depth) to each sample, it is possible to create finer and more accurate audio data.

Bit depth is the precision of each image in animation production. The higher the bit depth, the more expressive the sound and effects will be, and the higher the sound quality will feel.

What is the sample rate that determines the smoothness?
Of the bit rates that determine audio quality, the sample rate is an element that represents the number of data divisions per second on the horizontal axis. It shows how many tens of thousands of data are divided per second and the higher the number of divisions, the higher the sound reproducibility and sound quality.

It is even easier to understand if you consider it a mechanism similar to animation production. The more images you use per second, the smoother your character will move, and the higher the sample rate (the number of data divisions), the smoother your sound will sound. Also, the amount of data increases according to the size of the sample rate.

In conclusion
To improve sound quality, it is important to increase the audio bit rate. The audio bit rate is determined by two factors: the bit depth, which determines the expressiveness of the sound, and the sampling rate, which determines the smoothness of the sound.

However, keep in mind that the higher the audio bit rate, the more beautiful the audio will be and will also be affected by the original sound source. If you want to create high-quality data from the moment of recording, why not ask a production company that has high-quality recording equipment?

I think there are many people who are concerned about the balance between capacity and sound quality when it comes to the audio system mounted on a computer. Professional engineers and mixers will come up with the best balance, so don’t hesitate to contact us on these points.

What is the audio bit rate? Relationship between “bit depth” and “sample rate” and sound quality

What is the audio bit rate? Relationship between “bit depth” and “sample rate” and sound quality

Digital Audio Basics

Bit rate is one of the factors that determine the quality of a video job.

BitRate

Among them, the one that determines the audio finish is the audio bit rate. Understanding the bit rate will allow you to control the sound quality and create better quality audio.

So on this occasion, I will explain bit depth and sample rate which are indispensable when talking about sound quality.

What is a bit rate?
What is a bit rate?
■ Bit rate represents high quality

Bit rate is a numerical value that represents the amount of information in the data, and the height of the bit rate is proportional to the quality of the data.

By looking at the bit rate, you can see how much data is packed in one second. Generally, the higher the bit rate, the higher the sound quality, and the sound and video are more realistic.

■ Audio bit rate and video bit rate

In the case of video, the bit rate is divided into two types, video and audio, and the overall quality of the video is determined by the height of each bit rate.

The “video data rate” shown in the video properties is the video bit rate and the bit rate assigned to “audio” is the audio bit rate. The “total bit rate” is the total of the two types of bit rates, video and audio, and is the bit rate of all video.

However, high bit rate audio is not always good. This is because if the quality of the original audio data or the equipment used for recording is poor, poor quality audio will be played. For example, if the original voice contains noise, the noise part will be reproduced realistically and the voice will be difficult to hear. Similarly, if the bit rate of the video is low, the movement will be choppy and unnatural, or the video will be uneven.

“High resolution” basics. What is the difference between DSD, FLAC, MQA, etc.? Part 7

“High resolution” basics. What is the difference between DSD, FLAC, MQA, etc.? Part 7

DSD vs. PCM

MQA encoding processing can be performed on 44.1 kHz to 768 kHz linear PCM sound sources and can be stored in existing file formats (container formats) such as FLAC, ALAC, and WAV. If you use a compatible device equipped with a dedicated decoder, you can “open origami” and demonstrate the original quality. It can be played with an unsupported device, but in that case, “Origami cannot be opened”, so normal PCM playback will be performed that does not include the original MQA information.

MQA is available for download at e-onkyo music. It has the advantage that it can be used with a low communication volume even with streaming type distribution, and abroad, the “TIDAL” music distribution is developing a high quality 96 kHz / 24 bit MQA streaming distribution ” TIDAL Masters “.

High Resolution Portable Player “DP-X1” Quickly Achieved MQA Compatibility
Current Status of Supported High-Resolution Formats and Music Distribution Services (as of July 2017)
WAV FLAC ALAC DSD MQA others
e-onkyo music 〇〇 — 〇〇 Dolby TrueHD
blackberry — 〇 — 〇 — —
OTOTOY 〇〇〇〇 — —
Recochoku — 〇 — — — —
My sound — 〇 — — — —
slots — 〇 — 〇 — —
Is FLAC a powerful option right now?
Generally speaking, the high resolution format has been “PCM or DSD”, and for PCM, the choice has been “lossless compression” or “uncompressed”. There may be circumstances on the side of the playback device and sound preferences, but in terms of the balance between data compression effect (file capacity) and sound quality, “lossless” is a reasonable existence.

Among them, FLAC, which is easy to use on smartphones, will remain high resolution as source code is published and royalties are not incurred, it is already compatible with many compatible high resolution audio devices, and will be supported in the system level in the next iOS (iOS 11), will be the centerpiece of the format.

Looking around, there are signs of change, such as the emergence of a new face, MQA, and the spread of streaming services, but the audio format must have attractive content. Above all, I would like to hope that the record company / distribution service company is actively working in high resolution.

“High resolution” basics. What is the difference between DSD, FLAC, MQA, etc.? Part 6

PCM DSD

Review the basics of “high resolution”. What is the difference between DSD, FLAC, MQA, etc.?

There are several formats, even if it says “high resolution”. If the format changes, the amount of sound information and thus the sound quality will change, the file size will also change, and whether the playback device / software will support it or not, it will also change, so choose a format is important. We will explain the main formats incorporating common technologies and unique pieces.

There are several high-resolution formats, but …
What is the sampling frequency?
Most digital sound sources are “linear PCM”. This is data obtained by digitizing (sampling) the sound waveform (analog signal) in a canned cycle, and that cycle is called the “sample rate.” If sampling is done every 1/44100 of a second it will be “44.1 kHz”, if it is 1/96000 of a second it will be “96 kHz”, if it is 1/192000 of a second it will be “192kHz”. This means that the implementation cycle of is shorter and the amount of information is greater. In other words, if you look at this number, you can see “how finely the sound was measured with respect to time.”

What is the number of quantization bits?
Value that indicates the number of steps in which the amplitude of a signal is expressed when an analog signal is converted into a signal.

“High resolution” basics. What is the difference between DSD, FLAC, MQA, etc.? Part 4

“High resolution” basics. What is the difference between DSD, FLAC, MQA, etc.? Part 4

DSD vs PCM

The number of audio devices that support ALAC has increased in recent years, and some high-resolution distribution sites handle it as well, but FLAC is compatible with more software / hardware, and there is a significant advantage in sound quality over FLAC. . it will eventually restore to the same linear PCM data), so there is no reason to choose it aggressively except for users who are unified with Apple products.

DSD vs PCM

“ALAC”, which is compatible with Apple software / hardware, can also be used for high resolution playback.
Wav
Classification:
PCM compression: no
extension: .wav

It is widely used as a linear PCM container format (it can store various types of data, and the codec can be selected for musical use), and it has been widely adopted by high-resolution distribution sites. Since linear PCM is not compressed, the file size is large, but since no decoding (decoding) processing is required, there are quite a few people who dare to choose WAV from a sound quality point of view.

It is playable on most digital audio software / devices, but tags / metadata such as artist names and album images cannot be expected to display a lot. It is possible to store data, but the format is not officially defined, so it can be displayed in one environment but not another. FLAC and ALAC are more convenient than WAV when using an application like Roon that searches for information, such as related artists, by referencing metadata.

Linear PCM with a sampling frequency of 352.8 kHz (8 times 44.1 kHz) or 384 kHz (8 times 48 kHz) and a quantization bit rate of 24 bits or more is called “DXD” (Digital eXtreme Definition). Originally intended for SACD production (because DSD is not suitable for editing), it is now used as a distribution format as well.

“High resolution” basics. What is the difference between DSD, FLAC, MQA, etc.? Part 3

“High resolution” basics. What is the difference between DSD, FLAC, MQA, etc.? Part 3

DSD vs PCM

PCM or DSD

PCM to DSD

So far, we have looked at high resolution definition on the premise of PCM, but “DSD (Direct Stream Digial)” is also recognized as a type of high resolution sound source. Regarding DSD, which is not clarified in the JEITA definition, the Japan Audio Association definition recognizes products that support 2.8MHz / 5.6MHz DSD playback as compatible high-resolution devices.

Sound reproduction methods are almost completely different between PCM and DSD. DSD always records 1 bit width (1 bit is represented by “0” or “1”) subdividing only in the direction of the time axis.

In the case of PCM, the resolution can be increased and the dynamic range can be expanded by increasing the number of bits, but on the other hand, quantization noise occurs (distortion caused by an error in rounding up / down the amplitude value real). . DSD is set to 1 bit to create a state in which no quantization noise is produced and the amount of information is ensured by significantly increasing the sample rate.

DSD has a 1-bit amplitude of “0 or 1” and records sound information with little information in the direction of the time axis.
(The picture is taken from the product information page of Sony HDD audio player “HAP-Z1ES”).
Main high-resolution formats
Data as uncompressed or compressed, lossless or lossy as compressed, PCM or DSD is primarily a high resolution format, and depends on the playback device / software to be used in addition to the taste of the sound. Let’s explain the characteristics of each format for the formats handled by the so-called music distribution sites.

FLAC
Classification:
PCM compression: Yes (lossless)
Extension: .flac

“FLAC” is a representative of high resolution audio sources. It is an open source lossless codec (software whose source code is open to the public and can be freely improved / redistributed), and one of the reasons for its widespread use is that it does not generate royalties. It is thought that the sense of security that allows us to confirm that we are there also helped spread the word.

It is often created (encoded) using uncompressed linear PCM as the source, and can be compressed to a file size of around 60% compared to the source. There are many supported hardware / software. At the moment, iOS is late (some apps have already supported it), but it almost certainly is if it is a device that claims to support high resolution. If you are wondering which format to choose, choose FLAC and you are right.

Most of the hardware / software and distribution sites that claim to support high resolution support FLAC (image is blackberry screen).
ALAC (Apple Lossless Audio Codec)
Classification:
PCM compression: Yes (lossless)
Extensions: .m4a, .mov, .alac

Introduced when Apple realized lossless audio streaming using a compact router (AirTunes). Initially it was an original technology, but then it was made open source and is now popular as a lossless codec alongside FLAC.

It has been well supported by Apple products since its introduction, and the files can be generated (encoded) using iTunes. However, Apple products are designed not to consider high resolution playback, so even if you play high resolution ALAC files on iOS devices, the information will be cut (reduced) to CD quality.

“High resolution” basics. What is the difference between DSD, FLAC, MQA, etc.? Part 2

“High resolution” basics. What is the difference between DSD, FLAC, MQA, etc.? Part 2

Bit Depth, Sample Rate

Thus, there is no strict unified standard for determining “whether or not the sound source is high resolution”, and whether or not it is high resolution is not determined only by the older formats such as FLAC and WAV.

High Resolution Audio

Since the amount of data is required to be greater than or equal to that of an audio CD and, in particular, that the number of quantization bits must be 24 bits or more, a sound source with a “sample rate of 44.1 kHz or more and a number of quantization bits of 24 bits or more “is high resolution. It seems good to think. Of course, it should be noted that the sound quality is not always high quality sound because it meets the high resolution conditions because it is not an audible standard.

Relationship between audio format and codec (for PCM)
Uncompressed or lossless compressed
Linear PCM is the sampled data itself, and theoretically does not deteriorate unless conversion processing is performed. However, since efficient data storage is not considered, the file size increases as the sample rate and the number of quantization bits increase. This is why multi-MB songs with compressed sound sources like MP3 are converted to tens of MB and hundreds of MB in high resolution.

That’s where the “lossless” codec is used. The purpose is to make the data compact by signing the linear PCM (processing to organize the data arrangement / storage pattern according to a certain rule). During playback, it is converted to the original linear PCM in real time, and in theory there is no deterioration in sound quality. “FLAC” and “ALAC” are typical examples, and linear PCM can be reduced to a data size of approximately 60%. The characteristic is that the sound quality does not deteriorate theoretically because the original information is left completely when encoded.

On the other hand, the “lossy compression” codec can achieve a high compression rate so that the data size is 10% of the original, but it is said to sound out of the audible band (human ears cannot perceive) during encoding. High frequency band) is removed. Sounds processed by lossy compression codecs are not classified as high resolution because the presence of sounds outside the audible band is believed to have a great influence on the expression of the realistic sound field and depth, which is You can tell that it is the advantage of high resolution. The way of thinking is dominant. In fact, even the JEITA and Japan Audio Association definition mentioned above does not include high resolution lossy compressed sound sources.

Compressibility and sound quality trends of the main digital audio formats
method Typical format Sound quality Compression rate
(assuming PCM is 100%)
Uncompressed WAV ◎ 100%
AIFF
Lossless compression FLAC ◎ Approximately 60-70%
A THE C
Lossy compression MQA About 15-25%
MP3 △ ～ ○ About 10 to 20%
CAA
Ogg Vorbis
WMA