basic principles of digital video compression techniques Archives

Digital Video Compression Part 6

Free Download Mp4Gain

Digital Video Compression Part 6

General description

DCT is a transformation widely used in image compression. The static JPEG compression standard, the H.263 video conferencing standard, and the MPEG digital video standards (MPEG-1, MPEG-2, and MPEG-4) use DCT. These standards use, in particular, two-dimensional DCT, applied sequentially to 8 x 8 pixel image blocks. DCT calculates 64 (8×8 = 64) coefficients, which are then quantized, thus providing true compression. In most images, most DCT coefficients, due to their smallness, are zeroed after quantization. This property of DCT is the basis for many compression algorithms that use DCT.

Furthermore, the human eye is known to be much less sensitive to high-frequency image components represented by large DCT coefficients. At these larger values of the coefficients, a larger quantization factor can (and usually does) be applied. In particular, the matrix of 64 quantization factors for each of the 64 DCT coefficients used in the JPEG algorithm has large quantization factors for the higher frequency DCT coefficients, respectively. Once quantized, the coefficients are subjected to the RLE algorithm. In addition, for frequent combinations, short code words are used, for the rarer ones, relatively long. A probabilistic coding is carried out.

DCT, in turn, is best explained with the example of a one-dimensional DCT. Two-dimensional DCT is a one-dimensional DCT that is applied sequentially for each row (row) of a block of pixels and each column of a block of pixels obtained from a one-dimensional DCT of rows. One-dimensional DCT applied to N samples (pixels in an image or samples in an audio file). DCT is an NxN matrix whose rows are cosine functions:

DCT (m, n) = sqrt ((1 – delta (m, 1)) / N) * cos ((pi / N) * (n – 1/2) * (m-1))
where
DCT (m, n) is a one-dimensional DCT matrix
m, n = 1, …, N
pi = 3.14159267 …
N = number of samples per block
delta (m, 1) = 1 if m = 1 and 0 otherwise
cos (x) = cosine of x, measured in radians.
Naturally, using DCT in a block of N samples will require N * N multiplication and addition operations. However, due to the recursive structure of the DCT array, actually much fewer math operations are required, namely N log (N). This property makes DCT really applicable in modern personal computer math processors.

At the beginning At the beginning
Discrete Wavelet Transform (DWT)
Compressors using DWT (Discrete Wavelet Transform): Intel Indeo 5.x; Intel Indeo 4.x

Advantages and disadvantages

Most of the static and dynamic images compressed using the DWT algorithm do not have the characteristic block structure of the DCT algorithm.
The relative quality of DWT compressed images is superior to DCT compressed images with the same compression ratios.
DWT smudges a bit, rounds out the sharp outlines of the image. The so-called edge noise or Gibbs effect.
General description

The DWT algorithm is based on the transmission of a signal, such as an image, through a pair of filters: low pass and high pass. A low pass filter produces a coarse waveform of the original signal. A high pass filter produces a difference signal or additional detail.

In turn, the result at the output of the high-pass filter (additional detail signal) can be subjected to the same procedure, and so on.

A simple example of DWT is DWT Hara:

The input signal x [n] is a plurality of samples with index n. Haar’s low-pass filter is the arithmetic average of two successful samples:

g [n] = 1/2 * (x [n] + x [n + 1])

Haar’s high-pass filter is the average difference of two successful samples:

h [n] = 1/2 * (x [n + 1] – x [n])

Note that:

x [n] = g [n] – h [n] x [n + 1] = g [n] + h [n]

The output sequences g [n] and h [n] contain redundant information. Therefore, it is clear that to reproduce the original signal x [n], it is sufficient to take only odd or even samples. As a rule, even samples are taken. Therefore, the original signal x [n] is obtained only from: g [0], g [2], g [4], …. h [0], h [2], h [4], .. …

x [0] = g [0] – h [0]

x [1] = g [0] + h [0] x [2] = g [2] – h [2] x [3] = g [2] + h [2] and so on …

The low pass filter output, as noted, is a rough analogy to the original signal. If the original signal is an image, the low-pass filter output will produce a blurry, fuzzy, and low-resolution image. The high-frequency signal output adds detail to the image. In combination with the low-pass filter output, the original image can be reproduced in this way.

Free Download Mp4Gain

Mp4Gain Main Window

Mp4Gain Features

Free Download Mp4Gain

Digital Video Compression Part 5

Digital Video Compression Part 5

General description

The basic idea behind vector quantization is to divide the image into blocks (4×4 pixels in the YUV color scheme for Indeo and Cinepak compressors). As a general rule, some blocks are similar to each other. In this case, the compressor identifies a class of similar blocks and replaces them with a common block. Furthermore, a binary table (map) of said common blocks is generated from the shorter code words. The VQ decoder then uses a table to assemble the image block by block from common blocks. It is clear that this encoding method is lossy, since, strictly speaking, the similarity of blocks is very relative. Here the approximation of blocks of real images to the common that joins them is allowed. The encoding process is long and laborious, since the encoder needs to identify the membership of each block of the image to some common block. However, the decoding task in this case is reduced to the task of constructing an image from a given map from common blocks and does not require much hardware and time resources. A table or map is also called a codebook, and the binary codes included in it are codewords, respectively. The highest compression using the VQ algorithm is achieved by reducing the number of common block classes, that is, by assuming the similarity of a relatively large number of image blocks and consequently reducing the codebook. As the size of the codebook decreases, the quality of the played video deteriorates. As a result, an artificial “lock” appears on the image. (Vector quantization and signal compression, A. Gersho and R. Gray, Boston, MA: Kluwer, 1992). A table or map is also called a codebook, and the binary codes included in it are codewords, respectively. The highest compression using the VQ algorithm is achieved by reducing the number of common block classes, that is, by assuming the similarity of a relatively large number of image blocks and consequently reducing the codebook. As the size of the codebook decreases, the quality of the played video deteriorates. As a result, an artificial “lock” appears on the image. (Vector quantization and signal compression, A. Gersho and R. Gray, Boston, MA: Kluwer, 1992). A table or map is also called a codebook, and the binary codes included in it are codewords, respectively. The highest compression using the VQ algorithm is achieved by reducing the number of common block classes, that is, by assuming the similarity of a relatively large number of image blocks and consequently reducing the codebook. As the size of the codebook decreases, the quality of the played video deteriorates. As a result, an artificial “lock” appears on the image. (Vector quantization and signal compression, A. Gersho and R. Gray, Boston, MA: Kluwer, 1992). that is, an assumption about the similarity of a relatively large number of image blocks and, as a consequence, a decrease in the codebook. As the size of the codebook decreases, the quality of the played video deteriorates. As a result, an artificial “lock” appears on the image. (Vector quantization and signal compression, A. Gersho and R. Gray, Boston, MA: Kluwer, 1992). that is, an assumption about the similarity of a relatively large number of image blocks and, as a consequence, a decrease in the codebook. As the size of the codebook decreases, the quality of the played video deteriorates. As a result, an artificial “block” appears in the image. (Vector quantization and signal compression, A. Gersho and R. Gray, Boston, MA: Kluwer, 1992).

A simple example: compare the following three 4 x 4 blocks.

(Block 1)
128 128 128 128

128 128 128 128

(Block 2)
128 127 128 128

128 128 128 128

128, 128, 127, 128

128 128 128 128

(Box 3)
128 127 126 128

128 128 128 128

127 128 128 128

128 128 128 128

These three blocks are indistinguishable to the human eye. Therefore, the second and third block can be safely replaced with the first. So the codebook will look like this:

Codebook [1] = 128128128128

128 128 128 128

In conclusion, we note that an important feature of VQ technology is that when compressing video, the same codebook can be used for multiple picture frames.

At the beginning At the beginning
Discrete Cosine Transform (DCT)
Compressors using DCT: Motion JPEG; Editable MPEG; MPEG-1; MPEG-2; MPEG-4.

Advantages and disadvantages

Digital Video Compression Part 4

Digital Video Compression Part 4

Lossy compression

(2) Compression can be lossy if information is lost during the compression process. However, from the point of view of human perception, lossy compression should be considered only that compression in which it is possible to distinguish with the naked eye the result of compression from the original.

Thus, despite the fact that the two images, the original and the result of compression with one or the other compressor, may not coincide little by little, however, the difference between them may be completely imperceptible. Examples include the JPEG algorithm for compressing static graphics and the M-JPEG algorithm for compressing video.

Compression without loss of perception
(3) Although technically lossy compression, a compression scheme can at the same time appear to be lossless in terms of human perception. Most lossy compression technologies have a so-called Compression Quality Factor (QF), which characterizes the perceived quality side and ranges from 0 to 100. With a compression quality factor of 100, the perceived quality characteristics of the compressed video are indistinguishable from the original. .

Lossy compression
(4) JPEG and MPEG and other lossy compression technologies are sometimes compressed, losslessly, exceeding the limits of compression in terms of perception of video information. However, compressed video and still images are quite acceptable for proper human perception. In other words, in this case the so-called natural degradation of the image is observed, in which some small details of the scene are lost. Something similar can happen in natural conditions, for example, rain or fog. The image in such conditions is usually distinguishable, but its detail is reduced.

Unnatural quality loss compression
(5) Poor compression quality, which greatly distorts the image and introduces artificial details (not present in the original) of the scene, is called unnatural lossy compression. An example of this is true “blocking” in highly compressed MPEG and other compressors using BDKP technology. Unnaturalness lies mainly in the violation of the most important features of the image from the point of view of human perception – the outlines. Experience shows that it is precisely the contours that allow a person’s perceptual apparatus to correctly identify a particular visual object.

In conclusion, we note that all widely used video compressors use lossy compression technologies. At high enough compression ratios, they will all compress with an unnatural loss of quality.

Thus, when choosing one or the other compressor to compress digital video, it is necessary to achieve compression, at least with a natural loss of quality.

Video compression algorithms and technologies overview
Run length encoding
The compressors that use RLE technology are:

Microsoft RLE (MRLE) RLE is also used to encode coefficients in the BDCP used in MPEG-1234, H.261, H.263, and JPEG.

Advantages and disadvantages

It works exclusively with 8-bit images.
Not suitable for full color video compression.
General description

RLE encodes a sequence of repeating picture elements or monochrome elements with one code word. For example, a sequence of pixels 77 77 77 77 77 77 77 can be encoded as 7 77 (for seven 77-rock). RLE is good for compressing images in which there is a repetition of the outlines or colors of individual elements. There is much less color repetition in full color images, so compressing full color video with RLE technology is pointless.

Vector quantization (VQ)
The compressors that use VQ technology are Indeo 3.2 and Cinepak. They both use the YUV (not RGB) color scheme.

Advantages and disadvantages

The encoding process is very time consuming and practically impossible without special additional equipment.
The decoding process is very fast.
Locks out distortion at high compression ratios.
With technologies that use BDKP algorithms, Fibreboard can achieve higher compression levels.

Digital Video Compression Part 3

Digital Video Compression Part 3

VDOWave or VDOLive from VDONet (VDOM, VDOW)

VDONet releases a wavelet-based video compressor included with the 32-bit implementation of Video for Windows. Microsoft uses VDOWave as part of NetShow. There are currently two versions of the VDOWave compressor:

VDOWave 2.0 is a fixed rate video codec.

VDOWave 3.0 is a “scalable” video codec.

The NetShow 2.0 standard kit only installs the VDOWave decoder. The NetShow 2.0 development tool installs both the VDONet VDOWave encoder and decoder. According to some tests, VDOWave significantly outperforms MPEG-1 and other compressors based on the block discrete cosine transform algorithm in compression, but only at low kbps rates.

MPEG-4 (MPG4)
Microsoft’s NetShow 2.0 installs the MPEG-4 compressor, which is a new international standard that, however, has not yet been officially recognized.

AVI compressor performance
To test the performance of various compressors, a series of experiments was performed on a 10-second video clip recorded at 30 frames per second, 320 x 240, in a 24-bit-per-pixel color palette.

Furthermore, this video fragment was sequentially compressed by various video compressors using Microsoft Video for Windows 1.1. The results of the comparison are shown in the table. Except for the cases mentioned, the quality obtained did not differ significantly from the original.

Legend:

FCC, fluctuating in the range of 0 .. 100 and configurable compression quality factor.

TC: The technology used in compression.

CC X: keyframe every X.

Digital video compression technologies
There are many technologies for compressing digital video, but we will focus on those that formed the basis of the most popular video compressors. Some of the compressors considered use more than one compression technology, but some combination of them. For example, both Indeo 3.2 and Cinepak use vector quantization. The international standards MPEG-1, MPEG-2, MPEG-4, H.261 and H.263 use a combined BDKP technology and motion compensation. Some modern algorithms use Discrete Wavelet Transform (DWT) technology. Other technologies include fractal image compression.

First, let’s define the term “video compression quality”. Despite all the subjectivity of this concept, we will try to conditionally divide the compression quality into the following levels (compression ratios).

Lossless compression
(1) Image compression can be performed without quality loss only if there was no data loss during the compression process. As a result, the image obtained after decompression will exactly match (bit by bit) the original. Examples of such compression are GIF for static graphics and GIF89a for video.

Digital Video Compression Part 2

Digital Video Compression Part 2

Individual images are made up of macroblocks. The macroblock is the main structural unit of image fragmentation.

Corresponds to an area of the image with a size of 16 * 16 pixels. It is for them that the displacement vectors are determined relative to the I or P images. The total number of macroblocks in the image is 396. To increase the resistance of the image recovery process to possible data transmission errors, the Consecutive macroblocks are combined into independent sections (slices), with a maximum number of 396. In the limiting case of “clean transmission”, only one section of the image falls on the image. 396 macroblocks. In turn, each macroblock consists of six blocks, four of which contain information about the brightness Y, and one at a time determine the U and V components of color. Each block is an 8 * 8 matrix.

In conclusion, we would like to emphasize an important feature of MPEG streams: they do not allow frame-by-frame editing. In fact, due to the complex dependency on the images described above, making changes to one of them will inevitably lead to the need for complex calculation of many parameters across the entire group.

Motion JPEG or M-JPEG (MJPG)
Most desktop video capture and video editing systems use this method when recording video to AVI files. In Motion JPEG, each video frame in an image is compressed separately using the JPEG standard. In this case no other additional algorithms are used. The undoubted advantage of this method is the possibility of editing video without loss of quality, since the frames are independent. This, in fact, determines the use of this method precisely as a video storage mechanism that serves for editing, and not for distribution. Motion JPEG uses the Block Discrete Cosine Transform (DCT) algorithm to compress images.

Editable MPEG (XMPG. MPGI)
Editable MPEG, such as M-JPEG, used for digital video editing is an AVI file consisting only of MPEG type i frames. However, all other MPEG compression mechanisms are involved here. It is included in the standard delivery of Microsoft Video for Windows 1.1. and is used by desktop digital video editing systems like Adobe Premiere.

Although the above compressors are quite popular, they are far from all standards for compressing AVI files. In characterizing this group of compressors, it can be seen that they were designed and created primarily as a means of compressing video and audio data stored on hard drives and CDs, and this, in turn, indicates their low compression capacity and relatively high quality. high during playback. .

With the advent of the Internet, methods and tools for compressing video and audio data are becoming increasingly popular, allowing, using advanced technologies (sophisticated motion estimation and compensation, wavelets, fractals, and others) to achieve kilobits. / second smaller. ratios that allowed, for example, videoconference sessions over the Internet. It is clear that such compression methods provide significantly higher compression ratios with relatively low quality.

Compressor generation – H.XXX
More recently (5/18/97), a new series of digital video compressors was officially registered, defining trends in the development of video compression mechanisms. Some compressors in the H.XXX family, such as H.261, are quite popular, others are little known and use advanced and improved technologies such as wavelets.

A distinguishing feature of the H.XXX family compressors is their focus on reducing the flow of digital video over the Internet, which naturally leads to the quality factor fading into the background. Microsoft is improving some of the products in the H.XXX family, and some are already included in teleconferencing packages such as NetShow and NetMeeting.

Digital video compression

Digital video compression

The number of different formats for presenting digital videos is huge.

One of the most common problems faced by a stumped user is how do they differ and which one is the best? This article is an attempt to help a user lost in many standards. Furthermore, the subject of this article is a review and comparative analysis of various compressors for digital video compression, as well as an overview of the main ideas and algorithms used by these compressors. In addition, this article reveals the capabilities of the new compressors used to transmit video information over the Internet. The article is useful both for beginners who want to understand the variety of digital video presentation formats and understand how they differ, and for programmers looking for answers to specific questions.

AVI (Audio Video Interlive)
AVI files are a special case of RIFF files. (short for Resource Interchange File Format). This format, originally intended for the exchange of multimedia data, was Microsoft and IBM. This format is the most common form of video presentation on personal computers. AVI files come in different standards depending on the way the video data is presented.

AVI file standards
Video data in AVI can be formed and compressed in various ways. For example, Video for Windows 1.1 comes with a standard set of compressors. Namely:

Intel Indeo (version 3.2);
Microsoft Video 1;
Microsoft RLE (run-length encoding)
Cinepak;
….
Each standard has a so-called Four Character Code (FOURCC).

Full frames (uncompressed)
You can keep AVI files as uncompressed frames. You don’t need any compressor for this.

The 4 character code for this standard is ‘DIB’, (Microsoft Device Independent Bitmap).

However, there are at least three more FOURCCs for uncompressed AVI files:

‘RGB’; ‘RAW’; 0x00000000 (FOURCC, whose hexadecimal value is 0)

Variable Length Encoding (MRLE)
The peculiarity of this video image encoding scheme is that MRLE is capable of encoding video only in an 8-bit palette (256 colors). It does not support modes called “High Color” (2 ^ 16 colors) or “True Color” (2 ^ 24 or 2 ^ 32 colors).

Microsoft Video 1 (MSVC or CRAM)
Microsoft Video 1 only supports the 8-bit or 16-bit color gamut.

Intel Realtime Video 2.1 (Indeo 2.1) (RT21)
Included in the standard distribution of Microsoft Video for Windows 1.1.

Intel Indeo 3.1 / 3.2 (IV31 and IV32)
A feature of the method is the use of the vector image quantization algorithm (see below).

Intel® Indeo® Multimedia Kit 5.0

Cinepak (CVID)
One of the most popular and widely used video compressors for Windows. Provides the fastest video playback. Unlike the Indeo 32, which provides slightly better quality, but significantly overloads the processor during decompression, Cinepak offloads the processor resource as much as possible.

Today there are at least three standards for Cinepak.

Cinepak SuperMac (original 16-bit compressor)
Cinepak Radius (new and improved 16-bit compressor);
Cinepak Radius [32] (32-bit version of Radius Cinepak included with Windows 95).
A feature of the method is the use of the vector image quantization algorithm in conjunction with the frame difference algorithm (see below).

MPEG (Moving Image Expert Group)
One of the most popular digital video presentation formats. Currently, there are three of its specifications: MPEG-1, MPEG-2 and MPEG-4. Despite the great flexibility of the standard, which allows various applications to change the values of most of its parameters (such as image resolution, aspect ratio, frame rate) within a wide range, its developers initially focused on it. use of compact discs. (CD-ROM) with a data transfer rate of 50Kbps. As a result, the basic algorithm (MPEG 1) limits the bit rate to the range of 150-225Kbps, the resolution of images (frames) as 352 * 288 (Pal) or 320 * 240 (NTSC), the frequency of its change is 25 (PAL) or 30 (NTSC)). Also, for simplicity, we just consider the Pal system.

MPEG compression uses the following basic ideas:

Digital Video Compression Methods Part 3

Digital Video Compression Methods Part 3

Compression rules

You don’t need to know all the intricacies of the compression process if you follow these general rules of thumb:

If compression isn’t absolutely necessary, don’t use it! Until the editing and design of the material is completed, it is better to work with the original source to have a sufficient quality margin and compress it only at the last stage, that is, immediately before the output to the medium.
If compression is unavoidable, try to use the lowest possible compression ratio. In other words, always use the highest possible bit rate.
Do not turn on the compression systems one after the other and, if possible, avoid compression. This always leads to a loss of quality, and the lower the flow rate, the more pronounced this deterioration will be. The quality will be even worse if between applying the codecs you perform some editing actions or apply special effects.
Digital encoding systems always introduce a time delay and complicate video editing, and the use of compression can lead to desynchronization of audio and video streams.
Any compression system works best with the purest video footage. Signal noise, jitter, graininess and poor lighting when shooting, as well as poor decoding of the original video signal make compression difficult and produce poor results.
Compressed data is always much more susceptible to transmission errors than uncompressed data.
Do not believe the claims of manufacturers comparing codec quality to analog video (“VHS quality”, “quality comparable to Betacam SP”, etc.), because the defects of a compressed digital image are different from those of a Analog video, even if Analog video is played from a consumer VCR known for its defects.
The quality of the image obtained by this or that codec depends directly on the type and condition of the source material. Therefore, question even the most compelling demos, as you can use specially selected material to ensure that a particular codec works perfectly. Always use your own material for testing, selected based on a reasonable balance of “easy” and “hard” stories.
The quality of compression can only be assessed subjectively.

Many types of digital video compression are based on the fact that high-amplitude signals do not occur simultaneously at all frequencies, and when compressing audio signals, it is necessary to perform a frequency analysis of the input signal to develop a masking effect. . Spectral transformations are typically used for such purposes, which are also used in the phase correlation method to measure motion. A special case of spectral transformations are Fourier transforms, which are used in most of the known compression algorithms for audio and video signals. Fourier transforms are a processing technique used to analyze changes in signals over time and express them as a spectrum.

Digital Video Compression Methods Part 2

Digital Video Compression Methods Part 2

For video storage and transmission, joint (composite) coding of video signals is also used, which can be characterized as a method of strong analog compression, as it allows the transmission of the three-component color images described above in the allocated bandwidth. for one-component black and white images.

After digitizing the analog signal, you can apply digital compression to it based on one or another digital encoding algorithm. In principle, it is possible to compress any signal presented in digital form, however, for composite video, only serial double compression (first strong analog and then digital) will be possible, which cannot be considered satisfactory, and the digital compression ratio will be it will inevitably be limited due to the high noise analog signal. So, for example, the most widespread digital compression standard today, MPEG, was developed exclusively for separate (component) video signals and is essentially a method of replacing full-color analog signals with digital signals, if possible. no preliminary analog compression.

Interlaced
Ideosystems traditionally use so-called interlaced scanning, when the lines of each frame are subdivided into even and odd lines, and the aggregates of even and odd lines are called fields. The alternating transmission of odd and even fields instead of a full frame was necessary at one point, not for a good life, so doubling the image refresh rate was guaranteed without doubling the required bandwidth.

Therefore, interlaced scanning can be thought of as a kind of analog compression. The defects of such “compression” are the well-known flickering between lines, poor resolution when transmitting fast-moving objects, and so on.

For best results, digital compression should ideally be done using material from a non-interlaced source, as otherwise it will revert to being a consecutive double compression (as is the case with a composite signal), from which you cannot expect nothing good.

The current prevalence of interlaced scanning in existing video systems means that, in practice, digital compression encoders must also accept interlaced source materials. Interlacing creates a particular difficulty for compression using motion compensation algorithms (such as MPEG), since it is quite difficult to track changes in a situation where successive fields do not describe the same picture points. Furthermore, interlacing complicates the formation of a differential image signal between fields.

It follows from all this that if your goal is to make DVD-video (MPEG-2), even if your player does not support progressive (progressive) scanning and your camcorder has such a mode, it is better to use it.

At the beginning At the beginning
Quantification
The term “quantization” appeared at the junction of analog and digital technologies and does not mean more than a method to represent the value of the signal at a reference point as a binary number with some precision. Therefore, a hardware quantizer divides the range (or voltage scale of an analog signal) into several equal intervals, each of which is represented by a separate number. Consequently, the quantizer generates the number of the interval in which the analog voltage falls (the value itself, in general terms, can be determined from the table).

If information about the position of the analog voltage within the range is lost, then we speak of a quantization error (or error), but since it cannot be greater than the quantization range, then this amount of error can be minimized using a number large enough of intervals.

Therefore, an 8-bit converter has 256 quantization ranges (providing a signal-to-noise ratio of about 50 dB), in a 10-bit converter, respectively, 1024 code words (a signal-to-noise ratio is 12 dB more high) etc.

By the way, the signal is restored in the same way: only the number of the quantization interval can be stored, and the restored value is taken from the corresponding table.

At first
Digital compression
Digital compression, also generally known as data compression or bit stream reduction, is a more sophisticated way of obtaining the most audiovisual results at the lowest cost than analog. And therefore, with digital encoding, sound and video can be brought to the viewer, significantly reducing stream or bandwidth.

Digital video compression methods

Digital video compression methods

Gradually, all the equipment for the production of television and video programs in the world is going digital. Why is this happening?

It is not at all due to the fact that everyone admires digital methods and that the image quality is radically improving, the reasons for such a transition are mainly economic, because now both the price and the cost of operating a digital complex are lower. than a traditional analog with the same functions. Now that digital technology has reached a certain maturity, analog equipment is suddenly less efficient, less reliable and much less profitable.

EN The very foundations of video production and delivery of images to the viewer are currently being revised, and the dynamics of change in the current age – digital – is much higher than in analog. Today, for example, the 4: 3 aspect ratio is gradually being replaced by the 16: 9 aspect ratio, interlaced-progressive (non-interlaced) scanning and changing encoding standards increase image quality. The economy also dictates the need to compress video information. Naturally, any compression degrades the quality of the rendering and is applied not for a good life, but out of necessity.

Do not think that video compression was not used before and that it only appeared with the advent of digital technologies. No, compression has always been used, but only before it was analog, and today it is necessary to make it completely digital, getting rid, as much as possible, of double compression, that is, both analog and digital.

Modern digital compression methods, which replaced analog, especially when combined with computer technology, can not only improve the quality of the video itself, but also expand the possibilities of video production and optimize the display of audiovisual products.

Basic concepts of digital transformation

Essentially, a digital representation, or digitization, is a partition of the domain of a continuous function in some intervals and the representation of this function as a set of values at the end of these intervals. So, for a digital audio signal, the second interval on the time scale is divided into 32, 44 or 48 intervals, in each of these samples the sound is measured and its value is stored with a certain representation precision, generally 14 to 20 bits. These operations are called sampling processes, that is, the representation of any continuous quantity (in this case, sound) through periodic discrete measurements. After that, they say that the sound is digitized with a sample rate of 32 to 48 kHz, respectively, and a bit depth of 14 to 20 bits. Therefore, the digital current

In digital video, the sampling process for a value is simply generalized to a multivalued function that has a range of values, not a number, but an image (that is, it becomes three-dimensional) to break solid lines of a value. image (scanning beam) in so-called pixels, or image elements, our three-dimensional space will be because when the image is presented at each moment of time, not a value is involved, as in sound, but the whole image – a frame . As a result, the video signal is also represented as a sequence of integers, in other words, as a stream of data.

In the case of (separate) component analog video signals, each channel must simultaneously transmit three color components (red – red, green – green and blue – blue, i.e. RGB) or a luminance signal together with signals of color difference (YUV). The conversion of RGB color components to YUV components is one by one, but the latter method makes it possible to highlight the change in signal brightness, for which the human eye provides a separate perception mechanism, with the help of rod cells. in the retina, in contrast to the perception of color by cone cells. Rods are more sensitive than cones, so in systems with separate component coding (e.g.

Sampling theory, in turn, requires that the sampling frequency be at least twice the bandwidth of the signal to be digitized. In the case of a broadband signal (for sound, that is, for example, the simultaneous reproduction of several octaves of frequencies), the sampling frequency must be at least twice the highest frequency of the input signal analog (Fig. 1).

Figure: 1. An example of sampling an analog signal: a – correct; b – wrong. In the first case, the original signal is easily reconstructed in the form of a sample envelope, and in the second, when the signal frequency exceeds half the frequency

What is digital video compression?

What is digital video compression?

Compression is the process of removing or restructuring data to reduce file size. When 3DS МАХ writes render to an AVI file, the information in each frame is compressed based on a selection from a list of software controlled encoders / decoders (codecs).

Codecs are divided into three main categories. First of all, they have loss or not. Lossless compression preserves all data in the image, usually using a technique called Run-Length Encoding (RLE). RLE removes continuous areas of the same color by marking the file with a code that restores the duplicate color during decompression. RLE is very efficient for computer-generated graphics with large areas of the same pixels and not very efficient with digitized analog photos and videos, as these images tend to contain few areas of continuous color.

Lossy codecs are designed to recognize and permanently remove information from an image that the viewer is unlikely to see. The algorithm is sensitive to decimation and spreading in the frame rate. Lossy codecs have quality parameters that control the amount of loss (and therefore the size of the file). These codecs are effective for compressing animated ZO geometry combined with analog video.

A second important category of codecs refers to how compression fits into an entire frame, spatially or temporally. Spatial compression examines each frame, recognizes and removes the detail within the frame. Time compression (temporal compression) compares frames over time in order to strategically and gradually remove data. An important type of temporal compression (frame differentiation) remembers only changed pixels of information from successive frames.

Finally, the codecs support specific pixel depths and are platform specific. Some codecs will only compress 8-bit, others support 16-bit and 24-bit compression. Some only play Video for Windows. Decompression (and recompression) occurs when the movie is played. To do this, the View File in 3D Studio MAX mechanism launches MS Windows Media Player (NT or WIN95).

Codecs are essential when animations are played from a CD-ROM drive or hard disk. The codec selected in the output of the 3D Studio MAX render file affects the visual quality and playback speed of digital animation. As shown in Figure 27.3, the codecs are accessed after specifying the output file name with the AVI extension and clicking the Configure button in the File Browsing dialog.

AVI files can be compressed using any of the software codecs that come with Video for Windows. Most likely, you have at least one codec installed with your operating system. Other codecs are installed with multimedia products and Internet browsers. You can also add third-party codecs to your system for a variety of compression formats. To get the results you want for your project, understand how codecs work and experiment a lot.