
MPEG format: specifications and capabilities Part 3
Due to the fact that MPEG is developed in an organization as accredited as ISO and is a fairly universal compression method (it can be used in video recording, television broadcasting, home video editing, multimedia programs (educational, games), teleconferencing , creating videos for Internet presentations), has become the dominant standard for digital video compression, eliminating the need to use many incompatible video compression methods that existed before.

How MPEG Video Works
A digital color image of the compressed sequence is converted to the YUV (YCbCr) color space. The Y component represents intensity and the U and V components represent chromaticity. Since the human eye is less sensitive to chromaticity than intensity, the resolution of color components can be reduced 2 times vertically, or both vertically and horizontally. For high-quality studio video and animation, downsampling is not applied to preserve quality, and for consumer applications, where streams are lower and equipment is cheaper, this action does not lead to a Noticeable loss in visual perception, while saving valuable bits of data. .
MPEG format
MPEG format
MPEG format
The basic idea of the whole scheme is to predict the movement from one frame to another and then apply a discrete cosine transform (DCT) to redistribute the redundancy in space. DCT is carried out in blocks of 8×8 points, the prediction of movement is carried out in the intensity channel (Y) in blocks of 16×16 points or, depending on the characteristics of the original image sequence (interlaced, content), in blocks of 16×8 points. In other words, a given 16×16 dot block is searched in the current frame in the corresponding larger area in the previous or subsequent frames. The DCT coefficients (original data or the difference of this block and its corresponding) are quantized, that is, they are divided by a certain number to discard insignificant bits. Many coefficients after such an operation turn out to be zero.
Frame rates and types
There are three types of encoded frames. I-frames are frames encoded as still images, without reference to the next or the next. They are used as starting points. P-frames are predicted frames from previous I or P-frames. Each macroblock in a P frame can come with a vector and the DCT coefficient difference of the corresponding block from the last decoded I or P, or can be encoded as in I, if the corresponding block was not found.
And finally, there are B-frames that are predicted from the two closest I or P-frames, one before and one after. The corresponding blocks are searched in these frames and the best of them is selected. The forward vector is searched, then the inverse, and the average between the corresponding macroblocks in the past and the future is calculated. If this doesn’t work, then the block can be encoded as an I-frame.









