
MPEG format: specifications and capabilities Part 5
Frame sequence can have any frame arrangement

I, P and B. In industrial practice, it is common to have a fixed sequence (such as IBBPBBPBBPBBPBB), however more powerful encoders can optimize frame type selection based on the context and global characteristics of the footage.
Each type of framing has its own advantages depending on the characteristics of the image (movement activity, temporary masking effects, …).
For example, if the sequence of images changes little from frame to frame, it makes sense to encode more B-frames than P. Since B-frames are not used in the subsequent decoding process, they can be further compressed without affecting the overall video. . quality.
Specific application requirements also affect the choice of frame type: keyframes, channel switching, program indexing, error recovery, etc.
The following statistical characteristics are used for video compression:
1. Spatial correlation: 8 × 8 point discrete cosine transform.
MPEG format
2. Characteristics of human vision – immunity to high frequency components: scalar quantification of DCT coefficients with loss of quality.
MPEG format
3. Large spatial correlation of the entire image: prediction of the first low-frequency transformation coefficient in an 8×8 block (mean value of the entire block).
4. Statistics of occurrence of syntactic elements in the most probable coding stream: optimal coding of motion vectors, DCT coefficients, types of macroblocks, and so on.
5. Sparse matrix of quantized DCT coefficients: encoding of repeated zero elements with the designation of the end of the block.
6. Spatial masking: the degree of quantization of the macroblock.
7. Coding of areas taking into account the content of the scene: the degree of quantization of the macroblock.
8. Adaptation to the characteristics of the local image: block coding, macroblock type, adaptive quantization.
9. Constant adaptive quantization step size: The new quantization degree is set only by a special macroblock type and is not transmitted by default.
10. Temporal redundancy: 16×16 pixel macroblock level forward and backward motion vectors.
11. Coding of the macroblock prediction error taking into account perception: adaptive quantization and quantization of transformation coefficients.
12. Small prediction error: no error can be reported for a macroblock.
13. Macroblock level prediction error fine encoding: Each of the blocks within the macroblock can be encoded or omitted.
14. Motion vectors: slow motion of a fragment of an image with a complex pattern: predicting motion vectors.
15. Appearances and disappearances: forward and backward prediction in B-frames.
16. Interprediction precision: bilinear interpolated (filtered) block differences. In the real world, the movement of objects from one frame to another rarely falls within the bounds of the points. Interpolation allows you to find out the actual position of the object, which often increases the compression efficiency by 1 dB.
MPEG format
17. Motion activity limited in P-frames: macroblocks skipped. When the motion vector and the prediction error are zero. Skipped macroblocks are very desirable in the codestream, as they take no bits except in the header of the next macroblock.
18. Coplanar movement in B-frames: macroblocks skipped. When the motion vector is the same and the prediction error is zero.









