
Video encoding, how it works (part 2)

So far, we’ve only talked about image compression. But a full video also involves an audio component. CD-quality sound is believed to need to be digitized at 44.1 kHz at 16 bits per channel, which is equivalent to 706 Kbps per channel (1.4 Mbps for stereo). The quality of the DAT signal determines the sampling rate of 48 KHz (frequency band 4-24000 Hz) and increases the stream to 768 Kbps per channel.
The information compression approach is the same: discarding the part that is not very important for the human ear to perceive. The MPEG standard allows 3 layers of audio compression. Layer 1 uses the simplest algorithm with minimal compression, assuming 192 Kbps per channel. The Layer 2 algorithm is more complex, but the compression rate is higher, only 128 Kbps per channel. A powerful CD-quality digital audio compression algorithm (11 times lossless distinguishable by the human ear) Layer 3 provides the highest possible sound quality with severe transmission restrictions – no more than 64 Kbps per channel. It is primarily intended for the Internet. Its importance is so great that it has received a special abbreviation MP3, which stands for MPEG Layer 3. There are many Internet sites that contain hundreds of thousands of MP3 files of popular music. With the help of special playback programs (Real Audio), MP3 music can be listened to in real time over the Internet, copied indefinitely (note that a typical song is 2-8MB), and illegally distributed. There are already portable MP3 players priced around $ 200 (like the Diamond Rio). The music industry, with tangible losses, began an active fight against MP3 sites (the Recording Industry Association of America found and closed most of them). But the gin is out, you can’t close everyone. Adaptec predicts that billions of songs will be downloaded from the Internet in the coming years and announces MP3 support in the next version of EasyCD Creator. However, in digital editing tasks, audio signal compression is not used, therefore, in allowable stream calculations, it is necessary to allocate up to 1.5 Mbps to the audio component.
MPEG2 for non-linear editing tasks
The term non-linear editing does not correspond to the essence of the process, but only reflects one of its characteristics. In fact, we are talking about video editing, done in digital format on computers. In this case, the original video fragments are subject to mandatory digitization and recording on the hard disk in the form of appropriate files. Unlike tape drives, accessing any of these fragmented files does not require tedious rewinding (and this process is linear), meaning all video frames are available in random order. This important property gave rise to the name of digital editing as non-linear, although, obviously, the possibilities of digital processing are much broader and richer.
Remember that according to the ITU-R BT.601 recommendation, a television frame is a 720×576 matrix. Taking into account the television frame rate of 25 Hz, we conclude that one second of digital video in 4: 2: 2 representation requires 25x2x720x576 = 20,736,000 bytes, that is, the data stream is 21 MBps. Recording these streams is technically feasible, but difficult, expensive, and inefficient in terms of post-processing. The real possibilities of practice require a significant reduction in flows. Many algorithms are known to perform lossless compression, but even the most effective ones do not provide more than 2x compression on typical images.
Until recently, M-JPEG reigned supreme in the world of non-linear video editing systems. The different solutions differed in the degree of compression, which corresponded to different levels of quality of the resulting video. Quite conditionally, 4 levels can be distinguished here: Standard Video (VHS, C-VHS, Video8), Super-Video (SVHS, C-SVHS, Hi8), Digital Video (Betacam SP, DV / DVCAM / DVCPRO, mini -DV, Digital8) and Studio Video (Digital-S, DVCPRO50). For simplicity, we will refer to them as Video, S-Video, DV, and Studio-TV in what follows. Quantitatively, they are generally characterized by horizontal resolution (the number of distinguishable elements in a line: television lines). Video is considered to provide a resolution of up to 280 lines and corresponds to an MJPEG stream of approximately 2 MBps.





