Video Demystified: A Handbook for the Digital Engineer, Fourth Edition

Figure 13.14 illustrates the basic motion compensation process. Motion compensation forms predictions from previously decoded pictures, which are in turn combined with the coefficient data (error terms) from the IDCT.
Prediction for P pictures is made from the two most recently decoded reference fields. The simplest case is shown in Figure 13.15, used when predicting the first picture of a frame or when using field prediction within a frame.
Predicting the second field of a frame also requires the two most recently decoded reference fields. This is shown in Figure 13.16 where the second picture is the bottom field and in Figure 13.17 where the second picture is the top field.
Field prediction for B pictures is made from the two fields of the two most recent reference frames, as shown in Figure 13.18.
Prediction for P pictures is made from the most recently decoded picture, as shown in Figure 13.19. The reference picture may have been coded as either two fields or a single frame.
Frame prediction for B pictures is made from the two most recent reference frames, as shown in Figure...