Embedding supplemental data in a digital video signal
An MPEG-encoded video signal includes groups of pictures (GOPs), each GOP having an intraframe coded (I) picture and a series of predictively encoded (P) pictures and bi-directionally predictively (B) pictures. Usually, the GOP structure IBBPBBP . . . is used. However, in order to embed a watermark in the MPEG-encoded video signal, the MPEG encoder is forced to produce a GOP structure which does not normally occur, e.g., a GOP including a BPP sequence. Different symbol values can be assigned to different positions of the BPP sequence in the GOP.
1. Field of the Invention
The invention relates to a method and arrangement for embedding supplemental data in a digital video signal. The invention also relates to an arrangement for decoding the embedded supplemental data.
2. Description of the Related Art
Video and audio signals are increasingly transmitted and recorded in a digitally encoded form, for example, an MPEG bit stream. There is a growing need to accommodate supplemental data in the signal, for example, a watermark to classify the signal as authentic program material. Watermarking digital signals is particularly useful in copy protection applications. The watermark can effectively take the form of a single bit indicating that the signal constitutes copy protected material, or a multi-bit code representing the originator of the material.
In the known MPEG standard for audio and video compression, a copy protection bit has been defined for that purpose. However, a disadvantage of this known method is that the protection bit can easily be modified to circumvent the copyright protection mechanism.
SUMMARY OF THE INVENTIONIt is an object of the invention to provide a method and arrangement for embedding a watermark in a video signal in such a manner that the embedded watermark can easily be detected but is difficult to remove.
To this end, the invention provides a method of embedding supplemental data in a video signal comprising the step of encoding the video signal in groups of pictures comprising an intraframe (I) coded picture and a series of predictively (P) and bidirectionally predictively (B) coded pictures, characterized by encoding the video signal in such a manner that the pattern of picture coding types in a group of pictures (GOP) represents a supplemental data value.
With the invention, it is achieved that a watermark can easily be detected. The picture coding types are accommodated in the picture headers of an MPEG bit stream and can easily be read. However, changing the picture coding type in a picture header to remove the watermark renders the picture data no longer compliant with the coding standard. The MPEG bit stream can no longer be decoded by a compliant decoder. The relevant picture must be transcoded to comply with the new picture coding type, e.g., by decoding the picture and encoding it again.
It should be noted that the general idea of generating a predetermined sequence of I, P and B pictures in an MPEG signal so as to mark a digital video signal has also been proposed in Applicant's International Patent Application WO 97/13248, corresponding to U.S. Pat. No. 6,131,161 (Attorney Docket No. PHN 15,391). However, this application was published after the priority date of the present invention and fails to disclose the representation of a supplemental data value by a pattern of picture coding types within a group of pictures.
Preferably, the supplemental data value is represented by a given pattern of B and P picture coding types in a GOP, for example, by the position of a BPP pattern in a GOP. Herewith, it is achieved that changing a picture coding type also changes its reference to other pictures within the GOP and, consequently, ripples through the remainder of the GOP. To remove a watermark, a substantial number of pictures in the GOP must now be transcoded rather than a single picture. There is one exception, i.e., a P picture can be transcoded into an I picture without requiring other pictures to be transcoded as well. However, the I picture must then be encoded with the low amount of bits used for the P picture. This affects the quality of the I picture as well as any P picture referring to this I picture. Consequently, a watermark cannot be removed from a GOP without either transcoding the remainder of the GOP or suffering severe decrease in quality for the remainder of the GOP.
BRIEF DESCRIPTION OF THE DRAWINGS
First, the basic principles of MPEG which are essential to the watermarking method in accordance with the invention will be briefly described.
To achieve efficient video compression, an MPEG encoder encodes pictures in accordance with one of three different coding methods. Some pictures are autonomously encoded, i.e., without any reference to another picture in the video sequence. These pictures are denoted intraframe coded pictures or I pictures. Other pictures are predictively encoded, using a motion-compensated previous picture as a reference (prediction) image. They are denoted P pictures. The previous picture to which a P picture refers, may be an I picture or another P picture. Yet other pictures are bi-directionally predictively encoded. They refer to a previous as well as a future I or P picture, and are denoted B pictures.
Generally, the amount of bits required to represent a picture is most for I pictures, less for P pictures and least for B pictures. The amount of compression and the quality of the decoded video sequence largely depends on the performance of the motion estimation process in the encoder. Motion estimation is the most complicated and computational intensive operation of an MPEG encoder. It is this operation which will make professional MPEG encoders far superior over cost-effective consumer encoders for a long time.
In order to inform an MPEG decoder whether a received picture is an I, P or B picture, a parameter, picture_type, in each picture header of an MPEG video bit stream, describes how the relevant picture has been encoded. If the picture coding type is I, the decoder reconstructs the picture completely from the received picture data. If the picture coding type is P, the decoder reconstructs the picture from the received picture data and an already displayed I or P picture. If the picture coding type is B, it is reconstructed on the basis of a preceding as well as a succeeding I or P picture. It should be noted that the parameter picture_type implicitly specifies the reference picture(s), i.e., a P picture refers to the most recent I or P picture, a B picture refers to the most recent and the next I or P picture.
A series of an I picture and consecutive B and P pictures are called a Group of Pictures (GOP). According to the MPEG standard, an encoder is free to choose the optimum sequence of I, B and P picture coding types. However, only a few GOP-structures are used in practice:
-
- IPPP . . . is used by cheap encoders that do not have access to large amounts of memory;
- IBPBP . . . is used by more advanced encoders; and
- IBBPBBP . . . is commonly used by professional encoders.
MPEG encoders currently under development optimize the GOP structure a little further than the conventional sequences listed above, usually in that an I picture is chosen when a hard scene change occurs.
As
A watermark in the form of a BPP pattern can easily be detected because the picture coding type is included in the respective picture header. However, it is impossible to remove the watermark by merely changing the parameter picture_type. For example, if the parameter picture_type of picture 6 in
Similarly, if the watermark is removed by changing the parameter picture_type of picture 7 in
Neither can the parameter picture_type of P picture 6 or P picture 7 be changed from P into I because, in that case, a predictively encoded picture is then interpreted as an autonomously encoded picture (pixel differences are interpreted as pixels).
Accordingly, in order to remove a watermark, the relevant picture has to be transcoded, i.e., decoded into the pixel domain and encoded again in accordance with its modified picture coding type. That is not attractive for a hacker because, as mentioned before, high-quality encoding involves complicated motion estimation circuitry, unless a severe degradation of quality is accepted. In this respect, it is to be noted that not only the picture whose picture_type parameter has been changed has to be re-encoded. Pictures referring to the modified picture are to be re-encoded as well. For example, if picture 7 in
The occurrence of two consecutive P pictures rarely occurs by accident. The number of false alarms (watermark detected in a non-watermarked signal) is thus limited. To further reduce the false alarm possibility, a requirement can be imposed on the maximal amount of GOPs between two watermarked GOPs. For example, a video stream is specified to be copyright protected if watermarked GOPs occur in small enough intervals.
The above-described concept of watermarking a GOP allows messages of any length to be embedded in an MPEG video signal. To this end, different supplemental data values are assigned to different positions of the BPP pattern in the GOP. A first example thereof is shown in
It will be appreciated that the alphabet of supplemental data values can be further enlarged. For example, six different message symbols 0-5 may be assigned to GOP structures in accordance with the following Table I:
The three MPEG encoding modes (I, P, B) are symbolized in
The current encoding mode (I, P, B) is controlled by the control circuit 40 which controls the selection switch 36 through a picture coding type signal PT in accordance with a received watermark message w to be embedded.
In summary, a method of embedding a watermark in an MPEG-encoded video signal is disclosed. An MPEG-encoded video signal includes groups of pictures (GOPs), each GOP comprising an intraframe coded (I) picture and a series of predictively encoded (P) pictures and bi-directionally predictively (B) pictures. Usually, the GOP structure IBBPBBP . . . is used. In accordance with the invention, the video signal is watermarked by forcing the MPEG encoder to produce a GOP structure which does normally not occur, e.g., a GOP including a BPP sequence. Different symbol values can be assigned to different positions of the BPP sequence in the GOP.
Claims
1-8. (canceled)
9. An encoded video signal with embedded supplemental data, comprising groups of intraframe (I), predictively (P) and bidirectionally predictively (B) coded pictures, characterized in that supplemental data values are represented by respective predetermined patterns of picture coding types (I, B and P) in a group of pictures.
10. The signal as claimed in claim 9, wherein the supplemental data value is represented by the position of a BPP pattern in a group of pictures.
11. A storage medium stored thereon an encoded video signal with embedded supplemental data as claimed in claim 9.
12. A storage medium having stored thereon an encoded video signal with embedded supplemental data as claimed in claim 10.
Type: Application
Filed: Dec 6, 2004
Publication Date: May 5, 2005
Inventors: Emmanuel Frimout (Eindhoven), Stephanus Nijssen (Eindhoven), Johan Linnartz (Eindhoven)
Application Number: 11/005,260