METHOD AND APPARATUS FOR HIERARCHICAL PICTURE ENCODING AND DECODING
A residual picture is decoded first by demultiplexing an input enhancement layer bitstream into one or more bitstreams corresponding to an order of quality layers. Next, a residual picture is restored by independently decoding the bitstream for each quality layer. The restored residual picture is stored in a frame buffer. A finally restored residual picture is output by adding, to the restored residual picture, a picture that has undergone motion compensation in a reference frame within a same enhancement layer.
Latest Samsung Electronics Patents:
- Multi-device integration with hearable for managing hearing disorders
- Display device
- Electronic device for performing conditional handover and method of operating the same
- Display device and method of manufacturing display device
- Device and method for supporting federated network slicing amongst PLMN operators in wireless communication system
This application claims the benefit under 35 U.S.C. §119(a) of U.S. Provisional Application No. 61/382,594, filed in the United States Patent and Trademark Office on Sep. 14, 2010, which is incorporated by reference, herein, in its entirety.
FIELDThe inventive concept relates generally to hierarchical (or layered) picture encoding/decoding technology and, more particularly, to a hierarchical picture encoding/decoding method and apparatus for refining picture quality using residual pictures in video compression codec processing videos.
BACKGROUNDGenerally, in video compression technology, pictures (or images) are processed in units of macro blocks having M×N pixels per block. In video processing, macro blocks are encoded and decoded in either an intra mode or an inter mode. The term ‘macro block,’ as used herein, may be understood to refer to a set of pixel blocks that each have a specific size. A ‘frame’ includes multiple macro blocks. The typical video compression technology using macro blocks may include the use of compression standards such as MPEG and H.26x.
The basic concept of video compression is to remove spatially and temporally redundant data from the original picture data. The intra mode is a scheme for removing spatial redundancy. In other words, this mode is a scheme for removing redundancy, between pixels in macro blocks having a specific size, in the current frame. The inter mode is a scheme for removing temporal redundancy. In other words, this mode is a scheme for estimating a difference between macro blocks of a current frame and corresponding macro blocks of a previous or of a following frame (referred to as a “reference” frame) by estimating motions between corresponding macro blocks in the two adjacent frames. The motion estimation is a process of searching for macro blocks in the reference frame, which are similar to macro blocks to be encoded in the current frame. In picture encoding, motion compensation is performed using the macro blocks in the reference frame, which were searched for by the motion estimation process and were found. A picture encoder entropy-encodes a difference between the “found” macro blocks in the reference frame and macro blocks in the current frame, along with a motion vector representing the location of the reference frame during its transmission.
Typically, in motion compensation, a residual encoder/decoder calculates multiple residual differential pictures for the multiple quality layers constituting its input residual picture, and also performs motion compensation on the residual differential pictures independently, causing an increase in its complexity and encoding/decoding time.
SUMMARYAn aspect of the exemplary embodiments is to provide a method and apparatus for reducing complexity and refining picture quality during picture encoding, especially during residual encoding in hierarchical picture processing technology.
Another aspect of the exemplary embodiments is to provide a method and apparatus for reducing complexity and refining picture quality during picture decoding, especially during residual decoding in hierarchical picture processing technology.
In accordance with one exemplary embodiment, a residual picture of picture data in hierarchical picture processing is encoded, and this includes a basement layer and at least one enhancement layer. Also, a picture of the enhancement layer is divided into one or more quality layers. The method includes combining the residual picture with a picture that has already undergone motion compensation in a reference frame within a same enhancement layer. A bitstream is generated for each quality layer by encoding the combined picture independently for each quality layer. Residual differential pictures generated in the process of encoding the combined picture are stored, in associated frame buffers, independently for each quality layer. A motion-compensated picture is output by selectively performing motion compensation on the pictures stored in the frame buffers; and outputting an enhancement layer bitstream by multiplexing the bitstreams for the individual quality layers.
In accordance with another exemplary embodiment, an apparatus encodes a residual picture of picture data in hierarchical picture processing. In the apparatus, one picture includes a basement layer and at least one enhancement layer. A picture of the enhancement layer is divided into one or more quality layers. The apparatus includes a quality layer encoder for combining the residual picture with a picture that has already undergone motion compensation, in a reference frame within the same enhancement layer. The encoder also generates a bitstream and a residual differential picture for each quality layer. It does this by encoding the combined picture independently for each quality layer. The apparatus also has a frame buffer for storing the residual differential pictures generated by the quality layer encoder. The apparatus also has a motion compensator for outputting a motion-compensated picture by selectively performing motion compensation on the pictures stored in the frame buffer. Additionally, there is a multiplexer for outputting an enhancement layer bitstream by multiplexing the bitstreams for the individual quality layers.
In accordance with further exemplary embodiment, there is provided a method for decoding a residual picture of picture data, in hierarchical picture processing, in which one picture includes a basement layer and at least one enhancement layer. Also, a picture of the enhancement layer is divided into one or more quality layers. The method includes demultiplexing an input enhancement layer bitstream into one or more bitstreams. The bitstreams correspond to an order of the quality layers. In this method, a residual picture is restored by decoding the bitstream independently for each quality layer. The restored residual picture is stored in a frame buffer. The finally restored residual picture is output by adding to the restored residual picture a picture that has undergone motion compensation in a reference frame within the same enhancement layer.
In accordance with yet another exemplary embodiment, an apparatus decodes a residual picture of picture data in hierarchical picture processing in which one picture includes a basement layer and at least one enhancement layer, and a picture of the enhancement layer is divided into one or more quality layers. The apparatus includes a demultiplexer for demultiplexing an input enhancement layer bitstream into one or more bitstreams corresponding to an order of the quality layers. The apparatus also includes a quality layer decoder for restoring a residual picture by decoding the bitstream for each the quality layer independently. A frame buffer stores the residual picture that was restored in the equality layer decoder. A motion compensator outputs the finally restored residual picture by adding, to the restored residual picture, a picture that has undergone motion compensation in a reference frame within the same enhancement layer.
The above and other aspects, features and advantages of certain exemplary embodiments will be more apparent from the following description taken in conjunction with the accompanying drawings, in which:
Exemplary embodiments will now be described in detail, with reference to the accompanying drawings. In the following description, specific details (such as detailed configuration and particular components) are provided merely to assist the overall understanding of the exemplary embodiments. Therefore, it should be apparent to those skilled in the art that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the inventive concept. In addition, descriptions of well-known functions and constructions have been omitted for clarity and conciseness, and to avoid obscuring the inventive concept.
The inventive concept includes, among others, a method and apparatus for performing residual encoding and/or decoding on multiple quality layers, constituting an enhancement layer bitstream/picture in a hierarchical picture encoding/decoding process.
First, the terms used herein will be defined, in brief, as follows.
Quality refinement: a process for refining the quality of residual samples reconstructed using refined data.
Quality layers: one or more layers used in the quality refinement process.
Quality basement layer: a layer representing the lowest-quality picture among reconstructed pictures, among multiple quality layers in one picture.
Quality enhancement layer: a layer representing a high-quality picture among reconstructed pictures, among multiple quality layers in one picture.
A general hierarchical picture encoding/decoding apparatus will now be described.
Referring to
It is assumed, in the example of
The format up-converter 105 is configured to perform an up-conversion in terms of, for example, the size (or the frame rate) or a view point of an input picture, and may be considered to perform a process that is a reverse of the process of the format down-converter 101. The format up-converter 105 up-converts the reconstructed basement layer picture into a picture having the same format as that of the enhancement layer. The input picture that is input to the format down-converter 101 is also input to a subtractor 107. The subtractor 107 outputs residual data obtained by subtracting the up-converted picture, output by format up-converter 105, from the input picture. A residual mapping/scaling unit 109 converts the residual data into a residual picture. The residual picture is input to residual encoder 111, which outputs an enhancement layer bitstream by performing residual encoding on the input residual picture.
A method for encoding a residual picture to gradually refine the quality in one enhancement layer will now be described.
The residual encoder 111 includes a quality layer encoder (not shown) and a motion compensator (not shown). The quality layer encoder performs encoding on quality layers independently so as to encode the residual picture in multiple quality layers.
Referring to
It is assumed, in the example of
The residual decoder 205 includes a quality layer decoder (not shown) and a motion compensator. The quality layer decoder performs decoding on the quality layers independently so as to reconstruct the residual picture in multiple quality layers.
Referring to
A residual picture input to the residual encoder is encoded into a first bitstream and a first residual differential picture by the first quality layer encoder 301. The second quality layer encoder 303 encodes a picture, corresponding to a difference between the residual picture and the first residual differential picture, into a second bitstream and a second residual differential picture. The third quality layer encoder 305 encodes a picture, corresponding to a difference between the residual picture and the second residual differential picture, into a third bitstream and a third residual differential picture. The first to third residual differential pictures are inputs of the selective motion compensator 307. The selective motion compensator 307 carries out motion compensation, depending on a motion vector, and outputs a motion-compensated picture. A picture corresponding to a difference between the residual picture and the motion-compensated picture is input back to the first quality layer encoder 301. The first to third bitstreams are integrated (or multiplexed) into one bitstream (i.e., an enhancement layer bitstream) by a multiplexer (not shown), and then transmitted to a decoder over a transport channel.
Referring to
In order to achieve gradual quality refinement for a residual picture, as for residual differential pictures reconstructed by the quality layer encoders in
Referring to
Referring to
Referring to
The first bitstream is converted, in the first variable-length decoder 711, into quantized level values using variable-length decoding. Next, it is converted into a transform coefficient for an N×N block, in the first de-quantizer 713, using a de-quantization operation. The transform coefficient is inverse-transformed on an N×N block basis, in the first inverse-transform encoder 715, and then added to a motion-compensated picture from motion compensator 740. The result is stored in a first frame buffer 750.
The second bitstream is converted, in the second variable-length decoder 721, into quantized level values using variable-length decoding Next, it is converted into a transform coefficient for an N×N block , in the second de-quantizer 723, using a de-quantization operation. The transform coefficient is inverse-transformed on an N×N block basis, in the second inverse-transform encoder 725, and then added to the sum of the picture restored in the first quality layer decoder 710 and the motion-compensated picture from motion compensator 740. The result is delivered to the third quality layer decoder 730.
The third bitstream is converted, in the third variable-length decoder 731, into quantized level values using variable-length decoding. Next, it is converted into a transform coefficient for an N×N block, in the third de-quantizer 733, using a de-quantization operation. The transform coefficient is inverse-transform encoded on an N×N block basis in the third inverse-transform encoder 735, and then added to a sum of the pictures restored in the first and second quality layer decoders 710 and 720 and of the motion-compensated picture from motion compensator 740. The result is stored in a third frame buffer 760 The picture stored in the third frame buffer 760 is output as a residual picture finally restored in the residual decoder.
The pictures stored in the first and third frame buffers 750 and 760 are used for motion compensation in the motion compensator 740.
Referring to
In other words, when the quality refinement between pictures is applied in the hierarchical picture decoder as in the exemplary embodiment, a residual block is restored by adding a motion-compensated block to an inverse-transform encoded block in a quality basement layer (or a first quality layer), and the restored residual samples constitute a picture of the quality basement layer. In addition, inverse-transform blocks of quality enhancement layers (e.g., second and third quality layers in
Referring to
Referring to
As is apparent from the foregoing description, according to the exemplary embodiments, multiple residual pictures have to undergo motion compensation only once, thus ensuring a lower-complexity residual encoder/decoder.
According to the exemplary embodiments, the drift occurs not in the original picture but in the residual picture, and this reduces the degradation in picture quality due to drift, and thereby ensures effective, scalable encoding.
While the inventive concept has been shown and described with reference to certain exemplary embodiments thereof, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the inventive concept as defined by the appended claims and their equivalents.
Claims
1. A method for encoding a residual picture of picture data in hierarchical picture processing in which one picture includes a basement layer and one or more enhancement layers, and in which a picture of the enhancement layer is divided into one or more quality layers, comprising:
- combining the residual picture with a picture that has undergone motion compensation in a reference frame within a same enhancement layer, and generating a bitstream for each quality layer by independently encoding the combined picture for each quality layer;
- storing, in associated frame buffers, residual differential pictures generated in the process of independently encoding the combined picture for each quality layer;
- outputting a motion-compensated picture by performing motion compensation on the pictures stored in the frame buffers; and
- outputting one enhancement layer bitstream by multiplexing together the bitstreams for the individual quality layers.
2. The method of claim 1, wherein:
- a picture corresponding to a difference, between (1) a picture obtained by combining the residual picture with the picture that has undergone motion compensation in the reference frame, and (2) a residual differential picture generated in a process of encoding a picture for a low quality layer is used as an input signal in an encoding process for each quality layer.
3. The method of claim 1, wherein the storing comprises storing, in associated frame buffers, residual differential pictures generated in an encoding process of a highest one of the one or more quality layers and a lowest one of the one or more quality layers.
4. The method of claim 3, wherein the outputting of a motion-compensated picture comprises outputting the motion-compensated picture by performing motion compensation on all of the residual differential pictures generated in the encoding process of the highest one of the one or more quality layers and the lowest one of the one or more quality layers.
5. An apparatus for encoding a residual picture of picture data in hierarchical picture processing in which one picture includes a basement layer and one or more enhancement layers, and in which a picture of the enhancement layer is divided into one or more quality layers, comprising:
- a quality layer encoder adapted to combine the residual picture with a picture that has undergone motion compensation in a reference frame within a same enhancement layer, and adapted to generate a bitstream and a residual differential picture for each quality layer by independently encoding the combined picture for each quality layer;
- a frame buffer adapted to store residual differential pictures generated in the quality layer encoder;
- a motion compensator adapted to output a motion-compensated picture by performing motion compensation on the pictures stored in the frame buffer; and
- a multiplexer adapted to output a single enhancement layer bitstream by multiplexing the bitstreams for the individual quality layers.
6. The apparatus of claim 5, wherein the quality layer encoder uses, as an input signal, a picture corresponding to a difference between: (1) a picture obtained by combining the residual picture with the picture that has undergone motion compensation in the reference frame, and (2) a residual differential picture generated in a process of encoding a picture for a lower quality layer.
7. The apparatus of claim 5, wherein the frame buffer stores residual differential pictures generated in an encoding process of a highest one of the one or more quality layers and a lowest one of the one or more quality layers.
8. The apparatus of claim 7, wherein the motion compensator outputs the motion-compensated picture by performing motion compensation on all of residual differential pictures generated in the encoding process of the highest one of the one or more quality layers and the lowest one of the one or more quality layers.
9. A method for decoding a residual picture of picture data in hierarchical picture processing in which one picture includes a basement layer and one or more enhancement layers, and in which a picture of the enhancement layer is divided into one or more quality layers, comprising:
- demultiplexing an input enhancement layer bitstream into one or more bitstreams corresponding to an order of the one or more quality layers;
- restoring a residual picture by decoding the bitstream for each quality layer independently, and storing the restored residual picture in a frame buffer; and
- outputting a finally restored residual picture by adding a picture, that has undergone motion compensation in a reference frame within a same enhancement layer, to the restored residual picture.
10. The method of claim 9, wherein the outputting of a finally restored residual picture comprises outputting a finally restored residual picture by adding (1) a picture that has undergone motion compensation in a reference frame within a same enhancement layer, to (2) a picture obtained by adding up residual pictures restored in each quality layer decoding process.
11. The method of claim 9, wherein the storing comprises:
- storing in a first frame buffer a picture obtained by adding (1) the picture that has undergone motion compensation in the reference frame to (2) a residual picture restored in a lowest quality layer decoding process, the first frame buffer corresponding to a lowest one of the one or more quality layers; and
- storing the finally restored residual picture in a second buffer corresponding to a highest one of the one or more quality layer.
12. The method of claim 11, wherein the picture stored in at least one of the first and second frame buffers is used to restore a residual picture of a next frame after undergoing motion compensation.
13. The method of claim 9, wherein the storing comprises:
- storing a residual picture restored in a lowest quality layer decoding process, in a first frame buffer corresponding to the lowest one of the one or more quality layers; and
- outputting a picture obtained by adding (1) the picture that has undergone motion compensation in the reference frame to (2) the residual picture restored in a highest quality layer decoding process, as the finally restored residual picture, and storing the finally restored residual picture in a second frame buffer corresponding to a highest one of the one or more quality layers.
14. The method of claim 13, wherein the picture stored in at least one of the first and second frame buffers is used to restore a residual picture of a next frame after undergoing motion compensation.
15. An apparatus for decoding a residual picture of picture data in hierarchical picture processing in which one picture includes a basement layer and one or more enhancement layers, and in which a picture of the enhancement layer is divided into one or more quality layers, comprising:
- a demultiplexer adapted to demultiplex an input enhancement layer bitstream into one or more bitstreams corresponding to an order of the one or more quality layers;
- a quality layer decoder adapted to restore a residual picture by independently decoding the bitstream for each quality layer;
- a frame buffer storing the residual picture restored in the quality layer decoder; and
- a motion compensator adapted to output a finally restored residual picture by adding (1) a picture that has undergone motion compensation in a reference frame within a same enhancement layer, to (2) the restored residual picture.
16. The apparatus of claim 15, wherein the motion compensator outputs a finally restored residual picture by adding (1) a picture that has undergone motion compensation in a reference frame within a same enhancement layer, to (2) a picture obtained by adding up residual pictures restored in each decoding process of the one or more quality layers.
17. The apparatus of claim 15, wherein the frame buffer comprises:
- a first frame buffer storing a picture obtained by adding (1) the picture that has undergone motion compensation in the reference frame to (2) a residual picture restored in a lowest quality layer decoder; and
- a second frame buffer storing the finally restored residual picture.
18. The apparatus of claim 17, wherein the motion compensator performs motion compensation on the picture stored in at least one of the first and second frame buffers, and uses the motion-compensated picture to restore a residual picture of a next frame.
19. The apparatus of claim 15, wherein the frame buffer comprises:
- a first frame buffer storing a residual picture restored in a lowest quality layer decoder; and
- a second frame buffer storing, as the finally restored residual picture, a picture obtained by adding (1) the picture that has undergone motion compensation in the reference frame to (2) the residual picture restored in a highest quality layer decoder.
20. The apparatus of claim 19, wherein the motion compensator performs motion compensation on the picture stored in at least one of the first and second frame buffers, and uses the motion-compensated picture to restore a residual picture of a next frame.
21. A residual encoder, comprising:
- a plurality of sequenced quality layer encoders, including at least a first quality layer encoder and a last quality layer encoder; a selective motion compensator that outputs a motion-compensated picture;
- wherein: the first quality layer encoder generates a first residual differential picture based on a difference between a residual picture and the motion-compensated picture, the first residual differential picture being output to the selective motion compensator and to a subsequent one of the plurality of sequenced quality layer encoders; each subsequent one of the plurality of sequenced quality layer encoders generates a respective residual differential picture based on a difference between the residual picture and the picture output by an immediately preceding one of the plurality of sequenced quality layer encoders, the respective residual differential picture being output to the selective motion compensator and to another subsequent one of the plurality of sequenced quality layer encoders; and the last quality layer encoder generates a last residual differential picture based on a difference between a residual picture and the respective residual differential picture received from the immediately preceding one of the plurality of sequenced quality layer encoders, the last residual differential picture being output to the selective motion compensator.
22. The residual encoder as set forth in claim 21, further comprising frame buffers storing only the first and the last residual differential pictures.
23. The residual encoder as set forth in claim 21, further comprising frame buffers storing a respective residual differential picture corresponding to each of the plurality of sequenced quality layer encoders.
Type: Application
Filed: Sep 14, 2011
Publication Date: Mar 15, 2012
Applicant: SAMSUNG ELECTRONICS CO., LTD. (Suwon-si)
Inventors: Woong-Il CHOI (Hwaseong-si), Min-Woo PARK (Yongin-si), Dae-Hee KIM (Suwon-si)
Application Number: 13/232,613
International Classification: H04N 7/26 (20060101);