Method and Apparatus for Intra Coding for a Block in a Coding System
A method and apparatus of Intra coding for a block in a coding system. The method includes receiving input data associated with a current block; determining a current Intra prediction mode for the current block; deriving prediction samples for the current block based on reconstructed neighboring samples according to the current Intra prediction mode, wherein a predicting value representing a single prediction value for the current block is determined based on at least two corner samples corresponding to the current block according to the current Intra prediction mode; and applying encoding or decoding to the input data associated with the current block using the prediction samples derived for the current block.
The present invention is a Continuation of pending U.S. patent application No. 14/480,788, filed on Sep. 9, 2014, which claims priority to U.S. Provisional Patent Application, Ser. No. 61/892,240, filed on Oct. 17, 2013, entitled “Methods of Intra Coding Improvement”. The priority applications are hereby incorporated by reference in their entireties.
FIELD OF THE INVENTIONThe present invention relates to video coding. In particular, the present invention relates to Intra coding.
BACKGROUND AND RELATED ARTThree-dimensional (3D) television has been a technology trend in recent years that is targeted to bring viewers sensational viewing experience. Multi-view video is a technique to capture and render 3D video. The multi-view video is typically created by capturing a scene using multiple cameras simultaneously, where the multiple cameras are properly located so that each camera captures the scene from one viewpoint. The multi-view video with a large number of video sequences associated with the views represents a massive amount data. Accordingly, the multi-view video will require a large storage space to store and/or a high bandwidth to transmit. Therefore, multi-view video coding techniques have been developed in the field to reduce the required storage space and the transmission bandwidth. In three-dimensional and multi-view coding systems, the texture data as well as depth data are coded.
For depth map, the Simplified Depth Coding (SDC), which is also termed as Segment-wise DC Coding, is an alternative Intra coding mode. Whether SDC is used is signalled by a SDC flag at coding unit (CU) level. For SDC, the depth block is Intra predicted by a conventional Intra mode or depth modelling mode 1. The partition size of SDC-coded CU is always 2N×2N and therefore there is no need for signaling in the bitstream regarding the block size of SDC-coded CU. Furthermore, instead of coded as quantized transform coefficients, the SDC-coded residuals are represented by one or two constant residual values depending on whether the depth block is divided into one or two segments.
According to existing three-dimensional video coding based on HEVC (3D-HEVC), certain information is signalled for SDC-coded blocks. The information signalled includes:
1. type of segmentation/prediction of the current block. Possible values are
-
- i. DMM (Depth Modelling Mode) Mode 1—Explicit Wedgelets (2 segments)
- ii. Planar (1 segment)
2. For the DMM, additional prediction information (e.g., partition information) is coded.
3. For each resulting segment, a residual value (in the pixel domain) is signalled in the bitstream
In the depth coding process, the depth samples are first mapped to limited depth values. The limited depth values are represented by a Depth Lookup Table (DLT), and after mapping depth samples into limited depth values, these limited depth values are used as the indexes pointing to entries of the DLT. Consequently, residuals can be generated by taking the difference between the DLT index of the predictor and the original depth samples; the index residual can then be coded and sent to the decoder. The depth values present in a depth map are usually limited to a number smaller than the total number that can be represented by a depth capture device. Therefore, the use of DLT can reduces the bit depth required for residual magnitudes. This mapping table is transmitted to the decoder so that the inverse lookup from an index to a valid depth value can be performed at the decoder.
At the encoder side, the residual index iresi to be coded into the bitstream, is determined according to:
iresi=I(dorig)−I(dpred), (1)
where dorig denotes an original depth value determined for the depth block (e.g., the mean value of the original depth block), dpred denotes the predicting depth value (e.g., the mean value of the prediction samples for the depth block), and I(.) denotes the Index Lookup Table. The computed residual index iresi is then coded with a significance flag, a sign flag and with ┌log2 dvalid┐ bits for the magnitude of the residual index, where dvalid denotes the number of valid depth values and ┌x┐ is a ceiling function corresponding to the smallest integer not less than x.
The Depth Lookup Table takes advantage of the sparse property of the depth map, where only a small number of depth values out of a full available depth range (e.g., 28) will typically be present in the depth map. In the encoder, a dynamic depth lookup-table is constructed by analyzing a number of frames (e.g. one Intra period) of the input sequence. This depth lookup-table is used during the coding process to reduce the effective signal bit-depth of the residual signal.
In order to reconstruct the lookup table, the encoder reads a pre-defined number of frames from the input video sequence to be coded and scans all samples for presence of the depth values. During this process a mapping table is generated that maps depth values to existing depth values based on the original uncompressed depth map.
The Depth Lookup Table D(.), the index Lookup Table I(.), the Depth Mapping Table M(.) and the number of valid depth values dvalid are derived by the following process that analyses the depth map Dt:
-
- 1. Initialization
- boolean vector B(d)=FALSE for all depth values d,
- index counter i=0.
- 2. Process each pixel position p in Dt for multiple time instances t:
- Set B(Dt(p))=TRUE to mark valid depth values.
- 3. Count the number of TRUE values in B(d). The result is set to the value for dvalid.
- 4. For each d with B(d)==TRUE:
- Set D(i)=d,
- Set M(d)=d,
- Set I(d)=i, and
- i=i+1.
- 5. For each d with B(d)==FALSE:
- Find {circumflex over (d)}=arg min|d−{circumflex over (d)}| and B({circumflex over (d)})==TRUE
- Set M(d)={circumflex over (d)}.
- 6. Set I(d)=I({circumflex over (d)}) .
- 1. Initialization
As mentioned above, there are two types of segmentation and prediction in the existing SDC. The respective processes for the two types of segmentation and prediction are described as follows.
-
- DMM mode:
- Edge information is defined by start/end side and corresponding index.
- The DC prediction value (i.e., predicting depth value (dpred)) for each segment are predicted by neighboring depth values (also referred as neighboring depth samples or neighboring samples in this disclosure) as shown in
FIG. 1 . Two examples of depth blocks (110 and 120) are shown inFIG. 1 , where each depth block is divided into two segments as shown by the dashed line. The reconstructed neighboring depth samples used to generate prediction samples for block 110 are indicated by reference numbers 112 and 114, and the reconstructed neighboring depth samples used to generate prediction samples for block 120 are indicated by reference numbers 122 and 124 inFIG. 1 .
- Planar mode:
- Generate the predictors (i.e., prediction samples) of the Planar mode as shown in
FIG. 2 . Linear interpolation is used to generate predictors for the right column and the bottom row as shown inFIG. 2A . For the right column, the linear interpolation is based on depth values at A and Z. For the bottom row, the linear interpolation is based on depth values at B and Z. After the right column and the bottom row are interpolated, the predictors for the rest of depth positions are bilinear interpolated using four respective depth samples from four sides as shown inFIG. 2B . - The DC prediction value (i.e., a single predicting depth value (dpred) for the block) is the mean of the predictors of the Planar mode.
- Generate the predictors (i.e., prediction samples) of the Planar mode as shown in
- DMM mode:
In the above derivation processes, prediction sample refers to the predicted value generated by the Intra coding mode, which may be DMM Mode 1 or the Planar mode in the existing 3D-HEVC. The reconstruction process for the Planar mode at the decoder side is illustrated in
The reconstruction process for the DMM Mode 1 at the decoder side is illustrated in
In the conventional SDC for depth block coding, only limited prediction modes are provided. It is desirable to extend the prediction modes to improve coding efficiency.
BRIEF SUMMARY OF THE INVENTIONA method and apparatus for a block in a coding system are disclosed. In certain embodiments, a method of Intra coding for a block in a coding system comprises: receiving input data associated with a current block; determining a current Intra prediction mode for the current block; deriving prediction samples for the current block based on reconstructed neighboring samples according to the current Intra prediction mode, wherein a predicting value representing a single prediction value for the current block is determined based on at least two corner samples corresponding to the current block according to the current Intra prediction mode; and applying encoding or decoding to the input data associated with the current block using the prediction samples derived for the current block.
In other embodiments, an apparatus for Intra coding of a block in a coding system comprises one or more electronic circuits configured to: receive input data associated with a current block; determine a current Intra prediction mode for the current block; derive prediction samples for the current block based on reconstructed neighboring samples according to the current Intra prediction mode, wherein a predicting value representing a single prediction value for the current block is determined based on at least two corner samples corresponding to the current block according to the current Intra prediction mode; and apply encoding or decoding to the input data associated with the current block using the prediction samples derived for the current block.
In order to improve the coding efficiency of SDC coding, embodiments according to the present invention uses extended prediction modes for SDC coding, where the extended prediction modes includes Horizontal Intra mode, Vertical Intra mode or both in additional to the conventional Planar mode and DMM Mode 1. The Intra prediction mode for SDC is also referred as Intra prediction type in this disclosure. The Horizontal Intra mode and the Vertical Intra mode are also referred as Horizontal mode and Vertical mode in this disclosure. For example, the set of extended Intra prediction types, of segmentation/prediction for the current block may correspond to:
a. DMM Mode 1—Explicit Wedgelets (2 segments)
b. Planar (1 segment)
c. Horizontal (1 segment)
d. Vertical (1 segment)
For the DMM mode, additional prediction information is coded to indicate the partition. For each resulting segment, a residual value (in the pixel domain) is signalled in the bitstream. The two additional types or modes, i.e., Horizontal and Vertical Intra prediction process is described as follows
-
- Horizontal Mode:
- Generate the predictors of the Horizontal Intra mode as shown in
FIG. 5 , where the reconstructed neighboring depth column (510) adjacent to the current left block boundary of the current block (520) is used to generate the prediction samples. Each sample in the neighboring column is used to generate the prediction samples for a corresponding row of the current block. - The DC prediction value (i.e., a single predicting depth value (dpred) for the block) is the mean of the predictors or the mean of the subset of the predictors of the Horizontal mode as shown in
FIG. 5 .
- Generate the predictors of the Horizontal Intra mode as shown in
- Vertical Mode:
- Generate the predictors of the Vertical Intra mode as shown in
FIG. 6 , where the reconstructed neighboring depth row (610) adjacent to the current top block boundary of the current block (620) is used to generate the prediction samples. Each sample in the neighboring row is used to generate the prediction samples for a corresponding column of the current block. - The DC prediction value (i.e., a single predicting depth value (dpred) for the block) is the mean of the predictors or the mean of the subset of the predictors of the Vertical mode as shown in
FIG. 6 .
- Generate the predictors of the Vertical Intra mode as shown in
- Horizontal Mode:
An example of the reconstruction process for the Horizontal mode and the Vertical mode at the decoder side is illustrated in
drec=I−1(iresi+I(dpred)), (2)
with I−1(.) denoting the inverse Index Lookup table. At the decoder side, the reconstructed residual R, is derived by
Rrec=drec−dpred. (3)
Each prediction sample of the current block or prediction unit (PU) is then updated with the reconstructed residual, i.e., the reconstructed residual is added to each prediction sample as the new prediction sample. Accordingly, the prediction samples of Horizontal/Vertical mode will be the prediction samples of the Horizontal/Vertical mode plus an offset value, where the offset value is derived from the residual index.
In another example, all the prediction samples of the current block or PU will be replaced by a same reconstructed depth value derived from
drec=I−1(iresi+I(dpred)),
with I−1(.) denoting the inverse Index Lookup table.
In the above example, the set of extended Intra prediction types of segmentation/prediction corresponds to {DMM Mode 1, Planar mode, Horizontal mode, Vertical mode}. However, other sets of extended Intra prediction types may also be used to practice the present invention. For example, the set of extended Intra prediction types may correspond to:
-
- {DC mode, DMM Mode 1, Horizontal mode, Vertical mode}
- {Planar mode, DMM Mode 1, Horizontal mode, Vertical mode}
- {Planar mode, DC Mode, Horizontal mode, Vertical mode}
- {DMM mode 1, Planar mode, Horizontal mode, Vertical mode, Diagonal Mode 2, Diagonal Mode 8}
where Diagonal Mode 2 and Diagonal Mode 8 are the Intra prediction modes for texture video data as described in the HEVC standard (H.265: High efficiency video coding, Recommendation ITU-T H.265. April 2013).
The order of the type for generating prediction samples may also be changed. Based on this order, a truncated unary code can be used to signal the type selected for the current block. An example of the modified coding table with these two additional modes, i.e., Horizontal and Vertical modes are shown in Table 1.
An example of the binarization for depth Intra mode with additional Horizontal and
Vertical modes is shown in Table 2.
For the set of extended Intra prediction types for SDC coding according to embodiments of the present invention, the predicting depth value (d pred) in those Intra prediction modes can be calculated from all prediction samples, or the prediction samples after sub-sampling, or a subset of the prediction samples. Since the prediction samples are derived based on reconstructed neighboring depth samples, the predicting depth value (d pred) may also be derived directly using the reconstructed neighboring depth samples of the current coding block. The function used to derive the predicting depth value can be the mean, median, maximum, minimum, or a linear combination of the prediction samples, the prediction samples after sub-sampling, the subset of the prediction samples, or the neighboring reconstructed samples of the current coding block.
When a subset of the prediction samples is used to derive the predicting depth value (dpred) for the Intra prediction modes, any of the following subsets can be used:
The first row and first column of current coding block,
The center prediction sample of current coding block,
The four corner samples of current coding block,
The two corner samples of the first/last/one row of current coding block, and
The two corner samples of the left-most/right-most/one column of current coding block.
In another example, different subsets of the prediction samples can be used for difference Intra prediction modes for SDC. For example, for the Horizontal Intra mode, the two corner samples (shown as two black dots) of the left-most column can be used to calculate the predicting depth value as shown in
In another embodiment of the present invention, the neighboring reconstructed samples are used to derive the predicting depth value. For example, for the Horizontal Intra mode, all samples, a single sample, partial samples, or two corner samples of the reconstructed neighboring depth column (1110) adjacent to the left block boundary of the current block 1120 can be used to calculate the predicting depth value as shown in
The performance of a 3D video coding system incorporating an extended Intra prediction mode set according to an embodiment of the present invention is compared to that of a conventional system based on HTM-8.0 (High Efficiency Video Coding based 3D Coding Test Model, version 8.0). The types of Intra prediction for the HTM-8.0 include DMM Mode 1 and Planar mode. The embodiment according to the present invention further includes Horizontal mode and Vertical mode. The performance comparison is based on different sets of test data listed in the first column. The test results of the system incorporating an embodiment of the present invention under the common test conditions (CTC) and under the all-Intra (AI) test conditions are shown in Table 3 and Table 4, respectively. As shown in the tables, the system with extended Intra prediction mode set according to the present invention can achieve over 0.1% BD-rate saving in some cases. The average decoding time can be reduced by more than 3%.
The flowchart shown above is intended to illustrate an example of Simplified Depth Coding (SDC) with an extended Intra prediction mode set. A person skilled in the art may modify each step, re-arranges the steps, split a step, or combine steps to practice the present invention without departing from the spirit of the present invention.
The above description is presented to enable a person of ordinary skill in the art to practice the present invention as provided in the context of a particular application and its requirement. Various modifications to the described embodiments will be apparent to those with skill in the art, and the general principles defined herein may be applied to other embodiments. Therefore, the present invention is not intended to be limited to the particular embodiments shown and described, but is to be accorded the widest scope consistent with the principles and novel features herein disclosed. In the above detailed description, various specific details are illustrated in order to provide a thorough understanding of the present invention. Nevertheless, it will be understood by those skilled in the art that the present invention may be practiced.
Embodiment of the present invention as described above may be implemented in various hardware, software codes, or a combination of both. For example, an embodiment of the present invention can be a circuit integrated into a video compression chip or program code integrated into video compression software to perform the processing described herein. An embodiment of the present invention may also be program code to be executed on a Digital Signal Processor (DSP) to perform the processing described herein. The invention may also involve a number of functions to be performed by a computer processor, a digital signal processor, a microprocessor, or field programmable gate array (FPGA). These processors can be configured to perform particular tasks according to the invention, by executing machine-readable software code or firmware code that defines the particular methods embodied by the invention. The software code or firmware code may be developed in different programming languages and different formats or styles. The software code may also be compiled for different target platforms. However, different code formats, styles and languages of software codes and other means of configuring code to perform the tasks in accordance with the invention will not depart from the spirit and scope of the invention.
The invention may be embodied in other specific forms without departing from its spirit or essential characteristics. The described examples are to be considered in all respects only as illustrative and not restrictive. The scope of the invention is therefore, indicated by the appended claims rather than by the foregoing description. All changes which come within the meaning and range of equivalency of the claims are to be embraced within their scope.
Claims
1. A method of Intra coding for a block in a coding system, the method comprising:
- receiving input data associated with a current block;
- determining a current Intra prediction mode for the current block;
- deriving prediction samples for the current block based on reconstructed neighboring samples according to the current Intra prediction mode, wherein a predicting value representing a single prediction value for the current block is determined based on at least two corner samples corresponding to the current block according to the current Intra prediction mode; and
- applying encoding or decoding to the input data associated with the current block using the prediction samples derived for the current block.
2. The method of claim 1, wherein the at least two corner samples corresponding to the current block comprise two corner prediction samples of a left-most column of the current block if the Horizontal mode is selected.
3. The method of claim 1, wherein the at least two corner samples corresponding to the current block comprise two corner reconstructed samples of the reconstructed neighboring column adjacent to a left block boundary of the current block if the Horizontal mode is selected.
4. The method of claim 1, wherein the at least two corner samples corresponding to the current block comprise two corner prediction samples of a top row of the current block if the Vertical mode is selected.
5. The method of claim 1, wherein the at least two corner samples corresponding to the current block comprise two corner reconstructed samples of the reconstructed neighboring row adjacent to a top block boundary of the current block if the Vertical mode is selected.
6. The method of claim 1, wherein the at least two corner samples corresponding to the current block comprise four corner prediction samples of the current block.
7. The method of claim 6, wherein predicting value is determined using a mean median, maximum, minimum, or linear combination of said four corner prediction samples of the current block.
8. The method of claim 1, wherein the at least two corner samples corresponding to the current block comprise two corner prediction samples of a line of the current block adjacent to a block boundary of the current block according to the current Intra prediction mode for the current block.
9. The method of claim 1, wherein the at least two corner samples corresponding to the current block comprise two corner reconstructed samples of reconstructed neighboring line adjacent to a block boundary of the current block according to the current Intra prediction mode for the current block.
10. An apparatus for Intra coding of a block in a coding system, the apparatus comprising one or more electronic circuits configured to:
- receive input data associated with a current block;
- determine a current Intra prediction mode for the current block;
- derive prediction samples for the current block based on reconstructed neighboring samples according to the current Intra prediction mode, wherein a predicting value representing a single prediction value for the current block is determined based on at least two corner samples corresponding to the current block according to the current Intra prediction mode; and apply encoding or decoding to the input data associated with the current block using the prediction samples derived for the current block.
Type: Application
Filed: Oct 21, 2016
Publication Date: Feb 9, 2017
Inventors: Jian-Liang LIN (Su'ao Township), Yi-Wen CHEN (Taichung)
Application Number: 15/299,939