IMAGE SIGNAL TRANSFORMING AND INVERSE-TRANSFORMING METHOD AND COMPUTER PROGRAM PRODUCT WITH PRE-ENCODING FILTERING FEATURES
An aspect of an image signal transforming method is a method of generating one or more transformed samples from a plurality of input samples, which includes a first transformed sample generating step of performing a first filtering process by a filter, on at least one first input sample (an input sample from a terminal) out of a plurality of first input samples used for generation of a first transformed sample, to generate first filtered data, and performing a first arithmetic process (subtraction by a subtractor) on another first input sample not used for the generation of the first filtered data (an input sample from another terminal), and the first filtered data generated, to generate the first transformed sample.
Latest NTT DoCoMo, Inc Patents:
This application is a Continuation of and is based upon and claims the benefit of priority under 35 U.S.C. §120 for U.S. Ser. No. 11/317,014, filed Dec. 27, 2005, the entire contents of each which are incorporated herein by reference. This application also claims the benefit of priority under 35 U.S.C. §119 from Japanese Patent Application No. 2005-002995, filed Jan. 7, 2005 and Japanese Patent Application No. 2005-141669, filed May 13, 2005.
BACKGROUND OF THE INVENTION1. Field of the Invention
The present invention relates to image signal transforming method, image signal inversely-transforming method, image encoding apparatus, image encoding method, image encoding program, image decoding apparatus, image decoding method, and image decoding program.
2. Related Background Art
Conventionally, the compressive coding techniques are used for efficiently performing transmission and storage of still image and moving image data. Particularly, MPEG (Moving Picture Experts Group) 1, 2, and 4 and H.261-H.264 systems are used in the case of moving images, and JPEG (Joint Photographic Experts Group) and JPEG2000 are used in the case of still images.
In most of these coding systems, an image as an object for coding is decomposed into a plurality of blocks and the discrete cosine transform (hereinafter referred to as “DCT”) is applied to transform each block itself or a differential signal between the block and a prediction signal for the block, into data in the frequency domain. The transformation coefficients obtained by the transformation are quantized to compress the data volume of the signal of the original image. In reconstruction, the compressed data is dequantized into a signal of each block, and the inverse discrete cosine transform (hereinafter referred to as “IDCT”) is applied to restore a signal or differential signal in the pixel domain. The coding using DCT is described, for example, in U.S. Pat. No. 5,196,946.
As described above, the encoding apparatus is able to express an input signal in a compact form through the transformation of the image into the frequency domain and thus to achieve efficient coding.
SUMMARY OF THE INVENTIONHowever, since the conventional transformation method is to transform signals as objects for transformation by one type of transformation system, it is difficult to realize concentration of energy exceeding the original characteristics of the signals as objects for transformation. Namely, where there is a high correlation between signals as objects, the concentration of energy will be high enough to achieve efficient coding, but, where the original correlation between signals is low, coefficients by DCT will spread over a wide range in the frequency domain. In consequence, it is difficult to achieve efficient coding.
In general, signals of taken still images and moving images demonstrate a high correlation and thus can be efficiently encoded by use of DCT. In contrast to it, in the case where a difference is taken between a prediction signal obtained by intra-frame prediction or inter-frame prediction and an image signal as an object for coding, the correlation of the differential signal is not so high. Therefore, it is difficult to express the differential signal in a compact form even with the use of DCT.
An object of the present invention is to solve the above problem and thereby to provide image signal transforming method, image signal inversely-transforming method, image encoding apparatus, image encoding method, image encoding program, image decoding apparatus, image decoding method, and image decoding program capable of efficiently expressing a signal by concentration of energy of the signal even in the case where the correlation of the signal as an object for transformation is not high.
In order to achieve the above object, an image signal transforming method according to the present invention is an image signal transforming method of generating one or more transformed samples from a plurality of input samples, comprising: a first transformed sample generating step of performing a first filtering process on at least one first input sample out of a plurality of first input samples used for generation of a first transformed sample, to generate first filtered data, and performing a first arithmetic process on another first input sample not used for the generation of the first filtered data, and said first filtered data generated, to generate the first transformed sample.
The present invention involves performing the predetermined filtering on the signal as an object, and thereby provides the effect of capability of realizing concentration of energy exceeding the original characteristic of the signal and efficiently encoding the signal.
The image signal transforming method according to the present invention can be applied to either of the transformation process and inverse transformation process in an image encoding apparatus described hereinafter and can also be applied to either of the transformation process and inverse transformation process in an image decoding apparatus described hereinafter.
The image signal transforming method preferably further comprises a second transformed sample generating step of performing a second filtering process on the first transformed sample generated in the first transformed sample generating step, to generate second filtered data, and performing a second arithmetic process on at least one second input sample used for generation of a second transformed sample, and said second filtered data generated, to generate the second transformed sample.
The image signal transforming method preferably further comprises a second transformed sample generating step of performing a second filtering process on at least one second input sample out of a plurality of second input samples used for generation of a second transformed sample, to generate second filtered data, and performing a second arithmetic process on another second input sample not used for the generation of the second filtered data, and said second filtered data generated, to generate the second transformed sample.
An image encoding apparatus according to the present invention is an image encoding apparatus comprising: importing means for importing an input image as an object for coding; region decomposing means for decomposing the input image imported by the importing means, into a plurality of coding regions; predicting means for obtaining a differential signal by either intra-frame prediction or inter-frame prediction, for each of the coding regions resulting from the decomposition by the region decomposing means, and for generating the obtained differential signal as a transformation object signal; transforming means for generating a transformed sample, using the transformation object signal generated by the predicting means, as an input sample, based on an image signal transforming method of generating one or more transformed samples from a plurality of input samples, which comprises a first transformed sample generating step of performing a filtering process on at least one first input sample out of a plurality of first input samples used for generation of a first transformed sample, to generate first filtered data, and performing a first arithmetic process on another first input sample not used for the generation of the first filtered data, and the first filtered data generated, to generate the first transformed sample, and for defining the generated transformed sample as a transformation coefficient, thereby transforming the transformation object signal into the transformation coefficient; and encoding means for encoding the transformation coefficient obtained by the transforming means.
In the image encoding apparatus, preferably, the transforming means selects and uses a filter to maximize a correlation of the transformation object signal, out of plural types of filters in the filtering process, and further encodes identification information for identification of the selected filter.
An image decoding apparatus according to the present invention is an image decoding apparatus comprising: importing means for importing compressed data generated by performing either intra-frame prediction or inter-frame prediction, for an image decomposed into a plurality of regions, and performing transformation and coding; decoding means for restoring transformation coefficients corresponding to said respective regions from the compressed data imported by the importing means, and for generating the resultant transformation coefficients as restored transformation coefficients; and inversely transforming means for generating a transformed sample, using the restored transformation coefficients generated by the decoding means, as input samples, based on an image signal transforming method of generating one or more transformed samples from a plurality of input samples, which comprises a first transformed sample generating step of performing a filtering process on at least one first input sample out of a plurality of first input samples used for generation of a first transformed sample, to generate first filtered data, and performing a first arithmetic process on another first input sample not used for the generation of the first filtered data, and the first filtered data generated, to generate the first transformed sample, and for defining the generated transformed sample as inversely transformed data, thereby transforming the restored transformation coefficients into the inversely transformed data.
In the image decoding apparatus, preferably, the compressed data contains filter identification information for identification of a filter used in the filtering process, and the decoding means decodes the filter identification information and performs the filtering process using the filter corresponding to the decoded filter identification information.
An image encoding method according to the present invention is an image encoding method comprising: an importing step of importing an input image as an object for coding; a region decomposing step of decomposing the input image imported in the importing step, into a plurality of coding regions; a predicting step of obtaining a differential signal by either intra-frame prediction or inter-frame prediction, for each of the coding regions resulting from the decomposition in the region decomposing step, and generating the obtained differential signal as a transformation object signal; a transforming step of generating a transformed sample, using the transformation object signal generated in the predicting step, as an input sample, based on an image signal transforming method of generating one or more transformed samples from a plurality of input samples, which comprises a first transformed sample generating step of performing a filtering process on at least one first input sample out of a plurality of first input samples used for generation of a first transformed sample, to generate first filtered data, and performing a first arithmetic process on another first input sample not used for the generation of the first filtered data, and the first filtered data generated, to generate the first transformed sample, and defining the generated transformed sample as a transformation coefficient, thereby transforming the transformation object signal into the transformation coefficient; and an encoding step of encoding the transformation coefficient obtained in the transforming step.
An image decoding method according to the present invention is an image decoding method comprising: an importing step of importing compressed data generated by performing either intra-frame prediction or inter-frame prediction, for an image decomposed into a plurality of regions, and performing transformation and coding; a decoding step of restoring transformation coefficients corresponding to said respective regions from the compressed data imported in the importing step, and generating the resultant transformation coefficients as restored transformation coefficients; and an inversely transforming step of generating a transformed sample, using the restored transformation coefficients generated in the decoding step, as input samples, based on an image signal transforming method of generating one or more transformed samples from a plurality of input samples, which comprises a first transformed sample generating step of performing a filtering process on at least one first input sample out of a plurality of first input samples used for generation of a first transformed sample, to generate first filtered data, and performing a first arithmetic process on another first input sample not used for the generation of the first filtered data, and the first filtered data generated, to generate the first transformed sample, and defining the generated transformed sample as inversely transformed data, thereby transforming the restored transformation coefficients into the inversely transformed data.
An image encoding program according to the present invention is an image encoding program for letting a computer execute the following steps: an importing step of importing an input image as an object for coding; a region decomposing step of decomposing the input image imported in the importing step, into a plurality of coding regions; a predicting step of obtaining a differential signal by either intra-frame prediction or inter-frame prediction, for each of the coding regions resulting from the decomposition in the region decomposing step, and generating the obtained differential signal as a transformation object signal; a transforming step of generating a transformed sample, using the transformation object signal generated in the predicting step, as an input sample, based on an image signal transforming method of generating one or more transformed samples from a plurality of input samples, which comprises a first transformed sample generating step of performing a filtering process on at least one first input sample out of a plurality of first input samples used for generation of a first transformed sample, to generate first filtered data, and performing a first arithmetic process on another first input sample not used for the generation of the first filtered data, and the first filtered data generated, to generate the first transformed sample, and defining the generated transformed sample as a transformation coefficient, thereby transforming the transformation object signal into the transformation coefficient; and an encoding step of encoding the transformation coefficient obtained in the transforming step.
An image decoding program according to the present invention is an image decoding program for letting a computer execute the following steps: an importing step of importing compressed data generated by performing either intra-frame prediction or inter-frame prediction, for an image decomposed into a plurality of regions, and performing transformation and coding; a decoding step of restoring transformation coefficients corresponding to said respective regions from the compressed data imported in the importing step, and generating the resultant transformation coefficients as restored transformation coefficients; and an inversely transforming step of generating a transformed sample, using the restored transformation coefficients generated in the decoding step, as input samples, based on an image signal transforming method of generating one or more transformed samples from a plurality of input samples, which comprises a first transformed sample generating step of performing a filtering process on at least one first input sample out of a plurality of first input samples used for generation of a first transformed sample, to generate first filtered data, and performing a first arithmetic process on another first input sample not used for the generation of the first filtered data, and the first filtered data generated, to generate the first transformed sample, and defining the generated transformed sample as inversely transformed data, thereby transforming the restored transformation coefficients into the inversely transformed data.
In order to solve the above problem, another image signal transforming method according to the present invention is an image signal transforming method of generating 2N transformed samples from 2N (N is a natural number) input samples in accordance with a predetermined transformation rule, comprising: an intermediate value generating step of weighting one input sample out of an n-th pair (1≦n≦2N−1, n is a natural number) of input samples determined in accordance with the transformation rule, by an n-th weighting factor, and performing a first transformation arithmetic to generate an n-th pair of intermediate values; and a transformation coefficient generating step of importing 2N intermediate values generated in the intermediate value generating step, weighting one intermediate value out of an m-th pair (1≦m≦2N−1, m is a natural number) of intermediate values determined in accordance with the transformation rule, by an m-th weighting factor, and performing a second transformation arithmetic to generate an m-th pair of transformed samples.
The present invention involves performing the predetermined weighting process on the signal as an object, whereby the signal is transformed by a transformation basis suitable for the original characteristic thereof. For this reason, it is feasible to enhance the degree of energy concentration and to achieve efficient coding of the signal.
An image signal inversely-transforming method according to the present invention is an image signal inversely-transforming method of generating 2N output samples from 2N (N is a natural number) transformed samples in accordance with a predetermined transformation rule, comprising: an intermediate value generating step of weighting one transformed sample out of an n-th pair (1≦n≦2N−1, n is a natural number) of transformed samples determined in accordance with the transformation rule, by an n-th weighting factor, and performing a first transformation arithmetic to generate an n-th pair of intermediate values; and an output value generating step of importing 2N intermediate values generated in the intermediate value generating step, weighting one intermediate value out of an m-th pair (1≦m≦2N−1, m is a natural number) of intermediate values determined in accordance with the transformation rule, by an m-th weighting factor, and performing a second transformation arithmetic to generate an m-th pair of output samples.
The image signal transforming method according to the present invention can also be applied to a transformation process in an image encoding apparatus. Namely, an image encoding apparatus is one comprising: importing means for importing an input image as an object for coding; region decomposing means for decomposing the input image imported by the importing means, into a plurality of coding regions; predicting means for obtaining a differential signal by either intra-frame prediction or inter-frame prediction, for each of the coding regions resulting from the decomposition by the region decomposing means, and for generating the differential signal as a transformation object signal; transforming means for generating a transformed sample, using the transformation object signal generated by the predicting means, as an input sample, based on an image signal transforming method of generating 2N transformed samples from 2N (N is a natural number) input samples in accordance with a predetermined transformation rule, which comprises: an intermediate value generating step of weighting one input sample out of an n-th pair (1≦n≦2N−1, n is a natural number) of input samples determined in accordance with the transformation rule, by an n-th weighting factor, and performing a first transformation arithmetic to generate an n-th pair of intermediate values; and a transformation coefficient generating step of importing 2N intermediate values generated in the intermediate value generating step, weighting one intermediate value out of an m-th pair (1≦m≦2N−1, m is a natural number) of intermediate values determined in accordance with the transformation rule, by an m-th weighting factor, and performing a second transformation arithmetic to generate an m-th pair of transformed samples, and for defining the generated transformed sample as a transformation coefficient, thereby transforming the transformation object signal into the transformation coefficient; and encoding means for encoding the transformation coefficient obtained by the transforming means.
In the image encoding apparatus, preferably, the transforming means selects a weighting factor to maximize a degree of energy concentration of the transformation object signal, out of a plurality of weighting factors in the intermediate value generating step and in the transformation coefficient generating step, and identification information of the weighting factor is further encoded.
Another image encoding method according to the present invention is an image encoding method comprising: an importing step of importing an input image as an object for coding; a region decomposing step of decomposing the input image imported in the importing step, into a plurality of coding regions; a predicting step of obtaining a differential signal by either intra-frame prediction or inter-frame prediction, for each of the coding regions resulting from the decomposition in the region decomposing step, and generating the differential signal as a transformation object signal; a transforming step of generating a transformed sample, using the transformation object signal generated in the predicting step, as an input sample, based on an image signal transforming method of generating 2N transformed samples from 2N (N is a natural number) input samples in accordance with a predetermined transformation rule, which comprises: an intermediate value generating step of weighting one input sample out of an n-th pair (1≦n≦2N−1, n is a natural number) of input samples determined in accordance with the transformation rule, by an n-th weighting factor, and performing a first transformation arithmetic to generate an n-th pair of intermediate values; and a transformation coefficient generating step of importing 2N intermediate values generated in the intermediate value generating step, weighting one intermediate value out of an m-th pair (1≦m≦2N−1, m is a natural number) of intermediate values determined in accordance with the transformation rule, by an m-th weighting factor, and performing a second transformation arithmetic to generate an m-th pair of transformed samples, and defining the generated transformed sample as a transformation coefficient, thereby transforming the transformation object signal into the transformation coefficient; and an encoding step of encoding the transformation coefficient obtained in the transforming step.
Similarly, the image signal inversely-transforming method according to the present invention can also be applied to an inverse transformation process in an image decoding apparatus. Namely, an image decoding apparatus is one comprising: importing means for importing compressed data generated by performing either intra-frame prediction or inter-frame prediction, for an image decomposed into a plurality of regions, and for performing transformation and coding; decoding means for restoring transformation coefficients corresponding to the respective regions from the compressed data imported by the importing means, and for generating the resultant transformation coefficients as restored transformation coefficients; and inversely transforming means for generating an output sample, using the restored transformation coefficients generated by the decoding means, as transformed samples, based on an image signal inversely-transforming method of generating 2N output samples from 2N (N is a natural number) transformed samples in accordance with a predetermined transformation rule, which comprises: an intermediate value generating step of weighting one transformed sample out of an n-th pair (1≦n≦2N−1, n is a natural number) of transformed samples determined in accordance with the transformation rule, by an n-th weighting factor, and performing a first transformation arithmetic to generate an n-th pair of intermediate values; and an output value generating step of importing 2N intermediate values generated in the intermediate value generating step, weighting one intermediate value out of an m-th pair (1≦m≦2N−1, m is a natural number) of intermediate values determined in accordance with the transformation rule, by an m-th weighting factor, and performing a second transformation arithmetic to generate an m-th pair of output samples, and for defining the output sample as inversely transformed data, thereby transforming the restored transformation coefficients into the inversely transformed data.
In the above image decoding apparatus, preferably, the compressed data contains identification information of the weighting factor used in the intermediate value generating step or in the output value generating step, and the decoding means decodes the identification information and performs the process of the intermediate value generating step or the output value generating step, using the weighting factor corresponding to the identification information.
Another image decoding method according to the present invention is an image decoding method comprising: an importing step of importing compressed data generated by performing either intra-frame prediction or inter-frame prediction, for an image decomposed into a plurality of regions, and performing transformation and coding; a decoding step of restoring transformation coefficients corresponding to the respective regions from the compressed data imported in the importing step, and generating the resultant transformation coefficients as restored transformation coefficients; and an inversely transforming step of generating an output sample, using the restored transformation coefficients generated in the decoding step, as transformed samples, based on an image signal inversely-transforming method of generating 2N output samples from 2N (N is a natural number) transformed samples in accordance with a predetermined transformation rule, which comprises: an intermediate value generating step of weighting one transformed sample out of an n-th pair (1≦n≦2N−1, n is a natural number) of transformed samples determined in accordance with the transformation rule, by an n-th weighting factor, and performing a first transformation arithmetic to generate an n-th pair of intermediate values; and an output value generating step of importing 2N intermediate values generated in the intermediate value generating step, weighting one intermediate value out of an m-th pair (1≦m≦2N−1, m is a natural number) of intermediate values determined in accordance with the transformation rule, by an m-th weighting factor, and performing a second transformation arithmetic to generate an m-th pair of output samples, and defining the output sample as inversely transformed data, thereby transforming the restored transformation coefficients into the inversely transformed data.
Furthermore, the encoding technology according to the present invention can also be applied to a program. Namely, an encoding program is an image encoding program for letting a computer execute the following processes: a process of importing an input image as an object for coding; a process of decomposing the input image imported, into a plurality of coding regions; a process of obtaining a differential signal by either intra-frame prediction or inter-frame prediction, for each of the coding regions resulting from the decomposition, and generating the differential signal as a transformation object signal; a process of generating a transformed sample, using the generated transformation object signal as an input sample, based on an image signal transforming method of generating 2N transformed samples from 2N (N is a natural number) input samples in accordance with a predetermined transformation rule, which comprises: an intermediate value generating step of weighting one input sample out of an n-th pair (1≦n≦2N−1, n is a natural number) of input samples determined in accordance with the transformation rule, by an n-th weighting factor, and performing a first transformation arithmetic to generate an n-th pair of intermediate values; and a transformation coefficient generating step of importing 2N intermediate values generated in the intermediate value generating step, weighting one intermediate value out of an m-th pair (1≦m≦2N−1, m is a natural number) of intermediate values determined in accordance with the transformation rule, by an m-th weighting factor, and performing a second transformation arithmetic to generate an m-th pair of transformed samples, and defining the generated transformed sample as a transformation coefficient, thereby transforming the transformation object signal into the transformation coefficient; and a process of encoding the transformation coefficient obtained.
Similarly, a decoding program according to the present invention is an image decoding program for letting a computer execute the following processes: a process of importing compressed data generated by performing either intra-frame prediction or inter-frame prediction, for an image decomposed into a plurality of regions, and performing transformation and coding; a process of restoring transformation coefficients corresponding to the respective regions from the compressed data imported, and generating the resultant transformation coefficients as restored transformation coefficients; and a process of generating an output sample, using the restored transformation coefficients as transformed samples, based on an image signal inversely-transforming method of generating 2N output samples from 2N (N is a natural number) transformed samples in accordance with a predetermined transformation rule, which comprises: an intermediate value generating step of weighting one transformed sample out of an n-th pair (1≦n≦2N−1, n is a natural number) of transformed samples determined in accordance with the transformation rule, by an n-th weighting factor, and performing a first transformation arithmetic to generate an n-th pair of intermediate values; and an output value generating step of importing 2N intermediate values generated in the intermediate value generating step, weighting one intermediate value out of an m-th pair (1≦m≦2N−1, m is a natural number) of intermediate values determined in accordance with the transformation rule, by an m-th weighting factor, and performing a second transformation arithmetic to generate an m-th pair of output samples, and defining the output sample as inversely transformed data, thereby transforming the restored transformation coefficients into the inversely transformed data.
As described above, the present invention provides the effect of capability of realizing the concentration of energy exceeding the original characteristic of the signal and achieving efficient coding of the signal.
The first embodiment of the present invention will be described below using
Concerning the image signal transforming apparatus 300 constructed as described above, the operation thereof will be described below. This image signal transforming apparatus 300 imports image signal 501 consisting of 4×4 pixels shown in
Pixels a0 (502)-a3 (505) in
In the transformation module 313 in
Next, an inverse transformation process of an image signal will be described using
The image signal inversely-transforming apparatus 400 in
In the present embodiment the transformation modules are arranged to add the result of subtraction, but there is also an implementation method of subtracting the result of addition, in which the multiplier coefficients may be set so as to maintain the input energy of the transformer and the output energy of the inverse transformer. The above described the input signals of four pixels, and, in the case of input signals of N pixels (N is an arbitrary integer), data may be processed by locating a corresponding filter before each arithmetic unit (addition/subtraction) in the conventional N×N DCT apparatus and IDCT apparatus.
The filters used in
The relation of two filters used in the transformation modules (313, 314, 315, 316 in
The filtering processes used in the transformation and inverse transformation methods of image signals according to the present invention require pixels located in a region across a boundary of an object block in certain cases. In particular, in a case where a column or row at a boundary of a block is transformed, the boundary value problem arises. In this case the filtering process may be carried out using pixel values in a region of an adjacent block, but the present embodiment is arranged to perform the filtering process repeatedly using pixel values at the boundary.
In
Next, an inverse transformation process of an image signal will be described with reference to
An image encoding apparatus, method, and program using the image signal transforming process according to the embodiment of the present invention will be described below.
Concerning the image encoding apparatus constructed as described above, the operation thereof will be described below. A plurality of images constituting a motion picture are imported via input terminal 801 and each image is decomposed into blocks of N×M pixels by block decomposer 802. In the present embodiment N=M=8, but N does not have to equal M. It is also possible to adopt decomposition in the other sizes than 8 pixels. A block as an object for coding is fed via line L820a to intra-frame predictor 803 and to inter-frame predictor 804. The intra-frame predictor 803 imports a block signal as an object for coding and a previously reconstructed image signal forming the same frame stored in frame memory 812, and generates a frame prediction signal similar to that in Standard H.264. The inter-frame predictor 804 imports a block as an object for coding and a previously reconstructed signal of a different frame stored in frame memory 812, and generates an inter-frame prediction signal by motion detection prediction similar to that in Standard H.264. The encoding apparatus according to the present invention provides for a case without any input by terminal 805. Namely, an original signal directly becomes an object for coding. The selector switch 809 selects a mode to minimize the number of bits, out of three cases of the prediction signal by the intra-frame predictor 803, the prediction signal by the inter-frame predictor 804, and no prediction signal. The prediction signal determined as described above is fed to adder 811 and a difference is determined from a block as an object for coding. The differential signal determined is fed to filter determiner 818. The filter determiner 818 transforms the differential signal by the aforementioned image signal transforming method with a plurality of filters to estimate the number of bits of transformation coefficients.
The present embodiment is arranged to determine filters to minimize the number of bits after entropy coding of transformation coefficients and to feed an identifier to identify each filter, to transformer 813. The transformer 813 performs the transformation, using the filters determined by the filter determiner 818. In the present embodiment, a block of 8×8 pixels is further divided into 4×4 pixel units and the transformation is performed to filter only columns of each 4×4 block. The transformation coefficients obtained in this manner are fed to quantizer 814 to be quantized. The quantized coefficients are fed to entropy encoder 819 to be encoded by variable length coding, and coded data is outputted from output terminal 820. On the other hand, the quantized coefficients are dequantized by dequantizer 815, the dequantized coefficients are inversely transformed by the filters determined by the filter determiner (at inverse transformer 816), the result is added to the prediction signal (fed via line L811) by adder 817, to generate a reconstructed signal, and the reconstructed signal is stored in frame memory 812. The identifiers of the filters determined by the filter determiner 818 are fed via line L814 to entropy encoder 819 and they, together with other data, are outputted from output terminal 820.
Next, an image encoding program for letting a computer operate as an image encoding apparatus according to the present invention will be described.
As shown in
As shown in
As shown in
Imported via input terminal 1100 is compressed data generated by performing either intra-frame prediction or inter-frame prediction on an image decomposed into a plurality of regions and then performing transformation and coding. The data analyzer 1101 analyzes the compressed data and performs an entropy decoding process, and it also extracts quantized transformation coefficients, information about quantization, mode information about generation of the prediction signal, and identifiers indicating the filters to be used in the inverse transformation process. The quantized transformation coefficients and the information about quantization are fed via line L1102 to dequantizer 1102, which generates dequantized transformation coefficients. The dequantized transformation coefficients are fed via line L1105 and the identifiers indicating the filters to be used in the inverse transformation process are fed via line L1104 to inverse transformer 1103, which performs the inverse transformation using the designated filters to generate an inversely transformed signal. The inverse transformation process is the one as described above. The mode information about generation of the prediction signal is fed via line L1103 to the prediction signal generator 1105, which determines the intra-frame prediction or inter-frame prediction, or no prediction, based on the information, to generate the prediction signal. The inversely transformed signal and the prediction signal obtained in this manner are added at adder 1104 and the result is stored in frame memory 1106 and also outputted via output terminal 1107 in order to display it.
Next, an image decoding program for letting a computer operate as an image decoding apparatus according to the present invention will be described.
As shown in
As described above, the signal transformation process is carried out to transform the signals after the filtering process to enhance the correlation of the input signals, thereby achieving the effect of capability of expressing the signals in a more compact form and achieving efficient coding of the image signal.
Second EmbodimentAn image signal transforming apparatus according to the second embodiment will be described below with reference to the accompanying drawings.
The image signal transforming apparatus 300A imports image signal 501 consisting of 4×4 pixels shown in
The pixels “a0” (502) to “a3” (505) in
The adder 321 adds the intermediate value to twice the pixel “a3” obtained by the multiplier 320. An intermediate value resulting from the addition is outputted to connection terminal 308. Namely, the image signal transforming apparatus 300A includes a module consisting of the weighting device 318A, arithmetic unit 317, and arithmetic units 320, 321 as fundamental transformation module 313 and transforms the input signals.
The transformation is completed by the above-described processing in the case where the target signal as an object of transformation is composed of two pixels, but, because the present embodiment handles the transformation object of four pixels, the pixels “a1” and “a2” are also similarly transformed by transformation module 314 consisting of weighting device 323A, arithmetic unit 322, and arithmetic units (325 and 326). In the present embodiment the weighting device 323A performs the weighting process with a weighting factor different from that of the weighting device 318A, but they may be arranged to use the same weighting factor.
The intermediate values obtained by the transformation modules 313, 314 are outputted to transformation module 315 and to transformation module 316 and are subjected to similar processes. The weighting device 329A, arithmetic units (328, 327), and arithmetic units (332, 333) in the transformation module 315 are different from those in the transformation modules 313, 314. However, the fundamental process is much the same, which includes performing a weighting process on one input (input from connection terminal 306) signal, subtracting the weighted signal from another input (input from connection terminal 305) signal, and again adding the result of the subtraction to the input signal from the connection terminal 306.
The signals of intermediate values imported from connection terminals 307, 308 are also similarly processed, and coefficients of frequency components thus transformed are outputted from terminals 309-312. If the weighting factors w1-w4 shown in
In the transformation module 313 of
In the transformation module 315, the weighting factor w3 in the weighting device 329A corresponds to the “m-th weighting factor” stated in Claims, and the subtraction process by multiplier 328 and subtractor 327 and the addition process by multiplier 332 and adder 333 correspond to the “second transformation arithmetic” stated in Claims. In the transformation module 316, the weighting factor w4 in the weighting device 335A corresponds to the “m-th weighting factor” stated in Claims, and the subtraction process by subtractor 334 and the addition process by multiplier 337 and adder 338 correspond to the “second transformation arithmetic” stated in Claims.
Next, an inverse transformation process of an image signal will be described with reference to
The image signal inversely-transforming apparatus 400 performs an inverse process to the image signal transforming apparatus 300A. The image signal inversely-transforming apparatus 400 is an apparatus for inversely transforming coefficients in the frequency domain into signals in the pixel domain and is composed of four fundamental transformation modules 413-416. As shown in
The arithmetic result is halved (at 439) and the result is outputted as an intermediate value to connection terminal 406. At the same time, this subtraction result is weighted by a factor w3 in the weighting device 420A and the weighted result is subjected to an arithmetic operation with the input signal from input terminal 401. This arithmetic process is executed by multipliers 421, 422 and adder 437. Similarly, the transformation coefficients imported from input terminals 403, 404 are processed by transformation module 414. The intermediate values obtained are outputted to connection terminals 405-408 and thereafter inversely transformed into signals in the pixel domain by transformation modules 415, 416. The multipliers 422, 425, 431, 436 in
In the second embodiment the transformation modules are configured to add the subtraction result, but, contrary to it, it is also possible to adopt a method of subtracting an addition result. In this case, the image signal inversely-transforming apparatus may be configured to set the multiplier coefficients so as to maintain the input energy of the transformer and the output energy of the inverse transformer. The above described the input signal of four pixels, but the input signal of N pixels (N is an arbitrary natural number) can also be processed by providing a corresponding weight, prior to the arithmetic process (addition/subtraction) in the conventional N×N DCT apparatus and IDCT apparatus, and processing resultant data.
Now, let us describe the weights used in the transformation and inverse transformation of image signal. In the second embodiment a factor to maximize the degree of energy concentration of the signal as a processing object is selected from a total of sixteen factors of 19/16, 18/16, 17/16, . . . , 5/16, and 4/16. The factors w1-w4 take an identical value, but different factors may also be used. Depending upon signals, w3 may be fixed to “1” and the other factors may be any one of the above-described values. Alternatively, each of w1-w3 may be fixed to “1” and only w4 may be variable. Furthermore, the factors may be those other than the aforementioned sixteen factors.
The image signal inversely-transforming apparatus 400 weights the input signals by the weighting factors w1-w4, whereby it substantially changes the transformation basis. Such processing will be described with reference to
By comparison between
Subsequently, an image encoding apparatus, method, and program using the aforementioned image signal transforming process will be described.
When a plurality of images constituting a motion picture are imported via input terminal 801, each of these images is decomposed into blocks of N×M pixels by block decomposer 802. In the present embodiment N=M=8, but N does not always have to equal M. In addition, the decomposition may be one other than 8-pixel decomposition. A block as an object for coding is fed via line L820a to the intra-frame predictor 803 and to the inter-frame predictor 804.
The intra-frame predictor 803 imports a block signal as an object for coding and a previously reconstructed image signal forming the same frame stored in frame memory 812, and generates a frame prediction signal similar to that in Standard H.264. The inter-frame predictor 804 imports the block as an object for coding and a previously reconstructed signal of a different frame stored in frame memory 812, and generates an inter-frame prediction signal by motion detection prediction similar to that in Standard H.264.
The image encoding apparatus 800 assumes a case where no input is supplied from terminal 805, or where an original signal is used directly as an object for coding. The selector switch 809 selects a mode to minimize the number of bits, out of three cases of a prediction signal by the intra-frame predictor 803, a prediction signal by the inter-frame predictor 804, and no prediction signal. The prediction signal determined as described above is fed to adder 811 and a difference is determined from a block as an object for coding. Where the selector switch 809 is connected to terminal 808, the input from terminal 805 is “0”, and the output of adder 811 is nothing but the coding object block. The differential signal is outputted to the weight determiner 818.
The weight determiner 818 executes the aforementioned image signal transforming process with a plurality of weighting factors on the differential signal, and thereafter estimates the number of bits of the transformation coefficients. In the present embodiment, the weight determiner 818 determines weights to minimize the number of bits after entropy encoding of the transformation coefficients and outputs identifiers (corresponding to identification information) for identification of the weights to the transformer 813. The transformer 813 performs the transformation using the weights determined by the weight determiner 818. The transformer 813 further decompose a block of 8×8 pixels in units of 4×4 pixels, and performs the weighted transformation for each 4×4 block.
The apparatus may also be configured as follows: the process executed by the weight determiner 818 is incorporated in the transformer 813, and the transformer 813 determines the optimal weighted transformation while performing a plurality of weighted transformations, and generates the transformation coefficients. The present embodiment is arranged to perform the process of the same weighted transformation for all the four 4×4 blocks included in an 8×8 block, but the transformer 813 may be arranged to perform different weighted transformations for the respective 4×4 blocks.
The transformation coefficients obtained in this manner are fed to the quantizer 814 to be quantized. The quantized coefficients are fed to the entropy encoder 819 to be encoded by variable length coding, and thereafter the coded data is outputted from output terminal 820. On the other hand, the quantized coefficients are dequantized by the dequantizer 815 and the dequantized coefficients are inversely transformed with the weight determined by the weight determiner 818, by the inverse transformer 816. The adder 817 adds the result to the prediction signal fed via line L811, to generate a reconstructed signal. The reconstructed signal thus generated is stored in frame memory 812. The identifiers about the weighting factors determined by the weight determiner 818 are fed via line L814 to the entropy encoder 819 and thereafter they, together with the other data, are outputted from the output terminal 820.
The following will describe the operation of the image encoding apparatus according to the present invention with reference to
S4 is to generate a differential signal from a difference between the prediction signal and the block as an object for coding. S5 is to perform the transformation process by the above method with a plurality of weighting factors on the differential signal decomposed in 4×4 sample units, and to determine weighting factors to achieve the most compact form of signals. For example, where the number of bits or image quality is enhanced, the image encoding apparatus 800 sets the weighting factors w1-w3 to “1” and the weighting factor w4 to one of the aforementioned sixteen values. On the other hand, where the number of bits or image quality is lowered, the image encoding apparatus 800 sets the weighting factor w3 to “1” and the weighting factors w1, w2, w4 to one of the aforementioned sixteen values.
S6 is to execute the transformation process according to the weighting factors determined at S5. Thereafter, the transformation coefficients are quantized to generate quantized transformation coefficients (S7). The quantized transformation coefficients are dequantized (S8) and thereafter they are inversely transformed using the weighting factors determined at S5. As a result, reconstructed coefficients are generated (S9). S10 is to add the reconstructed coefficients thus generated, to the prediction signal determined at S3. This results in generating a reconstructed block. The reconstructed block is temporarily stored in frame memory 812. At the same time, the quantized transformation coefficients and the identifiers of the weighting factors are entropy encoded and coded data is outputted (S11).
The sequential processes of S3-S11 are executed for all the regions generated by the decomposition process at S2 (S12; NO). The image encoding process will end at the time of completion of the processing for all the regions. If the image encoding apparatus 800 is configured to output the transformation coefficients acquired in the determining process of the weighting factors for the transformation at S5, the transformation process does not have to be performed again, and the process of S6 can be omitted.
An image decoding apparatus, method, and program using the above-described image signal inversely-transforming process will be described.
Imported via input terminal 1100 is compressed data generated by performing intra-frame prediction or inter-frame prediction for an image decomposed into a plurality of regions and then performing the transformation and coding. The data analyzer 1101 analyzes the compressed data and performs the entropy decoding process. It also extracts the quantized transformation coefficients, the information about quantization, the mode information about generation of the prediction signal, and the identifiers (corresponding to the identification information) of the weighting factors used in the inverse transformation process.
The dequantizer 1102 imports the quantized transformation coefficients and the information about quantization via line L1102 and generates dequantized transformation coefficients. The inverse transformer 1103 imports the dequantized transformation coefficients via line L1105 and imports the identifiers of the weighting factors to be used in the inverse transformation process, via line L1104. Then it performs the inverse transformation, using the designated weighting factors, to generate an inversely transformed signal.
The prediction signal generator 1105 imports the mode information about generation of the prediction signal via line L1103 and then selects an optimal mode from the intra-frame prediction, inter-frame prediction, and no prediction with reference to the information to generate a prediction signal. The adder 1104 adds the inversely transformed signal imported via line L1106, to the prediction signal imported via line L1107. The frame memory 1106 stores the result of the addition and the output terminal 1107 implements a display thereof.
The following will describe the operation of the image decoding apparatus according to the present invention with reference to
T4 is to dequantize the quantized transformation coefficients and T5 is to perform the inverse transformation process according to the weighting factors designated by the weighting factor identifiers. As a result, an inversely transformed signal is generated. T6 is to add the prediction signal generated at T3, to the inversely transformed signal generated at T5, to generate a reconstructed block signal. At T7, this reconstructed block signal is temporarily stored in frame memory 1106. The sequential processes of T2-T7 are executed for all the compressed data imported at T1 (T8; NO). The image decoding process will end at the time of completion of the processing for all the data.
The image encoding technology according to the present invention can also be implemented as an image encoding program for letting a computer operate as image encoding apparatus 800.
As shown in
The image decoding technology according to the present invention can also be implemented as an image decoding program for letting a computer operate as the image decoding apparatus 1110 shown in
As shown in
When the recording medium 10 is set in the reading device 12, the computer 30 becomes accessible to the image encoding program or image decoding program recorded in the recording medium 10, through the reading device 12. The computer 30 lets CPU 26 execute the image encoding program, whereby it operates as the aforementioned image encoding apparatus 800. Similarly, the computer 30 lets the CPU 26 execute the image decoding program, whereby it can operate as the aforementioned image decoding apparatus 1110.
As shown in
As described above, the image encoding/decoding technology (apparatus, methods, and programs) according to the second embodiment is to perform the weighted transformation process to maximize the correlation of input signals in the signal transformation process. This enables signals to be expressed in a more compact form and thus achieves efficient coding of the image signal.
The disclosure of Japanese Patent Application No. 2005-002995 filed Jan. 7, 2005 including specification, drawings and claims, and the disclosure of Japanese Patent Application No. 2005-141669 filed May 13, 2005 including specification, drawings and claims are incorporated herein by reference in its entirety.
Claims
1. An image signal transforming method of generating one or more transformed samples from a plurality of input samples, comprising:
- a first transformed sample generating step of performing a first filtering process on at least one first input sample out of a plurality of first input samples used for generation of a first transformed sample, to generate first filtered data, and performing a first arithmetic process on another first input sample not used for the generation of the first filtered data, and said first filtered data generated, to generate the first transformed sample.
2. The image signal transforming method according to claim 1, further comprising a second transformed sample generating step of performing a second filtering process on the first transformed sample generated in the first transformed sample generating step, to generate second filtered data, and performing a second arithmetic process on at least one second input sample used for generation of a second transformed sample, and said second filtered data generated, to generate the second transformed sample.
3. The image signal transforming method according to claim 1, further comprising a second transformed sample generating step of performing a second filtering process on at least one second input sample out of a plurality of second input samples used for generation of a second transformed sample, to generate second filtered data, and performing a second arithmetic process on another second input sample not used for the generation of the second filtered data, and said second filtered data generated, to generate the second transformed sample.
4. An image encoding apparatus comprising:
- importing means for importing an input image as an object for coding;
- region decomposing means for decomposing the input image imported by the importing means, into a plurality of coding regions;
- predicting means for obtaining a differential signal by either intra-frame prediction or inter-frame prediction, for each of the coding regions resulting from the decomposition by the region decomposing means, and for generating the obtained differential signal as a transformation object signal;
- transforming means for generating a transformed sample, using the transformation object signal generated by the predicting means, as an input sample, based on an image signal transforming method of generating one or more transformed samples from a plurality of input samples, which comprises a first transformed sample generating step of performing a filtering process on at least one first input sample out of a plurality of first input samples used for generation of a first transformed sample, to generate first filtered data, and performing a first arithmetic process on another first input sample not used for the generation of the first filtered data, and the first filtered data generated, to generate the first transformed sample, and for defining the generated transformed sample as a transformation coefficient, thereby transforming the transformation object signal into the transformation coefficient; and
- encoding means for encoding the transformation coefficient obtained by the transforming means.
5. The image encoding apparatus according to claim 4, wherein the transforming means selects and uses a filter to maximize a correlation of the transformation object signal, out of plural types of filters in the filtering process, and further encodes identification information for identification of the selected filter.
6. An image decoding apparatus comprising:
- importing means for importing compressed data generated by performing either intra-frame prediction or inter-frame prediction, for an image decomposed into a plurality of regions, and performing transformation and coding;
- decoding means for restoring transformation coefficients corresponding to said respective regions from the compressed data imported by the importing means, and for generating the resultant transformation coefficients as restored transformation coefficients; and
- inversely transforming means for generating a transformed sample, using the restored transformation coefficients generated by the decoding means, as input samples, based on an image signal transforming method of generating one or more transformed samples from a plurality of input samples, which comprises a first transformed sample generating step of performing a filtering process on at least one first input sample out of a plurality of first input samples used for generation of a first transformed sample, to generate first filtered data, and performing a first arithmetic process on another first input sample not used for the generation of the first filtered data, and the first filtered data generated, to generate the first transformed sample, and for defining the generated transformed sample as inversely transformed data, thereby transforming the restored transformation coefficients into the inversely transformed data.
7. The image decoding apparatus according to claim 6, wherein the compressed data contains filter identification information for identification of a filter used in the filtering process, and
- wherein the decoding means decodes the filter identification information and performs the filtering process using the filter corresponding to the decoded filter identification information.
8. An image encoding method comprising:
- an importing step of importing an input image as an object for coding;
- a region decomposing step of decomposing the input image imported in the importing step, into a plurality of coding regions;
- a predicting step of obtaining a differential signal by either intra-frame prediction or inter-frame prediction, for each of the coding regions resulting from the decomposition in the region decomposing step, and generating the obtained differential signal as a transformation object signal;
- a transforming step of generating a transformed sample, using the transformation object signal generated in the predicting step, as an input sample, based on an image signal transforming method of generating one or more transformed samples from a plurality of input samples, which comprises a first transformed sample generating step of performing a filtering process on at least one first input sample out of a plurality of first input samples used for generation of a first transformed sample, to generate first filtered data, and performing a first arithmetic process on another first input sample not used for the generation of the first filtered data, and the first filtered data generated, to generate the first transformed sample, and defining the generated transformed sample as a transformation coefficient, thereby transforming the transformation object signal into the transformation coefficient; and
- an encoding step of encoding the transformation coefficient obtained in the transforming step.
9. An image decoding method comprising:
- an importing step of importing compressed data generated by performing either intra-frame prediction or inter-frame prediction, for an image decomposed into a plurality of regions, and performing transformation and coding;
- a decoding step of restoring transformation coefficients corresponding to said respective regions from the compressed data imported in the importing step, and generating the resultant transformation coefficients as restored transformation coefficients; and
- an inversely transforming step of generating a transformed sample, using the restored transformation coefficients generated in the decoding step, as input samples, based on an image signal transforming method of generating one or more transformed samples from a plurality of input samples, which comprises a first transformed sample generating step of performing a filtering process on at least one first input sample out of a plurality of first input samples used for generation of a first transformed sample, to generate first filtered data, and performing a first arithmetic process on another first input sample not used for the generation of the first filtered data, and the first filtered data generated, to generate the first transformed sample, and defining the generated transformed sample as inversely transformed data, thereby transforming the restored transformation coefficients into the inversely transformed data.
10. An image encoding program for letting a computer execute the following steps:
- an importing step of importing an input image as an object for coding;
- a region decomposing step of decomposing the input image imported in the importing step, into a plurality of coding regions;
- a predicting step of obtaining a differential signal by either intra-frame prediction or inter-frame prediction, for each of the coding regions resulting from the decomposition in the region decomposing step, and generating the obtained differential signal as a transformation object signal;
- a transforming step of generating a transformed sample, using the transformation object signal generated in the predicting step, as an input sample, based on an image signal transforming method of generating one or more transformed samples from a plurality of input samples, which comprises a first transformed sample generating step of performing a filtering process on at least one first input sample out of a plurality of first input samples used for generation of a first transformed sample, to generate first filtered data, and performing a first arithmetic process on another first input sample not used for the generation of the first filtered data, and the first filtered data generated, to generate the first transformed sample, and defining the generated transformed sample as a transformation coefficient, thereby transforming the transformation object signal into the transformation coefficient; and
- an encoding step of encoding the transformation coefficient obtained in the transforming step.
11. An image decoding program for letting a computer execute the following steps:
- an importing step of importing compressed data generated by performing either intra-frame prediction or inter-frame prediction, for an image decomposed into a plurality of regions, and performing transformation and coding;
- a decoding step of restoring transformation coefficients corresponding to said respective regions from the compressed data imported in the importing step, and generating the resultant transformation coefficients as restored transformation coefficients; and
- an inversely transforming step of generating a transformed sample, using the restored transformation coefficients generated in the decoding step, as input samples, based on an image signal transforming method of generating one or more transformed samples from a plurality of input samples, which comprises a first transformed sample generating step of performing a filtering process on at least one first input sample out of a plurality of first input samples used for generation of a first transformed sample, to generate first filtered data, and performing a first arithmetic process on another first input sample not used for the generation of the first filtered data, and the first filtered data generated, to generate the first transformed sample, and defining the generated transformed sample as inversely transformed data, thereby transforming the restored transformation coefficients into the inversely transformed data.
12. An image signal transforming method of generating 2N transformed samples from 2N (N is a natural number) input samples in accordance with a predetermined transformation rule, comprising:
- an intermediate value generating step of weighting one input sample out of an n-th pair (1≦n≦2N−1, n is a natural number) of input samples determined in accordance with the transformation rule, by an n-th weighting factor, and performing a first transformation arithmetic to generate an n-th pair of intermediate values; and
- a transformation coefficient generating step of importing 2N intermediate values generated in the intermediate value generating step, weighting one intermediate value out of an m-th pair (1≦m≦2N−1, m is a natural number) of intermediate values determined in accordance with the transformation rule, by an m-th weighting factor, and performing a second transformation arithmetic to generate an m-th pair of transformed samples.
13. An image signal inversely-transforming method of generating 2N output samples from 2N (N is a natural number) transformed samples in accordance with a predetermined transformation rule, comprising:
- an intermediate value generating step of weighting one transformed sample out of an n-th pair (1≦n≦2N−1, n is a natural number) of transformed samples determined in accordance with the transformation rule, by an n-th weighting factor, and performing a first transformation arithmetic to generate an n-th pair of intermediate values; and
- an output value generating step of importing 2N intermediate values generated in the intermediate value generating step, weighting one intermediate value out of an m-th pair (1≦m≦2N−1, m is a natural number) of intermediate values determined in accordance with the transformation rule, by an m-th weighting factor, and performing a second transformation arithmetic to generate an m-th pair of output samples.
14. An image encoding apparatus comprising:
- importing means for importing an input image as an object for coding;
- region decomposing means for decomposing the input image imported by the importing means, into a plurality of coding regions;
- predicting means for obtaining a differential signal by either intra-frame prediction or inter-frame prediction, for each of the coding regions resulting from the decomposition by the region decomposing means, and for generating the differential signal as a transformation object signal;
- transforming means for generating a transformed sample, using the transformation object signal generated by the predicting means, as an input sample, based on an image signal transforming method of generating 2N transformed samples from 2N (N is a natural number) input samples in accordance with a predetermined transformation rule, which comprises: an intermediate value generating step of weighting one input sample out of an n-th pair (1≦n≦2N−1, n is a natural number) of input samples determined in accordance with the transformation rule, by an n-th weighting factor, and performing a first transformation arithmetic to generate an n-th pair of intermediate values; and a transformation coefficient generating step of importing 2N intermediate values generated in the intermediate value generating step, weighting one intermediate value out of an m-th pair (1≦m≦2N−1, m is a natural number) of intermediate values determined in accordance with the transformation rule, by an m-th weighting factor, and performing a second transformation arithmetic to generate an m-th pair of transformed samples, and for defining the generated transformed sample as a transformation coefficient, thereby transforming the transformation object signal into the transformation coefficient; and
- encoding means for encoding the transformation coefficient obtained by the transforming means.
15. The image encoding apparatus according to claim 14, wherein the transforming means selects a weighting factor to maximize a degree of energy concentration of the transformation object signal, out of a plurality of weighting factors in the intermediate value generating step and in the transformation coefficient generating step, and wherein identification information of the weighting factor is further encoded.
16. An image decoding apparatus comprising:
- importing means for importing compressed data generated by performing either intra-frame prediction or inter-frame prediction, for an image decomposed into a plurality of regions, and for performing transformation and coding;
- decoding means for restoring transformation coefficients corresponding to the respective regions from the compressed data imported by the importing means, and for generating the resultant transformation coefficients as restored transformation coefficients; and
- inversely transforming means for generating an output sample, using the restored transformation coefficients generated by the decoding means, as transformed samples, based on an image signal inversely-transforming method of generating 2N output samples from 2N (N is a natural number) transformed samples in accordance with a predetermined transformation rule, which comprises: an intermediate value generating step of weighting one transformed sample out of an n-th pair (1≦n≦2N−1, n is a natural number) of transformed samples determined in accordance with the transformation rule, by an n-th weighting factor, and performing a first transformation arithmetic to generate an n-th pair of intermediate values; and an output value generating step of importing 2N intermediate values generated in the intermediate value generating step, weighting one intermediate value out of an m-th pair (1≦m≦2N−1, m is a natural number) of intermediate values determined in accordance with the transformation rule, by an m-th weighting factor, and performing a second transformation arithmetic to generate an m-th pair of output samples, and for defining the output sample as inversely transformed data, thereby transforming the restored transformation coefficients into the inversely transformed data.
17. The image decoding apparatus according to claim 16, wherein the compressed data contains identification information of the weighting factor used in the intermediate value generating step or in the output value generating step, and
- wherein the decoding means decodes the identification information and performs the process of the intermediate value generating step or the output value generating step, using the weighting factor corresponding to the identification information.
18. An image encoding method comprising:
- an importing step of importing an input image as an object for coding;
- a region decomposing step of decomposing the input image imported in the importing step, into a plurality of coding regions;
- a predicting step of obtaining a differential signal by either intra-frame prediction or inter-frame prediction, for each of the coding regions resulting from the decomposition in the region decomposing step, and generating the differential signal as a transformation object signal;
- a transforming step of generating a transformed sample, using the transformation object signal generated in the predicting step, as an input sample, based on an image signal transforming method of generating 2N transformed samples from 2N (N is a natural number) input samples in accordance with a predetermined transformation rule, which comprises: an intermediate value generating step of weighting one input sample out of an n-th pair (1≦n≦2N−1, n is a natural number) of input samples determined in accordance with the transformation rule, by an n-th weighting factor, and performing a first transformation arithmetic to generate an n-th pair of intermediate values; and a transformation coefficient generating step of importing 2N intermediate values generated in the intermediate value generating step, weighting one intermediate value out of an m-th pair (1≦m≦2N−1, m is a natural number) of intermediate values determined in accordance with the transformation rule, by an m-th weighting factor, and performing a second transformation arithmetic to generate an m-th pair of transformed samples, and defining the generated transformed sample as a transformation coefficient, thereby transforming the transformation object signal into the transformation coefficient; and
- an encoding step of encoding the transformation coefficient obtained in the transforming step.
19. An image decoding method comprising:
- an importing step of importing compressed data generated by performing either intra-frame prediction or inter-frame prediction, for an image decomposed into a plurality of regions, and performing transformation and coding;
- a decoding step of restoring transformation coefficients corresponding to the respective regions from the compressed data imported in the importing step, and generating the resultant transformation coefficients as restored transformation coefficients; and
- an inversely transforming step of generating an output sample, using the restored transformation coefficients generated in the decoding step, as transformed samples, based on an image signal inversely-transforming method of generating 2N output samples from 2N (N is a natural number) transformed samples in accordance with a predetermined transformation rule, which comprises: an intermediate value generating step of weighting one transformed sample out of an n-th pair (1≦n≦2N−1, n is a natural number) of transformed samples determined in accordance with the transformation rule, by an n-th weighting factor, and performing a first transformation arithmetic to generate an n-th pair of intermediate values; and an output value generating step of importing 2N intermediate values generated in the intermediate value generating step, weighting one intermediate value out of an m-th pair (1≦m≦2N−1, m is a natural number) of intermediate values determined in accordance with the transformation rule, by an m-th weighting factor, and performing a second transformation arithmetic to generate an m-th pair of output samples, and defining the output sample as inversely transformed data, thereby transforming the restored transformation coefficients into the inversely transformed data.
20. An image encoding program for letting a computer execute the following processes:
- a process of importing an input image as an object for coding;
- a process of decomposing the input image imported, into a plurality of coding regions;
- a process of obtaining a differential signal by either intra-frame prediction or inter-frame prediction, for each of the coding regions resulting from the decomposition, and generating the differential signal as a transformation object signal;
- a process of generating a transformed sample, using the generated transformation object signal as an input sample, based on an image signal transforming method of generating 2N transformed samples from 2N (N is a natural number) input samples in accordance with a predetermined transformation rule, which comprises: an intermediate value generating step of weighting one input sample out of an n-th pair (1≦n≦2N−1, n is a natural number) of input samples determined in accordance with the transformation rule, by an n-th weighting factor, and performing a first transformation arithmetic to generate an n-th pair of intermediate values; and a transformation coefficient generating step of importing 2N intermediate values generated in the intermediate value generating step, weighting one intermediate value out of an m-th pair (1≦m≦2N−1, m is a natural number) of intermediate values determined in accordance with the transformation rule, by an m-th weighting factor, and performing a second transformation arithmetic to generate an m-th pair of transformed samples, and defining the generated transformed sample as a transformation coefficient, thereby transforming the transformation object signal into the transformation coefficient; and
- a process of encoding the transformation coefficient obtained.
21. An image decoding program for letting a computer execute the following processes:
- a process of importing compressed data generated by performing either intra-frame prediction or inter-frame prediction, for an image decomposed into a plurality of regions, and performing transformation and coding;
- a process of restoring transformation coefficients corresponding to the respective regions from the compressed data imported, and generating the resultant transformation coefficients as restored transformation coefficients; and
- a process of generating an output sample, using the restored transformation coefficients as transformed samples, based on an image signal inversely-transforming method of generating 2N output samples from 2N (N is a natural number) transformed samples in accordance with a predetermined transformation rule, which comprises: an intermediate value generating step of weighting one transformed sample out of an n-th pair (1≦n≦2N−1, n is a natural number) of transformed samples determined in accordance with the transformation rule, by an n-th weighting factor, and performing a first transformation arithmetic to generate an n-th pair of intermediate values; and an output value generating step of importing 2N intermediate values generated in the intermediate value generating step, weighting one intermediate value out of an m-th pair (1≦m≦2N−1, m is a natural number) of intermediate values determined in accordance with the transformation rule, by an m-th weighting factor, and performing a second transformation arithmetic to generate an m-th pair of output samples, and defining the output sample as inversely transformed data, thereby transforming the restored transformation coefficients into the inversely transformed data.
Type: Application
Filed: Sep 18, 2009
Publication Date: Jan 14, 2010
Applicant: NTT DoCoMo, Inc (Tokyo)
Inventors: Choong Seng BOON (Yokohama-shi), Thiow Keng Tan (Singapore)
Application Number: 12/562,717
International Classification: G06K 9/36 (20060101); G06K 9/46 (20060101);