Motion estimation method for motion picture encoding and recording medium having program recorded thereon to implement the motion estimation method
A motion estimation method in a hierarchical frame structure, applied to motion picture encoding, the method including, calculating, an initial motion vector of a predetermined block of a current frame at a low resolution with reference to an adjacent frame of the current frame, calculating a reference motion vector using the calculated initial motion vector for motion estimation between the current frame and a reference frame, adjusting a resolution of the calculated reference motion vector to a high resolution, determining a search area within the reference frame based on the adjusted reference motion vector and calculating a final motion vector by performing motion estimation within the determined search area.
Latest Samsung Electronics Patents:
This application claims the priority of Korean Patent Application No. 2003-88779, filed on Dec. 8, 2003 in the Korean Intellectual Property Office, the disclosure of which is incorporated herein by reference.
BACKGROUND OF THE INVENTION1. Field of the Invention
The present invention relates to motion picture encoding, and more particularly, to a motion estimation method in a hierarchical frame structure in which a motion vector is calculated based on a reference candidate motion vector that is calculated at each hierarchical frame level.
2. Description of the Related Art
In general, a part that requires the largest amount of computation in a motion picture encoder is a motion estimation unit. Fast motion estimation algorithms are intended to reduce the amount of computation of such a motion estimation unit. Fast motion estimation algorithms perform faster computation than full search block matching algorithms without performance degradation. Full search block matching algorithms divide a current frame and a reference frame into blocks having a predetermined size, search for the location of the best-matching block by comparing each block of the current frame with every block included in a search area of the reference frame according to given matching standards, and estimate a two-dimensional motion vector for each block. The sum of absolute difference (SAD) is one type of reference information used in such block matching.
Among fast motion estimation algorithms, hierarchical, multi-resolution search algorithms have been preferred. An example of such hierarchical, multi-resolution search algorithms is disclosed in Korean Patent No. 10-275694 titled “High-speed motion vector estimation for real-time based motion picture encoding”.
At level 0, one macroblock is composed of 16×16 pixels. At level 1, one macroblock is composed of 8×8 pixels and level 1 is obtained by averaging adjacent 2×2 pixels at level 0. At level 2, one macroblock is composed of 4×4 pixels and level 2 is obtained by averaging adjacent 2×2 pixels at level 1. Although the hierarchical frame structure has three levels in
Referring to
General hierarchical motion estimation includes:
-
- (i) performing a full search at the lowest-resolution level and determining search points having the minimum SAD obtained through the full search to be initial search points of the intermediate resolution level (operation 220);
- (ii) performing local searching in a narrow area surrounding the determined search points, and determining search points having the minimum SAD to be initial search points of the highest-resolution level (operation 240); and
- (iii) selecting a final motion vector by performing local searching in a narrow area surrounding the determined search points at the highest-resolution level (operation 260).
As shown in
Accordingly, motion estimation of MPEG-2 motion picture encoders requires frame motion estimation and field motion estimation, and moreover, hierarchical searching should be performed both in forward directions and backward directions in the case of B frames. As a result, if hierarchical searching is applied to MPEG-2 motion estimation, a memory for frame motion estimation and a memory for field motion estimation are separately required and a vast amount of computation is required for motion estimation.
SUMMARY OF THE INVENTIONAccording to an aspect of the present invention there is provided a motion estimation method by which motion estimation is performed on a reference frame by using a reference motion vector calculated at a low resolution with reference to an adjacent frame, so as to reduce the amount of computation and memory required for motion estimation, and a computer readable recording medium having a program recorded thereon to implement the motion estimation method.
According to one aspect of the present invention, there is provided a motion estimation method for encoding motion pictures. The motion estimation method includes calculating an initial motion vector of a predetermined block of a current frame at a low resolution with reference to an adjacent frame of the current frame, calculating a reference motion vector with reference to the calculated initial motion vector for motion estimation between the current frame and a reference frame, and adjusting a resolution of the reference motion vector to correspond to a high resolution, determining a search area within the reference frame at the high resolution based on the adjusted reference motion vector, and calculating a final motion vector by performing motion estimation within the determined search area.
According to another aspect of the present invention, there is provided a computer readable recording medium having recorded thereon a program for a motion estimation method for encoding motion pictures. The motion estimation method includes calculating an initial motion vector of a predetermined block of a current frame at a low resolution with reference to an adjacent frame of the current frame, calculating a reference motion vector with reference to the calculated initial motion vector for motion estimation between the current frame and a reference frame, and adjusting a resolution of the reference motion vector to correspond to a high resolution, determining a search area within the reference frame at the high resolution based on the adjusted reference motion vector, and calculating a final motion vector by performing motion estimation within the determined search area.
According to still another aspect of the present invention, there is provided a motion estimation method for encoding motion pictures. The motion estimation method includes calculating an initial motion vector of a predetermined block of a current frame at a low resolution with reference to an adjacent frame of the current frame, calculating a reference motion vector based on the calculated initial motion vector for motion estimation between the current frame and a reference frame, determining a first search area in the reference frame based on the reference motion vector and calculating a candidate motion vector by performing motion vector estimation within the determined first search area at the low resolution, and adjusting a resolution of the candidate motion vector to correspond to a high resolution, determining a second search area based on the adjusted candidate motion vector, and calculating a final motion vector by performing motion estimation within the determined second search area at the high resolution.
According to yet another aspect of the present invention, there is provided a computer readable recording medium having recorded thereon a program for a motion estimation method for encoding motion pictures. The motion estimation method includes calculating an initial motion vector of a predetermined block of a current frame at a low resolution with reference to an adjacent frame of the current frame, calculating a reference motion vector based on the calculated initial motion vector for motion estimation between the current frame and a reference frame, determining a first search area in the reference frame based on the reference motion vector and calculating a candidate motion vector by performing motion vector estimation within the determined first search area at the low resolution, and adjusting a resolution of the candidate motion vector to correspond to a high resolution, determining a second search area based on the adjusted candidate motion vector, and calculating a final motion vector by performing motion estimation within the determined second search area at the high resolution.
Additional aspects and/or advantages of the invention will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the invention.
BRIEF DESCRIPTION OF THE DRAWINGSThese and/or other aspects and advantages of the present invention will become more apparent and readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings of which:
Reference will now be made in detail to the embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like reference numerals refer to the like elements throughout. The embodiments are described below to explain the present invention by referring to the figures.
Input image data is made up of group of picture (GOP) units. A discrete cosine transform (DCT) unit 420 performs DCT on an 8×8 block-by-block basis to obtain spatial redundancy of the input image data. A quantization (Q) unit 430 quantizes the image data that is discrete cosine transformed by the DCT unit 420. A dequantization (Q−1) unit 450 dequantizes the image data that is quantized by the quantization unit 430. An inverse DCT (IDCT) unit 460 performs IDCT on the image data that is dequantized by the dequantization unit 450. A frame memory unit 470 stores the image data that is inverse discrete cosine transformed by the IDCT unit 460 on a frame-by-frame basis. A motion estimation (ME)/motion compensation (MC) unit 480 estimates a motion vector (MV) and an SAD for each macroblock using image data of a current input frame and image data of a previous frame stored in the frame memory unit 470. A variable length-coding (VLC) unit 440 removes statistical redundancy of the quantized image data according to the motion vector estimated by the ME/MC unit 480.
Referring to
The current frame and the reference frame are formed into hierarchical structures through low-pass filtering and sub sampling.
At least one initial search point (vector) having a minimum SAD is searched by performing a full search at a lowest-resolution level (level 2) for motion estimation on a frame-by-frame basis (520).
A motion vector having a minimum SAD on a frame-by-frame basis is obtained by performing local searches for motion estimation on a frame-by-frame basis at an intermediate resolution level (level 1) using the found initial motion vectors of the lowest-resolution level (level 2) (520). At this time, the motion vector obtained at the intermediate-resolution level (level 1) is referred to as a based motion vector.
The based motion vector of the intermediate-resolution level (level 1) is commonly applied to frame motion estimation and field motion estimation at a highest-resolution level (level 0) (560). In other words, after local searches for motion estimation on a frame-by-frame basis and local searches for motion estimation on a field-by-field basis are performed using the based motion vector of the intermediate-resolution level (level 1) at the highest-resolution level (level 0), the motion vector having the minimum SAD on a frame-by-frame basis and the motion vector having the minimum SAD on a field-by-field basis are simultaneously estimated. At this time, a motion vector with respect to motion estimation between a top field and a top field (Top2Top) and with respect to motion estimation between a bottom field and a bottom field (Bot2Bot) are estimated with reference to SAD for each field which is automatically obtained in motion estimation. The based motion vector cannot be applied to the motion estimation between a top field and a bottom field (Top2Bot) and motion estimation between a bottom field and a top field (Bot2Top) because the distances between fields are different. Thus, the motion estimation between a top field and a bottom field (Top2Bot) and the motion estimation between a bottom field and a top field (Bot2Top) by selecting motion vectors having the minimum SAD between the top field and the bottom field (Top2Bot) and between the bottom field and the top field (Bot2Top) after performing local searches based on a based motion vector that is scaled, is based on the distance between fields.
As such, field motion estimation does not perform searches at the lowest-resolution level (level 2) and the intermediate-resolution level (level 1). Instead, local searches on a narrow search area are performed at the highest-resolution level (level 0) using search points obtained through frame motion estimation at the intermediate-resolution level (level 1).
In this embodiment, it is assumed that motion estimation is performed in a frame structure having three-level resolutions. However, it is natural that motion estimation be performed in a frame structure having resolutions of more than 3 levels. In the present invention, resolutions in the horizontal direction at levels 1, 2, and 3 are half resolutions of the vertical direction as shown in
The size of a macroblock at level 0 is equal to the original image (level 0) in the vertical direction and ½ of the original image (level 0) in the horizontal direction. Such a frame structure can be created by performing filtering on the original image in the vertical and horizontal directions using low-pass filtering (LPF) having the proper number of taps. For example, LPF may be {1, 2, 1}/4. Alternatively, another type of LPF may be used. Also, only sub sampling may be performed without performing LPF.
Here, LPF is performed. Proper 1-D LPF is applied to an image at level 0 only in the horizontal direction and is then 2:1 sub sampled to obtain the image at level 0 as shown in
Therefore, the size of the macroblock at level 0 is 8×16, the size of the macroblock at level 1 is 4×8, and the size of the macroblock at level 2 is 2×4.
In
In this embodiment, motion estimation is only performed at level 2 and 1, and motion estimation at level 0 and half-pixel accuracy motion estimation are not performed. Alternatively, motion estimation may be only performed at level 1 and motion estimation at level 0 and half-pixel accuracy motion estimation may be omitted.
If a result of first motion estimation of the current macroblock, which is obtained with reference to the adjacent frame B2, i.e., the initial motion vector 1, is MVL1, an initial point for final motion estimation that refers to the reference frame I0 is calculated based on the initial motion vector 1. The initial point is determined by the reference motion vector 2 that is an extension of the initial motion vector 1 to fit the distance between the current frame P3 and the reference frame I0.
In the case of
In this embodiment, the search area is determined based on the initial point determined by the reference motion vector 2, motion estimation is performed at level 1, and then motion estimation is performed at level 0. However, alternatively, the resolution of the reference motion vector 2 is adjusted, the search area is determined based on the adjusted resolution and the initial point determined, for example, at level 0, and motion estimation is performed at the determined search area.
In
Backward-direction motion estimation is performed using a motion vector obtained in forward-direction motion estimation as its initial point. In this embodiment, backward-direction motion estimation does not use a motion vector at level 0 of the forward direction motion estimation or a half-pixel accuracy motion vector as its initial point, but is performed based on a motion vector at level 1 of the forward direction motion estimation.
At this time, assuming that the initial motion vector 1 at level 1 of the forward direction is MVL1
In
Forward-direction motion estimation is performed using the motion vector obtained in backward-direction motion estimation as its initial point. In this embodiment, forward-direction motion estimation does not use a motion vector at level 0 of the backward direction motion estimation or a half-pixel accuracy motion vector as its initial point, but is performed based on a motion vector at level 1 of the backward direction motion estimation.
At this time, assuming that the initial motion vector 1 at level 1 of the forward direction is MVL1
The motion vectors of adjacent macroblocks used in the present invention are the motion vectors at level 1. Thus, in the case of
In operation 1120, an initial motion vector of a predetermined block of a current frame is calculated with reference to an adjacent frame of the current frame. Referring to
In this embodiment, the initial motion vector is calculated using hierarchical motion estimation based on the frame structure shown in
Alternatively, shown in
At this time, the candidate motion vector can be obtained by taking the medians of x coordinates and y coordinates of motion vectors of adjacent blocks. For example, if the motion vectors of adjacent blocks are (5, 6), (3, 8), and (7, 7), {median (5, 3, 7), median (6, 7, 8)} are equal to (5, 7).
In operation 1140, based on the calculated initial motion vector 1, a reference motion vector (2 of
In operation 1160, a resolution of the calculated reference motion vector 2 is adjusted, and a search area in the reference frame is determined at the adjusted resolution based on the adjusted reference motion vector. In this embodiment, the resolution of the calculated reference motion vector is adjusted from level 1 to level 0, and a search area in level 0 is newly determined using the adjusted reference motion vector.
In operation 1180, a motion vector corresponding to a search point having the minimum SAD is determined to be a final motion vector by performing motion estimation within the search area determined in operation 1160.
In operation 1220, an initial motion vector of a predetermined block of a current frame is calculated with reference to an adjacent frame of the current frame. The current frame is represented by P3, the adjacent frame is represented by B2, and the initial motion vector is represented by 1.
In this embodiment, the initial motion vector is calculated using hierarchical motion estimation based on the frame structure shown in
Alternatively, shown in
In operation 1240, based on the calculated initial motion vector 1, a reference motion vector (2 of
In operation 1260, a first search area is determined in the reference frame I0 based on the calculated reference motion vector 2 and a candidate motion vector is calculated by performing motion vector estimation within the determined search area.
In operation 1280, a resolution of the candidate motion vector is adjusted and a search area within the reference frame is newly determined at the adjusted resolution based on the adjusted candidate motion vector.
In operation 1300, a motion vector corresponding to a search point having the minimum SAD is determined to be a final motion vector by performing motion vector within the newly determined search area. In this embodiment, the resolution of the newly determined search area is adjusted from level 1 to level 0 and the motion vector corresponding to the search point having the minimum SAD is determined to be the final motion vector by performing motion estimation at level 0 within the newly determined search area.
As described above, in a motion estimation method according to the present invention, by performing motion estimation on a reference frame using a motion vector calculated at a low resolution with reference to an adjacent frame, it is possible to minimize the amount of computation and memory required for motion estimation.
The present invention can be implemented as computer readable codes in computer readable recording media. The computer readable recording media include all kinds of recording media such as ROM, RAM, CD-ROM, magnetic tapes, hard disk, floppy disk, flash memory, optical data storage media, and carrier wave (e.g., transmissions via the Internet). Also, the computer readable recording media is distributed over a computer system connected by network and stored and implemented as computer readable codes.
As described above, by adopting calculation of an absolute difference value according to the present invention, it is possible to reduce the number of adders used for calculation of an absolute difference value and alleviate the load of the apparatus for calculating the absolute difference value, the motion estimation apparatus, and the motion picture encoding apparatus.
While the present invention has been particularly shown and described with reference to exemplary embodiments thereof, it will be understood by those of ordinary skill in the art that various changes in form and details may be made therein without departing from the spirit and scope of the present invention as defined by the following claims.
Claims
1. A motion estimation method encoding motion pictures, the motion estimation method comprising:
- calculating an initial motion vector of a predetermined block of a current frame at a low resolution with reference to an adjacent frame of the current frame;
- calculating a reference motion vector with reference to the calculated initial motion vector for motion estimation between the current frame and a reference frame; and
- adjusting a resolution of the calculated reference motion vector to a high resolution, determining a search area within the reference frame at the high resolution based on the adjusted reference motion vector, and calculating a final motion vector by performing motion estimation within the determined search area.
2. The motion estimation method of claim 1, wherein the reference motion vector is calculated based on the initial motion vector and a scaling coefficient based on a distance between the frames.
3. The motion estimation method of claim 1, wherein a resolution of a horizontal direction of the predetermined block is lower than a resolution of a vertical direction of the predetermined block.
4. The motion estimation method of claim 1, wherein the initial motion vector is calculated using a candidate motion vector that is calculated based on motion vectors of adjacent blocks of the predetermined block.
5. The motion estimation method of claim 4, wherein the candidate motion vector is a median value of the motion vectors of the adjacent blocks.
6. The motion estimation method of claim 4, wherein the motion vectors of the adjacent blocks, used for calculation of the candidate motion vector, are calculated at the low resolution.
7. The motion estimation method of claim 1, wherein the motion estimation method is applied to an IBBP frame according to MPEG 2 standards; and
- if the current frame is a P frame, the adjacent frame is a B frame that is adjacent to the P frame and the reference frame is an I frame that is previous to the P frame; and
- if the initial motion vector is MVL1, then the reference motion vector is MVL1×3.
8. The motion estimation method of claim 1, wherein the motion estimation method is applied to an IBBP frame according to MPEG 2 standards; and
- if the current frame is a B frame that is adjacent to an I frame, then the adjacent frame is the I frame and the reference frame is a P frame; and
- if the initial motion vector is MVL1, then the reference motion vector is MVL1×(−2).
9. The motion estimation method of claim 1, wherein the motion estimation method is applied to an IBBP frame according to MPEG 2 standards; and
- if the current frame is a B frame that is previous to a P frame, then the adjacent frame is the P frame and the reference frame is an I frame; and
- if the initial motion vector is MVL1, then the reference motion vector is MVL1×(−2).
10. The motion estimation method of claim 1, wherein an initial point for final motion estimation is determined by the reference motion vector.
11. The motion estimation method of claim 10, wherein the search area is determined based on the initial point.
12. The motion estimation method of claim 1, wherein the initial motion vector is calculated using hierarchical motion estimation based on a structure of the frames.
13. A computer readable recording medium having recorded thereon a program for a motion estimation method encoding motion pictures, the motion estimation method comprising:
- calculating an initial motion vector of a predetermined block of a current frame at a low resolution with reference to an adjacent frame of the current frame;
- calculating a reference motion vector with reference to the calculated initial motion vector for motion estimation between the current frame and a reference frame; and
- adjusting a resolution of the calculated reference motion vector to a high resolution, determining a search area within the reference frame at the high resolution based on the adjusted reference motion vector, and calculating a final motion vector by performing motion estimation within the determined search area.
14. A motion estimation method encoding motion pictures, the motion estimation method comprising:
- calculating an initial motion vector of a predetermined block of a current frame at a low resolution with reference to an adjacent frame of the current frame;
- calculating a reference motion vector based on the calculated initial motion vector for motion estimation between the current frame and a reference frame;
- determining a first search area in the reference frame based on the reference motion vector and calculating a candidate motion vector by performing motion vector estimation within the determined first search area at a low resolution; and
- adjusting a resolution of the calculated candidate motion vector to a high resolution, determining a second search area based on the adjusted candidate motion vector, and calculating a final motion vector by performing motion estimation within the determined second search area at the high resolution.
15. The motion estimation method of claim 14, wherein the reference motion vector is calculated based on the initial motion vector and a scaling coefficient based on a distance between the frames.
16. The motion estimation method of claim 14, wherein the motion estimation method is applied to an IBBP frame according to MPEG 2 standards; and
- if the current frame is a P frame, then the adjacent frame is a B frame that is adjacent to the P frame and the reference frame is an I frame that is previous to the P frame; and
- if the initial motion vector is MVL1, the reference motion vector is MVL1×3.
17. The motion estimation method of claim 14, wherein the motion estimation method is applied to an IBBP frame according to MPEG 2 standards; and
- if the current frame is a B frame that is adjacent to an I frame, then the adjacent frame is the I frame and the reference frame is a P frame; and
- if the initial motion vector is MVL1, then the reference motion vector is MVL1×(−2).
18. The motion estimation method of claim 14, wherein the motion estimation method is applied to an IBBP frame according to MPEG 2 standards; and
- if the current frame is a B frame that is previous to a P frame, then the adjacent frame is the P frame and the reference frame is an I frame; and
- if the initial motion vector is MVL1, then the reference motion vector is MVL1×(−2).
19. A computer readable recording medium having recorded thereon a program for a motion estimation method encoding motion pictures, the motion estimation method comprising:
- calculating an initial motion vector of a predetermined block of a current frame at a low resolution with reference to an adjacent frame of the current frame;
- calculating a reference motion vector based on the calculated initial motion vector for motion estimation between the current frame and a reference frame;
- determining a first search area in the reference frame based on the reference motion vector and calculating a candidate motion vector by performing motion vector estimation within the determined first search area at the low resolution; and
- adjusting a resolution of the calculated candidate motion vector to a high resolution, determining a second search area based on the adjusted candidate motion vector, and calculating a final motion vector by performing motion estimation within the determined second search area at the high resolution.
20. A motion estimation method of encoding motion pictures, the motion estimation method comprising:
- calculating an initial motion vector of a predetermined block of a current frame at a first resolution with respect to an adjacent frame;
- calculating a reference motion vector between the current frame and a reference frame based on the calculated initial motion vector; and
- adjusting a resolution of the calculated reference motion vector to a second resolution, determining a search area within the reference frame at the second resolution based on the adjusted reference motion vector, and calculating a final motion vector by performing motion estimation within the determined search area.
21. The motion estimation method of claim 20, wherein the reference motion vector is calculated based on the initial motion vector and a scaling coefficient based on a distance between the frames.
22. The motion estimation method of claim 20, wherein a resolution of a horizontal direction of the predetermined block is lower than a resolution of a vertical direction of the predetermined block.
23. The motion estimation method of claim 20, wherein the initial motion vector is calculated using a candidate motion vector calculated based on motion vectors of adjacent blocks of the predetermined block.
24. The motion estimation method of claim 23, wherein the candidate motion vector is a median value of the motion vectors of the adjacent blocks.
25. The motion estimation method of claim 23, wherein the motion vectors of the adjacent blocks, used for calculation of the candidate motion vector, are calculated at the first resolution.
Type: Application
Filed: Nov 23, 2004
Publication Date: Jun 9, 2005
Applicant: Samsung Electronics Co., Ltd. (Suwon-si)
Inventors: Byung-cheol Song (Suwon-si), Kang-wook Chun (Hwassono-gun)
Application Number: 10/994,435