Method and system for image scaling
A method for image scaling. Specifically, one embodiment of the present invention discloses a method for image scaling that begins by selecting an input pixel from a first image. An input edge characteristic is determined from an input window associated with the input pixel. The input edge characteristic is determined from a plurality of predetermined edge characteristics that are associated with sets of filter coefficient vectors. An output window is generated by filtering the input window with a corresponding set of filter coefficients associated with the input edge characteristic. An output image is generated by repeating the above for a plurality of input pixels associated with the first image.
Latest Hewlett Packard Patents:
The present invention relates to the field of image scaling.
BACKGROUND ARTFor many computer process applications, it is necessary to scale input data of lower resolution into output data of higher resolution in order to achieve acceptable, usable, or improved results.
Representative data often presents a low resolution input data set that is a poor replication of the original source. The data presents something much less than a true-life rendering. Many types of source data can benefit by data resolution enhancement: computer image data, digital or analog video data, geographic data measured on a regular grid, vector field data, speech synthesis signals data, and the like.
As to more specific examples in the field of image processing, such as, digital scanning, dot matrix printing, digital camera to video display conversion, digital photography, Internet provider data, JPEG computer image standards, CDROM, magnetic resonance imaging, interframe interpolation for analog video, and the like, the source of the imaging data often provides graphical image data (including bit mapped alphanumeric text images) which is of a relatively low resolution. The term “graphics” as used herein shall be understood to be generic for all forms of imaging. Therefore, output renderings made directly from the source data may be of a low quality.
Moreover, source images, such as, JPEG photographs, Internet files, are often compressed using relatively high data loss compression algorithms to reduce memory storage requirements for the providers. In some cases, decompression algorithms applied when the data is retrieved from memory essentially interpolate the reduced data using simple linear interpolation techniques that lack the precision necessary to reproduce the original image.
For example, in color printing, such as with laser or ink-jet dot matrix hard copy apparatus, resolution enhancement may be necessary to create a print with higher resolution in order to provide a sharper, less blocky, and smoother textured image. In another case, resolution enhancement may be necessary to zoom in and crop a video image before printing, or merely to enhance a particular detail within the original image.
Some prior image scaling techniques provide resolution enhancement through linear filtering approaches. These approaches tend to be less computationally intensive by averaging pixel values across a particular direction. A particular drawback to linear filtering is that sharp edges in the higher resolution image appear blurry and blocky. Previously, the linear filtering approaches were not able to distinguish between sharp edge regions and smooth transition regions to adapt their processing in or around the regions containing sharp edges.
For example, B-spline interpolators, such as image picture element (“pixel”) replication and bilinear interpolation, are well known in the art. The technique of B-spline interpolation specifies an interpolating function which approximates the original image by sampling across a range of the source image data. The interpolating function is defined on a continuous-valued parameter and may be evaluated at arbitrary points. The resulting values comprise the interpolated image pixels. However, a major drawback to B-spline interpolation is the amount of computation needed before recovering the desired output samples, which increases the complexity and cost of B-spline interpolation techniques, especially for higher orders of B-spline interpolation techniques, such as cubic B-spline interpolations. Moreover, B-spline interpolation techniques still suffer from blurring around sharp edges in the higher resolution image.
In particular, both pixel replication and bilinear interpolation perform satisfactorily for interpolating smooth textures. However, these single linear filter approaches tend to average image data across high-contrast boundaries like edges and rapid contrast change zones, causing a blocky or blurry printed result. Thus, these techniques are not especially effective in terms of edge and detail rendition.
More specifically, in pixel replication, the interpolating function is piecewise constant. As such, each output pixel simply takes the value of the closest input pixel. Pixel replication especially tends to produce blocking where the sharp edge appears as stair step edge renderings. Also, pixel replication produces aliasing artifacts where undesirable spurious marks appear.
In bilinear interpolation, the interpolating function is piecewise linear, such that, known data samples are connected with straight lines. As such, each output pixel may be computed as a linear combination of up to four input pixels. Since bilinear interpolation is a simple averaging algorithm, it produces results that can be blurry.
Nonlinear scaling techniques provide an improvement to the linear scaling techniques particularly when dealing with regions with sharp edges. Primarily, the nonlinear scaling techniques locate the regions with sharp edges and treat them differently to avoid blurring cross the edge boundary. In areas not containing sharp edges, the nonlinear scaling techniques use linear scaling techniques, such as, bilinear interpolation. In general, these edge enhancement and smoothing nonlinear techniques find sharp edges in the source data and render them continuous, rather than appearing to have steps, and sharp at the target resolution without blurring the values of pixels which lie on different sides of the edge.
However, these nonlinear scaling techniques often rely on ad hoc characterizations for edge structures which may not reflect the true edge structure. Moreover, some nonlinear scaling techniques, such as those relying on stochastic models for image data, often require iterative optimization which can require excessive computational and/or storage resources.
Two prior approaches include U.S. Pat. No. 6,075,926 (Atkins et al. assigned to the common assignee of the present invention, entitled, “Computerized Method for Improving Data Resolution), and U.S. Pat. No. 6,058,248 (Atkins et al. assigned to the common assignee of the present invention, entitled, “Computerized Method for Improving Data Resolution”). Both approaches use a process to determine a class index for each input pixel, where the class refers to edge sharpness and orientation. However, these previous classification processes require that a feature vector be extracted from the window surrounding the input pixel, and further that distances be calculated between the feature vector and a predetermined set of vectors that represent the different classes requiring additional computational time and resources.
Therefore, prior art methods suffer from either being too computationally complex or too simplistic. For example, linear filtering techniques in the prior art, while less computationally intensive, are disadvantageous because of their inability to maintain sharpness of edges when increasing resolution of an image resulting in blocky or blurry results. In addition, prior art nonlinear filtering techniques, while an improvement to linear filtering, suffer from procedures that are too computationally intensive, such as, iterative operations to distinguish between sharp and smooth edges.
DISCLOSURE OF THE INVENTIONA method for image scaling is disclosed. Specifically, one embodiment of the present invention discloses a method for image scaling that begins by selecting an input pixel from a first image. An input edge characteristic is determined from an input window associated with the input pixel. The input edge characteristic is determined from a plurality of predetermined edge characteristics that are associated with sets of filter coefficients. An output window is generated by filtering the input window with a corresponding set of filter coefficients associated with the input edge characteristic. An output image is generated by repeating the above for a plurality of input pixels associated with the first image.
Reference will now be made in detail to the preferred embodiments of the present invention, a method of image scaling. While the invention will be described in conjunction with the preferred embodiments, it will be understood that they are not intended to limit the invention to these embodiments. On the contrary, the invention is intended to cover alternatives, modifications and equivalents, which may be included within the spirit and scope of the invention as defined by the appended claims.
Furthermore, in the following detailed description of the present invention, numerous specific details are set forth in order to provide a thorough understanding of the present invention. However, it will be recognized by one of ordinary skill in the art that the present invention may be practiced without these specific details. In other instances, well known methods, procedures, components, and circuits have not been described in detail as not to unnecessarily obscure aspects of the present invention.
Embodiments of the present invention can be implemented on software running on a computer system. The computer system can be a personal computer, notebook computer, server computer, mainframe, networked computer, handheld computer, personal digital assistant, workstation, and the like. This software program is operable for providing image scaling. In one embodiment, the computer system includes a processor coupled to a bus and memory storage coupled to the bus. The memory storage can be volatile or non-volatile and can include removable storage media. The computer can also include a display, provision for data input and output, etc. Other embodiments of the present invention can be implemented on hardware (e.g., application specific integrated circuits (ASICs)) if sufficient resources are available.
Accordingly, the present invention provides a method and system for image scaling. The present invention provides image scaling results that are of considerably higher quality than the approaches based on linear filtering (e.g., bilinear and bicubic interpolation). In addition, the present invention, while giving image scaling results comparable to the above non-linear approaches, is not computationally intensive. Furthermore, since the present invention can be implemented using integer arithmetic, hardware implementation of the image scaling technique achieves superior results that are faster and more efficient over previous techniques in the prior art.
In step 120, the present embodiment proceeds by determining an input edge characteristic of an input window surrounding the input pixel. The input edge characteristic is determined from a plurality of predetermined edge characteristics. Further, the plurality of predetermined edge characteristics are associated with corresponding filter coefficients. In one embodiment, each of the plurality of predetermined edge characteristics is associated with a unique set of filter coefficient vectors. As such, the input edge characteristic is associated with a corresponding set of predetermined filter coefficient vectors. Moreover, the input window is comprised of the pixels in the first image associated with the input pixel. A first vector of pixels defines the pixel values corresponding to the pixels in the input window. In one embodiment, the input window surrounds the input pixel.
In step 130 of flow chart 100, the present embodiment proceeds by identifying a class index based on the input edge characteristic that is associated with the corresponding set of predetermined filter coefficient vectors. The class index is selected from a predetermined set of class indexes each uniquely corresponding to one of said plurality of predetermined edge characteristics. Furthermore, each of the predetermined set of class indexes corresponds to an associated set of predetermined filter coefficient vectors.
The use of the class index is for clarity and computational efficiency, since the class index uniquely corresponds to one of the plurality of predetermined edge characteristics. A complete description of the present invention may be made without reference to class indexes. As such, embodiments of the present invention are well suited to methods of image scaling that do not employ the use of class indexes as identified in step 130.
In step 140, the present embodiment proceeds by generating a block of output pixels, or an output window, by filtering the input window with a corresponding set of filter coefficient vectors that is associated with the selected class index. In one embodiment, each pixel in the output window is generated by multiplying the first vector and a corresponding filter coefficient vector from the corresponding set of filter coefficient vectors.
In step 150, the present embodiment proceeds by generating an output image having higher resolution than the first image by repeating steps 110, 120, 130, and 140 for each of the plurality of input pixels in the first image.
Part of the analysis and resolution synthesis may require access to a look up table 240 by the resolution synthesis module 230. The look up table 240 may contain a plurality of class indexes 245, each of which is associated with a particular edge characteristic. Each of the plurality of class indexes 245 corresponds to and is associated with a unique set of predetermined filter coefficient vectors. As such, the plurality of class indexes 245 is associated with a plurality of sets of predetermined filter coefficient vectors. Within each of the predetermined filter coefficient vectors 247, the corresponding filter coefficients that comprise the predetermined filter coefficient vectors 247 are also stored in the look up table 240.
The data analysis and resolution synthesis module 230 generates an output of high resolution data. The high resolution output data can be stored in storage apparatus 255 contained within the output device 250, in accordance with one embodiment of the present invention. In addition, the high resolution output data may be further modified, or enhanced to conform the output data to the resolution capabilities of the output device 250 (e.g., a variety of user output options provided in a color printer) in order to create a high resolution output image 260.
Referring now to
The higher resolution output image 260 is of higher resolution than the in put image 220 by an image scaling factor L. For an input image 220 that is a first image, the resolution of the output image is an integer factor L (e.g., L is greater than or equal to 2) greater than the input image. For example, if the input image 220 is 100×100 pixels, then the output image has one of the following dimensions: 200×200, 300×300, 400×400, etc. The image scaling factor L represents the integer scaling factor. As such, L correspondingly can take any of the following integer values: 2, 3, 4, etc.
In one embodiment, the input and output images are gray scale. Although the present embodiment is employed with gray scale images for purposes of illustration, brevity, and clarity, it should be understood that the present invention can be implemented and extended to generate color output images from input color images. For example, in one embodiment, the method of image scaling would be implemented on each of the colors red, green, and blue, in an RGB color scheme. In addition, embodiments of the present invention are well suited to images represented by using other color schemes.
Returning back to
In one embodiment, the input window 320 of pixels surrounds the input pixel 310 and is of a S×S pixel dimension. For example, in
Other embodiments of the present invention are well suited to input windows 320 of various shapes and sizes that are associated with the input pixel 310. The various shapes and sizes of the input window 320 chosen can be determined from computational resource considerations. For example, a smaller input window 320, than that of
The input window 320 can be represented by an input window vector Z 325. The components of the vector Z 325 are the pixel values of the input window 320 associated with the input pixel 310. For example, for the pixel 321 of input window 320, the pixel value z1, is also the first component of the vector Z 325. Correspondingly, for the pixel 327 of input window 320, the pixel value z25 is also the last component of the vector Z 325. In one embodiment the vector Z 325 is a column vector.
For each input pixel 310 in the input image, a class index of the associated input window 320 is determined in the class identifier module 235. In one embodiment, there are m predetermined class indexes {1 . . . m}. The class index identifies the edge characteristics of the input window 320 that is associated with the input pixel 310. In general, the class index represents edge attributes, such as edge angle and edge transition sharpness, which are critical factors in image scaling.
Further, each of the predetermined class indexes is associated with a unique set of filter coefficient vectors. The filter coefficient vectors are of a dimension to correspond to the input window 320 and the dimension of the vector Z 325. In one embodiment, the filter coefficient vectors are of S×S dimension in the form of row vectors. In the present embodiment, the filter coefficient vectors are of a 25 component dimension. In addition, each set contains up to L×L (L2) filter coefficient vectors to correspond to the L×L block of output pixels in the output window 350. Also, each filter coefficient vector is for computing at least one output pixel in the output window 350.
In the present embodiment, the input pixel 310 is classified with class index j 335, with j being a number between 1 and m inclusive. As such, for input pixel 310, a class index j 335 is associated with a corresponding set of predetermined filter coefficient vectors 330 of 25 component dimension, as follows: vector aj,1=[ ]1×25 through vector aj,(L×L)=[ ]1×25.
Returning now to
In one embodiment, each of the output pixels in the output window 350 is computed by multiplying the corresponding predetermined filter coefficient vector in the set 330 of predetermined filter coefficient vectors with vector Z 325 to obtain a scalar value. For example for output pixel 351, the value is computed by multiplying vector aj,1 with the vector Z 325, to produce a scalar value. For output pixel 359, the value is computed by multiplying vector aj,(L×L) with the vector Z 325. This form of computation is commonly referred to as the dot product between the corresponding predetermined filter coefficient vector and the vector Z 325 to generate the scalar value for an associated output pixel. Vector notation is being used here to illustrate the form of the filtering process. However, the vector notation is not necessary. In fact, one skilled in the art will recognize this form of calculation is exactly the same as the filtering process more commonly known as convolution.
Formally, the output pixels are computed in the following equation:
where OPi is output pixel i (where the variable i indexes through the block of output pixels corresponding to the input pixel: e.g., when L=3, i ranges from 1 to 9); where zn is the nth element of the input window vector Z 325; and where aj,i(n) is the nth element of vector aj,i, which is the filter coefficient vector associated with class index j used for computing the ith output pixel. The output window 350 is computed for OPi for i between 1 and L2.
The output pixels in the output window 350 associated with the high resolution output image 260 may be generated in any order, in one embodiment, or even in parallel if resources allow, in another embodiment. Further, in still another embodiment, output pixels in the high resolution output image 260 are generated by processing the input pixels in raster order.
The input pixel 310 classification process is illustrated in the flow chart 400 of
In one embodiment, there are 25 different class indexes describing edge characteristics, which can be divided into 4 groups. In the first group, classes 1 through 8 represent relatively smooth transitions having different angular orientations. The angles can be quantized into 8 separate bins, each subtending 45 degrees of arc. For each class in the first group, there is a corresponding class in the second and third group. The corresponding class in the second and third group represents a sharper edge with the same angular orientation. However, in the second group, classes 9 through 16 are for pixels that lie on the dark side of sharp edges. Also, in the third group, classes 17 through 24 are for pixels that lie on the light side of sharp edges. The second and third group represent relatively sharp edge transitions. Finally, in the fourth group, class zero (O) usually corresponds to the case where there are no transitions or edges at all.
Although the present embodiment is employed with 25 different class indexes for purposes of illustration, brevity, and clarity, it should be understood that other lesser or greater numbers of index classes could be employed in conjunction with the present invention.
The class index selection is based on gradient information along two different directions which conveys the angle of any edge present at the input pixel 310 within the input pixel window 320. The gradient information can be summarized by edge angle coefficients Gx and Gy, where x and y represent two different directions. In one embodiment, x and y are perpendicular to each other.
Depending on the gradient information, the class index selection may also be based on curvature information (A, B, and C), which describes the side of the edge the input pixel lies within (e.g., the darker side or the lighter side).
Referring now back to
where zn is the nth element of the input window vector Z 325, and Cx(n) is the nth element of vector Cx which comprises the coefficients in operator 540, with an order analogous to the order of elements in input window vector Z 325.
Similarly, the scalar value of edge angle coefficient Gy 530 is computed by the following equation:
where Zn is the nth element of the input window vector Z 325, and Cy(n) is the nth element of vector Cy which comprises the coefficients in operator 550, with an order analogous to the order of elements in input window vector Z 325.
Returning back to
Edge Sharpness Metric=(Gx)(Gx)+(Gy)(Gy) (4)
In decision step 420, the edge sharpness metric is compared to a first threshold value (THRESH1) to determine whether an edge is present in the input window 320. In one embodiment, THRESH1 is equal to zero, which indicates the complete absence of an edge. In one embodiment, the value of THRESH1 was determined empirically, and remained fixed for all input pixels in the input image. In another embodiment, the value of THRESH1 can change from input image to input image, or from input pixel to input pixel.
If the edge sharpness metric is less than or equal to THRESH1, the present embodiment proceeds to step 425 where the class index j 335 is defined to be zero, and the classification process for input pixel 310 ends.
On the other hand, if the edge sharpness metric is greater than THRESH1, this indicates the presence of an edge within the input window 320. The present embodiment then proceeds to step 430 to determine an angular orientation of the edge within the input window 320, and based on the angular orientation to determine an initial or tentative class index between 1 and 8 based on Gx 520 and Gy 530.
In decision step 610, the present embodiment determines if Gx 520 is greater than zero. If Gx 520 is less than or equal to zero, then the present embodiment proceeds to decision step 620 to determine if Gy 530 is less than zero. If Gy 530 is greater than or equal to zero, then the present embodiment determines if Gx 520 is less than the negative of Gy 530. If Gx 520 is greater than or equal to the negative of Gy 530, then the tentative class index is assigned as index 8 in step 688. On the other hand, if Gx 520 is less than the negative of Gy 530, then the tentative class index is assigned as index 7 in step 687.
Returning back to decision step 620, if Gy 530 is less than zero, then the present embodiment proceeds to decision step 640 to determine if Gx 520 is greater than Gy 530. If Gx 520 is less than or equal to Gy 530, then the tentative class index is assigned as class index 6 in step 686. On the other hand, if Gx 520 is greater than Gy 530, then the tentative class index is assigned as class index 5 in step 685.
Returning back to decision step 610, if Gx 520 is greater than zero, then the present embodiment proceeds to decision step 650 to determine if Gy 530 is greater than zero. If Gy 530 is less than or equal to zero, then the present embodiment proceeds to decision step 660 to determine if Gx 520 is greater than the negative of Gy 530. If Gx 520 is less than or equal to the negative of Gy 530, then the present embodiment assigns the tentative class index as class index 4 in step 684. On the other hand, if Gx 520 is greater than the negative of Gy 530, then the tentative class index is assigned as class index 3 in step 683.
Returning back to decision step 650, if Gy 530 is greater than zero, then the present embodiment proceeds to decision step 670 to determine if Gx 520 if less than Gy 530. If Gx 520 is greater than or equal to Gy 530, then the present embodiment assigns the tentative class index as class index 2 in step 682. On the other hand, if Gx 520 is less than Gy 530, then the present embodiment assigns the tentative class index as class index 1 in step 681.
Returning back to
On the other hand, if the edge sharpness metric is greater than or exceeds the threshold value THRESH2, then the present embodiment proceeds to step 450 to compute the curvature coefficients A 710, B 720, and C 730. The diagram 700 in
Returning back to
(½)A(Gx)2+BGxGy+(½)C(Gy)2 (5)
In decision step 470, the present embodiment determines whether the curvature metric, as determined in Equation 5, is greater than zero. If the curvature metric is less than or equal to zero, indicating pixels that lie on the light side of the edge, then the value of sixteen is added to the tentative class index, as determined in step 430, to determine the class index j 335 of the input pixel 310. On the other hand, if the curvature metric is greater than zero, indicating pixels that lie on the dark side of the edge, then the value of eight is added to the tentative class index to determine the class index j 335 of the input pixel 310.
Although the present embodiment discloses the computation of the curvature coefficients A 710, B 720, and C 730, as well as, the curvature metric, other embodiments are well suited to determining the class index without computing the curvature coefficients A 710, B 720, and C 730, or the curvature metric. In that case, from decision step 445, if the edge sharpness metric is greater than THRESH2, the present embodiment would proceed to step 490. The tentative class index would be determined from a group of class indexes that represent relatively sharp edges, such as those in the second and third groups of class indexes, as discussed previously.
Referring now to
Other embodiments of the present invention are well suited to other means or methods for determining filter coefficients used for generating higher resolution images that may or may not involve the use of sample images (e.g., heuristic techniques, etc.).
The filter coefficients are generated for a given integer image scaling factor (e.g., L). The image scaling factor can be any integer greater than 1, as previously discussed.
The present embodiment generates lower resolution images from sample images 805, in step 810. The sample images 805 are digital images, such as stock photographs from an image library. For each sample image, a lower resolution image is created based on the corresponding sample image. The lower resolution image has a lower resolution than the corresponding sample image by the image scaling factor L. As such, the sample image is also of a higher resolution than its corresponding lower resolution image by the image scaling factor L.
In one embodiment, each of the lower resolution images is generated by block averaging its corresponding sample image. As such, each pixel in the lower resolution image is generated as the average of the corresponding L×L sample output block 1033 of pixels in the original higher resolution sample image, where L is the image scaling factor. For example, the sample output block window 1033 of the sample image 1030 is averaged to obtain the value for pixel 1015 in the lower resolution image, a portion of which is shown by sample input window 1020 associated with sample input pixel 1015.
In step 820, for pixels in the lower resolution image, a vector pair is created of a low resolution vector from the lower resolution image corresponding to the sample input window 1020, and its corresponding higher resolution vector from the sample output block 1033 from the sample image 1030. The vector pair is denoted as (X,Z): where vector X is the higher resolution vector and vector Z is the lower resolution vector. For any given pixel in the lower resolution sample images, the lower resolution vector includes the window of pixels associated with that pixel, and the higher resolution vector includes the L×L block of high-resolution pixels that correspond to the low-resolution pixel. A plurality of vector pairs is created for all the sample input images 805.
In step 830 of
In step 840, once all the vector pairs are classified, a unique set of predetermined filter coefficient vectors can be calculated for each of the plurality of class indexes, each of which are associated with a predetermined edge characteristic. As such, an output of a plurality of sets of filter coefficient vectors 850 is generated that are the predetermined filter coefficients as used for generating a higher resolution image of an input image as disclosed in
From the vector pairs, classified as being associated with class j in step 840, the average of the low resolution Z vectors, which correspond to the windows surrounding the input pixels from the lower resolution images (e.g., input window 1020), is computed in the following equation:
In addition, the average of the corresponding X vectors, which correspond to the output block window (e.g., block 1033) from the high resolution image sample (e.g., sample image 1033) is computed from the following equation:
Also, the sample covariance matrix of the Z vectors, and the sample cross covariance matrix of the X and Z vectors are computed from the following equations, respectively:
Each of the quantities computed in Equations 6A through 6D is computed separately for each class index value of j between 1 and m inclusive. For any given value j, Nj is the number of sample vector pairs classified into class j, as described in step 830. In addition, for any given value of j, the sums in Equations 6A through 6D include only the vectors classified into class j in step 830. Superscript T in Equations 6C and 6D denotes the vector transpose, which is well known to one skilled in the art. Superscript (−1) denotes the matrix inverse, which is well known to one skilled in the art.
Thereafter, the image scaling filter coefficients for each of the sets of filter coefficient vectors, which correspond to each of the class indexes are calculated according to the following equation:
Aj=ΓXZ,j(ΓZ,j)−1 (7)
where Aj is a matrix having L×L rows and a number of columns equal to the dimension of the low resolution Z vector, whose rows are the filter coefficient vectors for class index j. The quantity in Equation (7) is computed for each class index value of j between 1 and m inclusive.
Image scaling results of the present invention are of considerably higher quality than the approaches based on linear filtering, such as bilinear and bicubic interpolation.
The present invention, while giving image scaling results comparable to the above non-linear approaches, is not computationally intensive. For example, in one embodiment, the method of image scaling is not iterative, but rather requires only one pass through the input image. Further, the method of image scaling does not rely on ad-hoc models for high-resolution image edge structure but rather makes use of specialized prediction filters to compute the output pixels as a function of the pixels in the vicinity of the input pixel. As such, the classification of edge characteristics of input pixels in input images is more efficient and faster.
Furthermore, this invention may be implemented using only integer arithmetic. This may be a significant advantage for hardware implementation of the method for image scaling as described in
The preferred embodiment of the present invention, a method for image scaling, is thus described. While the present invention has been described in particular embodiments, it should be appreciated that the present invention should not be construed as limited by such embodiments, but rather construed according to the below claims.
Claims
1. A method of image scaling comprising:
- selecting an input pixel associated with a first image;
- determining an input edge characteristic, from a plurality of predetermined edge characteristics associated with a plurality of sets of filter coefficients, of an input window associated with said input pixel, wherein said input edge characteristic defines an edge angle and an edge sharpness of said input window;
- generating an output window by filtering said input window with a corresponding set of filter coefficients based on said input edge characteristic, wherein said output window is scaled by an image scaling factor L over said input pixel; and
- generating an output image of a higher resolution than said first image by said image scaling factor L by repeating said steps of selecting, determining, generating said output window, and generating said output image for a plurality of input pixels associated with said first image without requiring distances to be calculated between vectors.
2. The method of image scaling as described in claim 1, further comprising:
- determining filter coefficients for said plurality of sets of filter coefficients according to said image scaling factor L, said determining of filter coefficients performed before said selecting said input pixel.
3. The method of image scaling as described in claim 2, wherein said determining filter coefficients comprises:
- generating a lower resolution image by said image scaling factor L of a sample image;
- creating a plurality of window pairs for pixels in said lower resolution image comprising a lower resolution window of a first dimension equal to said input window, and a higher resolution window of a second dimension equal to said output window;
- classifying every window pair as one of said plurality of predetermined edge characteristics; and
- for every predetermined edge characteristic, calculating its associated set of predetermined filter coefficients from associated window pairs.
4. The method of image scaling as described in claim 3, further comprising:
- repeating steps for generating said lower resolution, creating said plurality of window pairs, and classifying every window pair for a plurality of sample images, said repeating performed before the step of calculating its associated set of predetermined filter coefficients.
5. The method of image scaling as described in claim 1, wherein said generating said output window comprises:
- generating an output pixel in said output window by computing the dot product of a first vector of pixels associated with said input window and a corresponding filter coefficient vector associated with said corresponding set of filter coefficients.
6. The method of image scaling as described in claim 1, wherein the steps of selecting, determining, generating said output window, and generating said output are performed using integer arithmetic.
7. The method of image scaling as described in claim 1, wherein said determining said input edge characteristic further comprises:
- determining an edge angle metric;
- determining an edge sharpness metric;
- determining said input edge characteristic based on said edge sharpness metric and said edge angle metric.
8. The method of image scaling as described in claim 7, further comprising:
- determining edge angle coefficients, said determining edge angle coefficients performed before the step of determining said edge angle metric, wherein said edge angle metric and said edge sharpness metric are based on said angle coefficients.
9. The method of image scaling as described in claim 7, further comprising:
- determining the absence of an edge when said edge sharpness metric is less than or equal to a first threshold;
- determining a smooth edge transition when said edge sharpness metric is less than or equal to a second threshold; and
- determining a sharp edge when said edge sharpness metric is greater than said second threshold.
10. The method of image scaling as described in claim 7, wherein said determining said edge angle metric further comprises:
- determining edge angle coefficients; and
- quantizing said edge angle coefficients into edge angles for determining said edge angle metric.
11. The method of image scaling as described in claim 1, wherein said input window surrounds said input pixel.
12. The method of image scaling as described in claim 1, wherein said output window is of L×L pixel dimension.
13. A method of image scaling comprising:
- selecting an input pixel associated with a first image of a first resolution, said first image comprising a plurality of input pixels;
- determining an input edge characteristic, from a plurality of predetermined edge characteristics, of an input window associated with said input pixel and comprising a first vector of pixels from said input window, wherein said input pixel and characteristic defines an edge angle and an edge sharpness of said input window;
- identifying a class index based on said input edge characteristic from a predetermined set of class indexes corresponding to said plurality of predetermined edge characteristics, said class index associated with a corresponding set of predetermined filter coefficient vectors;
- generating an output window by filtering said input window with said corresponding set of predetermined filter coefficient vectors, wherein said output window is scaled by an image scaling factor L over said input pixel; and
- generating an output image by repeating said steps of selecting said input pixel, determining said input edge characteristic, identifying said class index, and generating said output window for said plurality of input pixels without requiring distances to be calculated between vectors.
14. The method of image scaling as described in claim 13, wherein said output image is of a second resolution that is greater than said first resolution by an image scaling factor L that is an integer greater than or equal to two, and wherein said output window is of L×L dimension.
15. The method of image scaling as described in claim 13, wherein said input window surrounds said input pixel and is of S×S pixel dimension, and wherein each filter coefficient vector of said corresponding set of predetermined filter coefficient vectors has S×S coefficients.
16. The method of image scaling as described in claim 15, wherein said input window is of 5×5 pixel dimension.
17. The method of image scaling as described in claim 13, wherein said determining said input edge characteristic further comprises:
- calculating a first edge angle coefficient along a first direction;
- calculating a second edge angle coefficient along a second direction;
- computing an edge sharpness metric based on said first and second edge angle coefficients;
- determining whether an edge is present in said input window based on said edge sharpness metric;
- assigning a first predetermined edge characteristic as said input edge characteristic if said edge is not present; and
- determining a second edge characteristic from said plurality of predetermined edge characteristics for said input edge characteristic if said edge is present.
18. The method of image scaling as described in claim 17, wherein said determining said second edge characteristic comprises:
- determining an edge angle metric based on said first and second edge angle coefficients;
- determining whether said edge sharpness metric indicates less than a degree of sharpness;
- selecting one of a first group of edge characteristics if said edge sharpness metric indicates less than said degree of sharpness; and
- selecting one of a second group of edge characteristics if said edge sharpness metric does not indicate less than said degree of sharpness.
19. The method of image scaling as described in claim 17, wherein said computing said edge sharpness metric further comprises:
- computing said edge sharpness metric by adding the square of said first edge angle coefficient to the square of the second edge angle coefficient.
20. The method of image scaling as described in claim 17, wherein said determining whether said edge is present further comprises:
- comparing said edge sharpness metric to a threshold, said edge not present when said edge sharpness metric is less than or equal to said threshold.
21. The method of image scaling as described in claim 18, wherein said determining said edge angle metric further comprises:
- quantizing said edge angle coefficients into edge angles for determining said edge angle metric.
22. The method of image scaling as described in claim 13, wherein said generating said output window comprises:
- generating each pixel scalar value in said output window by computing the dot product of said first vector and a corresponding filter coefficient vector from said corresponding set of filter coefficient vectors.
23. The method of image scaling as described in claim 18, wherein said determining whether said edge sharpness metric indicates less than a degree of sharpness further comprises:
- comparing said edge sharpness metric to a threshold, said edge sharpness metric not indicating said degree of sharpness if said edge sharpness metric is less than or equal to said threshold.
24. The method of image scaling as described in claim 13, further comprising:
- determining filter coefficients for said plurality of class indexes.
25. The method of image scaling as described in claim 24, wherein said determining filter coefficients comprises:
- generating a lower resolution image of a sample image by an image scaling factor L;
- creating a plurality of vector pairs for pixels in said lower resolution image comprising a lower resolution vector of a first dimension equal to said first vector, and a higher resolution vector of a corresponding output window in said sample image;
- classifying each of said plurality of vector pairs into one of said plurality of predetermined edge characteristics; and
- for every predetermined edge characteristic, calculating its associated set of predetermined filter coefficient vectors from associated vector pairs.
26. The method of image scaling as described in claim 25, further comprising:
- repeating steps of generating said lower resolution image, creating said plurality of vector pairs, and classifying each of said plurality of vector pairs for a plurality of sample images, said repeating performed before the steps of performing the step of calculating its associated set of predetermined filter coefficient vectors.
27. A method of image scaling comprising:
- generating a plurality of lower resolution images of a plurality of sample images by an image scaling factor L;
- pairing input windows associated with pixels in said plurality of lower resolution images with corresponding output windows associated with pixels in said plurality of sample images into a plurality of window pairs;
- classifying each of said plurality of window pairs into a plurality of predetermined edge characteristics, wherein each of said plurality of predetermined input edge characteristics defines an edge angle and an edge sharpness of a corresponding input window;
- calculating filter coefficients for said plurality of predetermined edge characteristics; and
- generating an output image of a higher resolution than an input image comprising a plurality of input pixels by said image scaling factor L by calculating an L×L block of output pixels for each of said plurality of input pixels using said filer coefficients without requiring distances to be calculated between vectors.
28. The method of image scaling as described in claim 27, wherein said generating said lower resolution images further comprises:
- generating said lower resolution images by block averaging said sample images.
29. The method of image scaling as described in claim 27, wherein said generating said output image further comprises:
- selecting an input pixel associated with said input image;
- determining an input edge characteristic, from a plurality of predetermined edge characteristics associated with said filter coefficients, of an input window associated with said input pixel;
- generating an output window by filtering said input window with a corresponding set of filter coefficients associated with said input edge characteristic; and
- generating an output image by repeating said steps of selecting said input pixel, determining said input edge characteristic, and generating said output window for said plurality of input pixels.
30. The method of image scaling as described in claim 27, wherein said plurality of predetermined edge characteristics further comprises:
- a first group representing relatively smooth edge transitions;
- a second group representing relatively sharp edge transitions; and
- a third group representing the absence of an edge.
31. The method of image scaling as described in claim 27, wherein said output window is generated for each color per pixel in said plurality of input pixels.
32. A computer system comprising:
- a processor: and
- a computer readable memory coupled to said processor and containing program instructions that, when executed, implement a method of image scaling comprising:
- selecting an input pixel associated with a first image;
- determining an input edge characteristic, from a plurality of predetermined edge characteristics associated with a plurality of sets of filter coefficients, of an input window associated with said input pixel, wherein said input edge characteristic defines an edge angle and an edge sharpness of said input window;
- generating an output window by filtering said input window with a corresponding set of filter coefficients based on said input edge characteristic, wherein said output window is scaled by an image scaling factor L over said input pixel; and
- generating an output image of a higher resolution than said first image by said image scaling factor L by repeating said steps of selecting said input pixel, determining said input edge characteristic, generating said output window, and generating said output image for a plurality of input pixels associated with said first image without requiring distances to be calculated between vectors.
33. The computer system as described in claim 32, wherein said method further comprises:
- determining filter coefficients for said plurality of sets of filter coefficients according to said image scaling factor L, said determining of filter coefficients performed before said selecting said input pixel.
34. The computer system as described in claim 33, wherein said determining filter coefficients in said method comprises:
- generating a lower resolution image by said image scaling factor L of a sample image;
- creating a plurality of window pairs for pixels in said lower resolution image comprising a lower resolution window of a first dimension equal to said input window, and a higher resolution window of a second dimension equal to said output window;
- classifying every window pair as one of said plurality of predetermined edge characteristics; and
- for every predetermined edge characteristic, calculating its associated set of predetermined filter coefficients from associated window pairs.
35. The computer system as described in claim 34, wherein said method further comprises:
- repeating said steps of generating said lower resolution image, creating said plurality of window pairs, and classifying every window pair for a plurality of sample images, said repeating performed before performing said step of calculating its associated set of predetermined filter coefficients.
36. The computer system as described in claim 32, wherein said generating said output window in said method comprises:
- generating an output pixel in said output window by computing the dot product of a first vector of pixels associated with said input window and a corresponding filter coefficient vector associated with said corresponding set of filter coefficients.
37. The computer system as described in claim 32, wherein said steps of selecting said input pixel, determining said input edge characteristic, generating said output window, and generating said output image in said method are performed using integer arithmetic.
38. The computer system as described in claim 32, wherein said determining said input edge characteristic in said method further comprises:
- determining an edge angle metric;
- determining an edge sharpness metric;
- determining said input edge characteristic based on said edge sharpness metric and said edge angle metric.
39. The computer system as described in claim 38, wherein said method further comprises:
- determining edge angle coefficients, said determining edge angle coefficients performed before said step of determining said edge angle metric, wherein said edge sharpness metric are based on said angle coefficients.
40. The computer system as described in claim 38, wherein said method further comprises:
- determining the absence of an edge when said edge sharpness metric is less than or equal to a first threshold;
- determining a smooth edge transition when said edge sharpness metric is less than or equal to a second threshold; and
- determining a sharp edge when said edge sharpness metric is greater than said second threshold.
41. The computer system as described in claim 38, wherein said determining said edge angle metric in said method further comprises:
- determining edge angle coefficients; and
- quantizing said edge angle coefficients into edge angles for determining said edge angle metric.
42. The computer system as described in claim 32, wherein said input window in said method surrounds said input pixel.
43. The computer system as described in claim 32, wherein said output window in said method is of L×L pixel dimension.
4896364 | January 23, 1990 | Lohscheller |
5131057 | July 14, 1992 | Walowit et al. |
5327257 | July 5, 1994 | Hrytzak et al. |
5351305 | September 27, 1994 | Wood et al. |
5406334 | April 11, 1995 | Kondo et al. |
5469216 | November 21, 1995 | Takahashi et al. |
5852470 | December 22, 1998 | Kondo et al. |
5917963 | June 29, 1999 | Miyake |
5953465 | September 14, 1999 | Saotome |
6058248 | May 2, 2000 | Atkins et al. |
6075926 | June 13, 2000 | Atkins et al. |
6175659 | January 16, 2001 | Huang |
6281875 | August 28, 2001 | Zhao et al. |
6384828 | May 7, 2002 | Arbeiter et al. |
6400413 | June 4, 2002 | Miyake |
6519368 | February 11, 2003 | Kondo et al. |
6714210 | March 30, 2004 | Yip et al. |
6714692 | March 30, 2004 | Kim et al. |
6717622 | April 6, 2004 | Lan |
20010035969 | November 1, 2001 | Kishimoto |
Type: Grant
Filed: Apr 23, 2002
Date of Patent: Dec 12, 2006
Patent Publication Number: 20030198399
Assignee: Hewlett-Packard Development Company, L.P. (Houston, TX)
Inventor: C. Brian Atkins (Mountain View, CA)
Primary Examiner: Bhavesh M Mehta
Assistant Examiner: Manav Seth
Application Number: 10/131,691
International Classification: G06K 9/32 (20060101); G06K 9/40 (20060101); G09G 5/00 (20060101); G09G 5/02 (20060101); H04N 5/44 (20060101); H04N 9/74 (20060101); H04N 3/323 (20060101); G06K 9/48 (20060101); H04N 1/393 (20060101); G06F 7/00 (20060101); G06F 15/00 (20060101);