Digital image processing method having an exposure correction based on recognition of areas corresponding to the skin of the photographed subject
A digital image processing method includes extracting chromatic information of an image taken by an image taking device and related to a human subject; detecting visually interesting regions; and exposure correcting of the taken image by normalizing a grey scale of the taken image based on the visually interesting regions. Advantageously, the method includes recognizing areas corresponding to the skin of the subject, these areas being used as the visually interesting regions for the exposure correction step.
Latest STMicroelectronics S.A. Patents:
- Non-volatile memory cell with single poly, floating gate extending over two wells
- Double-channel HEMT device and manufacturing method thereof
- MOSFET device with shielding region and manufacturing method thereof
- Driver circuit for phase-change memory cells and method of driving phase-change memory cells
- Pulse width check circuit for laser diode pulse generator
1. Field of the Invention
The present invention relates to a digital image processing method. The invention relates particularly, but not exclusively, to an image processing method of human subjects being photographed by portable image taking devices, particularly of backlit subjects and the following description is made with reference to this field of application for convenience of illustration only.
2. Description of the Related Art
As is well known, one of the main problems limiting photographic image quality involves the generation of sub-optimal photographs due to the wrong exposure to light of the photographed subject.
This problem is particularly suffered in portable devices such as mobile phones, wherein several factors concur in obtaining photographs that are wrongly exposed: the smallness of the available optical device, the lack of a flash device and the like. Moreover, the portable device nature and the traditional use of the photographs produced therefrom, particularly linked to the so-called multimedia messaging services or MMS, cause the acquisition of photographs of the type shown in
Although it is impossible to provide a precise definition of a correct exposure, since the exposure depends on the photographed subject as well as on the personal taste of the person looking at the photograph, it is however possible to state that, for “normal” subjects (and thus not considering extreme cases, like a snow-covered landscape whose correct acquisition would involve an intentional photograph overexposure), a correct exposure is obtained when the main features of the photographic image are reproduced by using an intermediate grey level.
In the image processing field several techniques for improving the tone quality of photographic images are well known, such as histogram equalization, grey-level slicing, and histogram stretching.
Although advantageous under many aspects, these prior art techniques have several drawbacks mainly linked to the fact of being independent from the visual content of the photographed images.
The article entitled “Automated Global Enhancement of Digitized Photographs” by Bhukhanwale et al., published on the IEEE Transaction on Consumer Electronics, vol. 40, no. 1, 1994, which is hereby incorporated by reference in its entirety, describes instead an algorithm being capable to identify visually important regions in a photographic image, by adjusting the image exposure so that these regions occupy intermediate tone levels.
Moreover, the European patent application no. EP 01830803.1 filed in the name of STMicroelectronics, the assignee of the present application, which is hereby incorporated by reference in its entirety, describes an algorithm being similarly capable to identify visually important regions in a photographic image in order to replace them at intermediate tone levels. This algorithm directly processes images of the Bayer Pattern type and simplifies the statistical measures used to detect regions in the image having a high information content, i.e., visually important regions.
The algorithms provided in this document directly operate on the image in the Bayer Pattern format and they comprise the following steps:
-
- extraction of the Bayer Pattern green plane or channel G: this plane provides a good approximation of the luminance Y.
- visual analysis: once the channel G has been extracted, the visually interesting regions are identified on this channel. For this purpose, the green plane is split into N blocks having the same size and the following statistical values are calculated for each block:
- focus: it characterizes the block sharpness and it is used for identifying the regions comprising high-frequency components, corresponding to details of the photographed image;
- contrast: it is related to the image tone range—the higher the contrast, the higher the insulation of the so-called clusters of points in the block, i.e., the higher the block visual impact.
- extraction of the Bayer Pattern green plane or channel G: this plane provides a good approximation of the luminance Y.
In order to obtain important visual features, independently from the lighting conditions of the photographed image, the visual analysis is performed on an image having an intermediate luminosity produced by making a temporary correction only based on the average value of the channel G calculated on the whole plane. The algorithms further perform exposure adjustment: once the visually interesting regions have been detected, the exposure adjustment is performed by using the average grey levels of these regions as reference values. In greater detail, the photographed image is changed so to bring the average value of these regions to a target value T by changing all the pixels belonging to the Bayer Pattern. This target value T should be a value ranging around 128 and it should take into consideration a possible correction range performed after the color reconstruction of the corrected Bayer Pattern. This means that, in certain cases, the target value T could be substantially lower than 128.
To this aim, a simulated response curve of a digital image taking device or camera is used, schematically shown in
This curve gives an evaluation of how the light values picked up by the camera are turned into pixel values, i.e., it represents the function:
ƒ(q)=I (1)
q being the light amount and I the final pixel value.
This simulated response function (1) of a camera can be expressed in a parametric way:
A and C being the control parameters of the curve shape and the value q being expressed in base 2 logarithmic units (also known with the name “stops”. It is possible to evaluate these control parameters A and C by using the information comprised in the article by Mann et al. entitled “Comparametric Equations with Practical Applications in Quantigraphic Image Processing”, IEEE Transactions on Image Processing, Vol. 9, no. 8, 2000, which is hereby incorporated by reference in its entirety.
It is also possible to obtain experimentally the values of these parameters A and C or to set them in order to realize a particular final effect (for example, a more or less marked improvement of the contrast). In particular,
By using this simulated response curve f and an average grey level avg for the visually important regions, the distance Δ of an ideal exposure situation is expressed as:
Δ=ƒ−1(128)−ƒ−1(avg) (3)
and the grey value I(x, y) of a pixel with position (x, y) is thus changed in:
I′(x,y)=ƒ(ƒ−1(I(x,y))+Δ) (4)
It is worth noting that all the grey values of the pixels are corrected.
In particular, the above-mentioned changes are substantially a look-up table (LUT) transformation (i.e., they can be put in a table. in order to be then referred to) and
It is worth noting that the distance or offset of the value 128 is 1.24 for f1 and 0.62 for f2 respectively (starting from a same input value equal to 72).
From the
Although advantageous under several aspects, these prior art techniques are not very effective in the case of portable devices like mobile phones for which the photographic images are often backlit and they are mainly focused on human figures, when the user uses the image transmission for videophony, as shown in
One embodiment of the present invention provides an image processing method having such features as to overcome the limits still affecting prior art techniques.
One embodiment of the present invention detects the features in the photograph of the skin of the subject being photographed in order to select and find convenient interesting regions on whose base an exposure adjustment/correction is applied.
One embodiment of the present invention is directed to a digital image processing method that includes: extracting chromatic information of an image taken by an image taking device and related to a human subject; detecting visually interesting regions in the taken image by recognizing areas corresponding to skin of the subject, wherein the recognized areas are the visually interesting regions; and correcting exposure of the taken image by normalizing a grey scale of the taken image based on the visually interesting regions.
The features and advantages of the method according to the invention will be apparent from the following description of an embodiment thereof given by way of non-limiting example with reference to the attached drawings.
BRIEF DESCRIPTION OF THE DRAWINGSIn the drawings:
An image processing method according to one embodiment of the invention performs an exposure correction of a digital photographed image taken by an image taking device on the basis of a recognition algorithm of the skin of the photographed subject, thus improving the final photographic image quality, in a decisive way in the case of backlit subjects as in
In particular, the method comprises the following steps:
-
- 1) a first extraction step of chromatic information from the photographic image;
- 2) a second visual analysis step using a recognition method of the areas corresponding to the skin of the subject photographed in the photographic image; and
- 3) a third exposure adjustment step of the obtained photographic image.
1) Extraction Step
As has been seen with reference to known image processing methods, the method provides the extraction of the green channel G of the image taken when the images are in the Bayer format.
On the contrary, it provides the extraction of the luminance channel Y for images of the YcbCr type obtained from RGB images.
2) Visual Analysis Step
This analysis can be performed on:
-
- 1. images in the RGB format;
- 2. images in the Bayer Pattern format generating, from an initial image, a conveniently sub-sampled RGB copy.
In particular, by using this skin recognition method, a plurality of visually interesting regions corresponding to the skin of the photographed subject are detected.
The chromatic information obtained during the first extraction step is thus used.
In particular, using Bayer data, it is possible to operate on three color planes and on sub-samples having a size corresponding to a quarter of the initial data, as schematically shown in
3) Third Exposure Adjustment Step
This adjustment can be performed in two ways:
-
- 1. correction of images in the RGB format;
- 2. correction of images in the Bayer Pattern format before a following color interpolation algorithm.
In the case of the correction of images in the RGB format, once the visually important pixels have been detected as above-mentioned (i.e., the pixels belonging to the area corresponding to the skin of the photographed subject), a known exposure correction algorithm is used, wherein the average grey level of the known pixel clusters is considered as belonging to the skin of the photographed subject.
In other words, the pixels belonging to the subject skin are placed at the intermediate level of the image grey scale and all the remaining image pixels are placed once again based on this average level.
In particular, once the luminance value has been corrected from an original value Y to a revised value Y′ that reflects the average grey level of the know pixel clusters corresponding to the skin of the subject, according to the above-mentioned steps (2), (3) and thus using the information comprised in the article by Sakaue et al. entitled “Adaptive Gamma Processing of the Video Cameras for the Expansion of the Dynamic Range”, IEEE Transaction on Consumer Electronics, Vol. 41, n. 3, August 1995, which is hereby incorporated by reference in its entirety, starting from a curve of the type shown in
R, G, B being the color values of the input pixels.
In the case of the correction of images in the Bayer format the formulas (5), (6) and (7) cannot be used and the output product will be obtained by simply applying the relation (4) to all the pixels of the pattern.
The recognition method of the areas corresponding to the skin of the subject photographed in the photographic image will be now described in greater detail.
Several recognition methods of the color of the skin of the photographed subject are known, substantially based on the application of a threshold to a color probability measure for the skin.
In fact, the colors of the human skin belong to a particular color category, different from the colors of most natural objects. In particular, in the article by Zarti et al. entitled “Comparison of Five Color Models in Skin Pixel Classification”, Proc. Of Int. Workshop on Recognition, Analysis and Tracking of Faces and Gestures in Real-Time Systems, IEEE Computer Society, Corfu, Greece, pages 58-63, 1999, which is hereby incorporated by reference in its entirety, it has been shown that the colors of the human skin are clustered and that the skin changes between each subject are substantially due to a different intensity and they can thus be reduced by using only the chrominance component of the subject image.
Moreover, in the article by Yang et al. entitled “Skin-Color Modeling and Adaptation”, Technical Report CMU-CS-97-146, School of Computer Science, Carnegie Mellon University, 1997, which is hereby incorporated by reference in its entirety, it has been shown that the human skin color slicing can be represented by a bidimensional Gaussian function in the chrominance plane. The center of this slicing is determined by the mean vector {right arrow over (μ)} and the amplitude of the bell thereof is determined by the covariance matrix Σ, these two values being evaluated starting from a convenient group of test data.
The conditional probability p({right arrow over (x)}|s) of a pixel block to belong to a human skin color class starting from the chrominance vector thereof {right arrow over (x)} is thus given by:
d({right arrow over (x)}) being the so-called Mahalonobis distance of the chrominance vector {right arrow over (x)} of the mean vector {right arrow over (μ)}, defined as:
[d({right arrow over (x)})]2=({right arrow over (x)}−{right arrow over (μ)})′Σ−1({right arrow over (x)}−{right arrow over (μ)}) (9)
In other words, the value of the Mahalonobis distance d({right arrow over (x)}) of a pixel block with chrominance vector {right arrow over (x)} determines the probability of this block to belong to a predetermined human skin color class. The higher the Mahalonobis distance d({right arrow over (x)}) is, the lower the probability of the block belonging to this human skin color class.
Given the considerable amount of color types, of distance measures and of bidimensional slicings, a considerable variety of image processing algorithms can be considered. Moreover, the lighting conditions and the color models can change according to the image taking modes.
Advantageously, the method comprises a step of recognizing a portion of the photographic image corresponding to the subject skin.
In a first embodiment, this recognition step is substantially based on a probabilistic function.
In particular, for each pixel of an image taken in the YcrCb format a probabilistic slicing is prepared to evaluate if this pixel must be classified as belonging to the subject skin. Based on this slicing, a new image is thus processed with a normalized grey scale, wherein the subject skin is highlighted as indicated in
Based on this first embodiment of the recognition step of the photographed subject skin, the image pixels with higher grey values are considered as belonging to the skin of the photographed subject.
The areas being detected by using this first embodiment of the recognition step of the skin of the photographed subject on an image shown in
In a second embodiment, the recognition step of the image areas corresponding to the skin of the photographed subject is substantially based on a single threshold area and it provides an RGB-format image processing in order to produce a chrominance slicing histogram starting from normalized channels r and g as described in the article by Soriano et al. entitled “Skin Color Modeling Under Varying Illumination Conditions Using the Skin Locus for Selecting Training Pixels”, Real-time Image Sequence Analysis (RISA2000, August 31-September 1, Finland), which is hereby incorporated by reference in its entirety. In particular, the normalized channels r and g are defined as:
r=R/(R+G+B) (10)
g=G/(R+G+B) (11)
The resulting bidimensional histogram shows the chrominance slicing in the image and the areas having the right human skin chrominance slicing are thus detected by applying a single threshold area. In particular, the pixels of the processed image belonging to the threshold area are classified as belonging to the subject skin.
Similarly,
In particular, on a taken image (
An exposure correction step can thus be performed by using the areas detected as belonging to the skin of the photographed subject in order to normalize the grey levels obtaining a final processed image (
Advantageously, the regions being detected in the recognition step as belonging to the subject's skin are used as visually important images for the following exposure adjustment step of the photographic image.
In a preferred embodiment of the image processing method according to the invention, the recognition step of the areas belonging to the photographed subject's skin processes a 8-bit image of the Bayer type, constructing a color image of sub-samples with size corresponding to a quarter of the initial data, obtained as previously described and schematically shown in
Starting from this color image of sub-samples, a recognition step of the areas belonging to the photographed subject's skin is performed using a chrominance slicing histogram according to the first probabilistic embodiment or the normalized channels r and g according to the second threshold embodiment.
However, in this second case, the normalized channels r and g are defined as:
r=R/(R+G*+B) (12)
g=G*/(R+G*+B) (13)
being
G*=(G1+G2)/2 (14)
The resulting bidimensional histogram shows the chrominance slicing of the processed image, therefore the areas corresponding to the photographed subject skin, as schematically shown in FIGS. 8A-C, showing in series a taken image of the Bayer type (
The method finally comprises a reconstruction step of the color of the image taken according to the relations (5) to (7), already shown with reference to the prior art, R, G, B and R′, G′, B′ being the red, green and blue values of the images being respectively taken and processed.
The step sequence being described is suitable for a simple change allowing a correction to be performed directly on images in the Bayer Pattern format in favor of a further simplification from the calculation point of view. In fact, once the image for the skin detection according to the diagram of
It is however worth noting that the color reconstruction formulas described in the equations (5), (6), (7) cannot be used in this case and the output product of the corrected Bayer Pattern will be obtained by simply applying the relation (4) to all the model pixels.
In other words, the grey value I(x, y) of a pixel with position (x, y) is modified in:
I′(x,y)=ƒ(ƒ−1(I(x,y))+Δ), (15)
where Δ is the distance of the ideal exposure situation as expressed in relation (3). The image processing of a backlit subject being performed by using a CMOS-VGA sensor and an evaluation kit on the Windows® platform is shown in
Similarly,
All of the above U.S. patents, U.S. patent application publications, U.S. patent applications, foreign patents, foreign patent applications and non-patent publications referred to in this specification and/or listed in the Application Data Sheetare incorporated herein by reference, in their entirety.
From the foregoing it will be appreciated that, although specific embodiments of the invention have been described herein for purposes of illustration, various modifications may be made without deviating from the spirit and scope of the invention. Accordingly, the invention is not limited except as by the appended claims.
Claims
1. A digital image processing method, comprising the steps of:
- extracting chromatic information of an image taken by an image taking device and related to a human subject;
- detecting visually interesting regions in the taken image by recognizing areas corresponding to skin of the subject, wherein the recognized areas are the visually interesting regions; and
- correcting an exposure of said taken image by normalizing a grey scale of said taken image based on said visually interesting regions.
2. A digital image processing method according to claim 1, wherein said recognizing step comprises constructing a probabilistic slicing of said image taken in a YcrCb format to evaluate if pixels of said image must be classified as belonging to said areas corresponding to the skin of said subject.
3. A digital image processing method according to claim 2, wherein pixels with higher grey values are classified as belonging to said areas corresponding to the skin of said photographed subject.
4. A digital image processing method according to claim 1, wherein said recognizing step comprises applying a threshold area of said image taken in an RGB format to evaluate if pixels of said image must be classified as belonging to said areas corresponding to the skin of said subject.
5. A digital image processing method according to claim 4, wherein applying said threshold area comprises constructing a chrominance slicing histogram of said taken image.
6. A digital image processing method according to claim 5, wherein constructing said chrominance slicing histogram uses normalized channels r and g of the type: r=R/(R+G+B) g=G/(R+G+B) R, G and B being red, green and blue values of each pixel of said taken RGB image.
7. A digital image processing method according to claim 5, wherein said recognizing step uses said chrominance slicing histogram to detect said areas corresponding to the skin of said subject formed by the pixels of said taken image belonging in said chrominance slicing histogram to said threshold area.
8. A digital image processing method according to claim 5, wherein said taken image is of a Bayer type, the method further comprising:
- sub-sampling the image according to G*=(G1+G2)/2, wherein G1 and G2 are first and second green channels of the image, and said step of constructing said chrominance slicing histogram uses normalized channels r and g of the type:
- r=R/(R+G*+B) g=G*/(R+G*+B),
- R, G and B being red, green and blue values of each pixel of said taken RGB image.
9. A digital image processing method according to claim 1, wherein said taken image is of a Bayer type.
10. A digital image processing method according to claim 9, wherein said exposure correction of said taken image uses:
- a simulated response function of a type:
- f ( q ) = 255 ( 1 + ⅇ - ( Aq ) ) C
- A and C being predetermined control parameters and q being a light quantity value expressed in base 2 logarithmic units; and
- a grey average level (avg) calculated on said visually interesting regions, in order to calculate a distance Δ of an ideal exposure situation using:
- Δ=ƒ−1(128)−ƒ−1(avg)
- and in order to change a luminance value Y(x, y) of a pixel with position (x, y) in:
- Y′(x,y)=ƒ(ƒ−1(Y(x,y))+Δ).
11. A digital image processing method according to claim 1, further comprising a final color reconstruction step.
12. A digital image processing method according to claim 11, said image being taken in an RGB format, wherein said final color reconstruction step comprises the relations: R ′ = 0.5 · ( Y ′ Y · ( R + Y ) + R - Y ) G ′ = 0.5 · ( Y ′ Y · ( G + Y ) + G - Y ) B ′ = 0.5 · ( Y ′ Y · ( B + Y ) + B - Y ) R, G, B, and Y being respective red, green, blue, and luminance values of said taken image, Y′ being a desired luminance value, and R′, G′, and B′ being respective red, green, and blue values of the image after said final color reconstruction step.
13. A digital image processing method according to claim 11, said image being taken in a Bayer Pattern format, wherein said final color reconstruction step provides that a grey value I(x, y) of a pixel with position (x, y) is changed in: I′(x,y)=ƒ(ƒ−1(I(x,y))+Δ), where Δ is a distance of an ideal exposure situation.
14. A digital image processing method, comprising:
- extracting chromatic information from an image of a human subject;
- detecting, based on the extracted chromatic information, which areas of the image correspond to skin of the subject; and
- normalizing grey scale values of the image based on the areas of the image that are detected as corresponding to the skin of the subject.
15. The method of claim 14, wherein the detecting step comprises constructing a probabilistic slicing of the image taken in a YcrCb format to evaluate if pixels of the image belong to the areas corresponding to the skin of the subject.
16. The method of claim 14, wherein the detecting step comprises applying a threshold area of the image taken in an RGB format to evaluate if pixels of the image belong to the areas corresponding to the skin of the subject.
17. The method of claim 16, wherein applying the threshold area comprises constructing a chrominance slicing histogram of the image and using the chrominance slicing histogram to detect the areas corresponding to the skin of the subject formed by the pixels of the image belonging in the chrominance slicing histogram to the threshold area.
18. The method of claim 16, wherein applying the threshold area comprises constructing a chrominance slicing histogram of the image using normalized channels r and g of the type: r=R/(R+G+B) g=G/(R+G+B) R, G and B being red, green and blue values of each pixel of the image.
19. The method of claim 16, wherein applying the threshold area comprises constructing a chrominance slicing histogram of the image and the image is of a Bayer type, the method further comprising:
- sub-sampling the image according to G*=(G1+G2)/2, wherein G1 and G2 are first and second green channels of the image, and said step of constructing said chrominance slicing histogram uses normalized channels r and g of the type:
- r=R/(R+G*+B) g=G*/(R+G*+B),
- R, G and B being red, green and blue values of each pixel of said taken RGB image.
20. The method of claim 14, wherein the normalizing step performs exposure correction of the image that includes:
- using a simulated response function of a type:
- f ( q ) = 255 ( 1 + ⅇ - ( Aq ) ) C
- A and C being predetermined control parameters and q being a light quantity value expressed in base 2 logarithmic units; and
- calculating a grey average level (avg) of the areas corresponding to the skin;
- calculating a distance Δ of an ideal exposure situation using:
- Δ=ƒ−1(128)−ƒ−1(avg); and
- changing a luminance value Y(x, y) of a pixel with position (x, y) in:
- Y′(x, y)=ƒ(ƒ−1(Y(x, y))+Δ).
21. The method of claim 14, further comprising a final color reconstruction step using the relations: R ′ = 0.5 · ( Y ′ Y · ( R + Y ) + R - Y ) G ′ = 0.5 · ( Y ′ Y · ( G + Y ) + G - Y ) B ′ = 0.5 · ( Y ′ Y · ( B + Y ) + B - Y ) R, G, B, and Y being respective red, green, blue, and luminance values of the image, Y′ being a desired luminance value, and R′, G′, and B′ being respective red, green, and blue values of the image after the final color reconstruction step.
22. The method of claim 14, further comprising a final color reconstruction step that changes a grey value I(x, y) of a pixel with position (x, y) using: I′(x,y)=ƒ(ƒ−1(I(x,y))+Δ),
- where Δ is a distance of an ideal exposure situation.
23. A digital image processor, comprising:
- means for extracting chromatic information from an image of a human subject;
- means for detecting, based on the extracted chromatic information, which areas of the image correspond to skin of the subject; and
- means for normalizing grey scale values of the image based on the areas of the image that are detected as corresponding to the skin of the subject.
Type: Application
Filed: May 18, 2004
Publication Date: Jan 27, 2005
Applicants: STMicroelectronics S.A. (Montrouge), STMicroelectronics S.r.I. (Agrate Brianza)
Inventors: Giuseppe Messina (Giardini Naxos), Sebastiano Battiato (Acicatena), Alfio Castorina (Linera), Laurent Plaza (Aix en Provence)
Application Number: 10/848,815