APPARATUS AND METHOD FOR AUTOMATICALLY ADJUSTING SIZE OF CHARACTERS USING CAMERA

A method is provided for automatically adjusting a size of characters using a camera. The method includes receiving an image with characters; adjusting a focus of the image with characters and detecting a region and a size of characters in the image; determining whether the size of the characters in the image falls within a preset range; recognizing the characters in the image and displaying the recognition results, if the size of the characters falls within the preset range; and automatically adjusting a zoom ratio of the image and recognizing the characters in the resized image, if the size of the characters does not fall within the preset range.

Skip to: Description  ·  Claims  · Patent History  ·  Patent History
Description
PRIORITY

This application is a Continuation Application of U.S. application Ser. No. 14/022,799, which was filed in the U.S. Patent and Trademark Office on Sep. 10, 2013 as a Divisional Application of U.S. application Ser. No. 12/632,101, which was filed in the U.S. Patent and Trademark Office on Dec. 7, 2009 and issued as U.S. Pat. No. 8,553,083 on Oct. 8, 2013, and claims priority under 35 U.S.C. § 119(a) to a Korean Patent Application filed in the Korean Intellectual Property Office on Dec. 5, 2008 and assigned Serial No. 10-2008-0123019, the contents of each of which are incorporated herein by reference.

BACKGROUND OF THE INVENTION 1. Field of the Invention

The present invention relates generally to character recognition using a camera, and more particularly, to an apparatus and method for optimizing user views by adjusting a size of characters in an image received by means of a camera.

2. Description of the Related Art

Camera-based Optical Character Recognition (OCR) may be classified into Snapshot OCR (OCR based on a view of an image to be photographed) and Preview OCR (OCR based on a preview image or viewfinder images). In Snapshot OCR, an image is acquired according to the same method as a normal method for acquiring an image for a photograph. The image acquisition process includes, in turn, photographing desired characters by means of a camera, performing a character recognition process on the photographed characters, and outputting the recognition results to an output unit.

By contrast, in Preview OCR, images are input at a preset rate of Frames Per Second (FPS). Therefore, in Preview OCR, character recognition is possible in real time. Generally, an image acquisition process of Preview OCR includes scanning characters by a user by means of a camera, and continuously performing a recognition operation on an image received through the camera using a recognizer in the camera.

The largest difference between Snapshot OCR and Preview OCR consists in the resolution of an image to be recognized. While Preview OCR recognizes a small image of Quater Video Graphics Array (QVGA; 320×240) for real-time recognition, Snapshot OCR recognizes an image of 1 Mega pixels or more, thus having a longer recognition time compared with Preview OCR. Due to the use of high-resolution images, Snapshot OCR is far superior to Preview OCR in character recognition capability. Recently, as the performance of cameras improves and accurate recognition becomes increasingly important, Snapshot OCR has become widely used than Preview OCR.

It is possible for Snapshot OCR to recognize all characters in an input image, or characters corresponding to a particular region among all the characters in the input image. Hence, a user may perform full recognition on the image displayed on a Liquid Crystal Display (LCD) screen of the camera, or perform partial recognition on a particular region designated.

FIG. 1 illustrates a conventional process of recognizing characters in an image.

In step 101, a mobile terminal receives an image with characters using a camera module. In step 103, the mobile terminal may acquire a focused image from the received image using an Automatic Focusing (Auto-Focus) technique. In step 105, the mobile terminal stores the acquired focused image in a separate storage. In step 107, the mobile terminal recognizes characters in the stored image. The mobile terminal may recognize all the characters in the image or only characters in a region of the image selected by the user. In step 109, the mobile terminal displays the recognition results on its display.

Since, unlike the conventional name-card recognition, Snapshot OCR has no limitation on a recognizable distance desired by the user, characters in the input image may vary in size. For example, when an image, in which very large characters and very small characters are included in various ways, is resized on an LCD, the sizes of the displayed characters are also appear very small and very large, proportionally to the actual image. The excessively large or small size of the characters displayed on the LCD may reduce recognition efficiency or cause recognition failure.

SUMMARY OF THE INVENTION

An aspect of the present invention is to address at least the above-mentioned problems and/or disadvantages and to provide at least the advantages described below. Accordingly, an aspect of the present invention provides an apparatus and method for optimizing a size of characters in an image by automatically adjusting a zoom ratio using a zoom function of a camera when the size of the characters in the image does not fall within a predetermined range.

According to one aspect of the present invention, an electronic device is provided that includes a camera, a display and a processor that is configured to control the camera to obtain an image containing characters, detect a region and a size of the characters in the image, control determining whether the size of the characters in the image falls within a recognizable character size range, and in response to detecting that the size of the characters is less than the recognizable character size range, control the camera to automatically adjust an optical zoom ratio of the camera and resize the obtained image so that the size of the characters in the image falls within the recognizable character size range, and recognize the characters in the resized image.

According to another aspect of the present invention, a method is provided that includes controlling a camera of the electronic device to obtain an image containing characters; detecting a region and a size of the characters in the image; determining whether the size of the characters in the image falls within a recognizable character size range; and in response to detecting that the size of the characters is less than the recognizable character size range, controlling the camera to automatically adjust an optical zoom ratio of the camera and resizing the obtained image so that the size of the characters in the image falls within the recognizable character size range, and recognizing the characters in the resized image.

According to a further aspect of the present invention, an electronic device is provided that includes a camera, a display and a processor configured to control to display an image obtained by the camera corresponding to an object including at least one character displayed on the display, detect an area of the object, wherein the at least one character included in the area is displayed based on a first size according to a specified distance between the camera and the object including the at least one character, in response to the specified distance being a first distance, display again the at least one character based on a second size, wherein the second size is larger than the first size, in response to the specified distance being a second distance, display again the at least one character based on a third size, wherein the third size is smaller than the first size, and display, on the display, a process result for one of the at least one character being displayed based on the second size and the at least one character being displayed based on the third size.

BRIEF DESCRIPTION OF THE DRAWINGS

The above and other aspects, features and advantages of certain embodiments of the present invention will be more apparent from the following description taken in conjunction with the accompanying drawings, in which:

FIG. 1 is a flowchart illustrating a conventional process of recognizing characters in an image;

FIG. 2 is a diagram illustrating an internal structure of a mobile terminal for recognizing characters according to an embodiment of the present invention;

FIG. 3 is a flowchart illustrating a process of recognizing characters in an image according to an embodiment of the present invention; and

FIGS. 4 and 5 are flowcharts illustrating a process of enlarging a size of characters in an image using a zoom function according to an embodiment of the present invention.

DETAILED DESCRIPTION OF EMBODIMENTS

The matters defined in the description such as a detailed construction and elements are provided to assist in a comprehensive understanding of embodiments of the invention. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the invention. Also, descriptions of well-known functions and constructions are omitted for clarity and conciseness.

Since conventional mobile terminals for recognizing characters in an image have no limitations on recognizable distances desired by users, excessively large or small size of characters in an input image may not be properly recognized in a character recognition process. To address these and other problems and/or disadvantages, the present invention provides an apparatus and method for receiving an image with characters having a proper size, by means of a zoom function, for precise recognition of the characters. Features of the present invention will be more apparent by describing certain embodiments of the present invention.

FIG. 2 illustrates an internal structure of a mobile terminal for recognizing characters according to an embodiment of the present invention. The mobile terminal includes a recognizer module 207, an OCR recognition engine module 209, a camera module 201, a display module 203, and a dictionary module 205.

Referring to FIG. 2, the recognizer module 207, a module for controlling the overall operation of an application called Snapshot OCR, mainly initiates and terminates an application for recognizing characters in an image, like character recognition of a camera. In accordance with an embodiment of the present invention, the recognizer module 207 implements pre-recognition, including resizing an image received from the camera module 201 for recognition of characters in the image.

The OCR recognition engine module 209 receives the image with characters from the recognizer module 207, performs character recognition on the received image, and transmits the recognition results back to the recognizer module 207. As to the criteria for recognizing characters, the OCR recognition engine module 209 determines whether a size of the characters falls within a preset range of a recognizable character size, and transmits the determined results back to the recognizer module 207.

The camera module 201 receives an image with characters by driving and controlling a camera. In the process of receiving an image, the camera module 201 captures a clear image using an Auto-Focus function, and if the size of the characters in the image is excessively large or small, captures an image with characters having a size desired by the user using a zoom function. Auto-Focus, an important function for increasing an image recognition rate, serves to increase a resolution of an input image.

The display module 203 outputs at least one of a camera preview image, a photographed image, and character recognition results on a screen.

The dictionary module 205 provides a dictionary definition of the characters recognized by the OCR recognition engine module 209 based on a dictionary search interface function. A variety of information in the dictionary module 205 is stored in a DataBase (DB) in the mobile terminal However, if required information is not stored in the DB in the mobile terminal, the dictionary module 205 may receive the required information from a DB in a server. A process of recognizing characters in an image using the aforesaid structure will be described as follows.

FIG. 3 illustrates a process of recognizing characters in an image according to an embodiment of the present invention.

Referring to FIG. 3, the mobile terminal receives an image with characters using a camera module, in step 301. The received image may be displayed for viewing by a user. In step 303, the mobile terminal acquires a focused image by adjusting the focus of the received image using an automatic focusing technique. In step 305, the mobile terminal detects a character region within the acquired image by means of a character size measurer, and then detects a size of the characters within the character region by detecting a line on which characters exist, in the X- and Y-axes with use of a threshold, and calculates an average size of characters on the line.

In step 307, the mobile terminal compares the measured character size with a preset range of a recognizable character size, and determines whether the measured character size falls within the range of the recognizable character size. If the measured character size does not fall within the range, the mobile terminal automatically adjusts a zoom ratio, or a zoom magnification, so that the measured character size falls within the range, and then receives the resized image in step 309. After performing step 309, the method returns to step 303, where the mobile terminal adjusts the focus of the zoomed image, and then re-performs the character region and size detection in step 305.

If the measured character size falls within the range of the recognizable character size in step 307, the mobile terminal stores the image for accurate character recognition in step 311. The mobile terminal recognizes characters in the stored image in step 313, and displays the recognition results corresponding to the information desired by the user, on a display in step 315.

In FIG. 3, when the measured character size does not fall within the range of the recognizable character size, the character size may be excessively large or excessively small. When the character size is excessively large, the mobile terminal may simply decrease the zoom ratio and receive the resized image. However, if the character size is excessively small and the zoom ratio has already been increased to the maximum zoom ratio, the increased character size may still not fall within the recognizable character size range. In this case, characters in the image may be recognized as follows.

FIGS. 4 and 5 illustrate processes of enlarging a size of characters in an image using a zoom function according to an embodiment of the present invention. As an example, FIG. 4 illustrates a method for a mobile terminal that supports only an optical zoom function, while FIG. 5 illustrates a method for a mobile terminal supporting both an optical zoom function and digital image zoom function. In both FIGS. 4 and 5, the mobile terminal first uses an optical zoom function, which does not influence the image quality. In FIG. 5, the mobile terminal uses a digital image zoom function, when the mobile terminal cannot recognize characters only with the optical zoom function.

Referring to FIG. 4, the mobile terminal receives an image with characters using a camera module in step 401. The received image may be displayed for viewing by a user. In step 403, the mobile terminal acquires a focused image by adjusting the focus of the received image using the automatic focusing technique. In step 405, the mobile terminal detects a character region from the acquired image by means of a character size measurer, and then detects a size of the characters within the character region.

In step 407, the mobile terminal determines whether the detected character size is greater than a predetermined minimum size of a distinguishable character. If the measured character size is greater than the minimum size, the mobile terminal stores the image for precise character recognition in step 409, recognizes characters in the stored image in step 411, and displays the recognition results corresponding to the information desired by the user, on a display in step 413.

However, if the measured character size is less than the size of the minimum distinguishable character in step 407, the mobile terminal automatically adjusts a zoom ratio in step 415. In step 417, the mobile terminal determines whether the adjusted zoom ratio is the maximum zoom ratio. If the adjusted zoom ratio is not the maximum zoom ratio, the mobile terminal re-receives the image enlarged at the adjusted zoom ratio in step 419. However, if the adjusted zoom ratio is the maximum zoom ratio, the mobile terminal cannot recognize the detected characters as the mobile terminal cannot further increase the zoom ratio. Therefore, the mobile terminal displays a manual action request message on an output unit or a display for the user in step 421, informing that it is not possible to distinguish the detected characters at the current location of the user. In this case, the user may acquire a desired size of characters by moving the mobile terminal closer to the subject having the characters.

Referring to FIG. 5, the mobile terminal receives an image with characters using a camera module in step 501. The received image may be displayed for viewing by a user. In step 503, the mobile terminal acquires a focused image by adjusting the focus of the received image using the automatic focusing technique. In step 505, the mobile terminal detects a character region from the acquired image by means of a character size measurer, and then detects a size of the characters.

In step 507, the mobile terminal determines whether the detected character size is greater than a predetermined minimum size of a distinguishable character. If the measured character size is greater than the minimum size, the mobile terminal stores the image for precise character recognition in step 509. The mobile terminal recognizes characters in the stored image in step 511, and displays the recognition results corresponding to the information desired by the user, on its display in step 513.

If the measured character size is less than the size of the minimum distinguishable character in step 507, the mobile terminal automatically adjusts a zoom ratio in step 515. In step 517, the mobile terminal determines whether the adjusted zoom ratio is the maximum zoom ratio. If the adjusted zoom ratio is not the maximum zoom ratio, the mobile terminal re-receives the image enlarged at the adjusted zoom ratio in step 519. However, if the adjusted zoom ratio is the maximum zoom ratio, the mobile terminal provides the user with a message indicating that the character size is small for recognition, and stores the image in step 521. This message is displayed on an output unit or display of the mobile terminal In step 523, the mobile terminal enlarges a region corresponding to characters in the stored image using a digital image zoom function. Thereafter, the mobile terminal recognizes characters in the enlarged image using the image zoom in step 509, and displays the recognition results in step 513.

As is apparent from the foregoing description, embodiments of the present invention can optimize the size of characters in an image so that the user may easily carry out an operation of distinguishing and recognizing characters in the image on the LCD, thereby increasing the character recognition rate.

While an apparatus and method for adjusting a size of characters using a camera according to the present invention has been shown and described with reference to a certain embodiments of the present invention, it will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims and their equivalents.

Claims

1. An electronic device comprising:

a camera;
a display; and
a processor configured to: control the camera to obtain an image containing characters, detect a region and a size of the characters in the image, control to determine whether the size of the characters in the image falls within a recognizable character size range, and in response to detecting that the size of the characters is less than the recognizable character size range, control the camera to automatically adjust an optical zoom ratio of the camera and resize the obtained image so that the size of the characters in the image falls within the recognizable character size range, and recognize the characters in the resized image.

2. The electronic device of claim 1, wherein the processor is further configured to control the camera to adjust a focus of the camera using an automatic focusing technique.

3. The electronic device of claim 1, wherein the processor is further configured to:

detect a line on which the characters in the image exist, wherein the line is on one of an X-axis and a Y-axis; and
wherein the region and the size of the characters is detected by calculating a height and a length of the characters on the detected line.

4. The electronic device of claim 1, wherein the processor is further configured to recognize the characters in the image and control the display to display recognition results, in response to detecting that the size of the characters is greater than the recognizable character size range.

5. The electronic device of claim 1, wherein the character recognition is performed on at least two characters in an enlarged region formed by the resizing of the obtained image.

6. The electronic device of claim 1, wherein the processor is further configured to generate a notification informing a user that the size of the characters is less than a lower limit of the recognizable character size range, in response to detecting that the size of the characters is less than a lower limit of the recognizable character size range, and if a zoom ratio needed to compensate the image is greater than a maximum zoom ratio.

7. The electronic device of claim 6, wherein the notification comprises a message indicating that the size of the characters is less than the lower limit of the recognizable character size range.

8. A method for controlling an electronic device, the method comprising:

controlling a camera of the electronic device to obtain an image containing characters;
detecting a region and a size of the characters in the image;
determining whether the size of the characters in the image falls within a recognizable character size range; and
in response to detecting that the size of the characters is less than the recognizable character size range, controlling the camera to automatically adjust an optical zoom ratio of the camera and resizing the obtained image so that the size of the characters in the image falls within the recognizable character size range, and recognizing the characters in the resized image.

9. The method of claim 8, further comprising controlling the camera to adjust a focus of the camera using an automatic focusing technique.

10. The method of claim 8, further comprising:

detecting a line on which the characters in the image exist, wherein the line is on one of an X-axis and a Y-axis; and
wherein the region and the size of the characters is detected by calculating a height and a length of the characters on the detected line.

11. The method of claim 8, further comprising recognizing the characters in the image and controlling a display of the electronic device to display recognition results, in response to detecting that the size of the characters is greater than the recognizable character size range.

12. The method of claim 8, wherein the character recognition is performed on at least two characters in an enlarged region formed by the resizing of the obtained image.

13. The method of claim 8, further comprising generating a notification informing a user that the size of the characters is less than a lower limit of the recognizable character size range, in response to detecting that the size of the characters is less than a lower limit of the recognizable character size range, and if a zoom ratio needed to compensate the image is greater than a maximum zoom ratio.

14. The method of claim 13, wherein the notification comprises a message indicating that the size of the characters is less than the lower limit of the recognizable character size range.

15. An electronic device comprising:

a camera;
a display; and
a processor configured to control to:
display an image, obtained by the camera, corresponding to an object including at least one character on the display, detect an area of the object, wherein the at least one character included in the area is displayed based on a first size according to a specified distance between the camera and the object including the at least one character, in response to the specified distance being a first distance, display again the at least one character based on a second size, wherein the second size is larger than the first size, in response to the specified distance being a second distance, display again the at least one character based on a third size, wherein the third size is smaller than the first size, and display, on the display, a process result for one of the at least one character being displayed based on the second size and the at least one character being displayed based on the third size.

16. The electronic device of claim 15, wherein the processor is further configured to control to receive a user input for storing at least one of the at least one character being displayed based on the second size and the at least one character being displayed based on the third size.

17. The electronic device of claim 15, wherein the at least one character includes a plurality of characters.

18. The electronic device of claim 17, wherein the processor is further configured to change each of the plurality of characters to one of the second size and the third size.

Patent History
Publication number: 20180367737
Type: Application
Filed: Aug 28, 2018
Publication Date: Dec 20, 2018
Inventors: Hyun-Soo Kim (Gyeonggi-do), Sang-Ho Kim (Gyeonggi-do), Seong-Taek Hwang (Gyeonggi-do), Sang-Wook Oh (Gyeonggi-do), Sung-Cheol Kim (Gyeonggi-do)
Application Number: 16/115,042
Classifications
International Classification: H04N 5/232 (20060101); G09G 5/00 (20060101); G09G 5/22 (20060101); G06K 9/20 (20060101);