Image dictionary creating apparatus, coding apparatus, image dictionary creating method
An image processor obtains character images included in input image and the character codes for identifying the characters represented by the character images, classifies the character images included in the input image into a plurality of character image groups based on the obtained character codes, determines typical image patterns constituting the input image based on the character images classified in the character image groups, assigns indices for identifying the image patterns to the determined image patterns, and codes the occurrence position information of the character images included in the input image and the indices of the image patterns corresponding to the character images so as to be associated with each other.
Latest FUJI XEROX CO., LTD. Patents:
- System and method for event prevention and prediction
- Image processing apparatus and non-transitory computer readable medium
- PROTECTION MEMBER, REPLACEMENT COMPONENT WITH PROTECTION MEMBER, AND IMAGE FORMING APPARATUS
- PARTICLE CONVEYING DEVICE AND IMAGE FORMING APPARATUS
- TONER FOR DEVELOPING ELECTROSTATIC CHARGE IMAGE, ELECTROSTATIC CHARGE IMAGE DEVELOPER, TONER CARTRIDGE, PROCESS CARTRIDGE, IMAGE FORMING APPARATUS, AND IMAGE FORMING METHOD
1. Field of the Invention
The present invention relates to a coding apparatus that creates an image dictionary in which the image patterns constituting the input image and the identification information of the image patterns are associated with each other, and applies the created image dictionary to the coding processing.
2. Description of the Related Art
For example, it is known to provide an image recording apparatus in which image information having a first image comprising a photographic image or graphics and a second image comprising characters is inputted, the area of the second image in the image information is detected, and the area of the second image is extracted from the image information and recorded. This enables the characters within the area of the second image to be converted into character codes and recorded for use as a keyword for retrieval. It is also known to provide a character area coding method in which a font database common to the coding and decoding sides is provided and the character codes, the font type and the like are coded.
SUMMARY OF THE INVENTIONThe present invention has been made in view of the above circumstances and provides a coding apparatus that creates an image dictionary realizing high coding efficiency and applies this image dictionary for coding.
According to one aspect of the invention, there is provided with an image dictionary creating apparatus including: an information obtaining section for obtaining a result of a character recognition processing on an input image; a character classifying section for classifying character images included in the input image into a plurality of character image groups based on the result of the character recognition processing obtained by the information obtaining section; typical pattern determining section for determining typical image patterns constituting the input image based on the character images classified into the character image groups by the character classifying section; and an identification information assigning section for assigning identification information for identifying the image patterns, to the image patterns determined by the typical pattern determining section.
According to another aspect of the invention, the information obtaining section obtains character identification information for identifying a character represented by each character image as the result of the character recognition processing, and the character classifying section classifies the character images included in the input image into the plurality of character image groups based on the character identification information obtained by the information obtaining section.
According to another aspect of the invention, the character identification information includes character codes, the character classifying section classifies the character images included in the input image based on the character codes, and based on the character images classified as having the same character code, the typical pattern determining section determines image patterns corresponding to the character images.
According to another aspect of the invention, the information obtaining section obtains character area information representative of an area of each character image in the input image, character image extracting section is further provided for extracting the character images from the input image based on the character area information obtained by the information obtaining section, the character classifying section classifies the character images extracted by the character image extracting section into a plurality of character image groups, and the typical pattern determining section determines the image patterns by comparing the character images extracted by the character image extracting section with each other.
According to another aspect of the invention, the following are further provided: coincidence determining section for determining a degree of coincidence of a plurality of character images extracted by the character image extracting section by comparing the character images with each other at a plurality of relative positions; area correcting section for correcting character area information of each character image based on the degree of coincidence determined at each relative position by the coincidence determining section; and outputting section for outputting the character area information of each of the character images corresponding to the image patterns so as to be associated with the identification information of the image patterns.
According to another aspect of the invention, coincidence determining section is further provided for determining a degree of coincidence of a plurality of character images classified into the character image groups by the character classifying section by comparing the character images with each other, the typical pattern determining section determines at least one image pattern with respect to the plurality of character images classified into the same character image group based on the degree of coincidence determined by the coincidence determining section, and the identification information assigning section assigns the identification information for identifying the image pattern to at least one image pattern determined with respect to the same character image group by the typical pattern determining section.
According to another aspect of the invention, coincidence determining section is provided for determining the degree of coincidence of each area included in a plurality of character images classified into the character image groups by the character classifying section by comparing the character images with each other, and the typical pattern determining section determines a plurality of image patterns constituting the character images based on the degree of coincidence determined with respect to each area by the coincidence determining section.
According to another aspect of the invention, the information obtaining section further obtains recognition accuracy information of the character recognition processing so as to be associated with the character identification information or the character area information thereof, and the character classifying section classifies the character images included in the input image into a plurality of character image groups based on the recognition accuracy information and the corresponding character identification information or character area information.
According to another aspect of the invention, there is provided with an image dictionary creating apparatus including: a pattern storing section for storing typical image patterns constituting an input image; coincidence determining section for determining a degree of coincidence by comparing a newly inputted character image and the image patterns stored by the pattern storing section; and typical pattern determining section for storing the newly inputted character image into the pattern storing section as the image pattern according to the degree of coincidence determined by the coincidence degree determining section.
According to another aspect of the invention, the pattern storing section stores identification information of characters represented by the image patterns so as to be associated with the image patterns, weight assigning section is further provided for performing weight assignment on the degree of coincidence determined with respect to the image patterns by the coincidence degree determining section based on the identification information of a character represented by the inputted character image and the identification information of the image patterns stored in the pattern storing section, and the typical pattern determining section determines whether to permit registration of the image pattern based on the inputted character image or not based on the degree of coincidence on which weight assignment has been performed by the weight assigning section.
According to another aspect of the invention, the identification information of the characters are character codes, the weight assigning section performs weight assignment so that the degree of coincidence is higher when a character code of the inputted character image coincides with a character code of any of the image patterns than when the character codes are different from each other, and the typical pattern determining section prohibits the registration of the image pattern based on the character image when the degree of coincidence is higher than a reference, and permits the registration of the image pattern based on the character image when the degree of coincidence is equal to or less than the reference.
According to another aspect of the invention, character codes are ones determined by the character recognition processing on the input image, and based on a recognition accuracy of the character recognition processing on the inputted character image, the weight assigning section performs weight assignment such that the higher the recognition accuracy is, the higher the degree of coincidence is.
According to another aspect of the invention, there is provided with an image dictionary creating apparatus including: an information obtaining section for obtaining character images included in an input image and character identification information for identifying characters represented by the character images; character classifying section for classifying the character images included in the input image into a plurality of character image groups based on the character identification information obtained by the information obtaining section; typical pattern determining section for determining typical image patterns constituting the input image based on the character images classified into the character image groups by the character classifying section; and identification information assigning section for assigning identification information for identifying the image patterns, to the image patterns determined by the typical pattern determining section.
According to another aspect of the invention, there is provided with an image dictionary creating apparatus including: an information obtaining section for obtaining character area information representative of an area of each character image in the input image; character image extracting section for extracting the character images from the input image based on the character area information obtained by the information obtaining section; typical pattern determining section for determining typical image patterns constituting the input image based on the character images extracted by the character image extracting section; and identification information assigning section for assigning identification information for identifying the image patterns, to the image patterns determined by the typical pattern determining section.
According to another aspect of the invention, there is provided with a coding apparatus including: an information obtaining section for obtaining character images included in an input image and character identification information for identifying characters represented by the character images; character classifying section for classifying the character images included in the input image into a plurality of character image groups based on the character identification information obtained by the obtaining section; typical pattern determining section for determining typical image patterns constituting the input image based on the character images classified into the character image groups by the character classifying section; identification information assigning section for assigning identification information for identifying the image patterns, to the image patterns determined by the typical pattern determining section; and coding section for coding character area information representative of areas of the character images included in the input image and identification information of image patterns corresponding to the character images so as to be associated with each other.
According to another aspect of the invention, there is provided with a coding apparatus including: an information obtaining section for obtaining character area information representative of an area of each character image in the input image; character image extracting section for extracting the character images from the input image based on the character area information obtained by the information obtaining section; character classifying section for classifying the character images extracted by the character image extracting section into a plurality of character image groups; typical pattern determining section for determining typical image patterns constituting the input image based on the character images classified into the character image groups by the character classifying section; identification information assigning section for assigning identification information for identifying the image patterns, to the image patterns determined by the typical pattern determining section; coincidence determining section for determining a degree of coincidence of the character images by comparing the plurality of character images classified into the character image groups by the character classifying section with each other at a plurality of relative positions; area correcting section for correcting the character area information of the character images based on the degree of coincidence determined at the relative positions by the coincidence determining section; and coding section for coding at least the character area information of the character images corrected by the area correcting section and the identification information of the image patterns determined based on the character image groups into which the character images are classified.
By thus configuration, a high compression rate can be realized while character readability is maintained.
According to another aspect of the invention, there is provided with an image dictionary creating method including: character images included in an input image and character identification information for identifying characters represented by the character images are obtained, the character images included in the input image are classified into a plurality of character image groups based on the obtained character identification information, typical image patterns constituting the input image are determined based on the character images classified into the character image groups, and identification information for identifying the image patterns is assigned to the determined image patterns.
BRIEF DESCRIPTION OF THE DRAWINGSEmbodiments of the present invention will be described in detail based on the following figures, wherein:
First, to aid the understanding of the present invention, the background and outline thereof will be described.
An image processor 2 is capable of realizing a high compression rate, for example, by coding the identification information, the positions of occurrence and the like of the character images included in the input image instead of coding the respective character images.
As shown in
However, according to the coding method premised on the presence of the font database, it is necessary to provide the font database on each of the coding and decoding sides, so that a large part of the storage area compressed by the font database. Moreover, when the font database on the coding side is updated, it is necessary to update the font database on the decoding side accordingly. Moreover, support for handwritten characters and the like is insufficient such that reproducibility is reduced because handwritten characters and the like are replaced with font images and that the code amount cannot be reduced because handwritten characters are treated as non-character images.
Accordingly, in the image processor 2 of the present embodiment, as shown in
As described above, the image processor 2 is capable of realizing a high compression rate without being premised on the presence of the common database by creating an image dictionary according to the input image and transmitting and receiving it. Moreover, it is unnecessary to synchronize the font databases on the coding and decoding sides. Further, also for handwritten characters and the like, the code amount can be reduced while sufficient reproducibility is maintained. To reduce the code amount, it is desirable that the image dictionary be also coded.
As illustrated in
The next problem is the reference based on which the image patterns are extracted from the input image and registered as the image dictionary. This is a problem because the code amount of the input image differs according to the sizes and the frequencies of occurrence of the extracted image patterns. For example, as illustrated in
In the case where the image patterns are extracted in units smaller than character images, although the frequencies of occurrence of the image patterns are frequently high (for example, the vertical bar part of “1” occurs as parts of “L” and “J”), the number of image patterns to be registered in the image dictionary is large, so that the data amount of the image dictionary is large.
On the other hand, when the image patterns are extracted in units of character images, since many characters of the same language, the same font type and the same font size appear in the same document, a high frequency of occurrence can be expected although the image pattern size is large.
Moreover, when a certain degree of irreversibility is allowed with the aim of achieving a high compression rate, in the image processor on the coding side, not only the partial images the same as the image patterns but also the partial images similar to the image patterns are replaced with indices and coded. In this case, if each part of a character image is replaced with a similar image pattern, there is a possibility that readability is lost by the character image being decoded into a completely different character as a whole. However, when the image patterns are extracted in units of character images, since the overall shape of the character image is replaced with a similar image pattern (for example, a numeric character “1” and a letter “I” of the alphabet), a certain degree of readability is maintained.
Thus, in the image processor 2 of the present embodiment, the image patterns are extracted from the input image in units of character images and registered in the image dictionary.
Next, the hardware configuration of the image processor 2 will be described.
As illustrated in
The image processor 2 is, for example, a general-purpose computer in which a coding computer-readable record medium 5 (herein after referred to as a coding program 5) is installed as part of the printer driver, and obtains image data through the communication device 22 or the storage device 24, codes the obtained image data and transmits the coded image data to a printer 10. Moreover, the image processor 2 obtains the image data optically read by the scanner function of the printer 10, and codes the obtained image data.
As illustrated in
In the coding program 5, the image input portion 40 (information obtaining section) obtains the image data read by the scanner function of the printer 10 or the PDL (Page Description Language) image data obtained through the communication device 22, the storage device 24 or the like, converts the obtained image data into raster data and outputs the raster data to the image dictionary creating portion 50. Moreover, the image input portion 40 has a character recognizing portion 410 that recognizes character images from optically read image data or the like and a PDL decomposer 420 that analyzes PDL image data and generates raster data.
The character recognizing portion 410 recognizes the characters included in the inputted image data (hereinafter, referred to as input image), and outputs the character identification information of the recognized characters and the character area information of the recognized characters to the image dictionary creating portion 50 as the result of the character recognition processing. Here, the character identification information is information for identifying characters, and is, for example, versatile character codes (ASCII codes, shifted JIS codes, etc.) or a combination of a character code and a font type. The character area information is information representative of the areas of character images in an input image, and is, for example, the positions, the sizes or the ranges of the character images or a combination thereof.
The PDL decomposer 420 analyzes PDL image data and generates rasterized image data (raster data), and outputs the character identification information and the character area information of the character images in the generated image data together with the generated image data to the image dictionary creating portion 50.
The image dictionary creating portion 50 creates an image dictionary used for the coding processing of the input image based on the input image inputted from the image input portion 40, and outputs the created image dictionary and the input image to the coding portion 60. More specifically, the image dictionary creating portion 50 extracts the image patterns in units of character images from the input image based on the character identification information and the character area information inputted from the character recognizing portion 410 or the PDL decomposer 420, assigns indices to the extracted image patterns to create an image dictionary, and outputs the image dictionary to the coding portion 60.
The coding portion 60 (coding section) codes the input image based on the image dictionary inputted from the image dictionary creating portion 50, and outputs the coded input image and the image dictionary to the storage device 24 (
As shown in
The character image extracting portion 510 extracts the character images from the input image based on the character position information. That is, the character image extracting portion 510 extracts the areas represented by the character area information from the input image as character images. The extracted character images are areas determined to be character images by the character recognizing portion 410. The character images may be outputted to the image dictionary creating portion 50 in a condition of being extracted from the input image by the character recognizing portion 410 or the PDL decomposer 420.
The character classifying portion 520 classifies the character images extracted from the input image into a plurality of character image groups based on the character codes. For example, the character classifying portion 520 classifies the character images whose character codes coincide with each other, into the same character image group.
The coincidence determining portion 530 compares a plurality of character images extracted from the input image with each other and determines the degree of coincidence. Here, the degree of coincidence is information representative of the extent to which a plurality of images coincide with each other, for example, in a case where binary images are compared, the number of overlapping pixels when two character images are overlaid one on another (hereinafter, referred to as the number of coinciding pixels), a coinciding pixel rate which is the number of coinciding pixels that is normalized (for example, the number of coinciding pixels divided by the total number of pixels), or the pixel distribution (histogram) when a plurality of character images are overlaid one on another.
Moreover, the coincidence determining portion 530 compares a plurality of character images at a plurality of relative positions and determines the degree of coincidence. That is, the coincidence determining portion 530 compares a plurality of character images while shifting them from each other in order to calculate the highest degree of coincidence.
For example, the coincidence determining portion 530 calculates the coinciding pixel rate while shifting two character images (character images whose character codes coincide with each other) classified into the same character image group, and outputs, to the storage portion 500, the highest value of the coinciding pixel rate and the shifting vector at the highest value.
The dictionary determining portion 540 determines the image patterns to be registered in the image dictionary based on the character images included in each character image group. That is, the dictionary determining portion 540 determines the image patterns to be registered based on a plurality of character images whose character codes coincide with each other. For example, the dictionary determining portion 540 selects the sum coupling pattern of a plurality of character images whose character codes coincide with each other (character images having undergone position correction described later) as the image pattern to be registered. The sum coupling pattern is the configuration of a union when a plurality of images are overlaid on each other.
The position correcting portion 550 corrects the position information of the character images based on the shifting vector outputted from the coincidence determining portion 530. That is, the position correcting portion 550 corrects the position information inputted from the image input portion 40 so that the degree of coincidence of a plurality of character images whose character codes coincide with each other is the highest.
The index assigning portion 560 assigns indices for identifying the image patterns to the image patterns determined based on the input image, associates the assigned indices with the image patterns, and outputs them to the storage portion 500.
As shown in
The pattern determining portion 610 compares the image patterns registered in the image dictionary with the partial images included in the input image, and determines the image patterns corresponding to the partial images (the same or similar image patterns). More specifically, the pattern determining portion 610 overlays the partial images (ones corrected by the position correcting portion 550) extracted in units of character images from the input image with the image patterns, calculates the degree of coincidence by the same method as that used by the coincidence determining portion 530 (
When corresponding image patterns are found, the pattern determining portion 610 outputs the position information of the partial images to the position information coding portion 620 and outputs the indices of the image patterns to the index coding portion 630, and when no corresponding image patterns are found, the pattern determining portion 610 outputs the partial images to the image coding portion 640.
The position information coding portion 620 codes the position information inputted from the pattern determining portion 610 (that is, the position information of the partial images (character images) corrected by the position correcting portion 550), and outputs it to the selecting portion 660. For example, the position information coding portion 620 codes the position information by applying LZ coding, arithmetic coding or the like.
The index coding portion 630 codes the indices inputted from the pattern determining portion 610, and outputs them to the selecting portion 660. For example, the index coding portion 630 assigns, to each index, a code that is different in code length according to the frequency of occurrence of the index.
The image coding portion 640 codes the partial images inputted from the pattern determining portion 610 by applying a coding method appropriate to the images, and outputs the coded partial images to the selecting portion 660.
The dictionary coding portion 650 codes the image dictionary (image patterns and indices associated with each other) inputted from the image dictionary creating portion 50 (
When image patterns corresponding to the partial images are found by the pattern determining portion 610, the selecting portion 660 associates the coded data of the position information inputted from the position information coding portion 620 with the coded data of the indices inputted from the index coding portion 630 and outputs them to the code output portion 670, and when no image patterns corresponding to the partial images are found by the pattern determining portion 610, the selecting portion 660 outputs the coded data of the partial images coded by the image coding portion 640 to the code output portion 670.
The code output portion 670 associates the coded data inputted from the selecting portion 660 (the coded data of the position information, the indices and the partial images) with the coded data inputted from the dictionary coding portion 650 (the coded data of the image dictionary) and outputs them to the printer 10 (
Next, the general operation of the coding processing by the image processor 2 will be described.
As shown in
At step 20 (S20), the storage portion 500 of the image dictionary creating portion 50 stores the input image inputted from the image input portion 40, the character codes and the position information into the memory 204 (
The character image extracting portion 510 determines the range of the character images in the input image based on the position information stored by the storage portion 500, extracts the character images from the determined range, and stores them into the storage portion 500. The character image extraction is performed on the entire area of the input image to be coded (for example, one page or one document).
At step 30 (S30), the character classifying portion 520 classifies the character images extracted by the character image extracting portion 510, based on the character codes inputted from the character recognizing portion 410 (
At step 40 (S40), the coincidence determining portion 530, the dictionary determining portion 540 and the position correcting portion 550 cooperate to determine the image patterns registered in the image dictionary based on the character images classified based on the character classifying portion 520 and store the determined image patterns into the storage portion 500 as the image dictionary.
At step 50 (S50), the index assigning portion 560 assigns indices to the determined image patterns, associates the assigned indices with the image patterns and stores them into the storage portion 500. The assigned indices at least uniquely identify the image patterns with respect to all the input images inputted as the objects of coding.
When the image pattern determination and the index assignment are finished with respect to all the input images inputted as the objects of coding, these image patterns and indices are outputted to the coding portion 60 as the image dictionary.
At step 60 (S60), the coding portion 60 compares the image patterns registered in the image dictionary and the partial images included in the input image, and when partial images coinciding with the image patterns are present, the coding portion 60 codes the partial images after replacing them with indices and position information, and codes the partial images not coinciding with the image patterns as they are. Moreover, the coding portion 60 codes the image dictionary.
At step 70 (S70), the coding portion 60 outputs the coded data of the indices, the position information and the partial images and the coded data of the image dictionary to the printer 10 or the like.
As shown in
That is, when the pixel distribution of the character image group is Q(x), the pixel value of each character image is P(i, x), the position vector is x, each character image belonging to the character image group is i(1 to N: N is the number of character images belonging to the character image group) and the shifting vector of the character image i is vi, the coincidence determining portion 530 calculates the number of coinciding pixels by the following expression:
(The number of coinciding pixels K)=Σ{Q(x)*P(i, x−vi)}
(Σ represents the sum total with respect to the variable x)
When i=1,
Q(x)=P(1, x), and
-
- when i>1,
Q(x)=P(1, x)+P(2, x−v2)+ . . . +P(i−1, x−v(i−1)).
- when i>1,
At step 404 (S404), the position correcting portion 550 determines the correction vector of the position information inputted from the character recognizing portion 410 based on the number of coinciding pixels (the degree of coincidence) calculated at a plurality of relative positions by the coincidence determining portion 530. Specifically, the position correcting portion 550 selects, as the correction vector, the shifting vector vi when the number of coinciding pixels K calculated by the coincidence determining portion 530 becomes highest (the two-dimensional vector where the character image is displaced with respect to the position information inputted from the character recognizing portion 410).
At step 406 (S406), the coincidence determining portion 530 compares a plurality of character images classified into the same character image group (ones position-corrected by the correction vector), and calculates the degree of coincidence of the pixel values in each area. Specifically, the coincidence determining portion 530 overlays all the character images included in the character image group at the relative position where the number of coinciding pixels is largest, and adds the black pixels in each area to create the pixel distribution (histogram). That is, the coincidence determining portion 530 calculates Q(x) by the following expression with respect to all the character images (1 to N) included in each character image group:
Q(x)=ΣP(i, x−vi)
At step 408 (S408), the dictionary determining portion 540 performs threshold processing to remove the distribution number not more than the threshold value, on the degree of coincidence (pixel distribution) calculated by the coincidence determining portion 530. Specifically, the dictionary determining portion 540 normalizes Q(x) calculated by the coincidence determining portion 530 to calculate Q′(x) and performs the threshold processing on the calculated Q′ (x). That is, the dictionary determining portion 540 calculates the distribution probability Q′ (x) by the following expression:
Q′(x)=Q(x)/N
Then, by the following conditional expression, the coincidence determining portion 530 removes the part where the distribution probability Q′(x) is lower than the reference value, and calculates Q″(x):
when Q′(x)>a threshold value A, Q″(x)=1, and
in the other cases, Q″(x)=0
At step 410 (S410), with respect to the pixel distribution after the threshold processing, the dictionary determining portion 540 determines whether the area where the distribution number is not 0 (the black pixel area) is larger than the reference or not. When the area is not less than the reference, the process shifts to the processing of S412, and when the area is smaller than the reference, with respect to the character image group, the image pattern determination processing (S40) is finished without the image patterns being registered.
Specifically, the dictionary determining portion 540 determines whether the number of pixels where the above-mentioned Q″(x) is 1 is not less than the reference value or not. The dictionary determining portion 540 performs image pattern registration when the number is not less than the reference value, and does not perform image pattern registration when the number is smaller than the reference value.
At step 412 (S412), the dictionary determining portion 540 determines the image patterns based on the pixel distribution. Specifically, the dictionary determining portion 540 determines the patterns of Q″(x) as the image patterns to be registered in the image dictionary.
As shown in
At step 604 (S604), the pattern determining portion 610 determines whether coinciding image patterns are present or not. Specifically, the pattern determining portion 610 determines whether the number of coinciding pixels calculated with respect to each image pattern is within the permissible range (for example, not less than 90% of all the pixels of the partial image) or not. When the number is within the permissible range, the process shifts to the processing of S606, and when the number is outside the permissible range, the process shifts to the processing of S610.
At step 606 (S606), the pattern determining portion 610 reads out the index of, of the image patterns whose numbers of coinciding pixels are within the permissible range, the image pattern whose number of coinciding pixels is the largest from the image dictionary, outputs the read index to the index coding portion 630, and outputs the position information (one corrected by the position correcting portion 550) of the character image to the position information coding portion 620.
The index coding portion 630 codes the index inputted from the pattern determining portion 610, and outputs the coded data of the index to the selecting portion 660.
At step 608 (S608), the position information coding portion 620 codes the position information inputted from the pattern determining portion 610, and outputs the coded data of the position information to the selecting portion 660.
The selecting portion 660 associates the coded data of the index inputted from the index coding portion 630 and the coded data of the position information inputted from the position information coding portion 620 with each other, and outputs them to the code output portion 670. That is, the selecting portion 660 outputs, to the code output portion 670, the index and the position information so as to be associated with each other for each partial image.
At step 610 (S610), the pattern determining portion 610 outputs the partial image (that is, the character image whose corresponding image pattern is absent in the image dictionary) to the image coding portion 640.
The image coding portion 640 codes the image data of the partial image inputted from the pattern determining portion 610, and outputs the coded data of the partial image to the selecting portion 660.
The selecting portion 660 outputs the coded data of the partial image inputted from the image coding portion 640 to the code output portion 670.
At step 612 (S612), the pattern determining portion 610 determines whether the coding processing is finished with respect to all the partial images or not. When a partial image that is not coded is present, the process returns to the processing of S602 to perform the coding processing of the next partial image, and when all the partial images are coded, the process shifts to the processing of S614.
At step 614 (S614), the dictionary coding portion 650 codes the image dictionary (one in which the image patterns and the indices are associated with each other) inputted from the image dictionary creating portion 50, and outputs the coded data of the image dictionary to the code output portion 670.
As described above, in the image processor 2 of the present embodiment, since the image patterns are determined based on the character image groups classified based on character codes and are registered in the image dictionary, efficient coding (coding of a high compression rate) is realized while the readability of document images is maintained.
Moreover, in the image processor 2, since the character images belonging to the same character image group are compared with each other to correct the extraction position of the character images (the position information of the character images), the character image shift caused by a character image extraction error, a font difference or the like is corrected, so that the positions of the characters can be reproduced with high accuracy. While in the present embodiment, the image processor 2 classifies the character images into character image groups based on character codes, it may classify the character images by use of another character identification information such as the font type or the character image size, or may classify the character images by combining at least two of the character codes, the font type and the character image size.
Next, a first modification of the above-described embodiment will be described.
As illustrated in
Moreover, as illustrated in
Accordingly, the image processor 2 of the first modification enables the following: When character images having different shapes are mixed in the character image groups classified based on character codes, at least two image patterns are determined with respect to the character image groups and the image patterns are registered in the image dictionary.
As shown in
At step 422 (S422), the dictionary determining portion 540 determines the sum coupling pattern Q″(x) based on the pixel distribution.
At step 424 (S424), the dictionary determining portion 540 extracts, as a common pattern, the part (intersection) common to the sum coupling pattern Q″(x) and the character images belonging to the character image group, and determines the extracted common pattern as the image pattern. The dictionary determining portion 540 does not register all the common patterns extracted based on the character images in the image dictionary but registers them in the image dictionary while excluding overlapping common patterns. Moreover, the dictionary determining portion 540 may exclude, of a plurality of common patterns extracted based on the character images, ones where the difference in the black pixel area is not more than the threshold value, as overlapping patterns.
As shown in
The dictionary determining portion 540 generates, at the above-described S422, a sum coupling pattern Q″(x) where the part not less than a threshold value B is 1 and the part less than the threshold value B is 0.
Then, the dictionary determining portion 540 extracts, at S424, the parts common to the sum coupling pattern Q″(x) and the character images belonging to the character image group as common patterns. That is, the dictionary determining portion 540 multiplies the sum coupling pattern Q″(x) and the pixel distribution P(i, x−vi) of each character image. By this, an image pattern #1 and an image pattern #2 which are typical character images being present in a plurality of numbers in the character image group are extracted.
As described above, the image processor 2 according to the first modification is capable of determining a plurality of typical image patterns based on a plurality of image characters classified as having the same character code. By this, when the frequency of occurrence is high, character images of different fonts, sizes and the like are also registered in the image dictionary as image patterns, and these character images are reproduced in the decoded image.
Next, a second modification of the above-described embodiment will be described.
As illustrated in
As illustrated in
Accordingly, in the second image processor 2 according to the second modification, when a plurality of character images having different shapes are mixed in the character image groups classified based on character codes, the common shapes and the different shapes of these character images are hierarchically registered in the image dictionary as image patterns.
As shown in
At step 442 (S442), the dictionary determining portion 540 extracts a part higher in the distribution probability Q′(x) than a first level threshold value as the image pattern of the common shape (that is, the first level pattern). Here, the first level threshold value is a reference for separating the common shape common to the character images belonging to the character image group and the other shapes, and in this example, is a distribution probability when black pixels are present in substantially all the character images belonging to the character image group (0.95. etc.).
At step 444 (S444), the dictionary determining portion 540 generates a distribution probability Q1(x) where the part corresponding to the first level pattern is removed from the distribution probability Q′(x) by making the part corresponding to the first level pattern 0.
Then, the dictionary determining portion 540 generates a sum coupling pattern Q1″(x) where the first level pattern is removed by making the part not less than the second level threshold value 1 and making the part lower than the second level threshold value 0 in the distribution probability Q1(x) Here, the second level threshold value is a reference for separating one which is a different shape included in each character image as the difference of the common shape and is high in the frequency of occurrence in the character image group. The second level threshold value is lower than the first level threshold value, and is set, for example, according to the target code amount and image quality.
The dictionary determining portion 540 multiplies the sum coupling pattern Q1″ and the pixel distribution P(i, x−vi) of each character image, extracts Ri(x) which is the part common to these, and determines the extracted Ri(x) as the image pattern of the different shape (that is, the second level pattern).
At step 446 (S446), the dictionary determining portion 540 registers the first level pattern and the second level pattern generated from the first character image group in the image dictionary so as to be associated with each other. More specifically, the dictionary determining portion 540 registers the first level pattern and the second level pattern in the image dictionary so as to be hierarchically associated with each other based on their indices.
As shown in
The dictionary determining portion 540 has the first level threshold value for extracting the common shape (the area where the distribution probability is substantially highest) and the second level threshold value for extracting the different shape occurring at high frequency.
First, the dictionary determining portion 540 performs the threshold processing on the distribution probability Q′(x) by the first level threshold value, and extracts the part corresponding to the first level pattern (common shape). Then, the part corresponding to the extracted first level pattern is removed from the distribution probability Q′(x) and converted into “1” or “0” with the second level threshold value as the reference, thereby generating a second level sum coupling pattern Q1″(x).
The dictionary determining portion 540 multiplies the second level sum coupling pattern Q1″(x) and the pixel distribution p(i, x) of each character image i to thereby extract the second level pattern which is their common part. In this example, the common part of the second level sum coupling pattern Q1″(x) and the “character image #1” is the second level pattern a, and the common part of the second level sum coupling pattern Q1″ (x) and the “character image #2” is the second level pattern b.
By this, the “character image #1” can be replaced with the index as the sum of the first level pattern and the second level pattern a, and the “character image #2”•can be replaced with the index as the sum of the first level pattern and the second level pattern b.
As shown in
At step 642 (S642), the pattern determining portion 610 determines whether the first level pattern that coincides with the inputted partial image is present or not. Specifically, the pattern determining portion 610 determines whether the number of coinciding pixels calculated with respect to each first level pattern is within the permissible range (for example, not less than 90% of all the pixels of the partial image) or not. When the number is within the permissible range, the process shifts to the processing of S644, and when the number is outside the permissible range, the process shifts to the processing of S610.
At step 644 (S644), the pattern determining portion 610 reads out the index of, of the first level patterns whose numbers of coinciding pixels are within the permissible range, the first level pattern whose number of coinciding pixels is the largest from the image dictionary, outputs the read index to the index coding portion 630, and outputs the position information (one corrected by the position correcting portion 550) of the character image to the position information coding portion 620.
At step 646 (S646), the pattern determining portion 610 extracts the different shape between the shape of the first level pattern and the inputted partial image.
At step 648 (S648), the pattern determining portion 610 reads out the second level pattern associated with the first level pattern from the image dictionary, compares the read second level pattern with the extracted different shape, and calculates the number of coinciding pixels. The pattern determining portion 610 determines whether the number of coinciding pixels calculated with respect to each second level pattern is within the permissible range (for example, not less than 90% of all the pixels of the different shape) or not. When the number is within the permissible range, the process shifts to the processing of S650, and when the number is outside the permissible range, the process shifts to the processing of S608.
At step 650 (S650), the pattern determining portion 610 reads out the index of, of the second level patterns whose numbers of coinciding pixels are within the permissible range, the second level pattern whose number of coinciding pixels is the largest from the image dictionary, associates the read index with the index of the first level pattern, and outputs it to the index coding portion 630.
While the black pixels coinciding with none of the first level pattern and the second level pattern are removed as noise or the like in this example, these black pixels themselves may be coded separately.
As described above, in the image processor 2 according to the second modification, a plurality of character images classified as having the same character code are compared with each other, the common shape and the different shape are extracted, and the extracted common shape and different shape are hierarchically associated with each other and registered in the image dictionary. By this, the reproducibility of the character images can be improved while a high compression rate is realized.
Moreover, the image processor 2 is capable of realizing progressive transmission that gradually improves the image quality according to the amount of coded data decoded on the decoding side by hierarchizing the common shape and the different shape and outputting them successively from the coded data of the common shape.
Next, a third modification of the above-described embodiment will be described.
The correctness (accuracy) of the character recognition by the character recognizing portion 410 sometimes differs among the character images included in the input image. Therefore, there is a possibility that character images whose shapes are not similar are mixed in the character image groups classified based on the character recognition result (character codes) When character images whose shapes are not similar are mixed in the character image groups like this, a problem can occur such that the image patterns registered in the image dictionary are distorted.
Accordingly, the image processor 2 according to the third modification creates an image dictionary for each accuracy of the character recognition processing.
As illustrated in
Further, the character classifying portion 520 further classifies each character image group based on character codes.
For each of the character image groups classified based on the accuracy and character codes, the coincidence determining portion 530 compares the character images included in the character image group with each other and determines the degree of coincidence. The dictionary determining portion 540 determines the image pattern for each of the character image groups classified based on the accuracy and character codes. When the image pattern is determined with respect to a character image group with low accuracy, the dictionary determining portion 540 first compares the image pattern determined with respect to a character image group with high accuracy with the character images belonging to this character image group (the character image group with low accuracy), and determines whether these coincide with each other or not. When they coincide with each other, the character image is excluded from the character image group.
As described above, the image processor 2 according to the third modification is capable of minimizing the influence of a character recognition processing error on the image dictionary by creating the image dictionary for each accuracy of the character recognition processing.
Next, a second embodiment will be described.
While the image dictionary creating portion 50 creates the image dictionary by using all the character images of one page or one document in the above-described first embodiment, a second image dictionary creating portion 52 according to the present embodiment compares the character images one character at a time and registers them in the image dictionary.
As shown in
The registration controller 570 controls new registration in the image dictionary by determining whether to register the character image extracted by the character image extracting portion 510 in the image dictionary as an image pattern or not. More specifically, the registration controller 570 determines whether to register the newly inputted character image in the image dictionary as an image pattern or not based on the degree of coincidence between the newly inputted character image and the image patterns registered in the image dictionary. For example, when the degrees of coincidence between the already registered image patterns and the newly inputted character image are all equal to or less than the reference, the registration controller 570 instructs the dictionary determining portion 540 to register this character image in the image dictionary as the image pattern.
As shown in
At step 22 (S22), the storage portion 500 of the image dictionary creating portion 50 stores, into the memory 204 (
At step 46 (S46), the registration controller 570 determines whether to register the newly inputted character image in the image dictionary as an image pattern or not in cooperation with the coincidence determining portion 530 and the position correcting portion 550.
At step 52 (S52), when the registration controller 570 determines that the inputted character image is registered in the image dictionary as an image pattern, the processing shifts to the processing of S54, and when the registration controller 570 determines that the inputted character image is not registered as an image pattern, the process shifts to the processing of S62. The position information of the character image determined not to be registered is associated with the index of the image pattern most similar to the character image, and outputted to the coding portion 60.
At step 54 (S54), the index assigning portion 560 assigns an index to the newly inputted character image (one registered as the image pattern), outputs this character image and the index to the storage portion 500, and outputs the position information of this character image and the index of this character image to the coding portion 60.
At step 62 (S62), the coding portion 60 codes the index and the position information inputted from the image dictionary creating portion 52 (
At step 80 (S80), the coding program 5 determines whether the processing is finished for all the character images included in the input image or not. When determining that the processing is finished, the coding program 5 codes the image dictionary, outputs it to the printer 10 or the like, and ends the operation (S2). When determining that the processing is not finished, the coding program 5 returns to the processing of S22 and performs the processing on the next character image.
As shown in
The number of coinciding pixels K is calculated by the following expression:
(The number of coinciding pixels K)=Σ{S(x)*P(i, x−vi)}
where S(x) is the pixel distribution of the image pattern.
Then, the coincidence determining portion 530 normalizes the calculated number of coinciding pixels K to calculate the coinciding pixel rate K′.
The coinciding pixel rate K′ is calculated by the following expression:
(The coinciding pixel rate K′)=K/M
where M is the number of pixels constituting the character image.
At S404, the position correcting portion 550 determines the correction vector of the position information inputted from the character recognizing portion 410 based on the degree of coincidence calculated at a plurality of relative positions by the coincidence determining portion 530. Specifically, the position correcting portion 550 sets, as the correction vector, the shifting vector vi when the coinciding pixel rate K′ calculated by the coincidence determining portion 530 becomes highest.
At step 464 (S464), the registration controller 570 compares the coinciding pixel rate K′ determined by the coincidence determining portion 530 with the reference value for determining whether to register the image pattern or not. When the coinciding pixel rate K′ is not less than the reference value, the process shifts to the processing of S466, and when the coinciding pixel rate K′ is lower than the reference value, the process shifts to the processing of S470.
At step 466 (S466), the registration controller 570 prohibits the registration of this character image in the image dictionary. Further, the registration controller 570 reads out the index of the image pattern most similar to this character image (that is, the image pattern whose coinciding pixel rate K′ is the highest) from the image dictionary.
At step 467 (S467), the registration controller 570 outputs the read index and the position information of this character image (one corrected by the position correcting portion 550) to the coding portion 60.
At step 470 (S470), the registration controller 570 instructs the dictionary determining portion 540 to register the newly inputted character image in the image dictionary as an image pattern.
The dictionary determining portion 540 registers the inputted character image (one position-corrected by the position correcting portion 550) in the storage portion 500 (image pattern storing section) as an image pattern.
As described above, in the image processor 2 according to the second embodiment, the character images are extracted one character at a time based on the position information of the character images, whether to register the extracted character images in the image dictionary or not is determined one character at a time, and coding is performed based on the result of the determination. By this, the coding processing can be performed sequentially.
Next, a modification of the second embodiment will be described.
The image dictionary creating portion 52 stores the character images registered in the image dictionary as image patterns so as to be associated with character codes, and compares the newly inputted character image with the image pattern whose character code coincides with priority to determine the degree of coincidence. For example, when the character images are registered in the image dictionary as image patterns, the dictionary determining portion 540 registers the character codes of these character images so as to be associated with the image patterns. The coincidence determining portion 530 reads out only the image pattern whose character code coincides with that of the newly inputted character image from the image dictionary, compares the read image pattern with the inputted character image, and calculates the coinciding pixel rate K′. The registration controller 570 determines whether to register the inputted character image in the image dictionary as an image pattern or not based on only the coinciding pixel rate K′ between the image pattern whose character code coincides and the inputted character image.
This enables the image dictionary creating portion 52 to reduce the number of times of the coincidence degree determination processing.
Moreover, the image dictionary creating portion 52 may assign a weight to the determined degree of coincidence based on the character code of the inputted character image. Moreover, the image dictionary creating portion 52 may assign a weight to the determined degree of coincidence based on the accuracy of the character recognition processing by the character recognizing portion 410.
As illustrated in
The weight assignment processing according to the character codes is not limited to the one based on whether the character codes coincide with each other or not. For example, the weight assignment processing may be performed by setting a weight assignment coefficient corresponding to a combination of specific character codes (for example, a combination of ones having similar shapes such as a numeric character “1” and a letter “l”) so that the weight assignment processing is performed based on the combination of the character codes, or the weight assignment processing may be performed by setting a weight assignment coefficient according to the requirement of the readability (for example, weight assignment that differs between the character code of a kanji character and the character code of a hiragana character).
Moreover, as illustrated in
As shown in
At step 482 (S482), the registration controller 570 multiplies the coinciding pixel rate K′ calculated with respect to each image pattern by the weight assignment coefficient according to whether the character codes coincide with each other or not.
Further, the registration controller 570 obtains the accuracy of the character recognition processing on the character image from the character recognizing portion 410, and multiplies the coinciding pixel rate K′ by the weight assignment coefficient according to the obtained accuracy of the character recognition processing.
At S464, the registration controller 570 compares the coinciding pixel rate K′ on which weight assignment has been performed based on whether the character codes coincide with each other or not and the accuracy of the character recognition processing with the reference value, and determines whether to register this character image in the image dictionary as an image pattern or not.
As described above, the image dictionary creating portion 52 is capable of registering more appropriate image patterns in the image dictionary by assigning a weight to the determined degree of coincidence based on whether the character codes coincide with each other or not and the accuracy of the character recognition processing.
-
- Coding side
- Character codes, font types, positions of occurrence, etc.
- Decoding side
- Different coding methods for characters and images
- Font DB
- Association is required
- Image dictionary, indices, positions of occurrence, etc.
FIG. 3 - 2 Image processor
- 10 Printer
- 20 Controller
- 22 Communication device
- 24 Storage device
- 26 UI device
- 204 Memory
- 240 Storage medium
FIG. 4 - To scanner
- To storage device, etc.
- 40 Image input portion
- 410 Character recognizing portion
- 420 PDL decomposer
- 50 Image dictionary creating portion
- 60 Coding portion
- Coding program 5
FIG. 5 - Image dictionary creating portion 50
- Character codes
- Character area information
- Input image data
- Dictionary data
- Character area information (after correction)
- Input image data
- 500 Storage portion
- 510 Character image extracting portion
- 520 Character classifying portion
- 530 Coincidence determining portion
- 540 Dictionary determining portion
- 550 Position correcting portion
- 560 Index assigning portion
FIG. 6
- Coding portion 60
- 610 Pattern determining portion
- 620 Position information coding portion
- 630 Index coding portion
- 640 Image coding portion
- 650 Dictionary coding portion
- 660 Selecting portion
- 670 Code output portion
FIG. 7 - Start
- S10 Obtain image data
- S20 Extract the character images
- S30 Classify the character images based on character codes
- S40 Determine the image patterns from the classified character images
- S50 Assign indices to the image patterns
- S60 Coding processing
- S70 Output the coded data
- End
FIG. 8 - Image pattern determination processing S40
- S402 Compare the character images of the same classification while shifting them
- S404 Determine the correction amount of the character position
- S406 Calculate the degree of coincidence of the character images of the same classification
- S408 Coincidence degree threshold processing
- S410 Pixels of the number larger than the reference value present?
- S412 Determine image patterns
FIG. 9 - Coding processing S60
- S602 Compare the partial images and the image patterns
- S604 Coinciding patterns present?
- S606 Code the index of the coinciding image pattern
- S608 Code the position information of the partial image
- S610 Code the partial image
- S612 Finished with respect to all the images?
- S614 Code the dictionary data
FIG. 10 - The same character code
- Character recognition error
- The same character code (“1” or “I”)
FIG. 11 - Image pattern determination processing S42
- S402 Compare the character images of the same classification while shifting them
- S404 Determine the correction amount of the character position
- S406 Calculate the degree of coincidence of the character images of the same classification
- S408 Coincidence degree threshold processing
- S410 Pixels of the number larger than the reference value present?
- S422 Determine the sum coupling pattern of the same classification
- S424 Multiply the sum coupling pattern and the character images
FIG. 12 - Distribution probability Q′(x)
- Threshold value B
- Pixel position x
- Generate sum coupling pattern
- Sum coupling pattern Q″ (x)
- Character image #1
- Image pattern #1
FIG. 13 - The same character code
- Second level pattern a
- First level pattern
FIG. 14 - Image pattern determination processing S44
- S402 Compare the character images of the same classification while shifting them
- S404 Determine the correction amount of the character position
- S406 Calculate the degree of coincidence of the character images of the same classification
- S408 Coincidence degree threshold processing
- S410 Pixels of the number larger than the reference value present?
- S442 Extract the first level pattern
- S444 Extract the second level pattern
- S446 Hierarchize and output the level patterns
FIG. 15 - Distribution probability Q′(x)
- First level threshold value
- Second level threshold value
- Pixel position x
- (extracted according to the degree of coincidence)
- First level pattern
- Second level sum coupling pattern Q1″(x)
- Character image #1
- Second level pattern a
FIG. 16 - Coding processing S64
- S602 Compare the partial image and the image pattern
- S610 Code the partial image
- S642 Coinciding first level pattern present?
- S644 Code the index of the first level pattern
- S646 Extract the difference from the first level pattern
- S648 Coinciding second level pattern present?
- S650 Code the index of the second level pattern
- S608 Code the position information of the partial image
- S612 Finished with respect to all the images?
- S614 Code the dictionary data
FIG. 17 - Accuracy
- Character code
- Character image
- Not less than 90%
- Not less than 70% and less than 90%
- Less than 70%
- The same character code
FIG. 18 - Image dictionary creating portion 52
- Character codes
- Character area information
- Input image data
- Dictionary data
- Character area information (after correction)
- Input image data
- 500 Storage portion
- 510 Character image extracting portion
- 520 Character classifying portion
- 530 Coincidence determining portion
- 540 Dictionary determining portion
- 550 Position correcting portion
- 560 Index assigning portion
- 570 Registration controller
FIG. 19
- Start
- S10 Obtain image data
- S22 Extract character images and transfer them one character image at a time
- S46 Determine register/non-register as an image pattern or not
- S52 Register?
- S54 Assign an index to the image pattern
- S62 Coding processing
- S80 Finished with respect to all the character images?
- End
FIG. 20 - Image pattern registration determination processing S46
- S462 Compare with the image patterns while shifting
- S404 Determine the correction amount of the character position
- S464 The degree of coincidence not less than the reference value?
- S466 Select the image pattern with the highest coinciding pixel rate
- S468 Output the index of this image pattern
- S470 Determine the new registration of the inputted character image
FIG. 21 - Character code Weight assignment
- Coincidence 1.2 times
- Noncoincidence 1 time
- Recognition accuracy Weight assignment
- Not less than 90% 1.2 times
- Not less than 70% and less than 90% 1 time
- Less than 70% 0.8 times
FIG. 22 - Image pattern registration determination processing S48
- S462 Compare with the image patterns while shifting
- S404 Determine the correction amount of the character position
- S482 Assign a weight based on the character codes and the accuracy
- S464 The degree of coincidence not less than the reference value?
- S466 Select the image pattern with the highest coinciding pixel rate
- S468 Output the index of this image pattern
- S470 Determine the new registration of the inputted character image
Claims
1. An image dictionary creating apparatus comprising:
- an information obtaining section that obtains a result of a character recognition processing on an input image;
- a character classifying section that classifies a character image included in the input image into a plurality of character image groups based on the result of the character recognition processing obtained by the information obtaining section;
- a typical pattern determining section that determines a typical image pattern constituting the input image based on the character image classified into the character image groups by the character classifying section; and
- an identification information assigning section that assigns identification information for identifying the image pattern, to the typical image pattern determined by the typical pattern determining section.
2. An image dictionary creating apparatus according to claim 1, wherein the information obtaining section obtains character identification information for identifying a character represented by each character image as a result of the character recognition processing, and
- wherein the character classifying section classifies the character image included in the input image into the plurality of character image groups based on the character identification information obtained by the information obtaining section.
3. An image dictionary creating apparatus according to claim 2, wherein the character identification information includes a character code,
- wherein the character classifying section classifies the character image included in the input image based on the character code, and
- wherein based on the character image classified as having the same character code, the typical pattern determining section determines an image pattern corresponding to the character image.
4. An image dictionary creating apparatus according to claim 1,
- wherein the information obtaining section obtains character area information representative of an area of each character image in the input image,
- wherein the information obtaining section further includes a character image extracting section that extracts the character image from the input image based on the character area information obtained by the information obtaining section,
- wherein the character classifying section classifies the character image extracted by the character image extracting section into a plurality of character image groups, and
- wherein the typical pattern determining section determines the image pattern by comparing the character image extracted by the character image extracting section with each other.
5. An image dictionary creating apparatus according to claim 4, further comprising:
- a coincidence determining section that determines a degree of coincidence of a plurality of character images extracted by the character image extracting section by comparing the character image with each other at a plurality of relative positions;
- an area correcting section that corrects character area information of each character image based on the degree of coincidence determined at each relative position by the coincidence determining section; and
- an outputting section for outputting the character area information of each of the character images corresponding to the image patterns so as to be associated with the identification information of the image patterns.
6. An image dictionary creating apparatus according to claim 1, further comprising:
- a coincidence determining section that determining a degree of coincidence of a plurality of character images classified into the character image groups by the character classifying section by comparing the character image with each other,
- wherein the typical pattern determining section determines at least one image pattern with respect to the plurality of character images classified into the same character image group based on the degree of coincidence determined by the coincidence determining section, and
- wherein the identification information assigning section assigns the identification information for identifying the image pattern to at least one image pattern determined with respect to the same character image group by the typical pattern determining section.
7. An image dictionary creating apparatus according to claim 1, further comprising:
- a coincidence determining section that determines a degree of coincidence of each area included in a plurality of character images classified into the character image groups by the character classifying section by comparing the character images with each other,
- wherein the typical pattern determining section determines a plurality of image patterns constituting the character image based on the degree of coincidence determined with respect to each area by the coincidence determining section.
8. An image dictionary creating apparatus according to claim 1,
- wherein the information obtaining section further obtains recognition accuracy information of the character recognition processing so as to be associated with the character identification information or the character area information thereof, and
- wherein the character classifying section classifies the character images included in the input image into a plurality of character image groups based on the recognition accuracy information and the corresponding character identification information or character area information.
9. An image dictionary creating apparatus comprising:
- a pattern storing section that stores a typical image pattern constituting an input image;
- a coincidence determining section that determines a degree of coincidence by comparing a newly inputted character image and the typical image pattern stored by the pattern storing section; and
- a typical pattern determining section that stores the newly inputted character image into the pattern storing section as the image pattern according to the degree of coincidence determined by the coincidence degree determining section.
10. An image dictionary creating section according to claim 9,
- wherein the pattern storing section stores identification information of characters represented by the image pattern so as to be associated with the image pattern,
- wherein the pattern storing section further includes a weight assigning section for performing weight assignment on a degree of coincidence determined with respect to the image patterns by the coincidence degree determining section based on the identification information of a character represented by the inputted character image and the identification information of the image patterns stored in the pattern storing section, and
- wherein the typical pattern determining section determines whether to permit registration of the image pattern based on the inputted character image or not based on a degree of coincidence on which weight assignment has been performed by the weight assigning section.
11. An image dictionary creating apparatus according to claim 10,
- wherein the identification information of the characters is character codes,
- wherein the weight assigning section performs weight assignment so that the degree of coincidence is higher when a character code of the inputted character image coincides with a character code of any of the image patterns than when the character codes are different from each other, and
- wherein the typical pattern determining section prohibits the registration of the image pattern based on the character image when the degree of coincidence is higher than a reference, and
- the typical pattern determining section permits a registration of the image pattern based on the character image when the degree of coincidence is not more than the reference.
12. An image dictionary creating apparatus according to claim 11,
- wherein the character code are one determined by the character recognition processing on the input image, and
- wherein based on a recognition accuracy of the character recognition processing on the inputted character image, the weight assigning section performs weight assignment such that the higher the recognition accuracy is, the higher the degree of coincidence is.
13. An image dictionary creating apparatus comprising:
- an information obtaining section that obtains a character image included in an input image and character identification information that identifies a character represented by the character image;
- a character classifying that classifies the character image included in the input image into a plurality of character image groups based on the character identification information obtained by the information obtaining section;
- a typical pattern determining section that determines a typical image pattern constituting the input image based on the character image classified into the character image groups by the character classifying section; and
- an identification information assigning section that assigns an identification information for identifying the image patterns, to the image pattern determined by the typical pattern determining section.
14. An image dictionary creating apparatus comprising:
- an information obtaining section that obtaining a character area information representative of an area of each character image in the input image;
- a character image extracting section that extracts the character image from the input image based on the character area information obtained by the information obtaining section;
- a typical pattern determining section that determines a typical image pattern constituting the input image based on the character image extracted by the character image extracting section; and
- an identification information assigning section that assigns an identification information for identifying the image patterns, to the image patterns determined by the typical pattern determining section.
15. A coding apparatus comprising:
- an information obtaining section that obtaining a character image included in an input image and a character identification information that identifying a character represented by the character image;
- a character classifying section that classifies the character image included in the input image into a plurality of character image groups based on the character identification information obtained by the obtaining section;
- a typical pattern determining section that a determining typical image pattern constituting the input image based on the character images classified into the character image groups by the character classifying section;
- an identification information assigning section that assigns an identification information for identifying the image patterns, to the image patterns determined by the typical pattern determining section; and
- a coding section that codes a character area information representative of areas of the character image included in the input image and identification information of image patterns corresponding to the character images so as to be associated with each other.
16. A coding apparatus comprising:
- an information obtaining section that obtains a character area information representative of an area of each character image in the input image;
- a character image extracting section that extracts the character image from the input image based on the character area information obtained by the information obtaining section;
- a character classifying section that classifies the character image extracted by the character image extracting section into a plurality of character image groups;
- a typical pattern determining section that determines a typical image pattern constituting the input image based on the character image classified into the character image groups by the character classifying section;
- an identification information assigning section that assigns an identification information for identifying the image patterns, to the image patterns determined by the typical pattern determining section;
- a coincidence determining section that determines a degree of coincidence of the character images by comparing the plurality of character images classified into the character image groups by the character classifying section with each other at a plurality of relative positions;
- an area correcting section that corrects the character area information of the character images based on the degree of coincidence determined at a relative position by the coincidence determining section; and
- a coding section that codes at least the character area information of the character image corrected by the area correcting section and the identification information of the image patterns determined based on the character image groups into which the character image are classified.
17. An image dictionary creating method comprising:
- obtaining a character image included in an input image and character identification information for identifying characters represented by the character image;
- classifying the character image included in the input image into a plurality of character image groups based on the obtained character identification information,
- determining a typical image pattern constituting the input image based on the character image classified into the character image groups, and
- assigning an identification information for identifying the image pattern to the determined image pattern.
Type: Application
Filed: Mar 1, 2005
Publication Date: Oct 13, 2005
Patent Grant number: 7889926
Applicant: FUJI XEROX CO., LTD. (Minato-ku)
Inventors: Shunichi Kimura (Kanagawa), Yutaka Koshi (Kanagawa)
Application Number: 11/067,904