Distinguishing Text From Other Regions Patents (Class 382/176)

SYSTEM AND METHOD FOR EXTRACTING FLOWCHART INFORMATION FROM DIGITAL IMAGES

Publication number: 20120213429

Abstract: A system and method for extracting flowchart information from digital images is provided. The method includes converting the digital flowchart image into a grayscale image and then binarizing the image. The method further includes extracting and masking text data from the binarized image. Further, flow lines connecting geometric components within the flowchart image are extracted and masked. The geometric components are classified into one or more categories and the flow line relationships between the geometric components are extracted. Finally, the extracted text data, flow line relationship information and geometric component information is stored in a database.

Type: Application

Filed: March 28, 2011

Publication date: August 23, 2012

Applicant: INFOSYS TECHNOLOGIES LIMITED

Inventors: Bintu Gopalan Vasudevan, Sorawish Dhanapanichkul, Rajesh Balakrishnan
RECEIPTS SCANNER AND FINANCIAL ORGANIZER

Publication number: 20120213441

Abstract: The system contains a scanner, an apparatus for scanning receipts into a computer and a unique software program which automatically processes, organizes and saves expense information that can be viewed in various formats, namely, tabular statements, pie-charts, etc. The scanner, which accommodates paper of differing sizes, is used to input bills, receipts, bank statements, etc. The scanner is usually connected to a computer through a Universal Serial Bus or a parallel port for easy installation. The software program creates a text file of the scanned data by inclusion of sorting, categories, etc., and automatically saves the information in Quicken Interchange Format, allowing it to be imported into any financial management software for further processing. Each receipt is treated as an individual transaction. Multiple items in the receipt are used to create a “split” transaction with proper customizable categories added. Further, the software also allows for record keeping, budgeting and budget balancing.

Type: Application

Filed: April 30, 2012

Publication date: August 23, 2012

Applicant: KRIS ENGINEERING, INC.

Inventor: Radha K. C. Pandipati
Recording medium for recording logical structure model creation assistance program, logical structure model creation assistance device and logical structure model creation assistance method

Patent number: 8249351

Abstract: A method for assisting in the creation of a logical structure model, which stores, from an image in which character strings associated respectively with a plurality of logical elements constituting a logical structure are described, the logical elements, character strings associated with the logical elements, and the logical structure, wherein character strings in an input image and the logical structure among the character strings in the input image are extracted, a logical element is selected among the plurality of logical elements according to the degrees of similarity between the extracted character strings and the character string associated respectively with the plurality of logical elements stored in the logical structure model, a character string associated with the selected logical element and a character string in the input image associated with the logical element based on the logical structure among the extracted character strings in the input image are extracted.

Type: Grant

Filed: December 4, 2008

Date of Patent: August 21, 2012

Assignee: Fujitsu Limited

Inventors: Noriaki Ozawa, Yoshinobu Hotta, Hiroaki Takebe, Yusaku Fujii, Akihiro Minagawa, Hiroshi Tanaka, Katsuhito Fujimoto
Image forming apparatus and method thereof

Patent number: 8248662

Abstract: An image forming apparatus and a method of using the same, the image forming apparatus including: a detection unit to detect an edge of an input image; a categorization unit to categorize the detected edge, according to a gray value and line width; and a compensation unit to compensate the gray value according to the categorized edge type.

Type: Grant

Filed: November 20, 2007

Date of Patent: August 21, 2012

Assignee: Samsung Electronics Co., Ltd.

Inventor: Hyeon-seok Seo
Method and system for searching for information on a network in response to an image query sent by a user from a mobile communications device

Patent number: 8249347

Abstract: Present invention relates to a method and system for automatic searching for information on a network in response to an image query sent by a user. The image query includes an image that is captured by using a mobile communications device with a camera. The image is processed to detect the text present in it. The detected text is then recognized using an OCR. Subsequently, the text is searched for matches in the corresponding domain database, selected from the various domain databases present in the network. Thereafter, selected matches and additional related information is sent to the user.

Type: Grant

Filed: May 19, 2011

Date of Patent: August 21, 2012

Assignee: A9.com, Inc.

Inventors: Gurumurthy D. Ramkumar, Raghavan Manmatha, Supratik Bhattacharyya, Gautam Bhargava, Mark Ruzon
Image evaluation for reading mode in a reading machine

Patent number: 8249309

Abstract: A portable reading machine detects poor image conditions for performing optical character recognition processing. The portable reading machine receives an image of sufficient resolution to distinguish lines of text but not necessarily of sufficient resolution to distinguish individual characters and processes the image to determine imaging conditions from the image. The reading machine reports imaging conditions to the user.

Type: Grant

Filed: April 1, 2005

Date of Patent: August 21, 2012

Assignee: K-NFB Reading Technology, Inc.

Inventors: Raymond C. Kurzweil, Paul Albrecht, James Gashel, Lucy Gibson, Lev Lvovsky
SYSTEMS AND METHODS FOR REPLACING NON-IMAGE TEXT

Publication number: 20120207390

Abstract: Systems and methods for replacing non-image text are provided. One method for replacing non-image text includes padding a first data representing an image of text to create an image segment. The method includes replacing a second data representing non-image text with the image segment.

Type: Application

Filed: February 14, 2011

Publication date: August 16, 2012

Inventors: Craig P. Sayers, Prakash Reddy
INTERACTIVE PAPER SYSTEM

Publication number: 20120207391

Abstract: A printer, scanner device and methods for using same are described herein. A printer device may include a dedicated input that, when actuated, generates and sends a request to a computer for known data or a predetermined print job, e.g., schedule information from a personal information management (PIM) application. A scanner device may include another dedicated input that, when actuated, automatically scans a document fed to the device by the user and sends the scanned image to IM (or other) software on a computer, bypassing the need to manipulate the scanned image using scanner software. The device may be used with printed metapaper, which includes a barcode or other indicia identifying the metapaper and corresponds to a stored template image of the metapaper. When the metapaper is rescanned, the scan can be compared to the stored template information to identify changes and synchronize the changes with the IM software.

Type: Application

Filed: February 3, 2012

Publication date: August 16, 2012

Applicant: MICROSOFT CORPORATION

Inventors: Daniel Allen Rosenfeld, Kumar H. Chellapilla
SYSTEM, METHOD, AND COMPUTER PROGRAM PRODUCT FOR DETERMINING WHETHER TEXT WITHIN AN IMAGE INCLUDES UNWANTED DATA, UTILIZING A MATRIX

Publication number: 20120201458

Abstract: A system, method, and computer program product are provided for determining whether text within an image includes unwanted data, utilizing a matrix. In operation, a matrix corresponding to an image is generated. Additionally, text within the image is identified utilizing the matrix. Furthermore, it is determined whether the text includes unwanted data.

Type: Application

Filed: April 16, 2012

Publication date: August 9, 2012

Inventor: Udhayakumar Lakshmi Narayanan
FINDING REPEATED STRUCTURE FOR DATA EXTRACTION FROM DOCUMENT IMAGES

Publication number: 20120201457

Abstract: Methods and system employing the same for finding repeated structure for data extraction from document images are provided. A reference record and one or more reference fields thereof are identified from a document image. One or more candidate fields are generated for each of the reference fields. One or more best candidate records from the candidate fields are selected using a probabilistic model and an optimal record set is determined from the best candidate records.

Type: Application

Filed: February 8, 2011

Publication date: August 9, 2012

Applicant: PALO ALTO RESEARCH CENTER INCORPORATED

Inventors: Evgeniy Bart, Prateek Sarkar, Eric Saund
Similar image search apparatus and computer readable medium

Patent number: 8238663

Abstract: A similar image search apparatus includes a storage unit, a search unit, a text feature selection unit, an image feature transformation unit and a similar image search unit. The storage unit stores images and pieces of text information associated with the respective images. The search unit retrieves candidate images. Each candidate image has a similar image feature to a image feature of a key image. The text feature selection unit select a text feature of the respective candidate images which satisfies a given selecting condition. The image feature transformation unit, base on the selected text feature, transforms the image features. The similar image search unit retrieves similar images from the candidate images based on the transformed image features. The image features of the similar images are similar to the image feature of the key image.

Type: Grant

Filed: August 13, 2008

Date of Patent: August 7, 2012

Assignee: Fuji Xerox Co., Ltd.

Inventor: Noriji Kato
TECHNIQUES INCLUDING URL RECOGNITION AND APPLICATIONS

Publication number: 20120195505

Abstract: Methods are systems are provided that include obtaining a digital image from a digital photograph, such as may be taken by a digital camera or a camera phone. The digital image includes, for example, a URI or URL, which may be contained within a visible frame. A character recognition technique, such as an optical character recognition technique, may be used to recognize the URI or URL from the digital image. The URI or URL may be used to access a corresponding Web page. The character recognition technique may be applied on the digital camera or cell phone itself, or remotely.

Type: Application

Filed: January 31, 2011

Publication date: August 2, 2012

Applicant: Yahoo! Inc.

Inventor: Jin Suk Park
Image processing method and image processing apparatus

Patent number: 8233713

Abstract: An image processing method, for receiving an input image and separating pixels having text characteristics and pixels having figure characteristics, includes: applying a first filtering processing for the input image to derive a first image processing result; applying a second filtering processing for the first image processing result to derive a second image processing result, wherein a distribution of filtering parameters of the first filtering processing is different from a distribution of filtering parameters of the second filtering processing; deriving a set of first reference values according to the first image processing result and the second image processing result; and determining whether each pixel within the input image is a text pixel or a figure pixel according to at least the set of the first reference values and a predetermined threshold.

Type: Grant

Filed: January 14, 2010

Date of Patent: July 31, 2012

Assignee: Primax Electronics Ltd.

Inventors: Hui-Jan Chien, Tsai-Hsing Chen, Li-Kai Cho, Chiung-Sheng Wang, Sung-Hui Lin
IMAGE PROCESSING APPARATUS, IMAGE PROCESSING SYSTEM AND IMAGE PROCESSING METHOD

Publication number: 20120189202

Abstract: A handwritten area is separated from image data of printed material in which handwriting has been inserted, and the separated handwritten area is identified as an enclosing line or a class symbol. An image area enclosed within the handwritten area identified as the enclosing line is extracted and acquired as an extracted image. The class symbol is correlated to an enclosing line drawn nearest the class symbol, and the extracted images are classified into groups according to the image areas within the enclosing line correlated to the type of class symbols. The grouped images are organized as listed data.

Type: Application

Filed: January 3, 2012

Publication date: July 26, 2012

Applicant: MURATA MACHINERY LTD.

Inventor: Nariyasu KAN
Image encoding apparatus, image processing apparatus and control method thereof

Patent number: 8229238

Abstract: The invention provides an image encoding apparatus which can improve image quality of an output image while further reduce the amount of attribute. A determination unit determines an area including a character/line drawing as a foreground image area based on an input multi-valued image. A foreground image generator generates foreground image in binary representation so that a first encoder performs MMR encoding on the foreground image. A background image generator generates multi-valued background image data by replacing the value of a multi-valued pixel in a position of the character/line drawing in the foreground image area with a replacement value calculated from the pixel values in a position of the non-character/line drawing pixel. A second encoder performs JPEG encoding on the background image. A mask unit masks attribute for pixels within the foreground image area with a predetermined value to output the masked data to a third encoder.

Type: Grant

Filed: February 5, 2009

Date of Patent: July 24, 2012

Assignee: Canon Kabushiki Kaisha

Inventor: Yuki Matsumoto
Method and system for selective bitmap edge smoothing

Patent number: 8228561

Abstract: An image processing system utilizes an image type classification circuit to identify inputted image data as picture image data or text/graphics image data. A halftone circuit, operatively connected to the image type classification circuit, converts the inputted image data, identified as picture image data, to halftone image data. Moreover, a tile pattern circuit, operatively connected to the image type classification circuit, to replace the inputted image data, identified as text/graphics image data, with tile patterns. The tile patterns are encoded with a predetermined pattern. A bitmap rendering circuit combines the halftone image data with the encoded tile patterns to render a bitmap, wherein the bitmap can be used by a print engine to reproduce the image.

Type: Grant

Filed: March 30, 2007

Date of Patent: July 24, 2012

Assignee: Xerox Corporation

Inventor: Michael Dale Stevens
SYSTEM, METHOD, AND COMPUTER PROGRAM PRODUCT FOR PREVENTING IMAGE-RELATED DATA LOSS

Publication number: 20120183174

Abstract: A system, method, and computer program product are provided for preventing data loss associated with an image. In use, an image is identified, and it is determined whether the image includes predetermined data. In addition, an action is performed based on the determination, for preventing data loss.

Type: Application

Filed: March 24, 2012

Publication date: July 19, 2012

Inventors: Prasanna Ganapathi Basavapatna, Gopi Krishna Chebiyyam
AUTOMATIC TABLE LOCATION IN DOCUMENTS

Publication number: 20120177290

Abstract: A method for locating tables in documents includes defining a plurality of tiles for a document, for each tile, determining a horizontal profile and a vertical profile, determining the location of lines by means of gradients of the horizontal profiles and the vertical profiles, selecting from the lines, the lines that are persistent, determining a rectangle in at least one corner of the document based on the persistent lines, and applying heuristics in order to accept or reject a determined rectangle as a table of the document. An apparatus for automatically locating a table in a document applies the method for locating tables in documents.

Type: Application

Filed: January 27, 2012

Publication date: July 12, 2012

Applicant: OCE TECHNOLOGIES B.V.

Inventors: Vincent Jean-Marie Noël Le Glaunec, Christophe Antoine Leynadier
Image processing apparatus, image processing method and image processing means

Patent number: 8218863

Abstract: An image processing apparatus which extracts, from image data, drawing-photograph pixels forming a drawing or a photograph, the image processing apparatus including a pixel value replacement unit configured to replace pixel values of image data with plural representative pixel values; a candidate region extraction unit configured to extract plural candidate regions; a feature value acquisition unit configured to acquire a feature value indicating a degree of contained symbol pixels forming symbols; a feature value determination units.

Type: Grant

Filed: January 21, 2009

Date of Patent: July 10, 2012

Assignee: Ricoh Company, Ltd.

Inventor: Fumihiro Hasegawa
Method and system for preprocessing an image for optical character recognition

Patent number: 8218875

Abstract: A method and system for preprocessing an image for Optical Character Recognition (OCR), wherein the image includes a plurality of columns is disclosed. Each column includes one or more of Arabic text and non-text items. The method includes determining a plurality of components associated with one or more of the Arabic text and the non-text items, wherein a component includes a set of connected pixels. On determining the plurality of components, a line height and a column spacing is determined for the plurality of components. The plurality of components are then associated with a column of the plurality of columns based on the line height and the column spacing. Subsequently, a set of characteristic parameters are calculated for each column and the plurality of components of each column are merged based on the set of characteristic parameters to form sub-words and words.

Type: Grant

Filed: June 12, 2010

Date of Patent: July 10, 2012

Inventors: Hussein Khalid Al-Omari, Mohammad Sulaiman Khorsheed
Method for video mode detection

Patent number: 8213718

Abstract: A method for video mode detection, wherein video input data (VID) corresponding to a video picture (P) is received and a video mode is determined for said video picture (P). The determining of said video mode depends on a local video mode (LVM) and a global video mode (GVM) of said video picture (P). Said global video mode (GVM) is determined for said video picture (P) based on said video input data (VID) or a derivative (m1) thereof. For determining said local video mode (LVM), first said video picture (P) is subdivided into a ticker area (TA) and a remaining area (RA), thereby generating ticker area data (TAD). Then, said local video mode (LVM) is determined for said ticker area (TA) based on said ticker area data (TAD). When determining said local video mode (LVM), said ticker area (TA) is subdivided into n sub-areas, and at least one of said n sub-areas (1 . . . 6) is selected as selected sub-area (SSA).

Type: Grant

Filed: March 22, 2007

Date of Patent: July 3, 2012

Assignee: Sony Deutschland GmbH

Inventors: Sergio Mayoral, Oliver Erdler
Methods and apparatus for performing image binarization

Patent number: 8213735

Abstract: Methods and apparatus for binarizing images represented by sets of multivalent pixel values in a computationally efficient manner are described In a grayscale image to be binarized, one group of pixel values represents “foreground”, e.g., text to be converted to black, while another group represents a shaded “background” region to be converted, e.g., to white. The difference between foreground and background is often a function of the scale of the image components, e.g., text and/or other images. Filters in the form of morphological operators, computationally efficient quick-open and quick-close morphological operators are employed to binarize images, e.g., grayscale images. The methods and apparatus effectively handle both smooth and sharp image background structures in a computationally efficient manner.

Type: Grant

Filed: October 9, 2009

Date of Patent: July 3, 2012

Assignee: Accusoft Corporation

Inventors: Erica Drew Cooksey, William Douglas Withers
Document processing apparatus, document processing method, recording medium and data signal

Patent number: 8213717

Abstract: A document processing apparatus includes a marking detection part that detects a marking written on the form from data read by a first reading part, an attribute name extraction part that extracts a character string described beforehand within or near a marking area of the detected marking as an attribute name, an attribute name detection part that detects the attribute name, extracted by the attribute name extraction part, stored in an attribute information memory and specifies the descriptive position of the detected attribute name from the data read by a second reading part that reads the form on which the attribute values are entered, and an attribute value extraction part that extracts the character string around the detection position of the attribute name detected from the read data, and registers the extracted character string as the attribute value of the attribute associated with the attribute name in the attribute information memory.

Type: Grant

Filed: August 2, 2007

Date of Patent: July 3, 2012

Assignee: Fuji Xerox Co., Ltd.

Inventors: Yuuya Konno, Masahiro Kato, Katsuhiko Itonori, Etsuko Ito
Removing character from text in non-image form where location of character in image of text falls outside of valid content boundary

Publication number: 20120163718

Abstract: Data representing an image of text is received, as is data representing the text in non-image form. A valid content boundary within the image of the text is determined. For each character within the text in the non-image form, a location of the character within the image of the text is determined. Where the location of the character within the image of the text falls outside the valid content boundary, the character is removed from the data representing the text in the non-image form.

Type: Application

Filed: December 28, 2010

Publication date: June 28, 2012

Inventor: Prakash Reddy
Image reading apparatus and method to prevent image erasing due to erroneously line-shaped noise detection

Patent number: 8208171

Abstract: The present invention aims to prevent a problem that an image on a document sheet is erased due to misdetection of a line-shaped noise. A copy machine 1 compares RGB values of a target pixel with averaged RGB values (Step S103). If only one of the RGB values has a difference that is greater than a prescribed value Ref2 (Step S103: YES), the copy machine 1 extracts the target pixel as a line-shaped noise pixel, and moves to a line-shaped noise correction (Step S108) while holding the address of the target pixel in a line-shaped noise address storing area 49b. If two of the RGB values have differences (Step S103: NO, Step S104: YES) and a difference between these two of the RGB values is no greater than a prescribed value Ref3 (Step S105: YES), the copy machine 1 extracts the target pixel as a line-shaped noise pixel, and moves to the line-shaped noise correction (Step S108) while holding the address of the target pixel in the line-shaped noise address storing area 49b.

Type: Grant

Filed: December 10, 2008

Date of Patent: June 26, 2012

Assignee: Konica Minolta Business Technologies, Inc.

Inventors: Hiroaki Kubo, Nobuhiro Mishima
Methods and systems for identifying captions in media material

Patent number: 8208737

Abstract: The present invention relates to systems and methods for identifying captions associated with images in media material. A captioner includes a selector module and a caption identifier module. The selector module identifies text-blocks potentially associated with images in the media material. The caption identifier module identifies which text-blocks are captions associated with images in the media material, based on the textual and proximity features of the text-block and the images. The captioner may also include a caption feedback module to modify the determining of the caption identifier module.

Type: Grant

Filed: April 17, 2009

Date of Patent: June 26, 2012

Assignee: Google Inc.

Inventor: Eugene Ie
Image processing apparatus capable of accurately and quickly determining character part included in image

Patent number: 8208744

Abstract: An image processing apparatus separates in a scanned image a text area from a graphic area primarily including a graphic form or a graph. For the text area, neighboring black pixels are connected to perform character determination in a unit of a rectangle obtained by connecting the black pixels. For the graphic area, labeling processing is used to extract a circumscribed rectangle of consecutive black pixels, without connecting the black pixels, to perform character determination in a unit of the circumscribed rectangle.

Type: Grant

Filed: June 6, 2006

Date of Patent: June 26, 2012

Assignee: Konica Minolta Business Technologies, Inc.

Inventor: Toshihiro Mori
Method for interactively segmenting structures in image data records and image processing unit for carrying out the method

Patent number: 8200015

Abstract: In the method according to at least one embodiment of the invention, an image data record having a structure to be segmented is first of all displayed by display equipment. Using an input apparatus, a segmentation algorithm to be used is selected from a group of different segmentation algorithms, including a contour-based segmentation algorithm, a region-based segmentation algorithm and manual segmentation, based on the local image contrast in a region to be segmented in the image data record. A region to be segmented in the image data record is marked, and the structure to be segmented in the marked region is segmented using the selected segmentation algorithm, and a segmentation result of the segmentation is displayed. This procedure (selecting a segmentation algorithm/marking a region/segmenting the region/displaying) is repeated until the structure to be segmented is completely segmented in the displayed image data record and a boundary line of the structure is produced as the final segmentation result.

Type: Grant

Filed: June 18, 2008

Date of Patent: June 12, 2012

Assignee: Siemens Aktiengesellschaft

Inventors: Matthias Fenchel, Andreas Schilling, Stefan Thesen
Image analyser and adaptive image scaling circuit and methods

Patent number: 8200044

Abstract: An image analyser analyses regions of an image. An image scaler may then scale the image adaptively, in dependence on the nature of region of the image being scaled. In one embodiment, adjacent pixels are analysed to determine their frequency content. This frequency analysis provides an indication of whether the pixels likely contain hard edges, discontinuities or variations typical of computer generated graphics. As a result of the analysis, the type of scaling suited for scaling the image portion containing the pixels may be assessed. Adjacent pixels having high frequency components may be scaled by a scaling circuit that introduces limited ringing. Adjacent pixels having lower frequency components may be scaled using a higher-order multi-tap scaler. Resulting scaled pixels may be formed as a blended combination of the two different scaling techniques.

Type: Grant

Filed: January 3, 2007

Date of Patent: June 12, 2012

Assignee: Broadcom Corporation

Inventor: Edward George Callway
Image determination apparatus, image search apparatus and computer readable recording medium storing an image search program

Patent number: 8200012

Abstract: A preprocessing section binarizes input image data and calculates a total black pixel ratio. A feature extracting section detects connected components contained in the binarized image data and detects circumscribing bounding boxes that circumscribe these connected components, respectively. Based on sizes of the circumscribing bounding boxes detected and numbers of black pixels contained therein, predetermined connected components are removed. A determining section generates an edge map by using the residual connected components, and performs two-dimensional fast Fourier transform thereon to generate spectral data. The determining section performs two-dimensional fast Fourier transform on template images to generate spectral data. The determining section determines, based on these pieces of spectral data, whether or not a circular shape is contained in the input image data.

Type: Grant

Filed: February 26, 2009

Date of Patent: June 12, 2012

Assignee: Sharp Kabushiki Kaisha

Inventors: Jilin Li, Zhi-Gang Fan, Yadong Wu, Bo Wu
Device and method of processing image for power consumption reduction

Patent number: 8194092

Abstract: An image processing method for reducing a power consumption. The image processing method may reduce the power consumption by classifying an input content into a conversion target region and a preservation target region and by converting a luminance of pixels included in the conversion target region. Also, the image processing method may effectively perform a luminance conversion for pixels by separating the input content into the conversion target region and the preservation target region based on a luminance of the pixels of the input content. The image processing method may convert the luminance of the pixels of the conversion target region to maintain a contrast between text pixels and background pixels.

Type: Grant

Filed: March 20, 2009

Date of Patent: June 5, 2012

Assignee: Samsung Electronics Co., Ltd.

Inventors: Young Ran Han, Seung Sin Lee, Du-Sik Park
Document-image-data providing system, document-image-data providing device, information processing device, document-image-data providing method, information processing method, document-image-data providing program, and information processing program

Patent number: 8194982

Abstract: In a document-image-data providing device, a document image inputting unit is configured to input document image data. An area recognition unit is configured to recognize a text area of a document image element containing text data among document image elements constituting the document image data, and another area of a document image element containing data other than the text data. A text data acquiring unit is configured to acquire text data contained in the recognized text area. A providing unit is configured to provide, in response to a document image data request received from the information processing device, both image data generated from the input document image data to have a resolution lower than a resolution of the input document image data and the text data acquired by the text data acquiring unit, to the information processing device.

Type: Grant

Filed: September 12, 2008

Date of Patent: June 5, 2012

Assignee: Ricoh Company, Ltd.

Inventor: Masajiro Iwasaki
RECTIFICATION OF CHARACTERS AND TEXT AS TRANSFORM INVARIANT LOW-RANK TEXTURES

Publication number: 20120134588

Abstract: A “Text Rectifier” provides various techniques for processing selected regions of an image containing text or characters by treating those images as matrices of low-rank textures and using a rank minimization technique that recovers and removes image deformations (e.g., affine and projective transforms as well as general classes of nonlinear transforms) while rectifying the text or characters in the image region. Once distortions have been removed and the text or characters rectified, the resulting text is made available for a variety of uses or further processing such as optical character recognition (OCR). In various embodiments, binarization and/or inversion techniques are applied to the selected image regions during the rank minimization process to both improve text rectification and to present the resulting images of text to an OCR engine in a form that enhances the accuracy of the OCR results.

Type: Application

Filed: December 3, 2011

Publication date: May 31, 2012

Applicant: MICROSOFT CORPORATION

Inventors: Xin Zhang, Zhengdong Zhang, Xiao Liang, Zhouchen Lin, Yi Ma
Methods and systems for locating text in a digital image

Patent number: 8189917

Abstract: Aspects of the present invention are related to systems and methods for locating text in a digital image.

Type: Grant

Filed: September 25, 2008

Date of Patent: May 29, 2012

Assignee: Sharp Laboratories of America, Inc.

Inventor: Richard John Campbell
Method for ad-hoc parallel processing in a distributed environment

Patent number: 8184335

Abstract: An overall processing time to rasterize, at the first device, the electronic document to be rendered is computed. Also, a rendering time to render, at the first device, the electronic document to be rendered is computed. When the overall processing time to rasterize at the first device is greater than the rendering time to render at the first device, the electronic document to be rendered is parsed into a first document and sub-documents. A productivity capacity of each node is determined, the productivity capacity being a measured of the processing power of the node and the communication cost of exchanging information between the first device and the node. A sub-document is rasterized at a node when a productivity capacity of the node reduces the processing time to rasterize the electronic document to be rendered to be less than the computed overall processing time.

Type: Grant

Filed: March 25, 2008

Date of Patent: May 22, 2012

Assignee: Xerox Corporation

Inventors: Hua Liu, Steven J. Harrington
System, method, and computer program product for determining whether text within an image includes unwanted data, utilizing a matrix

Patent number: 8180152

Abstract: A system, method, and computer program product are provided for determining whether text within an image includes unwanted data, utilizing a matrix. In operation, a matrix corresponding to an image is generated. Additionally, text within the image is identified utilizing the matrix. Furthermore, it is determined whether the text includes unwanted data.

Type: Grant

Filed: April 14, 2008

Date of Patent: May 15, 2012

Assignee: McAfee, Inc.

Inventor: Udhayakumar Lakshmi Narayanan
3+1 layer mixed raster content (MRC) images having a black text layer

Patent number: 8180153

Abstract: A method, system and data structure for providing a 3+1 layer MRC image, including a black text layer. The black text layer includes pixel data corresponding to black text in an image and may be assigned a predetermined value for the color of black. According to one or more embodiments, using thresholding processing along with various morphological operations, the black text layer may be generated.

Type: Grant

Filed: December 5, 2008

Date of Patent: May 15, 2012

Assignee: Xerox Corporation

Inventors: Amal Malik, Xing Li
Method and System for Identifying Addressing Data Within a Television Presentation

Publication number: 20120114242

Abstract: Characters represented within a frame of a television presentation are identified. A pattern formed by a subset of the characters is identified if the pattern is indicative of an addressing datum. A provision is made for a selection of characters that form the pattern indicative of the addressing datum. In one embodiment, a web page is displayed upon a selection of characters that form a pattern indicative of a uniform resource locator for the web page.

Type: Application

Filed: January 19, 2012

Publication date: May 10, 2012

Applicant: JLB VENTURES LLC

Inventor: Dan Kikinis
USING EXTRACTED IMAGE TEXT

Publication number: 20120114241

Abstract: Methods, systems, and apparatus including computer program products for using extracted image text are provided. In one implementation, a computer-implemented method is provided. The method includes receiving an input of one or more image search terms and identifying keywords from the received one or more image search terms. The method also includes searching a collection of keywords including keywords extracted from image text, retrieving an image associated with extracted image text corresponding to one or more of the image search terms, and presenting the image.

Type: Application

Filed: January 13, 2012

Publication date: May 10, 2012

Applicant: GOOGLE INC.

Inventors: Luc Vincent, Adrian Ulges
Device adaptively switching image processes in accordance with characteristic of object included in image

Patent number: 8175397

Abstract: The present invention relates to a device configured to determine whether to perform transform processing from image data into vector data in accordance with characteristics of an object. The device is configured: to separate an object from image data; then to determine whether to transform image data corresponding to the object into vector data; subsequently, to extract contour data of the object that has been determined to be transformed into vector data; and to perform function approximation for the extracted contour data.

Type: Grant

Filed: September 18, 2008

Date of Patent: May 8, 2012

Assignee: Canon Kabushiki Kaisha

Inventor: Naoki Ito
Recognizing text at multiple orientations

Patent number: 8175388

Abstract: Systems, methods, and apparatus, including software tangibly stored on a computer readable medium, involve identifying text in an electronic document. An electronic document that includes an image object is received. In a first region of the image object, a first set of text characters having a first orientation in the image object are recognized. In a second region of the image object, a second set of text characters having a second orientation in the image object are recognized. The electronic document is modified to include a first text object containing an identification of the first set of text characters and a second text object containing an identification of the second set of text characters. The identification of the first set of text characters includes a first set of values. Each value in the first set of values represent an individual text character recognized in the first region. The identification of the second set of text characters includes a second set of values.

Type: Grant

Filed: January 30, 2009

Date of Patent: May 8, 2012

Assignee: Adobe Systems Incorporated

Inventor: Maurice D. Fisher
Image acquiring apparatus and control method thereof

Patent number: 8175386

Abstract: An image acquiring apparatus includes: an image sensor which senses light reflected from an object to be read, and which detects an analog pixel value corresponding to the sensed reflected light; and a compensator which compensates the analog pixel value in its analog form for removing a background of the object contemporaneously as the object is being read in line(s).

Type: Grant

Filed: October 24, 2008

Date of Patent: May 8, 2012

Assignee: Samsung Electronics Co., Ltd.

Inventor: Seok-Ho Kim
Apparatus and method for improving text recognition capability

Patent number: 8175380

Abstract: Disclosed are an apparatus and a method for text recognition capability using a camera provided in a mobile communication terminal. Image pre-processing discriminates a text color and a text-background color in an input image, and unifies regions except the text into the text-background color, so that a text region and a background region surrounding the text region can be precisely separated. The image pre-processing method is adaptive to a photographing environment, whereby stable text recognition capability can be expected even if the photographing environment is variously changed.

Type: Grant

Filed: February 23, 2010

Date of Patent: May 8, 2012

Assignee: Samsung Electronics Co., Ltd.

Inventors: Sang-Ho Kim, Sang-Wook Oh, Yun-Je Oh, Seong-Taek Hwang
Method of describing the structure of graphical objects

Patent number: 8171391

Abstract: The proposed technical solution allows processing of machine-readable forms of unfixed format. It comprises a method of specifying the logical structure of a document characterized by: preliminary specification of the list and descriptions of varieties of elements which may be present in the form, specifying an algorithm of setting the search constraints for every element, description of at least the following characteristics of search for every simple or compound element—the spatial characteristics of the search area and the parametric characteristics of the element, description of the method of identification of obtained elements, testing the type of the element, testing the properties which are typical of the type, testing the completeness of composition of the parts of the element.

Type: Grant

Filed: November 3, 2006

Date of Patent: May 1, 2012

Assignee: ABBYY Software, Ltd

Inventors: Konstantin Zuev, Diar Tuganbaev, Irina Filimonova
Image text to character information conversion

Patent number: 8155444

Abstract: Converting text may be provided. A user selectable element may be used to select a text. The selected text may include a first text within an electronic document and a second text within an image. The second text within the image may be converted to character information by receiving the image. The image may have image character information and an image type. An aspect of the received image may be adjusted based on the image type. Optical character recognition may be performed on the adjusted image to extract character information. The character information may include characters and corresponding location information for the characters. The extracted character information may be evaluated to improve the recognition quality of the extracted character information as compared to the image character information.

Type: Grant

Filed: January 15, 2007

Date of Patent: April 10, 2012

Assignee: Microsoft Corporation

Inventors: Alex J. Simmons, Radoslav P. Nickolov, Peter Baer, Vincent Lascaux, Igor Kofman
Image processing apparatus, method, and processing program for image inversion with tree structure

Patent number: 8155445

Abstract: The present invention relates to an image processing method, an image processing apparatus and an image processing program for dealing with inverted characters (outlined characters) constituted by white pixels on a black ground in a tree structure same as that of normal characters constituted by black pixels on a white ground. In the present invention, black pixel blocks and white pixel blocks are sampled recursively from a binary image, tree structure data indicating a positional relation between the sampled black pixel blocks and white pixel blocks is created, an inverted image is created by white-black-inverting the insides of black pixel blocks that can include inverted characters, of black pixel blocks included in the tree structure data, white pixel blocks and black pixel blacks are sampled from the created inverted image, and data regarding the sampled white pixel blocks and black pixel blocs is added to corresponding nodes of the tree structure data.

Type: Grant

Filed: September 25, 2007

Date of Patent: April 10, 2012

Assignee: Canon Kabushiki Kaisha

Inventor: Tomotoshi Kanatsu
Automated processing of paper forms using remotely-stored templates

Patent number: 8150156

Abstract: A computer-implemented method for processing paper forms includes accepting a filled-in paper form conforming to a template at a computer system having a local memory, wherein the template is not stored in the local memory. Identification information is extracted from the filled-in paper form using the computer system. The identification information indicates a network address of a remote storage location external to the computer system, in which the template is stored. The template is retrieved responsively to the identification information by communication with the remote storage location via a wide area network (WAN). The filled-in paper form is processed responsively to the retrieved template.

Type: Grant

Filed: January 4, 2006

Date of Patent: April 3, 2012

Assignee: International Business Machines Corporation

Inventors: Amir Geva, Ehud Karnin, Eugeniusz Walach
Information processing apparatus, method, and recording medium storing program for modifying print instructions

Patent number: 8149432

Abstract: An information processing apparatus that can be connected to an image-forming apparatus, a method, and a program used for the information processing apparatus are disclosed. The information processing apparatus comprises a control unit for controlling print-setting information set for document data to be printed, a recognition unit for recognizing information about a first function specified by the print-setting information by translating the print-setting information controlled by the control unit, an obtaining unit for obtaining information about a second function of the image-forming apparatus connected to the information processing apparatus, a determination unit for determining whether or not the image-forming apparatus can perform the first function recognized by the recognition unit based on the second-function information obtained by the obtaining unit, and a modification unit for modifying the print-setting information controlled by the control unit based on the determination result.

Type: Grant

Filed: October 19, 2010

Date of Patent: April 3, 2012

Assignee: Canon Kabushiki Kaisha

Inventors: Junichiro Kizaki, Satoshi Nishikawa
Methods and Systems for Automatic Extraction and Retrieval of Auxiliary Document Content

Publication number: 20120076413

Abstract: Aspects of the present invention are related to systems and methods for automatically extracting, from a document image, references to relevant external content and automatically retrieving the external content associated with the references.

Type: Application

Filed: September 27, 2010

Publication date: March 29, 2012

Inventor: Ahmet Mufit FERMAN
External Image Based Summarization Techniques

Publication number: 20120076414

Abstract: Techniques involve visually summarizing documents (e.g., search results, a collection of documents, etc.) using images which are visually representative of the documents for which the images represent. The images representing the documents may be external images obtained from sources other than the documents. The external images may be obtained from the sources other than the documents by performing a separate image based search using key phrases from the documents rather than extracting the images directly from within the documents themselves. Alternatively, an algorithm may be used to determine an image type, which may be chosen from a selection of external images, thumbnail images, or internal imaged taken directly from the collection of documents, that is suited to represent each document in the collection of documents. A snippet of the documents may be displayed along with the images which visually represent each of the documents.

Type: Application

Filed: September 27, 2010

Publication date: March 29, 2012

Applicant: MICROSOFT CORPORATION

Inventors: Jizheng Xu, Binxing Jiao, Feng Wu

prev … 8 9 10 11 12 13 14 15 16 … next