Distinguishing Text From Other Regions Patents (Class 382/176)
  • Publication number: 20120213429
    Abstract: A system and method for extracting flowchart information from digital images is provided. The method includes converting the digital flowchart image into a grayscale image and then binarizing the image. The method further includes extracting and masking text data from the binarized image. Further, flow lines connecting geometric components within the flowchart image are extracted and masked. The geometric components are classified into one or more categories and the flow line relationships between the geometric components are extracted. Finally, the extracted text data, flow line relationship information and geometric component information is stored in a database.
    Type: Application
    Filed: March 28, 2011
    Publication date: August 23, 2012
    Applicant: INFOSYS TECHNOLOGIES LIMITED
    Inventors: Bintu Gopalan Vasudevan, Sorawish Dhanapanichkul, Rajesh Balakrishnan
  • Publication number: 20120213441
    Abstract: The system contains a scanner, an apparatus for scanning receipts into a computer and a unique software program which automatically processes, organizes and saves expense information that can be viewed in various formats, namely, tabular statements, pie-charts, etc. The scanner, which accommodates paper of differing sizes, is used to input bills, receipts, bank statements, etc. The scanner is usually connected to a computer through a Universal Serial Bus or a parallel port for easy installation. The software program creates a text file of the scanned data by inclusion of sorting, categories, etc., and automatically saves the information in Quicken Interchange Format, allowing it to be imported into any financial management software for further processing. Each receipt is treated as an individual transaction. Multiple items in the receipt are used to create a “split” transaction with proper customizable categories added. Further, the software also allows for record keeping, budgeting and budget balancing.
    Type: Application
    Filed: April 30, 2012
    Publication date: August 23, 2012
    Applicant: KRIS ENGINEERING, INC.
    Inventor: Radha K. C. Pandipati
  • Patent number: 8249351
    Abstract: A method for assisting in the creation of a logical structure model, which stores, from an image in which character strings associated respectively with a plurality of logical elements constituting a logical structure are described, the logical elements, character strings associated with the logical elements, and the logical structure, wherein character strings in an input image and the logical structure among the character strings in the input image are extracted, a logical element is selected among the plurality of logical elements according to the degrees of similarity between the extracted character strings and the character string associated respectively with the plurality of logical elements stored in the logical structure model, a character string associated with the selected logical element and a character string in the input image associated with the logical element based on the logical structure among the extracted character strings in the input image are extracted.
    Type: Grant
    Filed: December 4, 2008
    Date of Patent: August 21, 2012
    Assignee: Fujitsu Limited
    Inventors: Noriaki Ozawa, Yoshinobu Hotta, Hiroaki Takebe, Yusaku Fujii, Akihiro Minagawa, Hiroshi Tanaka, Katsuhito Fujimoto
  • Patent number: 8248662
    Abstract: An image forming apparatus and a method of using the same, the image forming apparatus including: a detection unit to detect an edge of an input image; a categorization unit to categorize the detected edge, according to a gray value and line width; and a compensation unit to compensate the gray value according to the categorized edge type.
    Type: Grant
    Filed: November 20, 2007
    Date of Patent: August 21, 2012
    Assignee: Samsung Electronics Co., Ltd.
    Inventor: Hyeon-seok Seo
  • Patent number: 8249347
    Abstract: Present invention relates to a method and system for automatic searching for information on a network in response to an image query sent by a user. The image query includes an image that is captured by using a mobile communications device with a camera. The image is processed to detect the text present in it. The detected text is then recognized using an OCR. Subsequently, the text is searched for matches in the corresponding domain database, selected from the various domain databases present in the network. Thereafter, selected matches and additional related information is sent to the user.
    Type: Grant
    Filed: May 19, 2011
    Date of Patent: August 21, 2012
    Assignee: A9.com, Inc.
    Inventors: Gurumurthy D. Ramkumar, Raghavan Manmatha, Supratik Bhattacharyya, Gautam Bhargava, Mark Ruzon
  • Patent number: 8249309
    Abstract: A portable reading machine detects poor image conditions for performing optical character recognition processing. The portable reading machine receives an image of sufficient resolution to distinguish lines of text but not necessarily of sufficient resolution to distinguish individual characters and processes the image to determine imaging conditions from the image. The reading machine reports imaging conditions to the user.
    Type: Grant
    Filed: April 1, 2005
    Date of Patent: August 21, 2012
    Assignee: K-NFB Reading Technology, Inc.
    Inventors: Raymond C. Kurzweil, Paul Albrecht, James Gashel, Lucy Gibson, Lev Lvovsky
  • Publication number: 20120207390
    Abstract: Systems and methods for replacing non-image text are provided. One method for replacing non-image text includes padding a first data representing an image of text to create an image segment. The method includes replacing a second data representing non-image text with the image segment.
    Type: Application
    Filed: February 14, 2011
    Publication date: August 16, 2012
    Inventors: Craig P. Sayers, Prakash Reddy
  • Publication number: 20120207391
    Abstract: A printer, scanner device and methods for using same are described herein. A printer device may include a dedicated input that, when actuated, generates and sends a request to a computer for known data or a predetermined print job, e.g., schedule information from a personal information management (PIM) application. A scanner device may include another dedicated input that, when actuated, automatically scans a document fed to the device by the user and sends the scanned image to IM (or other) software on a computer, bypassing the need to manipulate the scanned image using scanner software. The device may be used with printed metapaper, which includes a barcode or other indicia identifying the metapaper and corresponds to a stored template image of the metapaper. When the metapaper is rescanned, the scan can be compared to the stored template information to identify changes and synchronize the changes with the IM software.
    Type: Application
    Filed: February 3, 2012
    Publication date: August 16, 2012
    Applicant: MICROSOFT CORPORATION
    Inventors: Daniel Allen Rosenfeld, Kumar H. Chellapilla
  • Publication number: 20120201458
    Abstract: A system, method, and computer program product are provided for determining whether text within an image includes unwanted data, utilizing a matrix. In operation, a matrix corresponding to an image is generated. Additionally, text within the image is identified utilizing the matrix. Furthermore, it is determined whether the text includes unwanted data.
    Type: Application
    Filed: April 16, 2012
    Publication date: August 9, 2012
    Inventor: Udhayakumar Lakshmi Narayanan
  • Publication number: 20120201457
    Abstract: Methods and system employing the same for finding repeated structure for data extraction from document images are provided. A reference record and one or more reference fields thereof are identified from a document image. One or more candidate fields are generated for each of the reference fields. One or more best candidate records from the candidate fields are selected using a probabilistic model and an optimal record set is determined from the best candidate records.
    Type: Application
    Filed: February 8, 2011
    Publication date: August 9, 2012
    Applicant: PALO ALTO RESEARCH CENTER INCORPORATED
    Inventors: Evgeniy Bart, Prateek Sarkar, Eric Saund
  • Patent number: 8238663
    Abstract: A similar image search apparatus includes a storage unit, a search unit, a text feature selection unit, an image feature transformation unit and a similar image search unit. The storage unit stores images and pieces of text information associated with the respective images. The search unit retrieves candidate images. Each candidate image has a similar image feature to a image feature of a key image. The text feature selection unit select a text feature of the respective candidate images which satisfies a given selecting condition. The image feature transformation unit, base on the selected text feature, transforms the image features. The similar image search unit retrieves similar images from the candidate images based on the transformed image features. The image features of the similar images are similar to the image feature of the key image.
    Type: Grant
    Filed: August 13, 2008
    Date of Patent: August 7, 2012
    Assignee: Fuji Xerox Co., Ltd.
    Inventor: Noriji Kato
  • Publication number: 20120195505
    Abstract: Methods are systems are provided that include obtaining a digital image from a digital photograph, such as may be taken by a digital camera or a camera phone. The digital image includes, for example, a URI or URL, which may be contained within a visible frame. A character recognition technique, such as an optical character recognition technique, may be used to recognize the URI or URL from the digital image. The URI or URL may be used to access a corresponding Web page. The character recognition technique may be applied on the digital camera or cell phone itself, or remotely.
    Type: Application
    Filed: January 31, 2011
    Publication date: August 2, 2012
    Applicant: Yahoo! Inc.
    Inventor: Jin Suk Park
  • Patent number: 8233713
    Abstract: An image processing method, for receiving an input image and separating pixels having text characteristics and pixels having figure characteristics, includes: applying a first filtering processing for the input image to derive a first image processing result; applying a second filtering processing for the first image processing result to derive a second image processing result, wherein a distribution of filtering parameters of the first filtering processing is different from a distribution of filtering parameters of the second filtering processing; deriving a set of first reference values according to the first image processing result and the second image processing result; and determining whether each pixel within the input image is a text pixel or a figure pixel according to at least the set of the first reference values and a predetermined threshold.
    Type: Grant
    Filed: January 14, 2010
    Date of Patent: July 31, 2012
    Assignee: Primax Electronics Ltd.
    Inventors: Hui-Jan Chien, Tsai-Hsing Chen, Li-Kai Cho, Chiung-Sheng Wang, Sung-Hui Lin
  • Publication number: 20120189202
    Abstract: A handwritten area is separated from image data of printed material in which handwriting has been inserted, and the separated handwritten area is identified as an enclosing line or a class symbol. An image area enclosed within the handwritten area identified as the enclosing line is extracted and acquired as an extracted image. The class symbol is correlated to an enclosing line drawn nearest the class symbol, and the extracted images are classified into groups according to the image areas within the enclosing line correlated to the type of class symbols. The grouped images are organized as listed data.
    Type: Application
    Filed: January 3, 2012
    Publication date: July 26, 2012
    Applicant: MURATA MACHINERY LTD.
    Inventor: Nariyasu KAN
  • Patent number: 8229238
    Abstract: The invention provides an image encoding apparatus which can improve image quality of an output image while further reduce the amount of attribute. A determination unit determines an area including a character/line drawing as a foreground image area based on an input multi-valued image. A foreground image generator generates foreground image in binary representation so that a first encoder performs MMR encoding on the foreground image. A background image generator generates multi-valued background image data by replacing the value of a multi-valued pixel in a position of the character/line drawing in the foreground image area with a replacement value calculated from the pixel values in a position of the non-character/line drawing pixel. A second encoder performs JPEG encoding on the background image. A mask unit masks attribute for pixels within the foreground image area with a predetermined value to output the masked data to a third encoder.
    Type: Grant
    Filed: February 5, 2009
    Date of Patent: July 24, 2012
    Assignee: Canon Kabushiki Kaisha
    Inventor: Yuki Matsumoto
  • Patent number: 8228561
    Abstract: An image processing system utilizes an image type classification circuit to identify inputted image data as picture image data or text/graphics image data. A halftone circuit, operatively connected to the image type classification circuit, converts the inputted image data, identified as picture image data, to halftone image data. Moreover, a tile pattern circuit, operatively connected to the image type classification circuit, to replace the inputted image data, identified as text/graphics image data, with tile patterns. The tile patterns are encoded with a predetermined pattern. A bitmap rendering circuit combines the halftone image data with the encoded tile patterns to render a bitmap, wherein the bitmap can be used by a print engine to reproduce the image.
    Type: Grant
    Filed: March 30, 2007
    Date of Patent: July 24, 2012
    Assignee: Xerox Corporation
    Inventor: Michael Dale Stevens
  • Publication number: 20120183174
    Abstract: A system, method, and computer program product are provided for preventing data loss associated with an image. In use, an image is identified, and it is determined whether the image includes predetermined data. In addition, an action is performed based on the determination, for preventing data loss.
    Type: Application
    Filed: March 24, 2012
    Publication date: July 19, 2012
    Inventors: Prasanna Ganapathi Basavapatna, Gopi Krishna Chebiyyam
  • Publication number: 20120177290
    Abstract: A method for locating tables in documents includes defining a plurality of tiles for a document, for each tile, determining a horizontal profile and a vertical profile, determining the location of lines by means of gradients of the horizontal profiles and the vertical profiles, selecting from the lines, the lines that are persistent, determining a rectangle in at least one corner of the document based on the persistent lines, and applying heuristics in order to accept or reject a determined rectangle as a table of the document. An apparatus for automatically locating a table in a document applies the method for locating tables in documents.
    Type: Application
    Filed: January 27, 2012
    Publication date: July 12, 2012
    Applicant: OCE TECHNOLOGIES B.V.
    Inventors: Vincent Jean-Marie Noël Le Glaunec, Christophe Antoine Leynadier
  • Patent number: 8218863
    Abstract: An image processing apparatus which extracts, from image data, drawing-photograph pixels forming a drawing or a photograph, the image processing apparatus including a pixel value replacement unit configured to replace pixel values of image data with plural representative pixel values; a candidate region extraction unit configured to extract plural candidate regions; a feature value acquisition unit configured to acquire a feature value indicating a degree of contained symbol pixels forming symbols; a feature value determination units.
    Type: Grant
    Filed: January 21, 2009
    Date of Patent: July 10, 2012
    Assignee: Ricoh Company, Ltd.
    Inventor: Fumihiro Hasegawa
  • Patent number: 8218875
    Abstract: A method and system for preprocessing an image for Optical Character Recognition (OCR), wherein the image includes a plurality of columns is disclosed. Each column includes one or more of Arabic text and non-text items. The method includes determining a plurality of components associated with one or more of the Arabic text and the non-text items, wherein a component includes a set of connected pixels. On determining the plurality of components, a line height and a column spacing is determined for the plurality of components. The plurality of components are then associated with a column of the plurality of columns based on the line height and the column spacing. Subsequently, a set of characteristic parameters are calculated for each column and the plurality of components of each column are merged based on the set of characteristic parameters to form sub-words and words.
    Type: Grant
    Filed: June 12, 2010
    Date of Patent: July 10, 2012
    Inventors: Hussein Khalid Al-Omari, Mohammad Sulaiman Khorsheed
  • Patent number: 8213718
    Abstract: A method for video mode detection, wherein video input data (VID) corresponding to a video picture (P) is received and a video mode is determined for said video picture (P). The determining of said video mode depends on a local video mode (LVM) and a global video mode (GVM) of said video picture (P). Said global video mode (GVM) is determined for said video picture (P) based on said video input data (VID) or a derivative (m1) thereof. For determining said local video mode (LVM), first said video picture (P) is subdivided into a ticker area (TA) and a remaining area (RA), thereby generating ticker area data (TAD). Then, said local video mode (LVM) is determined for said ticker area (TA) based on said ticker area data (TAD). When determining said local video mode (LVM), said ticker area (TA) is subdivided into n sub-areas, and at least one of said n sub-areas (1 . . . 6) is selected as selected sub-area (SSA).
    Type: Grant
    Filed: March 22, 2007
    Date of Patent: July 3, 2012
    Assignee: Sony Deutschland GmbH
    Inventors: Sergio Mayoral, Oliver Erdler
  • Patent number: 8213735
    Abstract: Methods and apparatus for binarizing images represented by sets of multivalent pixel values in a computationally efficient manner are described In a grayscale image to be binarized, one group of pixel values represents “foreground”, e.g., text to be converted to black, while another group represents a shaded “background” region to be converted, e.g., to white. The difference between foreground and background is often a function of the scale of the image components, e.g., text and/or other images. Filters in the form of morphological operators, computationally efficient quick-open and quick-close morphological operators are employed to binarize images, e.g., grayscale images. The methods and apparatus effectively handle both smooth and sharp image background structures in a computationally efficient manner.
    Type: Grant
    Filed: October 9, 2009
    Date of Patent: July 3, 2012
    Assignee: Accusoft Corporation
    Inventors: Erica Drew Cooksey, William Douglas Withers
  • Patent number: 8213717
    Abstract: A document processing apparatus includes a marking detection part that detects a marking written on the form from data read by a first reading part, an attribute name extraction part that extracts a character string described beforehand within or near a marking area of the detected marking as an attribute name, an attribute name detection part that detects the attribute name, extracted by the attribute name extraction part, stored in an attribute information memory and specifies the descriptive position of the detected attribute name from the data read by a second reading part that reads the form on which the attribute values are entered, and an attribute value extraction part that extracts the character string around the detection position of the attribute name detected from the read data, and registers the extracted character string as the attribute value of the attribute associated with the attribute name in the attribute information memory.
    Type: Grant
    Filed: August 2, 2007
    Date of Patent: July 3, 2012
    Assignee: Fuji Xerox Co., Ltd.
    Inventors: Yuuya Konno, Masahiro Kato, Katsuhiko Itonori, Etsuko Ito
  • Publication number: 20120163718
    Abstract: Data representing an image of text is received, as is data representing the text in non-image form. A valid content boundary within the image of the text is determined. For each character within the text in the non-image form, a location of the character within the image of the text is determined. Where the location of the character within the image of the text falls outside the valid content boundary, the character is removed from the data representing the text in the non-image form.
    Type: Application
    Filed: December 28, 2010
    Publication date: June 28, 2012
    Inventor: Prakash Reddy
  • Patent number: 8208171
    Abstract: The present invention aims to prevent a problem that an image on a document sheet is erased due to misdetection of a line-shaped noise. A copy machine 1 compares RGB values of a target pixel with averaged RGB values (Step S103). If only one of the RGB values has a difference that is greater than a prescribed value Ref2 (Step S103: YES), the copy machine 1 extracts the target pixel as a line-shaped noise pixel, and moves to a line-shaped noise correction (Step S108) while holding the address of the target pixel in a line-shaped noise address storing area 49b. If two of the RGB values have differences (Step S103: NO, Step S104: YES) and a difference between these two of the RGB values is no greater than a prescribed value Ref3 (Step S105: YES), the copy machine 1 extracts the target pixel as a line-shaped noise pixel, and moves to the line-shaped noise correction (Step S108) while holding the address of the target pixel in the line-shaped noise address storing area 49b.
    Type: Grant
    Filed: December 10, 2008
    Date of Patent: June 26, 2012
    Assignee: Konica Minolta Business Technologies, Inc.
    Inventors: Hiroaki Kubo, Nobuhiro Mishima
  • Patent number: 8208737
    Abstract: The present invention relates to systems and methods for identifying captions associated with images in media material. A captioner includes a selector module and a caption identifier module. The selector module identifies text-blocks potentially associated with images in the media material. The caption identifier module identifies which text-blocks are captions associated with images in the media material, based on the textual and proximity features of the text-block and the images. The captioner may also include a caption feedback module to modify the determining of the caption identifier module.
    Type: Grant
    Filed: April 17, 2009
    Date of Patent: June 26, 2012
    Assignee: Google Inc.
    Inventor: Eugene Ie
  • Patent number: 8208744
    Abstract: An image processing apparatus separates in a scanned image a text area from a graphic area primarily including a graphic form or a graph. For the text area, neighboring black pixels are connected to perform character determination in a unit of a rectangle obtained by connecting the black pixels. For the graphic area, labeling processing is used to extract a circumscribed rectangle of consecutive black pixels, without connecting the black pixels, to perform character determination in a unit of the circumscribed rectangle.
    Type: Grant
    Filed: June 6, 2006
    Date of Patent: June 26, 2012
    Assignee: Konica Minolta Business Technologies, Inc.
    Inventor: Toshihiro Mori
  • Patent number: 8200015
    Abstract: In the method according to at least one embodiment of the invention, an image data record having a structure to be segmented is first of all displayed by display equipment. Using an input apparatus, a segmentation algorithm to be used is selected from a group of different segmentation algorithms, including a contour-based segmentation algorithm, a region-based segmentation algorithm and manual segmentation, based on the local image contrast in a region to be segmented in the image data record. A region to be segmented in the image data record is marked, and the structure to be segmented in the marked region is segmented using the selected segmentation algorithm, and a segmentation result of the segmentation is displayed. This procedure (selecting a segmentation algorithm/marking a region/segmenting the region/displaying) is repeated until the structure to be segmented is completely segmented in the displayed image data record and a boundary line of the structure is produced as the final segmentation result.
    Type: Grant
    Filed: June 18, 2008
    Date of Patent: June 12, 2012
    Assignee: Siemens Aktiengesellschaft
    Inventors: Matthias Fenchel, Andreas Schilling, Stefan Thesen
  • Patent number: 8200044
    Abstract: An image analyser analyses regions of an image. An image scaler may then scale the image adaptively, in dependence on the nature of region of the image being scaled. In one embodiment, adjacent pixels are analysed to determine their frequency content. This frequency analysis provides an indication of whether the pixels likely contain hard edges, discontinuities or variations typical of computer generated graphics. As a result of the analysis, the type of scaling suited for scaling the image portion containing the pixels may be assessed. Adjacent pixels having high frequency components may be scaled by a scaling circuit that introduces limited ringing. Adjacent pixels having lower frequency components may be scaled using a higher-order multi-tap scaler. Resulting scaled pixels may be formed as a blended combination of the two different scaling techniques.
    Type: Grant
    Filed: January 3, 2007
    Date of Patent: June 12, 2012
    Assignee: Broadcom Corporation
    Inventor: Edward George Callway
  • Patent number: 8200012
    Abstract: A preprocessing section binarizes input image data and calculates a total black pixel ratio. A feature extracting section detects connected components contained in the binarized image data and detects circumscribing bounding boxes that circumscribe these connected components, respectively. Based on sizes of the circumscribing bounding boxes detected and numbers of black pixels contained therein, predetermined connected components are removed. A determining section generates an edge map by using the residual connected components, and performs two-dimensional fast Fourier transform thereon to generate spectral data. The determining section performs two-dimensional fast Fourier transform on template images to generate spectral data. The determining section determines, based on these pieces of spectral data, whether or not a circular shape is contained in the input image data.
    Type: Grant
    Filed: February 26, 2009
    Date of Patent: June 12, 2012
    Assignee: Sharp Kabushiki Kaisha
    Inventors: Jilin Li, Zhi-Gang Fan, Yadong Wu, Bo Wu
  • Patent number: 8194092
    Abstract: An image processing method for reducing a power consumption. The image processing method may reduce the power consumption by classifying an input content into a conversion target region and a preservation target region and by converting a luminance of pixels included in the conversion target region. Also, the image processing method may effectively perform a luminance conversion for pixels by separating the input content into the conversion target region and the preservation target region based on a luminance of the pixels of the input content. The image processing method may convert the luminance of the pixels of the conversion target region to maintain a contrast between text pixels and background pixels.
    Type: Grant
    Filed: March 20, 2009
    Date of Patent: June 5, 2012
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Young Ran Han, Seung Sin Lee, Du-Sik Park
  • Patent number: 8194982
    Abstract: In a document-image-data providing device, a document image inputting unit is configured to input document image data. An area recognition unit is configured to recognize a text area of a document image element containing text data among document image elements constituting the document image data, and another area of a document image element containing data other than the text data. A text data acquiring unit is configured to acquire text data contained in the recognized text area. A providing unit is configured to provide, in response to a document image data request received from the information processing device, both image data generated from the input document image data to have a resolution lower than a resolution of the input document image data and the text data acquired by the text data acquiring unit, to the information processing device.
    Type: Grant
    Filed: September 12, 2008
    Date of Patent: June 5, 2012
    Assignee: Ricoh Company, Ltd.
    Inventor: Masajiro Iwasaki
  • Publication number: 20120134588
    Abstract: A “Text Rectifier” provides various techniques for processing selected regions of an image containing text or characters by treating those images as matrices of low-rank textures and using a rank minimization technique that recovers and removes image deformations (e.g., affine and projective transforms as well as general classes of nonlinear transforms) while rectifying the text or characters in the image region. Once distortions have been removed and the text or characters rectified, the resulting text is made available for a variety of uses or further processing such as optical character recognition (OCR). In various embodiments, binarization and/or inversion techniques are applied to the selected image regions during the rank minimization process to both improve text rectification and to present the resulting images of text to an OCR engine in a form that enhances the accuracy of the OCR results.
    Type: Application
    Filed: December 3, 2011
    Publication date: May 31, 2012
    Applicant: MICROSOFT CORPORATION
    Inventors: Xin Zhang, Zhengdong Zhang, Xiao Liang, Zhouchen Lin, Yi Ma
  • Patent number: 8189917
    Abstract: Aspects of the present invention are related to systems and methods for locating text in a digital image.
    Type: Grant
    Filed: September 25, 2008
    Date of Patent: May 29, 2012
    Assignee: Sharp Laboratories of America, Inc.
    Inventor: Richard John Campbell
  • Patent number: 8184335
    Abstract: An overall processing time to rasterize, at the first device, the electronic document to be rendered is computed. Also, a rendering time to render, at the first device, the electronic document to be rendered is computed. When the overall processing time to rasterize at the first device is greater than the rendering time to render at the first device, the electronic document to be rendered is parsed into a first document and sub-documents. A productivity capacity of each node is determined, the productivity capacity being a measured of the processing power of the node and the communication cost of exchanging information between the first device and the node. A sub-document is rasterized at a node when a productivity capacity of the node reduces the processing time to rasterize the electronic document to be rendered to be less than the computed overall processing time.
    Type: Grant
    Filed: March 25, 2008
    Date of Patent: May 22, 2012
    Assignee: Xerox Corporation
    Inventors: Hua Liu, Steven J. Harrington
  • Patent number: 8180152
    Abstract: A system, method, and computer program product are provided for determining whether text within an image includes unwanted data, utilizing a matrix. In operation, a matrix corresponding to an image is generated. Additionally, text within the image is identified utilizing the matrix. Furthermore, it is determined whether the text includes unwanted data.
    Type: Grant
    Filed: April 14, 2008
    Date of Patent: May 15, 2012
    Assignee: McAfee, Inc.
    Inventor: Udhayakumar Lakshmi Narayanan
  • Patent number: 8180153
    Abstract: A method, system and data structure for providing a 3+1 layer MRC image, including a black text layer. The black text layer includes pixel data corresponding to black text in an image and may be assigned a predetermined value for the color of black. According to one or more embodiments, using thresholding processing along with various morphological operations, the black text layer may be generated.
    Type: Grant
    Filed: December 5, 2008
    Date of Patent: May 15, 2012
    Assignee: Xerox Corporation
    Inventors: Amal Malik, Xing Li
  • Publication number: 20120114242
    Abstract: Characters represented within a frame of a television presentation are identified. A pattern formed by a subset of the characters is identified if the pattern is indicative of an addressing datum. A provision is made for a selection of characters that form the pattern indicative of the addressing datum. In one embodiment, a web page is displayed upon a selection of characters that form a pattern indicative of a uniform resource locator for the web page.
    Type: Application
    Filed: January 19, 2012
    Publication date: May 10, 2012
    Applicant: JLB VENTURES LLC
    Inventor: Dan Kikinis
  • Publication number: 20120114241
    Abstract: Methods, systems, and apparatus including computer program products for using extracted image text are provided. In one implementation, a computer-implemented method is provided. The method includes receiving an input of one or more image search terms and identifying keywords from the received one or more image search terms. The method also includes searching a collection of keywords including keywords extracted from image text, retrieving an image associated with extracted image text corresponding to one or more of the image search terms, and presenting the image.
    Type: Application
    Filed: January 13, 2012
    Publication date: May 10, 2012
    Applicant: GOOGLE INC.
    Inventors: Luc Vincent, Adrian Ulges
  • Patent number: 8175397
    Abstract: The present invention relates to a device configured to determine whether to perform transform processing from image data into vector data in accordance with characteristics of an object. The device is configured: to separate an object from image data; then to determine whether to transform image data corresponding to the object into vector data; subsequently, to extract contour data of the object that has been determined to be transformed into vector data; and to perform function approximation for the extracted contour data.
    Type: Grant
    Filed: September 18, 2008
    Date of Patent: May 8, 2012
    Assignee: Canon Kabushiki Kaisha
    Inventor: Naoki Ito
  • Patent number: 8175388
    Abstract: Systems, methods, and apparatus, including software tangibly stored on a computer readable medium, involve identifying text in an electronic document. An electronic document that includes an image object is received. In a first region of the image object, a first set of text characters having a first orientation in the image object are recognized. In a second region of the image object, a second set of text characters having a second orientation in the image object are recognized. The electronic document is modified to include a first text object containing an identification of the first set of text characters and a second text object containing an identification of the second set of text characters. The identification of the first set of text characters includes a first set of values. Each value in the first set of values represent an individual text character recognized in the first region. The identification of the second set of text characters includes a second set of values.
    Type: Grant
    Filed: January 30, 2009
    Date of Patent: May 8, 2012
    Assignee: Adobe Systems Incorporated
    Inventor: Maurice D. Fisher
  • Patent number: 8175386
    Abstract: An image acquiring apparatus includes: an image sensor which senses light reflected from an object to be read, and which detects an analog pixel value corresponding to the sensed reflected light; and a compensator which compensates the analog pixel value in its analog form for removing a background of the object contemporaneously as the object is being read in line(s).
    Type: Grant
    Filed: October 24, 2008
    Date of Patent: May 8, 2012
    Assignee: Samsung Electronics Co., Ltd.
    Inventor: Seok-Ho Kim
  • Patent number: 8175380
    Abstract: Disclosed are an apparatus and a method for text recognition capability using a camera provided in a mobile communication terminal. Image pre-processing discriminates a text color and a text-background color in an input image, and unifies regions except the text into the text-background color, so that a text region and a background region surrounding the text region can be precisely separated. The image pre-processing method is adaptive to a photographing environment, whereby stable text recognition capability can be expected even if the photographing environment is variously changed.
    Type: Grant
    Filed: February 23, 2010
    Date of Patent: May 8, 2012
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Sang-Ho Kim, Sang-Wook Oh, Yun-Je Oh, Seong-Taek Hwang
  • Patent number: 8171391
    Abstract: The proposed technical solution allows processing of machine-readable forms of unfixed format. It comprises a method of specifying the logical structure of a document characterized by: preliminary specification of the list and descriptions of varieties of elements which may be present in the form, specifying an algorithm of setting the search constraints for every element, description of at least the following characteristics of search for every simple or compound element—the spatial characteristics of the search area and the parametric characteristics of the element, description of the method of identification of obtained elements, testing the type of the element, testing the properties which are typical of the type, testing the completeness of composition of the parts of the element.
    Type: Grant
    Filed: November 3, 2006
    Date of Patent: May 1, 2012
    Assignee: ABBYY Software, Ltd
    Inventors: Konstantin Zuev, Diar Tuganbaev, Irina Filimonova
  • Patent number: 8155444
    Abstract: Converting text may be provided. A user selectable element may be used to select a text. The selected text may include a first text within an electronic document and a second text within an image. The second text within the image may be converted to character information by receiving the image. The image may have image character information and an image type. An aspect of the received image may be adjusted based on the image type. Optical character recognition may be performed on the adjusted image to extract character information. The character information may include characters and corresponding location information for the characters. The extracted character information may be evaluated to improve the recognition quality of the extracted character information as compared to the image character information.
    Type: Grant
    Filed: January 15, 2007
    Date of Patent: April 10, 2012
    Assignee: Microsoft Corporation
    Inventors: Alex J. Simmons, Radoslav P. Nickolov, Peter Baer, Vincent Lascaux, Igor Kofman
  • Patent number: 8155445
    Abstract: The present invention relates to an image processing method, an image processing apparatus and an image processing program for dealing with inverted characters (outlined characters) constituted by white pixels on a black ground in a tree structure same as that of normal characters constituted by black pixels on a white ground. In the present invention, black pixel blocks and white pixel blocks are sampled recursively from a binary image, tree structure data indicating a positional relation between the sampled black pixel blocks and white pixel blocks is created, an inverted image is created by white-black-inverting the insides of black pixel blocks that can include inverted characters, of black pixel blocks included in the tree structure data, white pixel blocks and black pixel blacks are sampled from the created inverted image, and data regarding the sampled white pixel blocks and black pixel blocs is added to corresponding nodes of the tree structure data.
    Type: Grant
    Filed: September 25, 2007
    Date of Patent: April 10, 2012
    Assignee: Canon Kabushiki Kaisha
    Inventor: Tomotoshi Kanatsu
  • Patent number: 8150156
    Abstract: A computer-implemented method for processing paper forms includes accepting a filled-in paper form conforming to a template at a computer system having a local memory, wherein the template is not stored in the local memory. Identification information is extracted from the filled-in paper form using the computer system. The identification information indicates a network address of a remote storage location external to the computer system, in which the template is stored. The template is retrieved responsively to the identification information by communication with the remote storage location via a wide area network (WAN). The filled-in paper form is processed responsively to the retrieved template.
    Type: Grant
    Filed: January 4, 2006
    Date of Patent: April 3, 2012
    Assignee: International Business Machines Corporation
    Inventors: Amir Geva, Ehud Karnin, Eugeniusz Walach
  • Patent number: 8149432
    Abstract: An information processing apparatus that can be connected to an image-forming apparatus, a method, and a program used for the information processing apparatus are disclosed. The information processing apparatus comprises a control unit for controlling print-setting information set for document data to be printed, a recognition unit for recognizing information about a first function specified by the print-setting information by translating the print-setting information controlled by the control unit, an obtaining unit for obtaining information about a second function of the image-forming apparatus connected to the information processing apparatus, a determination unit for determining whether or not the image-forming apparatus can perform the first function recognized by the recognition unit based on the second-function information obtained by the obtaining unit, and a modification unit for modifying the print-setting information controlled by the control unit based on the determination result.
    Type: Grant
    Filed: October 19, 2010
    Date of Patent: April 3, 2012
    Assignee: Canon Kabushiki Kaisha
    Inventors: Junichiro Kizaki, Satoshi Nishikawa
  • Publication number: 20120076413
    Abstract: Aspects of the present invention are related to systems and methods for automatically extracting, from a document image, references to relevant external content and automatically retrieving the external content associated with the references.
    Type: Application
    Filed: September 27, 2010
    Publication date: March 29, 2012
    Inventor: Ahmet Mufit FERMAN
  • Publication number: 20120076414
    Abstract: Techniques involve visually summarizing documents (e.g., search results, a collection of documents, etc.) using images which are visually representative of the documents for which the images represent. The images representing the documents may be external images obtained from sources other than the documents. The external images may be obtained from the sources other than the documents by performing a separate image based search using key phrases from the documents rather than extracting the images directly from within the documents themselves. Alternatively, an algorithm may be used to determine an image type, which may be chosen from a selection of external images, thumbnail images, or internal imaged taken directly from the collection of documents, that is suited to represent each document in the collection of documents. A snippet of the documents may be displayed along with the images which visually represent each of the documents.
    Type: Application
    Filed: September 27, 2010
    Publication date: March 29, 2012
    Applicant: MICROSOFT CORPORATION
    Inventors: Jizheng Xu, Binxing Jiao, Feng Wu