Segmenting Individual Characters Or Words Patents (Class 382/177)

Separating touching or overlapping characters (Class 382/178)

Segmenting hand-printed characters (Class 382/179)

Identification of regions of a document

Patent number: 8832549

Abstract: Some embodiments provide a for analyzing a document that includes a number of primitive elements. The method identifies boundaries between sets of primitive elements and identifies regions bounded by the boundaries. The method uses the identified regions to define structural elements for the document. The method defines a structured document based on the primitive elements and the structural elements.

Type: Grant

Filed: June 7, 2009

Date of Patent: September 9, 2014

Assignee: Apple Inc.

Inventors: Philip Andrew Mansfield, Michael Robert Levy
Generation of document fingerprints for identification of electronic document types

Patent number: 8831350

Abstract: Candidate identification utilizing fingerprint identification is disclosed.

Type: Grant

Filed: August 29, 2011

Date of Patent: September 9, 2014

Assignee: DST Technologies, Inc.

Inventor: Joshua O. Highley
Automatic forms processing systems and methods

Patent number: 8818100

Abstract: Systems and methods analyze the physical structure of text rows in a document image, including the positions of one or more alignments of one or more character blocks in one or more text rows of the document image. The systems and methods determine one or more groups of text rows that are placed into a class based on the structures of the text rows, such as the positions of the one or more alignments of the one or more character blocks in each text row.

Type: Grant

Filed: August 24, 2010

Date of Patent: August 26, 2014

Assignee: Lexmark International, Inc.

Inventors: Jose Eduardo Bastos dos Santos, Brian G. Anderson, Scott T. R. Coons, David E. Kelley, Humayun H. Khan, Jess B. Sturgeon, Richard L. Taylor
Methods for digital mapping and associated apparatus

Patent number: 8805078

Abstract: A method comprises extracting a local identifier (130, 730a, 730b) from an image (100, 500, 700), the image (100, 500, 700) also having positional data (120) relating to the location at which the image (100, 500, 700) was captured; and associating the extracted local identifier (130, 730a, 730b) with the corresponding positional data (120) to allow for associating the extracted local identifier with a digital map (300, 600, 800).

Type: Grant

Filed: February 8, 2010

Date of Patent: August 12, 2014

Assignee: TomTom Germany GmbH & Co. KG

Inventors: Heiko Mund, Oleg Schmelzle
Methods and systems for automatic extraction and retrieval of auxiliary document content

Patent number: 8805074

Abstract: Aspects of the present invention are related to systems and methods for automatically extracting, from a document image, references to relevant external content and automatically retrieving the external content associated with the references.

Type: Grant

Filed: September 27, 2010

Date of Patent: August 12, 2014

Assignee: Sharp Laboratories of America, Inc.

Inventor: Ahmet Mufit Ferman
SYSTEM AND METHODS FOR ARABIC TEXT RECOGNITION BASED ON EFFECTIVE ARABIC TEXT FEATURE EXTRACTION

Publication number: 20140219562

Abstract: A method for automatically recognizing Arabic text includes building an Arabic corpus comprising Arabic text files written in different writing styles and ground truths corresponding to each of the Arabic text files, storing writing-style indices in association with the Arabic text files, digitizing an Arabic word to form an array of pixels, dividing the Arabic word into line images, forming a text feature vector from the line images, training a Hidden Markov Model using the Arabic text files and ground truths in the Arabic corpus in accordance with the writing-style indices, and feeding the text feature vector into a Hidden Markov Model to recognize the Arabic words.

Type: Application

Filed: April 23, 2014

Publication date: August 7, 2014

Applicant: King Abdulaziz City for Science & Technology

Inventors: Mohammad S. Khorsheed, Hussein K. Al-Omari
Electronic book pagination

Patent number: 8798366

Abstract: An electronic book can be paginated by reference to a print version of the same book. Pages of the print version are scanned to obtain text strings and page labels corresponding to each of the pages. The text strings are then compared to the electronic book to find the best matching positions within the electronic book. The matching positions within the electronic book are then associated with the page numbers of the pages from which the matching text strings were obtained. Autocorrelation can be used to determine matching positions.

Type: Grant

Filed: December 28, 2010

Date of Patent: August 5, 2014

Assignee: Amazon Technologies, Inc.

Inventors: Derek T. Jones, Oleksandr Y. Berezhnyy
Reader with optical character recognition

Patent number: 8783570

Abstract: An imaging-based bar code reader that includes an imaging and decoding system. Focusing optics and a sensor array define a field of view. A data processor has a memory for storing a pattern definition of previously imaged OCR characters and comparing a format of said previously stored characters to a present image to determine a character content of the present image.

Type: Grant

Filed: August 21, 2007

Date of Patent: July 22, 2014

Assignee: Symbol Technologies, Inc.

Inventors: Xiaomei Wang, Christopher J. Fjellstad
System and method for performing automatic font definition

Patent number: 8787660

Abstract: A method of defining model characters of a font. The method includes receiving a string of characters, receiving an image that includes an occurrence of the string, identifying objects in the image, determining, for each respective object, which of the objects satisfies first criteria indicating that the respective object likely corresponds to a character in the string, determining, for each respective object satisfying the first criteria, which of the objects satisfies second criteria indicating that the respective object belongs to a sequence of objects likely to correspond to the string, and defining, for each respective object satisfying the second criteria, a model character for each character of the string based upon a corresponding object of the sequence of objects. The first criteria may include aspect ratio criterion, size criterion, or both, and the second criteria may include alignment criterion, spacing criterion contrast criterion, encompassment criterion, or combinations thereof.

Type: Grant

Filed: November 23, 2005

Date of Patent: July 22, 2014

Assignee: Matrox Electronic Systems, Ltd.

Inventors: Christian Simon, Sylvain Chapleau
Character recognition preprocessing method and apparatus

Patent number: 8787671

Abstract: Disclosed is a character recognition preprocessing method and apparatus for correcting a nonlinear character string into a linear character string. A binarized character string region is divided into character regions on a character-by-character basis. Upper and lower feature points of each character region are derived, and an upper boundary line, which is a curve connecting the upper feature points of the character regions, and a lower boundary line, which is a curve connecting the lower feature points of the character regions, are generated by applying cubic spline interpolation. Nonlinearity is corrected through adaptive region enlargement by using the maximum horizontal length and the maximum height of the divided character regions.

Type: Grant

Filed: March 21, 2011

Date of Patent: July 22, 2014

Assignees: Samsung Electronics Co., Ltd, Industry Foundation of Chonnam National University

Inventors: Hee-Bum Ahn, Jong-Hyun Park, Soo-Hyung Kim, Hyung-Jung Yang, Guee-Sang Lee
Recognition of numerical characters in digital images

Patent number: 8781227

Abstract: Recognition of numerical characters is disclosed, including: extracting a subimage from a received image comprising information pertaining to a plurality of numerical characters, wherein the extracted subimage is associated with one of the plurality of numerical characters; and performing recognition based at least in part on a set of topological information associated with the subimage, including: processing the subimage to obtain the set of topological information associated with the subimage; comparing the set of topological information associated with the subimage with a preset set of stored topological information; determining that in the event that the set of topological information associated with the subimage matches the preset set of stored topological information, the subimage is associated with a recognized numerical character associated with the preset set of stored topological information.

Type: Grant

Filed: August 25, 2011

Date of Patent: July 15, 2014

Assignee: Alibaba Group Holding Limited

Inventor: Xiang Sun
Triggering actions in response to optically or acoustically capturing keywords from a rendered document

Patent number: 8781228

Abstract: A system for processing text captured from rendered documents is described. The system receives a sequence of one or more words optically or acoustically captured from a rendered document by a user. The system identifies among words of the sequence a word with which an action has been associated. The system then performs the associated action with respect to the user.

Type: Grant

Filed: September 13, 2012

Date of Patent: July 15, 2014

Assignee: Google Inc.

Inventors: Martin T. King, Dale L. Grover, Clifford A. Kushler, James Q. Stafford-Fraser
System for extracting text from a plurality of captured images of a document

Patent number: 8768058

Abstract: A system including a data processing system, a network interface for communicating over a network, and a program memory storing instructions configured to cause the data processing system to implement a method for extracting textual information from images of a document containing text characters. The method includes receiving a plurality of digital images of the document over the network. Each of the captured digital images is automatically analyzed using an optical character recognition process to determine extracted textual data. The extracted textual data for the captured digital images are merged to determine the textual information for the document, wherein differences between the extracted textual data for the captured digital images are analyzed to determine the textual information for the document.

Type: Grant

Filed: May 23, 2012

Date of Patent: July 1, 2014

Assignee: Eastman Kodak Company

Inventors: Andrew C. Blose, Peter O. Stubler
Systems and methods for challenge-response animation and randomization testing

Patent number: 8769707

Abstract: Systems and methods are provided for challenge/response animation. In one implementation, a request for protected content may be received from a client, and the protected content may comprise data. A challenge phrase comprising a plurality of characters may be determined, and a computer processor may divide the challenge phrase into at least two character subsets selected from the characters comprising the challenge phrase. Each of the at least two character subsets may include less than all of the characters comprising the challenge phrase. The at least two character subsets may be sent to the client in response to the request; and an answer to the challenge phrase may be received from the client in response to the at least two character subsets. Access to the protected content may be limited based on whether the answer correctly solves the challenge phrase.

Type: Grant

Filed: September 14, 2012

Date of Patent: July 1, 2014

Assignee: AOL Inc.

Inventor: Scott Dorfman
Accurate text classification through selective use of image data

Patent number: 8768050

Abstract: Product images are used in conjunction with textual descriptions to improve classifications of product offerings. By combining cues from both text and image descriptions associated with products, implementations enhance both the precision and recall of product description classifications within the context of web-based commerce search. Several implementations are directed to improving those areas where text-only approaches are most unreliable. For example, several implementations use image signals to complement text classifiers and improve overall product classification in situations where brief textual product descriptions use vocabulary that overlaps with multiple diverse categories. Other implementations are directed to using text and images “training sets” to improve automated classifiers including text-only classifiers.

Type: Grant

Filed: June 13, 2011

Date of Patent: July 1, 2014

Assignee: Microsoft Corporation

Inventors: Anitha Kannan, Partha Pratim Talukdar, Nikhil Rasiwasia, Qifa Ke, Rakesh Agrawal
Segmentation of textual lines in an image that include western characters and hieroglyphic characters

Patent number: 8768059

Abstract: An image processing apparatus segments Western and hieroglyphic portions of textual lines. The apparatus includes an input component that receives an input image having at least one textual line. The apparatus also includes an inter-character break identifier component that identifies candidate inter-character breaks along a textual line and an inter-character break classifier component. The inter-character break classifier component classifies each of the candidate inter-character breaks as an actual break, a non-break or an indeterminate break based at least in part on the geometrical properties of each respective candidate inter-character break and the bounding boxes adjacent thereto. A character recognition component recognizes the candidate characters based at least in part on a feature set extracted from each respective candidate character that can be histogram features, Gabor features or any other feature set applicable to character recognition.

Type: Grant

Filed: January 23, 2013

Date of Patent: July 1, 2014

Assignee: Microsoft Corporation

Inventor: Ivan Mitic
Character recognition apparatus and method based on character orientation

Patent number: 8761514

Abstract: A character recognition apparatus and method based on a character orientation are provided, in which an input image is binarized, at least one character area is extracted from the binarized image, a slope value of the extracted at least one character area is calculated, the calculated slope value is set as a character feature value, and a character is recognized by using a neural network for recognizing a plurality of characters by receiving the set character feature value. Accordingly, the probability of wrongly recognizing a similar character decreases, and a recognition ratio of each character increases.

Type: Grant

Filed: February 28, 2011

Date of Patent: June 24, 2014

Assignee: Samsung Electronics Co., Ltd

Inventors: Jeong-Wan Park, Sang-Wook Oh, Do-Hyeon Kim, Hee-Bum Ahn
Information processing apparatus performing character recognition and correction and information processing method thereof

Patent number: 8755603

Abstract: An information processing apparatus includes an identifying unit, a character recognition unit, an obtaining unit, a correcting unit, and an output unit. The identifying unit identifies a still image included in a moving image. The character recognition unit performs character recognition on the still image identified by the identifying unit. The obtaining unit obtains information about the moving image. The correcting unit corrects, on the basis of the information obtained by the obtaining unit, a character recognition result generated by the character recognition unit. The output unit outputs the character recognition result corrected by the correcting unit in association with the moving image.

Type: Grant

Filed: February 17, 2012

Date of Patent: June 17, 2014

Assignee: Fuji Xerox Co., Ltd.

Inventors: Takeshi Nagamine, Tsutomu Abe
Character image extracting apparatus and character image extracting method

Patent number: 8750616

Abstract: In an extracting step, the extracting portion obtains a linked component composed of a plurality of mutually linking pixels from a character string region composed of a plurality of characters, and extracts section elements from the character string region, the section elements each being surrounded by a circumscribing figure circumscribing to the linked component. In the first altering step, the first altering portion combines section elements at least having a mutually overlapping part among the extracted section elements so as to prepare a new section element. In the first selecting step, the first selecting portion determines a reference size in advance and selects section elements having a size greater than the reference size, from among the section elements altered in the first altering step.

Type: Grant

Filed: December 21, 2007

Date of Patent: June 10, 2014

Assignee: Sharp Kabushiki Kaisha

Inventors: Bo Wu, Jianjun Dou, Ning Le, Yadong Wu, Jing Jia
Character region extracting apparatus and method using character stroke width calculation

Patent number: 8744189

Abstract: A character region extracting apparatus and method which extract a character region through the calculation of character stroke widths are provided. The method includes producing a binary image including a candidate character region from an original image; extracting a character outline from the candidate character region; acquires character outline information for the extracted outline; setting a representative character stroke width and a representative character angle in each of the pixels forming the outline, based on the character outline information; and determining a character existing region in the candidate character region by confirming the ratio of effective representative stroke widths and effective angles as compared to the entire length of the outline. Accordingly, it is possible to efficiently determine whether one or more characters exist in the candidate character region.

Type: Grant

Filed: February 17, 2011

Date of Patent: June 3, 2014

Assignees: Samsung Electronics Co., Ltd, Korea University Research and Business Foundation

Inventors: Sang-Wook Oh, Sang-Hoon Sull, Myung-Hoon Kim, Hoon-Jae Lee, Soon-Hong Jung, Jun-Sic Youn
ENTRIES TO AN ELECTRONIC CALENDAR

Publication number: 20140146200

Abstract: An example method of entering calendar events into an electronic calendar involves capturing a digital image of a document that contains a written calendar event; analyzing the digital image of the document containing the written calendar event to extract text information appearing on the digital image of the document; matching the extracted text information in the digital image of the written calendar event document to a date in the electronic calendar; and populating the extracted text information to at least one field of the electronic calendar associated with the date.

Type: Application

Filed: November 28, 2012

Publication date: May 29, 2014

Applicant: RESEARCH IN MOTION LIMITED

Inventors: Sherryl Lee Lorraine SCOTT, Scott David REEVE, Julia Murdock THOMPSON, Jodie Elizabeth FLETCHER
Method and device for generating character data, method and control device for displaying character data, and navigation apparatus

Patent number: 8730244

Abstract: A device includes a character-data rotating section that rotates a regular-position character by a predetermined angle with respect to a reference point that is the center point of the background area of the regular-position character by using regular-position character data having a rotation angle of 0° and a center-point matching processing section that horizontally and/or vertically enlarges the background area of the rotated character data to cause the center point of the rotated character and the center point of BMP data to match each other even with respect to rotated character data. Thus, when multiple pieces of character data are arranged so that the center points thereof lie on a reference line, not only are the center points of the characters aligned along the reference line, but also bottom portions of the characters aligned with respect to the reference line.

Type: Grant

Filed: July 1, 2008

Date of Patent: May 20, 2014

Assignee: Alpine Electronics, Inc.

Inventor: Noboru Yamazaki
Apparatus and method for digitizing documents with extracted region data

Patent number: 8718364

Abstract: An apparatus according to the present invention comprises: a region extraction unit configured to extract region data for each object from document image data including tables; a table structure analysis unit configured to analyze the region data relating to table objects out of the extracted region data and extract table structure information on each of the table objects; a sheet generation unit configured to generate a display sheet for reproducing a layout of the object in the document image data and an edit sheet for each table for editing the table, by using the region data and the table structure information on each object; and an electronic-document generation unit configured to generate an electronic document which associated the display sheet with the edit sheet.

Type: Grant

Filed: December 29, 2010

Date of Patent: May 6, 2014

Assignee: Canon Kabushiki Kaisha

Inventor: Makoto Enomoto
Method of producing probabilities of being a template shape

Patent number: 8711440

Abstract: A method of identifying a printed page from a scan of the printed page is disclosed. The method comprises the steps of generating a page key of the printed page on the basis of the scan (710), searching a database (199) for a similar page key (730). For each found similar page key (740), the method further comprises; retrieving from the database an instance key location (750), generating an instance key for the printed page (530), based on the retrieved instance key location of the referenced page instance; and comparing the generated instance key for the printed page with the retrieved instance key of the referenced page instance (770). A match between the instance keys indicates that the printed page is the referenced page instance.

Type: Grant

Filed: November 10, 2009

Date of Patent: April 29, 2014

Assignee: Canon Kabushiki Kaisha

Inventor: Stephen James Hardy
System and Method for Selecting Segmentation Parameters for Optical Character Recognition

Publication number: 20140105496

Abstract: A computer-implemented method for selecting at least one segmentation parameter for optical character recognition is provided. The method can include receiving an image having a character string that includes one or more characters. The method can also include receiving a character string identifying each of the one or more characters. The method can also include automatically generating at least one segmentation parameter. The method can also include performing segmentation on the image having the character string using the at least one segmentation parameter. The method can also include determining if a resultant segmentation satisfies one or more criteria and if the resultant segmentation satisfies the one or more criteria, selecting the at least one segmentation parameter.

Type: Application

Filed: October 17, 2012

Publication date: April 17, 2014

Applicant: Cognex Corporation

Inventors: Ali Zadeh, John Petry
System and Method for Selecting and Displaying Segmentation Parameters for Optical Character Recognition

Publication number: 20140105497

Abstract: A computer-implemented method for selecting at least one segmentation parameter for optical character recognition is provided. The method can include receiving an image having a character string that includes one or more characters. The method can also include receiving a character string identifying each of the one or more characters. The method can also include automatically generating at least one segmentation parameter. The method can also include performing segmentation on the image having the character string using the at least one segmentation parameter. The method can also include determining if a resultant segmentation satisfies one or more criteria and if the resultant segmentation satisfies the one or more criteria, selecting the at least one segmentation parameter.

Type: Application

Filed: November 21, 2012

Publication date: April 17, 2014

Applicant: Cognex Corporation

Inventors: Ali Zadeh, John Petry, Kim Marie Steiner, Steven Patrick Shuman
Variable glyph system and method

Patent number: 8699794

Abstract: Using methods, computer-readable storage media, and apparatuses for computer-implemented processing, a passage of text may be variably rendered. For each glyph in the passage of text, a glyph representation is varied according to a geometric transformation that was determined from statistical measurements of at least one geometric property from an ensemble of representations of the current glyph. Each varied glyph representation is included in renderable output data, such that when the passage of text is rendered to an output device, a given rendered representation of a given glyph subtly differs from other rendered representations of the given glyph.

Type: Grant

Filed: January 7, 2013

Date of Patent: April 15, 2014

Assignee: Gracious Eloise, Inc.

Inventors: Eloise Bune D'Agostino, Michael Bennett D'Agostino, Bryan Michael Minor, Tamas Frajka, Michel Francois Pettigrew
Method for omnidirectional processing of 2D images including recognizable characters

Patent number: 8682077

Abstract: The invention is a method for omnidirectional recognition of recognizable characters in a captured two-dimensional image. An optical reader configured in accordance with the invention searches for pixel groupings in a starburst pattern, and subjects located pixel groupings to a preliminary edge crawling process which records the pixel position of the grouping's edge and records the count of edge pixels. If two similar-sized pixel groupings are located that are of sizes sufficient to potentially represent recognizable characters, then the reader launches “alignment rails” at pixel positions substantially parallel to a centerline connecting the center points of the two similarly sized groupings. A reader according to the invention searches for additional recognizable characters within the rail area, and subjects each located pixel grouping within the rail area to a shape-characterizing edge crawling process for developing data that characterizes the shape of a pixel grouping's edge.

Type: Grant

Filed: June 11, 2010

Date of Patent: March 25, 2014

Assignee: Hand Held Products, Inc.

Inventor: Andrew Longacre, Jr.
Method, system and computer program product for transmitting data from a document application to a data application

Patent number: 8667410

Abstract: In a method for computer-aided transfer of data from a document application into a data application having a set of data fields, a document is displayed in the document application opened on a computer with a display device, and wherein from the document data are to be transferred into the data application also opened on the computer. A name of a data field into which data are to be transferred is displayed on the display device. Via identification of a corresponding data value in the document on the display device, a character string representing the data value is automatically read out from the document and entered into the data field corresponding to the data field name in the data application via actuation of a predetermined button.

Type: Grant

Filed: July 4, 2006

Date of Patent: March 4, 2014

Assignee: Open Text S.A.

Inventor: Johannes Schacht
Techniques for shape clustering and assignment of character codes in post optical character recognition processing

Patent number: 8666174

Abstract: Systems, methods and computer program products on storage devices for shape clustering and applications in processing various documents, including an output of an optical character recognition (OCR) process. The output of an OCR process is classified into a plurality of clusters of clip images and a representative image for each cluster is generated to identify clusters whose clip images were incorrectly assigned character codes by the OCR process.

Type: Grant

Filed: January 17, 2012

Date of Patent: March 4, 2014

Assignee: Google Inc.

Inventors: Luc Vincent, Raymond W. Smith
Accuracy of recognition by means of a combination of classifiers

Patent number: 8660371

Abstract: In one embodiment, there is provided a method for an Optical Character Recognition (OCR) system. The method comprises: recognizing an input character based on a plurality of classifiers, wherein each classifier generates an output by comparing the input character with a plurality of trained patterns; grouping the plurality of classifiers based on a classifier grouping criterion; and combining the output of each of the plurality of classifiers based on the grouping.

Type: Grant

Filed: May 6, 2010

Date of Patent: February 25, 2014

Assignee: ABBYY Development LLC

Inventor: Diar Tuganbaev
Image processing apparatus including an obtaining unit, an isolating unit, a classifying unit, information processing apparatus, and image reading apparatus

Patent number: 8660354

Abstract: An image processing apparatus includes: a memory; an obtaining unit that obtains image data representing an image including concatenated pixels; an isolating unit that isolates a rendering element, the rendering element being an image surrounded by border lines of a color in an image represented by the image data; and a classifying unit that, in a case where a plurality of rendering elements has been isolated by the isolating unit, and in a case where color difference between two of the plural rendering elements or the distance between the two rendering elements is less than a threshold, classifies the two rendering elements into the same group, associates pieces of image data that represent rendering elements belonging to the same group with one another, and stores the pieces of image data in the memory.

Type: Grant

Filed: October 5, 2009

Date of Patent: February 25, 2014

Assignee: Fuji Xerox Co., Ltd.

Inventors: Chihiro Matsuguma, Hiroyoshi Uejo, Kazuhiro Ohya, Ayumi Onishi, Katsuya Koyanagi, Hiroshi Niina, Zhenrui Zhang
APPARATUS, METHOD AND COMPUTER READABLE RECORDING MEDIUM FOR ANALYZING VIDEO USING IMAGE CAPTURED FROM VIDEO

Publication number: 20140050400

Abstract: An apparatus for analyzing a video capture screen includes: a video frame extracting unit extracting at least one frame from a video having a plurality of frames; an extracted frame digitizing unit digitizing features of each of the at least one frame extracted by the video frame extracting unit; an image digitizing unit digitizing features of at least one collected search target image; an image comparing and searching unit comparing the search target image with the at least one frame extracted from the plurality of frames by digitized values of the collected search target image and the at least one frame; and a search result processing unit mapping related information of the collected search target image to a frame coinciding with the search target image and storing the related information in a database, when the extracted at least one frame coinciding with the search target image is present in a comparison result.

Type: Application

Filed: July 12, 2013

Publication date: February 20, 2014

Inventors: Gunhan PARK, Jeanie JUNG
Character recognition

Patent number: 8644616

Abstract: Systems and methods for character recognition by performing lateral view-based analysis on the character data and generating a feature vector based on the lateral view-based analysis.

Type: Grant

Filed: January 31, 2013

Date of Patent: February 4, 2014

Assignee: University of Calcutta

Inventors: Nabendu Chaki, Soharab Hossain Shaikh
TRELLIS BASED WORD DECODER WITH REVERSE PASS

Publication number: 20140023273

Abstract: Systems, apparatuses, and methods to relate images of words to a list of words are provided. A trellis based word decoder analyses a set of OCR characters and probabilities using a forward pass across a forward trellis and a reverse pass across a reverse trellis. Multiple paths may result, however, the most likely path from the trellises has the highest probability with valid links. A valid link is determined from the trellis by some dictionary word traversing the link. The most likely path is compared with a list of words to find the word closest to the most.

Type: Application

Filed: March 14, 2013

Publication date: January 23, 2014

Applicant: QUALCOMM Incorporated

Inventors: Pawan Kumar Baheti, Kishor K. Barman, Raj Kumar Krishna Kumar
System and method for identifying pictures in documents

Patent number: 8634644

Abstract: A system and method to identify pictures in documents. An image representing a page of a document is received. The image is analyzed to identify text objects in the page. A masked image is generated by masking out regions of the image including the text objects in the page. Groups of pixels in the masked image are identified, wherein a respective group of pixels corresponds to at least one picture in the page. When there is one or more groups of pixels, regions for pictures are identified based on the one or more groups of pixels. Metadata tags for the pictures are stored, wherein a respective metadata tag for a respective picture includes information about a respective bounding box for the respective picture.

Type: Grant

Filed: August 25, 2009

Date of Patent: January 21, 2014

Assignee: Fuji Xerox Co., Ltd.

Inventors: Patrick Chiu, Francine Chen, Laurent Denoue
Determining a class associated with an image

Patent number: 8620078

Abstract: The technology is directed to determining a class associated with an image. In some examples, a method determines the class associated with an image. The method can include determining a segmentation score for an image segment based on a comparison of the image segment and a region of an image. The region of the image can be associated with the image segment. The method further includes determining a confidence score for the image segment based on the segmentation score and a classification score. The classification score can be indicative of a similarity between the image segment and at least one class. The method further includes determining a class associated with the image based on the confidence score. The method further includes outputting the class associated with the image.

Type: Grant

Filed: July 14, 2010

Date of Patent: December 31, 2013

Assignee: Matrox Electronic Systems, Ltd.

Inventors: Sylvain Chapleau, Vincent Paquin
Utilizing subtitles in multiple languages to facilitate second-language learning

Patent number: 8620139

Abstract: Processing video for utilization in second language learning is described herein. A video file includes spoken words in a source language, subtitles in the source language, and subtitles in a native language of an end user (a target language). The subtitles in the source language are synchronized with the spoken words in the video, and the subtitles in the source language are mapped to the subtitles in the target language. Both sets of subtitles are displayed simultaneously as the video is played by the end user.

Type: Grant

Filed: April 29, 2011

Date of Patent: December 31, 2013

Assignee: Microsoft Corporation

Inventors: Chi Ho Li, Matthew Robert Scott
OCR multi-resolution method and apparatus

Patent number: 8611661

Abstract: In some embodiments, provided are procedures for processing images that may have different font sizes. In some embodiments, it involves OCR'ing with multiple passes at different resolutions.

Type: Grant

Filed: December 26, 2007

Date of Patent: December 17, 2013

Assignee: Intel Corporation

Inventors: Oscar Nestares, Badusha Kalathiparambil
Text detection using multi-layer connected components with histograms

Patent number: 8611662

Abstract: A digital image is converted to a multiple level image, and multiple scale sets are formed from connected components of the multiple level image such that different ones of the scale sets define different size spatial bins. For each of the multiple scale sets there is generated a count of connected components extracted from the respective scale set for each spatial bin; and adjacent spatial bins which represent connected components are linked. Then the connected components from the different scale sets are merged and text line detection is performed on the merged connected components. In one embodiment each of the scale sets is a histogram, and prior to linking all bins with less than a predetermined count are filtered out; and each histogram is extended such that counts of adjacent horizontal and vertical bins are added (single region bins are filtered out) and the linking is on the extended histograms.

Type: Grant

Filed: November 21, 2011

Date of Patent: December 17, 2013

Assignee: Nokia Corporation

Inventors: Shang-hsuan Tsai, Vasudev Parameswaran, Radek Grzeszczuk
Image processing apparatus, image processing method and computer-readable medium

Patent number: 8611669

Abstract: An image processing apparatus includes a line information reception unit, a line extraction unit, an inversion unit and a determination unit. The line information reception unit receives a set of information indicating (i) information on an image having a possibly of being a line and (ii) line elements being a rectangular pixel lump which constitutes a line. The line extraction unit extracts a line by tracing from a first start point to an end point of the line, based on the received information indicating the line elements and a tracing direction of the line. The inversion unit inverts the tracing direction of the line, sets the extracted end point of the line as a second start point and sends the second start point and the inverted tracing direction to the line extraction unit. The determination unit determines whether or not to cause the inversion unit to perform a process.

Type: Grant

Filed: April 21, 2011

Date of Patent: December 17, 2013

Assignee: Fuji Xerox Co., Ltd.

Inventor: Eiichi Tanaka
Method and apparatus for reducing memory capacity in encoding image data

Patent number: 8594427

Abstract: A filter unit performs an analysis filtering operation by segmenting a frequency component to generate a plurality of subbands composed of coefficient data segmented on a per frequency band basis. A coefficient storage unit stores the coefficient data on a subband by subband basis, with each subband corresponding to a respective area of the coefficient storage unit. The filter unit writes the coefficient data onto the coefficient storage unit via a write buffer in response to a determination that the analysis filtering operation has not reached a final segmentation level. An entropy encoding unit entropy encodes the coefficient data. The filter unit writes the coefficient data on an area of the coefficient storage unit corresponding to the subband to which the coefficient data belongs. The entropy encoding unit reads the coefficient data, stored on different areas on a subband by subband basis, from the coefficient storage unit.

Type: Grant

Filed: April 25, 2008

Date of Patent: November 26, 2013

Assignee: Sony Corporation

Inventors: Katsutoshi Ando, Takahiro Fukuhara
Method of authenticating and/or identifying an article

Patent number: 8590800

Abstract: The present invention relates to a method of authenticating and/or identifying an article containing a chemical marking agent, which is substantially inseparably enclosed in a marker as a carrier and contains selected chemical elements and/or compounds in the form of marker elements, in concentrations based on a predetermined encryption code, which method comprises the steps of: i) qualitatively and/or quantitatively identifying the marker elements of the chemical marking agent, and ii) comparing the values identified in step (i) with the predetermined encryption code.

Type: Grant

Filed: December 8, 2009

Date of Patent: November 26, 2013

Assignee: Polysecure GmbH

Inventor: Thomas Baque
System and Method for Processing Image for Identifying Alphanumeric Characters Present in a Series

Publication number: 20130272607

Abstract: A system and a method for identification of alphanumeric characters present in a series in an image are disclosed. The system and method captures the image and further processes it for binarization by computing a pattern of the image. The generated binarized images are then filtered for removing unwanted components. Candidate images are identified out of the filtered binarized images. All the obtained candidate images are combined to generate a final candidate image which is further segmented in order to recognize a valid alphanumeric character present in the series.

Type: Application

Filed: March 25, 2013

Publication date: October 17, 2013

Applicants: Indian Statistical Institute, Tata Consultancy Services Limited

Inventors: Tata Consultancy Services Limited, Indian Statistical Institute
METHODS AND SYSTEMS FOR ASSESSING THE QUALITY OF AUTOMATICALLY GENERATED TEXT

Publication number: 20130259378

Abstract: A set of ordered characters is received in association with information specifying the locations of the characters within the image of the document. Language-conditional character probabilities for each character are determined based on a set of language models and the ordering of the characters. Neighbor characters associated with a target character are identified based on the locations of the characters. Language-conditional character probabilities associated with the neighbor characters and language-conditional character probabilities associated with the target character are combined to generate a local language-conditional likelihood associated with the target character, the local language-conditional likelihood representing a concordance of the target character to a language model.

Type: Application

Filed: April 16, 2013

Publication date: October 3, 2013

Inventor: Ashok Popat
Method and system for preprocessing an image for optical character recognition

Patent number: 8548246

Abstract: A method and system for preprocessing an image, wherein the image includes a plurality of columns, or regions, of text is disclosed. A plurality of components associated with the text is determined. On determining the plurality of components, a line height and a column spacing is determined for the components. The components are then associated with a column based on the line height and the column spacing. A set of characteristic parameters are calculated for each column and the plurality of components of each column are merged based on the characteristic parameters to form sub-words and words. A first plurality of words and/or subwords is merged and processed as a first region and a second plurality of words and/or subwords is merged and processed as a second region wherein at least a portion of the second region vertically overlaps at least a portion of the first region.

Type: Grant

Filed: May 9, 2012

Date of Patent: October 1, 2013

Assignee: King Abdulaziz City for Science & Technology (KACST)

Inventors: Hussein Khalid Al-Omari, Mohammad Sulaiman Khorsheed
Method for omnidirectional processing of 2D images including recognizable characters

Patent number: 8548242

Abstract: The invention is a method for omnidirectional recognition of recognizable characters in a captured two-dimensional image. An optical reader configured in accordance with the invention searches for pixel groupings in a starburst pattern, and subjects located pixel groupings to a preliminary edge crawling process which records the pixel position of the grouping's edge and records the count of edge pixels. If two similar-sized pixel groupings are located that are of sizes sufficient to potentially represent recognizable characters, then the reader launches “alignment rails” at pixel positions substantially parallel to a centerline connecting the center points of the two similarly sized groupings. A reader according to the invention searches for additional recognizable characters within the rail area, and subjects each located pixel grouping within the rail area to a shape-characterizing edge crawling process for developing data that characterizes the shape of a pixel grouping's edge.

Type: Grant

Filed: June 11, 2010

Date of Patent: October 1, 2013

Assignee: Hand Held Products, Inc.

Inventor: Andrew Longacre, Jr.
VARIABLE GLYPH SYSTEM AND METHOD

Publication number: 20130251263

Abstract: Using methods, computer-readable storage media, and apparatuses for computer-implemented processing, a passage of text may be variably rendered. For each glyph in the passage of text, a glyph representation is varied according to a geometric transformation that was determined from statistical measurements of at least one geometric property from an ensemble of representations of the current glyph. Each varied glyph representation is included in renderable output data, such that when the passage of text is rendered to an output device, a given rendered representation of a given glyph subtly differs from other rendered representations of the given glyph.

Type: Application

Filed: January 7, 2013

Publication date: September 26, 2013

Applicant: GRACIOUS ELOISE, INC.

Inventors: Eloise Bune D'AGOSTINO, Michael Bennett D'AGOSTINO, Bryan Michael MINOR, Tamas FRAJKA, Michel Francois PETTIGREW
System and method to enable correction to application of substantially colorless material over identified text via segmentation

Patent number: 8542925

Abstract: Disclosed is a processor-implemented method for processing image data using an image processing apparatus. The processor is configured to receive a PDL file of image data and raster image process (RIP) the PDL file to determine pixels representing text. The ripped file is then segmented to determine at least any pixels representing text that were not initially indicated or identified by the ripped file. The results are combined to determine locations for marking onto a substrate by the output device using substantially colorless marking material over text pixels (to coat marked text pixels). In some instances, locations for covering text pixels with substantially colorless marking material can be tagged during segmenting image data, using a tag plane.

Type: Grant

Filed: September 12, 2011

Date of Patent: September 24, 2013

Assignee: Xerox Corporation

Inventors: David Robinson, Katherine Loj
System and method to enable correction of text handling mismatches via segmentation

Patent number: 8538152

Abstract: Disclosed is a processor-implemented method for processing image data using an image processing apparatus. The processor is configured to receive a PDL file of image data and raster image process (RIP) the PDL file to determine at least pixels representing text of a predetermined colorant. The ripped file is then segmented to determine at least any text pixels of the predetermined colorant not initially indicated by the ripped file. The results are combined to determine text pixels of the predetermined colorant for marking onto a substrate using marking material (e.g., ink). In some instances, pixels of the predetermined colorant can be tagged during segmenting using a tag plane to determine text pixels for marking by the output device.

Type: Grant

Filed: September 12, 2011

Date of Patent: September 17, 2013

Assignee: Xerox Corporation

Inventors: David Robinson, Katherine Loj

prev 1 2 3 4 5 6 7 8 … next