Segmenting Individual Characters Or Words Patents (Class 382/177)
-
Patent number: 8538086Abstract: An image inspection apparatus that compares a reference image with an inspection image obtained by scanning a printed medium on which the reference image has been printed, to determine whether the printed medium is acceptable is provided. The image inspection apparatus includes a first inspecting unit that compares the reference image exclusive of an edge in the reference image with the inspection image exclusive of an edge in the inspection image to perform inspection; a line-image detecting unit that detects a line image that contains the edge from each of the reference image and the inspection image; a second inspecting unit that compares the line image detected from the reference image with the line image detected from the inspection image to perform inspection; and a determining unit that determines whether the printed medium is acceptable based on results of these inspections.Type: GrantFiled: August 27, 2010Date of Patent: September 17, 2013Assignee: Ricoh Company, LimitedInventor: Shinji Yamakawa
-
Patent number: 8532988Abstract: A method for searching for an input symbol string, includes receiving (B) an input symbol string, proceeding (C) in a trie data structure to a calculation point indicated by the next symbol, calculating (D) distances at the calculation point, selecting (E) repeatedly the next branch to follow (C) to the next calculation point to repeat the calculation (D). After the calculation (G), selecting the symbol string having the shortest distance to the input symbol string on the basis of the performed calculations. To minimize the number of calculations, not only the distances are calculated (D) at the calculation points, but also the smallest possible length difference corresponding to each distance, and on the basis of each distance and corresponding length difference a reference value is calculated, and the branch is selected (E) in such a manner that next the routine proceeds from the calculation point producing the lowest reference value.Type: GrantFiled: July 3, 2003Date of Patent: September 10, 2013Assignee: Syslore OyInventor: Jorkki Hyvonen
-
Patent number: 8526735Abstract: Processing for a time-series analysis of keywords comprises clustering or classifying pieces of document data, each of which is description of a phenomenon in a natural language, on the basis of frequencies of occurrence of keywords in the pieces of document data, individual keywords being also clustered or classified by clustering or classifying the pieces of document data, and performing a time-series analysis of frequencies of occurrence of pieces of document data containing individual keywords in clusters or classes into which the pieces of document data are clustered or classified or a time-series analysis of frequencies of occurrence of pieces of document data containing clusters or classes into which the individual keywords are clustered or classified. Frequency distribution showing variation of the frequencies of occurrence of the pieces of document data is acquired by the time-series analysis.Type: GrantFiled: May 2, 2012Date of Patent: September 3, 2013Assignee: International Business Machines CorporationInventor: Takeshi Inagaki
-
Patent number: 8517945Abstract: A method for determining a candidate lesion region in a digital ultrasound medical image of anatomical tissue. The method includes the steps of: accessing the digital ultrasound medical image of anatomical tissue; applying an anisotropic diffusion filter to the ultrasound image to generate a filtered ultrasound image; performing a normalized cut operation on the filtered ultrasound image to partition the filtered ultrasound image into a plurality of regions; and selecting, from the plurality of regions, at least one region as a candidate lesion region.Type: GrantFiled: April 28, 2006Date of Patent: August 27, 2013Assignee: Carestream Health, Inc.Inventors: Zhimin Huo, Xu Liu
-
Patent number: 8514446Abstract: An information processing apparatus which is capable of efficiently performing color/monochrome determination of characters as a print object. A printer driver determines whether a character as the print object is a character which is necessary to be drawn or a character which is not necessary to be drawn. Then, when the character as the print object is determined to be a character which is not necessary to be drawn, the printer driver determines that the character as the print object is in monochrome.Type: GrantFiled: April 27, 2006Date of Patent: August 20, 2013Assignee: Canon Kabushiki KaishaInventor: Tetsu Oishi
-
Patent number: 8516606Abstract: Systems and methods are provided for challenge/response animation. In one implementation, a request for protected content may be received from a client, and the protected content may comprise data. A challenge phrase comprising a plurality of characters may be determined, and a computer processor may divide the challenge phrase into at least two character subsets selected from the characters comprising the challenge phrase. Each of the at least two character subsets may include less than all of the characters comprising the challenge phrase. The at least two character subsets may be sent to the client in response to the request; and an answer to the challenge phrase may be received from the client in response to the at least two character subsets. Access to the protected content may be limited based on whether the answer correctly solves the challenge phrase.Type: GrantFiled: March 18, 2010Date of Patent: August 20, 2013Assignee: AOL Inc.Inventor: Scott Dorfman
-
Patent number: 8503782Abstract: Methods, systems, and apparatus including computer program products for using extracted image text are provided. In one implementation, a computer-implemented method is provided. The method includes receiving an input of one or more image search terms and identifying keywords from the received one or more image search terms. The method also includes searching a collection of keywords including keywords extracted from image text, retrieving an image associated with extracted image text corresponding to one or more of the image search terms, and presenting the image.Type: GrantFiled: January 13, 2012Date of Patent: August 6, 2013Assignee: Google Inc.Inventors: Luc Vincent, Adrian Ulges
-
Patent number: 8498485Abstract: A system and method for creating one of a plurality of test decks to qualify and test forms processing systems, including preparing a handprint snippet data base containing labeled handprint image snippets representing a unique hand, preparing a form description file and a data content file, selecting handprint snippets from the handprint snippet data base to formulate a form using the data content file, creating a form image using the selected snippets according to the form description file and printing the form image.Type: GrantFiled: April 13, 2012Date of Patent: July 30, 2013Assignee: ADI, LLCInventors: K. Bradley Paxton, William L. DiBacco, Steven P. Spiwak, Craig A. Towne, Manuel Trevisan
-
Patent number: 8494240Abstract: A method of centerline determination for a tubular tissue in a medical image data set defined in a data space, comprising receiving at least one start point and one end point inside a tubular tissue volume; automatically determining a path between said points that remains inside said volume; automatically segmenting said tubular tissue using said path; and automatically determining a centerline for said tubular tissue from said segmentation, wherein said receiving, said determining a path and said segmenting, said determining a centerline are all performed on a same data space of said medical image data set.Type: GrantFiled: July 23, 2012Date of Patent: July 23, 2013Assignee: Algotec Systems Ltd.Inventors: Ido Milstein, Shmuel Akerman, Gad Miller, Laurent Cohen
-
Publication number: 20130170751Abstract: A method for processing data of a scanned book having a plurality of pages is disclosed. The method includes obtaining page image data from a page. The method further includes segmenting and recognizing the page image data to obtain locations of rectangular boxes corresponding to the respective characters and text codes for the respective characters. The method also includes obtaining respective aggregated character line information for each line of characters. The method further includes adjusting the rectangular boxes in accordance with the obtained aggregated character line information.Type: ApplicationFiled: December 28, 2012Publication date: July 4, 2013Applicants: BEIJING FOUNDER APABI TECHNOLOGY LTD., PEKING UNIVERSITY FOUNDER GROUP CO., LTD.Inventors: Peking University Founder Group Co., Ltd., BEIJING FOUNDER APABI TECHNOLOGY LTD.
-
Patent number: 8467608Abstract: A method and an apparatus for character string recognition may be provided that enables prevention of a decrease in recognition accuracy for a character string even when distortion of an image appears in a direction perpendicular to a medium transfer direction.Type: GrantFiled: March 31, 2008Date of Patent: June 18, 2013Assignee: Nidec Sankyo CorporationInventor: Hiroshi Nakamura
-
Patent number: 8467614Abstract: The present invention provides a method for an Optical Character Recognition (OCR) system providing recognition of characters that are partly hidden by crossing outs due to for example an imprint of a stamp, handwritten signatures, etc. The method establishes a set of template images of certainly recognized characters from the image of the text being processed by the OCR system, wherein the effect of the crossed out section is modelled into the template images before comparing these images with the image of a visually impaired crossed out character. The modelled template image having the highest similarity with the visually impaired crossed out character is the correct identification for the visually impaired character instance.Type: GrantFiled: November 21, 2008Date of Patent: June 18, 2013Assignee: Lumex ASInventors: Knut Tharald Fosseide, Hans Christian Meyer
-
Patent number: 8457404Abstract: An image processing apparatus includes: a receiver that receives an image including at least a character image; a path calculator that calculates separation paths, which are segments for separating the character images in the image received by the receiver; a feature amount calculator that calculates feature amounts of the separation paths in a plurality of directions calculated by the path calculator; a selector that determines a separation direction of the image and a state of the character image and selects a separation path among the separation paths in the plurality of directions; a separator that separates the image into a plurality of partial images; and a recursive processing determining unit that determines whether or not to perform recursive processing, wherein the path calculator calculates the separation paths, which are the segments for separating the character image in the image separated by the separator.Type: GrantFiled: March 1, 2011Date of Patent: June 4, 2013Assignee: Fuji Xerox Co., Ltd.Inventor: Eiichi Tanaka
-
Patent number: 8457443Abstract: To handle static text and logos in stabilized images without destabilizing the static text and logos, a method of handling overlay subpictures in stabilized images includes separating an existing overlay subpicture from an input image to generate a separated overlay subpicture and a separated input image. The separated input image is stabilized to form a stabilized image. The separated overlay subpicture is then merged with the stabilized image to obtain an output image.Type: GrantFiled: December 22, 2011Date of Patent: June 4, 2013Assignee: CyberLink Corp.Inventor: Chia-Chen Kuo
-
Publication number: 20130136359Abstract: An image processing apparatus segments Western and hieroglyphic portions of textual lines. The apparatus includes an input component that receives an input image having at least one textual line. The apparatus also includes an inter-character break identifier component that identifies candidate inter-character breaks along a textual line and an inter-character break classifier component. The inter-character break classifier component classifies each of the candidate inter-character breaks as an actual break, a non-break or an indeterminate break based at least in part on the geometrical properties of each respective candidate inter-character break and the bounding boxes adjacent thereto. A character recognition component recognizes the candidate characters based at least in part on a feature set extracted from each respective candidate character that can be histogram features, Gabor features or any other feature set applicable to character recognition.Type: ApplicationFiled: January 23, 2013Publication date: May 30, 2013Applicant: Microsoft CorporationInventor: Microsoft Corporation
-
Patent number: 8452100Abstract: A feature point calculating section binarizes the image data to obtain a centroid of a consecutive component in which pixels are connected as a feature point, reverses the image data, obtains a centroid as a feature point from the reversed image data similarly, and adds them as a feature point of the image data. A features calculating section calculates a predetermined invariant based on the feature point containing the feature point obtained from the reversed image data, and calculates a hash value based on the predetermined invariant. A vote process section retrieves a hash table based on the calculated hash value, votes for a document of an index stored in association with the hash value, and accumulatively adds the vote. A similarity determination process section compares the number of votes calculated by the vote process section with a predetermined threshold value to determine a similarity.Type: GrantFiled: July 23, 2008Date of Patent: May 28, 2013Assignee: Sharp Kabushiki KaishaInventors: Hiroki Yoshino, Makoto Hayasaki
-
Patent number: 8447110Abstract: Processing for a time-series analysis of keywords comprises clustering or classifying pieces of document data, each of which is description of a phenomenon in a natural language, on the basis of frequencies of occurrence of keywords in the pieces of document data, individual keywords being also clustered or classified by clustering or classifying the pieces of document data, and performing a time-series analysis of frequencies of occurrence of pieces of document data containing individual keywords in clusters or classes into which the pieces of document data are clustered or classified or a time-series analysis of frequencies of occurrence of pieces of document data containing clusters or classes into which the individual keywords are clustered or classified. Frequency distribution showing variation of the frequencies of occurrence of the pieces of document data is acquired by the time-series analysis.Type: GrantFiled: December 31, 2010Date of Patent: May 21, 2013Assignee: International Business Machines CorporationInventor: Takeshi Inagaki
-
Patent number: 8447111Abstract: A system for processing text captured from rendered documents is described. The system receives a sequence of one or more words optically or acoustically captured from a rendered document by a user. The system identifies among words of the sequence a word with which an action has been associated. The system then performs the associated action with respect to the user.Type: GrantFiled: February 21, 2011Date of Patent: May 21, 2013Assignee: Google Inc.Inventors: Martin T. King, Dale L. Grover, Clifford A. Kushler, James Q. Stafford-Fraser
-
Publication number: 20130108159Abstract: A method and apparatus for automatically identifying character segments for character recognition is provided. The method involves receiving a plurality of words and a ground truth corresponding to each word of the plurality of words. The plurality of words may be received in a cursive script. Each word of the plurality of words is segmented into one or more character segments based on the ground truth corresponding to each word. Thereafter, the segmentation of each word is refined by iteratively re-segmenting each word based on one or more similar character segments.Type: ApplicationFiled: October 27, 2011Publication date: May 2, 2013Applicant: King Abdul Aziz City for Science and TechnologyInventors: Ahmad Abdulkader, Hussein Khalid Al-Omari, Mohammad Sulaiman Khorsheed
-
Publication number: 20130108160Abstract: A character recognition device includes image input unit that receives an image, character region detection unit that detects a character region in the image, character region separation unit that separates the character region on a character-by-character basis, character recognition unit that performs character-by-character recognition on the characters present in separated regions and outputs one or more character recognition result candidates for each character, first character string transition data creation unit that receives the candidates, calculates weights for transitions to the candidates and creates first character string transition data based on a set of the candidates and the weights, and WFST processing unit that sequentially performs state transitions based on the first character string transition data, accumulates weights in each state transition and calculates a cumulative weight for each state transition, and outputs one or more state transition results based on the cumulative weight.Type: ApplicationFiled: February 24, 2012Publication date: May 2, 2013Applicant: NTT DOCOMO, INC.Inventors: Takafumi Yamazoe, Minoru Etoh, Takeshi Yoshimura, Kosuke Tsujino
-
Patent number: 8428932Abstract: A connected text data system for efficiently and accurately translating connected text. The connected text data system includes inputting or receiving connected text, transmitting the connected text to a text iterator, scanning the connected text, identifying a plurality of words in the connected text comprising a coordinate logic to help parse connected text matches into separated text by invalidating words with overlapping coordinates, and translating the connected text to separated text by adding a space between each of the plurality of words.Type: GrantFiled: July 11, 2008Date of Patent: April 23, 2013Inventor: Nathan S. Ross
-
Publication number: 20130094760Abstract: A system for identifying digital content related to a portion of a block of text receives, automatically or via input by a user, an indication of one or more words included in the block of text. The system searches a database of digital content based on the one or more words and retrieves from the database one or more digital content items or identifiers of digital content items that are related to the one or more words. The system provides the retrieved digital content items or identifiers to the user, and receives a selection of one or more of the provided items or identifiers from the user. The system associates for display or replay the one or more selected digital content items with the one or more words in the block of text. Other embodiments of the system are also disclosed.Type: ApplicationFiled: October 9, 2012Publication date: April 18, 2013Applicant: GETTY IMAGES, INC.Inventor: GETTY IMAGES, INC.
-
Patent number: 8422787Abstract: There is provided an apparatus including a model based topic segmentation section that when segments a text using a topic model representing semantic coherence, a parameter estimation section that estimates a control parameter used in segmenting the text based on detection of a change point of word distribution in the text, using the result of segmentation by the model based topic segmentation unit as training data, and a change point detection topic segmentation section that segments the text, based on detection of the change point of word distribution in the text, using the parameter estimated by the parameter estimation section.Type: GrantFiled: December 25, 2008Date of Patent: April 16, 2013Assignee: NEC CorporationInventors: Makoto Terao, Takafumi Koshinaka
-
Patent number: 8417057Abstract: A method of compensating for distortion in text recognition is provided, which includes extracting a text region from an image; estimating the form of an upper end of the extracted text region; estimating the form of a lower end of the extracted text region; estimating the form of left and right sides of the extracted text region; estimating a diagram constituted in the form of the estimated upper end, lower end, left and right sides, and including a minimum area of the text region; and transforming the text region constituting the estimated diagram into a rectangular diagram using an affine transform.Type: GrantFiled: February 12, 2010Date of Patent: April 9, 2013Assignees: Samsung Electronics Co., Ltd., Industry Foundation of Chonnam National UniversityInventors: Sang-Wook Oh, Seong-Taek Hwang, Hyun-Soo Kim, Sang-Ho Kim, Guee-Sang Lee, Soo-Hyung Kim, Hyung-Jeong Yang, Eui-Chul Kim
-
Patent number: 8416244Abstract: A graphics or image rendering system, such as a map image rendering system, receives image data from an image database in the form of vector data that defines various image objects, such as roads, geographical boundaries, etc., and textures defining text strings to be displayed on the image to provide, for example, labels for the image objects. The imaging rendering system renders the images such that the individual characters of the text strings are placed on the image following a multi-segmented or curved line.Type: GrantFiled: September 26, 2011Date of Patent: April 9, 2013Assignee: Google Inc.Inventor: Brian Cornell
-
Patent number: 8406528Abstract: Methods and apparatuses are provided which may be implemented to in various electronic devices to evaluate displayable digital images based on certain test criterion. The displayable images may represent web content and/or the like, and the test criterion may include or relate to desired user experience and/or other like content accessibility measures.Type: GrantFiled: October 5, 2009Date of Patent: March 26, 2013Assignee: Adobe Systems IncorporatedInventor: Joshua A. Hatwich
-
Patent number: 8401293Abstract: A method for identifying words in a textual image undergoing optical character recognition includes receiving a bitmap of an input image which includes textual lines that have been segmented by a plurality of chop lines. The chop lines are each associated with a confidence level reflecting a degree to which the respective chop line properly segments the textual line into individual characters. One or more words are identified in one of the textual lines based at least in part on the textual lines and a first subset of the plurality of chop lines which have a chop line confidence level above a first threshold value. If the first word is not associated with a sufficiently high word confidence level, at least a second word in the textual line is identified based at least in part on a second subset of the plurality of chop lines which have a confidence level above a second threshold value lower than the first threshold value.Type: GrantFiled: May 3, 2010Date of Patent: March 19, 2013Assignee: Microsoft CorporationInventors: Aleksandar Antonijevic, Ivan Mitic, Mircea Cimpoi, Djordje Nijemcevic
-
Patent number: 8391560Abstract: The present invention provides a method and a system for image identification and identification result output, which determines a location coordinate with respect to an image and a rotating angle based on at least one direction of the image according to features of the image. The image is compared to a plurality of sample images stored in a database according to the rotating angle so as to obtain at least one identification result. By means of the method and the system of the present invention, identification can be achieved with respect to various rotating angles and distances so as to improve the identification rate.Type: GrantFiled: July 30, 2009Date of Patent: March 5, 2013Assignee: Industrial Technology Research InstituteInventors: Ya-Hui Tsai, Yu-Ting Lin, Kuo-Tang Huang, Chun-Lung Chang, Tung-Chuan Wu
-
Patent number: 8391602Abstract: Systems and methods for character recognition by performing lateral view-based analysis on the character data and generating a feature vector based on the lateral view-based analysis.Type: GrantFiled: April 8, 2010Date of Patent: March 5, 2013Assignee: University of CalcuttaInventors: Nabendu Chaki, Soharab Hossain Shaikh
-
Patent number: 8391559Abstract: The present invention provides a method and system for image identification and identification result output, wherein a feature image under identification acquired from an image is compared with a plurality of sample images respectively stored in a database so as to obtain a plurality of similarity indexes associated with the plurality of sample images respectively. Each similarity index represents similarity between the feature image and the corresponding sample image. Thereafter, the plurality of similarity indexes are sorted and then a least one of comparison results is output. The present invention is further capable of being used for identifying identification marks with respect to a carrier. By sorting the similarity index with respect to each feature forming the identification marks, it is capable of outputting many sets of combinations corresponding to the identification marks so as to improve speed for targeting suspected carrier and enhance the identification efficiency.Type: GrantFiled: July 30, 2009Date of Patent: March 5, 2013Assignee: Industrial Technology Research InstituteInventors: Ya-Hui Tsai, Kuo-Tang Huang, Yu-Ting Lin, Chun-Lung Chang, Tung-Chuan Wu
-
Patent number: 8385652Abstract: An image processing apparatus segments Western and hieroglyphic portions of textual lines. The apparatus includes an input component that receives an input image having at least one textual line. The apparatus also includes an inter-character break identifier component that identifies candidate inter-character breaks along a textual line and an inter-character break classifier component. The inter-character break classifier component classifies each of the candidate inter-character breaks as an actual break, a non-break or an indeterminate break based at least in part on the geometrical properties of each respective candidate inter-character break and the bounding boxes adjacent thereto. A character recognition component recognizes the candidate characters based at least in part on a feature set extracted from each respective candidate character that can be histogram features, Gabor features or any other feature set applicable to character recognition.Type: GrantFiled: March 31, 2010Date of Patent: February 26, 2013Assignee: Microsoft CorporationInventor: Ivan Mitic
-
Patent number: 8384917Abstract: A method, system, and computer program product for font reproduction in electronic documents are provided. The method includes: receiving an image of a printed document; extracting pairs of consecutive characters from the image of the printed document; storing the extracted pairs as images of the characters; and reproducing the printed document as an electronic document with text of overlapping extracted character pair images. Extracting pairs of consecutive characters includes extracting adjacent horizontal characters, extracting spaced horizontal characters, and extracting spaced vertical characters. Reproducing the printed document as an electronic document includes reproducing the spacing between words and between lines using the spaced horizontal characters and the spaced vertical characters as anchors in the reproduced document.Type: GrantFiled: February 15, 2010Date of Patent: February 26, 2013Assignee: International Business Machines CorporationInventor: Asaf Tzadok
-
Patent number: 8358871Abstract: A skewed image data detecting and correcting device includes a skew angle detecting module, and an image rotating correction module. A skewed image data detecting and correcting method includes the following steps. Firstly, a binary digitizing operation is performed to obtain a binary image data. The binary image data is rotated by multiple different rotating angles, thereby obtaining multiple rotated binary image data. The pixel numbers of all horizontal rows of the rotated binary image data are totalized, thereby obtaining multiple horizontal pixel number distribution curves. A high-pass filtering procedure is performed to filter off low-frequency noise, thereby obtaining multiple high-frequency signal curves. The square sums of respective high-frequency signal curves are calculated, thereby obtaining multiple index values.Type: GrantFiled: May 28, 2009Date of Patent: January 22, 2013Assignee: AVerMedia Information, Inc.Inventors: Chien-Hui Tu, Cheng-Yueh Lo, De-Wei Huang, Yung-Hsi Wu
-
Patent number: 8351061Abstract: A printing apparatus, including a user input unit which receives a first user command to initiate a printing operation, a display unit which displays information relating to the printing operation, a printing unit which performs printing with respect to printing data, and a controller which controls the display unit to display reference information of the printing data before the printing, and which controls the printing unit to perform the printing according to a second user command.Type: GrantFiled: July 30, 2007Date of Patent: January 8, 2013Assignee: Samsung Electronics Co., Ltd.Inventor: Jin-young Lee
-
Patent number: 8345993Abstract: A multi-level data encoding system is provided that is operable on a computer. The encoding system includes a data input device adapted to input a data set and store the data set in a database. The system further includes an encoder adapted to encode the data set and separate the encoded data set into two files, wherein each character of the data set comprises a unique electronic footprint. Additionally, the system includes a data field adapted to organize the encoded data set for proper decoding, a master file comprising one file of the encoded data set and an overlay file comprising the other file of the encoded data set. The system also includes a decoder adapted to align the overlay file onto the master file to decode the encoded data set.Type: GrantFiled: October 22, 2008Date of Patent: January 1, 2013Inventor: Glenn E Weeks
-
Patent number: 8345978Abstract: Line segmentation in an OCR process is performed to detect the positions of words within an input textual line image by extracting features from the input to locate breaks and then classifying the breaks into one of two break classes which include inter-word breaks and inter-character breaks. An output including the bounding boxes of the detected words and a probability that a given break belongs to the identified class can then be provided to downstream OCR or other components for post-processing. Advantageously, by reducing line segmentation to the extraction of features, including the position of each break and the number of break features, and break classification, the task of line segmentation is made less complex but with no loss of generality.Type: GrantFiled: March 30, 2010Date of Patent: January 1, 2013Assignee: Microsoft CorporationInventors: Aleksandar Uzelac, Bodin Dresevic, Sasa Galic, Bogdan Radakovic
-
Patent number: 8340424Abstract: A visualization program, method and apparatus for determining reading order of content in a structured document. The method includes generating, for each of a plurality of elements, a directed segment; storing, in the reading order, the generated directed segments of the elements into a storage device; reading from the storage device; linking together the directed segments for the elements in accordance with the reading order; and displaying the linked directed segments overlaid on the structured document which is displayed on the screen. A computer implemented program and an apparatus for carrying out the above method are also provided.Type: GrantFiled: July 27, 2010Date of Patent: December 25, 2012Assignee: International Business Machines CorporationInventor: Daisuke Sato
-
Publication number: 20120321189Abstract: Systems and methods providing automated extraction of information contained in video data and uses thereof are described. In particular, systems and associated methods are described that provide techniques for extracting data embedded in video, for example measurement-value pairs of medical videos, for use in a variety of applications, for example video indexing, searching and decision support applications.Type: ApplicationFiled: August 27, 2012Publication date: December 20, 2012Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Arnon Amir, David James Beymer, Karen W. Brannon, Sangeeta T. Doraiswamy, Tanveer Fathima Syeda-Mahmood
-
Patent number: 8335402Abstract: Various embodiments of the present invention relate to a method, system and computer program product for detecting and recognizing text in the images captured by cameras and scanners. First, a series of image-processing techniques is applied to detect text regions in the image. Subsequently, the detected text regions pass through different processing stages that reduce blurring and the negative effects of variable lighting. This results in the creation of multiple images that are versions of the same text region. Some of these multiple versions are sent to a character-recognition system. The resulting texts from each of the versions of the image sent to the character-recognition system are then combined to a single result, wherein the single result is detected text.Type: GrantFiled: August 3, 2011Date of Patent: December 18, 2012Assignee: A9.com, Inc.Inventors: Raghavan Manmatha, Mark A Ruzon
-
Patent number: 8331736Abstract: An image processing device is provided which generates an easily reusable electronic document from an input image in which different page sizes are mixed. The image processing device generates a plurality of pieces of display information from a plurality of document images, and, depending on the size and the direction of each of the images, converts the pieces of display information into electronic documents. That is, the plurality of pieces of display information are divided into a plurality of groups, depending on the size and the direction of each of the images, and the display information included in each of the groups is converted into a separate electronic document. Further, sequence information based on the input order of the plurality of document images is stored on an electronic document.Type: GrantFiled: May 20, 2009Date of Patent: December 11, 2012Assignee: Canon Kabushiki KaishaInventors: Keiko Nakanishi, Makoto Enomoto, Taeko Yamazaki
-
Patent number: 8331680Abstract: A novel and useful method of using Incremental Connected Components to segment and isolate individual characters in a gray-scale or color image. For each pixel intensity of pixels in the image, a plurality of pixel groups are created comprising contiguous pixels of intensity equal to or less than the current pixel intensity. The pixel groups are then input to a character classifier which returns an identified character and a confidence value. Non-overlapping pixel groups (i.e. segmentation) of identified characters having the highest confidence values are then selected.Type: GrantFiled: June 23, 2008Date of Patent: December 11, 2012Assignee: International Business Machines CorporationInventors: Amir Geva, Doron Tal
-
Patent number: 8331672Abstract: Disclosed is a method and an apparatus for recognizing a character and efficiently removing a misrecognized character. The method includes detecting character regions including at least one character in an input image, converting the input image into a binary image, discriminating the characters from a non-character, re-classifying the character region including a number of characters equal to or less than a threshold into a non-character region, and outputting only the characters present in the character region.Type: GrantFiled: June 24, 2009Date of Patent: December 11, 2012Assignee: Samsung Electronics Co., LtdInventors: Sang-Wook Oh, Seong-Taek Hwang, Sang-Ho Kim, Hee-Won Jung
-
Patent number: 8331706Abstract: A first aspect of the invention relates to a method for creating a binary mask image from an a inputted digital image of a scanned document, comprising the steps of creating a binarized image by binarizing the inputted digital image, detecting first text regions representing light text on a dark background, and inverting the first text regions, such that the inverted first text regions are interpretable in the same way as dark text on a light background. A second aspect of the invention relates to a method for comparing in a binary image a first pixel blob with a second pixel blob to determine whether they represent matching symbols, comprising the steps of detecting a line in one blob not present in the other and/or determining if one of the blobs represents an italicized symbol where the other does not.Type: GrantFiled: November 17, 2011Date of Patent: December 11, 2012Assignee: I.R.I.S.Inventors: Michel Dauw, Pierre Demuelenaere
-
Publication number: 20120308135Abstract: A method comprises extracting a local identifier (130, 730a, 730b) from an image (100, 500, 700), the image (100, 500, 700) also having positional data (120) relating to the location at which the image (100, 500, 700) was captured; and associating the extracted local identifier (130, 730a, 730b) with the corresponding positional data (120) to allow for associating the extracted local identifier with a digital map (300, 600, 800).Type: ApplicationFiled: February 8, 2010Publication date: December 6, 2012Applicant: TOMTOM GERMANY GMBH & CO. KGInventors: Heiko Mund, Oleg Schmelzle
-
Patent number: 8315462Abstract: An apparatus and a method for character string recognition for correctly recognizing a character string placed on a medium, even in a recognition process system in which a plurality of formats are handled. An image processing area is set on a medium. The image processing area is divided in a placement direction of character strings so as to make up a plurality of segments. An image data projection in a direction of character strings is calculated for each segment. The number of character string lines for each segment is calculated according to the image data projection. The number of character string lines is determined for the image processing area as a whole, according to the number of character string lines for each segment, and it is judged whether or not the character strings are predetermined character strings.Type: GrantFiled: April 18, 2011Date of Patent: November 20, 2012Assignee: Nidec Sankyo CorporationInventor: Hiroshi Nakamura
-
Patent number: 8315484Abstract: The present invention provides a method and system for confirming uncertainly recognized words as reported by an Optical Character Recognition process by using spelling alternatives as search arguments for an Internet search engine. The measured number of hits for each spelling alternative is used to provide a confirmation measure for the most probable spelling alternative. Whenever the confirmation measure is inconclusive, a plurality of search strategies are used to reach a measured result comprising zero hits except for one spelling alternative that is used as the correct alternative.Type: GrantFiled: February 15, 2007Date of Patent: November 20, 2012Assignee: Lumex ASInventors: Hans Christian Meyer, Mats Stefan Carlin, Knut Tharald Fosseide
-
Patent number: 8314944Abstract: An image forming device includes a main body casing, a cover configured to be openable and closable with respect to the main body casing, a sensing unit configured to sense an opening-closing operation of the cover, a forming unit configured to form an image on a sheet, a detecting unit configured to perform a detecting operation to detect a deviation of an image forming position of the image to be formed by the forming unit, an accepting unit configured to accept a print request, and a control unit configured to control the detecting unit to perform the detecting operation in response to the print request being accepted when the sensing unit senses an opening-closing operation of the cover after execution of a previous detecting operation, and thereafter to control the forming unit to form the image in the image forming position corrected to cancel the deviation detected in the detecting operation.Type: GrantFiled: November 25, 2008Date of Patent: November 20, 2012Assignee: Brother Kogyo Kabushiki KaishaInventor: Tsuyoshi Kushida
-
Patent number: 8311332Abstract: An image processing system and a mask preparation method able to prepare a mask by simple processing and a program executed in such an image processing system are provided. To extract the edges of the image, strings of pixels corresponding to the contours of an object are extracted from the edge extracted image, and border lines for the masking are acquired based on an approximation line thereof.Type: GrantFiled: August 31, 2006Date of Patent: November 13, 2012Assignee: Sony CorporationInventor: Hiroshi Abe
-
Patent number: 8311331Abstract: An optical character recognition process characterizes text lines in a textual image by their base-line, mean-line and x-height. The base-line for at least one text line in the image is determined by finding a parametric curve that maximizes a first fitness function that depends on the values of pixels through which the parametric curve passes and pixels below the parametric curve. The base-line corresponds to the parametric curve for which the first fitness function is maximized. The first fitness function is designed so that it increases with increasing lightless or brightness of pixels immediately below the parametric curve while also increasing with decreasing lightness of pixels through which the parametric curve passes. The mean-line is determined by incrementally shifting the base-line upward by predetermined amounts (e.g., a single pixel) until a second fitness function for the shifted base-line is maximized. The second fitness function is essentially the inverse of the first fitness function.Type: GrantFiled: March 9, 2010Date of Patent: November 13, 2012Assignee: Microsoft CorporationInventors: Djordje Nijemcevic, Milan Vugdelija, Bodin Dresevic
-
Patent number: RE43894Abstract: A method for segmenting a small feature in a multidimensional digital array of intensity values in a data processor computes an edge metric along each ray of a plurality of multidimensional rays originating at a local intensity extreme (local maximum or minimum). A multidimensional point corresponding to a maximum edge metric on each said ray is identified as a ray edge point. Every point on each ray from the local extreme to the ray edge point is labeled as part of the small object. Further points on the feature are grown by labeling an unlabeled point if the unlabeled point is adjacent to a labeled point, and the unlabeled point has a more extreme intensity than the labeled point, and the unlabeled point is closer than the labeled point to the local extreme. The resulting segmentation is quick, and identifies boundaries of small features analogous to boundaries identified by human analysts, and does not require statistical parameterizations or thresholds manually determined by a user.Type: GrantFiled: December 7, 2011Date of Patent: January 1, 2013Assignee: The Johns Hopkins UniversityInventors: Isaac N. Bankman, Tanya Nizialek