Ideographic Characters (e.g., Japanese Or Chinese) Patents (Class 382/185)

CHARACTER AUTO-COMPLETION FOR ONLINE EAST ASIAN HANDWRITING INPUT

Publication number: 20090324082

Abstract: An exemplary method includes receiving stroke information for a partially written East Asian character, the East Asian character representable by one or more radicals; based on the stroke information, selecting a radical on a prefix tree wherein the prefix tree branches to East Asian characters as end states; identifying one or more East Asian characters as end states that correspond to the selected radical for the partially written East Asian character; and receiving user input to verify that one of the identified one or more East Asian characters is the end state for the partially written East Asian character. In such a method, the selection of a radical can occur using radical-based hidden Markov models. Various other exemplary methods, devices, systems, etc., are also disclosed.

Type: Application

Filed: June 26, 2008

Publication date: December 31, 2009

Applicant: Microsoft Corporation

Inventors: Peng Liu, Lei Ma, Frank Kao-Ping Soong
System and method for associating handwritten information with one or more objects

Patent number: 7639876

Abstract: A pen-enabled computing arrangement includes a handwriting capture interface and at least one processing element, such as in the form of a computing system and a digital pen that embodies a writing stylus. The handwriting capture interface can capture an electronic handwriting input. The processing element can sense an identifier associated with an object, and associate electronic handwriting input with the object. The processing element can then process the electronic handwriting input based upon the associated object. Printed paper with which the arrangement can operate to effectuate capturing and processing data may also be provided. Further, triggered verification of the sensed identifier may be provided when the identifier is sensed based upon initial electronic handwriting input corresponding to the identifier and the sensed identifier is unknown and/or improper.

Type: Grant

Filed: January 14, 2005

Date of Patent: December 29, 2009

Assignees: Advanced Digital Systems, Inc., Cardinal Brands, Inc.

Inventors: Gregory James Clary, Jason S. Priebe, Todd Andrew Eiles, Christopher M. DiPierro, Richard L. Thornburg, Michael Earl Miller
Adaptive OCR for books

Patent number: 7627177

Abstract: A system is presented for scanning entire books or document all at once using an adaptive process where the book or document has known fonts and unknown fonts. The known fonts are processed through a verification system where sure words and error words are determined. Both the sure words and error words are sent to OCR training where they are re-OCR'ed and repeatedly verified until they meet a predetermined quality criteria. Characters or words not meeting the predetermined quality criteria receive additional OCR training until all the characters and words pass the predetermined quality criteria. Unknown fonts are scanned and clustered together by shape. Outliers in the shapes are manually keyed-in. Those symbols that are manually classified go to OCR training and then to the known type optimization process.

Type: Grant

Filed: November 24, 2008

Date of Patent: December 1, 2009

Assignee: International Business Machines Corporation

Inventors: Asaf Tzadok, Eugeniusz Walach
Method of shuffling text in an Asian document image

Patent number: 7596270

Abstract: A method, system, and computer-readable medium containing computer-executable instructions are provided, for randomly relocating text character images of a scanned-in Asian character document to produce a shuffled image, wherein the meaning of text in the shuffled image is not understandable although individual characters forming the text in the shuffled image are recognizable. In one embodiment, the method includes generally four steps: (1) dividing an Asian character document image into a text image portion and a non-text image portion; (2) structuring the text image portion into a multiple resolution-level pyramid; (3) extracting shuffleable character images by analyzing the multiple-resolution-level pyramid; and (4) shuffling some or all of the extracted shuffleable character images to create a shuffled image. The shuffled (e.g., encoded) image can be reshuffled (e.g.

Type: Grant

Filed: September 23, 2005

Date of Patent: September 29, 2009

Assignee: DynaComware Taiwan Inc.

Inventor: Kuo-Young Cheng
INFORMATION PROCESSOR, INFORMATION PROCESSING METHOD, AND COMPUTER READABLE MEDIUM

Publication number: 20090234637

Abstract: An information processor includes: a character recognizing unit; a recognized character feature obtaining unit; a translation deciding unit; a translating unit; a translated result feature obtaining unit; an output deciding unit; an image receiving unit; and an output unit that, wherein the character recognizing unit recognizes a character in character image of the image data received by the image receiving unit, and the recognized character feature obtaining unit, in a case where a picture image other than the character is recognized, obtains a third feature related to a character included in the picture image.

Type: Application

Filed: September 16, 2008

Publication date: September 17, 2009

Applicant: FUJI XEROX CO., LTD.

Inventor: Masahiro KATO
STORAGE MEDIUM STORING DOCUMENT RECOGNITION PROGRAM, DOCUMENT RECOGNITION APPARATUS AND METHOD THEREOF

Publication number: 20090226089

Abstract: A program causes a computer to function as a document recognition apparatus, having an extraction unit for extracting connected components of pixels from an input image, a generation unit for generating a reference element that is connected components of pixels extracted by the extraction unit and combined elements obtained by combining the reference element and connected components of pixels adjacent to the reference element as an element to be estimated, a calculation unit for calculating a degree of certainty that indicates how much the element to be estimated generated by the generation unit seems to be a character, and a determination unit for identifying elements that seem to be characters among the elements to be estimated based on the degree of certainty calculated by the calculation unit.

Type: Application

Filed: February 25, 2009

Publication date: September 10, 2009

Applicant: FUJITSU LIMITED

Inventors: Noriaki Ozawa, Hiroaki Takebe, Yutaka Katsuyama, Katsuhito Fujimoto
Spatial motion recognition system and method using a virtual handwriting plane

Patent number: 7580572

Abstract: A spatial motion recognition system capable of recognizing motions in three-dimensional space as handwritings on a two-dimensional plane is provided. The system recognizes motions of a system body occurring in space based on position change information of the system body that is detected in a motion detection unit, displays the recognized motion information on a screen, or transmits to an external device the recognized motion information through a transmission/reception unit or a control signal corresponding to the motion information. A control unit produces a virtual handwriting plane having the shortest distances with respect to respective positions in predetermined time intervals based on three-dimensional track information obtained through tracking, and projects the respective positions in the predetermined time intervals onto the virtual handwriting plane to recover the motions in space.

Type: Grant

Filed: March 17, 2004

Date of Patent: August 25, 2009

Assignee: Samsung Electronics Co., Ltd.

Inventors: Won-chul Bang, Dong-yoon Kim, Wook Chang, Kyoung-ho Kang, Eun-seok Choi
AREA EXTRACTION PROGRAM, CHARACTER RECOGNITION PROGRAM, AND CHARACTER RECOGNITION DEVICE

Publication number: 20090202152

Abstract: An area extraction method including obtaining a character lattice showing a connection relation between unit areas, which are obtained by separating a character string pattern in an image into patterns each recognized as corresponding to a single character, judging whether or not all combinations of each of the unit areas in the obtained character lattice and each of the unit areas in a regular lattice defining a regular connection relation between the unit areas are likely to be established, generating a path coupling between nodes corresponding to the combination of the unit areas which is determined as likely to be established, determining an optimum path from the generated paths based on a degree of coincidence with the regular lattice or the character lattice, and extracting from an image the unit areas in the character lattice corresponding to the determined optimum path.

Type: Application

Filed: February 5, 2009

Publication date: August 13, 2009

Applicant: FUJITSU LIMITED

Inventors: Hiroaki TAKEBE, Katsuhito Fujimoto
Cleaning up of handwriting intra-stroke and inter-stroke overtracing

Patent number: 7567711

Abstract: A method and system for cleaning handwriting for redisplay of the handwriting or for improved recognition accuracy is provided. The cleanup system receives handwriting that has been digitized. The cleanup system then analyzes the handwriting to identify strokes that satisfy a cleanup criterion. When a stroke has been identified as satisfying some cleanup criteria, the cleanup system cleans up the handwriting based on the detected criteria. In this way, the cleanup system generates handwriting that may have a more visually pleasing appearance to the reader.

Type: Grant

Filed: August 1, 2005

Date of Patent: July 28, 2009

Assignee: Microsoft Corporation

Inventor: Zhouchen Lin
Method and apparatus for determining an orientation of a document including Korean characters

Publication number: 20090180694

Abstract: A method and apparatus for determining an orientation of a document including Korean text are presented. A binarized pixel image is created from the document image. Contiguous pixels are grouped and labeled using a bounding box. A spanning stroke may be detected from a group of the contiguous pixels. The orientation of the document is determined by comparing counts associated with spanning strokes in the left, right, top, and bottom halves of the bounding boxes.

Type: Application

Filed: January 11, 2008

Publication date: July 16, 2009

Applicant: SHARP LABORATORIES OF AMERICA, INC.

Inventor: Lawrence Shao-Hsien Chen
Mail data processing method, mail server, program for mail server, terminal device and program for terminal device

Patent number: 7538771

Abstract: A mail server extracts a character unregistered in a portable terminal from received mail data and affixes the font data of the character concerned to the mail data or inserts a reading tag indicating the reading (pronunciation) of the character concerned into the mail data. The portable terminal additionally registers the font data affixed to the mail data into a font database before the received mail data are displayed. Furthermore, in the display processing of the mail data, a character for which the corresponding font data is unregistered is replaced by a no-font symbol and then displayed. Furthermore, the font of each character constituting the reading tag is read out and this font is displayed subsequently to the no-font symbol.

Type: Grant

Filed: August 29, 2005

Date of Patent: May 26, 2009

Assignee: Omron Corporation

Inventors: Tetsuya Nakamura, Teruo Onishi
Segmenting an image via shortest cycles

Patent number: 7529407

Abstract: A method and device are provided for segmenting an image of pixels into a number of fields. First the method finds field separators using the background of the image, in particular white areas on a newspaper page. Based on the areas in the image, a graph is constructed that has edges corresponding to the white areas and vertices where vertical and horizontal white areas intersect. The segmenting starts with assigning weights to the edges, in particular a weight indicating the Euclidean distance between the vertices. Then a list of shortest cycles is constructed via the edges and vertices in the graph. The fields are defined by the vertices and edges of the shortest cycles of the list.

Type: Grant

Filed: November 21, 2003

Date of Patent: May 5, 2009

Assignee: OCE-Technologies B.V.

Inventors: Henricus A. Marquering, Dennis Peeten
INPUT METHOD TRANSFORM

Publication number: 20090103809

Abstract: Illustrative embodiments provide a computer implemented method, a data processing system and a computer program product for transforming character data input between a first writing system and a second writing system. The computer implemented method comprises receiving character data input of a first writing system and ensuring the character data input contains normalized characters. A predefined transform is selected based on the character data input of the first writing system and output to a second writing system to transform the normalized characters of the first writing system to character data output of the second writing system, and providing the character data output to a display process.

Type: Application

Filed: October 18, 2007

Publication date: April 23, 2009

Inventors: Guoyou Chen, Li Li, Su Liu, Xinhua Wu, Shunguo Yan
IMAGE PROCESSING APPARATUS

Publication number: 20090097750

Abstract: An information embedding apparatus (100) which embeds information by changing the character spacing in a document image includes a discrimination unit (101) which discriminates a text area in the document image, a circumscribed rectangle extraction unit (102) which extracts the circumscribed rectangle of each character in the text area, a determination unit (103) which determines, based on the position and size of each extracted circumscribed rectangle, whether a portion having a character spacing smaller than a threshold exists, a reduction unit (104) which, for a character determined to have a character spacing smaller than the threshold, reduces the size of the character in at least the character arrangement direction, and a character position changing unit (105) which changes, in accordance with information to be embedded, the position of a character determined to have a character spacing equal to or larger than the threshold, and that of a character reduced.

Type: Application

Filed: October 1, 2008

Publication date: April 16, 2009

Applicant: CANON KABUSHIKI KAISHA

Inventor: Jun Tamaru
Method for optical recognition of a multi-language set of letters with diacritics

Patent number: 7512272

Abstract: A method and system for recognizing alphabetic characters that contain diacritics is described. An image analysis separates the character into its constituent components. The one or more diacritic components are then distinguished and isolated from the base portion of the character. Optical recognition is performed separately on the base portion. The diacritic is recognized through a special image analysis and pattern recognition algorithms. The image analysis extracts geometric information from the one or more diacritic components. The extracted information is used as input for the pattern recognition algorithms. The output is a code that corresponds to a particular diacritic. The recognized base portion and diacritic are combined and a check is performed for acceptable combinations in a chosen language. By separately recognizing the base portion and diacritic, the character sets used by the recognizer can be narrowed, resulting in greater recognition.

Type: Grant

Filed: October 5, 2004

Date of Patent: March 31, 2009

Assignee: Cardiff Software, Inc.

Inventors: Isaac Mayzlin, Emily Ann Deere
Method of indexing Chinese characters

Publication number: 20090060338

Abstract: In practicing the present invention, in analyzing a Chinese character, a 3×3 square grid of 9 boxes is superimposed over the character. The character is analyzed based upon the shape of the stroke that is at the lowest elevation within the lower right-hand corner. A Table is consulted consisting of a plurality of elements including horizontal strokes, and the element most closely resembling the corresponding portion of the character is chosen. The user then consults a Root Table where characters all having in common the same part of the character immediately on top are displayed. From examination of the Root Table, the user narrows down the identity of the character to a smaller group. The pages to which the user is directed are carefully reviewed and the entire character may be found in a Form Block including pertinent information concerning the character.

Type: Application

Filed: September 4, 2007

Publication date: March 5, 2009

Inventors: Por-Sen Jaw, Yin-Ping Chao
Method of organizing chinese characters

Publication number: 20090060339

Abstract: A method of organizing Chinese characters includes the steps of: generating Stroke Set; generating Symbol Set; generating Stroke Code Set; generating a sequential code for each of the Chinese characters to be organized; generating a spatial code for each of the Chinese characters to be organized; generating a character code for each of the Chinese characters to be organized; and organizing said character codes together with related the Chinese characters to be organized such that a Chinese character is adapted to be located by first locating the related character code of the Chinese character, then locating the Chinese character in responsive to the related character code of the Chinese character.

Type: Application

Filed: June 5, 2008

Publication date: March 5, 2009

Inventor: Sutoyo Lim
Method of font generation for displaying the thickness of strokes of characters

Patent number: 7499055

Abstract: The present invention employs the notion of a Chinese writing brush in moving a geometric figure to produce a style of calligraphy, where the area of the geometric figure is large or small, then the strokes of a character are thick or thin. Hence the purpose is that the variance of the strokes of a character can be achieved using the present invention. The present invention only decides a moving path for the strokes of a character and the size of a geometric figure at starting points and end points, and then moves the geometric figure along the moving path, where the area the geometric figure passes is the style of calligraphy.

Type: Grant

Filed: November 7, 2002

Date of Patent: March 3, 2009

Assignee: Industrial Technology Research Institute

Inventors: Yu-Jen Lin, Cheng-Peng Kuan, Chih-Chia Chien, Yun-Ei Wu
Adaptive OCR for books

Patent number: 7480411

Abstract: A system/method is presented for scanning entire books or document all at once using an adaptive process where the book or document has known fonts and unknown fonts. The known fonts are processed through a verification system where sure words and error words are determined. Both the sure words and error words are sent to OCR training where they are re-OCR'ed and repeatedly verified until they meet a predetermined quality criteria. Characters or word not meeting the predetermined quality criteria receive additional OCR training until all the characters and words pass the predetermined quality criteria. Unknown fonts are scanned and clustered together by shape. Outliers in the shapes are manually key-in. Those symbols that are manually classified go to OCR training and then to the known type optimization process.

Type: Grant

Filed: March 3, 2008

Date of Patent: January 20, 2009

Assignee: International Business Machines Corporation

Inventors: Asaf Tzadok, Eugeniusz Walach
Text conversion apparatus capable of relieving inputting load and a method therefor

Publication number: 20080310724

Abstract: A text input device receives, in its information input circuit, a letter indicating a destination of transmission as information on the destination of transmission. The text input device stores, in its word-finder with learning function, an input text and an output text in a state correlated with the information on the destination of transmission or its attribute. The text input device in its text learning circuit controls a change in storage caused by correlating an input text matched to a text entered with the information on the destination of transmission or its attribute stored and coincident with the information on the destination of transmission or its attribute entered. When a text matched to the text entered is output, the text input device in its text converter takes out and outputs at least one output text stored.

Type: Application

Filed: June 9, 2008

Publication date: December 18, 2008

Applicant: OKI ELECTRIC INDUSTRY CO., LTD.

Inventor: Koji Okumura
Method of optical character recognition using feature recognition and baseline estimation

Patent number: 7454063

Abstract: The present invention is a method of optical character recognition. First, text is received. Next all words in the text are identified and associated with the appropriate line in the document. The directional derivative of the pixellation density function defining the text is then taken, and the highest value points for each word are identified from this equation. These highest value points are used to calculate a baseline for each word. A median anticipated baseline is also calculated and used to verify each baseline, which is corrected as necessary. Each word is then parsed into feature regions, and the features are identified through a series of complex analyses. After identifying the main features, outlying ornaments are identified and associated with appropriate features. The results are then compared to a database to identify the features and then displayed.

Type: Grant

Filed: September 22, 2005

Date of Patent: November 18, 2008

Assignee: The United States of America as represented by the Director National Security Agency

Inventors: Kyle E Kneisl, Jesse Otero
Coding systems for Chinese characters and uses thereof

Publication number: 20080232689

Abstract: User friendly coding systems are provided for Chinese characters, either complicated or simplified. Each Chinese character is assigned a code based on the shape of the character. In particular, the characters sharing the same beginning strokes are grouped together. The coding systems are useful for searching or sorting Chinese characters, as well as for typing Chinese characters on a computer or word processor.

Type: Application

Filed: February 11, 2005

Publication date: September 25, 2008

Inventor: Cheng-Fu Lee
SCALABLE STROKE FONT SYSTEM AND METHOD

Publication number: 20080218522

Abstract: A method of creating font format data from source font data includes analyzing the source font data to obtain glyph data for a plurality of glyphs, dissecting the glyph data, extracting midline data from the dissected glyph data, classifying the midline data as unique element data and common element data, associating unique element data and common element data to each glyph of the plurality of glyphs.

Type: Application

Filed: March 24, 2008

Publication date: September 11, 2008

Inventors: Vadim Fux, Denis N. Fedotenko
Radical-Based HMM Modeling for Handwritten East Asian Characters

Publication number: 20080219556

Abstract: Exemplary methods, systems, and computer-readable media for developing, training and/or using models for online handwriting recognition of characters are described. An exemplary method for building a trainable radical-based HMM for use in character recognition includes defining radical nodes, where a radical node represents a structural element of an character, and defining connection nodes, where a connection node represents a spatial relationship between two or more radicals. Such a method may include determining a number of paths in the radical-based HMM using subsequence direction histogram vector (SDHV) clustering and determining a number of states in the radical-based HMM using curvature scale space-based (CSS) corner detection.

Type: Application

Filed: March 6, 2007

Publication date: September 11, 2008

Applicant: Microsoft Corporation

Inventors: Shi Han, Yu Zou, Ming Chang, Peng Liu, Yi-Jian Wu, Lei Ma, Frank Soong, Dongmei Zhang, Jian Wang
Radical Set Determination For HMM Based East Asian Character Recognition

Publication number: 20080205761

Abstract: Exemplary techniques are described for selecting radical sets for use in probabilistic East Asian character recognition algorithms. An exemplary technique includes applying a decomposition rule to each East Asian character of the set to generate a progressive splitting graph where the progressive splitting graph comprises radicals as nodes, formulating an optimization problem to find an optimal set of radicals to represent the set of East Asian characters using maximum likelihood and minimum description length and solving the optimization problem for the optimal set of radicals. Another exemplary technique includes selecting an optimal set of radicals by using a general function that characterizes a radical with respect to other East Asian characters and a complex function that characterizes complexity of a radical.

Type: Application

Filed: February 28, 2007

Publication date: August 28, 2008

Applicant: Microsoft Corporation

Inventors: Shi Han, Yu Zou, Ming Chang, Peng Liu, Yi-Jian Wu, Lei Ma, Frank Soong, Dongmei Zhang, Jian Wang
Using a matrix input to improve stroke-entry of Chinese characters into a computer

Patent number: 7408537

Abstract: A method and system for entering Chinese characters into a computer by entering the size and shape of their strokes via a matrix, such as the 3×3 arrangement of the numbers one through nine found on a cell phone.

Type: Grant

Filed: October 25, 2004

Date of Patent: August 5, 2008

Inventor: Robert B. O'Dell
Correcting segmentation errors in OCR

Patent number: 7406201

Abstract: A method for encoding characters includes identifying one or more sequences of the character codes that are likely to be generated due a segmentation error in application of a pattern recognition process, and associating a respective extension character code with each of the sequences. The area of an image containing characters is divided into segments, such that each segment contains approximately one character. The pattern recognition process is applied to each of the segments in order to generate an input string of character codes. At least one of the identified sequences of the character codes in the input string is replaced with the respective extension character code so as to generate a modified string. The output string is determined by comparing the modified string to a directory of known strings.

Type: Grant

Filed: December 4, 2003

Date of Patent: July 29, 2008

Assignee: International Business Machines Corporation

Inventors: Andre Heilper, Eugene Walach
Chinese Character Learning System

Publication number: 20080170788

Abstract: A system of materials facilitates teaching Chinese characters to a child in progressive stages, whether or not the teacher is fluent in Chinese. Each stage associates a multi-colored object with a correspondingly multi-colored Chinese character that represents the object. For a child from birth to two years, a first stage material animates the object and morphs the object into the corresponding Chinese character. For a child two to four years, a second stage material to be read to the child by the teacher displays the object adjacent to the character, and material provides interactive means for the child to associate the object with the character. For a child four to seven years, a third stage story book displays text containing a sentence made of multiple characters and a scene corresponding to the meaning of the sentence, and for a child seven to nine years, a fourth stage group book presents multiple Chinese characters that share a common group element, whether meaning or sound.

Type: Application

Filed: January 16, 2008

Publication date: July 17, 2008

Inventor: Xiaohui Guo
Scalable stroke font system and method

Patent number: 7362898

Abstract: A method of creating font format data from source font data includes analyzing the source font data to obtain glyph data for a plurality of glyphs, dissecting the glyph data, extracting midline data from the dissected glyph data, classifying the midline data as unique element data and common element data, associating unique element data and common element data to each glyph of the plurality of glyphs.

Type: Grant

Filed: July 26, 2007

Date of Patent: April 22, 2008

Assignee: Research In Motion Limited

Inventors: Vadim Fux, Denis N. Fedotenko
Font retrieval apparatus and method

Patent number: 7346845

Abstract: Out of a lot of fonts, the desired font is quickly found. For this purpose, out of a plurality of portions constituting a character displayed in a partial image retrieval area, the desired portion is selected. A retrieval area is clicked by a user, to retrieve a font on the basis of the selected portion. A list of the results of retrieval is displayed in a retrieval result area. Even if a font name is not memorized, a desired font can be found.

Type: Grant

Filed: November 29, 2002

Date of Patent: March 18, 2008

Assignee: Fujifilm Corporation

Inventor: Atsushi Teshima
Pictographic Character Search Method

Publication number: 20080063281

Abstract: To make searching for pictographic characters, such as Chinese characters, easier for novice learners of languages using pictographic characters, a subset of pictographic character parts of the pictographic character is generated. Then, the subset of the pictographic character parts is used to generate the pictographic character based on the subset of the pictographic character parts.

Type: Application

Filed: September 7, 2006

Publication date: March 13, 2008

Inventor: Roger Dunn
Image reading apparatus

Patent number: 7327881

Abstract: A labeling process unit groups a continuous black pixel area as one group in the binary image data read by an image input device, and extracts the group bounding rectangle information about the group. A row extracting process unit extracts row rectangle information from the position information about the extracted group bounding rectangle. An overlap integrating process unit determines the overlap between the group bounding rectangles contained in the extracted row rectangle, and performs an overlap integrating process of integrating overlapping groups into one group. The ratio of the number of group bounding rectangles contained in the row rectangle before performing the overlap integrating process to the number of the group bounding rectangles contained in the row rectangle after performing the overlap integrating process is obtained, and the language of the characters written in the original is determined based on the difference in ratio.

Type: Grant

Filed: March 4, 2004

Date of Patent: February 5, 2008

Assignee: PFU Limited

Inventor: Nobuyuki Okubo
Character recognition system and method

Patent number: 7327883

Abstract: A system and method for translating a written document into a computer readable document by recognizing the character written on the document aim at recognizing typed or printed, especially hand-printed or handwritten characters, in the various fields of a form. Providing a pixel representation of the written document, the method allows translating a written document into a computer readable document by i) identifying at least one field into the pixel representation of the document; ii) segmenting each field so as to yield at least one segmented symbol; iii) applying a character recognition method on each segmented symbol; and iii) assigning a computer-readable code to each recognized character resulting from the character recognition method. The character recognition method includes doing a vector quantization on each segmented symbol, and doing a vector classification using a vector base. A learning base is also created based on the optimal elliptic separation method.

Type: Grant

Filed: March 11, 2003

Date of Patent: February 5, 2008

Assignee: IMDS Software Inc.

Inventor: Jean-Pierre Polonowski
TWO TIERED TEXT RECOGNITION

Publication number: 20080025610

Abstract: Systems and methods that exploit unique properties of a language script (e.g., condition joining rules for Arabic language) to enable a two tier text recognition. In such two tier system, one tier can recognize predetermined groups of linked letters that are connected based on joining rules of a language associated with the text, and another tier dissects (and recognizes) such linked letters to respective constituent letters that form the predetermined group of linked letters. Various classifiers and artificial intelligence components can further facilitate text recognition at each level.

Type: Application

Filed: July 31, 2006

Publication date: January 31, 2008

Applicant: MICROSOFT CORPORATION

Inventor: Ahmad A. Abdulkader
Method and apparatus for recognition of handwritten symbols

Publication number: 20080008387

Abstract: A method and apparatus for recognition of handwritten symbols. A plurality of strokes is received at a common input region of an electronic device, wherein the plurality of strokes in combination defines a plurality of symbols. Sequential combinations of the plurality of strokes are analyzed with a plurality of symbol recognition engines to determine at least one possible symbol of the plurality of symbols defined by the plurality of strokes, wherein at least one of the plurality of symbol recognition engines is configured to identify symbols comprising a particular number of strokes.

Type: Application

Filed: July 6, 2006

Publication date: January 10, 2008

Inventors: Yi-Hsun E. Cheng, Nada P. Matic, Raymond A. Trent
Method of converting a linework data format to the format of a page description language

Patent number: 7317543

Abstract: A method for converting image data coded with run lengths to the format of a page description language, such as PostScript or PDF, the run lengths identifying how many image points of one color follow one another in an image row, includes utilizing the run lengths of the same color that overlap in successive image rows to form an object, and describing the object with operators from the page description language. Objects of the same color can be combined to form one object. An object is described in the page description language by the image mask operator and an associated bitmap, or by a polygon train that connects reference points on the edge of the object.

Type: Grant

Filed: January 24, 2003

Date of Patent: January 8, 2008

Assignee: Heidelberger Druckmaschinen AG

Inventor: Frank Gnutzmann
Coordinate detection device with improved operability and method of detecting coordinates

Patent number: 7307622

Abstract: A coordinate detection device is provided, which device includes an input unit which has a surface thereof to which a coordinate value is input by an input means, a calculation unit which calculates a difference between previous and current coordinate values input by the input unit, and a setting unit which sets, in the calculation unit, a coordinate value input last before the input means is detached from the surface of said input unit as the previous coordinate value to a coordinate value input first after the input means is detached from the surface of the input unit.

Type: Grant

Filed: December 27, 2000

Date of Patent: December 11, 2007

Assignee: Fujitsu Takamisawa Component Limited

Inventor: Takuya Uchiyama
Stroke segmentation for template-based cursive handwriting recognition

Patent number: 7302099

Abstract: Ink strokes of cursive writing are segmented to make the cursive writing more like print writing, particularly with respect to the number of strokes of a character. A stroke-segmentation module first finds the local extrema points on a stroke of input ink. Then the local extrema points are stepped through, two (or three) at a time. The stroke-segmentation module may compare the three (or four) ink segments that are adjacent to the two (or three) local extrema points to a set of predefined stroke-segmentation patterns to find a closest matching pattern. Strokes are then segmented based on a stroke-segmentation rule that corresponds to the closest matching pattern. Additional stroke segmentation may be performed based on the change of curvature of the segmented ink strokes. Then, a character-recognition module performs character recognition processing by comparing the segmented ink strokes to prototype samples at least some of which have been similarly segmented.

Type: Grant

Filed: November 10, 2003

Date of Patent: November 27, 2007

Assignee: Microsoft Corporation

Inventors: Qi Zhang, Henry A. Rowley, Ahmad A. Abdulkader, Angshuman Guha
Ink input region adjustments

Patent number: 7295206

Abstract: Aspects of the present invention relate to the creation of an ink font. Based on characteristics of handwritten characters, the collection of characters may be scaled so as to adjust the size of the font to match predefined size values or relationships.

Type: Grant

Filed: January 31, 2005

Date of Patent: November 13, 2007

Assignee: Microsoft Corporation

Inventor: Zhouchen Lin
Method and apparatus for providing foreign language text display when encoding is not available

Patent number: 7260780

Abstract: A method and apparatus include referencing a phonetic language database that includes double-byte font entries and associated phonetic representations of the double-byte font entries. At least one of the double-byte font entries is used to obtain a phonetic representation of the used at least one double-byte font. The phonetic representation is displayed on a display device.

Type: Grant

Filed: January 3, 2005

Date of Patent: August 21, 2007

Assignee: Microsoft Corporation

Inventor: Ji Ma
Scalable stroke font system and method

Patent number: 7251365

Abstract: A method of creating font format data from source font data includes analyzing the source font data to obtain glyph data for a plurality of glyphs, dissecting the glyph data, extracting midline data from the dissected glyph data, classifying the midline data as unique element data and common element data, associating unique element data and common element data to each glyph of the plurality of glyphs.

Type: Grant

Filed: June 30, 2003

Date of Patent: July 31, 2007

Inventors: Vadim Fux, Denis N. Fedotenko
System and method for chinese input using a joystick

Patent number: 7218781

Abstract: A Chinese text entry system and method is provided to allow users to enter a character to a device such as a cellular phone or a PDA by adding a first few strokes required for the character using a joystick or its equivalent. By simply moving the joystick to add one or more strokes which are used to start writing a character, or in some case even before any stroke is added, a user can find a desired character from a displayed selection list. The selection list is context sensitive, varying depending on the last character entered, so that the user can be provided with the most possible candidates of the desired character.

Type: Grant

Filed: November 21, 2005

Date of Patent: May 15, 2007

Assignee: Tegic Communications, Inc.

Inventor: Pim van Meurs
System for distinguishing names in Asian writing systems

Patent number: 7212963

Abstract: A system for distinguishing names of persons in Chinese, which includes a computer. The computer includes at least an input, an output, a processor, and a memory and storage arrangement. Data is accessible by the processor, including at least names presently being used in Chinese, and name indicators and non-name indicators that respectively indicate probable presence and non-presence of a name. The system also includes software for performing computer processing including identifying names in Chinese text that has been input to the computer for names corresponding to names in the data for names presently being used in Chinese, name indicators, and non-name indicators. The processing includes comparing the location in the Chinese text of identified name indicators and non-name indicators relative to identified names in the text, and if predefined conditions are met, affirming that that an identified name is being used as a name in the text.

Type: Grant

Filed: June 11, 2002

Date of Patent: May 1, 2007

Assignee: Fuji Xerox Co., Ltd.

Inventor: Li Yang Li
ZhuYin symbol and tone mark input method, and electronic device

Patent number: 7197184

Abstract: A method for entering ZhuYin symbols and tone marks into electronic text is provided. The method comprises receiving an input symbol and providing a candidate list comprising the input symbol and the Chinese characters associated with the input symbol. After the selection of the input symbol from the candidate list, the selected input symbol is entered to the electronic text, and the tone marks of the ZhuYin symbol set are provided. After the selection of a tone mark, the selected tone mark is entered to the electronic text.

Type: Grant

Filed: September 30, 2004

Date of Patent: March 27, 2007

Assignee: Nokia Corporation

Inventor: Mikko Repka
Character recognition apparatus and method

Patent number: 7162086

Abstract: A character recognition apparatus which performs character recognition with increased accuracy on a document image including plural languages. A re-recognition range is set based on the result of recognition using a first recognition unit, and character recognition by a second recognition unit is performed within the set range. In the re-recognition range, if a similarity of the result of re-recognition is higher than that by the first recognition unit, the result of recognition by the first recognition unit is replaced with the result of recognition by the second recognition unit.

Type: Grant

Filed: July 9, 2003

Date of Patent: January 9, 2007

Assignee: Canon Kabushiki Kaisha

Inventor: Hiroaki Ikeda
Character recognition device and method for detecting erroneously read characters, and computer readable medium to implement character recognition

Patent number: 7133556

Abstract: A character recognition device to recognize characters in a text image read by an image scanner having a first recognition device to recognize the characters in the text image using a first character recognition method and a second recognition device to recognize the characters in the text image using a second character recognition method different from the first character recognition method. An extraction device extracts locations of recognized characters in the text image wherein the recognition results of the first recognition device do not coincide with the recognition results of the second recognition device. An output device outputs character recognition results designating the non-coinciding locations extracted by the extraction device.

Type: Grant

Filed: September 13, 2000

Date of Patent: November 7, 2006

Assignee: Fujitsu Limited

Inventors: Tsutomu Matsushita, Norikazu Shiiya, Toshikazu Hori, Kouji Yoshimoto
Intelligent default selection in an on-screen keyboard

Patent number: 7130846

Abstract: Systems and methods are described for intelligent default selection of characters to be entered via an on-screen keyboard. Based on one to several criteria, a character most likely to be selected for entry via the on-screen keyboard during a search request is determined and a selector is positioned at that particular character. If that character is indeed the character the user wishes to enter, the user does not have to execute any navigation steps to enter the character, but can—with a single actuation—enter that character. In many instances, the user will only have to enter the selection without first having to navigate to the selection. As a result, the number of times buttons need to be actuated by the user to enter a character string can be significantly reduced.

Type: Grant

Filed: June 10, 2003

Date of Patent: October 31, 2006

Assignee: Microsoft Corporation

Inventors: Daniel Danker, Steven Wasserman
System and method of context-based sorting of character strings for use in data base applications

Patent number: 7130470

Abstract: A method and system for context-based sorting of character strings. A first sorting weight of a current character of a character string is determined from a first table. The first sorting weight is stored. Provided the current character is a predetermined character, a second table is accessed. A second sorting weight of the current character is determined from the location of a preceding character within the second table. The first sorting weight is replaced with the second sorting weight for the current character. Embodiments of the present invention provide an efficient method of context-based sorting in languages, such as Japanese, where the sorting weight of a character can be altered by the preceding character.

Type: Grant

Filed: March 15, 2002

Date of Patent: October 31, 2006

Assignee: Oracle International Corporation

Inventor: Ching Lan Ho
System and method for chinese input using a joystick

Patent number: 7088861

Abstract: A Chinese text entry system and method is provided to allow users to enter a character to a device such as a cellular phone or a PDA by adding a first few strokes required for the character using a joystick or its equivalent. By simply moving the joystick to add one or more strokes which are used to start writing a character, or in some case even before any stroke is added, a user can find a desired character from a displayed selection list. The selection list is context sensitive, varying depending on the last character entered, so that the user can be provided with the most possible candidates of the desired character.

Type: Grant

Filed: February 9, 2004

Date of Patent: August 8, 2006

Assignee: America Online, Inc.

Inventor: Pim van Meurs
Chinese language input system based on graphic form

Patent number: 7058900

Abstract: Every Chinese character belongs to a small graphic form group which is created with respect to the radical of the character instead of character components. Every small graphic form group is incorporated into higher-level groups, i.e. medium graphic form groups, in turn every medium graphic form group is incorporated into higher-level groups, i.e. large graphic form groups. Input guidance is provided according to this hierarchy concerning graphic form. More specifically, the large groups are presented and one of them is selected by the first keystroke, the medium groups are presented and one of them is selected by the second keystroke, and the small groups are presented and one of them to which the desired character for input belongs is selected by the third keystroke. In this fashion, three keystrokes to a numeric keypad efficiently narrows down the alternative characters for conversion.

Type: Grant

Filed: July 12, 2002

Date of Patent: June 6, 2006

Assignee: Fujitsu Limited

Inventor: Jin Sugano

prev 1 2 3 4 5 6 next