Patents by Inventor Kai-Fu Lee
Kai-Fu Lee has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 7424675Abstract: A language input architecture converts input strings of phonetic text to an output string of language text. The language input architecture has a search engine, one or more typing models, a language model, and one or more lexicons for different languages. The typing model is configured to generate a list of probable typing candidates that may be substituted for the input string based on probabilities of how likely each of the candidate strings was incorrectly entered as the input string. The language model provides probable conversion strings for each of the typing candidates based on probabilities of how likely a probable conversion output string represents the candidate string. The search engine combines the probabilities of the typing and language models to find the most probable conversion string that represents a converted form of the input string.Type: GrantFiled: September 27, 2004Date of Patent: September 9, 2008Assignee: Microsoft CorporationInventors: Kai-Fu Lee, Zheng Chen, Jian Han
-
Patent number: 7403888Abstract: A language input architecture receives input text (e.g., phonetic text of a character-based language) entered by a user from an input device (e.g., keyboard, voice recognition). The input text is converted to an output text (e.g., written language text of a character-based language). The language input architecture has a user interface that displays the output text and unconverted input text in line with one another. As the input text is converted, it is replaced in the UI with the converted output text. In addition to this in-line input feature, the UI enables in-place editing or error correction without requiring the user to switch modes from an entry mode to an edit mode. To assist with this in-place editing, the UI presents pop-up windows containing the phonetic text from which the output text was converted as well as first and second candidate lists that contain small and large sets of alternative candidates that might be used to replace the current output text.Type: GrantFiled: June 28, 2000Date of Patent: July 22, 2008Assignee: Microsoft CorporationInventors: Jian Wang, Gao Zhang, Jian Han, Zheng Chen, Xianoning Ling, Kai-Fu Lee
-
Patent number: 7302640Abstract: A language input architecture converts input strings of phonetic text to an output string of language text. The language input architecture has a search engine, typing models, a language model, and one or more lexicons for different languages. Each typing model is trained on real data, and learns probabilities of typing errors. The typing model is configured to generate a list of probable typing candidates that may be substituted for the input string based on probabilities of how likely each of the candidate strings was incorrectly entered as the input string. The language model provides probable conversion strings for each of the typing candidates based on probabilities of how likely a probable conversion output string represents the candidate string. The search engine combines the probabilities of the typing and language models to find the most probable conversion string that represents a converted form of the input string.Type: GrantFiled: October 21, 2004Date of Patent: November 27, 2007Assignee: Microsoft CorporationInventors: Kai-Fu Lee, Zheng Chen, Jian Han
-
Patent number: 7275029Abstract: A method for the joint optimization of language model performance and size is presented comprising developing a language model from a tuning set of information, segmenting at least a subset of a received textual corpus and calculating a perplexity value for each segment and refining the language model with one or more segments of the received corpus based, at least in part, on the calculated perplexity value for the one or more segments.Type: GrantFiled: June 30, 2000Date of Patent: September 25, 2007Assignee: Microsoft CorporationInventors: Jianfeng Gao, Kai-Fu Lee, Mingjing Li, Hai-Feng Wang, Dong-Feng Cai, Lee-Feng Chien
-
Patent number: 7216066Abstract: A method is presented comprising assigning each of a plurality of segments comprising a received corpus to a node in a data structure denoting dependencies between nodes, and calculating a transitional probability between each of the nodes in the data structure.Type: GrantFiled: February 22, 2006Date of Patent: May 8, 2007Assignee: Microsoft CorporationInventors: Shuo Di, Kai-Fu Lee, Lee-Feng Chien, Zheng Chen, Jianfeng Gao
-
Patent number: 7165019Abstract: A language input architecture converts input strings of phonetic text (e.g., Chinese Pinyin) to an output string of language text (e.g., Chinese Hanzi) in a manner that minimizes typographical errors and conversion errors that occur during conversion from the phonetic text to the language text. The language input architecture has a search engine, one or more typing models, a language model, and one or more lexicons for different languages. Each typing model is trained on real data, and learns probabilities of typing errors. The typing model is configured to generate a list of probable typing candidates that may be substituted for the input string based on probabilities of how likely each of the candidate strings was incorrectly entered as the input string.Type: GrantFiled: June 28, 2000Date of Patent: January 16, 2007Assignee: Microsoft CorporationInventors: Kai-Fu Lee, Zheng Chen, Jian Han
-
Publication number: 20060184341Abstract: A method is presented comprising assigning each of a plurality of segments comprising a received corpus to a node in a data structure denoting dependencies between nodes, and calculating a transitional probability between each of the nodes in the data structure.Type: ApplicationFiled: February 22, 2006Publication date: August 17, 2006Applicant: Microsoft CorporationInventors: Shuo Di, Kai-Fu Lee, Lee-Feng Chien, Zheng Chen, Jianfeng Gao
-
Patent number: 7020587Abstract: The generation and management of a language model data structure include assigning each segment of a received corpus to a node in a data structure that denotes dependencies between the respective nodes. A transitional probability between each of the nodes in the data structure is calculated. A frequency of occurrence is calculated for each item of the respective segments, and those nodes of the data structure associated with items that do not meet a minimum frequency of occurrence threshold are removed. The data structure may be managed across a system memory of a computer system and an extended memory of the computer system.Type: GrantFiled: June 30, 2000Date of Patent: March 28, 2006Assignee: Microsoft CorporationInventors: Shuo Di, Kai-Fu Lee, Lee-Feng Chien, Zheng Chen, Jianfeng Gao
-
Patent number: 6904402Abstract: A method for optimizing a language model is presented comprising developing an initial language model from a lexicon and segmentation derived from a received corpus using a maximum match technique, and iteratively refining the initial language model by dynamically updating the lexicon and re-segmenting the corpus according to statistical principles until a threshold of predictive capability is achieved.Type: GrantFiled: June 30, 2000Date of Patent: June 7, 2005Assignee: Microsoft CorporationInventors: Hai-Feng Wang, Chang-Ning Huang, Kai-Fu Lee, Shuo Di, Jianfeng Gao, Dong-Feng Cai, Lee-Feng Chien
-
Publication number: 20050086590Abstract: A language input architecture converts input strings of phonetic text (e.g., Chinese Pinyin) to an output string of language text (e.g., Chinese Hanzi) in a manner that minimizes typographical errors and conversion errors that occur during conversion from the phonetic text to the language text. The language input architecture has a search engine, one or more typing models, a language model, and one or more lexicons for different languages. Each typing model is trained on real data, and learns probabilities of typing errors. The typing model is configured to generate a list of probable typing candidates that may be substituted for the input string based on probabilities of how likely each of the candidate strings was incorrectly entered as the input string. The probable typing candidates may be stored in a database. The language model provides probable conversion strings for each of the typing candidates based on probabilities of how likely a probable conversion output string represents the candidate string.Type: ApplicationFiled: October 21, 2004Publication date: April 21, 2005Applicant: Microsoft CorporationInventors: Kai-Fu Lee, Zheng Chen, Jian Han
-
Publication number: 20050060138Abstract: A language input architecture receives input text (e.g., phonetic text of a character-based language) entered by a user from an input device (e.g., keyboard, voice recognition). The input text is converted to an output text (e.g., written language text of a character-based language). The language input architecture has a user interface that displays the output text and unconverted input text in line with one another. As the input text is converted, it is replaced in the UI with the converted output text. In addition to this in-line input feature, the UI enables in-place editing or error correction without requiring the user to switch modes from an entry mode to an edit mode. To assist with this in-place editing, the UI presents pop-up windows containing the phonetic text from which the output text was converted as well as first and second candidate lists that contain small and large sets of alternative candidates that might be used to replace the current output text.Type: ApplicationFiled: July 23, 2004Publication date: March 17, 2005Applicant: Microsoft CorporationInventors: Jian Wang, Gao Zhang, Jian Han, Zheng Chen, Xianoning Ling, Kai-Fu Lee
-
Publication number: 20050044495Abstract: A language input architecture converts input strings of phonetic text to an output string of language text. The language input architecture has a search engine, one or more typing models, a language model, and one or more lexicons for different languages. The typing model is configured to generate a list of probable typing candidates that may be substituted for the input string based on probabilities of how likely each of the candidate strings was incorrectly entered as the input string. The language model provides probable conversion strings for each of the typing candidates based on probabilities of how likely a probable conversion output string represents the candidate string. The search engine combines the probabilities of the typing and language models to find the most probable conversion string that represents a converted form of the input string.Type: ApplicationFiled: September 27, 2004Publication date: February 24, 2005Applicant: Microsoft CorporationInventors: Kai-Fu Lee, Zheng Chen, Jian Han
-
Patent number: 6848080Abstract: A language input architecture converts input strings of phonetic text to an output string of language text. The language input architecture has a search engine, one or more typing models, a language model, and one or more lexicons for different languages. The typing model is configured to generate a list of probable typing candidates that may be substituted for the input string based on probabilities of how likely each of the candidate strings was incorrectly entered as the input string. The language model provides probable conversion strings for each of the typing candidates based on probabilities of how likely a probable conversion output string represents the candidate string. The search engine combines the probabilities of the typing and language models to find the most probable conversion string that represents a converted form of the input string.Type: GrantFiled: June 28, 2000Date of Patent: January 25, 2005Assignee: Microsoft CorporationInventors: Kai-Fu Lee, Zheng Chen, Jian Han
-
Publication number: 20040243568Abstract: A search engine architecture is designed to handle a full range of user queries, from complex sentence-based queries to simple keyword searches. The search engine architecture includes a natural language parser that parses a user query and extracts syntactic and semantic information. The parser is robust in the sense that it not only returns fully-parsed results (e.g., a parse tree), but is also capable of returning partially-parsed fragments in those cases where more accurate or descriptive information in the user query is unavailable. A question matcher is employed to match the fully-parsed output and the partially-parsed fragments to a set of frequently asked questions (FAQs) stored in a database. The question matcher then correlates the questions with a group of possible answers arranged in standard templates that represent possible solutions to the user query. The search engine architecture also has a keyword searcher to locate other possible answers by searching on any keywords returned from the parser.Type: ApplicationFiled: March 22, 2004Publication date: December 2, 2004Inventors: Hai-Feng Wang, Kai-Fu Lee, Qiang Yang
-
Publication number: 20040210434Abstract: A method for optimizing a language model is presented comprising developing an initial language model from a lexicon and segmentation derived from a received corpus using a maximum match technique, and iteratively refining the initial language model by dynamically updating the lexicon and re-segmenting the corpus according to statistical principles until a threshold of predictive capability is achieved.Type: ApplicationFiled: May 10, 2004Publication date: October 21, 2004Applicant: Microsoft CorporationInventors: Hai-Feng Wang, Chang-Ning Huang, Kai-Fu Lee, Shuo Di, Jianfeng Gao, Dong-Feng Cai, Lee-Feng Chien
-
Patent number: 6766320Abstract: A search engine architecture is designed to handle a full range of user queries, from complex sentence-based queries to simple keyword searches. The search engine architecture includes a natural language parser that parses a user query and extracts syntactic and semantic information. The parser is robust in the sense that it not only returns fully-parsed results (e.g., a parse tree), but is also capable of returning partially-parsed fragments in those cases where more accurate or descriptive information in the user query is unavailable. A question matcher is employed to match the fully-parsed output and the partially-parsed fragments to a set of frequently asked questions (FAQs) stored in a database. The question matcher then correlates the questions with a group of possible answers arranged in standard templates that represent possible solutions to the user query. The search engine architecture also has a keyword searcher to locate other possible answers by searching on any keywords returned from the parser.Type: GrantFiled: August 24, 2000Date of Patent: July 20, 2004Assignee: Microsoft CorporationInventors: Hai-Feng Wang, Kai-Fu Lee, Qiang Yang
-
System and method for automatic subcharacter unit and lexicon generation for handwriting recognition
Patent number: 5757964Abstract: A system for automatic subcharacter unit and lexicon generation for handwriting recognition comprises a processing unit, a handwriting input device, and a memory wherein a segmentation unit, a subcharacter generation unit, a lexicon unit, and a modeling unit reside. The segmentation unit generates feature vectors corresponding to sample characters. The subcharacter generation unit clusters feature vectors and assigns each feature vector associated with a given cluster an identical label. The lexicon unit constructs a lexical graph for each character in a character set. The modeling unit generates a Hidden Markov Model for each set of identically-labeled feature vectors. After a first set of lexical graphs and Hidden Markov Models have been created, the subcharacter generation unit determines for each feature vector which Hidden Markov Model produces a highest likelihood value.Type: GrantFiled: July 29, 1997Date of Patent: May 26, 1998Assignee: Apple Computer, Inc.Inventors: Kai-Fu Lee, Yen-Lu Chow, Kamil Grajski -
Patent number: 5734791Abstract: The branching decision for each node in a vector quantization (VQ) binary tree is made by a simple comparison of a pre-selected element of the candidate vector with a stored threshold resulting in a binary decision for reaching the next lower level. Each node has a preassigned element and threshold value. Conventional centroid distance training techniques (such as LBG and k-means) are used to establish code-book indices corresponding to a set of VQ centroids. The set of training vectors are used a second time to select a vector element and threshold value at each node that approximately splits the data evenly. After processing the training vectors through the binary tree using threshold decisions, a histogram is generated for each code-book index that represents the number of times a training vector belonging to a given index set appeared at each index. The final quantization is accomplished by processing and then selecting the nearest centroid belonging to that histogram.Type: GrantFiled: December 31, 1992Date of Patent: March 31, 1998Assignee: Apple Computer, Inc.Inventors: Alejandro Acero, Kai-Fu Lee, Yen-Lu Chow
-
Patent number: 5602960Abstract: A speech recognition system for continuous Mandarin Chinese speech comprises a microphone, an A/D converter, a syllable recognition system, an integrated tone classifier, and a confidence score augmentor. The syllable recognition system generates N-best theories with initial confidence scores. The integrated tone classifier has a pitch estimator to estimate the pitch of the input once and a long-term tone analyzer to segment the estimated pitch according to the syllables of each of the N-best theories. The long-term tone analyzer performs long-term tonal analysis on the segmented, estimated pitch and generates a long-term tonal confidence signal. The confidence score augmentor receives the initial confidence scores and the long-term tonal confidence signals, modifies each initial confidence score according to the corresponding long-term tonal confidence signal, re-ranks the N-best theories according to the augmented confidence scores, and outputs the N-best theories.Type: GrantFiled: September 30, 1994Date of Patent: February 11, 1997Assignee: Apple Computer, Inc.Inventors: Hsiao-Wuen Hon, Yen-Lu Chow, Kai-Fu Lee
-
Patent number: 5577135Abstract: A handwriting signal processing front-end method and apparatus for a handwriting training and recognition system which includes non-uniform segmentation and feature extraction in combination with multiple vector quantization. In a training phase, digitized handwriting samples are partitioned into segments of unequal length. Features are extracted from the segments and are grouped to form feature vectors for each segment. Groups of adjacent from feature vectors are then combined to form input frames. Feature-specific vectors are formed by grouping features of the same type from each of the feature vectors within a frame. Multiple vector quantization is then performed on each feature-specific vector to statistically model the distributions of the vectors for each feature by identifying clusters of the vectors and determining the mean locations of the vectors in the clusters. Each mean location is represented by a codebook symbol and this information is stored in a codebook for each feature.Type: GrantFiled: March 1, 1994Date of Patent: November 19, 1996Assignee: Apple Computer, Inc.Inventors: Kamil A. Grajski, Yen-Lu Chow, Kai-Fu Lee