Patents by Inventor Jian-Lai Zhou
Jian-Lai Zhou has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 8234116Abstract: Measurement of Kullback-Leibler Divergence (KLD) between hidden Markov models (HMM) of acoustic units utilizes an unscented transform to approximate KLD between Gaussian mixtures. Dynamic programming equalizes the number of states between HMMs having a different number of states, while the total KLD of the HMMs is obtained by summing individual KLDs calculated by state pair by state pair comparisons.Type: GrantFiled: August 22, 2006Date of Patent: July 31, 2012Assignee: Microsoft CorporationInventors: Peng Liu, Frank Kao-Ping K. Soong, Jian-Lai Zhou
-
Patent number: 7929767Abstract: A mechanism for recognizing and inputting handwritten mathematical expressions into a computer by providing part of a multi-path framework is described. The part of the multi-path framework includes a subordinate sub-expression analysis component. A method for analyzing a handwritten mathematical expression for a subordinate sub-expression includes identifying sub-expressions based on dominant symbols and determining a character for potential dominant symbols based upon sub-expression information. A determination may be made whether an expression structure candidate is valid and valid expression structure candidates may be stored in a parse tree.Type: GrantFiled: June 20, 2005Date of Patent: April 19, 2011Assignee: Microsoft CorporationInventors: Mingqing Xu, Yu Zou, Lei Huang, Jian-Lai Zhou, Xiaohui Hou, Zhouchen Lin, Dongmei Zhang, Jian Wang
-
Patent number: 7890325Abstract: Speech recognition such as command and control speech recognition generally use a context free grammar to constrain the decoding process. Word or subword background model are constructed to repopulate dynamic hypothesis space, especially when word spareness is at issue. The background models can be later used in speech recognition. During speech recognition, background and conventional context free grammar decoding are used to measure confidence. The discussion above is merely provided for general background information and is not intended to be used as an aid in determining the scope of the claimed subject matter.Type: GrantFiled: March 16, 2006Date of Patent: February 15, 2011Assignee: Microsoft CorporationInventors: Peng Liu, Ye Tian, Jian-Lai Zhou, Frank Kao-Ping K. Soong
-
Patent number: 7848917Abstract: Multiple input modalities are selectively used by a user or process to prune a word graph. Pruning initiates rescoring in order to generate a new word graph with a revised best path.Type: GrantFiled: March 30, 2006Date of Patent: December 7, 2010Assignee: Microsoft CorporationInventors: Frank Kao-Ping K. Soong, Jian-Lai Zhou, Peng Liu
-
Patent number: 7805301Abstract: A reliable full covariance matrix estimation algorithm for pattern unit's state output distribution in pattern recognition system is discussed. An intermediate hierarchical tree structure is built to relate models for product units. Full covariance matrices of pattern unit's state output distribution are estimated based on all the related nodes in the tree.Type: GrantFiled: July 1, 2005Date of Patent: September 28, 2010Assignee: Microsoft CorporationInventors: Ye Tian, Frank Kao-Ping Soong, Jian-Lai Zhou
-
Patent number: 7761287Abstract: An opinion system infers the opinion of a sentence of a product review based on a probability that the sentence contains certain sequences of parts of speech that are commonly used to express an opinion as indicated by the training data and the probabilities of the training data. When provided with the sentence, the opinion system identifies possible sequences of parts of speech of the sentence that are commonly used to express an opinion and the probability that the sequence is the correct sequence for the sentence. For each sequence, the opinion system then retrieves a probability derived from the training data that the sequence contains an opinion word that expresses an opinion. The opinion system then retrieves a probability from the training data that the opinion words of the sentence are used to express an opinion. The opinion system then combines the probabilities to generate an overall probability that the sentence with that sequence expresses an opinion.Type: GrantFiled: October 23, 2006Date of Patent: July 20, 2010Assignee: Microsoft CorporationInventors: Hua Li, Jian-Lai Zhou, Dongmei Zhang, legal representative, Zheng Chen, Jian Wang
-
Patent number: 7561737Abstract: A mechanism for recognizing and inputting handwritten mathematical expressions into a computer by providing a multi-path framework is described. The framework may include symbol grouping and recognition, tabular structure analysis, subordinate sub-expression analysis, subscript/superscript analysis and character determination, and semantic structure analysis components. A method for recognizing a handwritten mathematical expression includes receiving a plurality of input strokes corresponding to a handwritten mathematical expression and providing a candidate list of recognized candidate expressions based upon the input strokes. Input strokes are grouped into symbols, tabular structures are determined, dominant symbol candidates and subordinate symbols are determined, and subscript and superscript structures are determined.Type: GrantFiled: June 20, 2005Date of Patent: July 14, 2009Assignee: Microsoft CorporationInventors: Yu Zou, Lei Huang, Mingqing Xu, Jian-Lai Zhou, Yue Li, Dongmei Zhang, Xiaohui Hou, Wenli Zhu, Hongyun Yang, Xianfang Wang, Jiang Wu, Jian Wang
-
Patent number: 7561738Abstract: A mechanism for recognizing and inputting handwritten mathematical expressions into a computer by providing a part of a multi-path framework is described. The part of the multi-path framework includes a symbol grouping and recognition component that is designed to group input strokes that correspond to a handwritten mathematical expression into a symbol and to recognize the symbol based upon information associated with the grouped input strokes. A method for grouping and recognizing symbols of a handwritten mathematical expression includes receiving a plurality of input strokes corresponding to a handwritten mathematical expression, grouping the plurality of input strokes into symbols, recognizing the symbols based upon information, such as shape and time series information, associated with the grouped input strokes. Intra-group and inter-group information associated with the plurality of input strokes may be utilized to group the input strokes.Type: GrantFiled: June 20, 2005Date of Patent: July 14, 2009Assignee: Microsoft CorporationInventors: Yu Zou, Lei Huang, Jian-Lai Zhou, Mingqing Xu, Yue Li, Xiaohui Hou, Dongmei Zhang, Jian Wang
-
Patent number: 7561739Abstract: A mechanism for recognizing and inputting handwritten mathematical expressions into a computer by providing part of a multi-path framework is described. The part of the multi-path framework includes a subscript/superscript analysis and character determination component that is designed to identify subscript and superscript elements. A method for analyzing a handwritten mathematical expression includes receiving a symbols corresponding to handwritten mathematical expression input strokes, identifying subscript and/or superscript structures, and determining a character for each symbol of the set. A graph of vertexes and edges may be created based upon the set of symbols and the graph may be searched to determine optimized candidates.Type: GrantFiled: June 20, 2005Date of Patent: July 14, 2009Assignee: Microsoft CorporationInventors: Mingqing Xu, Yu Zou, Lei Huang, Jian-Lai Zhou, Xianfang Wang, Xiaohui Hou, Dongmei Zhang, Zhouchen Lin, Jian Wang
-
Patent number: 7506271Abstract: Systems, methods, and computer-readable media for processing electronic ink receive an electronic ink input; convert the electronic ink input to a first machine-generated object using handwriting recognition; display the first machine-generated object on a display; receive speech input; convert the speech input to a second machine-generated object using speech recognition; generate a list of machine-generated objects based on the electronic ink input, the list including the first machine-generated object and alternative machine-generated objects and functioning as a dictionary for converting the speech input; and replace the first machine-generated object with the second machine-generated object. A user may confirm that the second machine-generated object should replace the first machine-generated object.Type: GrantFiled: December 15, 2003Date of Patent: March 17, 2009Assignee: Microsoft CorporationInventors: Jian Wang, Jian-Lai Zhou, Jiang Wu, Hongyun Yang, Xianfang Wang, Wenli Zhu
-
Patent number: 7496512Abstract: A method and apparatus are provided for refining segmental boundaries in speech waveforms. Contextual acoustic feature similarities are used as a basis for clustering adjacent phoneme speech units, where each adjacent pair phoneme speech units include a segmental boundary. A refining model is trained for each cluster and used to refine boundaries of contextual phoneme speech units forming the clusters.Type: GrantFiled: April 13, 2004Date of Patent: February 24, 2009Assignee: Microsoft CorporationInventors: Yong Zhao, Min Chu, Jian-lai Zhou, Lijuan Wang
-
Patent number: 7496513Abstract: Input is received from at least two different input sources. Information from these sources are combined together to provide a result. In a particular example, input from one source corresponds to potential recognition candidates, and input from another source corresponds to other potential candidates. These candidates are combined together to select a result.Type: GrantFiled: June 28, 2005Date of Patent: February 24, 2009Assignee: Microsoft CorporationInventors: Frank Kao-Ping Soong, Jian-Lai Zhou, Ye Tian
-
Publication number: 20080240570Abstract: A forward pass through a sequence of strokes representing a handwritten equation is performed from the first stroke to the last stroke in the sequence. At each stroke, a path score is determined for a plurality of symbol-relation pairs that each represents a symbol and its spatial relation to a predecessor symbol. A symbol graph having nodes and links is constructed by backtracking through the strokes from the last stroke to the first stroke and assigning scores to the links based on the path scores for the symbol-relation pairs. The symbol graph is used to recognize a mathematical expression based in part on the scores for the links and the mathematical expression is stored.Type: ApplicationFiled: March 29, 2007Publication date: October 2, 2008Applicant: Microsoft CorporationInventors: Yu Shi, Frank Kao-Ping Soong, Jian-lai Zhou, Dongmei Zhang
-
Publication number: 20080097758Abstract: An opinion system infers the opinion of a sentence of a product review based on a probability that the sentence contains certain sequences of parts of speech that are commonly used to express an opinion as indicated by the training data and the probabilities of the training data. When provided with the sentence, the opinion system identifies possible sequences of parts of speech of the sentence that are commonly used to express an opinion and the probability that the sequence is the correct sequence for the sentence. For each sequence, the opinion system then retrieves a probability derived from the training data that the sequence contains an opinion word that expresses an opinion. The opinion system then retrieves a probability from the training data that the opinion words of the sentence are used to express an opinion. The opinion system then combines the probabilities to generate an overall probability that the sentence with that sequence expresses an opinion.Type: ApplicationFiled: October 23, 2006Publication date: April 24, 2008Applicant: Microsoft CorporationInventors: Hua Li, Jian-Lai Zhou, Zheng Chen, Jian Wang, Dongmei Zhang
-
Publication number: 20080059184Abstract: Measurement of Kullback-Leibler Divergence (KLD) between hidden Markov models (HMM) of acoustic units utilizes an unscented transform to approximate KLD between Gaussian mixtures. Dynamic programming equalizes the number of states between HMMs having a different number of states, while the total KLD of the HMMs is obtained by summing individual KLDs calculated by state pair by state pair comparisons.Type: ApplicationFiled: August 22, 2006Publication date: March 6, 2008Applicant: Microsoft CorporationInventors: Frank Kao-Ping K. Soong, Jian-Lai Zhou, Peng Liu
-
Publication number: 20080059183Abstract: A multi-state pattern recognition model with non-uniform kernel allocation is formed by setting a number of states for a multi-state pattern recognition model and assigning different numbers of kernels to different states. The kernels are then trained using training data to form the multi-state pattern recognition model.Type: ApplicationFiled: August 16, 2006Publication date: March 6, 2008Applicant: Microsoft CorporationInventors: Peng Liu, Jian-lai Zhou, Frank Kao-ping Soong
-
Publication number: 20080059169Abstract: Possible segmentations for an audio signal are scored based on distortions for feature vectors of the audio signal and the total number of segments in the segmentation. The scores are used to select a segmentation and the selected segmentation is used to identify a starting point and an ending point for a speech signal in the audio signal.Type: ApplicationFiled: August 15, 2006Publication date: March 6, 2008Applicant: Microsoft CorporationInventors: Yu Shi, Frank Kao-ping Soong, Jian-lai Zhou
-
Publication number: 20070239432Abstract: Multiple input modalities are selectively used by a user or process to prune a word graph. Pruning initiates rescoring in order to generate a new word graph with a revised best path.Type: ApplicationFiled: March 30, 2006Publication date: October 11, 2007Applicant: Microsoft CorporationInventors: Frank Soong, Jian-Lai Zhou, Peng Liu
-
Publication number: 20070219797Abstract: Speech recognition such as command and control speech recognition generally use a context free grammar to constrain the decoding process. Word or subword background model are constructed to repopulate dynamic hypothesis space, especially when word spareness is at issue. The background models can be later used in speech recognition. During speech recognition, background and conventional context free grammar decoding are used to measure confidence. The discussion above is merely provided for general background information and is not intended to be used as an aid in determining the scope of the claimed subject matter.Type: ApplicationFiled: March 16, 2006Publication date: September 20, 2007Applicant: Microsoft CorporationInventors: Peng Liu, Ye Tian, Jian-Lai Zhou, Frank Soong
-
Publication number: 20070219796Abstract: A Weighted Likelihood Ratio Hidden Markov Model is utilized for speech processing. The model emphasizes spectral peaks when comparing spectra. Probability density functions for states in the model can be developed with weights based on the comparison.Type: ApplicationFiled: March 20, 2006Publication date: September 20, 2007Applicant: Microsoft CorporationInventors: Chao Huang, Frank Soong, Jian-lai Zhou