Patents by Inventor Jian-Iai Zhou

Jian-Iai Zhou has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Symbol graph generation in handwritten mathematical expression recognition

Patent number: 7885456

Abstract: A forward pass through a sequence of strokes representing a handwritten equation is performed from the first stroke to the last stroke in the sequence. At each stroke, a path score is determined for a plurality of symbol-relation pairs that each represents a symbol and its spatial relation to a predecessor symbol. A symbol graph having nodes and links is constructed by backtracking through the strokes from the last stroke to the first stroke and assigning scores to the links based on the path scores for the symbol-relation pairs. The symbol graph is used to recognize a mathematical expression based in part on the scores for the links and the mathematical expression is stored.

Type: Grant

Filed: March 29, 2007

Date of Patent: February 8, 2011

Assignee: Microsoft Corporation

Inventors: Yu Shi, Frank Kao-Ping Soong, Jian-Iai Zhou, Dongmei Zhang, legal representative
Auto segmentation based partitioning and clustering approach to robust endpointing

Patent number: 7680657

Abstract: Possible segmentations for an audio signal are scored based on distortions for feature vectors of the audio signal and the total number of segments in the segmentation. The scores are used to select a segmentation and the selected segmentation is used to identify a starting point and an ending point for a speech signal in the audio signal.

Type: Grant

Filed: August 15, 2006

Date of Patent: March 16, 2010

Assignee: Microsoft Corporation

Inventors: Yu Shi, Frank Kao-ping Soong, Jian-Iai Zhou
Parsimonious modeling by non-uniform kernel allocation

Patent number: 7680664

Abstract: A multi-state pattern recognition model with non-uniform kernel allocation is formed by setting a number of states for a multi-state pattern recognition model and assigning different numbers of kernels to different states. The kernels are then trained using training data to form the multi-state pattern recognition model.

Type: Grant

Filed: August 16, 2006

Date of Patent: March 16, 2010

Assignee: Microsoft Corporation

Inventors: Peng Liu, Jian-Iai Zhou, Frank Kao-ping Soong
Method of speech recognition using hidden trajectory Hidden Markov Models

Patent number: 7617104

Abstract: A method of speech recognition is provided that determines a production-related value, vocal-tract resonance frequencies in particular, for a state at a particular frame based on the production-related values associated with two preceding frames using a recursion. The production-related value is used to determine a probability distribution of the observed feature vector for the state. A probability for an observed value received for the frame is then determined from the probability distribution. Under one embodiment, the production-related value is determined using a noise-free recursive definition for the value. Use of the recursion substantially improves the decoding speed. When the decoding algorithm is applied to training data with known phonetic transcripts, forced alignment is created which improves the phone segmentation obtained from the prior art.

Type: Grant

Filed: January 21, 2003

Date of Patent: November 10, 2009

Assignee: Microsoft Corporation

Inventors: Li Deng, Jian-Iai Zhou, Frank Torsten Bernd Seide
MINIMUM DIVERGENCE BASED DISCRIMINATIVE TRAINING FOR PATTERN RECOGNITION

Publication number: 20080243503

Abstract: A method of providing discriminative training of a speech recognition unit is discussed. The method includes receiving an acoustic indication of an utterance having a hypothesis space and comparing the hypothesis space against a reference. The method measures the Kullback-Leibler Divergence (KLD) between the reference and the hypothesis space to adjust the reference and stores the adjusted reference on a tangible storage medium.

Type: Application

Filed: March 30, 2007

Publication date: October 2, 2008

Applicant: Microsoft Corporation

Inventors: Frank Kao-Ping Soong, Peng Liu, Jian-Iai Zhou, Dongmei Zhang
Method of speech recognition using time-dependent interpolation and hidden dynamic value classes

Patent number: 7050975

Abstract: A method of speech recognition is provided that identifies a production-related dynamics value by performing a linear interpolation between a production-related dynamics value at a previous time and a production-related target using a time-dependent interpolation weight. The hidden production-related dynamics value is used to compute a predicted value that is compared to an observed value of acoustics to determine the likelihood of the observed acoustics given a sequence of hidden phonological units. In some embodiments, the production-related dynamics value at the previous time is selected from a set of continuous values. In addition, the likelihood of the observed acoustics given a sequence of hidden phonological units is combined with a score associated with a discrete class of production-related dynamic values at the previous time to determine a score for a current phonological state.

Type: Grant

Filed: October 9, 2002

Date of Patent: May 23, 2006

Assignee: Microsoft Corporation

Inventors: Li Deng, Jian-Iai Zhou, Frank Torsten Bernd Seide, Asela J. R. Gunawardana, Hagai Attias, Alejandro Acero, Xuedong Huang
Method of speech recognition using time-dependent interpolation and hidden dynamic value classes

Publication number: 20060085191

Abstract: A speech signal is decoded by determining a production-related value for a current state based on an optimal production-related value at the end of a preceding state, the optimal production-related value being selected from a set of continuous values. The production-related value is used to determine a likelihood of a phone being represented by a set of observation vectors that are aligned with a path between the preceding state and the current state. The likelihood of the phone is combined with a score from the preceding state to determine a score for the current state, the score from the preceding state being associated with a discrete class of production-related values wherein the class matches the class of the optimal production-related value.

Type: Application

Filed: December 6, 2005

Publication date: April 20, 2006

Applicant: Microsoft Corporation

Inventors: Li Deng, Jian-Iai Zhou, Frank Seide, Asela Gunawardana, Hagai Attias, Alejandro Acero, Xuedong Huang
Method of speech recognition using hidden trajectory hidden markov models

Publication number: 20040143435

Abstract: A method of speech recognition is provided that determines a production-related value, vocal-tract resonance frequencies in particular, for a state at a particular frame based on the production-related values associated with two preceding frames using a recursion. The production-related value is used to determine a probability distribution of the observed feature vector for the state. A probability for an observed value received for the frame is then determined from the probability distribution. Under one embodiment, the production-related value is determined using a noise-free recursive definition for the value. Use of the recursion substantially improves the decoding speed. When the decoding algorithm is applied to training data with known phonetic transcripts, forced alignment is created which improves the phone segmentation obtained from the prior art.

Type: Application

Filed: January 21, 2003

Publication date: July 22, 2004

Inventors: Li Deng, Jian-Iai Zhou, Frank Torsten Bernd Seide
Method of speech recognition using time-dependent interpolation and hidden dynamic value classes

Publication number: 20040019483

Abstract: A method of speech recognition is provided that identifies a production-related dynamics value by performing a linear interpolation between a production-related dynamics value at a previous time and a production-related target using a time-dependent interpolation weight. The hidden production-related dynamics value is used to compute a predicted value that is compared to an observed value of acoustics to determine the likelihood of the observed acoustics given a sequence of hidden phonological units. In some embodiments, the production-related dynamics value at the previous time is selected from a set of continuous values. In addition, the likelihood of the observed acoustics given a sequence of hidden phonological units is combined with a score associated with a discrete class of production-related dynamic values at the previous time to determine a score for a current phonological state.

Type: Application

Filed: October 9, 2002

Publication date: January 29, 2004

Inventors: Li Deng, Jian-Iai Zhou, Frank Torsten Bernd Seide, Asela J.R. Gunawardana, Hagai Attias, Alejandro Acero, Xuedong Huang