Patents by Inventor Qi P. Li

Qi P. Li has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Method and apparatus for interactive language instruction

Patent number: 7149690

Abstract: A method and apparatus for interactive language instruction is provided that displays text files for processing, provide key features and functions for interactive learning, displays facial animation, and provides a workspace for language building functions. The system includes a stored set of language rules as part of the text-to-speech sub-system, as well as another stored set of rules as applied to the process of learning a language. The method implemented by the system includes digitally converting text to audible speech, providing the audible speech to a user or student (with the aid of an animated image in selected circumstances), prompting the student to replicate the audible speech, comparing the student's replication with the audible speech provided by the system, and providing feedback and reinforcement to the student by, for example, selectively recording or playing back the audible speech and the student's replication.

Type: Grant

Filed: September 9, 1999

Date of Patent: December 12, 2006

Assignee: Lucent Technologies Inc.

Inventors: Katherine Grace August, Nadine Blackwood, Qi P. Li, Michelle McNerney, Chi-Lin Shih, Arun Chandrasekaran Surendran, Jialin Zhong, Qiru Zhou
Binding box for packaging and recycling

Publication number: 20040206875

Abstract: A binding box for packaging and paper recycling comprises of a base, four trays, and four supporting parts connecting the trays and base, respectively. Each of the trays holds a corner to a quarter of the area of the papers or materials which need to be bound. The four trays work together to hold papers or any materials that needs to be bound but there is a clearance between any two pairs of the trays. Given the clearance between the trays and the clearance between the base and trays, human hands, ropes, or binding tools can go through to bind the papers without touching any one of the trays or moving the objects which needs to be bound. The ropes are placed in the clearances between trays. Once the binding is finished, a stack of bound papers or materials can be removed directly from the top of the binding box without moving the box.

Type: Application

Filed: April 21, 2003

Publication date: October 21, 2004

Inventors: Joy Y. Li, Qi P. Li
Method and apparatus for performing real-time endpoint detection in automatic speech recognition

Patent number: 6782363

Abstract: A method and apparatus for performing real-time endpoint detection for use in automatic speech recognition. A filter is applied to the input speech signal and the filter output is then evaluated with use of a state transition diagram (i.e., a finite state machine). The filter is advantageously designed in light of several criteria in order to increase the accuracy and robustness of detection. The state transition diagram advantageously has three states. The endpoints which are detected may then be advantageously applied to the problem of energy normalization of the speech portion of the signal.

Type: Grant

Filed: May 4, 2001

Date of Patent: August 24, 2004

Assignee: Lucent Technologies Inc.

Inventors: Chin-Hui Lee, Qi P. Li, Jinsong Zheng, Qiru Zhou
Automatic speech recognition with psychoacoustically-based feature extraction, using easily-tunable single-shape filters along logarithmic-frequency axis

Patent number: 6701291

Abstract: A method and apparatus for extracting speech features from a speech signal in which the linear frequency spectrum data, as generated, for example, by a conventional frequency transform, is first converted to logarithmic frequency spectrum data having frequency data distributed on a substantially logarithmic (rather than linear) frequency scale. Then, a plurality of digital auditory filters is applied to the resultant logarithmic frequency spectrum data, each of these filters having a substantially similar shape, but centered at different points on the logarithmic frequency scale. Because each of the filters have a similar shape, the feature extraction approach of the present invention advantageously can be easily modified or tuned by adjusting each of the filters in a coordinated manner, with the adjustment of only a handful of filter parameters.

Type: Grant

Filed: April 2, 2001

Date of Patent: March 2, 2004

Assignee: Lucent Technologies Inc.

Inventors: Qi P. Li, Olivier Siohan, Frank Kao-Ping Soong
Methods and apparatus for fast and robust model training for object classification

Publication number: 20030225719

Abstract: Techniques for fast and robust data object classifier training are described. A process of classifier training creates a set of Gaussian mixture models, one model for each class to which data objects are to be assigned. Initial estimates of model parameters are made using training data. The model parameters are then optimized to maximize an aggregate a posteriori probability that data objects in the set of training data will be correctly classified. Optimization of parameters for each model is performed through the process of a number of iterations in which the closed form solutions are computed for the model parameters of each model, the model performance is tested to determine if the newly computed parameters improve the model performance and the model is updated with the newly computed parameters if performance has improved. At each new iteration, the parameters computed in the previous iteration are used as initial estimates.

Type: Application

Filed: May 31, 2002

Publication date: December 4, 2003

Applicant: Lucent Technologies, Inc.

Inventors: Biing-Hwang Juang, Qi P. Li
Background model design for flexible and portable speaker verification systems

Patent number: 6519563

Abstract: A speaker verification method and apparatus which advantageously minimizes the constraints on the customer and simplifies the system architecture by using a speaker dependent, rather than a speaker independent, background model, thereby obtaining many of the advantages of using a background model in a speaker verification process without many of the disadvantages thereof. In particular, no training data (e.g. speech) from anyone other than the customer is required, no speaker independent models need to be produced, no a priori knowledge of acoustic rules are required, and, no multi-lingual phone models, dictionaries, or letter-to-sound rules are needed. Nonetheless, in accordance with an illustrative embodiment of the present invention, the customer is free to select any password phrase in any language.

Type: Grant

Filed: November 22, 1999

Date of Patent: February 11, 2003

Assignee: Lucent Technologies Inc.

Inventors: Chin-Hui Lee, Qi P. Li, Olivier Siohan, Arun Chandrasekaran Surendran
METHOD AND APPARATUS FOR INTERACTIVE LANGUAGE INSTRUCTION

Publication number: 20030028378

Abstract: A method and apparatus for interactive language instruction is provided that displays text files for processing, provide key features and functions for interactive learning, displays facial animation, and provides a workspace for language building functions. The system includes a stored set of language rules as part of the text-to-speech sub-system, as well as another stored set of rules as applied to the process of learning a language. The method implemented by the system includes digitally converting text to audible speech, providing the audible speech to a user or student (with the aid of an animated image in selected circumstances), prompting the student to replicate the audible speech, comparing the student's replication with the audible speech provided by the system, and providing feedback and reinforcement to the student by, for example, selectively recording or playing back the audible speech and the student's replication.

Type: Application

Filed: September 9, 1999

Publication date: February 6, 2003

Inventors: KATHERINE GRACE AUGUST, NADINE BLACKWOOD, QI P. LI, MICHELLE MCNERNEY, CHI-LIN SHIH, ARUN CHANDRASEKARAN SURENDRAN, JIALIN ZHONG, QIRU ZHOU
Method and apparatus for performing real-time endpoint detection in automatic speech recognition

Publication number: 20020184017

Abstract: A method and apparatus for performing real-time endpoint detection for use in automatic speech recognition. A filter is applied to the input speech signal and the filter output is then evaluated with use of a state transition diagram (i.e., a finite state machine). The filter is advantageously designed in light of several criteria in order to increase the accuracy and robustness of detection. The state transition diagram advantageously has three states. The endpoints which are detected may then be advantageously applied to the problem of energy normalization of the speech portion of the signal.

Type: Application

Filed: May 4, 2001

Publication date: December 5, 2002

Inventors: Chin-Hui Lee, Qi P. Li, Jinsong Zheng, Qiru Zhou
Easily tunable auditory-based speech signal feature extraction method and apparatus for use in automatic speech recognition

Publication number: 20020062211

Abstract: A method and apparatus for extracting speech features from a speech signal in which the linear frequency spectrum data, as generated, for example, by a conventional frequency transform, is first converted to logarithmic frequency spectrum data having frequency data distributed on a substantially logarithmic (rather than linear) frequency scale. Then, a plurality of digital auditory filters is applied to the resultant logarithmic frequency spectrum data, each of these filters having a substantially similar shape, but centered at different points on the logarithmic frequency scale. Because each of the filters have a similar shape, the feature extraction approach of the present invention advantageously can be easily modified or tuned by adjusting each of the filters in a coordinated manner, with the adjustment of only a handful of filter parameters.

Type: Application

Filed: April 2, 2001

Publication date: May 23, 2002

Inventors: Qi P. Li, Olivier Siohan, Frank Kao-Ping Soong
Method for performing stochastic matching for use in speaker verification

Patent number: 5995927

Abstract: A method and an apparatus for performing stochastic matching of a set of input test speech data with a corresponding set of training speech data. In particular, a set of input test speech feature information, having been generated from an input test speech utterance, is transformed so that the stochastic characteristics thereof more closely match the stochastic characteristics of a corresponding set of training speech feature information. The corresponding set of training speech data may, for example, comprise training data which was generated from a speaker having the claimed identity of the speaker of the input test speech utterance. Specifically, in accordance with the present invention, a first covariance matrix representative of stochastic characteristics of input test speech feature information is generated based on the input test speech feature information.

Type: Grant

Filed: March 14, 1997

Date of Patent: November 30, 1999

Assignee: Lucent Technologies Inc.

Inventor: Qi P. Li