Patents by Inventor Chang-Qing Shu

Chang-Qing Shu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Using word confidence score, insertion and substitution thresholds for selected words in speech recognition

Patent number: 9886943

Abstract: A method and system for improving the accuracy of a speech recognition system using word confidence score (WCS) processing is introduced. Parameters in a decoder are selected to minimize a weighted total error rate, such that deletion errors are weighted more heavily than substitution and insertion errors. The occurrence distribution in WCS is different depending on whether the word was correctly identified and based on the type of error. This is used to determine thresholds in WCS for insertion and substitution errors. By processing the hypothetical word (HYP) (output of the decoder), a mHYP (modified HYP) is determined. In some circumstances, depending on the WCS's value in relation to insertion and substitution threshold values, mHYP is set equal to: null, a substituted HYP, or HYP.

Type: Grant

Filed: January 13, 2017

Date of Patent: February 6, 2018

Assignee: Adadel Inc.

Inventor: Chang-Qing Shu
PHONETIC DISTANCE MEASUREMENT SYSTEM AND RELATED METHODS

Publication number: 20170256253

Abstract: Phonetic distances are empirically measured as a function of speech recognition engine recognition error rates. The error rates are determined by comparing a recognized speech file with a reference file. The phonetic distances can be normalized to earlier measurements. The phonetic distances/error rates can also be used to improve speech recognition engine grammar selection, as an aid in language training and evaluation, and in other applications.

Type: Application

Filed: May 22, 2017

Publication date: September 7, 2017

Applicant: Adacel Systems, Inc.

Inventor: Chang-Qing Shu
Phonetic distance measurement system and related methods

Patent number: 9659559

Abstract: Phonetic distances are empirically measured as a function of speech recognition engine recognition error rates. The error rates are determined by comparing a recognized speech file with a reference file. The phonetic distances can be normalized to earlier measurements. The phonetic distances/error rates can also be used to improve speech recognition engine grammar selection, as an aid in language training and evaluation, and in other applications.

Type: Grant

Filed: June 25, 2009

Date of Patent: May 23, 2017

Assignee: ADACEL SYSTEMS, INC.

Inventor: Chang-Qing Shu
USING WORD CONFIDENCE SCORE, INSERTION AND SUBSTITUTION THRESHOLDS FOR SELECTED WORDS IN SPEECH RECOGNITION

Publication number: 20170125011

Abstract: A method and system for improving the accuracy of a speech recognition system using word confidence score (WCS) processing is introduced. Parameters in a decoder are selected to minimize a weighted total error rate, such that deletion errors are weighted more heavily than substitution and insertion errors. The occurrence distribution in WCS is different depending on whether the word was correctly identified and based on the type of error. This is used to determine thresholds in WCS for insertion and substitution errors. By processing the hypothetical word (HYP) (output of the decoder), a mHYP (modified HYP) is determined. In some circumstances, depending on the WCS's value in relation to insertion and substitution threshold values, mHYP is set equal to: null, a substituted HYP, or HYP.

Type: Application

Filed: January 13, 2017

Publication date: May 4, 2017

Applicant: Adacel, Inc.

Inventor: Chang-Qing Shu
Using word confidence score, insertion and substitution thresholds for selected words in speech recognition

Patent number: 9583094

Abstract: A method and system for improving the accuracy of a speech recognition system using were confidence score (WCS) processing is introduced. Parameters in a decoder are selected to minimize a weighted total error rate, such that deletion errors are weighted more heavily than substitution and insertion errors. The occurrence distribution in WCS is different depending on whether the word was correctly identified and based on the type of error. This is used to determine thresholds in WCS for insertion and substitution errors. By processing the hypothetical word (HYP) (output of the decoder), a mHYP (modified HYP) is determined. In some circumstances, depending on the WCS's value in relation to insertion and substitution threshold values, mHYP is set equal to: null, a substituted HYP, or HYP.

Type: Grant

Filed: September 22, 2016

Date of Patent: February 28, 2017

Assignee: ADACEL, INC.

Inventor: Chang-Qing Shu
USING WORD CONFIDENCE SCORE, INSERTION AND SUBSTITUTION THRESHOLDS FOR SELECTED WORDS IN SPEECH RECOGNITION

Publication number: 20170011737

Abstract: A method and system for improving the accuracy of a speech recognition system using were confidence score (WCS) processing is introduced. Parameters in a decoder are selected to minimize a weighted total error rate, such that deletion errors are weighted more heavily than substitution and insertion errors. The occurrence distribution in WCS is different depending on whether the word was correctly identified and based on the type of error. This is used to determine thresholds in WCS for insertion and substitution errors. By processing the hypothetical word (HYP) (output of the decoder), a mHYP (modified HYP) is determined. In some circumstances, depending on the WCS's value in relation to insertion and substitution threshold values, mHYP is set equal to: null, a substituted HYP, or HYP.

Type: Application

Filed: September 22, 2016

Publication date: January 12, 2017

Applicant: Adacel, Inc.

Inventor: Chang-Qing Shu
Using word confidence score, insertion and substitution thresholds for selected words in speech recognition

Patent number: 9478218

Abstract: A method and system for improving the accuracy of a speech recognition system using word confidence score (WCS) processing is introduced. Parameters in a decoder are selected to minimize a weighted total error rate, such that deletion errors are weighted more heavily than substitution and insertion errors. The occurrence distribution in WCS is different depending on whether the word was correctly identified and based on the type of error. This is used to determine thresholds in WCS for insertion and substitution errors. By processing the hypothetical word (HYP) (output of the decoder), a mHYP (modified HYP) is determined. In some circumstances, depending on the WCS's value in relation to insertion and substitution threshold values, mHYP is set equal to: null, a substituted HYP, or HYP.

Type: Grant

Filed: October 24, 2008

Date of Patent: October 25, 2016

Assignee: Adacel, Inc.

Inventor: Chang-Qing Shu
INTEGRATED LANGUAGE MODEL, RELATED SYSTEMS AND METHODS

Publication number: 20140163989

Abstract: An integrated language model includes an upper-level language model component and a lower-level language model component, with the upper-level language model component including a non-terminal and the lower-level language model component being applied to the non-terminal. The upper-level and lower-level language model components can be of the same or different language model formats, including finite state grammar (FSG) and statistical language model (SLM) formats. Systems and methods for making integrated language models allow designation of language model formats for the upper-level and lower-level components and identification of non-terminals. Automatic non-terminal replacement and retention criteria can be used to facilitate the generation of one or both language model components, which can include the modification of existing language models.

Type: Application

Filed: July 30, 2013

Publication date: June 12, 2014

Applicant: ADACEL SYSTEMS, INC.

Inventors: Chang Qing Shu, Han Shu, John M. Merwin
Method and system for creating natural language understanding grammars

Patent number: 8738384

Abstract: Grammars for interactive voice response systems using natural language understanding can be created using information which is available on websites. These grammars can be created in automated manners and can have various tuning measures applied to obtain optimal results when deployed in a customer contact environment. These grammars can allow a variety of statements to be appropriately handled by the system.

Type: Grant

Filed: November 16, 2012

Date of Patent: May 27, 2014

Assignee: Convergys CMG Utah Inc.

Inventors: Dhananjay Bansal, Nancy Gardner, Chang-Qing Shu, Kristie Goss, Matthew Yuschik, Sunil Issar, Woosung Kim, Jayant M. Naik
System and method for merging audio data streams for use in speech recognition applications

Patent number: 8731923

Abstract: A system and method for merging audio data streams receive audio data streams from separate inputs, independently transform each data stream from the time to the frequency domain, and generate separate feature data sets for the transformed data streams. Feature data from each of the separate feature data sets is selected to form a merged feature data set that is output to a decoder for recognition purposes. The separate inputs can include an ear microphone and a mouth microphone.

Type: Grant

Filed: August 20, 2010

Date of Patent: May 20, 2014

Assignee: Adacel Systems, Inc.

Inventor: Chang-Qing Shu
System and method for automatic microphone volume setting

Patent number: 8559656

Abstract: Optimal microphone volumes are automatically set for computer applications based on determination of peak volume levels and noise levels from one or more digital audio captures. The peak volume levels and noise levels can be advantageously determined based on distribution curves of sample volume levels in the digital audio captures. Clipping can be automatically compensated for by estimating peak unclipped capture volume levels from the distribution curves.

Type: Grant

Filed: July 13, 2010

Date of Patent: October 15, 2013

Assignee: Adacel Systems, Inc.

Inventors: Chang-Qing Shu, Dezhi Liao
Integrated language model, related systems and methods

Patent number: 8515734

Abstract: An integrated language model includes an upper-level language model component and a lower-level language model component, with the upper-level language model component including a non-terminal and the lower-level language model component being applied to the non-terminal. The upper-level and lower-level language model components can be of the same or different language model formats, including finite state grammar (FSG) and statistical language model (SLM) formats. Systems and methods for making integrated language models allow designation of language model formats for the upper-level and lower-level components and identification of non-terminals. Automatic non-terminal replacement and retention criteria can be used to facilitate the generation of one or both language model components, which can include the modification of existing language models.

Type: Grant

Filed: February 8, 2010

Date of Patent: August 20, 2013

Assignee: Adacel Systems, Inc.

Inventors: Chang-Qing Shu, Han Shu, John M. Mervin
System and method for training an acoustic model with reduced feature space variation

Patent number: 8301446

Abstract: Feature space variation associated with specific text elements is reduced by training an acoustic model with a phoneme set, dictionary and transcription set configured to better distinguish the specific text elements and at least some specific phonemes associated therewith. The specific text elements can include the most frequently occurring text elements from a text data set, which can include text data beyond the transcriptions of a training data set. The specific text elements can be identified using a text element distribution table sorted by occurrence within the text data set. Specific phonemes can be limited to consonant phonemes to improve speed and accuracy.

Type: Grant

Filed: March 30, 2009

Date of Patent: October 30, 2012

Assignee: Adacel Systems, Inc.

Inventor: Chang-Qing Shu
SYSTEM AND METHOD FOR MERGING AUDIO DATA STREAMS FOR USE IN SPEECH RECOGNITION APPLICATIONS

Publication number: 20120046946

Abstract: A system and method for merging audio data streams receive audio data streams from separate inputs, independently transform each data stream from the time to the frequency domain, and generate separate feature data sets for the transformed data streams. Feature data from each of the separate feature data sets is selected to form a merged feature data set that is output to a decoder for recognition purposes. The separate inputs can include an ear microphone and a mouth microphone.

Type: Application

Filed: August 20, 2010

Publication date: February 23, 2012

Applicant: ADACEL SYSTEMS, INC.

Inventor: Chang-Qing Shu
System and Method for Automatic Microphone Volume Setting

Publication number: 20120014537

Abstract: Optimal microphone volumes are automatically set for computer applications based on determination of peak volume levels and noise levels from one or more digital audio captures. The peak volume levels and noise levels can be advantageously determined based on distribution curves of sample volume levels in the digital audio captures. Clipping can be automatically compensated for by estimating peak unclipped capture volume levels from the distribution curves.

Type: Application

Filed: July 13, 2010

Publication date: January 19, 2012

Applicant: ADACEL SYSTEMS, INC.

Inventors: Chang-Qing Shu, Dezhi Liao
Integrated Language Model, Related Systems and Methods

Publication number: 20110196668

Abstract: An integrated language model includes an upper-level language model component and a lower-level language model component, with the upper-level language model component including a non-terminal and the lower-level language model component being applied to the non-terminal. The upper-level and lower-level language model components can be of the same or different language model formats, including finite state grammar (FSG) and statistical language model (SLM) formats. Systems and methods for making integrated language models allow designation of language model formats for the upper-level and lower-level components and identification of non-terminals. Automatic non-terminal replacement and retention criteria can be used to facilitate the generation of one or both language model components, which can include the modification of existing language models.

Type: Application

Filed: February 8, 2010

Publication date: August 11, 2011

Applicant: ADACEL SYSTEMS, INC.

Inventors: Chang-Qing Shu, Han Shu, John M. Mervin
PHONETIC DISTANCE MEASUREMENT SYSTEM AND RELATED METHODS

Publication number: 20100332230

Abstract: Phonetic distances are empirically measured as a function of speech recognition engine recognition error rates. The error rates are determined by comparing a recognized speech file with a reference file. The phonetic distances can be normalized to earlier measurements. The phonetic distances/error rates can also be used to improve speech recognition engine grammar selection, as an aid in language training and evaluation, and in other applications.

Type: Application

Filed: June 25, 2009

Publication date: December 30, 2010

Applicant: ADACEL SYSTEMS, INC.

Inventor: Chang-Qing Shu
SYSTEM AND METHOD FOR TRAINING AN ACOUSTIC MODEL WITH REDUCED FEATURE SPACE VARIATION

Publication number: 20100250240

Abstract: Feature space variation associated with specific text elements is reduced by training an acoustic model with a phoneme set, dictionary and transcription set configured to better distinguish the specific text elements and at least some specific phonemes associated therewith. The specific text elements can include the most frequently occurring text elements from a text data set, which can include text data beyond the transcriptions of a training data set. The specific text elements can be identified using a text element distribution table sorted by occurrence within the text data set. Specific phonemes can be limited to consonant phonemes to improve speed and accuracy.

Type: Application

Filed: March 30, 2009

Publication date: September 30, 2010

Applicant: ADACEL SYSTEMS, INC.

Inventor: Chang-Qing Shu
System and Method for Making a User Dependent Language Model

Publication number: 20100145677

Abstract: A language model for a speech recognition engine is made based on user-viewed data files. The data files are reviewed and texts are extracted therefrom. The language model is generated based on the extracted texts. Transcriptions of previous user statements are not required. Different weighting factors can be applied to elements of the extracted texts based on the nature of the data files. The weighting factors are then considered during generation of the language model. A user dependent and application independent language model can be created prior to initial use of the speech recognition engine.

Type: Application

Filed: March 3, 2009

Publication date: June 10, 2010

Applicant: ADACEL SYSTEMS, INC.

Inventor: Chang-Qing Shu
USING WORD CONFIDENCE SCORE, INSERTION AND SUBSTITUTION THRESHOLDS FOR SELECTED WORDS IN SPEECH RECOGNITION

Publication number: 20100106505

Abstract: A method and system for improving the accuracy of a speech recognition system using word confidence score (WCS) processing is introduced. Parameters in a decoder are selected to minimize a weighted total error rate, such that deletion errors are weighted more heavily than substitution and insertion errors. The occurrence distribution in WCS is different depending on whether the word was correctly identified and based on the type of error. This is used to determine thresholds in WCS for insertion and substitution errors. By processing the hypothetical word (HYP) (output of the decoder), a mHYP (modified HYP) is determined. In some circumstances, depending on the WCS's value in relation to insertion and substitution threshold values, mHYP is set equal to: null, a substituted HYP, or HYP.

Type: Application

Filed: October 24, 2008

Publication date: April 29, 2010

Applicant: Adacel, Inc.

Inventor: Chang-Qing Shu

1 2 next