Patents by Inventor Chang-Qing Shu
Chang-Qing Shu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 9886943Abstract: A method and system for improving the accuracy of a speech recognition system using word confidence score (WCS) processing is introduced. Parameters in a decoder are selected to minimize a weighted total error rate, such that deletion errors are weighted more heavily than substitution and insertion errors. The occurrence distribution in WCS is different depending on whether the word was correctly identified and based on the type of error. This is used to determine thresholds in WCS for insertion and substitution errors. By processing the hypothetical word (HYP) (output of the decoder), a mHYP (modified HYP) is determined. In some circumstances, depending on the WCS's value in relation to insertion and substitution threshold values, mHYP is set equal to: null, a substituted HYP, or HYP.Type: GrantFiled: January 13, 2017Date of Patent: February 6, 2018Assignee: Adadel Inc.Inventor: Chang-Qing Shu
-
Publication number: 20170256253Abstract: Phonetic distances are empirically measured as a function of speech recognition engine recognition error rates. The error rates are determined by comparing a recognized speech file with a reference file. The phonetic distances can be normalized to earlier measurements. The phonetic distances/error rates can also be used to improve speech recognition engine grammar selection, as an aid in language training and evaluation, and in other applications.Type: ApplicationFiled: May 22, 2017Publication date: September 7, 2017Applicant: Adacel Systems, Inc.Inventor: Chang-Qing Shu
-
Patent number: 9659559Abstract: Phonetic distances are empirically measured as a function of speech recognition engine recognition error rates. The error rates are determined by comparing a recognized speech file with a reference file. The phonetic distances can be normalized to earlier measurements. The phonetic distances/error rates can also be used to improve speech recognition engine grammar selection, as an aid in language training and evaluation, and in other applications.Type: GrantFiled: June 25, 2009Date of Patent: May 23, 2017Assignee: ADACEL SYSTEMS, INC.Inventor: Chang-Qing Shu
-
Publication number: 20170125011Abstract: A method and system for improving the accuracy of a speech recognition system using word confidence score (WCS) processing is introduced. Parameters in a decoder are selected to minimize a weighted total error rate, such that deletion errors are weighted more heavily than substitution and insertion errors. The occurrence distribution in WCS is different depending on whether the word was correctly identified and based on the type of error. This is used to determine thresholds in WCS for insertion and substitution errors. By processing the hypothetical word (HYP) (output of the decoder), a mHYP (modified HYP) is determined. In some circumstances, depending on the WCS's value in relation to insertion and substitution threshold values, mHYP is set equal to: null, a substituted HYP, or HYP.Type: ApplicationFiled: January 13, 2017Publication date: May 4, 2017Applicant: Adacel, Inc.Inventor: Chang-Qing Shu
-
Patent number: 9583094Abstract: A method and system for improving the accuracy of a speech recognition system using were confidence score (WCS) processing is introduced. Parameters in a decoder are selected to minimize a weighted total error rate, such that deletion errors are weighted more heavily than substitution and insertion errors. The occurrence distribution in WCS is different depending on whether the word was correctly identified and based on the type of error. This is used to determine thresholds in WCS for insertion and substitution errors. By processing the hypothetical word (HYP) (output of the decoder), a mHYP (modified HYP) is determined. In some circumstances, depending on the WCS's value in relation to insertion and substitution threshold values, mHYP is set equal to: null, a substituted HYP, or HYP.Type: GrantFiled: September 22, 2016Date of Patent: February 28, 2017Assignee: ADACEL, INC.Inventor: Chang-Qing Shu
-
Publication number: 20170011737Abstract: A method and system for improving the accuracy of a speech recognition system using were confidence score (WCS) processing is introduced. Parameters in a decoder are selected to minimize a weighted total error rate, such that deletion errors are weighted more heavily than substitution and insertion errors. The occurrence distribution in WCS is different depending on whether the word was correctly identified and based on the type of error. This is used to determine thresholds in WCS for insertion and substitution errors. By processing the hypothetical word (HYP) (output of the decoder), a mHYP (modified HYP) is determined. In some circumstances, depending on the WCS's value in relation to insertion and substitution threshold values, mHYP is set equal to: null, a substituted HYP, or HYP.Type: ApplicationFiled: September 22, 2016Publication date: January 12, 2017Applicant: Adacel, Inc.Inventor: Chang-Qing Shu
-
Patent number: 9478218Abstract: A method and system for improving the accuracy of a speech recognition system using word confidence score (WCS) processing is introduced. Parameters in a decoder are selected to minimize a weighted total error rate, such that deletion errors are weighted more heavily than substitution and insertion errors. The occurrence distribution in WCS is different depending on whether the word was correctly identified and based on the type of error. This is used to determine thresholds in WCS for insertion and substitution errors. By processing the hypothetical word (HYP) (output of the decoder), a mHYP (modified HYP) is determined. In some circumstances, depending on the WCS's value in relation to insertion and substitution threshold values, mHYP is set equal to: null, a substituted HYP, or HYP.Type: GrantFiled: October 24, 2008Date of Patent: October 25, 2016Assignee: Adacel, Inc.Inventor: Chang-Qing Shu
-
Publication number: 20140163989Abstract: An integrated language model includes an upper-level language model component and a lower-level language model component, with the upper-level language model component including a non-terminal and the lower-level language model component being applied to the non-terminal. The upper-level and lower-level language model components can be of the same or different language model formats, including finite state grammar (FSG) and statistical language model (SLM) formats. Systems and methods for making integrated language models allow designation of language model formats for the upper-level and lower-level components and identification of non-terminals. Automatic non-terminal replacement and retention criteria can be used to facilitate the generation of one or both language model components, which can include the modification of existing language models.Type: ApplicationFiled: July 30, 2013Publication date: June 12, 2014Applicant: ADACEL SYSTEMS, INC.Inventors: Chang Qing Shu, Han Shu, John M. Merwin
-
Patent number: 8738384Abstract: Grammars for interactive voice response systems using natural language understanding can be created using information which is available on websites. These grammars can be created in automated manners and can have various tuning measures applied to obtain optimal results when deployed in a customer contact environment. These grammars can allow a variety of statements to be appropriately handled by the system.Type: GrantFiled: November 16, 2012Date of Patent: May 27, 2014Assignee: Convergys CMG Utah Inc.Inventors: Dhananjay Bansal, Nancy Gardner, Chang-Qing Shu, Kristie Goss, Matthew Yuschik, Sunil Issar, Woosung Kim, Jayant M. Naik
-
Patent number: 8731923Abstract: A system and method for merging audio data streams receive audio data streams from separate inputs, independently transform each data stream from the time to the frequency domain, and generate separate feature data sets for the transformed data streams. Feature data from each of the separate feature data sets is selected to form a merged feature data set that is output to a decoder for recognition purposes. The separate inputs can include an ear microphone and a mouth microphone.Type: GrantFiled: August 20, 2010Date of Patent: May 20, 2014Assignee: Adacel Systems, Inc.Inventor: Chang-Qing Shu
-
Patent number: 8559656Abstract: Optimal microphone volumes are automatically set for computer applications based on determination of peak volume levels and noise levels from one or more digital audio captures. The peak volume levels and noise levels can be advantageously determined based on distribution curves of sample volume levels in the digital audio captures. Clipping can be automatically compensated for by estimating peak unclipped capture volume levels from the distribution curves.Type: GrantFiled: July 13, 2010Date of Patent: October 15, 2013Assignee: Adacel Systems, Inc.Inventors: Chang-Qing Shu, Dezhi Liao
-
Patent number: 8515734Abstract: An integrated language model includes an upper-level language model component and a lower-level language model component, with the upper-level language model component including a non-terminal and the lower-level language model component being applied to the non-terminal. The upper-level and lower-level language model components can be of the same or different language model formats, including finite state grammar (FSG) and statistical language model (SLM) formats. Systems and methods for making integrated language models allow designation of language model formats for the upper-level and lower-level components and identification of non-terminals. Automatic non-terminal replacement and retention criteria can be used to facilitate the generation of one or both language model components, which can include the modification of existing language models.Type: GrantFiled: February 8, 2010Date of Patent: August 20, 2013Assignee: Adacel Systems, Inc.Inventors: Chang-Qing Shu, Han Shu, John M. Mervin
-
Patent number: 8301446Abstract: Feature space variation associated with specific text elements is reduced by training an acoustic model with a phoneme set, dictionary and transcription set configured to better distinguish the specific text elements and at least some specific phonemes associated therewith. The specific text elements can include the most frequently occurring text elements from a text data set, which can include text data beyond the transcriptions of a training data set. The specific text elements can be identified using a text element distribution table sorted by occurrence within the text data set. Specific phonemes can be limited to consonant phonemes to improve speed and accuracy.Type: GrantFiled: March 30, 2009Date of Patent: October 30, 2012Assignee: Adacel Systems, Inc.Inventor: Chang-Qing Shu
-
Publication number: 20120046946Abstract: A system and method for merging audio data streams receive audio data streams from separate inputs, independently transform each data stream from the time to the frequency domain, and generate separate feature data sets for the transformed data streams. Feature data from each of the separate feature data sets is selected to form a merged feature data set that is output to a decoder for recognition purposes. The separate inputs can include an ear microphone and a mouth microphone.Type: ApplicationFiled: August 20, 2010Publication date: February 23, 2012Applicant: ADACEL SYSTEMS, INC.Inventor: Chang-Qing Shu
-
Publication number: 20120014537Abstract: Optimal microphone volumes are automatically set for computer applications based on determination of peak volume levels and noise levels from one or more digital audio captures. The peak volume levels and noise levels can be advantageously determined based on distribution curves of sample volume levels in the digital audio captures. Clipping can be automatically compensated for by estimating peak unclipped capture volume levels from the distribution curves.Type: ApplicationFiled: July 13, 2010Publication date: January 19, 2012Applicant: ADACEL SYSTEMS, INC.Inventors: Chang-Qing Shu, Dezhi Liao
-
Publication number: 20110196668Abstract: An integrated language model includes an upper-level language model component and a lower-level language model component, with the upper-level language model component including a non-terminal and the lower-level language model component being applied to the non-terminal. The upper-level and lower-level language model components can be of the same or different language model formats, including finite state grammar (FSG) and statistical language model (SLM) formats. Systems and methods for making integrated language models allow designation of language model formats for the upper-level and lower-level components and identification of non-terminals. Automatic non-terminal replacement and retention criteria can be used to facilitate the generation of one or both language model components, which can include the modification of existing language models.Type: ApplicationFiled: February 8, 2010Publication date: August 11, 2011Applicant: ADACEL SYSTEMS, INC.Inventors: Chang-Qing Shu, Han Shu, John M. Mervin
-
Publication number: 20100332230Abstract: Phonetic distances are empirically measured as a function of speech recognition engine recognition error rates. The error rates are determined by comparing a recognized speech file with a reference file. The phonetic distances can be normalized to earlier measurements. The phonetic distances/error rates can also be used to improve speech recognition engine grammar selection, as an aid in language training and evaluation, and in other applications.Type: ApplicationFiled: June 25, 2009Publication date: December 30, 2010Applicant: ADACEL SYSTEMS, INC.Inventor: Chang-Qing Shu
-
Publication number: 20100250240Abstract: Feature space variation associated with specific text elements is reduced by training an acoustic model with a phoneme set, dictionary and transcription set configured to better distinguish the specific text elements and at least some specific phonemes associated therewith. The specific text elements can include the most frequently occurring text elements from a text data set, which can include text data beyond the transcriptions of a training data set. The specific text elements can be identified using a text element distribution table sorted by occurrence within the text data set. Specific phonemes can be limited to consonant phonemes to improve speed and accuracy.Type: ApplicationFiled: March 30, 2009Publication date: September 30, 2010Applicant: ADACEL SYSTEMS, INC.Inventor: Chang-Qing Shu
-
Publication number: 20100145677Abstract: A language model for a speech recognition engine is made based on user-viewed data files. The data files are reviewed and texts are extracted therefrom. The language model is generated based on the extracted texts. Transcriptions of previous user statements are not required. Different weighting factors can be applied to elements of the extracted texts based on the nature of the data files. The weighting factors are then considered during generation of the language model. A user dependent and application independent language model can be created prior to initial use of the speech recognition engine.Type: ApplicationFiled: March 3, 2009Publication date: June 10, 2010Applicant: ADACEL SYSTEMS, INC.Inventor: Chang-Qing Shu
-
Publication number: 20100106505Abstract: A method and system for improving the accuracy of a speech recognition system using word confidence score (WCS) processing is introduced. Parameters in a decoder are selected to minimize a weighted total error rate, such that deletion errors are weighted more heavily than substitution and insertion errors. The occurrence distribution in WCS is different depending on whether the word was correctly identified and based on the type of error. This is used to determine thresholds in WCS for insertion and substitution errors. By processing the hypothetical word (HYP) (output of the decoder), a mHYP (modified HYP) is determined. In some circumstances, depending on the WCS's value in relation to insertion and substitution threshold values, mHYP is set equal to: null, a substituted HYP, or HYP.Type: ApplicationFiled: October 24, 2008Publication date: April 29, 2010Applicant: Adacel, Inc.Inventor: Chang-Qing Shu