Patents by Inventor Chang-Qing Shu

Chang-Qing Shu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 9886943
    Abstract: A method and system for improving the accuracy of a speech recognition system using word confidence score (WCS) processing is introduced. Parameters in a decoder are selected to minimize a weighted total error rate, such that deletion errors are weighted more heavily than substitution and insertion errors. The occurrence distribution in WCS is different depending on whether the word was correctly identified and based on the type of error. This is used to determine thresholds in WCS for insertion and substitution errors. By processing the hypothetical word (HYP) (output of the decoder), a mHYP (modified HYP) is determined. In some circumstances, depending on the WCS's value in relation to insertion and substitution threshold values, mHYP is set equal to: null, a substituted HYP, or HYP.
    Type: Grant
    Filed: January 13, 2017
    Date of Patent: February 6, 2018
    Assignee: Adadel Inc.
    Inventor: Chang-Qing Shu
  • Publication number: 20170256253
    Abstract: Phonetic distances are empirically measured as a function of speech recognition engine recognition error rates. The error rates are determined by comparing a recognized speech file with a reference file. The phonetic distances can be normalized to earlier measurements. The phonetic distances/error rates can also be used to improve speech recognition engine grammar selection, as an aid in language training and evaluation, and in other applications.
    Type: Application
    Filed: May 22, 2017
    Publication date: September 7, 2017
    Applicant: Adacel Systems, Inc.
    Inventor: Chang-Qing Shu
  • Patent number: 9659559
    Abstract: Phonetic distances are empirically measured as a function of speech recognition engine recognition error rates. The error rates are determined by comparing a recognized speech file with a reference file. The phonetic distances can be normalized to earlier measurements. The phonetic distances/error rates can also be used to improve speech recognition engine grammar selection, as an aid in language training and evaluation, and in other applications.
    Type: Grant
    Filed: June 25, 2009
    Date of Patent: May 23, 2017
    Assignee: ADACEL SYSTEMS, INC.
    Inventor: Chang-Qing Shu
  • Publication number: 20170125011
    Abstract: A method and system for improving the accuracy of a speech recognition system using word confidence score (WCS) processing is introduced. Parameters in a decoder are selected to minimize a weighted total error rate, such that deletion errors are weighted more heavily than substitution and insertion errors. The occurrence distribution in WCS is different depending on whether the word was correctly identified and based on the type of error. This is used to determine thresholds in WCS for insertion and substitution errors. By processing the hypothetical word (HYP) (output of the decoder), a mHYP (modified HYP) is determined. In some circumstances, depending on the WCS's value in relation to insertion and substitution threshold values, mHYP is set equal to: null, a substituted HYP, or HYP.
    Type: Application
    Filed: January 13, 2017
    Publication date: May 4, 2017
    Applicant: Adacel, Inc.
    Inventor: Chang-Qing Shu
  • Patent number: 9583094
    Abstract: A method and system for improving the accuracy of a speech recognition system using were confidence score (WCS) processing is introduced. Parameters in a decoder are selected to minimize a weighted total error rate, such that deletion errors are weighted more heavily than substitution and insertion errors. The occurrence distribution in WCS is different depending on whether the word was correctly identified and based on the type of error. This is used to determine thresholds in WCS for insertion and substitution errors. By processing the hypothetical word (HYP) (output of the decoder), a mHYP (modified HYP) is determined. In some circumstances, depending on the WCS's value in relation to insertion and substitution threshold values, mHYP is set equal to: null, a substituted HYP, or HYP.
    Type: Grant
    Filed: September 22, 2016
    Date of Patent: February 28, 2017
    Assignee: ADACEL, INC.
    Inventor: Chang-Qing Shu
  • Publication number: 20170011737
    Abstract: A method and system for improving the accuracy of a speech recognition system using were confidence score (WCS) processing is introduced. Parameters in a decoder are selected to minimize a weighted total error rate, such that deletion errors are weighted more heavily than substitution and insertion errors. The occurrence distribution in WCS is different depending on whether the word was correctly identified and based on the type of error. This is used to determine thresholds in WCS for insertion and substitution errors. By processing the hypothetical word (HYP) (output of the decoder), a mHYP (modified HYP) is determined. In some circumstances, depending on the WCS's value in relation to insertion and substitution threshold values, mHYP is set equal to: null, a substituted HYP, or HYP.
    Type: Application
    Filed: September 22, 2016
    Publication date: January 12, 2017
    Applicant: Adacel, Inc.
    Inventor: Chang-Qing Shu
  • Patent number: 9478218
    Abstract: A method and system for improving the accuracy of a speech recognition system using word confidence score (WCS) processing is introduced. Parameters in a decoder are selected to minimize a weighted total error rate, such that deletion errors are weighted more heavily than substitution and insertion errors. The occurrence distribution in WCS is different depending on whether the word was correctly identified and based on the type of error. This is used to determine thresholds in WCS for insertion and substitution errors. By processing the hypothetical word (HYP) (output of the decoder), a mHYP (modified HYP) is determined. In some circumstances, depending on the WCS's value in relation to insertion and substitution threshold values, mHYP is set equal to: null, a substituted HYP, or HYP.
    Type: Grant
    Filed: October 24, 2008
    Date of Patent: October 25, 2016
    Assignee: Adacel, Inc.
    Inventor: Chang-Qing Shu
  • Publication number: 20140163989
    Abstract: An integrated language model includes an upper-level language model component and a lower-level language model component, with the upper-level language model component including a non-terminal and the lower-level language model component being applied to the non-terminal. The upper-level and lower-level language model components can be of the same or different language model formats, including finite state grammar (FSG) and statistical language model (SLM) formats. Systems and methods for making integrated language models allow designation of language model formats for the upper-level and lower-level components and identification of non-terminals. Automatic non-terminal replacement and retention criteria can be used to facilitate the generation of one or both language model components, which can include the modification of existing language models.
    Type: Application
    Filed: July 30, 2013
    Publication date: June 12, 2014
    Applicant: ADACEL SYSTEMS, INC.
    Inventors: Chang Qing Shu, Han Shu, John M. Merwin
  • Patent number: 8738384
    Abstract: Grammars for interactive voice response systems using natural language understanding can be created using information which is available on websites. These grammars can be created in automated manners and can have various tuning measures applied to obtain optimal results when deployed in a customer contact environment. These grammars can allow a variety of statements to be appropriately handled by the system.
    Type: Grant
    Filed: November 16, 2012
    Date of Patent: May 27, 2014
    Assignee: Convergys CMG Utah Inc.
    Inventors: Dhananjay Bansal, Nancy Gardner, Chang-Qing Shu, Kristie Goss, Matthew Yuschik, Sunil Issar, Woosung Kim, Jayant M. Naik
  • Patent number: 8731923
    Abstract: A system and method for merging audio data streams receive audio data streams from separate inputs, independently transform each data stream from the time to the frequency domain, and generate separate feature data sets for the transformed data streams. Feature data from each of the separate feature data sets is selected to form a merged feature data set that is output to a decoder for recognition purposes. The separate inputs can include an ear microphone and a mouth microphone.
    Type: Grant
    Filed: August 20, 2010
    Date of Patent: May 20, 2014
    Assignee: Adacel Systems, Inc.
    Inventor: Chang-Qing Shu
  • Patent number: 8559656
    Abstract: Optimal microphone volumes are automatically set for computer applications based on determination of peak volume levels and noise levels from one or more digital audio captures. The peak volume levels and noise levels can be advantageously determined based on distribution curves of sample volume levels in the digital audio captures. Clipping can be automatically compensated for by estimating peak unclipped capture volume levels from the distribution curves.
    Type: Grant
    Filed: July 13, 2010
    Date of Patent: October 15, 2013
    Assignee: Adacel Systems, Inc.
    Inventors: Chang-Qing Shu, Dezhi Liao
  • Patent number: 8515734
    Abstract: An integrated language model includes an upper-level language model component and a lower-level language model component, with the upper-level language model component including a non-terminal and the lower-level language model component being applied to the non-terminal. The upper-level and lower-level language model components can be of the same or different language model formats, including finite state grammar (FSG) and statistical language model (SLM) formats. Systems and methods for making integrated language models allow designation of language model formats for the upper-level and lower-level components and identification of non-terminals. Automatic non-terminal replacement and retention criteria can be used to facilitate the generation of one or both language model components, which can include the modification of existing language models.
    Type: Grant
    Filed: February 8, 2010
    Date of Patent: August 20, 2013
    Assignee: Adacel Systems, Inc.
    Inventors: Chang-Qing Shu, Han Shu, John M. Mervin
  • Patent number: 8301446
    Abstract: Feature space variation associated with specific text elements is reduced by training an acoustic model with a phoneme set, dictionary and transcription set configured to better distinguish the specific text elements and at least some specific phonemes associated therewith. The specific text elements can include the most frequently occurring text elements from a text data set, which can include text data beyond the transcriptions of a training data set. The specific text elements can be identified using a text element distribution table sorted by occurrence within the text data set. Specific phonemes can be limited to consonant phonemes to improve speed and accuracy.
    Type: Grant
    Filed: March 30, 2009
    Date of Patent: October 30, 2012
    Assignee: Adacel Systems, Inc.
    Inventor: Chang-Qing Shu
  • Publication number: 20120046946
    Abstract: A system and method for merging audio data streams receive audio data streams from separate inputs, independently transform each data stream from the time to the frequency domain, and generate separate feature data sets for the transformed data streams. Feature data from each of the separate feature data sets is selected to form a merged feature data set that is output to a decoder for recognition purposes. The separate inputs can include an ear microphone and a mouth microphone.
    Type: Application
    Filed: August 20, 2010
    Publication date: February 23, 2012
    Applicant: ADACEL SYSTEMS, INC.
    Inventor: Chang-Qing Shu
  • Publication number: 20120014537
    Abstract: Optimal microphone volumes are automatically set for computer applications based on determination of peak volume levels and noise levels from one or more digital audio captures. The peak volume levels and noise levels can be advantageously determined based on distribution curves of sample volume levels in the digital audio captures. Clipping can be automatically compensated for by estimating peak unclipped capture volume levels from the distribution curves.
    Type: Application
    Filed: July 13, 2010
    Publication date: January 19, 2012
    Applicant: ADACEL SYSTEMS, INC.
    Inventors: Chang-Qing Shu, Dezhi Liao
  • Publication number: 20110196668
    Abstract: An integrated language model includes an upper-level language model component and a lower-level language model component, with the upper-level language model component including a non-terminal and the lower-level language model component being applied to the non-terminal. The upper-level and lower-level language model components can be of the same or different language model formats, including finite state grammar (FSG) and statistical language model (SLM) formats. Systems and methods for making integrated language models allow designation of language model formats for the upper-level and lower-level components and identification of non-terminals. Automatic non-terminal replacement and retention criteria can be used to facilitate the generation of one or both language model components, which can include the modification of existing language models.
    Type: Application
    Filed: February 8, 2010
    Publication date: August 11, 2011
    Applicant: ADACEL SYSTEMS, INC.
    Inventors: Chang-Qing Shu, Han Shu, John M. Mervin
  • Publication number: 20100332230
    Abstract: Phonetic distances are empirically measured as a function of speech recognition engine recognition error rates. The error rates are determined by comparing a recognized speech file with a reference file. The phonetic distances can be normalized to earlier measurements. The phonetic distances/error rates can also be used to improve speech recognition engine grammar selection, as an aid in language training and evaluation, and in other applications.
    Type: Application
    Filed: June 25, 2009
    Publication date: December 30, 2010
    Applicant: ADACEL SYSTEMS, INC.
    Inventor: Chang-Qing Shu
  • Publication number: 20100250240
    Abstract: Feature space variation associated with specific text elements is reduced by training an acoustic model with a phoneme set, dictionary and transcription set configured to better distinguish the specific text elements and at least some specific phonemes associated therewith. The specific text elements can include the most frequently occurring text elements from a text data set, which can include text data beyond the transcriptions of a training data set. The specific text elements can be identified using a text element distribution table sorted by occurrence within the text data set. Specific phonemes can be limited to consonant phonemes to improve speed and accuracy.
    Type: Application
    Filed: March 30, 2009
    Publication date: September 30, 2010
    Applicant: ADACEL SYSTEMS, INC.
    Inventor: Chang-Qing Shu
  • Publication number: 20100145677
    Abstract: A language model for a speech recognition engine is made based on user-viewed data files. The data files are reviewed and texts are extracted therefrom. The language model is generated based on the extracted texts. Transcriptions of previous user statements are not required. Different weighting factors can be applied to elements of the extracted texts based on the nature of the data files. The weighting factors are then considered during generation of the language model. A user dependent and application independent language model can be created prior to initial use of the speech recognition engine.
    Type: Application
    Filed: March 3, 2009
    Publication date: June 10, 2010
    Applicant: ADACEL SYSTEMS, INC.
    Inventor: Chang-Qing Shu
  • Publication number: 20100106505
    Abstract: A method and system for improving the accuracy of a speech recognition system using word confidence score (WCS) processing is introduced. Parameters in a decoder are selected to minimize a weighted total error rate, such that deletion errors are weighted more heavily than substitution and insertion errors. The occurrence distribution in WCS is different depending on whether the word was correctly identified and based on the type of error. This is used to determine thresholds in WCS for insertion and substitution errors. By processing the hypothetical word (HYP) (output of the decoder), a mHYP (modified HYP) is determined. In some circumstances, depending on the WCS's value in relation to insertion and substitution threshold values, mHYP is set equal to: null, a substituted HYP, or HYP.
    Type: Application
    Filed: October 24, 2008
    Publication date: April 29, 2010
    Applicant: Adacel, Inc.
    Inventor: Chang-Qing Shu