Patents by Inventor Yingyi Tan

Yingyi Tan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

System and method for a multiclass approach for confidence modeling in automatic speech recognition systems

Patent number: 11195514

Abstract: A system and method are presented for a multiclass approach for confidence modeling in automatic speech recognition systems. A confidence model may be trained offline using supervised learning. A decoding module is utilized within the system that generates features for audio files in audio data. The features are used to generate a hypothesized segment of speech which is compared to a known segment of speech using edit distances. Comparisons are labeled from one of a plurality of output classes. The labels correspond to the degree to which speech is converted to text correctly or not. The trained confidence models can be applied in a variety of systems, including interactive voice response systems, keyword spotters, and open-ended dialog systems.

Type: Grant

Filed: May 17, 2019

Date of Patent: December 7, 2021

Inventors: Ramasubramanian Sundaram, Aravind Ganapathiraju, Yingyi Tan
System and method for synthesis of speech from provided text

Patent number: 10733974

Abstract: A system and method are presented for the synthesis of speech from provided text. Particularly, the generation of parameters within the system is performed as a continuous approximation in order to mimic the natural flow of speech as opposed to a step-wise approximation of the feature stream. Provided text may be partitioned and parameters generated using a speech model. The generated parameters from the speech model may then be used in a post-processing step to obtain a new set of parameters for application in speech synthesis.

Type: Grant

Filed: January 18, 2018

Date of Patent: August 4, 2020

Inventors: Yingyi Tan, Aravind Ganapathiraju, Felix Immanuel Wyss
SYSTEM AND METHOD FOR A MULTICLASS APPROACH FOR CONFIDENCE MODELING IN AUTOMATIC SPEECH RECOGNITION SYSTEMS

Publication number: 20190355348

Abstract: A system and method are presented for a multiclass approach for confidence modeling in automatic speech recognition systems. A confidence model may be trained offline using supervised learning. A decoding module is utilized within the system that generates features for audio files in audio data. The features are used to generate a hypothesized segment of speech which is compared to a known segment of speech using edit distances. Comparisons are labeled from one of a plurality of output classes. The labels correspond to the degree to which speech is converted to text correctly or not. The trained confidence models can be applied in a variety of systems, including interactive voice response systems, keyword spotters, and open-ended dialog systems.

Type: Application

Filed: May 17, 2019

Publication date: November 21, 2019

Inventors: Ramasubramanian Sundaram, Aravind Ganapathiraju, Yingyi Tan
Method and system for predicting speech recognition performance using accuracy scores

Patent number: 10360898

Abstract: A system and method are presented for predicting speech recognition performance using accuracy scores in speech recognition systems within the speech analytics field. A keyword set is selected. Figure of Merit (FOM) is computed for the keyword set. Relevant features that describe the word individually and in relation to other words in the language are computed. A mapping from these features to FOM is learned. This mapping can be generalized via a suitable machine learning algorithm and be used to predict FOM for a new keyword. In at least embodiment, the predicted FOM may be used to adjust internals of speech recognition engine to achieve a consistent behavior for all inputs for various settings of confidence values.

Type: Grant

Filed: June 5, 2018

Date of Patent: July 23, 2019

Inventors: Aravind Ganapathiraju, Yingyi Tan, Felix Immanuel Wyss, Scott Allen Randal
METHOD AND SYSTEM FOR PREDICTING SPEECH RECOGNITION PERFORMANCE USING ACCURACY SCORES

Publication number: 20180286385

Abstract: A system and method are presented for predicting speech recognition performance using accuracy scores in speech recognition systems within the speech analytics field. A keyword set is selected. Figure of Merit (FOM) is computed for the keyword set. Relevant features that describe the word individually and in relation to other words in the language are computed. A mapping from these features to FOM is learned. This mapping can be generalized via a suitable machine learning algorithm and be used to predict FOM for a new keyword. In at least embodiment, the predicted FOM may be used to adjust internals of speech recognition engine to achieve a consistent behavior for all inputs for various settings of confidence values.

Type: Application

Filed: June 5, 2018

Publication date: October 4, 2018

Inventors: Aravind Ganapathiraju, Yingyi Tan, Felix Immanuel Wyss, Scott Allen Randal
Method and system for predicting speech recognition performance using accuracy scores

Patent number: 10019983

Abstract: A system and method are presented for predicting speech recognition performance using accuracy scores in speech recognition systems within the speech analytics field. A keyword set is selected. Figure of Merit (FOM) is computed for the keyword set. Relevant features that describe the word individually and in relation to other words in the language are computed. A mapping from these features to FOM is learned. This mapping can be generalized via a suitable machine learning algorithm and be used to predict FOM for a new keyword. In at least one embodiment, the predicted FOM may be used to adjust internals of speech recognition engine to achieve a consistent behavior for all inputs for various settings of confidence values.

Type: Grant

Filed: August 30, 2012

Date of Patent: July 10, 2018

Inventors: Aravind Ganapathiraju, Yingyi Tan, Felix Immanuel Wyss, Scott Allen Randal
SYSTEM AND METHOD FOR SYNTHESIS OF SPEECH FROM PROVIDED TEXT

Publication number: 20180144739

Abstract: A system and method are presented for the synthesis of speech from provided text. Particularly, the generation of parameters within the system is performed as a continuous approximation in order to mimic the natural flow of speech as opposed to a step-wise approximation of the feature stream. Provided text may be partitioned and parameters generated using a speech model. The generated parameters from the speech model may then be used in a post-processing step to obtain a new set of parameters for application in speech synthesis.

Type: Application

Filed: January 18, 2018

Publication date: May 24, 2018

Inventors: Yingyi Tan, Aravind Ganapathiraju, Felix Immanuel Wyss
System and method for synthesis of speech from provided text

Patent number: 9911407

Abstract: A system and method are presented for the synthesis of speech from provided text. Particularly, the generation of parameters within the system is performed as a continuous approximation in order to mimic the natural flow of speech as opposed to a step-wise approximation of the feature stream. Provided text may be partitioned and parameters generated using a speech model. The generated parameters from the speech model may then be used in a post-processing step to obtain a new set of parameters for application in speech synthesis.

Type: Grant

Filed: January 14, 2015

Date of Patent: March 6, 2018

Assignee: Interactive Intelligence Group, Inc.

Inventors: Yingyi Tan, Aravind Ganapathiraju, Felix Immanuel Wyss
SYSTEM AND METHOD FOR SYNTHESIS OF SPEECH FROM PROVIDED TEXT

Publication number: 20150199956

Abstract: A system and method are presented for the synthesis of speech from provided text. Particularly, the generation of parameters within the system is performed as a continuous approximation in order to mimic the natural flow of speech as opposed to a step-wise approximation of the feature stream. Provided text may be partitioned and parameters generated using a speech model. The generated parameters from the speech model may then be used in a post-processing step to obtain a new set of parameters for application in speech synthesis.

Type: Application

Filed: January 14, 2015

Publication date: July 16, 2015

Applicant: INTERACTIVE INTELLIGENCE GROUP, INC.

Inventors: Yingyi Tan, Aravind Ganapathiraju, Felix Immanuel Wyss
Method and System for Predicting Speech Recognition Performance Using Accuracy Scores

Publication number: 20140067391

Abstract: A system and method are presented for predicting speech recognition performance using accuracy scores in speech recognition systems within the speech analytics field. A keyword set is selected. Figure of Merit (FOM) is computed for the keyword set. Relevant features that describe the word individually and in relation to other words in the language are computed. A mapping from these features to FOM is learned. This mapping can be generalized via a suitable machine learning algorithm and be used to predict FOM for a new keyword. In at least embodiment, the predicted FOM may be used to adjust internals of speech recognition engine to achieve a consistent behavior for all inputs for various settings of confidence values.

Type: Application

Filed: August 30, 2012

Publication date: March 6, 2014

Applicant: INTERACTIVE INTELLIGENCE, INC.

Inventors: Aravind Ganapathiraju, Yingyi Tan, Felix Immanuel Wyss, Scott Allen Randal