Patents by Inventor Zhenhao Ge

Zhenhao Ge has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20180039888
    Abstract: A method for training a neural network of a neural network based speaker classifier for use in speaker change detection. The method comprises: a) preprocessing input speech data; b) extracting a plurality of feature frames from the preprocessed input speech data; c) normalizing the extracted feature frames of each speaker within the preprocessed input speech data with each speaker's mean and variance; d) concatenating the normalized feature frames to form overlapped longer frames having a frame length and a hop size; e) inputting the overlapped longer frames to the neural network based speaker classifier; and f) training the neural network through forward-backward propagation.
    Type: Application
    Filed: October 6, 2017
    Publication date: February 8, 2018
    Inventors: ZHENHAO GE, ANANTH NAGARAJA IYER, SRINATH CHELUVARAJA, ARAVIND GANAPATHIRAJU
  • Patent number: 9767792
    Abstract: A system and method for learning alternate pronunciations for speech recognition is disclosed. Alternative name pronunciations may be covered, through pronunciation learning, that have not been previously covered in a general pronunciation dictionary. In an embodiment, the detection of phone-level and syllable-level mispronunciations in words and sentences may be based on acoustic models trained by Hidden Markov Models. Mispronunciations may be detected by comparing the likelihood of the potential state of the targeting pronunciation unit with a pre-determined threshold through a series of tests. It is also within the scope of an embodiment to detect accents.
    Type: Grant
    Filed: October 12, 2016
    Date of Patent: September 19, 2017
    Assignee: Interactive Intelligence Group, Inc.
    Inventors: Zhenhao Ge, Vivek Tyagi, Aravind Ganapathiraju, Ananth Nagaraja Iyer, Scott Allen Randal, Felix Immanuel Wyss
  • Publication number: 20170032780
    Abstract: A system and method for learning alternate pronunciations for speech recognition is disclosed. Alternative name pronunciations may be covered, through pronunciation learning, that have not been previously covered in a general pronunciation dictionary. In an embodiment, the detection of phone-level and syllable-level mispronunciations in words and sentences may be based on acoustic models trained by Hidden Markov Models. Mispronunciations may be detected by comparing the likelihood of the potential state of the targeting pronunciation unit with a pre-determined threshold through a series of tests. It is also within the scope of an embodiment to detect accents.
    Type: Application
    Filed: October 12, 2016
    Publication date: February 2, 2017
    Inventors: Zhenhao Ge, Vivek Tyagi, Aravind Ganapathiraju, Ananth Nagaraja Iyer, Scott Allen Randal, Felix Immanuel Wyss
  • Patent number: 9489943
    Abstract: A system and method for learning alternate pronunciations for speech recognition is disclosed. Alternative name pronunciations may be covered, through pronunciation learning, that have not been previously covered in a general pronunciation dictionary. In an embodiment, the detection of phone-level and syllable-level mispronunciations in words and sentences may be based on acoustic models trained by Hidden Markov Models. Mispronunciations may be detected by comparing the likelihood of the potential state of the targeting pronunciation unit with a pre-determined threshold through a series of tests. It is also within the scope of an embodiment to detect accents.
    Type: Grant
    Filed: October 16, 2014
    Date of Patent: November 8, 2016
    Assignee: Interactive Intelligence Group, Inc.
    Inventors: Zhenhao Ge, Vivek Tyagi, Aravind Ganapathiraju, Ananth Nagaraja Iyer, Scott Allen Randal, Felix Immanuel Wyss
  • Publication number: 20150106082
    Abstract: A system and method for learning alternate pronunciations for speech recognition is disclosed. Alternative name pronunciations may be covered, through pronunciation learning, that have not been previously covered in a general pronunciation dictionary. In an embodiment, the detection of phone-level and syllable-level mispronunciations in words and sentences may be based on acoustic models trained by Hidden Markov Models. Mispronunciations may be detected by comparing the likelihood of the potential state of the targeting pronunciation unit with a pre-determined threshold through a series of tests. It is also within the scope of an embodiment to detect accents.
    Type: Application
    Filed: October 16, 2014
    Publication date: April 16, 2015
    Inventors: Zhenhao Ge, Vivek Tyagi, Aravind Ganapathiraju, Ananth Nagaraja Iyer, Scott Allen Randal, Felix Immanuel Wyss