Patents by Inventor Sankaran Panchapagesan

Sankaran Panchapagesan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10964315
    Abstract: An approach to wakeword detection uses an explicit representation of non-wakeword speech in the form of subword (e.g., phonetic monophone) units that do not necessarily occur in the wakeword and that broadly represent general speech. These subword units are arranged in a “background” model, which at runtime essentially competes with the wakeword model such that a wakeword is less likely to be declare as occurring when the input matches that background model well. An HMM may be used with the model to locate possible occurrences of the wakeword. Features are determined from portions of the input corresponding to subword units of the wakeword detected using the HMM. A secondary classifier is then used to process the features to yield a decision of whether the wakeword occurred.
    Type: Grant
    Filed: June 30, 2017
    Date of Patent: March 30, 2021
    Assignee: Amazon Technologies, Inc.
    Inventors: Minhua Wu, Sankaran Panchapagesan, Ming Sun, Shiv Naga Prasad Vitaladevuni, Bjorn Hoffmeister, Ryan Paul Thomas, Arindam Mandal
  • Patent number: 10847137
    Abstract: An approach to speech recognition, and in particular trigger word detection, implements fixed feature extraction form waveform samples with a neural network (NN). For example, rather than computing Log Frequency Band Energies (LFBEs), a convolutional neural network is used. In some implementations, this NN waveform processing is combined with a trained secondary classification that makes use of phonetic segmentation of a possible trigger word occurrence.
    Type: Grant
    Filed: December 12, 2017
    Date of Patent: November 24, 2020
    Assignee: Amazon Technologies, Inc.
    Inventors: Arindam Mandal, Nikko Strom, Kenichi Kumatani, Sankaran Panchapagesan
  • Patent number: 10304440
    Abstract: An approach to keyword spotting makes use of acoustic parameters that are trained on a keyword spotting task as well as on a second speech recognition task, for example, a large vocabulary continuous speech recognition task. The parameters may be optimized according to a weighted measure that weighs the keyword spotting task more highly than the other task, and that weighs utterances of a keyword more highly than utterances of other speech. In some applications, a keyword spotter configured with the acoustic parameters is used for trigger or wake word detection.
    Type: Grant
    Filed: June 30, 2016
    Date of Patent: May 28, 2019
    Assignee: Amazon Technologies, Inc.
    Inventors: Sankaran Panchapagesan, Bjorn Hoffmeister, Arindam Mandal, Aparna Khare, Shiv Naga Prasad Vitaladevuni, Spyridon Matsoukas, Ming Sun
  • Patent number: 10229356
    Abstract: Features are disclosed for error tolerant model compression. Such features could be used to reduce the size of a deep neural network model including several hidden node layers. The size reduction in an error tolerant fashion ensures predictive applications relying on the model do not experience performance degradation due to model compression. Such predictive applications include automatic recognition of speech, image recognition, and recommendation engines. Partially quantized models are re-trained such that any degradation of accuracy is “trained out” of the model providing improved error tolerance with compression.
    Type: Grant
    Filed: December 23, 2014
    Date of Patent: March 12, 2019
    Assignee: Amazon Technologies, Inc.
    Inventors: Baiyang Liu, Michael Reese Bastian, Bjorn Hoffmeister, Sankaran Panchapagesan, Ariya Rastrow
  • Patent number: 10147442
    Abstract: A neural network acoustic model is trained to be robust and produce accurate output when used to process speech signals having acoustic interference. The neural network acoustic model can be trained using a source-separation process by which, in addition to producing the main acoustic model output for a given input, the neural network generates predictions of the separate speech and interference portions of the input. The parameters of the neural network can be adjusted to jointly optimize all three outputs (e.g., the main acoustic model output, the speech signal prediction, and the interference signal prediction), rather than only optimizing the main acoustic model output. Once trained, output layers for the speech and interference signal predictions can be removed from the neural network or otherwise disabled.
    Type: Grant
    Filed: September 29, 2015
    Date of Patent: December 4, 2018
    Assignee: Amazon Technologies, Inc.
    Inventors: Sankaran Panchapagesan, Shiva Kumar Sundaram, Arindam Mandal
  • Patent number: 10009676
    Abstract: A voice controlled medical system with improved speech recognition includes a first microphone array, a second microphone array, a controller in communication with the first and second microphone arrays, and a medical device operable by the controller. The controller includes a beam module that generates a first beamed signal using signals from the first microphone array and a second beamed signal using signals from the second microphone array. The controller also includes a comparison module that compares the first and second beamed signals and determines a correlation between the first and second beamed signals. The controller also includes a voice interpreting module that identifies commands within the first and second beamed signals if the correlation is above a correlation threshold. The controller also includes an instrument control module that executes the commands to operate said medical device.
    Type: Grant
    Filed: November 3, 2014
    Date of Patent: June 26, 2018
    Assignee: Storz Endoskop Produktions GmbH
    Inventors: Matteo Contolini, Ted Applebaum, Sankaran Panchapagesan
  • Patent number: 9865256
    Abstract: A voice controlled medical system with improved speech recognition includes a microphone in an operating environment and a medical device. The voice controlled medical system further includes a controller in communication with the microphone and the medical device that operates the medical device by executing audio commands received by the microphone. The voice controlled medical system further includes a calibration signal indicative of distortion in the operating environment which is generated by comparing an audio signal played in the operating environment with a sound signal received by the microphone. The controller uses the calibration signal to interpret audio commands received by the microphone.
    Type: Grant
    Filed: February 27, 2015
    Date of Patent: January 9, 2018
    Assignee: Storz Endoskop Produktions GmbH
    Inventors: Sankaran Panchapagesan, Matteo Contolini, Ted Applebaum
  • Patent number: 9549717
    Abstract: A voice controlled surgical system including a wireless command microphone receiving audio input, a voice control module for generating commands from the audio input received by said wireless command microphone, a detection module for generating signals indicative of a proximity of said wireless command microphone, a switch module for disabling the commands in response to one or more of the signals, and an alarm module activated in response to the one or more of the signals.
    Type: Grant
    Filed: September 16, 2009
    Date of Patent: January 24, 2017
    Assignee: Storz Endoskop Produktions GmbH
    Inventors: Matteo Contolini, Ted Applebaum, Sankaran Panchapagesan
  • Publication number: 20160253994
    Abstract: A voice controlled medical system with improved speech recognition includes a microphone in an operating environment and a medical device. The voice controlled medical system further includes a controller in communication with the microphone and the medical device that operates the medical device by executing audio commands received by the microphone. The voice controlled medical system further includes a calibration signal indicative of distortion in the operating environment which is generated by comparing an audio signal played in the operating environment with a sound signal received by the microphone. The controller uses the calibration signal to interpret audio commands received by the microphone.
    Type: Application
    Filed: February 27, 2015
    Publication date: September 1, 2016
    Inventors: Sankaran Panchapagesan, Matteo Contolini, Ted Applebaum
  • Publication number: 20160125882
    Abstract: A voice controlled medical system with improved speech recognition includes a first microphone array, a second microphone array, a controller in communication with the first and second microphone arrays, and a medical device operable by the controller. The controller includes a beam module that generates a first beamed signal using signals from the first microphone array and a second beamed signal using signals from the second microphone array. The controller also includes a comparison module that compares the first and second beamed signals and determines a correlation between the first and second beamed signals. The controller also includes a voice interpreting module that identifies commands within the first and second beamed signals if the correlation is above a correlation threshold. The controller also includes an instrument control module that executes the commands to operate said medical device.
    Type: Application
    Filed: November 3, 2014
    Publication date: May 5, 2016
    Inventors: Matteo Contolini, Ted Applebaum, Sankaran Panchapagesan
  • Publication number: 20110063429
    Abstract: A voice controlled surgical system including a wireless command microphone receiving audio input, a voice control module for generating commands from the audio input received by said wireless command microphone, a detection module for generating signals indicative of a proximity of said wireless command microphone, a switch module for disabling the commands in response to one or more of the signals, and an alarm module activated in response to the one or more of the signals.
    Type: Application
    Filed: September 16, 2009
    Publication date: March 17, 2011
    Inventors: Matteo Contolini, Ted Applebaum, Sankaran Panchapagesan