Patents by Inventor Sankaran Panchapagesan
Sankaran Panchapagesan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 10964315Abstract: An approach to wakeword detection uses an explicit representation of non-wakeword speech in the form of subword (e.g., phonetic monophone) units that do not necessarily occur in the wakeword and that broadly represent general speech. These subword units are arranged in a “background” model, which at runtime essentially competes with the wakeword model such that a wakeword is less likely to be declare as occurring when the input matches that background model well. An HMM may be used with the model to locate possible occurrences of the wakeword. Features are determined from portions of the input corresponding to subword units of the wakeword detected using the HMM. A secondary classifier is then used to process the features to yield a decision of whether the wakeword occurred.Type: GrantFiled: June 30, 2017Date of Patent: March 30, 2021Assignee: Amazon Technologies, Inc.Inventors: Minhua Wu, Sankaran Panchapagesan, Ming Sun, Shiv Naga Prasad Vitaladevuni, Bjorn Hoffmeister, Ryan Paul Thomas, Arindam Mandal
-
Patent number: 10847137Abstract: An approach to speech recognition, and in particular trigger word detection, implements fixed feature extraction form waveform samples with a neural network (NN). For example, rather than computing Log Frequency Band Energies (LFBEs), a convolutional neural network is used. In some implementations, this NN waveform processing is combined with a trained secondary classification that makes use of phonetic segmentation of a possible trigger word occurrence.Type: GrantFiled: December 12, 2017Date of Patent: November 24, 2020Assignee: Amazon Technologies, Inc.Inventors: Arindam Mandal, Nikko Strom, Kenichi Kumatani, Sankaran Panchapagesan
-
Patent number: 10304440Abstract: An approach to keyword spotting makes use of acoustic parameters that are trained on a keyword spotting task as well as on a second speech recognition task, for example, a large vocabulary continuous speech recognition task. The parameters may be optimized according to a weighted measure that weighs the keyword spotting task more highly than the other task, and that weighs utterances of a keyword more highly than utterances of other speech. In some applications, a keyword spotter configured with the acoustic parameters is used for trigger or wake word detection.Type: GrantFiled: June 30, 2016Date of Patent: May 28, 2019Assignee: Amazon Technologies, Inc.Inventors: Sankaran Panchapagesan, Bjorn Hoffmeister, Arindam Mandal, Aparna Khare, Shiv Naga Prasad Vitaladevuni, Spyridon Matsoukas, Ming Sun
-
Patent number: 10229356Abstract: Features are disclosed for error tolerant model compression. Such features could be used to reduce the size of a deep neural network model including several hidden node layers. The size reduction in an error tolerant fashion ensures predictive applications relying on the model do not experience performance degradation due to model compression. Such predictive applications include automatic recognition of speech, image recognition, and recommendation engines. Partially quantized models are re-trained such that any degradation of accuracy is “trained out” of the model providing improved error tolerance with compression.Type: GrantFiled: December 23, 2014Date of Patent: March 12, 2019Assignee: Amazon Technologies, Inc.Inventors: Baiyang Liu, Michael Reese Bastian, Bjorn Hoffmeister, Sankaran Panchapagesan, Ariya Rastrow
-
Patent number: 10147442Abstract: A neural network acoustic model is trained to be robust and produce accurate output when used to process speech signals having acoustic interference. The neural network acoustic model can be trained using a source-separation process by which, in addition to producing the main acoustic model output for a given input, the neural network generates predictions of the separate speech and interference portions of the input. The parameters of the neural network can be adjusted to jointly optimize all three outputs (e.g., the main acoustic model output, the speech signal prediction, and the interference signal prediction), rather than only optimizing the main acoustic model output. Once trained, output layers for the speech and interference signal predictions can be removed from the neural network or otherwise disabled.Type: GrantFiled: September 29, 2015Date of Patent: December 4, 2018Assignee: Amazon Technologies, Inc.Inventors: Sankaran Panchapagesan, Shiva Kumar Sundaram, Arindam Mandal
-
Patent number: 10009676Abstract: A voice controlled medical system with improved speech recognition includes a first microphone array, a second microphone array, a controller in communication with the first and second microphone arrays, and a medical device operable by the controller. The controller includes a beam module that generates a first beamed signal using signals from the first microphone array and a second beamed signal using signals from the second microphone array. The controller also includes a comparison module that compares the first and second beamed signals and determines a correlation between the first and second beamed signals. The controller also includes a voice interpreting module that identifies commands within the first and second beamed signals if the correlation is above a correlation threshold. The controller also includes an instrument control module that executes the commands to operate said medical device.Type: GrantFiled: November 3, 2014Date of Patent: June 26, 2018Assignee: Storz Endoskop Produktions GmbHInventors: Matteo Contolini, Ted Applebaum, Sankaran Panchapagesan
-
Patent number: 9865256Abstract: A voice controlled medical system with improved speech recognition includes a microphone in an operating environment and a medical device. The voice controlled medical system further includes a controller in communication with the microphone and the medical device that operates the medical device by executing audio commands received by the microphone. The voice controlled medical system further includes a calibration signal indicative of distortion in the operating environment which is generated by comparing an audio signal played in the operating environment with a sound signal received by the microphone. The controller uses the calibration signal to interpret audio commands received by the microphone.Type: GrantFiled: February 27, 2015Date of Patent: January 9, 2018Assignee: Storz Endoskop Produktions GmbHInventors: Sankaran Panchapagesan, Matteo Contolini, Ted Applebaum
-
Patent number: 9549717Abstract: A voice controlled surgical system including a wireless command microphone receiving audio input, a voice control module for generating commands from the audio input received by said wireless command microphone, a detection module for generating signals indicative of a proximity of said wireless command microphone, a switch module for disabling the commands in response to one or more of the signals, and an alarm module activated in response to the one or more of the signals.Type: GrantFiled: September 16, 2009Date of Patent: January 24, 2017Assignee: Storz Endoskop Produktions GmbHInventors: Matteo Contolini, Ted Applebaum, Sankaran Panchapagesan
-
Publication number: 20160253994Abstract: A voice controlled medical system with improved speech recognition includes a microphone in an operating environment and a medical device. The voice controlled medical system further includes a controller in communication with the microphone and the medical device that operates the medical device by executing audio commands received by the microphone. The voice controlled medical system further includes a calibration signal indicative of distortion in the operating environment which is generated by comparing an audio signal played in the operating environment with a sound signal received by the microphone. The controller uses the calibration signal to interpret audio commands received by the microphone.Type: ApplicationFiled: February 27, 2015Publication date: September 1, 2016Inventors: Sankaran Panchapagesan, Matteo Contolini, Ted Applebaum
-
Publication number: 20160125882Abstract: A voice controlled medical system with improved speech recognition includes a first microphone array, a second microphone array, a controller in communication with the first and second microphone arrays, and a medical device operable by the controller. The controller includes a beam module that generates a first beamed signal using signals from the first microphone array and a second beamed signal using signals from the second microphone array. The controller also includes a comparison module that compares the first and second beamed signals and determines a correlation between the first and second beamed signals. The controller also includes a voice interpreting module that identifies commands within the first and second beamed signals if the correlation is above a correlation threshold. The controller also includes an instrument control module that executes the commands to operate said medical device.Type: ApplicationFiled: November 3, 2014Publication date: May 5, 2016Inventors: Matteo Contolini, Ted Applebaum, Sankaran Panchapagesan
-
Publication number: 20110063429Abstract: A voice controlled surgical system including a wireless command microphone receiving audio input, a voice control module for generating commands from the audio input received by said wireless command microphone, a detection module for generating signals indicative of a proximity of said wireless command microphone, a switch module for disabling the commands in response to one or more of the signals, and an alarm module activated in response to the one or more of the signals.Type: ApplicationFiled: September 16, 2009Publication date: March 17, 2011Inventors: Matteo Contolini, Ted Applebaum, Sankaran Panchapagesan