Patents by Inventor Joel Pinto

Joel Pinto has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Methods and apparatus for hybrid speech recognition processing

Patent number: 11990135

Abstract: Methods and apparatus for selectively performing speech processing in a hybrid speech processing system. The hybrid speech processing system includes at least one mobile electronic device and a network-connected server remotely located from the at least one mobile electronic device. The mobile electronic device is configured to use an embedded speech recognizer to process at least a portion of input audio to produce recognized text. A controller on the mobile electronic device determines whether to send information from the mobile electronic device to the server for speech processing. The determination of whether to send the information is based, at least in part, on an analysis of the input audio, the recognized text, or a semantic category associated with the recognized text.

Type: Grant

Filed: February 9, 2021

Date of Patent: May 21, 2024

Assignee: Microsoft Technology Licensing, LLC

Inventors: Daniel Willett, Joel Pinto, William F. Ganong, III
METHODS AND APPARATUS FOR HYBRID SPEECH RECOGNITION PROCESSING

Publication number: 20210166699

Abstract: Methods and apparatus for selectively performing speech processing in a hybrid speech processing system. The hybrid speech processing system includes at least one mobile electronic device and a network-connected server remotely located from the at least one mobile electronic device. The mobile electronic device is configured to use an embedded speech recognizer to process at least a portion of input audio to produce recognized text. A controller on the mobile electronic device determines whether to send information from the mobile electronic device to the server for speech processing. The determination of whether to send the information is based, at least in part, on an analysis of the input audio, the recognized text, or a semantic category associated with the recognized text.

Type: Application

Filed: February 9, 2021

Publication date: June 3, 2021

Applicant: Nuance Communications, Inc

Inventors: Daniel Willett, Joel Pinto, William F. Ganong, III
Methods and apparatus for hybrid speech recognition processing

Patent number: 10971157

Abstract: Methods and apparatus for selectively performing speech processing in a hybrid speech processing system. The hybrid speech processing system includes at least one mobile electronic device and a network-connected server remotely located from the at least one mobile electronic device. The mobile electronic device is configured to use an embedded speech recognizer to process at least a portion of input audio to produce recognized text. A controller on the mobile electronic device determines whether to send information from the mobile electronic device to the server for speech processing. The determination of whether to send the information is based, at least in part, on an analysis of the input audio, the recognized text, or a semantic category associated with the recognized text.

Type: Grant

Filed: January 11, 2017

Date of Patent: April 6, 2021

Assignee: Nuance Communications, Inc.

Inventors: Daniel Willett, Joel Pinto, William F. Ganong, III
Method for scoring in an automatic speech recognition system

Patent number: 10650805

Abstract: A system and method for speech recognition is provided. Embodiments may include receiving an audio signal at a first deep neural network (“DNN”) associated with a computing device. Embodiments may further include receiving the audio signal at a second deep neural network (“DNN”) associated with a computing device, wherein the second deep neural network includes fewer parameters than the first deep neural network. Embodiments may also include determining whether to select an output from the first deep neural network or the second deep neural network and providing the selected output to a decoder with an overall objective of speeding up ASR.

Type: Grant

Filed: September 11, 2014

Date of Patent: May 12, 2020

Assignee: Nuance Communications, Inc.

Inventors: Joel Pinto, Daniel Willett, Christian Plahl
Method for training an automatic speech recognition system

Patent number: 10049658

Abstract: A system and method for speech recognition is provided. Embodiments may include receiving, at a first computing device, a far-talk signal from a far-talk computing device, the far-talk signal transmitted using a first channel and corresponding to an audible sound. Embodiments may further include receiving, at the first computing device, a near-talk signal from a near-talk computing device, the near-talk signal transmitted using a second channel and corresponding to the audible sound, wherein the far-talk signal and the near-talk signal are received during an enrollment phase of a far-talk speech recognition system. Embodiments may also include updating, at the first computing device, one or more models associated with a far-talk speech recognition system based upon, at least in part, one or more characteristics of the far-talk signal and one or more characteristics of the near-talk signal.

Type: Grant

Filed: March 7, 2013

Date of Patent: August 14, 2018

Assignee: Nuance Communications, Inc.

Inventors: Joel Pinto, Josef Damianus Anastasiadis, Daniel Willett
METHODS AND APPARATUS FOR HYBRID SPEECH RECOGNITION PROCESSING

Publication number: 20180197545

Abstract: Methods and apparatus for selectively performing speech processing in a hybrid speech processing system. The hybrid speech processing system includes at least one mobile electronic device and a network-connected server remotely located from the at least one mobile electronic device. The mobile electronic device is configured to use an embedded speech recognizer to process at least a portion of input audio to produce recognized text. A controller on the mobile electronic device determines whether to send information from the mobile electronic device to the server for speech processing. The determination of whether to send the information is based, at least in part, on an analysis of the input audio, the recognized text, or a semantic category associated with the recognized text.

Type: Application

Filed: January 11, 2017

Publication date: July 12, 2018

Applicant: Nuance Communications, Inc.

Inventors: Daniel Willett, Joel Pinto, William F. Ganong, III
Meta-data inputs to front end processing for automatic speech recognition

Patent number: 9953638

Abstract: A computer-implemented method is described for front end speech processing for automatic speech recognition. A sequence of speech features which characterize an unknown speech input provided on an audio input channel and associated meta-data which characterize the audio input channel are received. The speech features are transformed with a computer process that uses a trained mapping function controlled by the meta-data, and automatic speech recognition is performed of the transformed speech features.

Type: Grant

Filed: June 28, 2012

Date of Patent: April 24, 2018

Assignee: Nuance Communications, Inc.

Inventors: Daniel Willett, Karl Jonas Lööf, Yue Pan, Joel Pinto, Christian Gollan
METHOD FOR SCORING IN AN AUTOMATIC SPEECH RECOGNITION SYSTEM

Publication number: 20170294186

Abstract: A system and method for speech recognition is provided. Embodiments may include receiving an audio signal at a first deep neural network (“DNN”) associated with a computing device. Embodiments may further include receiving the audio signal at a second deep neural network (“DNN”) associated with a computing device, wherein the second deep neural network includes fewer parameters than the first deep neural network. Embodiments may also include determining whether to select an output from the first deep neural network or the second deep neural network and providing the selected output to a decoder with an overall objective of speeding up ASR.

Type: Application

Filed: September 11, 2014

Publication date: October 12, 2017

Inventors: Joel Pinto, Daniel Willett, Christian Plahl
METHOD FOR TRAINING AN AUTOMATIC SPEECH RECOGNITION SYSTEM

Publication number: 20160027435

Abstract: A system and method for speech recognition is provided. Embodiments may include receiving, at a first computing device, a far-talk signal from a far-talk computing device, the far-talk signal transmitted using a first channel and corresponding to an audible sound. Embodiments may further include receiving, at the first computing device, a near-talk signal from a near-talk computing device, the near-talk signal transmitted using a second channel and corresponding to the audible sound, wherein the far-talk signal and the near-talk signal are received during an enrollment phase of a far-talk speech recognition system. Embodiments may also include updating, at the first computing device, one or more models associated with a far-talk speech recognition system based upon, at least in part, one or more characteristics of the far-talk signal and one or more characteristics of the near-talk signal.

Type: Application

Filed: March 7, 2013

Publication date: January 28, 2016

Inventors: Joel Pinto, Josef Damianus Anastasiadis, Daniel Willett
META-DATA INPUTS TO FRONT END PROCESSING FOR AUTOMATIC SPEECH RECOGNITION

Publication number: 20150262575

Abstract: A computer-implemented method is described for front end speech processing for automatic speech recognition. A sequence of speech features which characterize an unknown speech input provided on an audio input channel and associated meta-data data which characterize the audio input channel are received. The speech features are transformed with a computer process that uses a trained mapping function controlled by the meta-data, and automatic speech recognition is performed of the transformed speech features.

Type: Application

Filed: June 28, 2012

Publication date: September 17, 2015

Applicant: NUANCE COMMUNICATIONS, INC.

Inventors: Daniel Willett, Karl Jonas Lööf, Yue Pan, Joel Pinto, Christian Gollan