Patents by Inventor Joel Pinto

Joel Pinto has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11990135
    Abstract: Methods and apparatus for selectively performing speech processing in a hybrid speech processing system. The hybrid speech processing system includes at least one mobile electronic device and a network-connected server remotely located from the at least one mobile electronic device. The mobile electronic device is configured to use an embedded speech recognizer to process at least a portion of input audio to produce recognized text. A controller on the mobile electronic device determines whether to send information from the mobile electronic device to the server for speech processing. The determination of whether to send the information is based, at least in part, on an analysis of the input audio, the recognized text, or a semantic category associated with the recognized text.
    Type: Grant
    Filed: February 9, 2021
    Date of Patent: May 21, 2024
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Daniel Willett, Joel Pinto, William F. Ganong, III
  • Publication number: 20210166699
    Abstract: Methods and apparatus for selectively performing speech processing in a hybrid speech processing system. The hybrid speech processing system includes at least one mobile electronic device and a network-connected server remotely located from the at least one mobile electronic device. The mobile electronic device is configured to use an embedded speech recognizer to process at least a portion of input audio to produce recognized text. A controller on the mobile electronic device determines whether to send information from the mobile electronic device to the server for speech processing. The determination of whether to send the information is based, at least in part, on an analysis of the input audio, the recognized text, or a semantic category associated with the recognized text.
    Type: Application
    Filed: February 9, 2021
    Publication date: June 3, 2021
    Applicant: Nuance Communications, Inc
    Inventors: Daniel Willett, Joel Pinto, William F. Ganong, III
  • Patent number: 10971157
    Abstract: Methods and apparatus for selectively performing speech processing in a hybrid speech processing system. The hybrid speech processing system includes at least one mobile electronic device and a network-connected server remotely located from the at least one mobile electronic device. The mobile electronic device is configured to use an embedded speech recognizer to process at least a portion of input audio to produce recognized text. A controller on the mobile electronic device determines whether to send information from the mobile electronic device to the server for speech processing. The determination of whether to send the information is based, at least in part, on an analysis of the input audio, the recognized text, or a semantic category associated with the recognized text.
    Type: Grant
    Filed: January 11, 2017
    Date of Patent: April 6, 2021
    Assignee: Nuance Communications, Inc.
    Inventors: Daniel Willett, Joel Pinto, William F. Ganong, III
  • Patent number: 10650805
    Abstract: A system and method for speech recognition is provided. Embodiments may include receiving an audio signal at a first deep neural network (“DNN”) associated with a computing device. Embodiments may further include receiving the audio signal at a second deep neural network (“DNN”) associated with a computing device, wherein the second deep neural network includes fewer parameters than the first deep neural network. Embodiments may also include determining whether to select an output from the first deep neural network or the second deep neural network and providing the selected output to a decoder with an overall objective of speeding up ASR.
    Type: Grant
    Filed: September 11, 2014
    Date of Patent: May 12, 2020
    Assignee: Nuance Communications, Inc.
    Inventors: Joel Pinto, Daniel Willett, Christian Plahl
  • Patent number: 10049658
    Abstract: A system and method for speech recognition is provided. Embodiments may include receiving, at a first computing device, a far-talk signal from a far-talk computing device, the far-talk signal transmitted using a first channel and corresponding to an audible sound. Embodiments may further include receiving, at the first computing device, a near-talk signal from a near-talk computing device, the near-talk signal transmitted using a second channel and corresponding to the audible sound, wherein the far-talk signal and the near-talk signal are received during an enrollment phase of a far-talk speech recognition system. Embodiments may also include updating, at the first computing device, one or more models associated with a far-talk speech recognition system based upon, at least in part, one or more characteristics of the far-talk signal and one or more characteristics of the near-talk signal.
    Type: Grant
    Filed: March 7, 2013
    Date of Patent: August 14, 2018
    Assignee: Nuance Communications, Inc.
    Inventors: Joel Pinto, Josef Damianus Anastasiadis, Daniel Willett
  • Publication number: 20180197545
    Abstract: Methods and apparatus for selectively performing speech processing in a hybrid speech processing system. The hybrid speech processing system includes at least one mobile electronic device and a network-connected server remotely located from the at least one mobile electronic device. The mobile electronic device is configured to use an embedded speech recognizer to process at least a portion of input audio to produce recognized text. A controller on the mobile electronic device determines whether to send information from the mobile electronic device to the server for speech processing. The determination of whether to send the information is based, at least in part, on an analysis of the input audio, the recognized text, or a semantic category associated with the recognized text.
    Type: Application
    Filed: January 11, 2017
    Publication date: July 12, 2018
    Applicant: Nuance Communications, Inc.
    Inventors: Daniel Willett, Joel Pinto, William F. Ganong, III
  • Patent number: 9953638
    Abstract: A computer-implemented method is described for front end speech processing for automatic speech recognition. A sequence of speech features which characterize an unknown speech input provided on an audio input channel and associated meta-data which characterize the audio input channel are received. The speech features are transformed with a computer process that uses a trained mapping function controlled by the meta-data, and automatic speech recognition is performed of the transformed speech features.
    Type: Grant
    Filed: June 28, 2012
    Date of Patent: April 24, 2018
    Assignee: Nuance Communications, Inc.
    Inventors: Daniel Willett, Karl Jonas Lööf, Yue Pan, Joel Pinto, Christian Gollan
  • Publication number: 20170294186
    Abstract: A system and method for speech recognition is provided. Embodiments may include receiving an audio signal at a first deep neural network (“DNN”) associated with a computing device. Embodiments may further include receiving the audio signal at a second deep neural network (“DNN”) associated with a computing device, wherein the second deep neural network includes fewer parameters than the first deep neural network. Embodiments may also include determining whether to select an output from the first deep neural network or the second deep neural network and providing the selected output to a decoder with an overall objective of speeding up ASR.
    Type: Application
    Filed: September 11, 2014
    Publication date: October 12, 2017
    Inventors: Joel Pinto, Daniel Willett, Christian Plahl
  • Publication number: 20160027435
    Abstract: A system and method for speech recognition is provided. Embodiments may include receiving, at a first computing device, a far-talk signal from a far-talk computing device, the far-talk signal transmitted using a first channel and corresponding to an audible sound. Embodiments may further include receiving, at the first computing device, a near-talk signal from a near-talk computing device, the near-talk signal transmitted using a second channel and corresponding to the audible sound, wherein the far-talk signal and the near-talk signal are received during an enrollment phase of a far-talk speech recognition system. Embodiments may also include updating, at the first computing device, one or more models associated with a far-talk speech recognition system based upon, at least in part, one or more characteristics of the far-talk signal and one or more characteristics of the near-talk signal.
    Type: Application
    Filed: March 7, 2013
    Publication date: January 28, 2016
    Inventors: Joel Pinto, Josef Damianus Anastasiadis, Daniel Willett
  • Publication number: 20150262575
    Abstract: A computer-implemented method is described for front end speech processing for automatic speech recognition. A sequence of speech features which characterize an unknown speech input provided on an audio input channel and associated meta-data data which characterize the audio input channel are received. The speech features are transformed with a computer process that uses a trained mapping function controlled by the meta-data, and automatic speech recognition is performed of the transformed speech features.
    Type: Application
    Filed: June 28, 2012
    Publication date: September 17, 2015
    Applicant: NUANCE COMMUNICATIONS, INC.
    Inventors: Daniel Willett, Karl Jonas Lööf, Yue Pan, Joel Pinto, Christian Gollan