Patents by Inventor Carl Benjamin Quillen

Carl Benjamin Quillen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11657828
    Abstract: Embodiments improve speech data quality through training a neural network for de-noising audio enhancement. One such embodiment creates simulated noisy speech data from high quality speech data. In turn, training, e.g., deep normalizing flow training, is performed on a neural network using the high quality speech data and the simulated noisy speech data to train the neural network to create de-noised speech data given noisy speech data. Performing the training includes minimizing errors in the neural network according to at least one of (i) a decoding error of an Automatic Speech Recognition (ASR) system processing current de-noised speech data results generated by the neural network during the training and (ii) spectral distance between the high quality speech data and the current de-noised speech data results generated by the neural network during the training.
    Type: Grant
    Filed: January 31, 2020
    Date of Patent: May 23, 2023
    Assignee: Nuance Communications, Inc.
    Inventor: Carl Benjamin Quillen
  • Patent number: 11545136
    Abstract: A method for removing private data from an acoustic model includes capturing speech from a large population of users, creating a text-to-speech voice from at least a portion of the large population of users, discarding speech data from a database of speech, creating text-to-speech waveforms from the text-to-speech voice and the new database of speech with the discarded speech data and generating an automatic speech recognition model using the text-to-speech waveforms.
    Type: Grant
    Filed: October 21, 2019
    Date of Patent: January 3, 2023
    Assignee: NUANCE COMMUNICATIONS, INC.
    Inventors: Vincent Laurent Pollet, Carl Benjamin Quillen, Philip Charles Woodland, William F. Ganong, III, Steven Hoskins
  • Publication number: 20220399026
    Abstract: A method, computer program product, and computing system for receiving a plurality of signals from a plurality of microphones, thus defining a plurality of channels. A weighted multichannel representation of the plurality of channels may be generated. A plurality of weights for each channel of the plurality of channels may be generated based upon, at least in part, the weighted multichannel representation of the plurality of channels. A single channel representation of the plurality of channels may be generated based upon, at least in part, the weighted multichannel representation of the plurality of channels and the plurality of weights generated for each channel of the plurality of channels.
    Type: Application
    Filed: December 1, 2021
    Publication date: December 15, 2022
    Inventors: Rong Gong, Carl Benjamin Quillen, Dushyant Sharma, Ljubomir Milanovic
  • Publication number: 20210241780
    Abstract: Embodiments improve speech data quality through training a neural network for de-noising audio enhancement. One such embodiment creates simulated noisy speech data from high quality speech data. In turn, training, e.g., deep normalizing flow training, is performed on a neural network using the high quality speech data and the simulated noisy speech data to train the neural network to create de-noised speech data given noisy speech data. Performing the training includes minimizing errors in the neural network according to at least one of (i) a decoding error of an Automatic Speech Recognition (ASR) system processing current de-noised speech data results generated by the neural network during the training and (ii) spectral distance between the high quality speech data and the current de-noised speech data results generated by the neural network during the training.
    Type: Application
    Filed: January 31, 2020
    Publication date: August 5, 2021
    Inventor: Carl Benjamin Quillen
  • Publication number: 20210118425
    Abstract: A method for removing private data from an acoustic model includes capturing speech from a large population of users, creating a text-to-speech voice from at least a portion of the large population of users, discarding speech data from a database of speech, creating text-to-speech waveforms from the text-to-speech voice and the new database of speech with the discarded speech data and generating an automatic speech recognition model using the text-to-speech waveforms.
    Type: Application
    Filed: October 21, 2019
    Publication date: April 22, 2021
    Inventors: Vincent Laurent POLLET, Carl Benjamin QUILLEN, Philip Charles WOODLAND, William F. GANONG, III, Steven HOSKINS
  • Patent number: 10803871
    Abstract: Methods described herein provide functionality for automatic speech recognition (ASR). One such embodiment performs speech recognition using received speech recognition result candidates, where the received candidates were generated by performing Statistical Language Model (SLM) based speech recognition on one or more frames of audio data. In turn, such an embodiment transmits results of the speech recognition, performed using the received speech recognition result candidates, to a user device via a communications network. Results of the speech recognition are available with lower latency than pure cloud based ASR solution.
    Type: Grant
    Filed: November 26, 2018
    Date of Patent: October 13, 2020
    Assignee: Nuance Communications, Inc.
    Inventors: Carl Benjamin Quillen, Naveen Parihar
  • Publication number: 20190214014
    Abstract: Methods described herein provide functionality for automatic speech recognition (ASR). One such embodiment performs speech recognition using received speech recognition result candidates, where the received candidates were generated by performing Statistical Language Model (SLM) based speech recognition on one or more frames of audio data. In turn, such an embodiment transmits results of the speech recognition, performed using the received speech recognition result candidates, to a user device via a communications network. Results of the speech recognition are available with lower latency than pure cloud based ASR solution.
    Type: Application
    Filed: November 26, 2018
    Publication date: July 11, 2019
    Inventors: Carl Benjamin Quillen, Naveen Parihar
  • Publication number: 20180211668
    Abstract: Method and apparatus for providing visual feedback on an electronic device in a client/server speech recognition system comprising the electronic device and a network device remotely located from the electronic device. The method comprises processing, by an embedded speech recognizer of the electronic device, at least a portion of input audio comprising speech to produce local recognized speech, sending at least a portion of the input audio to the network device for remote speech recognition, and displaying, on a user interface of the electronic device, visual feedback based on at least a portion of the local recognized speech prior to receiving streaming recognition results from the network device.
    Type: Application
    Filed: July 17, 2015
    Publication date: July 26, 2018
    Applicant: Nuance Communications, Inc.
    Inventors: Daniel WILLETT, Christian GOLLAN, Carl Benjamin QUILLEN, Stefan HAHN, Fabian STEMMER
  • Patent number: 9761227
    Abstract: Methods described herein provide functionality for automatic speech recognition (ASR). One such embodiment performs speech recognition using received speech recognition result candidates, where the received candidates were generated by performing Statistical Language Model (SLM) based speech recognition on one or more frames of audio data. In turn, such an embodiment transmits results of the speech recognition, performed using the received speech recognition result candidates, to a user device via a communications network. Results of the speech recognition are available with lower latency than pure cloud based ASR solutions.
    Type: Grant
    Filed: May 26, 2016
    Date of Patent: September 12, 2017
    Assignee: Nuance Communications, Inc.
    Inventors: Carl Benjamin Quillen, Naveen Parihar