Patents by Inventor Thibault Doutre

Thibault Doutre has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240029716
    Abstract: A method for training a streaming automatic speech recognition student model includes receiving a plurality of unlabeled student training utterances. The method also includes, for each unlabeled student training utterance, generating a transcription corresponding to the respective unlabeled student training utterance using a plurality of non-streaming automated speech recognition (ASR) teacher models. The method further includes distilling a streaming ASR student model from the plurality of non-streaming ASR teacher models by training the streaming ASR student model using the plurality of unlabeled student training utterances paired with the corresponding transcriptions generated by the plurality of non-streaming ASR teacher models.
    Type: Application
    Filed: October 4, 2023
    Publication date: January 25, 2024
    Applicant: Google LLC
    Inventors: Thibault Doutre, Wei Han, Min Ma, Zhiyun Lu, Chung-Cheng Chiu, Ruoming Pang, Arun Narayanan, Ananya Misra, Yu Zhang, Liangliang Cao
  • Patent number: 11804212
    Abstract: A method for training a streaming automatic speech recognition student model includes receiving a plurality of unlabeled student training utterances. The method also includes, for each unlabeled student training utterance, generating a transcription corresponding to the respective unlabeled student training utterance using a plurality of non-streaming automated speech recognition (ASR) teacher models. The method further includes distilling a streaming ASR student model from the plurality of non-streaming ASR teacher models by training the streaming ASR student model using the plurality of unlabeled student training utterances paired with the corresponding transcriptions generated by the plurality of non-streaming ASR teacher models.
    Type: Grant
    Filed: June 15, 2021
    Date of Patent: October 31, 2023
    Assignee: Google LLC
    Inventors: Thibault Doutre, Wei Han, Min Ma, Zhiyun Lu, Chung-Cheng Chiu, Ruoming Pang, Arun Narayanan, Ananya Misra, Yu Zhang, Liangliang Cao
  • Publication number: 20230103382
    Abstract: A method includes obtaining a set of training samples, wherein each training sample includes a corresponding sequence of speech segments corresponding to a training utterance and a corresponding sequence of ground-truth transcriptions for the sequence of speech segments, and wherein each ground-truth transcription includes a start time and an end time of a corresponding speech segment. For each training sample in the set of training samples, the method includes processing, using a speech recognition model, the corresponding sequence of speech segments to obtain one or more speech recognition hypotheses for the training utterance; and, for each speech recognition hypothesis obtained for the training utterance, identifying a respective number of word errors relative to the corresponding sequence of ground-truth transcriptions.
    Type: Application
    Filed: September 27, 2022
    Publication date: April 6, 2023
    Applicant: Google LLC
    Inventors: Zhiyun Lu, Thibault Doutre, Yanwei Pan, Liangliang Cao, Rohit Prabhavalkar, Trevor Strohman, Chao Zhang
  • Publication number: 20220343894
    Abstract: A method for training a streaming automatic speech recognition student model includes receiving a plurality of unlabeled student training utterances. The method also includes, for each unlabeled student training utterance, generating a transcription corresponding to the respective unlabeled student training utterance using a plurality of non-streaming automated speech recognition (ASR) teacher models. The method further includes distilling a streaming ASR student model from the plurality of non-streaming ASR teacher models by training the streaming ASR student model using the plurality of unlabeled student training utterances paired with the corresponding transcriptions generated by the plurality of non-streaming ASR teacher models.
    Type: Application
    Filed: June 15, 2021
    Publication date: October 27, 2022
    Applicant: Google LLC
    Inventors: Thibault Doutre, Wei Han, Min Ma, Zhiyun Lu, Chung-Cheng Chiu, Ruoming Pang, Arun Narayanan, Ananya Misra, Yu Zhang, Liangliang Cao
  • Patent number: 10916003
    Abstract: An image quality scorer machine accesses a candidate image to be analyzed for visual quality. The image quality scorer machine generates a visual quality score of the candidate image by first generating a prediction of a similarity score for the candidate image. The predicted similarly score of the candidate image may be generated by a process including inputting the candidate image into a neural network that has been trained to detect a set of image features in the candidate image and then to generate a corresponding predicted similarity score based on degrees to which the image features in the set are present in the candidate image. The image quality scorer machine derives the visual quality score based on the predicted similarity score outputted by the neural network. Accordingly, the image quality score machine may provide or store the generated visual quality score of candidate image for subsequent usage.
    Type: Grant
    Filed: March 20, 2018
    Date of Patent: February 9, 2021
    Assignee: Uber Technologies, Inc.
    Inventors: Thibault Doutre, David Gregory Purdy, Jason Byron Yosinski
  • Publication number: 20190295240
    Abstract: An image quality scorer machine accesses a candidate image to be analyzed for visual quality. The image quality scorer machine generates a visual quality score of the candidate image by first generating a prediction of a similarity score for the candidate image. The predicted similarly score of the candidate image may be generated by a process including inputting the candidate image into a neural network that has been trained to detect a set of image features in the candidate image and then to generate a corresponding predicted similarity score based on degrees to which the image features in the set are present in the candidate image. The image quality scorer machine derives the visual quality score based on the predicted similarity score outputted by the neural network. Accordingly, the image quality score machine may provide or store the generated visual quality score of candidate image for subsequent usage.
    Type: Application
    Filed: March 20, 2018
    Publication date: September 26, 2019
    Inventors: Thibault Doutre, David Gregory Purdy, Jason Byron Yosinski