Patents by Inventor Norman Casagrande

Norman Casagrande has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11830475
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network to perform speech synthesis. One of the methods includes obtaining a training data set for training a first neural network to process a spectral representation of an audio sample and to generate a prediction of the audio sample, wherein, after training, the first neural network obtains spectral representations of audio samples from a second neural network; for a plurality of audio samples in the training data set: generating a ground-truth spectral representation of the audio sample; and processing the ground-truth spectral representation using a third neural network to generate an updated spectral representation of the audio sample; and training the first neural network using the updated spectral representations, wherein the third neural network is configured to generate updated spectral representations that resemble spectral representations generated by the second neural network.
    Type: Grant
    Filed: June 1, 2022
    Date of Patent: November 28, 2023
    Assignee: DeepMind Technologies Limited
    Inventor: Norman Casagrande
  • Publication number: 20230018384
    Abstract: A method includes obtaining training data including a plurality of training audio signals and corresponding transcripts. Each training audio signal is spoken by a target speaker in a first accent/dialect. For each training audio signal of the training data, the method includes generating a training synthesized speech representation spoken by the target speaker in a second accent/dialect different than the first accent/dialect and training a text-to-speech (TTS) system based on the corresponding transcript and the training synthesized speech representation. The method also includes receiving an input text utterance to be synthesized into speech in the second accent/dialect. The method also includes obtaining conditioning inputs that include a speaker embedding and an accent/dialect identifier that identifies the second accent/dialect.
    Type: Application
    Filed: July 14, 2021
    Publication date: January 19, 2023
    Applicant: Google LLC
    Inventors: Lev Finkelstein, Chun-an Chan, Byungha Chun, Norman Casagrande, Yu Zhang, Robert Andrew James Clark, Vincent Wan
  • Publication number: 20220383851
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network to perform speech synthesis. One of the methods includes obtaining a training data set for training a first neural network to process a spectral representation of an audio sample and to generate a prediction of the audio sample, wherein, after training, the first neural network obtains spectral representations of audio samples from a second neural network; for a plurality of audio samples in the training data set: generating a ground-truth spectral representation of the audio sample; and processing the ground-truth spectral representation using a third neural network to generate an updated spectral representation of the audio sample; and training the first neural network using the updated spectral representations, wherein the third neural network is configured to generate updated spectral representations that resemble spectral representations generated by the second neural network.
    Type: Application
    Filed: June 1, 2022
    Publication date: December 1, 2022
    Inventor: Norman Casagrande
  • Patent number: 11419197
    Abstract: A computer-implemented method for adaptive display brightness adjustment, the method comprising: obtaining current state data characterizing a current state of a device having a display with an adjustable brightness; providing the current state data as input to a brightness prediction machine learning model, wherein the model is configured to process the current state data in accordance with current values of a set of model parameters to generate as output a proposed display brightness for the display of the device; setting the brightness of the display to a brightness that is lower than the proposed display brightness in accordance with an exploration policy; determining whether a user of the device manually adjusts the display brightness; and in response to determining that the user did not manually adjust the display brightness, using the lower brightness as a target output for adjusting the current values of the set of model parameters.
    Type: Grant
    Filed: August 5, 2021
    Date of Patent: August 16, 2022
    Assignee: Google LLC
    Inventors: Thomas Degris, Benjamin Hal Murdoch, Norman Casagrande, Peeyush Agarwal, Christopher Gamble, Christopher Sigurd Fougner
  • Patent number: 11335321
    Abstract: A method of building a text-to-speech (TTS) system from a small amount of speech data includes receiving a first plurality of recorded speech samples from an assortment of speakers and a second plurality of recorded speech samples from a target speaker where the assortment of speakers does not include the target speaker. The method further includes training a TTS model using the first plurality of recorded speech samples from the assortment of speakers. Here, the trained TTS model is configured to output synthetic speech as an audible representation of a text input. The method also includes re-training the trained TTS model using the second plurality of recorded speech samples from the target speaker combined with the first plurality of recorded speech samples from the assortment of speakers. Here, the re-trained TTS model is configured to output synthetic speech resembling speaking characteristics of the target speaker.
    Type: Grant
    Filed: August 28, 2020
    Date of Patent: May 17, 2022
    Assignee: Google LLC
    Inventors: Ye Jia, Byungha Chun, Yusuke Oda, Norman Casagrande, Tejas Iyer, Fan Luo, Russell John Wyatt Skerry-Ryan, Jonathan Shen, Yonghui Wu, Yu Zhang
  • Publication number: 20220068256
    Abstract: A method of building a text-to-speech (TTS) system from a small amount of speech data includes receiving a first plurality of recorded speech samples from an assortment of speakers and a second plurality of recorded speech samples from a target speaker where the assortment of speakers does not include the target speaker. The method further includes training a TTS model using the first plurality of recorded speech samples from the assortment of speakers. Here, the trained TTS model is configured to output synthetic speech as an audible representation of a text input. The method also includes re-training the trained TTS model using the second plurality of recorded speech samples from the target speaker combined with the first plurality of recorded speech samples from the assortment of speakers. Here, the re-trained TTS model is configured to output synthetic speech resembling speaking characteristics of the target speaker.
    Type: Application
    Filed: August 28, 2020
    Publication date: March 3, 2022
    Applicant: Google LLC
    Inventors: Ye Jia, Byungha Chun, Yusuke Oda, Norman Casagrande, Tejas Iyer, Fan Luo, Russell John Wyatt Skerry-Ryan, Jonathan Shen, Yonghui Wu, Yu Zhang
  • Publication number: 20210368604
    Abstract: A computer-implemented method for adaptive display brightness adjustment, the method comprising: obtaining current state data characterizing a current state of a device having a display with an adjustable brightness; providing the current state data as input to a brightness prediction machine learning model, wherein the model is configured to process the current state data in accordance with current values of a set of model parameters to generate as output a proposed display brightness for the display of the device; setting the brightness of the display to a brightness that is lower than the proposed display brightness in accordance with an exploration policy; determining whether a user of the device manually adjusts the display brightness; and in response to determining that the user did not manually adjust the display brightness, using the lower brightness as a target output for adjusting the current values of the set of model parameters.
    Type: Application
    Filed: August 5, 2021
    Publication date: November 25, 2021
    Inventors: Thomas Degris, Benjamin Hal Murdoch, Norman Casagrande, Peeyush Agarwal, Christopher Gamble, Christopher Sigurd Fougner
  • Patent number: 11096259
    Abstract: A computer-implemented method for adaptive display brightness adjustment, the method comprising: obtaining current state data characterizing a current state of a device having a display with an adjustable brightness; providing the current state data as input to a brightness prediction machine learning model, wherein the model is configured to process the current state data in accordance with current values of a set of model parameters to generate as output a proposed display brightness for the display of the device; setting the brightness of the display to a brightness that is lower than the proposed display brightness in accordance with an exploration policy; determining whether a user of the device manually adjusts the display brightness; and in response to determining that the user did not manually adjust the display brightness, using the lower brightness as a target output for adjusting the current values of the set of model parameters.
    Type: Grant
    Filed: December 15, 2017
    Date of Patent: August 17, 2021
    Assignee: Google LLC
    Inventors: Thomas Degris, Benjamin Hal Murdoch, Norman Casagrande, Peeyush Agarwal, Christopher Gamble, Christopher Sigurd Fougner
  • Publication number: 20210089909
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating output audio examples using a generative neural network. One of the methods includes obtaining a training conditioning text input; processing a training generative input comprising the training conditioning text input using a feedforward generative neural network to generate a training audio output; processing the training audio output using each of a plurality of discriminators, wherein the plurality of discriminators comprises one or more conditional discriminators and one or more unconditional discriminators; determining a first combined prediction by combining the respective predictions of the plurality of discriminators; and determining an update to current values of a plurality of generative parameters of the feedforward generative neural network to increase a first error in the first combined prediction.
    Type: Application
    Filed: September 25, 2020
    Publication date: March 25, 2021
    Inventors: Mikolaj Binkowski, Karen Simonyan, Jeffrey Donahue, Aidan Clark, Sander Etienne Lea Dieleman, Erich Konrad Elsen, Luis Carlos Cobo Rus, Norman Casagrande
  • Publication number: 20200314985
    Abstract: A computer-implemented method for adaptive display brightness adjustment, the method comprising: obtaining current state data characterizing a current state of a device having a display with an adjustable brightness; providing the current state data as input to a brightness prediction machine learning model, wherein the model is configured to process the current state data in accordance with current values of a set of model parameters to generate as output a proposed display brightness for the display of the device; setting the brightness of the display to a brightness that is lower than the proposed display brightness in accordance with an exploration policy; determining whether a user of the device manually adjusts the display brightness; and in response to determining that the user did not manually adjust the display brightness, using the lower brightness as a target output for adjusting the current values of the set of model parameters.
    Type: Application
    Filed: December 15, 2017
    Publication date: October 1, 2020
    Inventors: Thomas Degris, Benjamin Hal Murdoch, Norman Casagrande, Peeyush Agarwal, Christopher Gamble, Christopher Sigurd Fougner