Patents by Inventor Norman Casagrande
Norman Casagrande has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11830475Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network to perform speech synthesis. One of the methods includes obtaining a training data set for training a first neural network to process a spectral representation of an audio sample and to generate a prediction of the audio sample, wherein, after training, the first neural network obtains spectral representations of audio samples from a second neural network; for a plurality of audio samples in the training data set: generating a ground-truth spectral representation of the audio sample; and processing the ground-truth spectral representation using a third neural network to generate an updated spectral representation of the audio sample; and training the first neural network using the updated spectral representations, wherein the third neural network is configured to generate updated spectral representations that resemble spectral representations generated by the second neural network.Type: GrantFiled: June 1, 2022Date of Patent: November 28, 2023Assignee: DeepMind Technologies LimitedInventor: Norman Casagrande
-
Publication number: 20230018384Abstract: A method includes obtaining training data including a plurality of training audio signals and corresponding transcripts. Each training audio signal is spoken by a target speaker in a first accent/dialect. For each training audio signal of the training data, the method includes generating a training synthesized speech representation spoken by the target speaker in a second accent/dialect different than the first accent/dialect and training a text-to-speech (TTS) system based on the corresponding transcript and the training synthesized speech representation. The method also includes receiving an input text utterance to be synthesized into speech in the second accent/dialect. The method also includes obtaining conditioning inputs that include a speaker embedding and an accent/dialect identifier that identifies the second accent/dialect.Type: ApplicationFiled: July 14, 2021Publication date: January 19, 2023Applicant: Google LLCInventors: Lev Finkelstein, Chun-an Chan, Byungha Chun, Norman Casagrande, Yu Zhang, Robert Andrew James Clark, Vincent Wan
-
Publication number: 20220383851Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network to perform speech synthesis. One of the methods includes obtaining a training data set for training a first neural network to process a spectral representation of an audio sample and to generate a prediction of the audio sample, wherein, after training, the first neural network obtains spectral representations of audio samples from a second neural network; for a plurality of audio samples in the training data set: generating a ground-truth spectral representation of the audio sample; and processing the ground-truth spectral representation using a third neural network to generate an updated spectral representation of the audio sample; and training the first neural network using the updated spectral representations, wherein the third neural network is configured to generate updated spectral representations that resemble spectral representations generated by the second neural network.Type: ApplicationFiled: June 1, 2022Publication date: December 1, 2022Inventor: Norman Casagrande
-
Patent number: 11419197Abstract: A computer-implemented method for adaptive display brightness adjustment, the method comprising: obtaining current state data characterizing a current state of a device having a display with an adjustable brightness; providing the current state data as input to a brightness prediction machine learning model, wherein the model is configured to process the current state data in accordance with current values of a set of model parameters to generate as output a proposed display brightness for the display of the device; setting the brightness of the display to a brightness that is lower than the proposed display brightness in accordance with an exploration policy; determining whether a user of the device manually adjusts the display brightness; and in response to determining that the user did not manually adjust the display brightness, using the lower brightness as a target output for adjusting the current values of the set of model parameters.Type: GrantFiled: August 5, 2021Date of Patent: August 16, 2022Assignee: Google LLCInventors: Thomas Degris, Benjamin Hal Murdoch, Norman Casagrande, Peeyush Agarwal, Christopher Gamble, Christopher Sigurd Fougner
-
Patent number: 11335321Abstract: A method of building a text-to-speech (TTS) system from a small amount of speech data includes receiving a first plurality of recorded speech samples from an assortment of speakers and a second plurality of recorded speech samples from a target speaker where the assortment of speakers does not include the target speaker. The method further includes training a TTS model using the first plurality of recorded speech samples from the assortment of speakers. Here, the trained TTS model is configured to output synthetic speech as an audible representation of a text input. The method also includes re-training the trained TTS model using the second plurality of recorded speech samples from the target speaker combined with the first plurality of recorded speech samples from the assortment of speakers. Here, the re-trained TTS model is configured to output synthetic speech resembling speaking characteristics of the target speaker.Type: GrantFiled: August 28, 2020Date of Patent: May 17, 2022Assignee: Google LLCInventors: Ye Jia, Byungha Chun, Yusuke Oda, Norman Casagrande, Tejas Iyer, Fan Luo, Russell John Wyatt Skerry-Ryan, Jonathan Shen, Yonghui Wu, Yu Zhang
-
Publication number: 20220068256Abstract: A method of building a text-to-speech (TTS) system from a small amount of speech data includes receiving a first plurality of recorded speech samples from an assortment of speakers and a second plurality of recorded speech samples from a target speaker where the assortment of speakers does not include the target speaker. The method further includes training a TTS model using the first plurality of recorded speech samples from the assortment of speakers. Here, the trained TTS model is configured to output synthetic speech as an audible representation of a text input. The method also includes re-training the trained TTS model using the second plurality of recorded speech samples from the target speaker combined with the first plurality of recorded speech samples from the assortment of speakers. Here, the re-trained TTS model is configured to output synthetic speech resembling speaking characteristics of the target speaker.Type: ApplicationFiled: August 28, 2020Publication date: March 3, 2022Applicant: Google LLCInventors: Ye Jia, Byungha Chun, Yusuke Oda, Norman Casagrande, Tejas Iyer, Fan Luo, Russell John Wyatt Skerry-Ryan, Jonathan Shen, Yonghui Wu, Yu Zhang
-
Publication number: 20210368604Abstract: A computer-implemented method for adaptive display brightness adjustment, the method comprising: obtaining current state data characterizing a current state of a device having a display with an adjustable brightness; providing the current state data as input to a brightness prediction machine learning model, wherein the model is configured to process the current state data in accordance with current values of a set of model parameters to generate as output a proposed display brightness for the display of the device; setting the brightness of the display to a brightness that is lower than the proposed display brightness in accordance with an exploration policy; determining whether a user of the device manually adjusts the display brightness; and in response to determining that the user did not manually adjust the display brightness, using the lower brightness as a target output for adjusting the current values of the set of model parameters.Type: ApplicationFiled: August 5, 2021Publication date: November 25, 2021Inventors: Thomas Degris, Benjamin Hal Murdoch, Norman Casagrande, Peeyush Agarwal, Christopher Gamble, Christopher Sigurd Fougner
-
Patent number: 11096259Abstract: A computer-implemented method for adaptive display brightness adjustment, the method comprising: obtaining current state data characterizing a current state of a device having a display with an adjustable brightness; providing the current state data as input to a brightness prediction machine learning model, wherein the model is configured to process the current state data in accordance with current values of a set of model parameters to generate as output a proposed display brightness for the display of the device; setting the brightness of the display to a brightness that is lower than the proposed display brightness in accordance with an exploration policy; determining whether a user of the device manually adjusts the display brightness; and in response to determining that the user did not manually adjust the display brightness, using the lower brightness as a target output for adjusting the current values of the set of model parameters.Type: GrantFiled: December 15, 2017Date of Patent: August 17, 2021Assignee: Google LLCInventors: Thomas Degris, Benjamin Hal Murdoch, Norman Casagrande, Peeyush Agarwal, Christopher Gamble, Christopher Sigurd Fougner
-
Publication number: 20210089909Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating output audio examples using a generative neural network. One of the methods includes obtaining a training conditioning text input; processing a training generative input comprising the training conditioning text input using a feedforward generative neural network to generate a training audio output; processing the training audio output using each of a plurality of discriminators, wherein the plurality of discriminators comprises one or more conditional discriminators and one or more unconditional discriminators; determining a first combined prediction by combining the respective predictions of the plurality of discriminators; and determining an update to current values of a plurality of generative parameters of the feedforward generative neural network to increase a first error in the first combined prediction.Type: ApplicationFiled: September 25, 2020Publication date: March 25, 2021Inventors: Mikolaj Binkowski, Karen Simonyan, Jeffrey Donahue, Aidan Clark, Sander Etienne Lea Dieleman, Erich Konrad Elsen, Luis Carlos Cobo Rus, Norman Casagrande
-
Publication number: 20200314985Abstract: A computer-implemented method for adaptive display brightness adjustment, the method comprising: obtaining current state data characterizing a current state of a device having a display with an adjustable brightness; providing the current state data as input to a brightness prediction machine learning model, wherein the model is configured to process the current state data in accordance with current values of a set of model parameters to generate as output a proposed display brightness for the display of the device; setting the brightness of the display to a brightness that is lower than the proposed display brightness in accordance with an exploration policy; determining whether a user of the device manually adjusts the display brightness; and in response to determining that the user did not manually adjust the display brightness, using the lower brightness as a target output for adjusting the current values of the set of model parameters.Type: ApplicationFiled: December 15, 2017Publication date: October 1, 2020Inventors: Thomas Degris, Benjamin Hal Murdoch, Norman Casagrande, Peeyush Agarwal, Christopher Gamble, Christopher Sigurd Fougner