Patents by Inventor Norman Casagrande

Norman Casagrande has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Predicting spectral representations for training speech synthesis neural networks

Patent number: 11830475

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network to perform speech synthesis. One of the methods includes obtaining a training data set for training a first neural network to process a spectral representation of an audio sample and to generate a prediction of the audio sample, wherein, after training, the first neural network obtains spectral representations of audio samples from a second neural network; for a plurality of audio samples in the training data set: generating a ground-truth spectral representation of the audio sample; and processing the ground-truth spectral representation using a third neural network to generate an updated spectral representation of the audio sample; and training the first neural network using the updated spectral representations, wherein the third neural network is configured to generate updated spectral representations that resemble spectral representations generated by the second neural network.

Type: Grant

Filed: June 1, 2022

Date of Patent: November 28, 2023

Assignee: DeepMind Technologies Limited

Inventor: Norman Casagrande
Two-Level Text-To-Speech Systems Using Synthetic Training Data

Publication number: 20230018384

Abstract: A method includes obtaining training data including a plurality of training audio signals and corresponding transcripts. Each training audio signal is spoken by a target speaker in a first accent/dialect. For each training audio signal of the training data, the method includes generating a training synthesized speech representation spoken by the target speaker in a second accent/dialect different than the first accent/dialect and training a text-to-speech (TTS) system based on the corresponding transcript and the training synthesized speech representation. The method also includes receiving an input text utterance to be synthesized into speech in the second accent/dialect. The method also includes obtaining conditioning inputs that include a speaker embedding and an accent/dialect identifier that identifies the second accent/dialect.

Type: Application

Filed: July 14, 2021

Publication date: January 19, 2023

Applicant: Google LLC

Inventors: Lev Finkelstein, Chun-an Chan, Byungha Chun, Norman Casagrande, Yu Zhang, Robert Andrew James Clark, Vincent Wan
PREDICTING SPECTRAL REPRESENTATIONS FOR TRAINING SPEECH SYNTHESIS NEURAL NETWORKS

Publication number: 20220383851

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network to perform speech synthesis. One of the methods includes obtaining a training data set for training a first neural network to process a spectral representation of an audio sample and to generate a prediction of the audio sample, wherein, after training, the first neural network obtains spectral representations of audio samples from a second neural network; for a plurality of audio samples in the training data set: generating a ground-truth spectral representation of the audio sample; and processing the ground-truth spectral representation using a third neural network to generate an updated spectral representation of the audio sample; and training the first neural network using the updated spectral representations, wherein the third neural network is configured to generate updated spectral representations that resemble spectral representations generated by the second neural network.

Type: Application

Filed: June 1, 2022

Publication date: December 1, 2022

Inventor: Norman Casagrande
Adaptive display brightness adjustment

Patent number: 11419197

Abstract: A computer-implemented method for adaptive display brightness adjustment, the method comprising: obtaining current state data characterizing a current state of a device having a display with an adjustable brightness; providing the current state data as input to a brightness prediction machine learning model, wherein the model is configured to process the current state data in accordance with current values of a set of model parameters to generate as output a proposed display brightness for the display of the device; setting the brightness of the display to a brightness that is lower than the proposed display brightness in accordance with an exploration policy; determining whether a user of the device manually adjusts the display brightness; and in response to determining that the user did not manually adjust the display brightness, using the lower brightness as a target output for adjusting the current values of the set of model parameters.

Type: Grant

Filed: August 5, 2021

Date of Patent: August 16, 2022

Assignee: Google LLC

Inventors: Thomas Degris, Benjamin Hal Murdoch, Norman Casagrande, Peeyush Agarwal, Christopher Gamble, Christopher Sigurd Fougner
Building a text-to-speech system from a small amount of speech data

Patent number: 11335321

Abstract: A method of building a text-to-speech (TTS) system from a small amount of speech data includes receiving a first plurality of recorded speech samples from an assortment of speakers and a second plurality of recorded speech samples from a target speaker where the assortment of speakers does not include the target speaker. The method further includes training a TTS model using the first plurality of recorded speech samples from the assortment of speakers. Here, the trained TTS model is configured to output synthetic speech as an audible representation of a text input. The method also includes re-training the trained TTS model using the second plurality of recorded speech samples from the target speaker combined with the first plurality of recorded speech samples from the assortment of speakers. Here, the re-trained TTS model is configured to output synthetic speech resembling speaking characteristics of the target speaker.

Type: Grant

Filed: August 28, 2020

Date of Patent: May 17, 2022

Assignee: Google LLC

Inventors: Ye Jia, Byungha Chun, Yusuke Oda, Norman Casagrande, Tejas Iyer, Fan Luo, Russell John Wyatt Skerry-Ryan, Jonathan Shen, Yonghui Wu, Yu Zhang
Building a Text-to-Speech System from a Small Amount of Speech Data

Publication number: 20220068256

Abstract: A method of building a text-to-speech (TTS) system from a small amount of speech data includes receiving a first plurality of recorded speech samples from an assortment of speakers and a second plurality of recorded speech samples from a target speaker where the assortment of speakers does not include the target speaker. The method further includes training a TTS model using the first plurality of recorded speech samples from the assortment of speakers. Here, the trained TTS model is configured to output synthetic speech as an audible representation of a text input. The method also includes re-training the trained TTS model using the second plurality of recorded speech samples from the target speaker combined with the first plurality of recorded speech samples from the assortment of speakers. Here, the re-trained TTS model is configured to output synthetic speech resembling speaking characteristics of the target speaker.

Type: Application

Filed: August 28, 2020

Publication date: March 3, 2022

Applicant: Google LLC

Inventors: Ye Jia, Byungha Chun, Yusuke Oda, Norman Casagrande, Tejas Iyer, Fan Luo, Russell John Wyatt Skerry-Ryan, Jonathan Shen, Yonghui Wu, Yu Zhang
ADAPTIVE DISPLAY BRIGHTNESS ADJUSTMENT

Publication number: 20210368604

Abstract: A computer-implemented method for adaptive display brightness adjustment, the method comprising: obtaining current state data characterizing a current state of a device having a display with an adjustable brightness; providing the current state data as input to a brightness prediction machine learning model, wherein the model is configured to process the current state data in accordance with current values of a set of model parameters to generate as output a proposed display brightness for the display of the device; setting the brightness of the display to a brightness that is lower than the proposed display brightness in accordance with an exploration policy; determining whether a user of the device manually adjusts the display brightness; and in response to determining that the user did not manually adjust the display brightness, using the lower brightness as a target output for adjusting the current values of the set of model parameters.

Type: Application

Filed: August 5, 2021

Publication date: November 25, 2021

Inventors: Thomas Degris, Benjamin Hal Murdoch, Norman Casagrande, Peeyush Agarwal, Christopher Gamble, Christopher Sigurd Fougner
Adaptive display brightness adjustment

Patent number: 11096259

Abstract: A computer-implemented method for adaptive display brightness adjustment, the method comprising: obtaining current state data characterizing a current state of a device having a display with an adjustable brightness; providing the current state data as input to a brightness prediction machine learning model, wherein the model is configured to process the current state data in accordance with current values of a set of model parameters to generate as output a proposed display brightness for the display of the device; setting the brightness of the display to a brightness that is lower than the proposed display brightness in accordance with an exploration policy; determining whether a user of the device manually adjusts the display brightness; and in response to determining that the user did not manually adjust the display brightness, using the lower brightness as a target output for adjusting the current values of the set of model parameters.

Type: Grant

Filed: December 15, 2017

Date of Patent: August 17, 2021

Assignee: Google LLC

Inventors: Thomas Degris, Benjamin Hal Murdoch, Norman Casagrande, Peeyush Agarwal, Christopher Gamble, Christopher Sigurd Fougner
HIGH FIDELITY SPEECH SYNTHESIS WITH ADVERSARIAL NETWORKS

Publication number: 20210089909

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating output audio examples using a generative neural network. One of the methods includes obtaining a training conditioning text input; processing a training generative input comprising the training conditioning text input using a feedforward generative neural network to generate a training audio output; processing the training audio output using each of a plurality of discriminators, wherein the plurality of discriminators comprises one or more conditional discriminators and one or more unconditional discriminators; determining a first combined prediction by combining the respective predictions of the plurality of discriminators; and determining an update to current values of a plurality of generative parameters of the feedforward generative neural network to increase a first error in the first combined prediction.

Type: Application

Filed: September 25, 2020

Publication date: March 25, 2021

Inventors: Mikolaj Binkowski, Karen Simonyan, Jeffrey Donahue, Aidan Clark, Sander Etienne Lea Dieleman, Erich Konrad Elsen, Luis Carlos Cobo Rus, Norman Casagrande
ADAPTIVE DISPLAY BRIGHTNESS ADJUSTMENT

Publication number: 20200314985

Abstract: A computer-implemented method for adaptive display brightness adjustment, the method comprising: obtaining current state data characterizing a current state of a device having a display with an adjustable brightness; providing the current state data as input to a brightness prediction machine learning model, wherein the model is configured to process the current state data in accordance with current values of a set of model parameters to generate as output a proposed display brightness for the display of the device; setting the brightness of the display to a brightness that is lower than the proposed display brightness in accordance with an exploration policy; determining whether a user of the device manually adjusts the display brightness; and in response to determining that the user did not manually adjust the display brightness, using the lower brightness as a target output for adjusting the current values of the set of model parameters.

Type: Application

Filed: December 15, 2017

Publication date: October 1, 2020

Inventors: Thomas Degris, Benjamin Hal Murdoch, Norman Casagrande, Peeyush Agarwal, Christopher Gamble, Christopher Sigurd Fougner