Patents by Inventor John Wyatt

John Wyatt has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

QUICK SWAPPABLE FISHING JIG

Publication number: 20250241281

Abstract: A fishing jig that allows for efficient adaptability, enabling swift transitions between different styles and weights of jig heads without the need for laborious retying. The jig incorporates a screw-on/off mechanism that facilitates the rapid and secure interchangeability of jig heads, eliminating the downtime associated with traditional methods. This expedited swapping capability not only optimizes time spent on the water but also ensures an uninterrupted pursuit of the most suitable bait for the prevailing fishing conditions. Moreover, the jig's user-friendly quick-swap feature accommodates individuals with limited dexterity, enhancing accessibility for a broader range of anglers.

Type: Application

Filed: July 9, 2024

Publication date: July 31, 2025

Applicant: Blitz Performance LLC

Inventors: John Wyatt Keating, Stelios Christos Melekos
Learning the Joint Distribution of Two Sequences Using Little or No Paired Data

Publication number: 20250131273

Abstract: Provided is a noisy channel generative model of two sequences, for example text and speech, which enables uncovering the associations between the two modalities when limited paired data is available. To address the intractability of the exact model under a realistic data set-up, example aspects of the present disclosure include a variational inference approximation. To train this variational model with categorical data, a KL encoder loss approach is proposed which has connections to the wake-sleep algorithm.

Type: Application

Filed: September 27, 2023

Publication date: April 24, 2025

Inventors: Soroosh Mariooryad, Sean Matthew Shannon, Thomas Edward Bagby, Siyuan Ma, David Teh-Hwa Kao, Daisy Antonia Stanton, Eric Dean Battenberg, Russell John Wyatt Skerry-Ryan
AUTOMATED TEXT-TO-SPEECH PRONUNCIATION EDITING FOR LONG FORM TEXT DOCUMENTS

Publication number: 20250053738

Abstract: Aspects of this disclosure are directed to techniques that enable efficient automated text-to-speech pronunciation editing for long form text documents. A computing device comprising a memory and a processor may be configured to perform the techniques. The memory may store a text document. The processor may process words in the text document to identify first candidate words that are predicted to be mispronounced during automated text-to-speech processing of the text document. The processor may next filter the first candidate words to remove one or more candidate words of the first candidate words and obtain second candidate words that have fewer candidate words than the first candidate words. The processor may then annotate the text document to obtain an annotated text document that identifies the second candidate words, and output at least a portion of the annotated text document that identifies at least one candidate word of the second candidate words.

Type: Application

Filed: December 20, 2021

Publication date: February 13, 2025

Inventors: Ryan Dingler, John Rivlin, Christopher Salvarani, Yuanlei Zhang, Nazarii Kukhar, Russell John Wyatt Skerry-Ryan, Daisy Stanton, Judy Chang, Md Enzam Hossain
REDUCED PRESSURE THERAPY APPARATUS CONSTRUCTION AND CONTROL

Publication number: 20250025349

Abstract: Embodiments of negative pressure wound therapy systems and methods for operating the systems are disclosed. In some embodiments, a system includes a pump assembly, canister, and a wound dressing configured to be positioned over a wound. The pump assembly, canister, and the wound dressing can be fluidically connected to facilitate delivery of negative pressure to a wound. The pump assembly can present graphical user interface screens for controlling and monitoring delivery of negative pressure. The system can be configured to efficiently deliver negative pressure and to detect and indicate presence of certain conditions, such as low pressure, high pressure, leak, canister full, and the like. Monitoring and detection of operating condition can be performed by measuring one or more operational parameters, such as pressure, flow rate, and the like.

Type: Application

Filed: October 7, 2024

Publication date: January 23, 2025

Inventors: Alex Fowler, William W. Gregory, William Joseph Jaecklein, Kathryn Ann Leigh, Paul N. Minor, Michael Mosholder, Felix C. Quintanar, John P. Racette, Christopher Rouseff, Matthew Smith, W. Len Smith, John Wyatt, Annaliese Yeaman
MULTILINGUAL SPEECH SYNTHESIS AND CROSS-LANGUAGE VOICE CLONING

Publication number: 20240404506

Abstract: A method includes receiving an input text sequence to be synthesized into speech in a first language and obtaining a speaker embedding, the speaker embedding specifying specific voice characteristics of a target speaker for synthesizing the input text sequence into speech that clones a voice of the target speaker. The target speaker includes a native speaker of a second language different than the first language. The method also includes generating, using a text-to-speech (TTS) model, an output audio feature representation of the input text by processing the input text sequence and the speaker embedding. The output audio feature representation includes the voice characteristics of the target speaker specified by the speaker embedding.

Type: Application

Filed: August 8, 2024

Publication date: December 5, 2024

Applicant: Google LLC

Inventors: Yu Zhang, Ron J. Weiss, Byungha Chun, Yonghui Wu, Zhifeng Chen, Russell John Wyatt Skerry-Ryan, Ye Jia, Andrew M. Rosenberg, Bhuvana Ramabhadran
Neural-Network-Based Text-to-Speech Model for Novel Speaker Generation

Publication number: 20240395239

Abstract: Systems and methods for text-to-speech with novel speakers can obtain text data and output audio data. The input text data may be input along with one or more speaker preferences. The speaker preferences can include speaker characteristics. The speaker preferences can be processed by a machine-learned model conditioned on a learned prior distribution to determine a speaker embedding. The speaker embedding can then be processed with the text data to generate an output that includes audio data descriptive of the text data spoken by a novel speaker.

Type: Application

Filed: August 6, 2024

Publication date: November 28, 2024

Inventors: Daisy Antonia Stanton, Sean Matthew Shannon, Soroosh Mariooryad, Russell John Wyatt Skerry-Ryan, Eric Dean Battenberg, Thomas Edward Bagby, David Teh-Hwa Kao
Variational Embedding Capacity in Expressive End-to-End Speech Synthesis

Publication number: 20240395238

Abstract: A method for estimating an embedding capacity includes receiving, at a deterministic reference encoder, a reference audio signal, and determining a reference embedding corresponding to the reference audio signal, the reference embedding having a corresponding embedding dimensionality. The method also includes measuring a first reconstruction loss as a function of the corresponding embedding dimensionality of the reference embedding and obtaining a variational embedding from a variational posterior. The variational embedding has a corresponding embedding dimensionality and a specified capacity. The method also includes measuring a second reconstruction loss as a function of the corresponding embedding dimensionality of the variational embedding and estimating a capacity of the reference embedding by comparing the first measured reconstruction loss for the reference embedding relative to the second measured reconstruction loss for the variational embedding having the specified capacity.

Type: Application

Filed: August 7, 2024

Publication date: November 28, 2024

Applicant: Google LLC

Inventors: Eric Dean Battenberg, Daisy Stanton, Russell John Wyatt Skerry-Ryan, Soroosh Mariooryad, David Teh-hwa Kao, Thomas Edward Bagby, Sean Matthew Shannon
LANGUAGE MODELS USING SPOKEN LANGUAGE MODELING

Publication number: 20240386885

Abstract: A method includes receiving an input sequence of speech features characterizing a spoken prompt. The method also includes generating a corresponding sequence of audio encodings using an audio encoder of a spoken language model. Without applying any intermediary cross-attention to the sequence of audio encoding between the audio encoder and a language model decoder of the spoken language model, the method includes processing the sequence of audio encodings generated by the audio encoder using the language model decoder to generate an output sequence of speech features characterizing a continuation of the spoken prompt.

Type: Application

Filed: May 13, 2024

Publication date: November 21, 2024

Applicant: Google LLC

Inventors: Michelle Dana Tadmor, Eliya Nachmani, Alon Levkovitch, Julian Salazar, Chulayuth Asawaroengchai, Russell John Wyatt Skerry-Ryan, Soroosh Mariooryad
Synthesizing speech from text using neural networks

Patent number: 12148444

Abstract: Methods, systems, and computer program products for generating, from an input character sequence, an output sequence of audio data representing the input character sequence. The output sequence of audio data includes a respective audio output sample for each of a number of time steps. One example method includes, for each of the time steps: generating a mel-frequency spectrogram for the time step by processing a representation of a respective portion of the input character sequence using a decoder neural network; generating a probability distribution over a plurality of possible audio output samples for the time step by processing the mel-frequency spectrogram for the time step using a vocoder neural network; and selecting the audio output sample for the time step from the possible audio output samples in accordance with the probability distribution.

Type: Grant

Filed: April 5, 2021

Date of Patent: November 19, 2024

Assignee: Google LLC

Inventors: Yonghui Wu, Jonathan Shen, Ruoming Pang, Ron J. Weiss, Michael Schuster, Navdeep Jaitly, Zongheng Yang, Zhifeng Chen, Yu Zhang, Yuxuan Wang, Russell John Wyatt Skerry-Ryan, Ryan M. Rifkin, Ioannis Agiomyrgiannakis
Reduced pressure therapy apparatus construction and control

Patent number: 12133789

Abstract: Embodiments of negative pressure wound therapy systems and methods for operating the systems are disclosed. In some embodiments, a system includes a pump assembly, canister, and a wound dressing configured to be positioned over a wound. The pump assembly, canister, and the wound dressing can be fluidically connected to facilitate delivery of negative pressure to a wound. The pump assembly can present graphical user interface screens for controlling and monitoring delivery of negative pressure. The system can be configured to efficiently deliver negative pressure and to detect and indicate presence of certain conditions, such as low pressure, high pressure, leak, canister full, and the like. Monitoring and detection of operating condition can be performed by measuring one or more operational parameters, such as pressure, flow rate, and the like.

Type: Grant

Filed: March 30, 2020

Date of Patent: November 5, 2024

Assignee: Smith & Nephew, Inc.

Inventors: William W. Gregory, William Joseph Jaecklein, Kathryn Ann Leigh, Paul N. Minor, Michael Mosholder, Felix C. Quintanar, John P. Racette, Christopher Rouseff, Matthew Smith, W. Len Smith, John Wyatt
Multilingual speech synthesis and cross-language voice cloning

Patent number: 12087273

Abstract: A method includes receiving an input text sequence to be synthesized into speech in a first language and obtaining a speaker embedding, the speaker embedding specifying specific voice characteristics of a target speaker for synthesizing the input text sequence into speech that clones a voice of the target speaker. The target speaker includes a native speaker of a second language different than the first language. The method also includes generating, using a text-to-speech (TTS) model, an output audio feature representation of the input text by processing the input text sequence and the speaker embedding. The output audio feature representation includes the voice characteristics of the target speaker specified by the speaker embedding.

Type: Grant

Filed: January 30, 2023

Date of Patent: September 10, 2024

Assignee: Google LLC

Inventors: Yu Zhang, Ron J. Weiss, Byungha Chun, Yonghui Wu, Zhifeng Chen, Russell John Wyatt Skerry-Ryan, Ye Jia, Andrew M. Rosenberg, Bhuvana Ramabhadran
Neural-network-based text-to-speech model for novel speaker generation

Patent number: 12087275

Abstract: Systems and methods for text-to-speech with novel speakers can obtain text data and output audio data. The input text data may be input along with one or more speaker preferences. The speaker preferences can include speaker characteristics. The speaker preferences can be processed by a machine-learned model conditioned on a learned prior distribution to determine a speaker embedding. The speaker embedding can then be processed with the text data to generate an output that includes audio data descriptive of the text data spoken by a novel speaker.

Type: Grant

Filed: February 16, 2022

Date of Patent: September 10, 2024

Assignee: GOOGLE LLC

Inventors: Daisy Antonia Stanton, Sean Matthew Shannon, Soroosh Mariooryad, Russell John-Wyatt Skerry-Ryan, Eric Dean Battenberg, Thomas Edward Bagby, David Teh-Hwa Kao
Variational embedding capacity in expressive end-to-end speech synthesis

Patent number: 12067969

Abstract: A method for estimating an embedding capacity includes receiving, at a deterministic reference encoder, a reference audio signal, and determining a reference embedding corresponding to the reference audio signal, the reference embedding having a corresponding embedding dimensionality. The method also includes measuring a first reconstruction loss as a function of the corresponding embedding dimensionality of the reference embedding and obtaining a variational embedding from a variational posterior. The variational embedding has a corresponding embedding dimensionality and a specified capacity. The method also includes measuring a second reconstruction loss as a function of the corresponding embedding dimensionality of the variational embedding and estimating a capacity of the reference embedding by comparing the first measured reconstruction loss for the reference embedding relative to the second measured reconstruction loss for the variational embedding having the specified capacity.

Type: Grant

Filed: April 18, 2023

Date of Patent: August 20, 2024

Assignee: Google LLC

Inventors: Eric Dean Battenberg, Daisy Stanton, Russell John Wyatt Skerry-Ryan, Soroosh Mariooryad, David Teh-Hwa Kao, Thomas Edward Bagby, Sean Matthew Shannon
Controlling Expressivity In End-to-End Speech Synthesis Systems

Publication number: 20230274728

Abstract: A system for generating an output audio signal includes a context encoder, a text-prediction network, and a text-to-speech (TTS) model. The context encoder is configured to receive one or more context features associated with current input text and process the one or more context features to generate a context embedding associated with the current input text. The text-prediction network is configured to process the current input text and the context embedding to predict, as output, a style embedding for the current input text. The style embedding specifies a specific prosody and/or style for synthesizing the current input text into expressive speech. The TTS model is configured to process the current input text and the style embedding to generate an output audio signal of expressive speech of the current input text. The output audio signal has the specific prosody and/or style specified by the style embedding.

Type: Application

Filed: May 9, 2023

Publication date: August 31, 2023

Applicant: Google LLC

Inventors: Daisy Stanton, Eric Dean Battenberg, Russell John Wyatt Skerry-Ryan, Soroosh Mariooryad, David Teh-hwa Kao, Thomas Edward Bagby, Sean Matthew Shannon
Variational Embedding Capacity in Expressive End-to-End Speech Synthesis

Publication number: 20230260504

Abstract: A method for estimating an embedding capacity includes receiving, at a deterministic reference encoder, a reference audio signal, and determining a reference embedding corresponding to the reference audio signal, the reference embedding having a corresponding embedding dimensionality. The method also includes measuring a first reconstruction loss as a function of the corresponding embedding dimensionality of the reference embedding and obtaining a variational embedding from a variational posterior. The variational embedding has a corresponding embedding dimensionality and a specified capacity. The method also includes measuring a second reconstruction loss as a function of the corresponding embedding dimensionality of the variational embedding and estimating a capacity of the reference embedding by comparing the first measured reconstruction loss for the reference embedding relative to the second measured reconstruction loss for the variational embedding having the specified capacity.

Type: Application

Filed: April 18, 2023

Publication date: August 17, 2023

Applicant: Google LLC

Inventors: Eric Dean Battenberg, Daisy Stanton, Russell John Wyatt Skerry-Ryan, Soroosh Mariooryad, David Teh-hwa Kao, Thomas Edward Bagby, Sean Matthew Shannon
Neural-Network-Based Text-to-Speech Model for Novel Speaker Generation

Publication number: 20230206898

Abstract: Systems and methods for text-to-speech with novel speakers can obtain text data and output audio data. The input text data may be input along with one or more speaker preferences. The speaker preferences can include speaker characteristics. The speaker preferences can be processed by a machine-learned model conditioned on a learned prior distribution to determine a speaker embedding. The speaker embedding can then be processed with the text data to generate an output that includes audio data descriptive of the text data spoken by a novel speaker.

Type: Application

Filed: February 16, 2022

Publication date: June 29, 2023

Inventors: Daisy Antonia Stanton, Sean Matthew Shannon, Soroosh Mariooryad, Russell John-Wyatt Skerry-Ryan, Eric Dean Battenberg, Thomas Edward Bagby, David Teh-Hwa Kao
Controlling expressivity in end-to-end speech synthesis systems

Patent number: 11676573

Abstract: A system for generating an output audio signal includes a context encoder, a text-prediction network, and a text-to-speech (TTS) model. The context encoder is configured to receive one or more context features associated with current input text and process the one or more context features to generate a context embedding associated with the current input text. The text-prediction network is configured to process the current input text and the context embedding to predict, as output, a style embedding for the current input text. The style embedding specifies a specific prosody and/or style for synthesizing the current input text into expressive speech. The TTS model is configured to process the current input text and the style embedding to generate an output audio signal of expressive speech of the current input text. The output audio signal has the specific prosody and/or style specified by the style embedding.

Type: Grant

Filed: July 16, 2020

Date of Patent: June 13, 2023

Assignee: Google LLC

Inventors: Daisy Stanton, Eric Dean Battenberg, Russell John Wyatt Skerry-Ryan, Soroosh Mariooryad, David Teh-Hwa Kao, Thomas Edward Bagby, Sean Matthew Shannon
MULTILINGUAL SPEECH SYNTHESIS AND CROSS-LANGUAGE VOICE CLONING

Publication number: 20230178068

Abstract: A method includes receiving an input text sequence to be synthesized into speech in a first language and obtaining a speaker embedding, the speaker embedding specifying specific voice characteristics of a target speaker for synthesizing the input text sequence into speech that clones a voice of the target speaker. The target speaker includes a native speaker of a second language different than the first language. The method also includes generating, using a text-to-speech (TTS) model, an output audio feature representation of the input text by processing the input text sequence and the speaker embedding. The output audio feature representation includes the voice characteristics of the target speaker specified by the speaker embedding.

Type: Application

Filed: January 30, 2023

Publication date: June 8, 2023

Applicant: Google LLC

Inventors: Yu Zhang, Ron J. Weiss, Byungha Chun, Yonghui Wu, Zhifeng Chen, Russell John Wyatt Skerry-Ryan, Ye Jia, Andrew M. Rosenberg, Bhuvana Ramabhadran
Variational embedding capacity in expressive end-to-end speech synthesis

Patent number: 11646010

Abstract: A method for estimating an embedding capacity includes receiving, at a deterministic reference encoder, a reference audio signal, and determining a reference embedding corresponding to the reference audio signal, the reference embedding having a corresponding embedding dimensionality. The method also includes measuring a first reconstruction loss as a function of the corresponding embedding dimensionality of the reference embedding and obtaining a variational embedding from a variational posterior. The variational embedding has a corresponding embedding dimensionality and a specified capacity. The method also includes measuring a second reconstruction loss as a function of the corresponding embedding dimensionality of the variational embedding and estimating a capacity of the reference embedding by comparing the first measured reconstruction loss for the reference embedding relative to the second measured reconstruction loss for the variational embedding having the specified capacity.

Type: Grant

Filed: December 9, 2021

Date of Patent: May 9, 2023

Assignee: Google LLC

Inventors: Eric Dean Battenberg, Daisy Stanton, Russell John Wyatt Skerry-Ryan, Soroosh Mariooryad, David Teh-Hwa Kao, Thomas Edward Bagby, Sean Matthew Shannon
Multilingual speech synthesis and cross-language voice cloning

Patent number: 11580952

Abstract: A method includes receiving an input text sequence to be synthesized into speech in a first language and obtaining a speaker embedding, the speaker embedding specifying specific voice characteristics of a target speaker for synthesizing the input text sequence into speech that clones a voice of the target speaker. The target speaker includes a native speaker of a second language different than the first language. The method also includes generating, using a text-to-speech (TTS) model, an output audio feature representation of the input text by processing the input text sequence and the speaker embedding. The output audio feature representation includes the voice characteristics of the target speaker specified by the speaker embedding.

Type: Grant

Filed: April 22, 2020

Date of Patent: February 14, 2023

Assignee: Google LLC

Inventors: Yu Zhang, Ron J. Weiss, Byungha Chun, Yonghui Wu, Zhifeng Chen, Russell John Wyatt Skerry-Ryan, Ye Jia, Andrew M. Rosenberg, Bhuvana Ramabhadran

1 2 3 4 next