Patents by Inventor Jesse Emond

Jesse Emond has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

LANGUAGE-AGNOSTIC MULTILINGUAL MODELING USING EFFECTIVE SCRIPT NORMALIZATION

Publication number: 20260120679

Abstract: A method includes obtaining a plurality of training data sets each associated with a respective native language and includes a plurality of respective training data samples. For each respective training data sample of each training data set in the respective native language, the method includes transliterating the corresponding transcription in the respective native script into corresponding transliterated text representing the respective native language of the corresponding audio in a target script and associating the corresponding transliterated text in the target script with the corresponding audio in the respective native language to generate a respective normalized training data sample.

Type: Application

Filed: December 22, 2025

Publication date: April 30, 2026

Applicant: Google LLC

Inventors: Arindrima Datta, Bhuvana Ramabhadran, Jesse Emond, Brian Roark
Language-agnostic multilingual modeling using effective script normalization

Patent number: 12536989

Abstract: A method includes obtaining a plurality of training data sets each associated with a respective native language and includes a plurality of respective training data samples. For each respective training data sample of each training data set in the respective native language, the method includes transliterating the corresponding transcription in the respective native script into corresponding transliterated text representing the respective native language of the corresponding audio in a target script and associating the corresponding transliterated text in the target script with the corresponding audio in the respective native language to generate a respective normalized training data sample.

Type: Grant

Filed: March 21, 2023

Date of Patent: January 27, 2026

Assignee: Google LLC

Inventors: Arindrima Datta, Bhuvana Ramabhadran, Jesse Emond, Brian Roark
Supervised and Unsupervised Training with Contrastive Loss Over Sequences

Publication number: 20250166614

Abstract: A method includes receiving audio data corresponding to an utterance and generating a pair of positive audio data examples. Here, each positive audio data example includes a respective augmented copy of the received audio data. For each respective positive audio data example, the method includes generating a respective sequence of encoder outputs and projecting the respective sequence of encoder outputs for the positive data example into a contrastive loss space. The method also includes determining a L2 distance between each corresponding encoder output in the projected sequences of encoder outputs for the positive audio data examples and determining a per-utterance consistency loss by averaging the L2 distances. The method also includes generating corresponding speech recognition results for each respective positive audio data example. The method also includes updating parameters of the speech recognition model based on a respective supervised loss term and the per-utterance consistency loss.

Type: Application

Filed: January 22, 2025

Publication date: May 22, 2025

Applicant: Google LLC

Inventors: Andrew Rosenberg, Bhuvana Ramabhadran, Zhehuai Chen, Yuan Wang, Yu Zhang, Jesse Emond
Supervised and unsupervised training with contrastive loss over sequences

Patent number: 12230249

Abstract: A method includes receiving audio data corresponding to an utterance and generating a pair of positive audio data examples. Here, each positive audio data example includes a respective augmented copy of the received audio data. For each respective positive audio data example, the method includes generating a respective sequence of encoder outputs and projecting the respective sequence of encoder outputs for the positive data example into a contrastive loss space. The method also includes determining a L2 distance between each corresponding encoder output in the projected sequences of encoder outputs for the positive audio data examples and determining a per-utterance consistency loss by averaging the L2 distances. The method also includes generating corresponding speech recognition results for each respective positive audio data example. The method also includes updating parameters of the speech recognition model based on a respective supervised loss term and the per-utterance consistency loss.

Type: Grant

Filed: March 22, 2022

Date of Patent: February 18, 2025

Assignee: Google LLC

Inventors: Andrew Rosenberg, Bhuvana Ramabhadran, Zhehuai Chen, Yuan Wang, Yu Zhang, Jesse Emond
Language-agnostic Multilingual Modeling Using Effective Script Normalization

Publication number: 20230223009

Abstract: A method includes obtaining a plurality of training data sets each associated with a respective native language and includes a plurality of respective training data samples. For each respective training data sample of each training data set in the respective native language, the method includes transliterating the corresponding transcription in the respective native script into corresponding transliterated text representing the respective native language of the corresponding audio in a target script and associating the corresponding transliterated text in the target script with the corresponding audio in the respective native language to generate a respective normalized training data sample.

Type: Application

Filed: March 21, 2023

Publication date: July 13, 2023

Applicant: Google LLC

Inventors: Arindrima Datta, Bhuvana Ramabhadran, Jesse Emond, Brian Roark
Language-agnostic multilingual modeling using effective script normalization

Patent number: 11615779

Abstract: A method includes obtaining a plurality of training data sets each associated with a respective native language and includes a plurality of respective training data samples. For each respective training data sample of each training data set in the respective native language, the method includes transliterating the corresponding transcription in the respective native script into corresponding transliterated text representing the respective native language of the corresponding audio in a target script and associating the corresponding transliterated text in the target script with the corresponding audio in the respective native language to generate a respective normalized training data sample.

Type: Grant

Filed: January 19, 2021

Date of Patent: March 28, 2023

Assignee: Google LLC

Inventors: Arindrima Datta, Bhuvana Ramabhadran, Jesse Emond, Brian Roark
Supervised and Unsupervised Training with Contrastive Loss Over Sequences

Publication number: 20220310065

Abstract: A method includes receiving audio data corresponding to an utterance and generating a pair of positive audio data examples. Here, each positive audio data example includes a respective augmented copy of the received audio data. For each respective positive audio data example, the method includes generating a respective sequence of encoder outputs and projecting the respective sequence of encoder outputs for the positive data example into a contrastive loss space. The method also includes determining a L2 distance between each corresponding encoder output in the projected sequences of encoder outputs for the positive audio data examples and determining a per-utterance consistency loss by averaging the L2 distances. The method also includes generating corresponding speech recognition results for each respective positive audio data example. The method also includes updating parameters of the speech recognition model based on a respective supervised loss term and the per-utterance consistency loss.

Type: Application

Filed: March 22, 2022

Publication date: September 29, 2022

Applicant: Google LLC

Inventors: Andrew Rosenberg, Bhuvana Ramabhadran, Zhehuai Chen, Gary Wang, Yu Zhang, Jesse Emond
Transliteration for speech recognition training and scoring

Patent number: 11417322

Abstract: Methods, systems, and apparatus, including computer programs stored on a computer-readable storage medium, for transliteration for speech recognition training and scoring. In some implementations, language examples are accessed, some of which include words in a first script and words in one or more other scripts. At least portions of some of the language examples are transliterated to the first script to generate a training data set. A language model is generated based on occurrences of the different sequences of words in the training data set in the first script. The language model is used to perform speech recognition for an utterance.

Type: Grant

Filed: December 12, 2019

Date of Patent: August 16, 2022

Assignee: Google LLC

Inventors: Bhuvana Ramabhadran, Min Ma, Pedro J. Moreno Mengibar, Jesse Emond, Brian E. Roark
Language-agnostic Multilingual Modeling Using Effective Script Normalization

Publication number: 20210233510

Abstract: A method includes obtaining a plurality of training data sets each associated with a respective native language and includes a plurality of respective training data samples. For each respective training data sample of each training data set in the respective native language, the method includes transliterating the corresponding transcription in the respective native script into corresponding transliterated text representing the respective native language of the corresponding audio in a target script and associating the corresponding transliterated text in the target script with the corresponding audio in the respective native language to generate a respective normalized training data sample.

Type: Application

Filed: January 19, 2021

Publication date: July 29, 2021

Applicant: Google LLC

Inventors: Arindrima Datta, Bhuvana Ramabhadran, Jesse Emond, Brian Roak
TRANSLITERATION FOR SPEECH RECOGNITION TRAINING AND SCORING

Publication number: 20200193977

Abstract: Methods, systems, and apparatus, including computer programs stored on a computer-readable storage medium, for transliteration for speech recognition training and scoring. In some implementations, language examples are accessed, some of which include words in a first script and words in one or more other scripts. At least portions of some of the language examples are transliterated to the first script to generate a training data set. A language model is generated based on occurrences of the different sequences of words in the training data set in the first script. The language model is used to perform speech recognition for an utterance.

Type: Application

Filed: December 12, 2019

Publication date: June 18, 2020

Inventors: Bhuvana Ramabhadran, Min Ma, Pedro J. Moreno Mengibar, Jesse Emond, Brian E. Roark