Patents by Inventor Rakesh Iyer

Rakesh Iyer has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Systems and Methods for Extracting Information from a Physical Document

Publication number: 20240404308

Abstract: Systems and methods for extracting information from documents are provided. In one example embodiment, a computer-implemented method includes obtaining one or more units of text from an image of a document. The method includes determining one or more annotated values from the one or more units of text and determining a set of candidate labels for each annotated value. The method determines each set of candidate labels by performing a search for the candidate labels based at least in part on a language associated with the document and a location of each annotated value. The method includes determining a canonical label for each annotated value based at least in part on the associated candidate labels, and mapping at least one annotated value to an action that is presented to a user based at least in part on the canonical label associated with the annotated value.

Type: Application

Filed: May 22, 2024

Publication date: December 5, 2024

Inventors: Rakesh Iyer, Lisha Ruan
Predicting parametric vocoder parameters from prosodic features

Patent number: 12125469

Abstract: A method for predicting parametric vocoder parameter includes receiving a text utterance having one or more words, each word having one or more syllables, and each syllable having one or more phonemes. The method also includes receiving, as input to a vocoder model, prosodic features that represent an intended prosody for the text utterance and a linguistic specification. The prosodic features include a duration, pitch contour, and energy contour for the text utterance, while the linguistic specification includes sentence-level linguistic features, word-level linguistic features for each word, syllable-level linguistic features for each syllable, and phoneme-level linguistic features for each phoneme. The method also includes predicting vocoder parameters based on the prosodic features and the linguistic specification.

Type: Grant

Filed: October 17, 2023

Date of Patent: October 22, 2024

Assignee: Google LLC

Inventors: Rakesh Iyer, Vincent Wan
AUTOMATIC ADAPTATION OF THE SYNTHESIZED SPEECH OUTPUT OF A TRANSLATION APPLICATION

Publication number: 20240331681

Abstract: A computer generated voice can automatically be adapted to be similar to a user's voice. Various implementations include processing audio data capturing a first language spoken utterance to identify one or more pitch characteristics. For example, the one or more pitch characteristics can include an estimated frequency range of the given user's voice. Additionally or alternatively, the system can process the audio data capturing the first language spoken utterance and a set of candidate computer generated voices using a computer generated voice selection model to select a candidate computer generated voice. Various implementations can include automatically modifying the selected candidate computer generated voice based on the one or more pitch characteristics to change the frequency range of the modified computer generated voice based on the user's voice.

Type: Application

Filed: March 29, 2023

Publication date: October 3, 2024

Inventors: Rakesh Iyer, Jeffrey Robert Pitman, Pendar Yousefi, Te I, Tiruvilwamalai Raman
Systems and methods for extracting information from a physical document

Patent number: 12033412

Abstract: Systems and methods for extracting information from documents are provided. In one example embodiment, a computer-implemented method includes obtaining one or more units of text from an image of a document. The method includes determining one or more annotated values from the one or more units of text and determining a set of candidate labels for each annotated value. The method determines each set of candidate labels by performing a search for the candidate labels based at least in part on a language associated with the document and a location of each annotated value. The method includes determining a canonical label for each annotated value based at least in part on the associated candidate labels, and mapping at least one annotated value to an action that is presented to a user based at least in part on the canonical label associated with the annotated value.

Type: Grant

Filed: January 28, 2019

Date of Patent: July 9, 2024

Assignee: GOOGLE LLC

Inventors: Rakesh Iyer, Lisha Ruan
Predicting Parametric Vocoder Parameters From Prosodic Features

Publication number: 20240046915

Abstract: A method for predicting parametric vocoder parameter includes receiving a text utterance having one or more words, each word having one or more syllables, and each syllable having one or more phonemes. The method also includes receiving, as input to a vocoder model, prosodic features that represent an intended prosody for the text utterance and a linguistic specification. The prosodic features include a duration, pitch contour, and energy contour for the text utterance, while the linguistic specification includes sentence-level linguistic features, word-level linguistic features for each word, syllable-level linguistic features for each syllable, and phoneme-level linguistic features for each phoneme. The method also includes predicting vocoder parameters based on the prosodic features and the linguistic specification.

Type: Application

Filed: October 17, 2023

Publication date: February 8, 2024

Applicant: Google LLC

Inventors: Rakesh Iyer, Vincent Wan
Predicting parametric vocoder parameters from prosodic features

Patent number: 11830474

Abstract: A method for predicting parametric vocoder parameter includes receiving a text utterance having one or more words, each word having one or more syllables, and each syllable having one or more phonemes. The method also includes receiving, as input to a vocoder model, prosodic features that represent an intended prosody for the text utterance and a linguistic specification. The prosodic features include a duration, pitch contour, and energy contour for the text utterance, while the linguistic specification includes sentence-level linguistic features, word-level linguistic features for each word, syllable-level linguistic features for each syllable, and phoneme-level linguistic features for each phoneme. The method also includes predicting vocoder parameters based on the prosodic features and the linguistic specification.

Type: Grant

Filed: January 6, 2022

Date of Patent: November 28, 2023

Assignee: Google LLC

Inventors: Rakesh Iyer, Vincent Wan
GENERATING SYNTHESIZED SPEECH INPUT

Publication number: 20230097338

Abstract: Systems and methods for synthesizing speech based on received text and one or more emulated speech parameters. Text is received with one or more emulated speech parameters that indicate one or more features for the synthesized speech. Synthesized speech audio is generated based on the received parameters. The synthesized speech audio data is provided to an emulated microphone component that provides the synthesized audio to an automatic speech recognizer. The automatic speech recognizer utilizes one or more speech recognition models to generate converted text based on the synthesized speech audio data.

Type: Application

Filed: November 23, 2021

Publication date: March 30, 2023

Inventors: Nnamdi Kalu, Fernando Fernandes, Uri First, Erwin Jansen, Rakesh Iyer, Lingfeng Yang
UPDATING RECORDS IN A REAL-TIME STORAGE SYSTEM

Publication number: 20230065823

Abstract: According to an aspect, a method includes storing messages exchanged on a messaging platform in a non-relational database, obtaining a database snapshot of the non-relational database, executing a database task on the database snapshot, and generating, in response to the database task, an update log, where the update log identifies a first record to be changed or deleted in the non-relational database. The method includes determining whether or not the first record identified in the update log has been updated in the non-relational database after a time instance associated with the database task and applying the change or deletion of the first record in the non-relational database in response to the first record being determined as not updated after the time instance associated with the database task.

Type: Application

Filed: August 24, 2021

Publication date: March 2, 2023

Inventor: Rakesh Iyer
Predicting Parametric Vocoder Parameters From Prosodic Features

Publication number: 20220130371

Abstract: A method for predicting parametric vocoder parameter includes receiving a text utterance having one or more words, each word having one or more syllables, and each syllable having one or more phonemes. The method also includes receiving, as input to a vocoder model, prosodic features that represent an intended prosody for the text utterance and a linguistic specification. The prosodic features include a duration, pitch contour, and energy contour for the text utterance, while the linguistic specification includes sentence-level linguistic features, word-level linguistic features for each word, syllable-level linguistic features for each syllable, and phoneme-level linguistic features for each phoneme. The method also includes predicting vocoder parameters based on the prosodic features and the linguistic specification.

Type: Application

Filed: January 6, 2022

Publication date: April 28, 2022

Applicant: Google LLC

Inventors: Rakesh Iyer, Vincent Wan
Predicting parametric vocoder parameters from prosodic features

Patent number: 11232780

Abstract: A method for predicting parametric vocoder parameter includes receiving a text utterance having one or more words, each word having one or more syllables, and each syllable having one or more phonemes. The method also includes receiving, as input to a vocoder model, prosodic features that represent an intended prosody for the text utterance and a linguistic specification. The prosodic features include a duration, pitch contour, and energy contour for the text utterance, while the linguistic specification includes sentence-level linguistic features, word-level linguistic features for each word, syllable-level linguistic features for each syllable, and phoneme-level linguistic features for each phoneme. The method also includes predicting vocoder parameters based on the prosodic features and the linguistic specification.

Type: Grant

Filed: September 26, 2020

Date of Patent: January 25, 2022

Assignee: Google LLC

Inventors: Rakesh Iyer, Vincent Wan
Systems and Methods for Extracting Information from a Physical Document

Publication number: 20210406451

Abstract: Systems and methods for extracting information from documents are provided. In one example embodiment, a computer-implemented method includes obtaining one or more units of text from an image of a document. The method includes determining one or more annotated values from the one or more units of text and determining a set of candidate labels for each annotated value. The method determines each set of candidate labels by performing a search for the candidate labels based at least in part on a language associated with the document and a location of each annotated value. The method includes determining a canonical label for each annotated value based at least in part on the associated candidate labels, and mapping at least one annotated value to an action that is presented to a user based at least in part on the canonical label associated with the annotated value.

Type: Application

Filed: January 28, 2019

Publication date: December 30, 2021

Inventors: Rakesh Iyer, Lisha Ruan