Patents by Inventor Shankar Kumar

Shankar Kumar has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

ON-DEVICE GRAMMAR CHECKING

Publication number: 20260203497

Abstract: A computing device may receive inputted text and perform, using one or more neural networks, on-device grammar checking of a sequence of words in the inputted text, including determining, using the one or more neural networks, a grammatically correct version of the sequence of words and determining that the sequence of words does not match the grammatically correct version of the sequence of words. The computing device may, in response to determining that the sequence of words does not match the grammatically correct version of the sequence of words, output, for display at a display device, at least a portion of the grammatically correct version of the sequence of words as a suggested replacement for at least a sequence of the sequence of words in the inputted text.

Type: Application

Filed: January 7, 2026

Publication date: July 16, 2026

Inventors: Matthew Sharifi, Sebastian Millius, Qi Wang, Yunpeng Li, Shankar Kumar, Lukas Zilka, Simon Tong, Martin Sundermeyer
Federated knowledge distillation on an encoder of a global ASR model and/or an encoder of a client ASR model

Patent number: 12664978

Abstract: Information can be distilled from a global automatic speech recognition (ASR) model to a client ASR model. Many implementations include using an RNN-T model as the ASR model, where the global ASR model includes a global encoder, a joint network, a prediction network, and where the client ASR model includes a client encoder, the joint network, and the prediction network. Various implementations include using principal component analysis (PCA) while training the global ASR model to learn a mean vector and a set of principal components corresponding to the global ASR model. Additional or alternative implementations include training the client ASR model to generate one or more predicted coefficients of the global ASR model.

Type: Grant

Filed: December 9, 2022

Date of Patent: June 23, 2026

Assignee: GOOGLE LLC

Inventors: Ehsan Amid, Rajiv Mathews, Shankar Kumar, Jared Lichtarge, Mingqing Chen, Tien-Ju Yang, Yuxin Ding
Multi-output decoders for multi-task learning of ASR and auxiliary tasks

Patent number: 12651594

Abstract: A method includes receiving a training dataset that includes one or more spoken training utterances for training an automatic speech recognition (ASR) model. Each spoken training utterance in the training dataset paired with a corresponding transcription and a corresponding target sequence of auxiliary tokens. For each spoken training utterance, the method includes generating a speech recognition hypothesis for a corresponding spoken training utterance, determining a speech recognition loss based on the speech recognition hypothesis and the corresponding transcription, generating a predicted auxiliary token for the corresponding spoken training utterance, and determining an auxiliary task loss based on the predicted auxiliary token and the corresponding target sequence of auxiliary tokens. The method also includes the ASR model jointly on the speech recognition loss and the auxiliary task loss determined for each spoken training utterance.

Type: Grant

Filed: October 26, 2023

Date of Patent: June 9, 2026

Assignee: Google LLC

Inventors: Weiran Wang, Ding Zhao, Shaojin Ding, Hao Zhang, Shuo-yiin Chang, David Johannes Rybach, Tara N. Sainath, Yanzhang He, Ian McGraw, Shankar Kumar
Knowledge distillation with domain mismatch for speech recognition

Patent number: 12586569

Abstract: A method includes receiving distillation data including a plurality of out-of-domain training utterances. For each particular out-of-domain training utterance of the distillation data, the method includes generating a corresponding augmented out-of-domain training utterance, and generating, using a teacher ASR model trained on training data corresponding to a target domain, a pseudo-label corresponding to the corresponding augmented out-of-domain training utterance. The method also includes distilling a student ASR model from the teacher ASR model by training the student ASR model using the corresponding augmented out-of-domain training utterances paired with the corresponding pseudo-labels generated by the teacher ASR model.

Type: Grant

Filed: October 17, 2023

Date of Patent: March 24, 2026

Assignee: Google LLC

Inventors: Tien-Ju Yang, You-Chi Cheng, Shankar Kumar, Jared Lichtarge, Ehsan Amid, Yuxin Ding, Rajiv Mathews, Mingqing Chen
On-device grammar checking

Patent number: 12536371

Abstract: A computing device may receive inputted text and perform, using one or more neural networks, on-device grammar checking of a sequence of words in the inputted text, including determining, using the one or more neural networks, a grammatically correct version of the sequence of words and determining that the sequence of words does not match the grammatically correct version of the sequence of words. The computing device may, in response to determining that the sequence of words does not match the grammatically correct version of the sequence of words, output, for display at a display device, at least a portion of the grammatically correct version of the sequence of words as a suggested replacement for at least a sequence of the sequence of words in the inputted text.

Type: Grant

Filed: December 18, 2020

Date of Patent: January 27, 2026

Assignee: Google LLC

Inventors: Matthew Sharifi, Sebastian Millius, Qi Wang, Yunpeng Li, Shankar Kumar, Lukas Zilka, Simon Tong, Martin Sundermeyer
Hybrid federated learning of machine learning model(s)

Patent number: 12505382

Abstract: Implementations disclosed herein are directed to a hybrid federated learning (FL) technique that utilizes both federated averaging (FA) and federated distillation (FD) during a given round of FL of a given global machine learning (ML) model. Implementations may identify a population of client devices to participate in the given round of FL, determine a corresponding quantity of instances of client data available at each of the client devices that may be utilized during the given round of FL, and select different subsets of the client devices based on the corresponding quantity of instances of client data. Further, implementations may cause a first subset of the client devices to generate a corresponding FA update and a second subset of client devices to generate a corresponding FD update. Moreover, implementations may subsequently update the given global ML model based on the corresponding FA updates and the corresponding FD updates.

Type: Grant

Filed: December 5, 2022

Date of Patent: December 23, 2025

Assignee: GOOGLE LLC

Inventors: Ehsan Amid, Rajiv Mathews, Rohan Anil, Shankar Kumar, Jared Lichtarge
Lookup-table recurrent language model

Patent number: 12412566

Abstract: A computer-implemented method includes receiving audio data that corresponds to an utterance spoken by a user and captured by a user device. The method also includes processing the audio data to determine a candidate transcription that includes a sequence of tokens for the spoken utterance. Tor each token in the sequence of tokens, the method includes determining a token embedding for corresponding token, determining a n-gram token embedding for a previous sequence of n-gram tokens, and concatenating the token embedding and the n-gram token embedding to generate a concatenated output for the corresponding token. The method also includes rescoring the candidate transcription for the spoken utterance by processing the concatenated output generated for each corresponding token in the sequence of tokens.

Type: Grant

Filed: February 10, 2022

Date of Patent: September 9, 2025

Assignee: Google LLC

Inventors: Ronny Huang, Tara N. Sainath, Trevor Strohman, Shankar Kumar
Modeling ambiguity in neural machine translation

Patent number: 12393795

Abstract: The technology addresses ambiguity in neural machine translation. An encoder module receives a given text exemplar and generates an encoded representation of it. A decoder module receives the encoded representation and a set of translation prefixes. The decoder module outputs an unbounded function corresponding to a set of tokens associated with each pair of the given text exemplar and translation prefix from the set of translation prefixes. Each token is assigned a probability between 0 and 1 in a vocabulary of the exemplar at each time step. A logits module generates, based on the unbounded function, a corresponding bounded conditional probability for each token, wherein the probabilities are not normalized over the vocabulary at each time step. A loss function module having a positive loss component and a scaled negative loss component identifies whether each target text of a set of target texts is a valid translation of the exemplar.

Type: Grant

Filed: December 28, 2022

Date of Patent: August 19, 2025

Assignee: GOOGLE LLC

Inventors: Felix Stahlberg, Shankar Kumar
METHODS, APPARATUS, AND SYSTEMS FOR PROVIDING A CHAMPION PEER JOURNEY RECOMMENDATION AND A VALUE TRACKING SYSTEM TO A QUERYING USER

Publication number: 20240311384

Abstract: Systems, apparatus, and methods are provided that identify a querying user's top peers, recommends an optimal product interaction journey, and tracks the value delivered. Peers of the querying user are identified based on the common profile attributes, product interaction, and similar user initiatives. A champion peer is recommended by sorting the peers based on the cumulative quality of product interactions which is modeled by a learnable parametric equation. A metric is formulated by measuring the value delivered to the user by computing the number of product interactions aligning to the user's initiative. Most engaging product interactions (e.g., documents read, events attended) by the champion peer aligning to querying user's initiatives are identified and are recommended to the querying user as a product interaction journey.

Type: Application

Filed: March 16, 2023

Publication date: September 19, 2024

Inventors: Priya Goel, Shankar Kumar, Upender Phogat, Mukur Gupta, Satyashiba Mohanty
Semantic Segmentation With Language Models For Long-Form Automatic Speech Recognition

Publication number: 20240290320

Abstract: A joint segmenting and ASR model includes an encoder to receive a sequence of acoustic frames and generate, at each of a plurality of output steps, a higher order feature representation for a corresponding acoustic frame. The model also includes a decoder to generate based on the higher order feature representation at each of the plurality of output steps a probability distribution over possible speech recognition hypotheses, and an indication of whether the corresponding output step corresponds to an end of segment (EOS).

Type: Application

Filed: February 22, 2024

Publication date: August 29, 2024

Applicant: Google LLC

Inventors: Wenqian Huang, Hao Zhang, Shankar Kumar, Shuo-yiin Chang, Tara N. Sainath
Heterogeneous Federated Learning Via Multi-Directional Knowledge Distillation

Publication number: 20240249193

Abstract: Generally, the present disclosure is directed to enhanced federated learning (FL) that employs a set of clients with varying amounts of computational resources (e.g., system memory, storage, and processing bandwidth). To overcome limitations of conventional FL methods that employ a set of clients with varying amounts of computational resources, the embodiments run multi-directional knowledge distillation between the server models produced by each federated averaging (FedAvg) pool, using unlabeled server data as the distillation dataset. By co-distilling the two (or more) models frequently over the course of FedAvg rounds, information is shared between the pools without sharing model parameters. This leads to increased performance and faster convergence (in fewer federated rounds).

Type: Application

Filed: January 19, 2024

Publication date: July 25, 2024

Inventors: Jared Alexander Lichtarge, Rajiv Mathews, Rohan Anil, Ehsan Amid, Shankar Kumar
Knowledge Distillation with Domain Mismatch For Speech Recognition

Publication number: 20240233707

Abstract: A method includes receiving distillation data including a plurality of out-of-domain training utterances. For each particular out-of-domain training utterance of the distillation data, the method includes generating a corresponding augmented out-of-domain training utterance, and generating, using a teacher ASR model trained on training data corresponding to a target domain, a pseudo-label corresponding to the corresponding augmented out-of-domain training utterance. The method also includes distilling a student ASR model from the teacher ASR model by training the student ASR model using the corresponding augmented out-of-domain training utterances paired with the corresponding pseudo-labels generated by the teacher ASR model.

Type: Application

Filed: October 17, 2023

Publication date: July 11, 2024

Applicant: Google LLC

Inventors: Tien-Ju Yang, You-Chi Cheng, Shankar Kumar, Jared Lichtarge, Ehsan Amid, Yuxin Ding, Rajiv Mathews, Mingqing Chen
FEDERATED KNOWLEDGE DISTILLATION ON AN ENCODER OF A GLOBAL ASR MODEL AND/OR AN ENCODER OF A CLIENT ASR MODEL

Publication number: 20240194192

Abstract: Information can be distilled from a global automatic speech recognition (ASR) model to a client ASR model. Many implementations include using an RNN-T model as the ASR model, where the global ASR model includes a global encoder, a joint network, a prediction network, and where the client ASR model includes a client encoder, the joint network, and the prediction network. Various implementations include using principal component analysis (PCA) while training the global ASR model to learn a mean vector and a set of principal components corresponding to the global ASR model. Additional or alternative implementations include training the client ASR model to generate one or more predicted coefficients of the global ASR model.

Type: Application

Filed: December 9, 2022

Publication date: June 13, 2024

Inventors: Ehsan Amid, Rajiv Mathews, Shankar Kumar, Jared Lichtarge, Mingqing Chen, Tien-Ju Yang, Yuxin Ding
Multi-Output Decoders for Multi-Task Learning of ASR and Auxiliary Tasks

Publication number: 20240153495

Abstract: A method includes receiving a training dataset that includes one or more spoken training utterances for training an automatic speech recognition (ASR) model. Each spoken training utterance in the training dataset paired with a corresponding transcription and a corresponding target sequence of auxiliary tokens. For each spoken training utterance, the method includes generating a speech recognition hypothesis for a corresponding spoken training utterance, determining a speech recognition loss based on the speech recognition hypothesis and the corresponding transcription, generating a predicted auxiliary token for the corresponding spoken training utterance, and determining an auxiliary task loss based on the predicted auxiliary token and the corresponding target sequence of auxiliary tokens. The method also includes the ASR model jointly on the speech recognition loss and the auxiliary task loss determined for each spoken training utterance.

Type: Application

Filed: October 26, 2023

Publication date: May 9, 2024

Applicant: Google LLC

Inventors: Weiran Wang, Ding Zhao, Shaojin Ding, Hao Zhang, Shuo-yiin Chang, David Johannes Rybach, Tara N. Sainath, Yanzhang He, Ian McGraw, Shankar Kumar
Knowledge Distillation with Domain Mismatch For Speech Recognition

Publication number: 20240135918

Abstract: A method includes receiving distillation data including a plurality of out-of-domain training utterances. For each particular out-of-domain training utterance of the distillation data, the method includes generating a corresponding augmented out-of-domain training utterance, and generating, using a teacher ASR model trained on training data corresponding to a target domain, a pseudo-label corresponding to the corresponding augmented out-of-domain training utterance. The method also includes distilling a student ASR model from the teacher ASR model by training the student ASR model using the corresponding augmented out-of-domain training utterances paired with the corresponding pseudo-labels generated by the teacher ASR model.

Type: Application

Filed: October 16, 2023

Publication date: April 25, 2024

Applicant: Google LLC

Inventors: Tien-Ju Yang, You-Chi Cheng, Shankar Kumar, Jared Lichtarge, Ehsan Amid, Yuxin Ding, Rajiv Mathews, Mingqing Chen
HYBRID FEDERATED LEARNING OF MACHINE LEARNING MODEL(S)

Publication number: 20240070530

Abstract: Implementations disclosed herein are directed to a hybrid federated learning (FL) technique that utilizes both federated averaging (FA) and federated distillation (FD) during a given round of FL of a given global machine learning (ML) model. Implementations may identify a population of client devices to participate in the given round of FL, determine a corresponding quantity of instances of client data available at each of the client devices that may be utilized during the given round of FL, and select different subsets of the client devices based on the corresponding quantity of instances of client data. Further, implementations may cause a first subset of the client devices to generate a corresponding FA update and a second subset of client devices to generate a corresponding FD update. Moreover, implementations may subsequently update the given global ML model based on the corresponding FA updates and the corresponding FD updates.

Type: Application

Filed: December 5, 2022

Publication date: February 29, 2024

Inventors: Ehsan Amid, Rajiv Mathews, Rohan Anil, Shankar Kumar, Jared Lichtarge
ON-DEVICE GRAMMAR CHECKING

Publication number: 20230359818

Abstract: A computing device may receive inputted text and perform, using one or more neural networks, on-device grammar checking of a sequence of words in the inputted text, including determining, using the one or more neural networks, a grammatically correct version of the sequence of words and determining that the sequence of words does not match the grammatically correct version of the sequence of words. The computing device may, in response to determining that the sequence of words does not match the grammatically correct version of the sequence of words, output, for display at a display device, at least a portion of the grammatically correct version of the sequence of words as a suggested replacement for at least a sequence of the sequence of words in the inputted text.

Type: Application

Filed: December 18, 2020

Publication date: November 9, 2023

Inventors: Matthew Sharifi, Sebastian Millius, Qi Wang, Yunpeng Li, Shankar Kumar, Lukas Zilka, Simon Tong, Martin Sundermeyer
Modeling Ambiguity in Neural Machine Translation

Publication number: 20230351125

Abstract: The technology addresses ambiguity in neural machine translation. An encoder module receives a given text exemplar and generates an encoded representation of it. A decoder module receives the encoded representation and a set of translation prefixes. The decoder module outputs an unbounded function corresponding to a set of tokens associated with each pair of the given text exemplar and translation prefix from the set of translation prefixes. Each token is assigned a probability between 0 and 1 in a vocabulary of the exemplar at each time step. A logits module generates, based on the unbounded function, a corresponding bounded conditional probability for each token, wherein the probabilities are not normalized over the vocabulary at each time step. A loss function module having a positive loss component and a scaled negative loss component identifies whether each target text of a set of target texts is a valid translation of the exemplar.

Type: Application

Filed: December 28, 2022

Publication date: November 2, 2023

Inventors: Felix Stahlberg, Shankar Kumar
METHODS AND COMPOSITIONS FOR THE GENERATION AND USE OF HUMANIZED CONFORMATION-SPECIFIC PHOSPHORYLATED TAU ANTIBODIES

Publication number: 20230287093

Abstract: The present disclosure relates generally to conformation-specific antibodies that can bind to and neutralize the activity of phosphorylated-Threonine 231-tau protein (pT231-tau). The antibodies of the present technology are useful in methods for treating a neurological disorder associated with elevated cis-pT231-tau protein expression in a subject in need thereof.

Type: Application

Filed: January 17, 2023

Publication date: September 14, 2023

Applicant: Pinteon Therapeutics Inc.

Inventors: Shankar Kumar, Naoya Tsurushita, Michael Ahlijanian, Martin Jefson
Methods and compositions for the generation and use of humanized conformation-specific phosphorylated tau antibodies

Patent number: 11591385

Abstract: The present disclosure relates generally to conformation-specific antibodies that can bind to and neutralize the activity of phosphorylated-Threonine 231-tau protein (pT231-tau). The antibodies of the present technology are useful in methods for treating a neurological disorder associated with elevated cis-pT231-tau protein expression in a subject in need thereof.

Type: Grant

Filed: November 8, 2018

Date of Patent: February 28, 2023

Assignee: Pinteon Therapeutics Inc.

Inventors: Shankar Kumar, Naoya Tsurushita, Michael Ahlijanian, Martin Jefson

1 2 3 4 5 … next