Patents by Inventor Katrin Kirchhoff

Katrin Kirchhoff has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

APPLICATION CREATION ENVIRONMENT USING MULTI-MODALITY INTERFACE OPTIONS

Publication number: 20260126962

Abstract: Systems and methods provide for a multi-modal development environment to receive inputs using a variety of different input modalities in different user interfaces (UIs). Multiple user interfaces may be linked within the development environment to maintain state information so that inputs provided to one UI are represented in the other UIs using an appropriate equivalent representation based on the UI modality. Users of the development environment may select a given UI for interaction based on a desired task and then see changes tracked and relayed through the different UIs to verify changes within the development environment. The UIs may also be contextually linked to permit the user to work between both UIs without losing the context due to the switch.

Type: Application

Filed: December 30, 2025

Publication date: May 7, 2026

Inventors: Daniele Bonadiman, Sailik Sengupta, James Gung, Arshit Gupta, John Baker, Yi-An Lai, Sebastien Jean, Saab Mansour, Santosh Kumar Ameti, Ruhaab Markas, Ganesh Kumar Gella, Katrin Kirchhoff
Joint ASR and speaker error correction post-processing

Patent number: 12614552

Abstract: Techniques for performing speaker error correction are described. In some examples, speaker error correction is a post-processing task aligned predicted words and predicted one or more speaker identities, wherein the post-processing at least includes: performing predicted speaker error correction on the aligned predicted words and predicted one or more speaker identities to generate a first corrected set of speaker error corrections of the one or more speaker identifiers.

Type: Grant

Filed: June 8, 2023

Date of Patent: April 28, 2026

Assignee: Amazon Technologies, Inc.

Inventors: Sundararajan Srinivasan, Rohit Paturi, Katrin Kirchhoff, Marc Helbing, Sumit Kumar, Xiang Li
Reward weighting alignment of large language models

Patent number: 12602548

Abstract: This patent application relates to using framework parameters with a large language model to create beams based on a prompt. The beams can be evaluated using multiple criteria of a reward model that can be weighted for importance. The beams can be evaluated according to each of the one or more criteria and compared to determine which beams most closely align with the criteria. The beam that best aligns can be selected to generate a response.

Type: Grant

Filed: November 22, 2023

Date of Patent: April 14, 2026

Assignee: Amazon Technologies, Inc.

Inventors: Sailik Sengupta, Daniele Bonadiman, Yi-An Lai, Arshit Gupta, Dan Roth, Katrin Kirchhoff, Saab Mansour, James Yipeng Huang
Constrained generative framework application development environment

Patent number: 12602209

Abstract: Systems and methods provide for modular, iterative framework to resolve inputs provided to an interaction environment. The input may be decomposed into different component parts and then relevant actions for each of the component parts may be predicted. An action may be selected and relevant parameters may be populated based on the input. If parameters remain unresolved, additional queries may be presented in order to resolve the remaining parameters. Multiple actions may be executed and then prepared to generate a combined response responsive to the input. Actions for a given interaction environment may be domain-specific and also may be developer-defined for a given goal or task to restrict one or more underlying language systems.

Type: Grant

Filed: September 29, 2023

Date of Patent: April 14, 2026

Assignee: Amazon Technologies, Inc.

Inventors: James Gung, Arshit Gupta, John Baker, Yi Zhang, Saab Mansour, Santosh Kumar Ameti, Ruhaab Markas, Ganesh Kumar Gella, Katrin Kirchhoff
Alignment model as decoding-time heuristic

Patent number: 12591748

Abstract: The application is directed to receiving an alignment model and a prompt to evaluate and generate a response using a large language model. The large language model populates tokens for potential responses based on framework parameters and evaluates the potential responses using an alignment model. The alignment model can be utilized or selected for a prompt at decoding-time and can be used to select a potential response to use to create an output.

Type: Grant

Filed: November 22, 2023

Date of Patent: March 31, 2026

Assignee: Amazon Technologies, Inc.

Inventors: Sailik Sengupta, Daniele Bonadiman, Yi-An Lai, Arshit Gupta, Dan Roth, Katrin Kirchhoff, Saab Mansour, James Yipeng Huang
Multi-domain fine-tuning for contextual adapters of automatic speech recognition

Patent number: 12586567

Abstract: Multi-stage fine-tuning for customizing automatic speech recognition with contextual adapters is performed. Entities to customize automatic speech recognition are received and used in an automatic speech recognition pipeline with a fine-tuned version of a speech recognition machine learning model and a fine-tuned version of a contextual adapter. The speech recognition machine learning model and the contextual adapter are fine-tuned in stages that include an initial stage that fine-tunes the speech recognition machine learning model using training data that includes multiple domains and subsequent tuning stages that freeze the speech recognition machine learning model while tuning the contextual adapter and tune the speech recognition machine learning model again.

Type: Grant

Filed: June 30, 2023

Date of Patent: March 24, 2026

Assignee: Amazon Technologies, Inc.

Inventors: Devang Kulshreshtha, Saket Dingliwal, Brady Houston, Sravan Babu Bodapati, Srikanth Ronanki, Jeffrey John Farris, Vivek Govindan, Katrin Kirchhoff
Augmenting automated speech recognition neural networks with scalable vocabularies

Patent number: 12562151

Abstract: Techniques for augmenting automated speech recognition neural networks with scalable vocabularies are described. A cluster is selected from a plurality of clusters of similar sounding words based on a score, the score representing a similarity between an embedding of the cluster and an audio embedding of an utterance generated with an automated speech recognition encoder neural network. A bias factor is calculated based on a similarity between an embedding of a word in the selected cluster and the audio embedding. The audio embedding of the utterance is biased by the bias factor.

Type: Grant

Filed: March 28, 2023

Date of Patent: February 24, 2026

Assignee: Amazon Technologies, Inc.

Inventors: Devang Kulshreshtha, Saket Dingliwal, Sravan Babu Bodapati, Veera Raghavendra Elluru, Anubhav Mishra, Katrin Kirchhoff
Speaker diarization error correction

Patent number: 12555584

Abstract: Techniques for performing speaker error correction are described. In some examples, speaker error correction is a post-processing task to be performed on aligned predicted words and predicted one or more speaker identities to jointly perform speaker identities error correction and at least inverse text normalization, wherein the post-processing at least includes: predicting word and speaker contextual features from the aligned predicted words and predicted one or more speaker identities using an encoder, and predicting, from the word and speaker contextual features, inverse text normalization of the aligned predicted words and corrected speaker identities using a decoder.

Type: Grant

Filed: June 8, 2023

Date of Patent: February 17, 2026

Assignee: Amazon Technologies, Inc.

Inventors: Sundararajan Srinivasan, Rohit Paturi, Katrin Kirchhoff, Marc Helbing, Sumit Kumar, Xiang Li
Application creation environment using multi- modality interface options

Patent number: 12547380

Abstract: Systems and methods provide for a multi-modal development environment to receive inputs using a variety of different input modalities in different user interfaces (UIs). Multiple user interfaces may be linked within the development environment to maintain state information so that inputs provided to one UI are represented in the other UIs using an appropriate equivalent representation based on the UI modality. Users of the development environment may select a given UI for interaction based on a desired task and then see changes tracked and relayed through the different UIs to verify changes within the development environment. The UIs may also be contextually linked to permit the user to work between both UIs without losing the context due to the switch.

Type: Grant

Filed: September 29, 2023

Date of Patent: February 10, 2026

Assignee: Amazon Technologies, Inc.

Inventors: Daniele Bonadiman, Sailik Sengupta, James Gung, Arshit Gupta, John Baker, Yi-An Lai, Sebastien Jean, Saab Mansour, Santosh Kumar Ameti, Ruhaab Markas, Ganesh Kumar Gella, Katrin Kirchhoff
Medical conversation summarization style intelligence

Patent number: 12547841

Abstract: A medical audio summarization service receives a medical conversation and an indication of a user preferred summarization style selected from a plurality of available summarization styles to generate a medical summary that conforms to the user preferred summarization style. A transcript is generated via a medical audio transcription service, and the transcript is used by a natural language processing engine (including a large language model) to generate the medical summary. The large language model is trained to be used to generate medical summaries that conform to respective ones of a plurality of user preferred summarization styles. The large language model is trained using training data comprising previously generated summaries and summary interaction metadata generated from user edits and/or feedback.

Type: Grant

Filed: June 22, 2023

Date of Patent: February 10, 2026

Assignee: Amazon Technologies, Inc.

Inventors: Aparna Elangovan, Lei Xu, Devang Kulshreshtha, Sravan Babu Bodapati, Katrin Kirchhoff, Sarthak Handa
Customizable latency for automatic speech recognition

Patent number: 12505827

Abstract: Techniques for customizable latency, from the customer's side, for automatic speech recognition (ASR) are described. In particular, the customer may specify a parameter that controls how fast or how slow the customer's media content will be streamed or processed. Slower processing means higher accuracy, with near real-time latency, while faster processing means lower accuracy, but offers much lower latency (e.g., less than 600 ms). Enabling tuning of the latency-versus-accuracy tradeoff of the ASR system offers customers the flexibility to meet varying needs for different ASR applications.

Type: Grant

Filed: March 31, 2023

Date of Patent: December 23, 2025

Assignee: Amazon Technologies, Inc.

Inventors: Dhanush Bekal Kannangola, Goeric Sydney Huybrechts, Xilai Li, Srikanth Ronanki, Srikanth Vishnubhotla, Hadis Nosrati, Vivek Govindan, Jeffrey John Farris, Sravan Babu Bodapati, Katrin Kirchhoff
DYNAMIC LANGUAGE MODEL UPDATES WITH BOOSTING

Publication number: 20250342830

Abstract: Language models may be dynamically updated for trending entities based on tuning data for particular users. A user may provide specific tuning data associated with trending entities within a class to generate a weight map for a language model. A class based model may be trained using the weight map specific for the user for the trending entities. Additionally, weights may be further boosted using a boosting language model to emphasize the trending entities.

Type: Application

Filed: July 14, 2025

Publication date: November 6, 2025

Inventors: Sravan Babu Bodapati, Ashish Vishwanath Shenoy, Monica Lakshmi Sunkara, Varun Sembium Varadarajan, Katrin Kirchhoff
Dynamic language model updates with boosting

Patent number: 12400644

Abstract: Language models may be dynamically updated for trending entities based on tuning data for particular users. A user may provide specific tuning data associated with trending entities within a class to generate a weight map for a language model. A class based model may be trained using the weight map specific for the user for the trending entities. Additionally, weights may be further boosted using a boosting language model to emphasize the trending entities.

Type: Grant

Filed: June 24, 2021

Date of Patent: August 26, 2025

Assignee: Amazon Technologies, Inc.

Inventors: Sravan Babu Bodapati, Ashish Vishwanath Shenoy, Monica Lakshmi Sunkara, Varun Sembium Varadarajan, Katrin Kirchhoff
Infusing knowledge graphs into automatic speech recognition

Patent number: 12400659

Abstract: Systems and methods are described for converting speech to text. In one aspect, a method for converting speech to text includes generating an initial transcript of an audio or video file. At least one named entity may be extracted from the initial transcript using entity recognition. A subset of nodes of a knowledge graph that include the at least one named entity may be selected, where the subset of nodes of the knowledge graph correspond to a set of named entities. The method may further include encoding the set of named entities to generate a set of entity embeddings. The speech in the audio or video file may then be decoded using the set of entity embeddings to produce a final transcript of the audio file.

Type: Grant

Filed: December 2, 2022

Date of Patent: August 26, 2025

Assignee: Amazon Technologies, Inc.

Inventors: Monica Lakshmi Sunkara, Nilaksh Das, Sravan Babu Bodapati, Katrin Kirchhoff
Removing bias from automatic speech recognition models using internal language model estimates

Patent number: 12387718

Abstract: Bias may be removed from automatic speech recognition model predictions using internal language model estimates. Audio data may be received for speech recognition. The audio data may be processed both through an automatic speech recognition model to produce original word token predictions and masked in different portions of the audio data to produce other word token predictions for the masked audio. A comparison of the original word token predictions and the other word token predictions may provide an estimate of an internal language model for the automatic speech recognition model. This estimate can be used to modify the original word token predictions to remove the lexical bias and produce a speech prediction.

Type: Grant

Filed: May 3, 2023

Date of Patent: August 12, 2025

Assignee: Amazon Technologies, Inc.

Inventors: Nilaksh Das, Monica Lakshmi Sunkara, Sravan Babu Bodapati, Jinglun Cai, Devang Kulshreshtha, Jeffrey John Farris, Nicholas G Aldridge, Srikanth Ronanki, Katrin Kirchhoff
DIALOG-DRIVEN APPLICATIONS SUPPORTING ALTERNATIVE VOCAL INPUT STYLES

Publication number: 20250111850

Abstract: A set of alternative vocal input styles for specifying a parameter of a dialog-driven application is determined. During execution of the application, an audio prompt requesting input in one of the styles is presented. A value of the parameter is determined by applying a collection of analysis tools to vocal input obtained after the prompt is presented. A task of the application is initiated using the value.

Type: Application

Filed: December 11, 2024

Publication date: April 3, 2025

Applicant: Amazon Technologies, Inc.

Inventors: John Baker, Anubhav Mishra, Bangrui Liu, Christopher Michael Hittner, Sravan Babu Bodapati, Harshal Pimpalkhute, Katrin Kirchhoff, Anuj Gautam Surana, Yilai Su, Brandon Louis Mendez, Chengshun Zhang
REDACTING PORTIONS OF TEXT TRANSCRIPTIONS GENERATED FROM INVERSE TEXT NORMALIZATION

Publication number: 20250086380

Abstract: Portions of text data generated from inverse text normalization may be redacted. Text data for redaction may be obtained. One or more inverse text normalization models may be applied to the text data to generate normalized text data. A machine learning model, trained to recognize text for redaction, may be applied to identify portions of the normalized text data for redaction. The identified portions may be redacted and the redacted normalized text provided to a destination.

Type: Application

Filed: November 22, 2024

Publication date: March 13, 2025

Applicant: Amazon Technologies, Inc.

Inventors: Monica Lakshmi Sunkara, Deepthi Devaiah Devanira, Chaitanya Shivade, Sravan Babu Bodapati, Katrin Kirchhoff, Srikanth Ronanki
Dynamically selectable automated speech recognition using a custom vocabulary

Patent number: 12250180

Abstract: Techniques for at least the generation of a chatbot built from a custom vocabulary and to use runtime hints during inference are described. In some examples, the generation of the chatbot includes receiving a request to build a chatbot using a bot definition and a custom vocabulary, wherein the chatbot is to use runtime hints during usage; building the chatbot from the bot definition and custom vocabulary by at least: generating automatic speech recognition (ASR) artifacts to be used in decoding audio input into the chatbot into text for at least one other component of the chatbot to use in determining a next act to be performed, the ASR artifacts including artifacts that use the custom vocabulary and artifacts that do not use the custom vocabulary, and storing the ASR artifacts.

Type: Grant

Filed: August 3, 2021

Date of Patent: March 11, 2025

Assignee: Amazon Technologies, Inc.

Inventors: Sravan Babu Bodapati, Ashish Vishwanath Shenoy, Monica Lakshmi Sunkara, Katrin Kirchhoff, Anubhav Mishra, Harshal Pimpalkhute, John Baker, Ganesh Kumar Gella
Managing access to sensitive data in transcriptions

Patent number: 12223259

Abstract: Techniques for managing access to sensitive data in transcriptions are described. A method for managing access to sensitive data in transcriptions may include receiving a request to generate a redacted transcript of content, obtaining a transcript of the content, sending at least a portion of the transcript to a model endpoint to identify sensitive entities in the transcript, receiving an inference response identifying one or more sensitive entities in the transcript, and generating the redacted transcript based at least one the transcript and the inference response.

Type: Grant

Filed: September 30, 2019

Date of Patent: February 11, 2025

Assignee: Amazon Technologies, Inc.

Inventors: Varun Sembium Varadarajan, Sravan Babu Bodapati, Deepthi Devaiah Devanira, Pu Paul Zhao, Katrin Kirchhoff, Yue Yang
DOMAIN SPECIALTY INSTRUCTION GENERATION FOR TEXT ANALYSIS TASKS

Publication number: 20250029603

Abstract: Domain specialty instructions may be generated for performing text analysis tasks. An input text may be received for performing a text analysis task. A domain specialty may be identified for the input text. Specialty domain identifiers may be inserted as part of generating instructions to perform the text analysis task using a pre-trained large language model fine-tuned to a domain that includes multiple domain specialties. The pre-trained large language model may perform the text analysis task on the input text using the generated instructions. A result of the text analysis tsk performed on the input text may be provided.

Type: Application

Filed: July 20, 2023

Publication date: January 23, 2025

Applicant: Amazon Technologies, Inc.

Inventors: Karthik Gopalakrishnan, Sravan Babu Bodapati, Katrin Kirchhoff, Sarthak Handa

1 2 next