Patents by Inventor Katrin Kirchhoff

Katrin Kirchhoff has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20250111850
    Abstract: A set of alternative vocal input styles for specifying a parameter of a dialog-driven application is determined. During execution of the application, an audio prompt requesting input in one of the styles is presented. A value of the parameter is determined by applying a collection of analysis tools to vocal input obtained after the prompt is presented. A task of the application is initiated using the value.
    Type: Application
    Filed: December 11, 2024
    Publication date: April 3, 2025
    Applicant: Amazon Technologies, Inc.
    Inventors: John Baker, Anubhav Mishra, Bangrui Liu, Christopher Michael Hittner, Sravan Babu Bodapati, Harshal Pimpalkhute, Katrin Kirchhoff, Anuj Gautam Surana, Yilai Su, Brandon Louis Mendez, Chengshun Zhang
  • Publication number: 20250086380
    Abstract: Portions of text data generated from inverse text normalization may be redacted. Text data for redaction may be obtained. One or more inverse text normalization models may be applied to the text data to generate normalized text data. A machine learning model, trained to recognize text for redaction, may be applied to identify portions of the normalized text data for redaction. The identified portions may be redacted and the redacted normalized text provided to a destination.
    Type: Application
    Filed: November 22, 2024
    Publication date: March 13, 2025
    Applicant: Amazon Technologies, Inc.
    Inventors: Monica Lakshmi Sunkara, Deepthi Devaiah Devanira, Chaitanya Shivade, Sravan Babu Bodapati, Katrin Kirchhoff, Srikanth Ronanki
  • Patent number: 12250180
    Abstract: Techniques for at least the generation of a chatbot built from a custom vocabulary and to use runtime hints during inference are described. In some examples, the generation of the chatbot includes receiving a request to build a chatbot using a bot definition and a custom vocabulary, wherein the chatbot is to use runtime hints during usage; building the chatbot from the bot definition and custom vocabulary by at least: generating automatic speech recognition (ASR) artifacts to be used in decoding audio input into the chatbot into text for at least one other component of the chatbot to use in determining a next act to be performed, the ASR artifacts including artifacts that use the custom vocabulary and artifacts that do not use the custom vocabulary, and storing the ASR artifacts.
    Type: Grant
    Filed: August 3, 2021
    Date of Patent: March 11, 2025
    Assignee: Amazon Technologies, Inc.
    Inventors: Sravan Babu Bodapati, Ashish Vishwanath Shenoy, Monica Lakshmi Sunkara, Katrin Kirchhoff, Anubhav Mishra, Harshal Pimpalkhute, John Baker, Ganesh Kumar Gella
  • Patent number: 12223259
    Abstract: Techniques for managing access to sensitive data in transcriptions are described. A method for managing access to sensitive data in transcriptions may include receiving a request to generate a redacted transcript of content, obtaining a transcript of the content, sending at least a portion of the transcript to a model endpoint to identify sensitive entities in the transcript, receiving an inference response identifying one or more sensitive entities in the transcript, and generating the redacted transcript based at least one the transcript and the inference response.
    Type: Grant
    Filed: September 30, 2019
    Date of Patent: February 11, 2025
    Assignee: Amazon Technologies, Inc.
    Inventors: Varun Sembium Varadarajan, Sravan Babu Bodapati, Deepthi Devaiah Devanira, Pu Paul Zhao, Katrin Kirchhoff, Yue Yang
  • Publication number: 20250029603
    Abstract: Domain specialty instructions may be generated for performing text analysis tasks. An input text may be received for performing a text analysis task. A domain specialty may be identified for the input text. Specialty domain identifiers may be inserted as part of generating instructions to perform the text analysis task using a pre-trained large language model fine-tuned to a domain that includes multiple domain specialties. The pre-trained large language model may perform the text analysis task on the input text using the generated instructions. A result of the text analysis tsk performed on the input text may be provided.
    Type: Application
    Filed: July 20, 2023
    Publication date: January 23, 2025
    Applicant: Amazon Technologies, Inc.
    Inventors: Karthik Gopalakrishnan, Sravan Babu Bodapati, Katrin Kirchhoff, Sarthak Handa
  • Patent number: 12205584
    Abstract: A set of alternative vocal input styles for specifying a parameter of a dialog-driven application is determined. During execution of the application, an audio prompt requesting input in one of the styles is presented. A value of the parameter is determined by applying a collection of analysis tools to vocal input obtained after the prompt is presented. A task of the application is initiated using the value.
    Type: Grant
    Filed: November 22, 2021
    Date of Patent: January 21, 2025
    Assignee: Amazon Technologies, Inc.
    Inventors: John Baker, Anubhav Mishra, Bangrui Liu, Christopher Michael Hittner, Sravan Babu Bodapati, Harshal Pimpalkhute, Katrin Kirchhoff, Anuj Gautam Surana, Yilai Su, Brandon Louis Mendez, Chengshun Zhang
  • Patent number: 12198681
    Abstract: Techniques for personalized batch and streaming speech-to-text transcription of audio reduce the error rate of automatic speech recognition (ASR) systems in transcribing rare and out-of-vocabulary words. The techniques achieve personalization of connectionist temporal classification (CT) models by using adaptive boosting to perform biasing at the level of sub-words. In addition to boosting, the techniques encompass a phone alignment network to bias sub-word predictions towards rare long-tail words and out-of-vocabulary words. A technical benefit of the techniques is that the accuracy of speech-to-text transcription of rare and out-of-vocabulary words in a custom vocabulary by automatic speech recognition (ASR) system can be improved without having to train the ASR system on the custom vocabulary. Instead, the techniques allow the same ASR system trained on a base vocabulary to realize the accuracy improvements for different custom vocabularies spanning different domains.
    Type: Grant
    Filed: September 30, 2022
    Date of Patent: January 14, 2025
    Assignee: Amazon Technologies, Inc.
    Inventors: Monica Lakshmi Sunkara, Srikanth Ronanki, Sravan Babu Bodapati, Jeffrey John Farris, Katrin Kirchhoff, Vivek Govindan, Yide Zou, Mohit Narendra Gupta, Silviu Mihai Burz
  • Publication number: 20250005063
    Abstract: Pairs of text collections are obtained. An individual pair comprises (a) a source text collection which includes a first group of text sequences and (b) an annotated analysis result of the source text collection, comprising a second group of text sequences and a set of evidence mappings generated by an evidence mapping model. An evidence mapping indicates, for a particular text sequence of the second group, another text sequence of the first group which provides evidence for the particular text sequence. A quality metric of the model is obtained using an automated evaluation methodology in which a question is generated from the particular text sequence, and an analysis of a pair of answers (including 10 an answer generated using an evidence mapping) to the question is performed. The quality metric is provided via a programmatic interface.
    Type: Application
    Filed: June 29, 2023
    Publication date: January 2, 2025
    Applicant: Amazon Technologies, Inc.
    Inventors: Devang Kulshreshtha, Saket Dingliwal, Sravan Babu Bodapati, Katrin Kirchhoff, Sarthak Handa
  • Publication number: 20250005282
    Abstract: Domain specialty instructions may be generated for performing text analysis tasks. An input text may be received for performing a text analysis task. One or more domain entities may be extracted from the input text using a machine learning model trained to recognize entities of a domain in a given text. The one or more domain entities may be inserted as part of generating instructions to perform the text analysis task using a pre-trained machine learning model fine-tuned to the domain. The pre-trained machine learning model may be caused to perform the text analysis task using the generated instructions and a result of the text analysis task may be provided.
    Type: Application
    Filed: June 29, 2023
    Publication date: January 2, 2025
    Applicant: Amazon Technologies, Inc.
    Inventors: John Colton Moriarty, Saket Dingliwal, Karthik Gopalakrishnan, Sravan Babu Bodapati, Katrin Kirchhoff, Lei Xu
  • Publication number: 20250005298
    Abstract: Pairs of text collections are obtained. An individual pair comprises (a) a source text collection which includes a first group of text sequences and (b) an annotated analysis result of the source text collection, comprising a second group of text sequences and a set of evidence mappings generated by an evidence mapping model. An evidence mapping indicates, for a particular text sequence of the second group, another text sequence of the first group which provides evidence for the particular text sequence. A quality metric of the model is obtained using an automated evaluation methodology in which a question is generated from the particular text sequence, and an analysis of a pair of answers (including an answer generated using an evidence mapping) to the question is performed. The quality metric is provided via a programmatic interface.
    Type: Application
    Filed: June 29, 2023
    Publication date: January 2, 2025
    Applicant: Amazon Technologies, Inc.
    Inventors: Saket Dingliwal, Karthik Gopalakrishnan, Sravan Babu Bodapati, Sarthak Handa, Katrin Kirchhoff
  • Patent number: 12182498
    Abstract: Portions of text data generated from inverse text normalization may be redacted. Text data for redaction may be obtained. One or more inverse text normalization models may be applied to the text data to generate normalized text data. A machine learning model, trained to recognize text for redaction, may be applied to identify portions of the normalized text data for redaction. The identified portions may be redacted and the redacted normalized text provided to a destination.
    Type: Grant
    Filed: June 30, 2022
    Date of Patent: December 31, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Monica Lakshmi Sunkara, Deepthi Devaiah Devanira, Chaitanya Shivade, Sravan Babu Bodapati, Katrin Kirchhoff, Srikanth Ronanki
  • Publication number: 20240428002
    Abstract: A medical audio summarization service receives a medical conversation and an indication of a user preferred summarization style selected from a plurality of available summarization styles to generate a medical summary that conforms to the user preferred summarization style. A transcript is generated via a medical audio transcription service, and the transcript is used by a natural language processing engine (including a large language model) to generate the medical summary. The large language model is trained to be used to generate medical summaries that conform to respective ones of a plurality of user preferred summarization styles. The large language model is trained using training data comprising previously generated summaries and summary interaction metadata generated from user edits and/or feedback.
    Type: Application
    Filed: June 22, 2023
    Publication date: December 26, 2024
    Applicant: Amazon Technologies, Inc.
    Inventors: Aparna Elangovan, Lei Xu, Devang Kulshreshtha, Sravan Babu Bodapati, Katrin Kirchhoff, Sarthak Handa
  • Patent number: 12136413
    Abstract: Domain-specific parameters may be used for tuning speech processing. A pre-trained transformer-based language model may train domain-specific parameters using domain-specific unlabeled text data. This domain-specific parameters can then be appended to candidate texts produced by a speech model on received speech data and input to the transformer-based language model to score the candidate texts. The scores of the candidate texts determined using the pre-trained transformer-based language model can then be used to select a candidate text for further speech processing.
    Type: Grant
    Filed: March 31, 2022
    Date of Patent: November 5, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Saket Dingliwal, Sravan Babu Bodapati, Katrin Kirchhoff, Ankur Gandhe, Anubhav Mishra, John Baker, Ashish Vishwanath Shenoy, Ravi Teja Gadde
  • Patent number: 11710479
    Abstract: Techniques for implementing a chatbot that utilizes context embeddings are described. An exemplary method includes determining a next turn by: applying a language model to the utterance to determine a probability of a sequence of words, generating a context embedding for the utterance based at least on one or more of: a dialog act as defined by a chatbot definition of the chatbot, a topic vector identifying a domain of the chatbot, a previous chatbot response, and one or more slot options; performing neural language model rescoring using the determined probability of a sequence of words as a word embedding and the generated context embedding to predict an hypothesis; determining at least a name of a slot and type to be fulfilled based at least in part on the hypothesis and the chatbot definition; and determining a next turn based at least in part on the chatbot definition, any previous state, and the name of the slot and type to be fulfilled.
    Type: Grant
    Filed: March 31, 2021
    Date of Patent: July 25, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Ashish Vishwanath Shenoy, Sravan Babu Bodapati, Katrin Kirchhoff
  • Patent number: 11580965
    Abstract: Techniques for predicting punctuation and casing using multimodal fusion are described.
    Type: Grant
    Filed: July 24, 2020
    Date of Patent: February 14, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Monica Lakshmi Sunkara, Srikanth Ronanki, Dhanush Bekal Kannangola, Sravan Babu Bodapati, Katrin Kirchhoff
  • Patent number: 11580968
    Abstract: Techniques are described for a contextual natural language understanding (cNLU) framework that is able to incorporate contextual signals of variable history length to perform joint intent classification (IC) and slot labeling (SL) tasks. A user utterance provided by a user within a multi-turn chat dialog between the user and a conversational agent is received. The user utterance and contextual information associated with one or more previous turns of the multi-turn chat dialog is provided to a machine learning (ML) model. An intent classification and one or more slot labels for the user utterance are then obtained from the ML model. The cNLU framework described herein thus uses, in addition to a current utterance itself, various contextual signals as input to a model to generate IC and SL predictions for each utterance of a multi-turn chat dialog.
    Type: Grant
    Filed: June 27, 2019
    Date of Patent: February 14, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Arshit Gupta, Peng Zhang, Rashmi Gangadharaiah, Garima Lalwani, Roger Scott Jenke, Hassan Sawaf, Mona Diab, Katrin Kirchhoff, Adel A. Youssef, Kalpesh N. Sutaria
  • Patent number: 11562735
    Abstract: A spoken language understanding (SLU) system may include an automatic speech recognizer (ASR), an audio feature extractor, an optional synchronizer and a language understanding module. The ASR may produce a first set of input data representing transcripts of utterances. The audio feature extractor may produce a second set of input data representing audio features of the utterances, in particular, non-transcript specific characteristics of the speaker in one or more portions the utterances. The two sets of input data may be provided for the language understanding module to predict intents and slot labels for the utterances. The SLU system may use the optional synchronizer to align the two sets of input data before providing them to the language understanding module.
    Type: Grant
    Filed: March 31, 2020
    Date of Patent: January 24, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Arshit Gupta, Julian E. S. Salazar, Peng Zhang, Katrin Kirchhoff, Yi Zhang
  • Patent number: 11551695
    Abstract: A transcription service may receive a request from a developer to build a custom speech-to-text model for a specific domain of speech. The custom speech-to-text model for the specific domain may replace a general speech-to-text model or add to a set of one or more speech-to-text models available for transcribing speech. The transcription service may receive a training data and instructions representing tasks. The transcription service may determine respective schedules for executing the instructions based at least in part on dependencies between the tasks. The transcription service may execute the instructions according to the respective schedules to train a speech-to-text model for a specific domain using the training data set. The transcription service may deploy the trained speech-to-text model as part of a network-accessible service for an end user to convert audio in the specific domain into texts.
    Type: Grant
    Filed: May 13, 2020
    Date of Patent: January 10, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Vivek Govindan, Varun Sembium Varadarajan, Christian Egon Berkhoff Dossow, Himalay Mohanlal Joriwal, Sai Madhuri Bhavirisetty, Abhinav Kumar, Orestis Lykouropoulos, Akshay Nalwaya, Rahul Gupta, Sravan Babu Bodapati, Liangwei Guo, Julian E. S. Salazar, Yibin Wang, K P N V D S Siva Rama, Calvin Xuan Li, Mohit Narendra Gupta, Asem Rustum, Katrin Kirchhoff, Pu Zhao
  • Patent number: 11531846
    Abstract: Techniques for extending sensitive data tagging without reannotating training data are described. A method for extending sensitive data tagging without reannotating training data may include hosting a plurality of models at a model endpoint in a machine learning service, each model trained to identify a different sensitive data type in a transcript of content, adding a new model to the model endpoint, the new model trained to identify a new sensitive data entity in the transcript of content, identifying sensitive entities in the transcript by each of the plurality of models and the new model, merging inference responses generated by each of the plurality of models and the new model using at least one inference policy, and returning a merged inference response identifying a plurality of sensitive entities in the transcript.
    Type: Grant
    Filed: September 30, 2019
    Date of Patent: December 20, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Sravan Babu Bodapati, Rishita Rajal Anubhai, Pu Paul Zhao, Katrin Kirchhoff