Patents by Inventor Abhinav Sethy

Abhinav Sethy has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

USER-SYSTEM DIALOG EXPANSION

Publication number: 20240347050

Abstract: Techniques for recommending a skill experience to a user after a user-system dialog session has ended are described. Upon a dialog session ending, the system uses a first machine learning model to determine potential intents to recommend to a user. The system then uses a second machine learning model to determine a particular skill and intent to recommend. The system then prompts the user to accept the recommended skill and intent. If the user accepts, the system calls the recommended skill to execute. As part of calling the skill, the system sends to the skill at least one entity provided in a natural language user input of the ended dialog session. This enables the skill to skip welcome prompts, and initiate processing to output a response based on the intent and the at least one entity of the ended dialog session.

Type: Application

Filed: May 14, 2024

Publication date: October 17, 2024

Inventors: Ruhi Sarikaya, Hung Tuan Pham, Savas Parastatidis, Dean Curtis, Pushpendre Rastogi, Nitin Ashok Jain, John Arland Nave, Abhinav Sethy, Arpit Gupta, Mayank Kumar, Nakul Dahiwade, Arshdeep Singh, Nikhil Reddy Kortha, Rohit Prasad
VISUAL RESPONSES TO USER INPUTS

Publication number: 20240321261

Abstract: Techniques for generating a visual response to a user input are described. A system may receive input data corresponding to a user input, determining a first skill component is to determine a response to the user input, and determine a second skill component is to determine supplemental content related to the user input. The system may also determine a template for presenting a visual response to the user input, where the template is configured for presenting the response and the supplemental content. The system may receive, from the first skill component, first image data corresponding to the first response. The system may also receive, from the second skill component, second image data corresponding to the first supplemental content. The system may send, to a device including a display, a command to present the first image data and the second image data using the template.

Type: Application

Filed: May 22, 2024

Publication date: September 26, 2024

Inventors: Vasiliy Radostev, Ruhi Sarikaya, Rekha Seshadrinathan, Abhinav Sethy, Chetan Nagaraj Naik, Anjishnu Kumar
Visual responses to user inputs

Patent number: 11996081

Abstract: Techniques for generating a visual response to a user input are described. A system may receive a natural language input and use a machine learning model to determine a first component is to determine a response to the natural language input while a second component is to determine supplemental content related to the natural language input. The system may receive, from the first component, first image data corresponding to the response. The system may also receive, from the second component, second image data corresponding to the supplemental content. The system may send, to a display, a command to present the first image data and the second image data.

Type: Grant

Filed: May 26, 2023

Date of Patent: May 28, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Vasiliy Radostev, Ruhi Sarikaya, Rekha Seshadrinathan, Abhinav Sethy, Chetan Nagaraj Naik, Anjishnu Kumar
User-system dialog expansion

Patent number: 11990122

Abstract: Techniques for recommending a skill experience to a user after a user-system dialog session has ended are described. Upon a dialog session ending, the system uses a first machine learning model to determine potential intents to recommend to a user. The system then uses a second machine learning model to determine a particular skill and intent to recommend. The system then prompts the user to accept the recommended skill and intent. If the user accepts, the system calls the recommended skill to execute. As part of calling the skill, the system sends to the skill at least one entity provided in a natural language user input of the ended dialog session. This enables the skill to skip welcome prompts, and initiate processing to output a response based on the intent and the at least one entity of the ended dialog session.

Type: Grant

Filed: December 7, 2022

Date of Patent: May 21, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Ruhi Sarikaya, Hung Tuan Pham, Savas Parastatidis, Dean Curtis, Pushpendre Rastogi, Nitin Ashok Jain, John Arland Nave, Abhinav Sethy, Arpit Gupta, Mayank Kumar, Nakul Dahiwade, Arshdeep Singh, Nikhil Reddy Kortha, Rohit Prasad
VISUAL RESPONSES TO USER INPUTS

Publication number: 20240029708

Abstract: Techniques for generating a visual response to a user input are described. A system may receive input data corresponding to a user input, determining a first skill component is to determine a response to the user input, and determine a second skill component is to determine supplemental content related to the user input. The system may also determine a template for presenting a visual response to the user input, where the template is configured for presenting the response and the supplemental content. The system may receive, from the first skill component, first image data corresponding to the first response. The system may also receive, from the second skill component, second image data corresponding to the first supplemental content. The system may send, to a device including a display, a command to present the first image data and the second image data using the template.

Type: Application

Filed: May 26, 2023

Publication date: January 25, 2024

Inventors: Vasiliy Radostev, Ruhi Sarikaya, Rekha Seshadrinathan, Abhinav Sethy, Chetan Nagaraj Naik, Anjishnu Kumar
Learning how to rewrite user-specific input for natural language understanding

Patent number: 11862149

Abstract: Techniques for decreasing (or eliminating) the possibility of a skill performing an action that is not responsive to a corresponding user input are described. A system may train one or more machine learning models with respect to user inputs, which resulted in incorrect actions being performed by skills, and corresponding user inputs, which resulted in the correct action being performed. The system may use the trained machine learning model(s) to rewrite user inputs that, if not rewritten, may result in incorrect actions being performed. The system may implement the trained machine learning model(s) with respect to ASR output text data to determine if the ASR output text data corresponds (or substantially corresponds) to previous ASR output text data that resulted in an incorrect action being performed.

Type: Grant

Filed: September 2, 2021

Date of Patent: January 2, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Bigyan Rajbhandari, Praveen Kumar Bodigutla, Zhenxiang Zhou, Karen Catelyn Stabile, Chenlei Guo, Abhinav Sethy, Alireza Roshan Ghias, Pragaash Ponnusamy, Kevin Quinn
Visual responses to user inputs

Patent number: 11705108

Abstract: Techniques for generating a visual response to a user input are described. A system may receive input data corresponding to a user input, determining a first skill component is to determine a response to the user input, and determine a second skill component is to determine supplemental content related to the user input. The system may also determine a template for presenting a visual response to the user input, where the template is configured for presenting the response and the supplemental content. The system may receive, from the first skill component, first image data corresponding to the first response. The system may also receive, from the second skill component, second image data corresponding to the first supplemental content. The system may send, to a device including a display, a command to present the first image data and the second image data using the template.

Type: Grant

Filed: December 10, 2021

Date of Patent: July 18, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Vasiliy Radostev, Ruhi Sarikaya, Rekha Seshadrinathan, Abhinav Sethy, Chetan Nagaraj Naik, Anjishnu Kumar
USER-SYSTEM DIALOG EXPANSION

Publication number: 20230215425

Abstract: Techniques for recommending a skill experience to a user after a user-system dialog session has ended are described. Upon a dialog session ending, the system uses a first machine learning model to determine potential intents to recommend to a user. The system then uses a second machine learning model to determine a particular skill and intent to recommend. The system then prompts the user to accept the recommended skill and intent. If the user accepts, the system calls the recommended skill to execute. As part of calling the skill, the system sends to the skill at least one entity provided in a natural language user input of the ended dialog session. This enables the skill to skip welcome prompts, and initiate processing to output a response based on the intent and the at least one entity of the ended dialog session.

Type: Application

Filed: December 7, 2022

Publication date: July 6, 2023

Inventors: Ruhi Sarikaya, Hung Tuan Pham, Savas Parastatidis, Dean Curtis, Pushpendre Rastogi, Nitin Ashok Jain, John Arland Nave, Abhinav Sethy, Arpit Gupta, Mayank Kumar, Nakul Dahiwade, Arshdeep Singh, Nikhil Reddy Kortha, Rohit Prasad
User-system dialog expansion

Patent number: 11527237

Abstract: Techniques for recommending a skill experience to a user after a user-system dialog session has ended are described. Upon a dialog session ending, the system uses a first machine learning model to determine potential intents to recommend to a user. The system then uses a second machine learning model to determine a particular skill and intent to recommend. The system then prompts the user to accept the recommended skill and intent. If the user accepts, the system calls the recommended skill to execute. As part of calling the skill, the system sends to the skill at least one entity provided in a natural language user input of the ended dialog session. This enables the skill to skip welcome prompts, and initiate processing to output a response based on the intent and the at least one entity of the ended dialog session.

Type: Grant

Filed: September 18, 2020

Date of Patent: December 13, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Ruhi Sarikaya, Hung Tuan Pham, Savas Parastatidis, Dean Curtis, Pushpendre Rastogi, Nitin Ashok Jain, John Arland Nave, Abhinav Sethy, Arpit Gupta, Mayank Kumar, Nakul Dahiwade, Arshdeep Singh, Nikhil Reddy Kortha, Rohit Prasad
LEARNING HOW TO REWRITE USER-SPECIFIC INPUT FOR NATURAL LANGUAGE UNDERSTANDING

Publication number: 20220059086

Abstract: Techniques for decreasing (or eliminating) the possibility of a skill performing an action that is not responsive to a corresponding user input are described. A system may train one or more machine learning models with respect to user inputs, which resulted in incorrect actions being performed by skills, and corresponding user inputs, which resulted in the correct action being performed. The system may use the trained machine learning model(s) to rewrite user inputs that, if not rewritten, may result in incorrect actions being performed. The system may implement the trained machine learning model(s) with respect to ASR output text data to determine if the ASR output text data corresponds (or substantially corresponds) to previous ASR output text data that resulted in an incorrect action being performed.

Type: Application

Filed: September 2, 2021

Publication date: February 24, 2022

Inventors: Bigyan Rajbhandari, Praveen Kumar Bodigutla, Zhenxiang Zhou, Karen Catelyn Stabile, Chenlei Guo, Abhinav Sethy, Alireza Roshan Ghias, Pragaash Ponnusamy, Kevin Quinn
Learning how to rewrite user-specific input for natural language understanding

Patent number: 11151986

Abstract: Techniques for decreasing (or eliminating) the possibility of a skill performing an action that is not responsive to a corresponding user input are described. A system may train one or more machine learning models with respect to user inputs, which resulted in incorrect actions being performed by skills, and corresponding user inputs, which resulted in the correct action being performed. The system may use the trained machine learning model(s) to rewrite user inputs that, if not rewritten, may result in incorrect actions being performed. The system may implement the trained machine learning model(s) with respect to ASR output text data to determine if the ASR output text data corresponds (or substantially corresponds) to previous ASR output text data that resulted in an incorrect action being performed.

Type: Grant

Filed: September 21, 2018

Date of Patent: October 19, 2021

Assignee: Amazon Technologies, Inc.

Inventors: Bigyan Rajbhandari, Praveen Kumar Bodigutla, Zhenxiang Zhou, Karen Catelyn Stabile, Chenlei Guo, Abhinav Sethy, Alireza Roshan Ghias, Pragaash Ponnusamy, Kevin Quinn
Symbol sequence estimation in speech

Patent number: 11145308

Abstract: Symbol sequences are estimated using a computer-implemented method including detecting one or more candidates of a target symbol sequence from a speech-to-text data, extracting a related portion of each candidate from the speech-to-text data, detecting repetition of at least a partial sequence of each candidate within the related portion of the corresponding candidate, labeling the detected repetition with a repetition indication, and estimating whether each candidate is the target symbol sequence, using the corresponding related portion including the repetition indication of each of the candidates.

Type: Grant

Filed: September 20, 2019

Date of Patent: October 12, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Kenneth W. Church, Gakuto Kurata, Bhuvana Ramabhadran, Abhinav Sethy, Masayuki Suzuki, Ryuki Tachibana
Combining installed audio-visual sensors with ad-hoc mobile audio-visual sensors for smart meeting rooms

Patent number: 11019306

Abstract: A method of combining data streams from fixed audio-visual sensors with data streams from personal mobile devices including, forming a communication link with at least one of one or more personal mobile devices; receiving at least one of an audio data stream and/or a video data stream from the at least one of the one or more personal mobile devices; determining the quality of the at least one of the audio data stream and/or the video data stream, wherein the audio data stream and/or the video data stream having a quality above a threshold quality is retained; and combining the retained audio data stream and/or the video data stream with the data streams from the fixed audio-visual sensors.

Type: Grant

Filed: January 9, 2019

Date of Patent: May 25, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Stanley Chen, Kenneth W. Church, Vaibhava Goel, Lidia L. Mangu, Etienne Marcheret, Bhuvana Ramabhadran, Laurence P. Sansone, Abhinav Sethy, Samuel Thomas
Implementing a whole sentence recurrent neural network language model for natural language processing

Patent number: 10692488

Abstract: A computer selects a test set of sentences from among sentences applied to train a whole sentence recurrent neural network language model to estimate the probability of likelihood of each whole sentence processed by natural language processing being correct. The computer generates imposter sentences from among the test set of sentences by substituting one word in each sentence of the test set of sentences. The computer generates, through the whole sentence recurrent neural network language model, a first score for each sentence of the test set of sentences and at least one additional score for each of the imposter sentences. The computer evaluates an accuracy of the natural language processing system in performing sequential classification tasks based on an accuracy value of the first score in reflecting a correct sentence and the at least one additional score in reflecting an incorrect sentence.

Type: Grant

Filed: August 23, 2019

Date of Patent: June 23, 2020

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Yinghui Huang, Abhinav Sethy, Kartik Audhkhasi, Bhuvana Ramabhadran
SYMBOL SEQUENCE ESTIMATION IN SPEECH

Publication number: 20200013408

Abstract: Symbol sequences are estimated using a computer-implemented method including detecting one or more candidates of a target symbol sequence from a speech-to-text data, extracting a related portion of each candidate from the speech-to-text data, detecting repetition of at least a partial sequence of each candidate within the related portion of the corresponding candidate, labeling the detected repetition with a repetition indication, and estimating whether each candidate is the target symbol sequence, using the corresponding related portion including the repetition indication of each of the candidates.

Type: Application

Filed: September 20, 2019

Publication date: January 9, 2020

Inventors: Kenneth W. Church, Gakuto Kurata, Bhuvana Ramabhadran, Abhinav Sethy, Masayuki Suzuki, Ryuki Tachibana
IMPLEMENTING A WHOLE SENTENCE RECURRENT NEURAL NETWORK LANGUAGE MODEL FOR NATURAL LANGUAGE PROCESSING

Publication number: 20200013393

Abstract: A computer selects a test set of sentences from among sentences applied to train a whole sentence recurrent neural network language model to estimate the probability of likelihood of each whole sentence processed by natural language processing being correct. The computer generates imposter sentences from among the test set of sentences by substituting one word in each sentence of the test set of sentences. The computer generates, through the whole sentence recurrent neural network language model, a first score for each sentence of the test set of sentences and at least one additional score for each of the imposter sentences. The computer evaluates an accuracy of the natural language processing system in performing sequential classification tasks based on an accuracy value of the first score in reflecting a correct sentence and the at least one additional score in reflecting an incorrect sentence.

Type: Application

Filed: August 23, 2019

Publication date: January 9, 2020

Inventors: YINGHUI HUANG, Abhinav Sethy, Kartik Audhkhasi, Bhuvana Ramabhadran
Symbol sequence estimation in speech

Patent number: 10529337

Abstract: Symbol sequences are estimated using a computer-implemented method including detecting one or more candidates of a target symbol sequence from a speech-to-text data, extracting a related portion of each candidate from the speech-to-text data, detecting repetition of at least a partial sequence of each candidate within the related portion of the corresponding candidate, labeling the detected repetition with a repetition indication, and estimating whether each candidate is the target symbol sequence, using the corresponding related portion including the repetition indication of each of the candidates.

Type: Grant

Filed: January 7, 2019

Date of Patent: January 7, 2020

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Kenneth W. Church, Gakuto Kurata, Bhuvana Ramabhadran, Abhinav Sethy, Masayuki Suzuki, Ryuki Tachibana
IMPLEMENTING A WHOLE SENTENCE RECURRENT NEURAL NETWORK LANGUAGE MODEL FOR NATURAL LANGUAGE PROCESSING

Publication number: 20190318732

Abstract: A whole sentence recurrent neural network (RNN) language model (LM) is provided for for estimating a probability of likelihood of each whole sentence processed by natural language processing being correct. A noise contrastive estimation sampler is applied against at least one entire sentence from a corpus of multiple sentences to generate at least one incorrect sentence. The whole sentence RNN LN is trained, using the at least one entire sentence from the corpus and the at least one incorrect sentence, to distinguish the at least one entire sentence as correct. The whole sentence recurrent neural network language model is applied to estimate the probability of likelihood of each whole sentence processed by natural language processing being correct.

Type: Application

Filed: April 16, 2018

Publication date: October 17, 2019

Inventors: Yinghui Huang, Abhinav Sethy, Kartik Audhkhasi, Bhuvana Ramabhadran
Implementing a whole sentence recurrent neural network language model for natural language processing

Patent number: 10431210

Abstract: A whole sentence recurrent neural network (RNN) language model (LM) is provided for for estimating a probability of likelihood of each whole sentence processed by natural language processing being correct. A noise contrastive estimation sampler is applied against at least one entire sentence from a corpus of multiple sentences to generate at least one incorrect sentence. The whole sentence RNN LN is trained, using the at least one entire sentence from the corpus and the at least one incorrect sentence, to distinguish the at least one entire sentence as correct. The whole sentence recurrent neural network language model is applied to estimate the probability of likelihood of each whole sentence processed by natural language processing being correct.

Type: Grant

Filed: April 16, 2018

Date of Patent: October 1, 2019

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Yinghui Huang, Abhinav Sethy, Kartik Audhkhasi, Bhuvana Ramabhadran
SYMBOL SEQUENCE ESTIMATION IN SPEECH

Publication number: 20190139550

Abstract: Symbol sequences are estimated using a computer-implemented method including detecting one or more candidates of a target symbol sequence from a speech-to-text data, extracting a related portion of each candidate from the speech-to-text data, detecting repetition of at least a partial sequence of each candidate within the related portion of the corresponding candidate, labeling the detected repetition with a repetition indication, and estimating whether each candidate is the target symbol sequence, using the corresponding related portion including the repetition indication of each of the candidates.

Type: Application

Filed: January 7, 2019

Publication date: May 9, 2019

Inventors: Kenneth W. Church, Gakuto Kurata, Bhuvana Ramabhadran, Abhinav Sethy, Masayuki Suzuki, Ryuki Tachibana

1 2 3 next