Patents by Inventor Ruhi Sarikaya

Ruhi Sarikaya has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

PROACTIVE COMMAND FRAMEWORK

Publication number: 20240153505

Abstract: Techniques for determining a command or intent likely to be subsequently invoked by a user of a system are described. A user inputs a command (either via a spoken utterance or textual input) to a system. The system determines content responsive to the command. The system also determines a second command or corresponding intent likely to be invoked by the user subsequent to the previous command. Such determination may involve analyzing pairs of intents, with each pair being associated with a probability that one intent of the pair will be invoked by a user subsequent to a second intent of the pair. The system then outputs first content responsive to the first command and second content soliciting the user as to whether the system to execute the second command.

Type: Application

Filed: October 19, 2023

Publication date: May 9, 2024

Inventors: Anjishnu Kumar, Xing Fan, Arpit Gupta, Ruhi Sarikaya
DATA DRIVEN DIALOG MANAGEMENT

Publication number: 20240153489

Abstract: Techniques for optimizing a system to improve an overall user satisfaction in a speech controlled system are described. A user speaks an utterance and the system compares an expected sum of user satisfaction values for each action to make a decision as to how best to process the utterance. As a result, the system may make a decision that decreases user satisfaction in the short term but increases user satisfaction in the long term. The system may estimate a user satisfaction value and associate the estimated user satisfaction value with a current dialog state. By tracking user satisfaction values over time, the system may train machine learning models to optimize the expected sum of user satisfaction values. This improves how the system selects an action or application to which to dispatch the dialog state and how a specific application selects an action or intent corresponding to the command.

Type: Application

Filed: January 17, 2024

Publication date: May 9, 2024

Inventors: Alborz Geramifard, Shiladitya Roy, Ruhi Sarikaya
Machine learning label generation

Patent number: 11929070

Abstract: Techniques for performing centralized unsuperivised learning in a multi-domain system are described. A user may request labeled data for an ML task, where the request includes a prompt for obtaining relevant explicit user feedback. The system may use the prompt to collect explicit user feedback for relevant runtime user inputs. After a duration of time (in the user's request for labeled data) has elapsed, the system determines whether collected user feedback indicates processing of the user input was defective and, if so, determines a cause of the defective processing. The system then uses one or more label generators to generate labeled data using the collected user feedback, whether the processing was defective, and the determined defect cause.

Type: Grant

Filed: August 30, 2021

Date of Patent: March 12, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Ruhi Sarikaya, Zheng Du, Xiaohu Liu, Kai Liu, Sriharsha Venkata Chintalapati, Chenlei Guo, Hung Tuan Pham, Joe Pemberton, Zhenyu Yao, Bigyan Rajbhandari
VISUAL RESPONSES TO USER INPUTS

Publication number: 20240029708

Abstract: Techniques for generating a visual response to a user input are described. A system may receive input data corresponding to a user input, determining a first skill component is to determine a response to the user input, and determine a second skill component is to determine supplemental content related to the user input. The system may also determine a template for presenting a visual response to the user input, where the template is configured for presenting the response and the supplemental content. The system may receive, from the first skill component, first image data corresponding to the first response. The system may also receive, from the second skill component, second image data corresponding to the first supplemental content. The system may send, to a device including a display, a command to present the first image data and the second image data using the template.

Type: Application

Filed: May 26, 2023

Publication date: January 25, 2024

Inventors: Vasiliy Radostev, Ruhi Sarikaya, Rekha Seshadrinathan, Abhinav Sethy, Chetan Nagaraj Naik, Anjishnu Kumar
Proactive command framework

Patent number: 11823678

Abstract: Techniques for determining a command or intent likely to be subsequently invoked by a user of a system are described. A user inputs a command (either via a spoken utterance or textual input) to a system. The system determines content responsive to the command. The system also determines a second command or corresponding intent likely to be invoked by the user subsequent to the previous command. Such determination may involve analyzing pairs of intents, with each pair being associated with a probability that one intent of the pair will be invoked by a user subsequent to a second intent of the pair. The system then outputs first content responsive to the first command and second content soliciting the user as to whether the system to execute the second command.

Type: Grant

Filed: February 28, 2022

Date of Patent: November 21, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Anjishnu Kumar, Xing Fan, Arpit Gupta, Ruhi Sarikaya
Personalization of experiences with digital assistants in communal settings through voice and query processing

Patent number: 11810576

Abstract: In non-limiting examples of the present disclosure, systems, methods and devices for providing personalized experiences to a computing device based on user input such as voice, text and gesture input are provided. Acoustic patterns associated with voice input, speech patterns, language patterns and natural language processing may be used to identify a specific user providing input from a plurality of users, identify user background characteristics and traits for the specific user, and topically categorize user input in a tiered hierarchical index. Topically categorized user input may be supplemented with user data and world knowledge and personalized responses and feedback for an identified specific user may be provided reactively and proactively.

Type: Grant

Filed: October 6, 2020

Date of Patent: November 7, 2023

Assignee: Microsoft Technology Licensing, LLC

Inventor: Ruhi Sarikaya
Visual responses to user inputs

Patent number: 11705108

Abstract: Techniques for generating a visual response to a user input are described. A system may receive input data corresponding to a user input, determining a first skill component is to determine a response to the user input, and determine a second skill component is to determine supplemental content related to the user input. The system may also determine a template for presenting a visual response to the user input, where the template is configured for presenting the response and the supplemental content. The system may receive, from the first skill component, first image data corresponding to the first response. The system may also receive, from the second skill component, second image data corresponding to the first supplemental content. The system may send, to a device including a display, a command to present the first image data and the second image data using the template.

Type: Grant

Filed: December 10, 2021

Date of Patent: July 18, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Vasiliy Radostev, Ruhi Sarikaya, Rekha Seshadrinathan, Abhinav Sethy, Chetan Nagaraj Naik, Anjishnu Kumar
USER-SYSTEM DIALOG EXPANSION

Publication number: 20230215425

Abstract: Techniques for recommending a skill experience to a user after a user-system dialog session has ended are described. Upon a dialog session ending, the system uses a first machine learning model to determine potential intents to recommend to a user. The system then uses a second machine learning model to determine a particular skill and intent to recommend. The system then prompts the user to accept the recommended skill and intent. If the user accepts, the system calls the recommended skill to execute. As part of calling the skill, the system sends to the skill at least one entity provided in a natural language user input of the ended dialog session. This enables the skill to skip welcome prompts, and initiate processing to output a response based on the intent and the at least one entity of the ended dialog session.

Type: Application

Filed: December 7, 2022

Publication date: July 6, 2023

Inventors: Ruhi Sarikaya, Hung Tuan Pham, Savas Parastatidis, Dean Curtis, Pushpendre Rastogi, Nitin Ashok Jain, John Arland Nave, Abhinav Sethy, Arpit Gupta, Mayank Kumar, Nakul Dahiwade, Arshdeep Singh, Nikhil Reddy Kortha, Rohit Prasad
POLICY AUTHORING FOR TASK STATE TRACKING DURING DIALOGUE

Publication number: 20230142892

Abstract: Conversational understanding systems allow users to conversationally interface with a computing device. In examples, a query may be received that includes a request for execution of a task. A data exchange task definition may be accessed. The data exchange task definition assists a conversational understanding system in managing task state tracking for information needed for task execution. Using the data exchange task definition, a per-turn policy for interacting with the user computing device is generated based on the state of a dialogue with a computing device and an evaluation of a process flow chart provided by a task owner resource. The task owner resource may be independent from the conversational understanding system. A response to the query may be generated and output based on the per-turn policy. In examples, the per-turn policy is used to generate one or more responses during a dialogue with a user via a computing device.

Type: Application

Filed: January 6, 2023

Publication date: May 11, 2023

Applicant: Microsoft Technology Licensing, LLC

Inventors: Paul CROOK, Vasiliy RADOSTEV, Omar Zia KHAN, Vipul AGARWAL, Ruhi SARIKAYA, Marius Alexandru MARIN, Alexandre ROCHETTE, Jean-Philippe ROBICHAUD
Policy authoring for task state tracking during dialogue

Patent number: 11574635

Abstract: Conversational understanding systems allow users to conversationally interface with a computing device. In examples, a query may be received that includes a request for execution of a task. A data exchange task definition may be accessed. The data exchange task definition assists a conversational understanding system in managing task state tracking for information needed for task execution. Using the data exchange task definition, a per-turn policy for interacting with the user computing device is generated based on the state of a dialogue with a computing device and an evaluation of a process flow chart provided by a task owner resource. The task owner resource may be independent from the conversational understanding system. A response to the query may be generated and output based on the per-turn policy. In examples, the per-turn policy is used to generate one or more responses during a dialogue with a user via a computing device.

Type: Grant

Filed: December 20, 2019

Date of Patent: February 7, 2023

Assignee: Microsoft Technology Licensing, LLC

Inventors: Paul Crook, Vasiliy Radostev, Omar Zia Khan, Vipul Agarwal, Ruhi Sarikaya, Marius Alexandru Marin, Alexandre Rochette, Jean-Philippe Robichaud
Skill shortlister for natural language processing

Patent number: 11568863

Abstract: Devices and techniques are generally described for application determination in speech processing. Input data corresponding to a spoken utterance may be received. Speech recognition processing may be performed on the input data to generate text data. A machine learning encoder may generate a vector representation of the input data. A first binary classifier may determine a first probability that the input data corresponds to a first speech-processing application. A second binary classifier may determine a second probability that the input data corresponds to a second speech-processing application. A selection between the first speech-processing application and the second speech-processing application may be made based at least in part on the first probability and the second probability.

Type: Grant

Filed: March 23, 2018

Date of Patent: January 31, 2023

Assignee: AMAZON TECHNOLOGIES, INC.

Inventors: Ruhi Sarikaya, Dongchan Kim, Young-Bum Kim
Natural language processing routing

Patent number: 11551681

Abstract: Devices and techniques are generally described for a speech processing routing architecture. In various examples, first data comprising a first feature definition is received. The first feature definition may include a first indication of first source data and first instructions for generating feature data using the first source data. In various examples, the feature data may be generated according to the first feature definition. In some examples, a speech processing system may receive a first request to process a first utterance. The feature data may be retrieved from a non-transitory computer-readable memory. The speech processing system may determine a first skill for processing the first utterance based at least in part on the feature data.

Type: Grant

Filed: December 13, 2019

Date of Patent: January 10, 2023

Assignee: AMAZON TECHNOLOGIES, INC.

Inventors: Rajesh Kumar Pandey, Ruhi Sarikaya, Shubham Katiyar, Arun Kumar Thenappan, Isaac Joseph Madwed, Jihwan Lee, David Thomas, Julia Kennedy Nemer, Mohamed Farouk AbdelHady, Joe Pemberton, Young-Bum Kim, Arima Vu Ram Thayumanavar, Wangyao Ge
User-system dialog expansion

Patent number: 11527237

Abstract: Techniques for recommending a skill experience to a user after a user-system dialog session has ended are described. Upon a dialog session ending, the system uses a first machine learning model to determine potential intents to recommend to a user. The system then uses a second machine learning model to determine a particular skill and intent to recommend. The system then prompts the user to accept the recommended skill and intent. If the user accepts, the system calls the recommended skill to execute. As part of calling the skill, the system sends to the skill at least one entity provided in a natural language user input of the ended dialog session. This enables the skill to skip welcome prompts, and initiate processing to output a response based on the intent and the at least one entity of the ended dialog session.

Type: Grant

Filed: September 18, 2020

Date of Patent: December 13, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Ruhi Sarikaya, Hung Tuan Pham, Savas Parastatidis, Dean Curtis, Pushpendre Rastogi, Nitin Ashok Jain, John Arland Nave, Abhinav Sethy, Arpit Gupta, Mayank Kumar, Nakul Dahiwade, Arshdeep Singh, Nikhil Reddy Kortha, Rohit Prasad
PROACTIVE COMMAND FRAMEWORK

Publication number: 20220246149

Abstract: Techniques for determining a command or intent likely to be subsequently invoked by a user of a system are described. A user inputs a command (either via a spoken utterance or textual input) to a system. The system determines content responsive to the command. The system also determines a second command or corresponding intent likely to be invoked by the user subsequent to the previous command. Such determination may involve analyzing pairs of intents, with each pair being associated with a probability that one intent of the pair will be invoked by a user subsequent to a second intent of the pair. The system then outputs first content responsive to the first command and second content soliciting the user as to whether the system to execute the second command.

Type: Application

Filed: February 28, 2022

Publication date: August 4, 2022

Inventors: Anjishnu Kumar, Xing Fan, Arpit Gupta, Ruhi Sarikaya
Discriminating ambiguous expressions to enhance user experience

Patent number: 11386268

Abstract: Methods and systems are provided for discriminating ambiguous expressions to enhance user experience. For example, a natural language expression may be received by a speech recognition component. The natural language expression may include at least one of words, terms, and phrases of text. A dialog hypothesis set from the natural language expression may be created by using contextual information. In some cases, the dialog hypothesis set has at least two dialog hypotheses. A plurality of dialog responses may be generated for the dialog hypothesis set. The dialog hypothesis set may be ranked based on an analysis of the plurality of the dialog responses. An action may be performed based on ranking the dialog hypothesis set.

Type: Grant

Filed: December 4, 2017

Date of Patent: July 12, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventors: Jean-Philippe Robichaud, Ruhi Sarikaya
Natural language processing

Patent number: 11380308

Abstract: Devices and techniques are generally described for using user feedback to determine routing decisions in a speech processing system. In various examples, first data representing a first utterance may be received. Second data representing a first semantic interpretation of the first utterance may be determined. A first intent data processing application may be selected for processing the second data. Feedback data may be determined related to the first intent data processing application processing the second data. Third data representing a semantic interpretation of a second utterance may be received, wherein the first semantic interpretation is the same as the second semantic interpretation. A second intent data processing application may be determined for processing the third data based at least in part on the feedback data.

Type: Grant

Filed: December 13, 2019

Date of Patent: July 5, 2022

Assignee: AMAZON TECHNOLOGIES, INC.

Inventors: Rajesh Kumar Pandey, Ruhi Sarikaya, Shubham Katiyar, Arun Kumar Thenappan, Isaac Joseph Madwed, Jihwan Lee, David Thomas, Julia Kennedy Nemer, Mohamed Farouk AbdelHady, Joe Pemberton, Young-Bum Kim, Prasha Shrestha, Hao Yuan
CONFIGURABLE GENERIC LANGUAGE UNDERSTANDING MODELS

Publication number: 20220100972

Abstract: Examples of the present disclosure describe systems and methods of configuring generic language understanding models. In aspects, one or more previously configured schemas for various applications may be identified and collected. A generic schema may be generated using the collected schemas. The collected schemas may be programmatically mapped to the generic schema. The generic schema may be used to train on ore more models. An interface may be provided to allow browsing the models. The interface may include a configuration mechanism that provides for selecting on or more of the models. The selected models may be bundled programmatically, such that the information and instructions needed to implement the models are configured programmatically. The bundled models may then be provided to a requestor.

Type: Application

Filed: December 14, 2021

Publication date: March 31, 2022

Applicant: Microsoft Technology Licensing, LLC

Inventors: Ruhi Sarikaya, Asli Celikyilmaz, Young-Bum Kim, Zhaleh Feizollahi, Nikhil Ramesh, Hisami Suzuki, Alexandre Rochette
Routing of natural language inputs to speech processing applications

Patent number: 11289075

Abstract: Devices and techniques are generally described for using user feedback to determine routing decisions in a speech processing system. In various examples, first data representing a first utterance may be received. Second data representing a first semantic interpretation of the first utterance may be determined. A first intent data processing application may be selected for processing the second data. Feedback data may be determined related to the first intent data processing application processing the second data. Third data representing a semantic interpretation of a second utterance may be received, wherein the first semantic interpretation is the same as the second semantic interpretation. A second intent data processing application may be determined for processing the third data based at least in part on the feedback data.

Type: Grant

Filed: December 13, 2019

Date of Patent: March 29, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Rajesh Kumar Pandey, Ruhi Sarikaya, Shubham Katiyar, Arun Kumar Thenappan, Isaac Joseph Madwed, Jihwan Lee, David Thomas, Julia Kennedy Nemer, Mohamed Farouk AbdelHady, Joe Pemberton, Young-Bum Kim, Hao Yuan, Prasha Shrestha
Natural language speech processing application selection

Patent number: 11276403

Abstract: Techniques for limiting natural language processing performed on input data are described. A system receives input data from a device. The input data corresponds to a command to be executed by the system. The system determines applications likely configured to execute the command. The system performs named entity recognition and intent classification with respect to only the applications likely configured to execute the command.

Type: Grant

Filed: November 25, 2019

Date of Patent: March 15, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Ruhi Sarikaya, Rohit Prasad, Kerry Hammil, Spyridon Matsoukas, Nikko Strom, Frédéric Johan Georges Deramat, Stephen Frederick Potter, Young-Bum Kim
Proactive command framework

Patent number: 11270698

Abstract: Techniques for determining a command or intent likely to be subsequently invoked by a user of a system are described. A user inputs a command (either via a spoken utterance or textual input) to a system. The system determines content responsive to the command. The system also determines a second command or corresponding intent likely to be invoked by the user subsequent to the previous command. Such determination may involve analyzing pairs of intents, with each pair being associated with a probability that one intent of the pair will be invoked by a user subsequent to a second intent of the pair. The system then outputs first content responsive to the first command and second content soliciting the user as to whether the system to execute the second command.

Type: Grant

Filed: August 26, 2019

Date of Patent: March 8, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Anjishnu Kumar, Xing Fan, Arpit Gupta, Ruhi Sarikaya

1 2 3 4 5 … next