Patents by Inventor Ruhi Sarikaya

Ruhi Sarikaya has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240153505
    Abstract: Techniques for determining a command or intent likely to be subsequently invoked by a user of a system are described. A user inputs a command (either via a spoken utterance or textual input) to a system. The system determines content responsive to the command. The system also determines a second command or corresponding intent likely to be invoked by the user subsequent to the previous command. Such determination may involve analyzing pairs of intents, with each pair being associated with a probability that one intent of the pair will be invoked by a user subsequent to a second intent of the pair. The system then outputs first content responsive to the first command and second content soliciting the user as to whether the system to execute the second command.
    Type: Application
    Filed: October 19, 2023
    Publication date: May 9, 2024
    Inventors: Anjishnu Kumar, Xing Fan, Arpit Gupta, Ruhi Sarikaya
  • Publication number: 20240153489
    Abstract: Techniques for optimizing a system to improve an overall user satisfaction in a speech controlled system are described. A user speaks an utterance and the system compares an expected sum of user satisfaction values for each action to make a decision as to how best to process the utterance. As a result, the system may make a decision that decreases user satisfaction in the short term but increases user satisfaction in the long term. The system may estimate a user satisfaction value and associate the estimated user satisfaction value with a current dialog state. By tracking user satisfaction values over time, the system may train machine learning models to optimize the expected sum of user satisfaction values. This improves how the system selects an action or application to which to dispatch the dialog state and how a specific application selects an action or intent corresponding to the command.
    Type: Application
    Filed: January 17, 2024
    Publication date: May 9, 2024
    Inventors: Alborz Geramifard, Shiladitya Roy, Ruhi Sarikaya
  • Patent number: 11929070
    Abstract: Techniques for performing centralized unsuperivised learning in a multi-domain system are described. A user may request labeled data for an ML task, where the request includes a prompt for obtaining relevant explicit user feedback. The system may use the prompt to collect explicit user feedback for relevant runtime user inputs. After a duration of time (in the user's request for labeled data) has elapsed, the system determines whether collected user feedback indicates processing of the user input was defective and, if so, determines a cause of the defective processing. The system then uses one or more label generators to generate labeled data using the collected user feedback, whether the processing was defective, and the determined defect cause.
    Type: Grant
    Filed: August 30, 2021
    Date of Patent: March 12, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Ruhi Sarikaya, Zheng Du, Xiaohu Liu, Kai Liu, Sriharsha Venkata Chintalapati, Chenlei Guo, Hung Tuan Pham, Joe Pemberton, Zhenyu Yao, Bigyan Rajbhandari
  • Publication number: 20240029708
    Abstract: Techniques for generating a visual response to a user input are described. A system may receive input data corresponding to a user input, determining a first skill component is to determine a response to the user input, and determine a second skill component is to determine supplemental content related to the user input. The system may also determine a template for presenting a visual response to the user input, where the template is configured for presenting the response and the supplemental content. The system may receive, from the first skill component, first image data corresponding to the first response. The system may also receive, from the second skill component, second image data corresponding to the first supplemental content. The system may send, to a device including a display, a command to present the first image data and the second image data using the template.
    Type: Application
    Filed: May 26, 2023
    Publication date: January 25, 2024
    Inventors: Vasiliy Radostev, Ruhi Sarikaya, Rekha Seshadrinathan, Abhinav Sethy, Chetan Nagaraj Naik, Anjishnu Kumar
  • Patent number: 11823678
    Abstract: Techniques for determining a command or intent likely to be subsequently invoked by a user of a system are described. A user inputs a command (either via a spoken utterance or textual input) to a system. The system determines content responsive to the command. The system also determines a second command or corresponding intent likely to be invoked by the user subsequent to the previous command. Such determination may involve analyzing pairs of intents, with each pair being associated with a probability that one intent of the pair will be invoked by a user subsequent to a second intent of the pair. The system then outputs first content responsive to the first command and second content soliciting the user as to whether the system to execute the second command.
    Type: Grant
    Filed: February 28, 2022
    Date of Patent: November 21, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Anjishnu Kumar, Xing Fan, Arpit Gupta, Ruhi Sarikaya
  • Patent number: 11810576
    Abstract: In non-limiting examples of the present disclosure, systems, methods and devices for providing personalized experiences to a computing device based on user input such as voice, text and gesture input are provided. Acoustic patterns associated with voice input, speech patterns, language patterns and natural language processing may be used to identify a specific user providing input from a plurality of users, identify user background characteristics and traits for the specific user, and topically categorize user input in a tiered hierarchical index. Topically categorized user input may be supplemented with user data and world knowledge and personalized responses and feedback for an identified specific user may be provided reactively and proactively.
    Type: Grant
    Filed: October 6, 2020
    Date of Patent: November 7, 2023
    Assignee: Microsoft Technology Licensing, LLC
    Inventor: Ruhi Sarikaya
  • Patent number: 11705108
    Abstract: Techniques for generating a visual response to a user input are described. A system may receive input data corresponding to a user input, determining a first skill component is to determine a response to the user input, and determine a second skill component is to determine supplemental content related to the user input. The system may also determine a template for presenting a visual response to the user input, where the template is configured for presenting the response and the supplemental content. The system may receive, from the first skill component, first image data corresponding to the first response. The system may also receive, from the second skill component, second image data corresponding to the first supplemental content. The system may send, to a device including a display, a command to present the first image data and the second image data using the template.
    Type: Grant
    Filed: December 10, 2021
    Date of Patent: July 18, 2023
    Assignee: Amazon Technologies, Inc.
    Inventors: Vasiliy Radostev, Ruhi Sarikaya, Rekha Seshadrinathan, Abhinav Sethy, Chetan Nagaraj Naik, Anjishnu Kumar
  • Publication number: 20230215425
    Abstract: Techniques for recommending a skill experience to a user after a user-system dialog session has ended are described. Upon a dialog session ending, the system uses a first machine learning model to determine potential intents to recommend to a user. The system then uses a second machine learning model to determine a particular skill and intent to recommend. The system then prompts the user to accept the recommended skill and intent. If the user accepts, the system calls the recommended skill to execute. As part of calling the skill, the system sends to the skill at least one entity provided in a natural language user input of the ended dialog session. This enables the skill to skip welcome prompts, and initiate processing to output a response based on the intent and the at least one entity of the ended dialog session.
    Type: Application
    Filed: December 7, 2022
    Publication date: July 6, 2023
    Inventors: Ruhi Sarikaya, Hung Tuan Pham, Savas Parastatidis, Dean Curtis, Pushpendre Rastogi, Nitin Ashok Jain, John Arland Nave, Abhinav Sethy, Arpit Gupta, Mayank Kumar, Nakul Dahiwade, Arshdeep Singh, Nikhil Reddy Kortha, Rohit Prasad
  • Publication number: 20230142892
    Abstract: Conversational understanding systems allow users to conversationally interface with a computing device. In examples, a query may be received that includes a request for execution of a task. A data exchange task definition may be accessed. The data exchange task definition assists a conversational understanding system in managing task state tracking for information needed for task execution. Using the data exchange task definition, a per-turn policy for interacting with the user computing device is generated based on the state of a dialogue with a computing device and an evaluation of a process flow chart provided by a task owner resource. The task owner resource may be independent from the conversational understanding system. A response to the query may be generated and output based on the per-turn policy. In examples, the per-turn policy is used to generate one or more responses during a dialogue with a user via a computing device.
    Type: Application
    Filed: January 6, 2023
    Publication date: May 11, 2023
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Paul CROOK, Vasiliy RADOSTEV, Omar Zia KHAN, Vipul AGARWAL, Ruhi SARIKAYA, Marius Alexandru MARIN, Alexandre ROCHETTE, Jean-Philippe ROBICHAUD
  • Patent number: 11574635
    Abstract: Conversational understanding systems allow users to conversationally interface with a computing device. In examples, a query may be received that includes a request for execution of a task. A data exchange task definition may be accessed. The data exchange task definition assists a conversational understanding system in managing task state tracking for information needed for task execution. Using the data exchange task definition, a per-turn policy for interacting with the user computing device is generated based on the state of a dialogue with a computing device and an evaluation of a process flow chart provided by a task owner resource. The task owner resource may be independent from the conversational understanding system. A response to the query may be generated and output based on the per-turn policy. In examples, the per-turn policy is used to generate one or more responses during a dialogue with a user via a computing device.
    Type: Grant
    Filed: December 20, 2019
    Date of Patent: February 7, 2023
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Paul Crook, Vasiliy Radostev, Omar Zia Khan, Vipul Agarwal, Ruhi Sarikaya, Marius Alexandru Marin, Alexandre Rochette, Jean-Philippe Robichaud
  • Patent number: 11568863
    Abstract: Devices and techniques are generally described for application determination in speech processing. Input data corresponding to a spoken utterance may be received. Speech recognition processing may be performed on the input data to generate text data. A machine learning encoder may generate a vector representation of the input data. A first binary classifier may determine a first probability that the input data corresponds to a first speech-processing application. A second binary classifier may determine a second probability that the input data corresponds to a second speech-processing application. A selection between the first speech-processing application and the second speech-processing application may be made based at least in part on the first probability and the second probability.
    Type: Grant
    Filed: March 23, 2018
    Date of Patent: January 31, 2023
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Ruhi Sarikaya, Dongchan Kim, Young-Bum Kim
  • Patent number: 11551681
    Abstract: Devices and techniques are generally described for a speech processing routing architecture. In various examples, first data comprising a first feature definition is received. The first feature definition may include a first indication of first source data and first instructions for generating feature data using the first source data. In various examples, the feature data may be generated according to the first feature definition. In some examples, a speech processing system may receive a first request to process a first utterance. The feature data may be retrieved from a non-transitory computer-readable memory. The speech processing system may determine a first skill for processing the first utterance based at least in part on the feature data.
    Type: Grant
    Filed: December 13, 2019
    Date of Patent: January 10, 2023
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Rajesh Kumar Pandey, Ruhi Sarikaya, Shubham Katiyar, Arun Kumar Thenappan, Isaac Joseph Madwed, Jihwan Lee, David Thomas, Julia Kennedy Nemer, Mohamed Farouk AbdelHady, Joe Pemberton, Young-Bum Kim, Arima Vu Ram Thayumanavar, Wangyao Ge
  • Patent number: 11527237
    Abstract: Techniques for recommending a skill experience to a user after a user-system dialog session has ended are described. Upon a dialog session ending, the system uses a first machine learning model to determine potential intents to recommend to a user. The system then uses a second machine learning model to determine a particular skill and intent to recommend. The system then prompts the user to accept the recommended skill and intent. If the user accepts, the system calls the recommended skill to execute. As part of calling the skill, the system sends to the skill at least one entity provided in a natural language user input of the ended dialog session. This enables the skill to skip welcome prompts, and initiate processing to output a response based on the intent and the at least one entity of the ended dialog session.
    Type: Grant
    Filed: September 18, 2020
    Date of Patent: December 13, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Ruhi Sarikaya, Hung Tuan Pham, Savas Parastatidis, Dean Curtis, Pushpendre Rastogi, Nitin Ashok Jain, John Arland Nave, Abhinav Sethy, Arpit Gupta, Mayank Kumar, Nakul Dahiwade, Arshdeep Singh, Nikhil Reddy Kortha, Rohit Prasad
  • Publication number: 20220246149
    Abstract: Techniques for determining a command or intent likely to be subsequently invoked by a user of a system are described. A user inputs a command (either via a spoken utterance or textual input) to a system. The system determines content responsive to the command. The system also determines a second command or corresponding intent likely to be invoked by the user subsequent to the previous command. Such determination may involve analyzing pairs of intents, with each pair being associated with a probability that one intent of the pair will be invoked by a user subsequent to a second intent of the pair. The system then outputs first content responsive to the first command and second content soliciting the user as to whether the system to execute the second command.
    Type: Application
    Filed: February 28, 2022
    Publication date: August 4, 2022
    Inventors: Anjishnu Kumar, Xing Fan, Arpit Gupta, Ruhi Sarikaya
  • Patent number: 11386268
    Abstract: Methods and systems are provided for discriminating ambiguous expressions to enhance user experience. For example, a natural language expression may be received by a speech recognition component. The natural language expression may include at least one of words, terms, and phrases of text. A dialog hypothesis set from the natural language expression may be created by using contextual information. In some cases, the dialog hypothesis set has at least two dialog hypotheses. A plurality of dialog responses may be generated for the dialog hypothesis set. The dialog hypothesis set may be ranked based on an analysis of the plurality of the dialog responses. An action may be performed based on ranking the dialog hypothesis set.
    Type: Grant
    Filed: December 4, 2017
    Date of Patent: July 12, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Jean-Philippe Robichaud, Ruhi Sarikaya
  • Patent number: 11380308
    Abstract: Devices and techniques are generally described for using user feedback to determine routing decisions in a speech processing system. In various examples, first data representing a first utterance may be received. Second data representing a first semantic interpretation of the first utterance may be determined. A first intent data processing application may be selected for processing the second data. Feedback data may be determined related to the first intent data processing application processing the second data. Third data representing a semantic interpretation of a second utterance may be received, wherein the first semantic interpretation is the same as the second semantic interpretation. A second intent data processing application may be determined for processing the third data based at least in part on the feedback data.
    Type: Grant
    Filed: December 13, 2019
    Date of Patent: July 5, 2022
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Rajesh Kumar Pandey, Ruhi Sarikaya, Shubham Katiyar, Arun Kumar Thenappan, Isaac Joseph Madwed, Jihwan Lee, David Thomas, Julia Kennedy Nemer, Mohamed Farouk AbdelHady, Joe Pemberton, Young-Bum Kim, Prasha Shrestha, Hao Yuan
  • Publication number: 20220100972
    Abstract: Examples of the present disclosure describe systems and methods of configuring generic language understanding models. In aspects, one or more previously configured schemas for various applications may be identified and collected. A generic schema may be generated using the collected schemas. The collected schemas may be programmatically mapped to the generic schema. The generic schema may be used to train on ore more models. An interface may be provided to allow browsing the models. The interface may include a configuration mechanism that provides for selecting on or more of the models. The selected models may be bundled programmatically, such that the information and instructions needed to implement the models are configured programmatically. The bundled models may then be provided to a requestor.
    Type: Application
    Filed: December 14, 2021
    Publication date: March 31, 2022
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Ruhi Sarikaya, Asli Celikyilmaz, Young-Bum Kim, Zhaleh Feizollahi, Nikhil Ramesh, Hisami Suzuki, Alexandre Rochette
  • Patent number: 11289075
    Abstract: Devices and techniques are generally described for using user feedback to determine routing decisions in a speech processing system. In various examples, first data representing a first utterance may be received. Second data representing a first semantic interpretation of the first utterance may be determined. A first intent data processing application may be selected for processing the second data. Feedback data may be determined related to the first intent data processing application processing the second data. Third data representing a semantic interpretation of a second utterance may be received, wherein the first semantic interpretation is the same as the second semantic interpretation. A second intent data processing application may be determined for processing the third data based at least in part on the feedback data.
    Type: Grant
    Filed: December 13, 2019
    Date of Patent: March 29, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Rajesh Kumar Pandey, Ruhi Sarikaya, Shubham Katiyar, Arun Kumar Thenappan, Isaac Joseph Madwed, Jihwan Lee, David Thomas, Julia Kennedy Nemer, Mohamed Farouk AbdelHady, Joe Pemberton, Young-Bum Kim, Hao Yuan, Prasha Shrestha
  • Patent number: 11276403
    Abstract: Techniques for limiting natural language processing performed on input data are described. A system receives input data from a device. The input data corresponds to a command to be executed by the system. The system determines applications likely configured to execute the command. The system performs named entity recognition and intent classification with respect to only the applications likely configured to execute the command.
    Type: Grant
    Filed: November 25, 2019
    Date of Patent: March 15, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Ruhi Sarikaya, Rohit Prasad, Kerry Hammil, Spyridon Matsoukas, Nikko Strom, Frédéric Johan Georges Deramat, Stephen Frederick Potter, Young-Bum Kim
  • Patent number: 11270698
    Abstract: Techniques for determining a command or intent likely to be subsequently invoked by a user of a system are described. A user inputs a command (either via a spoken utterance or textual input) to a system. The system determines content responsive to the command. The system also determines a second command or corresponding intent likely to be invoked by the user subsequent to the previous command. Such determination may involve analyzing pairs of intents, with each pair being associated with a probability that one intent of the pair will be invoked by a user subsequent to a second intent of the pair. The system then outputs first content responsive to the first command and second content soliciting the user as to whether the system to execute the second command.
    Type: Grant
    Filed: August 26, 2019
    Date of Patent: March 8, 2022
    Assignee: Amazon Technologies, Inc.
    Inventors: Anjishnu Kumar, Xing Fan, Arpit Gupta, Ruhi Sarikaya