Patents by Inventor Kalpesh N. Sutaria
Kalpesh N. Sutaria has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11580968Abstract: Techniques are described for a contextual natural language understanding (cNLU) framework that is able to incorporate contextual signals of variable history length to perform joint intent classification (IC) and slot labeling (SL) tasks. A user utterance provided by a user within a multi-turn chat dialog between the user and a conversational agent is received. The user utterance and contextual information associated with one or more previous turns of the multi-turn chat dialog is provided to a machine learning (ML) model. An intent classification and one or more slot labels for the user utterance are then obtained from the ML model. The cNLU framework described herein thus uses, in addition to a current utterance itself, various contextual signals as input to a model to generate IC and SL predictions for each utterance of a multi-turn chat dialog.Type: GrantFiled: June 27, 2019Date of Patent: February 14, 2023Assignee: Amazon Technologies, Inc.Inventors: Arshit Gupta, Peng Zhang, Rashmi Gangadharaiah, Garima Lalwani, Roger Scott Jenke, Hassan Sawaf, Mona Diab, Katrin Kirchhoff, Adel A. Youssef, Kalpesh N. Sutaria
-
Patent number: 11562288Abstract: Techniques for hosting adding and warming a host are described. In some instances, a method of determining that at least one group of hosts is to be increased by adding an additional host to the group of hosts; sending a request to the group of hosts for a list of machine learning models loaded per host of the group of hosts; receiving, from each host, the list of loaded machine learning models; loading at least a proper subset of list of loaded machine learning models into random access memory of the at least one group; receiving a request to perform an inference; routing the request to the additional host of the group of hosts; performing an inference using the additional host of the group of hosts; and providing a result of the inference to an external entity is described.Type: GrantFiled: September 28, 2018Date of Patent: January 24, 2023Assignee: Amazon Technologies, Inc.Inventors: Enrico Sartorello, Stefano Stefani, Nikhil Kandoi, Rama Krishna Sandeep Pokkunuri, Kalpesh N. Sutaria, Navneet Sabbineni, Ganesh Kumar Gella, Cheng Ran Li
-
Patent number: 11436524Abstract: Techniques for hosting machine learning models are described. In some instances, a method of receiving a request to perform an inference using a particular machine learning model; determining a group of hosts to route the request to, the group of hosts to host a plurality of machine learning models including the particular machine learning model; determining a path to the determined group of hosts; determining a particular host of the group of hosts to perform an analysis of the request based on the determined path, the particular host having the particular machine learning model in memory; routing the request to the particular host of the group of hosts; performing inference on the request using the particular host; and providing a result of the inference to a requester is performed.Type: GrantFiled: September 28, 2018Date of Patent: September 6, 2022Assignee: Amazon Technologies, Inc.Inventors: Nikhil Kandoi, Ganesh Kumar Gella, Rama Krishna Sandeep Pokkunuri, Sudhakar Rao Puvvadi, Stefano Stefani, Kalpesh N. Sutaria, Enrico Sartorello, Tania Khattar
-
Patent number: 11281857Abstract: Techniques for resolving a label are described. For example, a method of generating one or more labels corresponding to a substring of a tokenized utterance, each of the labels including one or more slot types, resolving the one or more generated labels into machine-readable values by: generating a tree of the generated one or more labels, the graph including a plurality of leaf nodes, wherein each rule of associated with a leaf node references a resolver, and resolving the generated tree from the leaf nodes to a root of the generated tree, by at each node, calling the referenced resolver and passing a result from each resolver to a subsequent resolver of the tree; and outputting a result of the resolving to an entity as dictated by the request is described.Type: GrantFiled: November 8, 2018Date of Patent: March 22, 2022Assignee: Amazon Technologies, Inc.Inventors: Kevin Michael Craft, Kalpesh N. Sutaria, Rama Krishna Sandeep Pokkunuri, Stefano Stefani, Bangrui Liu
-
Patent number: 11138374Abstract: Techniques for building slot types and slot type artifacts are described. For example, in some implementations, a natural language understanding (NLU) service is implemented by instructions that upon execution are to cause: receiving a request to build a slot type, the request including an indication of one or more locations storing information need to build a slot type, a slot type being a list of one or more values used to train a machine learning model to recognize a value for a slot; retrieving a slot type definition bundle associated with the request, the slot type definition bundle at least including a slot type definition defining resolution activities for the slot type; building the slot type using the retrieved slot type definition bundle; and storing into the storage slot type artifacts produced from building the slot type to be used during an analysis of an utterance.Type: GrantFiled: November 8, 2018Date of Patent: October 5, 2021Assignee: Amazon Technologies, Inc.Inventors: Kevin Michael Craft, Kalpesh N. Sutaria
-
Publication number: 20200104750Abstract: Techniques for hosting machine learning models are described. In some instances, a method of receiving a request to perform an inference using a particular machine learning model; determining a group of hosts to route the request to, the group of hosts to host a plurality of machine learning models including the particular machine learning model; determining a path to the determined group of hosts; determining a particular host of the group of hosts to perform an analysis of the request based on the determined path, the particular host having the particular machine learning model in memory; routing the request to the particular host of the group of hosts; performing inference on the request using the particular host; and providing a result of the inference to a requester is performed.Type: ApplicationFiled: September 28, 2018Publication date: April 2, 2020Inventors: Nikhil KANDOI, Ganesh Kumar GELLA, Rama Krishna Sandeep POKKUNURI, Sudhakar Rao PUVVADI, Stefano STEFANI, Kalpesh N. SUTARIA, Enrico SARTORELLO, Tania KHATTAR
-
Publication number: 20200104749Abstract: Techniques for hosting adding and warming a host are described. In some instances, a method of determining that at least one group of hosts is to be increased by adding an additional host to the group of hosts; sending a request to the group of hosts for a list of machine learning models loaded per host of the group of hosts; receiving, from each host, the list of loaded machine learning models; loading at least a proper subset of list of loaded machine learning models into random access memory of the at least one group; receiving a request to perform an inference; routing the request to the additional host of the group of hosts; performing an inference using the additional host of the group of hosts; and providing a result of the inference to an external entity is described.Type: ApplicationFiled: September 28, 2018Publication date: April 2, 2020Inventors: Enrico Sartorello, Stefano Stefani, Nikhil Kandoi, Rama Krishna Sandeep Pokkunuri, Kalpesh N. Sutaria, Navneet Sabbineni, Ganesh Kumar Gella, Cheng Ran Li