Patents by Inventor Deepak Narayanan
Deepak Narayanan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20250060998Abstract: Systems and methods for optimizing thread allocation in a model serving system include estimating a batch size for inference requests. An optimal configuration is then determined that defines a number of inference instances, a number of threads per inference instance, and a sub-batch size per inference instance for processing a batch of inference requests of the batch size using intra-operator parallelism that minimizes average per-batch latency. The optimal configuration is determined with reference to a plurality of predetermined model profiles that define single-inference average batch latencies for different combinations of thread counts and batch sizes, the predetermined model profiles being used as input to a dynamic programming algorithm that identifies optimal configurations that minimize the average per-batch latency.Type: ApplicationFiled: August 18, 2023Publication date: February 20, 2025Applicant: Microsoft Technology Licensing, LLCInventors: Amar PHANISHAYEE, . Ankit, Deepak NARAYANAN, Mihail Gavril TARTA
-
Publication number: 20240273397Abstract: The present disclosure relates to methods and systems that create a lineage graph that tracks provenance information across machine learning models. The methods and systems use the lineage graph to facilitate machine learning model testing, diagnostics, and updating. The methods and system also use the lineage graph to determine a storage optimization for reducing a storage footprint of the machine learning models.Type: ApplicationFiled: May 8, 2023Publication date: August 15, 2024Inventors: Deepak NARAYANAN, Amar PHANISHAYEE, Daniel Marcos MENDOZA, Wei HAO
-
Patent number: 12056604Abstract: Layers of a deep neural network (DNN) are partitioned into stages using a profile of the DNN. Each of the stages includes one or more of the layers of the DNN. The partitioning of the layers of the DNN into stages is optimized in various ways including optimizing the partitioning to minimize training time, to minimize data communication between worker computing devices used to train the DNN, or to ensure that the worker computing devices perform an approximately equal amount of the processing for training the DNN. The stages are assigned to the worker computing devices. The worker computing devices process batches of training data using a scheduling policy that causes the workers to alternate between forward processing of the batches of the DNN training data and backward processing of the batches of the DNN training data. The stages can be configured for model parallel processing or data parallel processing.Type: GrantFiled: June 29, 2018Date of Patent: August 6, 2024Assignee: MICROSOFT TECHNOLOGY LICENSING, LLCInventors: Vivek Seshadri, Amar Phanishayee, Deepak Narayanan, Aaron Harlap, Nikhil Devanur Rangarajan
-
Patent number: 11379862Abstract: The systems may include receiving, by a processor, transaction information for a transaction, wherein the transaction information comprises a transaction amount; matching, by the processor, the transaction information with a transaction type; retrieving, by the processor, a plurality of possible charge types associated with the transaction type; comparing, by the processor, the transaction information with the plurality of possible charge types; separating, by the processor, the transaction amount of the transaction information into at least one individual charge amount; and/or identifying, by the processor, a charge type of the plurality of possible charge types associated with the at least one individual charge amount.Type: GrantFiled: September 9, 2020Date of Patent: July 5, 2022Assignee: American Express Travel Related Service Company, IncInventors: Amy L. Harbour, Michael A. Woods, Jane E. Cook, Michael Alexander Gonzales, Sachin Jadhav, Yogaraj Jeyaprakasam, Deepak Narayanan
-
Patent number: 10929406Abstract: A system for generating and delivering custom data sets in a big data environment may receive a preselected schema that identifies a plurality of columns from a plurality of data sources for inclusion in an output data file. The system reads data from the data sources to generate a data file containing a big data table. The system monitors the plurality of data sources to detect that the data sources have been ingested into a data storage system. The data file is read and a column is filtered from the data file to generate the output data file in response to the preselected schema excluding the column. The output data file is transferred to a client device.Type: GrantFiled: October 27, 2016Date of Patent: February 23, 2021Assignee: American Express Travel Related Services Company, IncInventors: Jane Cook, Sachin Jadhav, Yogaraj Jayaprakasam, Deepak Narayanan, Rahul Shaurya
-
Patent number: 10776795Abstract: The systems may include receiving, by a processor, transaction information for a transaction, wherein the transaction information comprises a transaction amount; matching, by the processor, the transaction information with a transaction type; retrieving, by the processor, a plurality of possible charge types associated with the transaction type; comparing, by the processor, the transaction information with the plurality of possible charge types; separating, by the processor, the transaction amount of the transaction information into at least one individual charge amount; and/or identifying, by the processor, a charge type of the plurality of possible charge types associated with the at least one individual charge amount.Type: GrantFiled: February 10, 2017Date of Patent: September 15, 2020Assignee: AMERICAN EXPRESS TRAVEL RELATED SERVICES COMPANY, INC.Inventors: Amy L. Harbour, Michael A. Woods, Jane E. Cook, Michael Alexander Gonzales, Sachin Jadhav, Yogaraj Jeyaprakasam, Deepak Narayanan
-
Publication number: 20190362227Abstract: Layers of a deep neural network (DNN) are partitioned into stages using a profile of the DNN. Each of the stages includes one or more of the layers of the DNN. The partitioning of the layers of the DNN into stages is optimized in various ways including optimizing the partitioning to minimize training time, to minimize data communication between worker computing devices used to train the DNN, or to ensure that the worker computing devices perform an approximately equal amount of the processing for training the DNN. The stages are assigned to the worker computing devices. The worker computing devices process batches of training data using a scheduling policy that causes the workers to alternate between forward processing of the batches of the DNN training data and backward processing of the batches of the DNN training data. The stages can be configured for model parallel processing or data parallel processing.Type: ApplicationFiled: June 29, 2018Publication date: November 28, 2019Inventors: Vivek SESHADRI, Amar PHANISHAYEE, Deepak NARAYANAN, Aaron HARLAP, Nikhil Devanur RANGARAJAN
-
Publication number: 20180232730Abstract: The systems may include receiving, by a processor, transaction information for a transaction, wherein the transaction information comprises a transaction amount; matching, by the processor, the transaction information with a transaction type; retrieving, by the processor, a plurality of possible charge types associated with the transaction type; comparing, by the processor, the transaction information with the plurality of possible charge types; separating, by the processor, the transaction amount of the transaction information into at least one individual charge amount; and/or identifying, by the processor, a charge type of the plurality of possible charge types associated with the at least one individual charge amount.Type: ApplicationFiled: February 10, 2017Publication date: August 16, 2018Applicant: American Express Travel Related Services Company, Inc.Inventors: Amy L. Harbour, Michael A. Woods, Jane E. Cook, Michael Alexander Gonzales, Sachin Jadhav, Yogaraj Jeyaprakasam, Deepak Narayanan
-
Publication number: 20180121519Abstract: A system for generating and delivering custom data sets in a big data environment may receive a preselected schema that identifies a plurality of columns from a plurality of data sources for inclusion in an output data file. The system reads data from the data sources to generate a data file containing a big data table. The system monitors the plurality of data sources to detect that the data sources have been ingested into a data storage system. The data file is read and a column is filtered from the data file to generate the output data file in response to the preselected schema excluding the column. The output data file is transferred to a client device.Type: ApplicationFiled: October 27, 2016Publication date: May 3, 2018Applicant: AMERICAN EXPRESS TRAVEL RELATED SERVICES COMPANY, INC.Inventors: Jane Cook, Sachin Jadhav, Yogaraj Jayaprakasam, Deepak Narayanan, Rahul Shaurya