Patents by Inventor Deepak Narayanan

Deepak Narayanan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20250060998
    Abstract: Systems and methods for optimizing thread allocation in a model serving system include estimating a batch size for inference requests. An optimal configuration is then determined that defines a number of inference instances, a number of threads per inference instance, and a sub-batch size per inference instance for processing a batch of inference requests of the batch size using intra-operator parallelism that minimizes average per-batch latency. The optimal configuration is determined with reference to a plurality of predetermined model profiles that define single-inference average batch latencies for different combinations of thread counts and batch sizes, the predetermined model profiles being used as input to a dynamic programming algorithm that identifies optimal configurations that minimize the average per-batch latency.
    Type: Application
    Filed: August 18, 2023
    Publication date: February 20, 2025
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Amar PHANISHAYEE, . Ankit, Deepak NARAYANAN, Mihail Gavril TARTA
  • Publication number: 20240273397
    Abstract: The present disclosure relates to methods and systems that create a lineage graph that tracks provenance information across machine learning models. The methods and systems use the lineage graph to facilitate machine learning model testing, diagnostics, and updating. The methods and system also use the lineage graph to determine a storage optimization for reducing a storage footprint of the machine learning models.
    Type: Application
    Filed: May 8, 2023
    Publication date: August 15, 2024
    Inventors: Deepak NARAYANAN, Amar PHANISHAYEE, Daniel Marcos MENDOZA, Wei HAO
  • Patent number: 12056604
    Abstract: Layers of a deep neural network (DNN) are partitioned into stages using a profile of the DNN. Each of the stages includes one or more of the layers of the DNN. The partitioning of the layers of the DNN into stages is optimized in various ways including optimizing the partitioning to minimize training time, to minimize data communication between worker computing devices used to train the DNN, or to ensure that the worker computing devices perform an approximately equal amount of the processing for training the DNN. The stages are assigned to the worker computing devices. The worker computing devices process batches of training data using a scheduling policy that causes the workers to alternate between forward processing of the batches of the DNN training data and backward processing of the batches of the DNN training data. The stages can be configured for model parallel processing or data parallel processing.
    Type: Grant
    Filed: June 29, 2018
    Date of Patent: August 6, 2024
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Vivek Seshadri, Amar Phanishayee, Deepak Narayanan, Aaron Harlap, Nikhil Devanur Rangarajan
  • Patent number: 11379862
    Abstract: The systems may include receiving, by a processor, transaction information for a transaction, wherein the transaction information comprises a transaction amount; matching, by the processor, the transaction information with a transaction type; retrieving, by the processor, a plurality of possible charge types associated with the transaction type; comparing, by the processor, the transaction information with the plurality of possible charge types; separating, by the processor, the transaction amount of the transaction information into at least one individual charge amount; and/or identifying, by the processor, a charge type of the plurality of possible charge types associated with the at least one individual charge amount.
    Type: Grant
    Filed: September 9, 2020
    Date of Patent: July 5, 2022
    Assignee: American Express Travel Related Service Company, Inc
    Inventors: Amy L. Harbour, Michael A. Woods, Jane E. Cook, Michael Alexander Gonzales, Sachin Jadhav, Yogaraj Jeyaprakasam, Deepak Narayanan
  • Patent number: 10929406
    Abstract: A system for generating and delivering custom data sets in a big data environment may receive a preselected schema that identifies a plurality of columns from a plurality of data sources for inclusion in an output data file. The system reads data from the data sources to generate a data file containing a big data table. The system monitors the plurality of data sources to detect that the data sources have been ingested into a data storage system. The data file is read and a column is filtered from the data file to generate the output data file in response to the preselected schema excluding the column. The output data file is transferred to a client device.
    Type: Grant
    Filed: October 27, 2016
    Date of Patent: February 23, 2021
    Assignee: American Express Travel Related Services Company, Inc
    Inventors: Jane Cook, Sachin Jadhav, Yogaraj Jayaprakasam, Deepak Narayanan, Rahul Shaurya
  • Patent number: 10776795
    Abstract: The systems may include receiving, by a processor, transaction information for a transaction, wherein the transaction information comprises a transaction amount; matching, by the processor, the transaction information with a transaction type; retrieving, by the processor, a plurality of possible charge types associated with the transaction type; comparing, by the processor, the transaction information with the plurality of possible charge types; separating, by the processor, the transaction amount of the transaction information into at least one individual charge amount; and/or identifying, by the processor, a charge type of the plurality of possible charge types associated with the at least one individual charge amount.
    Type: Grant
    Filed: February 10, 2017
    Date of Patent: September 15, 2020
    Assignee: AMERICAN EXPRESS TRAVEL RELATED SERVICES COMPANY, INC.
    Inventors: Amy L. Harbour, Michael A. Woods, Jane E. Cook, Michael Alexander Gonzales, Sachin Jadhav, Yogaraj Jeyaprakasam, Deepak Narayanan
  • Publication number: 20190362227
    Abstract: Layers of a deep neural network (DNN) are partitioned into stages using a profile of the DNN. Each of the stages includes one or more of the layers of the DNN. The partitioning of the layers of the DNN into stages is optimized in various ways including optimizing the partitioning to minimize training time, to minimize data communication between worker computing devices used to train the DNN, or to ensure that the worker computing devices perform an approximately equal amount of the processing for training the DNN. The stages are assigned to the worker computing devices. The worker computing devices process batches of training data using a scheduling policy that causes the workers to alternate between forward processing of the batches of the DNN training data and backward processing of the batches of the DNN training data. The stages can be configured for model parallel processing or data parallel processing.
    Type: Application
    Filed: June 29, 2018
    Publication date: November 28, 2019
    Inventors: Vivek SESHADRI, Amar PHANISHAYEE, Deepak NARAYANAN, Aaron HARLAP, Nikhil Devanur RANGARAJAN
  • Publication number: 20180232730
    Abstract: The systems may include receiving, by a processor, transaction information for a transaction, wherein the transaction information comprises a transaction amount; matching, by the processor, the transaction information with a transaction type; retrieving, by the processor, a plurality of possible charge types associated with the transaction type; comparing, by the processor, the transaction information with the plurality of possible charge types; separating, by the processor, the transaction amount of the transaction information into at least one individual charge amount; and/or identifying, by the processor, a charge type of the plurality of possible charge types associated with the at least one individual charge amount.
    Type: Application
    Filed: February 10, 2017
    Publication date: August 16, 2018
    Applicant: American Express Travel Related Services Company, Inc.
    Inventors: Amy L. Harbour, Michael A. Woods, Jane E. Cook, Michael Alexander Gonzales, Sachin Jadhav, Yogaraj Jeyaprakasam, Deepak Narayanan
  • Publication number: 20180121519
    Abstract: A system for generating and delivering custom data sets in a big data environment may receive a preselected schema that identifies a plurality of columns from a plurality of data sources for inclusion in an output data file. The system reads data from the data sources to generate a data file containing a big data table. The system monitors the plurality of data sources to detect that the data sources have been ingested into a data storage system. The data file is read and a column is filtered from the data file to generate the output data file in response to the preselected schema excluding the column. The output data file is transferred to a client device.
    Type: Application
    Filed: October 27, 2016
    Publication date: May 3, 2018
    Applicant: AMERICAN EXPRESS TRAVEL RELATED SERVICES COMPANY, INC.
    Inventors: Jane Cook, Sachin Jadhav, Yogaraj Jayaprakasam, Deepak Narayanan, Rahul Shaurya