Patents by Inventor Deepak Narayanan

Deepak Narayanan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

AUTOMATIC LATENCY OPTIMIZATION FOR CPU-BASED DNN SERVING

Publication number: 20250060998

Abstract: Systems and methods for optimizing thread allocation in a model serving system include estimating a batch size for inference requests. An optimal configuration is then determined that defines a number of inference instances, a number of threads per inference instance, and a sub-batch size per inference instance for processing a batch of inference requests of the batch size using intra-operator parallelism that minimizes average per-batch latency. The optimal configuration is determined with reference to a plurality of predetermined model profiles that define single-inference average batch latencies for different combinations of thread counts and batch sizes, the predetermined model profiles being used as input to a dynamic programming algorithm that identifies optimal configurations that minimize the average per-batch latency.

Type: Application

Filed: August 18, 2023

Publication date: February 20, 2025

Applicant: Microsoft Technology Licensing, LLC

Inventors: Amar PHANISHAYEE, . Ankit, Deepak NARAYANAN, Mihail Gavril TARTA
MACHINE LEARNING MODEL LINEAGE TRACKING

Publication number: 20240273397

Abstract: The present disclosure relates to methods and systems that create a lineage graph that tracks provenance information across machine learning models. The methods and systems use the lineage graph to facilitate machine learning model testing, diagnostics, and updating. The methods and system also use the lineage graph to determine a storage optimization for reducing a storage footprint of the machine learning models.

Type: Application

Filed: May 8, 2023

Publication date: August 15, 2024

Inventors: Deepak NARAYANAN, Amar PHANISHAYEE, Daniel Marcos MENDOZA, Wei HAO
Highly performant pipeline parallel deep neural network training

Patent number: 12056604

Abstract: Layers of a deep neural network (DNN) are partitioned into stages using a profile of the DNN. Each of the stages includes one or more of the layers of the DNN. The partitioning of the layers of the DNN into stages is optimized in various ways including optimizing the partitioning to minimize training time, to minimize data communication between worker computing devices used to train the DNN, or to ensure that the worker computing devices perform an approximately equal amount of the processing for training the DNN. The stages are assigned to the worker computing devices. The worker computing devices process batches of training data using a scheduling policy that causes the workers to alternate between forward processing of the batches of the DNN training data and backward processing of the batches of the DNN training data. The stages can be configured for model parallel processing or data parallel processing.

Type: Grant

Filed: June 29, 2018

Date of Patent: August 6, 2024

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Vivek Seshadri, Amar Phanishayee, Deepak Narayanan, Aaron Harlap, Nikhil Devanur Rangarajan
Data amelioration and reformation system

Patent number: 11379862

Abstract: The systems may include receiving, by a processor, transaction information for a transaction, wherein the transaction information comprises a transaction amount; matching, by the processor, the transaction information with a transaction type; retrieving, by the processor, a plurality of possible charge types associated with the transaction type; comparing, by the processor, the transaction information with the plurality of possible charge types; separating, by the processor, the transaction amount of the transaction information into at least one individual charge amount; and/or identifying, by the processor, a charge type of the plurality of possible charge types associated with the at least one individual charge amount.

Type: Grant

Filed: September 9, 2020

Date of Patent: July 5, 2022

Assignee: American Express Travel Related Service Company, Inc

Inventors: Amy L. Harbour, Michael A. Woods, Jane E. Cook, Michael Alexander Gonzales, Sachin Jadhav, Yogaraj Jeyaprakasam, Deepak Narayanan
Systems and methods for a self-services data file configuration with various data sources

Patent number: 10929406

Abstract: A system for generating and delivering custom data sets in a big data environment may receive a preselected schema that identifies a plurality of columns from a plurality of data sources for inclusion in an output data file. The system reads data from the data sources to generate a data file containing a big data table. The system monitors the plurality of data sources to detect that the data sources have been ingested into a data storage system. The data file is read and a column is filtered from the data file to generate the output data file in response to the preselected schema excluding the column. The output data file is transferred to a client device.

Type: Grant

Filed: October 27, 2016

Date of Patent: February 23, 2021

Assignee: American Express Travel Related Services Company, Inc

Inventors: Jane Cook, Sachin Jadhav, Yogaraj Jayaprakasam, Deepak Narayanan, Rahul Shaurya
Data amelioration and reformation system

Patent number: 10776795

Abstract: The systems may include receiving, by a processor, transaction information for a transaction, wherein the transaction information comprises a transaction amount; matching, by the processor, the transaction information with a transaction type; retrieving, by the processor, a plurality of possible charge types associated with the transaction type; comparing, by the processor, the transaction information with the plurality of possible charge types; separating, by the processor, the transaction amount of the transaction information into at least one individual charge amount; and/or identifying, by the processor, a charge type of the plurality of possible charge types associated with the at least one individual charge amount.

Type: Grant

Filed: February 10, 2017

Date of Patent: September 15, 2020

Assignee: AMERICAN EXPRESS TRAVEL RELATED SERVICES COMPANY, INC.

Inventors: Amy L. Harbour, Michael A. Woods, Jane E. Cook, Michael Alexander Gonzales, Sachin Jadhav, Yogaraj Jeyaprakasam, Deepak Narayanan
HIGHLY PERFORMANT PIPELINE PARALLEL DEEP NEURAL NETWORK TRAINING

Publication number: 20190362227

Abstract: Layers of a deep neural network (DNN) are partitioned into stages using a profile of the DNN. Each of the stages includes one or more of the layers of the DNN. The partitioning of the layers of the DNN into stages is optimized in various ways including optimizing the partitioning to minimize training time, to minimize data communication between worker computing devices used to train the DNN, or to ensure that the worker computing devices perform an approximately equal amount of the processing for training the DNN. The stages are assigned to the worker computing devices. The worker computing devices process batches of training data using a scheduling policy that causes the workers to alternate between forward processing of the batches of the DNN training data and backward processing of the batches of the DNN training data. The stages can be configured for model parallel processing or data parallel processing.

Type: Application

Filed: June 29, 2018

Publication date: November 28, 2019

Inventors: Vivek SESHADRI, Amar PHANISHAYEE, Deepak NARAYANAN, Aaron HARLAP, Nikhil Devanur RANGARAJAN
Data Amelioration and Reformation System

Publication number: 20180232730

Abstract: The systems may include receiving, by a processor, transaction information for a transaction, wherein the transaction information comprises a transaction amount; matching, by the processor, the transaction information with a transaction type; retrieving, by the processor, a plurality of possible charge types associated with the transaction type; comparing, by the processor, the transaction information with the plurality of possible charge types; separating, by the processor, the transaction amount of the transaction information into at least one individual charge amount; and/or identifying, by the processor, a charge type of the plurality of possible charge types associated with the at least one individual charge amount.

Type: Application

Filed: February 10, 2017

Publication date: August 16, 2018

Applicant: American Express Travel Related Services Company, Inc.

Inventors: Amy L. Harbour, Michael A. Woods, Jane E. Cook, Michael Alexander Gonzales, Sachin Jadhav, Yogaraj Jeyaprakasam, Deepak Narayanan
SYSTEMS AND METHODS FOR A SELF-SERVICES DATA FILE CONFIGURATION WITH VARIOUS DATA SOURCES

Publication number: 20180121519

Abstract: A system for generating and delivering custom data sets in a big data environment may receive a preselected schema that identifies a plurality of columns from a plurality of data sources for inclusion in an output data file. The system reads data from the data sources to generate a data file containing a big data table. The system monitors the plurality of data sources to detect that the data sources have been ingested into a data storage system. The data file is read and a column is filtered from the data file to generate the output data file in response to the preselected schema excluding the column. The output data file is transferred to a client device.

Type: Application

Filed: October 27, 2016

Publication date: May 3, 2018

Applicant: AMERICAN EXPRESS TRAVEL RELATED SERVICES COMPANY, INC.

Inventors: Jane Cook, Sachin Jadhav, Yogaraj Jayaprakasam, Deepak Narayanan, Rahul Shaurya