Patents by Inventor Chandra Kumar RAMASAMY

Chandra Kumar RAMASAMY has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

ACCELERATING NEURAL NETWORKS WITH ONE SHOT SKIP LAYER PRUNING

Publication number: 20230351187

Abstract: Systems, methods, and devices for pruning a convolutional neural network (CNN). A subset of layers of the CNN is chosen, and for each layer of the subset of layers, how salient each filter in the layer is to an output of the CNN is determined, a subset of the filters in the layer is determined based on the salience of each filter in the layer, and the subset of filters in the layer is pruned. In some implementations, the layers of the subset of layers of the CNN are non-contiguous. In some implementations, the subset of layers includes odd numbered layers of the CNN and excludes even numbered layers of the CNN. In some implementations, the subset of layers includes even numbered layers of the CNN and excludes odd numbered layers of the CNN.

Type: Application

Filed: June 30, 2023

Publication date: November 2, 2023

Applicant: Advanced Micro Devices, Inc.

Inventors: Arun Coimbatore Ramachandran, Chandra Kumar Ramasamy, Prakash Sathyanath Raghavendra, Keerthan Shagrithaya
Accelerating neural networks with one shot skip layer pruning

Patent number: 11694081

Abstract: Systems, methods, and devices for pruning a convolutional neural network (CNN). A subset of layers of the CNN is chosen, and for each layer of the subset of layers, how salient each filter in the layer is to an output of the CNN is determined, a subset of the filters in the layer is determined based on the salience of each filter in the layer, and the subset of filters in the layer is pruned. In some implementations, the layers of the subset of layers of the CNN are non-contiguous. In some implementations, the subset of layers includes odd numbered layers of the CNN and excludes even numbered layers of the CNN. In some implementations, the subset of layers includes even numbered layers of the CNN and excludes odd numbered layers of the CNN.

Type: Grant

Filed: June 28, 2019

Date of Patent: July 4, 2023

Assignee: Advanced Micro Devices, Inc.

Inventors: Arun Coimbatore Ramachandran, Chandra Kumar Ramasamy, Prakash Sathyanath Raghavendra, Keerthan Subraya Shagrithaya
Method and apparatus for determining memory requirement in a network

Patent number: 11593644

Abstract: The present disclosure disclose method and apparatus for determining memory requirement for processing a DNN model on a device, a method includes receiving a DNN model for an input, wherein the DNN model includes a plurality of processing layers. The method includes generating a network graph of the DNN model. The method includes creating a colored network graph of the DNN model based on the identified execution order of the plurality of processing layers. The colored network graph indicates assignment of at least one memory buffer for storing at least one output of at least one processing layer. The method includes determining at least one buffer reuse overlap possibility across the plurality of processing layers. Based on the determined at least one buffer reuse overlap possibility, the method includes determining and assigning the memory required for processing the DNN model.

Type: Grant

Filed: August 8, 2018

Date of Patent: February 28, 2023

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Narasinga Rao Miniskar, Sirish Kumar Pasupuleti, Raj Narayana Gadde, Ashok Vishnoi, Vasanthakumar Rajagopal, Chandra Kumar Ramasamy
EFFICIENT WEIGHT CLIPPING FOR NEURAL NETWORKS

Publication number: 20210406690

Abstract: Systems, apparatuses, and methods for implementing one-sided per-kernel clipping and weight transformation for neural networks are disclosed. Various parameters of a neural network are quantized from higher-bit representations to lower-bit representations to reduce memory utilization and power consumption. To exploit the effective range of quantized representations, positively biased weights are clipped and negated before convolution. Then, the results are rescaled back after convolution. A one-sided clipping technique is used for transforming weights to exploit the quantization range effectively, with the side chosen to be clipped being the biased side. This technique uses a global strategy for clipping without requiring skilled expertise. This approach allows the system to retain as much information as possible without losing unnecessary accuracy when quantizing parameters from higher-bit representations to lower-bit representations.

Type: Application

Filed: September 25, 2020

Publication date: December 30, 2021

Inventors: Arun Coimbatore Ramachandran, Chandra Kumar Ramasamy, Keerthan S. Shagrithaya, Prakash Sathyanath Raghavendra, Vasanthakumar Rajagopal
ACCELERATING NEURAL NETWORKS WITH ONE SHOT SKIP LAYER PRUNING

Publication number: 20200364573

Abstract: Systems, methods, and devices for pruning a convolutional neural network (CNN). A subset of layers of the CNN is chosen, and for each layer of the subset of layers, how salient each filter in the layer is to an output of the CNN is determined, a subset of the filters in the layer is determined based on the salience of each filter in the layer, and the subset of filters in the layer is pruned. In some implementations, the layers of the subset of layers of the CNN are non-contiguous. In some implementations, the subset of layers includes odd numbered layers of the CNN and excludes even numbered layers of the CNN. In some implementations, the subset of layers includes even numbered layers of the CNN and excludes odd numbered layers of the CNN.

Type: Application

Filed: June 28, 2019

Publication date: November 19, 2020

Applicant: Advanced Micro Devices, Inc.

Inventors: Arun Coimbatore Ramachandran, Chandra Kumar Ramasamy, Prakash Sathyanath Raghavendra, Keerthan Subraya Shagrithaya
METHOD AND APPARATUS FOR DETERMINING MEMORY REQUIREMENT IN A NETWORK

Publication number: 20200257972

Abstract: The present disclosure disclose method and apparatus for determining memory requirement for processing a DNN model on a device, a method includes receiving a DNN model for an input, wherein the DNN model includes a plurality of processing layers. The method includes generating a network graph of the DNN model. The method includes creating a colored network graph of the DNN model based on the identified execution order of the plurality of processing layers. The colored network graph indicates assignment of at least one memory buffer for storing at least one output of at least one processing layer. The method includes determining at least one buffer reuse overlap possibility across the plurality of processing layers. Based on the determined at least one buffer reuse overlap possibility, the method includes determining and assigning the memory required for processing the DNN model.

Type: Application

Filed: August 8, 2018

Publication date: August 13, 2020

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Narasinga Rao MINISKAR, Sirish Kumar PASUPULETI, Raj Narayana GADDE, Ashok VISHNOI, Vasanthakumar RAJAGOPAL, Chandra Kumar RAMASAMY

ACCELERATING NEURAL NETWORKS WITH ONE SHOT SKIP LAYER PRUNING

Accelerating neural networks with one shot skip layer pruning

Method and apparatus for determining memory requirement in a network

EFFICIENT WEIGHT CLIPPING FOR NEURAL NETWORKS

ACCELERATING NEURAL NETWORKS WITH ONE SHOT SKIP LAYER PRUNING

METHOD AND APPARATUS FOR DETERMINING MEMORY REQUIREMENT IN A NETWORK