Patents by Inventor Dipankar Das

Dipankar Das has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Instructions for fused multiply-add operations with variable precision input operands

Patent number: 12288062

Abstract: Disclosed embodiments relate to instructions for fused multiply-add (FMA) operations with variable-precision inputs. In one example, a processor to execute an asymmetric FMA instruction includes fetch circuitry to fetch an FMA instruction having fields to specify an opcode, a destination, and first and second source vectors having first and second widths, respectively, decode circuitry to decode the fetched FMA instruction, and a single instruction multiple data (SIMD) execution circuit to process as many elements of the second source vector as fit into an SIMD lane width by multiplying each element by a corresponding element of the first source vector, and accumulating a resulting product with previous contents of the destination, wherein the SIMD lane width is one of 16 bits, 32 bits, and 64 bits, the first width is one of 4 bits and 8 bits, and the second width is one of 1 bit, 2 bits, and 4 bits.

Type: Grant

Filed: December 28, 2023

Date of Patent: April 29, 2025

Assignee: Intel Corporation

Inventors: Dipankar Das, Naveen K. Mellempudi, Mrinmay Dutta, Arun Kumar, Dheevatsa Mudigere, Abhisek Kundu
SYSTEM AND METHODS FOR DETECTING AND CLEANING CONTAMINANTS FROM AN IMAGING OPTICAL PATH

Publication number: 20250130419

Abstract: A system for detecting and cleaning contaminants from an imaging optical path, comprising an imaging device configured to receive a slide and capture a first slide image, at least a computing device configured to determine a contaminant presence indicator associated with a contaminant within an optical path of the imaging device based on the first slide image and execute a contaminant cleaning protocol as a function of the contaminant presence indicator, a contaminant removal mechanism configured to remove the contaminant from the optical path according to the contaminant cleaning protocol, wherein the computing device is further configured to re-evaluate the contaminant presence indicator based on a second slide image of the slide captured using the imaging device and request a user input upon a positive re-evaluation of the contaminant presence indicator.

Type: Application

Filed: September 6, 2024

Publication date: April 24, 2025

Applicant: Pramana, Inc.

Inventors: Prasanth Perugupalli, Ajay Chadha, Vinothkumar Anbalagan, Rohan Prateek, Shilpa G. Krishna, Dipankar Das, Somesh Singh
APPARATUS AND METHOD OF USE OF A MECHANISM THAT CONVERTS ROTARY MOTION INTO LINEAR MOTION

Publication number: 20250128403

Abstract: An apparatus and method of use of a mechanism that converts rotary motion into linear motion are disclosed. The method includes mechanically connecting, using a first connecting component, a rotary component and one or more movable components, mechanically connecting, using a second connecting component, a plurality of gripping components and the one or more movable components, generating, using the rotary component, rotary motion about a rotational axis, transferring, using the first connecting component, the rotary motion to the one or more movable components, transferring, using the second connecting component, the off-axis motion of the one or more movable components to the plurality of gripping components and moving the plurality of gripping components linearly in opposite directions.

Type: Application

Filed: October 20, 2023

Publication date: April 24, 2025

Applicant: Pramana Inc.

Inventors: S Jayakrishna, Bhaskar Shetty, Ajay Chadha, Dipankar Das, Rameesh Varshan, Gourisetti Ravi Teja, Prasanth Perugupalli, Abhishek Chikkanduru Nagaraj
APPARATUS AND METHOD FOR IDENTIFYING REGIONS OF INTEREST DURING SLIDE DIGITIZATION

Publication number: 20250104847

Abstract: An apparatus for identifying regions of interest during slide digitization is disclosed. The apparatus includes at least processor and a memory communicatively connected to the processor. The memory contains instructions configuring the processor to receive a user dataset associated with at least a pathology slide. The memory contains instructions configuring the processor to identify one or more regions of interest within at least a pathology slide as a function of the user dataset. The memory contains instructions configuring the processor to identify at least one scan parameter as a function of the one or more regions of interest. The memory contains instructions configuring the processor to generate a digitized slide by scanning the at least a pathology slide as a function the at least one scan parameter.

Type: Application

Filed: December 7, 2024

Publication date: March 27, 2025

Applicant: Pramana, Inc.

Inventors: Prasanth Perugupalli, Jaya Jain, Durgaprasad Dodle, Prateek Jain, Suhash Gerald, Dipankar Das
SCALING HALF-PRECISION FLOATING POINT TENSORS FOR TRAINING DEEP NEURAL NETWORKS

Publication number: 20250061318

Abstract: One embodiment provides for a machine-learning accelerator device a multiprocessor to execute parallel threads of an instruction stream, the multiprocessor including a compute unit, the compute unit including a set of functional units, each functional unit to execute at least one of the parallel threads of the instruction stream. The compute unit includes compute logic configured to execute a single instruction to scale an input tensor associated with a layer of a neural network according to a scale factor, the input tensor stored in a floating-point data type, the compute logic to scale the input tensor to enable a data distribution of data of the input tensor to be represented by a 16-bit floating point data type.

Type: Application

Filed: August 28, 2024

Publication date: February 20, 2025

Applicant: Intel Corporation

Inventors: NAVEEN MELLEMPUDI, DIPANKAR DAS
Apparatus and method for identifying regions of interest during slide digitization

Patent number: 12217856

Abstract: An apparatus for identifying regions of interest during slide digitization is disclosed. The apparatus includes at least processor and a memory communicatively connected to the processor. The memory contains instructions configuring the processor to receive a user dataset associated with at least a pathology slide. The memory contains instructions configuring the processor to identify one or more regions of interest within at least a pathology slide as a function of the user dataset. The memory contains instructions configuring the processor to identify at least one scan parameter as a function of the one or more regions of interest. The memory contains instructions configuring the processor to generate a digitized slide by scanning the at least a pathology slide as a function the at least one scan parameter.

Type: Grant

Filed: March 12, 2024

Date of Patent: February 4, 2025

Assignee: Pramana, Inc.

Inventors: Prasanth Perugupalli, Jaya Jain, Durgaprasad Dodle, Prateek Jain, Suhash Gerald, Dipankar Das
Data parallelism and halo exchange for distributed machine learning

Patent number: 12211117

Abstract: One embodiment provides for a method of transmitting data between multiple compute nodes of a distributed compute system, the method comprising multi-dimensionally partitioning data of a feature map across multiple nodes for distributed training of a convolutional neural network; performing a parallel convolution operation on the multiple partitions to train weight data of the neural network; and exchanging data between nodes to enable computation of halo regions, the halo regions having dependencies on data processed by a different node.

Type: Grant

Filed: June 27, 2022

Date of Patent: January 28, 2025

Assignee: Intel Corporation

Inventors: Dipankar Das, Karthikeyan Vaidyanathan, Srinivas Sridharan
Incremental precision networks using residual inference and fine-grain quantization

Patent number: 12198055

Abstract: One embodiment provides for a computer-readable medium storing instructions that cause one or more processors to perform operations comprising determining a per-layer scale factor to apply to tensor data associated with layers of a neural network model and converting the tensor data to converted tensor data. The tensor data may be converted from a floating point datatype to a second datatype that is an 8-bit datatype. The instructions further cause the one or more processors to generate an output tensor based on the converted tensor data and the per-layer scale factor.

Type: Grant

Filed: December 7, 2023

Date of Patent: January 14, 2025

Assignee: Intel Corporation

Inventors: Abhisek Kundu, Naveen Mellempudi, Dheevatsa Mudigere, Dipankar Das
DYNAMIC PRECISION MANAGEMENT FOR INTEGER DEEP LEARNING PRIMITIVES

Publication number: 20240412318

Abstract: One embodiment provides for a graphics processing unit to perform computations associated with a neural network, the graphics processing unit comprising a hardware processing unit having a dynamic precision fixed-point unit that is configurable to convert elements of a floating-point tensor to convert the floating-point tensor into a fixed-point tensor.

Type: Application

Filed: June 24, 2024

Publication date: December 12, 2024

Applicant: Intel Corporation

Inventors: Naveen K. MELLEMPUDI, DHEEVATSA MUDIGERE, DIPANKAR DAS, SRINIVAS SRIDHARAN
APPARATUS AND METHOD FOR IDENTIFYING REGIONS OF INTEREST DURING SLIDE DIGITIZATION

Publication number: 20240371501

Abstract: An apparatus for identifying regions of interest during slide digitization is disclosed. The apparatus includes at least processor and a memory communicatively connected to the processor. The memory contains instructions configuring the processor to receive a user dataset associated with at least a pathology slide. The memory contains instructions configuring the processor to identify one or more regions of interest within at least a pathology slide as a function of the user dataset. The memory contains instructions configuring the processor to identify at least one scan parameter as a function of the one or more regions of interest. The memory contains instructions configuring the processor to generate a digitized slide by scanning the at least a pathology slide as a function the at least one scan parameter.

Type: Application

Filed: March 12, 2024

Publication date: November 7, 2024

Applicant: Pramana, Inc.

Inventors: Prasanth Perugupalli, Jaya Jain, Durgaprasad Dodle, Prateek Jain, Suhash Gerald, Dipankar Das, Chetan Srinidhi
Scaling half-precision floating point tensors for training deep neural networks

Patent number: 12106210

Abstract: One embodiment provides for a machine-learning accelerator device a multiprocessor to execute parallel threads of an instruction stream, the multiprocessor including a compute unit, the compute unit including a set of functional units, each functional unit to execute at least one of the parallel threads of the instruction stream. The compute unit includes compute logic configured to execute a single instruction to scale an input tensor associated with a layer of a neural network according to a scale factor, the input tensor stored in a floating-point data type, the compute logic to scale the input tensor to enable a data distribution of data of the input tensor to be represented by a 16-bit floating point data type.

Type: Grant

Filed: August 25, 2023

Date of Patent: October 1, 2024

Assignee: Intel Corporation

Inventors: Naveen Mellempudi, Dipankar Das
UTILIZING STRUCTURED SPARSITY IN SYSTOLIC ARRAYS

Publication number: 20240320000

Abstract: An apparatus to facilitate utilizing structured sparsity in systolic arrays is disclosed. The apparatus includes a processor comprising a systolic array to receive data from a plurality of source registers, the data comprising unpacked source data, structured source data that is packed based on sparsity, and metadata corresponding to the structured source data; identify portions of the unpacked source data to multiply with the structured source data, the portions of the unpacked source data identified based on the metadata; and output, to a destination register, a result of multiplication of the portions of the unpacked source data and the structured source data.

Type: Application

Filed: March 29, 2024

Publication date: September 26, 2024

Applicant: Intel Corporation

Inventors: Subramaniam Maiyuran, Jorge Parra, Ashutosh Garg, Chandra Gurram, Chunhui Mei, Durgesh Borkar, Shubra Marwaha, Supratim Pal, Varghese George, Wei Xiong, Yan Li, Yongsheng Liu, Dipankar Das, Sasikanth Avancha, Dharma Teja Vooturi, Naveen K. Mellempudi
System and methods for detecting and cleaning contaminants from an imaging optical path

Patent number: 12099184

Abstract: A system for detecting and cleaning contaminants from an imaging optical path, comprising an imaging device configured to receive a slide and capture a first slide image, at least a computing device configured to determine a contaminant presence indicator associated with a contaminant within an optical path of the imaging device based on the first slide image and execute a contaminant cleaning protocol as a function of the contaminant presence indicator, a contaminant removal mechanism configured to remove the contaminant from the optical path according to the contaminant cleaning protocol, wherein the computing device is further configured to re-evaluate the contaminant presence indicator based on a second slide image of the slide captured using the imaging device and request a user input upon a positive re-evaluation of the contaminant presence indicator.

Type: Grant

Filed: October 20, 2023

Date of Patent: September 24, 2024

Inventors: Prasanth Perugupalli, Ajay Chadha, Vinothkumar Anbalagan, Rohan Prateek, Shilpa G Krishna, Dipankar Das, Somesh Singh
Dynamic precision management for integer deep learning primitives

Patent number: 12033237

Abstract: One embodiment provides for a graphics processing unit to perform computations associated with a neural network, the graphics processing unit comprising a hardware processing unit having a dynamic precision fixed-point unit that is configurable to convert elements of a floating-point tensor to convert the floating-point tensor into a fixed-point tensor.

Type: Grant

Filed: April 24, 2023

Date of Patent: July 9, 2024

Assignee: Intel Corporation

Inventors: Naveen K. Mellempudi, Dheevatsa Mudigere, Dipankar Das, Srinivas Sridharan
INCREMENTAL PRECISION NETWORKS USING RESIDUAL INFERENCE AND FINE-GRAIN QUANTIZATION

Publication number: 20240160931

Abstract: One embodiment provides for a computer-readable medium storing instructions that cause one or more processors to perform operations comprising determining a per-layer scale factor to apply to tensor data associated with layers of a neural network model and converting the tensor data to converted tensor data. The tensor data may be converted from a floating point datatype to a second datatype that is an 8-bit datatype. The instructions further cause the one or more processors to generate an output tensor based on the converted tensor data and the per-layer scale factor.

Type: Application

Filed: December 7, 2023

Publication date: May 16, 2024

Applicant: Intel Corporation

Inventors: Abhisek KUNDU, NAVEEN MELLEMPUDI, DHEEVATSA MUDIGERE, Dipankar DAS
Utilizing structured sparsity in systolic arrays

Patent number: 11977885

Abstract: An apparatus to facilitate utilizing structured sparsity in systolic arrays is disclosed. The apparatus includes a processor comprising a systolic array to receive data from a plurality of source registers, the data comprising unpacked source data, structured source data that is packed based on sparsity, and metadata corresponding to the structured source data; identify portions of the unpacked source data to multiply with the structured source data, the portions of the unpacked source data identified based on the metadata; and output, to a destination register, a result of multiplication of the portions of the unpacked source data and the structured source data.

Type: Grant

Filed: November 30, 2020

Date of Patent: May 7, 2024

Assignee: INTEL CORPORATION

Inventors: Subramaniam Maiyuran, Jorge Parra, Ashutosh Garg, Chandra Gurram, Chunhui Mei, Durgesh Borkar, Shubra Marwaha, Supratim Pal, Varghese George, Wei Xiong, Yan Li, Yongsheng Liu, Dipankar Das, Sasikanth Avancha, Dharma Teja Vooturi, Naveen K. Mellempudi
INSTRUCTIONS FOR FUSED MULTIPLY-ADD OPERATIONS WITH VARIABLE PRECISION INPUT OPERANDS

Publication number: 20240126544

Abstract: Disclosed embodiments relate to instructions for fused multiply-add (FMA) operations with variable-precision inputs. In one example, a processor to execute an asymmetric FMA instruction includes fetch circuitry to fetch an FMA instruction having fields to specify an opcode, a destination, and first and second source vectors having first and second widths, respectively, decode circuitry to decode the fetched FMA instruction, and a single instruction multiple data (SIMD) execution circuit to process as many elements of the second source vector as fit into an SIMD lane width by multiplying each element by a corresponding element of the first source vector, and accumulating a resulting product with previous contents of the destination, wherein the SIMD lane width is one of 16 bits, 32 bits, and 64 bits, the first width is one of 4 bits and 8 bits, and the second width is one of 1 bit, 2 bits, and 4 bits.

Type: Application

Filed: December 28, 2023

Publication date: April 18, 2024

Inventors: Dipankar DAS, Naveen K. MELLEMPUDI, Mrinmay DUTTA, Arun KUMAR, Dheevatsa MUDIGERE, Abhisek KUNDU
APPARATUSES, METHODS, AND SYSTEMS FOR NEURAL NETWORKS

Publication number: 20240118892

Abstract: Methods and apparatuses relating to processing neural networks are described. In one embodiment, an apparatus to process a neural network includes a plurality of fully connected layer chips coupled by an interconnect; a plurality of convolutional layer chips each coupled by an interconnect to a respective fully connected layer chip of the plurality of fully connected layer chips and each of the plurality of fully connected layer chips and the plurality of convolutional layer chips including an interconnect to couple each of a forward propagation compute intensive tile, a back propagation compute intensive tile, and a weight gradient compute intensive tile of a column of compute intensive tiles between a first memory intensive tile and a second memory intensive tile.

Type: Application

Filed: December 18, 2023

Publication date: April 11, 2024

Inventors: Swagath VENKATARAMANI, Dipankar DAS, Ashish RANJAN, Subarno BANERJEE, Sasikanth AVANCHA, Ashok JAGANNATHAN, Ajaya V. DURG, Dheemanth NAGARAJ, Bharat KAUL, Anand RAGHUNATHAN
ABSTRACTION LAYERS FOR SCALABLE DISTRIBUTED MACHINE LEARNING

Publication number: 20240070799

Abstract: One embodiment provides for a method of transmitting data between multiple compute nodes of a distributed compute system, the method comprising creating a global view of communication operations to be performed between the multiple compute nodes of the distributed compute system, the global view created using information specific to a machine learning model associated with the distributed compute system; using the global view to determine a communication cost of the communication operations; and automatically determining a number of network endpoints for use in transmitting the data between the multiple compute nodes of the distributed compute system.

Type: Application

Filed: September 5, 2023

Publication date: February 29, 2024

Applicant: Intel Corporation

Inventors: Dhiraj D. KALAMKAR, Karthikeyan VAIDYANATHAN, Srinivas SRIDHARAN, Dipankar DAS
Instructions for fused multiply-add operations with variable precision input operands

Patent number: 11900107

Abstract: Disclosed embodiments relate to instructions for fused multiply-add (FMA) operations with variable-precision inputs. In one example, a processor to execute an asymmetric FMA instruction includes fetch circuitry to fetch an FMA instruction having fields to specify an opcode, a destination, and first and second source vectors having first and second widths, respectively, decode circuitry to decode the fetched FMA instruction, and a single instruction multiple data (SIMD) execution circuit to process as many elements of the second source vector as fit into an SIMD lane width by multiplying each element by a corresponding element of the first source vector, and accumulating a resulting product with previous contents of the destination, wherein the SIMD lane width is one of 16 bits, 32 bits, and 64 bits, the first width is one of 4 bits and 8 bits, and the second width is one of 1 bit, 2 bits, and 4 bits.

Type: Grant

Filed: March 25, 2022

Date of Patent: February 13, 2024

Assignee: Intel Corporation

Inventors: Dipankar Das, Naveen K. Mellempudi, Mrinmay Dutta, Arun Kumar, Dheevatsa Mudigere, Abhisek Kundu

1 2 3 4 5 … next