Patents by Inventor Narayan Ranganathan

Narayan Ranganathan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

SYSTEMS, METHODS, AND APPARATUSES FOR HETEROGENEOUS COMPUTING

Publication number: 20260161441

Abstract: Embodiments of systems, methods, and apparatuses for heterogeneous computing are described. In some embodiments, a hardware heterogeneous scheduler dispatches instructions for execution on one or more plurality of heterogeneous processing elements, the instructions corresponding to a code fragment to be processed by the one or more of the plurality of heterogeneous processing elements, wherein the instructions are native instructions to at least one of the one or more of the plurality of heterogeneous processing elements.

Type: Application

Filed: December 17, 2025

Publication date: June 11, 2026

Inventors: Rajesh M. SANKARAN, Gilbert NEIGER, Narayan RANGANATHAN, Stephen R. VAN DOREN, Joseph NUZMAN, Niall D. MCDONNELL, Michael A. O'HANLON, Lokpraveen B. MOSUR, Tracy Garrett DRYSDALE, Eriko NURVITADHI, Asit K. MISHRA, Ganesh VENKATESH, Deborah T. MARR, Nicholas P. CARTER, Jonathan D. PEARCE, Edward T. GROCHOWSKI, Richard J. GRECO, Robert VALENTINE, Jesus CORBAL, Thomas D. FLETCHER, Dennis R. BRADFORD, Dwight P. MANLEY, Mark J. CHARNEY, Jeffrey J. COOK, Paul CAPRIOLI, Koichi YAMADA, Kent D. GLOSSOP, David B. SHEFFIELD
Method and apparatus for dynamically adjusting pipeline depth to improve execution latency

Patent number: 12632290

Abstract: Apparatus and method for managing pipeline depth of a data processing device. For example, one embodiment of an accelerator comprises: an interface to receive a plurality of work requests from a plurality of clients; and a plurality of engines to perform the plurality of work requests; wherein the work requests are to be dispatched to the plurality of engines from work queues, the work queues to store a work descriptor per work request, each work descriptor to include information needed to perform a corresponding work request, wherein the work queues include a first and a second work queue to store work descriptors associated with first latency characteristics and second latency characteristics, respectively; engine configuration circuitry to configure a first engine to have a first pipeline depth based on the first latency characteristics and to configure a second engine to have a second pipeline depth based on the second latency characteristics.

Type: Grant

Filed: December 22, 2021

Date of Patent: May 19, 2026

Assignee: Intel Corporation

Inventors: Saurabh Gayen, Dhananjay Joshi, Philip Lantz, Rajesh Sankaran, Narayan Ranganathan
SCALABLE ACCESS CONTROL CHECKING FOR CROSS-ADDRESS-SPACE DATA MOVEMENT

Publication number: 20260104819

Abstract: Methods and apparatus relating to scalable access control checking for cross-address-space data movement are described. In an embodiment, a memory stores an Inter-Domain Permissions Table (IDPT) having a plurality of entries. At least one entry of the IDPT provides a relationship between a target address space identifier and a plurality of requester address space identifiers. A hardware accelerator device allows access to a target address space, corresponding to the target address space identifier, by one or more of requesters, corresponding to the plurality of requester address space identifiers, respectively, based at least in part on the relationship provided by the at least one entry of the IDPT. Other embodiments are also disclosed and claimed.

Type: Application

Filed: October 22, 2025

Publication date: April 16, 2026

Inventors: Narayan Ranganathan, Philip R. Lantz, Rajesh M. Sankaran, Sanjay Kumar, Saurabh Gayen, Nikhil Rao, Utkarsh Y. Kakaiya, Dhananjay A. Joshi, David Jiang, Ashok Raj
METHOD AND APPARATUS FOR HIGH-PERFORMANCE PAGE-FAULT HANDLING FOR MULTI-TENANT SCALABLE ACCELERATORS

Publication number: 20260056828

Abstract: Apparatus and method for high-performance page fault handling. For example, one embodiment of an apparatus comprises: one or more accelerator engines to process work descriptors submitted by clients to a plurality of work queues; fault processing hardware logic associated with the one or more accelerator engines, the fault processing hardware logic to implement a specified page fault handling mode for each work queue of the plurality of work queues, the page fault handling modes including a first page fault handling mode and a second page fault handling mode.

Type: Application

Filed: October 6, 2025

Publication date: February 26, 2026

Inventors: Utkarsh Y. KAKAIYA, Philip LANTZ, Sanjay KUMAR, Rajesh SANKARAN, Narayan RANGANATHAN, Saurabh GAYEN, Dhananjay JOSHI, Nikhil P. RAO
Scalable access control checking for cross-address-space data movement

Patent number: 12455701

Abstract: Methods and apparatus relating to scalable access control checking for cross-address-space data movement are described. In an embodiment, a memory stores an Inter-Domain Permissions Table (IDPT) having a plurality of entries. At least one entry of the IDPT provides a relationship between a target address space identifier and a plurality of requester address space identifiers. A hardware accelerator device allows access to a target address space, corresponding to the target address space identifier, by one or more of requesters, corresponding to the plurality of requester address space identifiers, respectively, based at least in part on the relationship provided by the at least one entry of the IDPT. Other embodiments are also disclosed and claimed.

Type: Grant

Filed: April 1, 2022

Date of Patent: October 28, 2025

Assignee: Intel Corporation

Inventors: Narayan Ranganathan, Philip R. Lantz, Rajesh M. Sankaran, Sanjay Kumar, Saurabh Gayen, Nikhil Rao, Utkarsh Y. Kakaiya, Dhananjay A. Joshi, David Jiang, Ashok Raj
Method and apparatus for high-performance page-fault handling for multi-tenant scalable accelerators

Patent number: 12443477

Abstract: Apparatus and method for high-performance page fault handling. For example, one embodiment of an apparatus comprises: one or more accelerator engines to process work descriptors submitted by clients to a plurality of work queues; fault processing hardware logic associated with the one or more accelerator engines, the fault processing hardware logic to implement a specified page fault handling mode for each work queue of the plurality of work queues, the page fault handling modes including a first page fault handling mode and a second page fault handling mode.

Type: Grant

Filed: December 22, 2021

Date of Patent: October 14, 2025

Assignee: Intel Corporation

Inventors: Utkarsh Y. Kakaiya, Philip Lantz, Sanjay Kumar, Rajesh Sankaran, Narayan Ranganathan, Saurabh Gayen, Dhananjay Joshi, Nikhil P. Rao
Universal Core to Accelerator Communication Architecture

Publication number: 20250199890

Abstract: Methods and apparatus relating to a universal core to accelerator communication architecture for enhanced performance and/or programmability are described. In an embodiment, a sending agent is coupled to a processor core and a receiving agent is coupled to a hardware accelerator device. Memory store data corresponding to a request from the processor core. The sending agent and the receiving agent maintain a communication channel to facilitate communication between the processor core and the hardware accelerator device in response to the request. Other embodiments are also disclosed and claimed.

Type: Application

Filed: March 15, 2022

Publication date: June 19, 2025

Applicant: Intel Corporation

Inventors: Yipeng Wang, Rajesh M. Sankaran, Ren Wang, Narayan Ranganathan, Jr-Shian Tsai, Tsung-Yuan Tai, Heqing Zhu, Ilia Kurakin, Binh Pham, Halit Dogan
Technologies for offload device fetching of address translations

Patent number: 12326816

Abstract: Techniques for offload device address translation fetching are disclosed. In the illustrative embodiment, a processor of a compute device sends a translation fetch descriptor to an offload device before sending a corresponding work descriptor to the offload device. The offload device can request translations for virtual memory address and cache the corresponding physical addresses for later use. While the offload device is fetching virtual address translations, the compute device can perform other tasks before sending the corresponding work descriptor, including operations that modify the contents of the memory addresses whose translation are being cached. Even if the offload device does not cache the translations, the fetching can warm up the cache in a translation lookaside buffer. Such an approach can reduce the latency overhead that the offload device may otherwise incur in sending memory address translation requests that would be required to execute the work descriptor.

Type: Grant

Filed: December 21, 2020

Date of Patent: June 10, 2025

Assignee: Intel Corporation

Inventors: Saurabh Gayen, Philip R. Lantz, Dhananjay A. Joshi, Rupin H. Vakharwala, Rajesh M. Sankaran, Narayan Ranganathan, Sanjay Kumar
SYSTEMS, METHODS, AND APPARATUSES FOR HETEROGENEOUS COMPUTING

Publication number: 20250123881

Abstract: Embodiments of systems, methods, and apparatuses for heterogeneous computing are described. In some embodiments, a hardware heterogeneous scheduler dispatches instructions for execution on one or more plurality of heterogeneous processing elements, the instructions corresponding to a code fragment to be processed by the one or more of the plurality of heterogeneous processing elements, wherein the instructions are native instructions to at least one of the one or more of the plurality of heterogeneous processing elements.

Type: Application

Filed: October 25, 2024

Publication date: April 17, 2025

Inventors: Rajesh M. SANKARAN, Gilbert NEIGER, Narayan RANGANATHAN, Stephen R. VAN DOREN, Joseph NUZMAN, Niall D. MCDONNELL, Michael A. O'HANLON, Lokpraveen B. MOSUR, Tracy Garrett DRYSDALE, Eriko NURVITADHI, Asit K. MISHRA, Ganesh VENKATESH, Deborah T. MARR, Nicholas P. CARTER, Jonathan D. PEARCE, Edward T. GROCHOWSKI, Richard J. GRECO, Robert VALENTINE, Jesus CORBAL, Thomas D. FLETCHER, Dennis R. BRADFORD, Dwight P. MANLEY, Mark J. CHARNEY, Jeffry J. COOK, Paul CAPRIOLI, Koichi YAMADA, Kent D. GLOSSOP, David B. SHEFFIELD
Hardware offload circuitry

Patent number: 12197601

Abstract: Examples described herein relate to offload circuitry comprising one or more compute engines that are configurable to perform a workload offloaded from a process executed by a processor based on a descriptor particular to the workload. In some examples, the offload circuitry is configurable to perform the workload, among multiple different workloads. In some examples, the multiple different workloads include one or more of: data transformation (DT) for data format conversion, Locality Sensitive Hashing (LSH) for neural network (NN), similarity search, sparse general matrix-matrix multiplication (SpGEMM) acceleration of hash based sparse matrix multiplication, data encode, data decode, or embedding lookup.

Type: Grant

Filed: December 22, 2021

Date of Patent: January 14, 2025

Assignee: Intel Corporation

Inventors: Ren Wang, Sameh Gobriel, Somnath Paul, Yipeng Wang, Priya Autee, Abhirupa Layek, Shaman Narayana, Edwin Verplanke, Mrittika Ganguli, Jr-Shian Tsai, Anton Sorokin, Suvadeep Banerjee, Abhijit Davare, Desmond Kirkpatrick, Rajesh M. Sankaran, Jaykant B. Timbadiya, Sriram Kabisthalam Muthukumar, Narayan Ranganathan, Nalini Murari, Brinda Ganesh, Nilesh Jain
Systems, methods, and apparatuses for heterogeneous computing

Patent number: 12135981

Abstract: Embodiments of systems, methods, and apparatuses for heterogeneous computing are described. In some embodiments, a hardware heterogeneous scheduler dispatches instructions for execution on one or more plurality of heterogeneous processing elements, the instructions corresponding to a code fragment to be processed by the one or more of the plurality of heterogeneous processing elements, wherein the instructions are native instructions to at least one of the one or more of the plurality of heterogeneous processing elements.

Type: Grant

Filed: June 9, 2023

Date of Patent: November 5, 2024

Assignee: Intel Corporation

Inventors: Rajesh M. Sankaran, Gilbert Neiger, Narayan Ranganathan, Stephen R. Van Doren, Joseph Nuzman, Niall D. McDonnell, Michael A. O'Hanlon, Lokpraveen B. Mosur, Tracy Garrett Drysdale, Eriko Nurvitadhi, Asit K. Mishra, Ganesh Venkatesh, Deborah T. Marr, Nicholas P. Carter, Jonathan D. Pearce, Edward T. Grochowski, Richard J. Greco, Robert Valentine, Jesus Corbal, Thomas D. Fletcher, Dennis R. Bradford, Dwight P. Manley, Mark J. Charney, Jeffrey J. Cook, Paul Caprioli, Koichi Yamada, Kent D. Glossop, David B. Sheffield
Method, Computer Program and Computer System for Prefetching Data during Execution of an Application Program

Publication number: 20240281379

Abstract: Examples relate to a non-transitory, computer-readable medium, method, and computer system for prefetching data during executing of a computer program. The non-transitory, computer-readable medium comprises machine-readable instructions that, when the program code is executed on a processor, causes the processor to pre-fetch to a processor cache, using data transfer offloading circuitry of the processor, data being accessed by an application program from a main memory of the computer system, and to execute the computer program using the pre-fetched data that is stored in the processor cache.

Type: Application

Filed: November 28, 2022

Publication date: August 22, 2024

Inventors: Narayan RANGANATHAN, Vamsi SRIPATHI
System, apparatus and method for accessing multiple address spaces via a data mover

Patent number: 12045640

Abstract: In one embodiment, a data mover accelerator is to receive, from a first agent having a first address space and a first process address space identifier (PASID) to identify the first address space, a first job descriptor comprising a second PASID selector to specify a second PASID to identify a second address space. In response to the first job descriptor, the data mover accelerator is to securely access the first address space and the second address space. Other embodiments are described and claimed.

Type: Grant

Filed: June 23, 2020

Date of Patent: July 23, 2024

Assignee: Intel Corporation

Inventors: Sanjay K. Kumar, Philip Lantz, Rajesh Sankaran, Narayan Ranganathan, Saurabh Gayen, David A. Koufaty, Utkarsh Y. Kakaiya
Method, Computer Program, and Computer System for Prefetching Data During Execution of an Application Program

Publication number: 20240143507

Abstract: Examples relate to a non-transitory, computer-readable medium, method, and computer system for prefetching data during executing of a computer program. The non-transitory, computer-readable medium comprises machine-readable instructions that, when the program code is executed on a processor, causes the processor to pre-fetch to a processor cache, using data transfer offloading circuitry of the processor, data being accessed by an application program from a main memory of the computer system, and to execute the computer program using the pre-fetched data that is stored in the processor cache.

Type: Application

Filed: November 28, 2022

Publication date: May 2, 2024

Inventors: Narayan RANGANATHAN, Vamsi SRIPATHI
DATA STREAMING ACCELERATOR

Publication number: 20240054011

Abstract: Methods and apparatus relating to data streaming accelerators are described. In an embodiment, a hardware accelerator such as a Data Streaming Accelerator (DSA) logic circuitry performs data movement and/or data transformation for data to be transferred between a processor (having one or more processor cores) and a storage device. Other embodiments are also disclosed and claimed.

Type: Application

Filed: August 12, 2023

Publication date: February 15, 2024

Applicant: Intel Corporation

Inventors: Rajesh M. Sankaran, Philip R. Lantz, Narayan Ranganathan, Saurabh Gayen, Sanjay Kumar, Nikhil Rao, Dhananjay A. Joshi, Hai Ming Khor, Utkarsh Y. Kakaiya
SYSTEMS, METHODS, AND APPARATUSES FOR HETEROGENEOUS COMPUTING

Publication number: 20230418655

Abstract: Embodiments of systems, methods, and apparatuses for heterogeneous computing are described. In some embodiments, a hardware heterogeneous scheduler dispatches instructions for execution on one or more plurality of heterogeneous processing elements, the instructions corresponding to a code fragment to be processed by the one or more of the plurality of heterogeneous processing elements, wherein the instructions are native instructions to at least one of the one or more of the plurality of heterogeneous processing elements.

Type: Application

Filed: June 9, 2023

Publication date: December 28, 2023

Inventors: Rajesh M. SANKARAN, Gilbert NEIGER, Narayan RANGANATHAN, Stephen R. VAN DOREN, Joseph NUZMAN, Niall D. MCDONNELL, Michael A. O'HANLON, Lokpraveen B. MOSUR, Tracy Garrett DRYSDALE, Eriko NURVITADHI, Asit K. MISHRA, Ganesh VENKATESH, Deborah T. MARR, Nicholas P. CARTER, Jonathan D. PEARCE, Edward T. GROCHOWSKI, Richard J. GRECO, Robert VALENTINE, Jesus CORBAL, Thomas D. FLETCHER, Dennis R. BRADFORD, Dwight P. MANLEY, Mark J. CHARNEY, Jeffrey J. COOK, Paul CAPRIOLI, Koichi YAMADA, Kent D. GLOSSOP, David B. SHEFFIELD
Systems, methods, and apparatuses for heterogeneous computing

Patent number: 11693691

Abstract: Embodiments of systems, methods, and apparatuses for heterogeneous computing are described. In some embodiments, a hardware heterogeneous scheduler dispatches instructions for execution on one or more plurality of heterogeneous processing elements, the instructions corresponding to a code fragment to be processed by the one or more of the plurality of heterogeneous processing elements, wherein the instructions are native instructions to at least one of the one or more of the plurality of heterogeneous processing elements.

Type: Grant

Filed: July 21, 2021

Date of Patent: July 4, 2023

Assignee: Intel Corporation

Inventors: Rajesh M. Sankaran, Gilbert Neiger, Narayan Ranganathan, Stephen R. Van Doren, Joseph Nuzman, Niall D. McDonnell, Michael A. O'Hanlon, Lokpraveen B. Mosur, Tracy Garrett Drysdale, Eriko Nurvitadhi, Asit K. Mishra, Ganesh Venkatesh, Deborah T. Marr, Nicholas P. Carter, Jonathan D. Pearce, Edward T. Grochowski, Richard J. Greco, Robert Valentine, Jesus Corbal, Thomas D. Fletcher, Dennis R. Bradford, Dwight P. Manley, Mark J. Charney, Jeffrey J. Cook, Paul Caprioli, Koichi Yamada, Kent D. Glossop, David B. Sheffield
DYNAMIC CAPABILITY DISCOVERY AND ENFORCEMENT FOR ACCELERATORS AND DEVICES IN MULTI-TENANT SYSTEMS

Publication number: 20230185603

Abstract: Methods and apparatus relating to dynamic capability discovery and enforcement for accelerators and devices in multi-tenant systems are described. In an embodiment, a hardware accelerator device advertises one or more available operations and/or capabilities of the hardware accelerator device to one or more tenants. Logic circuitry controls access to the one or more available operations and/or capabilities of the one or more work queues on a per-tenant basis. Other embodiments are also disclosed and claimed.

Type: Application

Filed: December 14, 2021

Publication date: June 15, 2023

Applicant: Intel Corporation

Inventors: Saurabh Gayen, Philip Lantz, Narayan Ranganathan, Dhananjay Joshi, Rajesh Sankaran, Utkarsh Kakaiya
Method and apparatus for remote field programmable gate array processing

Patent number: 11675326

Abstract: In one embodiment, an apparatus comprises a fabric controller of a first computing node. The fabric controller is to receive, from a second computing node via a network fabric that couples the first computing node to the second computing node, a request to execute a kernel on a field-programmable gate array (FPGA) of the first computing node; instruct the FPGA to execute the kernel; and send a result of the execution of the kernel to the second computing node via the network fabric.

Type: Grant

Filed: May 26, 2021

Date of Patent: June 13, 2023

Assignee: Intel Corporation

Inventors: Nicolas A. Salhuana, Karthik Kumar, Thomas Willhalm, Francesc Guim Bernat, Narayan Ranganathan
LEVERAGING AN ACCELERATOR DEVICE TO ACCELERATE HASH TABLE LOOKUPS

Publication number: 20230126783

Abstract: A processor may determine, based on a length of an input key, whether to compute a hash value based on the input key or cause an accelerator device coupled to the processor to compute the hash value based on the input key. The processor may cause a hash table lookup operation to be performed based on the hash value.

Type: Application

Filed: December 27, 2022

Publication date: April 27, 2023

Applicant: Intel Corporation

Inventors: JIAYU HU, REN WANG, XUAN DING, CHENG JIANG, CUNMING LIANG, NARAYAN RANGANATHAN, RAJESH SANKARAN, XIAO WANG

1 2 3 4 5 next