Patents by Inventor Olivier Giroux

Olivier Giroux has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Techniques for efficiently synchronizing multiple program threads

Patent number: 12271765

Abstract: Various embodiments include a parallel processing computer system that enables parallel instances of a program to synchronize at disparate addresses in memory. When the parallel program instances need to exchange data, the program instances synchronize based on a mask that identifies the program instances that are synchronizing. As each program instance reaches the point of synchronization, the program instance blocks and waits for all other program instances to reach the point of synchronization. When all program instances have reached the point of synchronization, at least one program instance executes a synchronous operation to exchange data. The program instances then continue execution at respective and disparate return addresses.

Type: Grant

Filed: June 3, 2021

Date of Patent: April 8, 2025

Assignee: NVIDIA CORPORATION

Inventors: Ajay Sudarshan Tirumala, Olivier Giroux, Peter Nelson, Gary M. Tarolli, Ankita Upreti, Konstantinos Kyriakopoulos, Divya Shanmughan, Rishkul Kulkarni
Distributed shared memory

Patent number: 12248788

Abstract: Distributed shared memory (DSMEM) comprises blocks of memory that are distributed or scattered across a processor (such as a GPU). Threads executing on a processing core local to one memory block are able to access a memory block local to a different processing core. In one embodiment, shared access to these DSMEM allocations distributed across a collection of processing cores is implemented by communications between the processing cores. Such distributed shared memory provides very low latency memory access for processing cores located in proximity to the memory blocks, and also provides a way for more distant processing cores to also access the memory blocks in a manner and using interconnects that do not interfere with the processing cores' access to main or global memory such as hacked by an L2 cache.

Type: Grant

Filed: March 10, 2022

Date of Patent: March 11, 2025

Assignee: NVIDIA Corporation

Inventors: Prakash Bangalore Prabhakar, Gentaro Hirota, Ronny Krashinsky, Ze Long, Brian Pharris, Rajballav Dash, Jeff Tuckey, Jerome F. Duluk, Jr., Lacky Shah, Luke Durant, Jack Choquette, Eric Werness, Naman Govil, Manan Patel, Shayani Deb, Sandeep Navada, John Edmondson, Greg Palmer, Wish Gandhi, Ravi Manyam, Apoorv Parle, Olivier Giroux, Shirish Gadre, Steve Heinrich
SELF-SYNCHRONIZING REMOTE MEMORY OPERATIONS IN A MULTIPROCESSOR SYSTEM

Publication number: 20240393951

Abstract: Various embodiments include techniques for performing self-synchronizing remote memory operations in a multiprocessor computing system. During a remote memory operation in the multiprocessor computing system, a source processing unit transmits multiple segments of data to a destination processing. For each segment of data, the source processing unit transmits a remote memory operation to the destination processing unit that includes associated metadata that identifies the memory location of a corresponding synchronization object. The remote memory operation along with the metadata is transmitted as a single unit to the destination processing unit. The destination processing unit splits the operation into the remote memory operation and the memory synchronization operation. As a result, the source processing unit avoids the need to perform a separate memory synchronization operation, thereby reducing inter-processor communications and increasing performance of remote memory operations.

Type: Application

Filed: July 10, 2024

Publication date: November 28, 2024

Inventors: Srinivas Santosh Kumar MADUGULA, Olivier GIROUX, Wishwesh Anil GANDHI, Michael Allen PARKER, Raghuram L, Ivan TANASIC, Manan PATEL, Mark HUMMEL, Alexander L. MINKIN
SELF-SYNCHRONIZING REMOTE MEMORY OPERATIONS IN A DATA CENTER OR MULTIPROCESSOR SYSTEM

Publication number: 20240354106

Abstract: Various embodiments include techniques for performing self-synchronizing remote memory operations in a data center or multiprocessor computing system. During a remote memory operation, a source processor transmits multiple data segments to a destination processor. For each data segment, the source processor transmits a remote memory operation to the destination processor that includes associated metadata that identifies the memory location of a corresponding synchronization object representing a count of data segments to be stored or a flag for each data segment to be stored. The remote memory operation along with the metadata is transmitted as a single unit to the destination processor. The destination processor splits the operation into the remote memory operation and the memory synchronization operation.

Type: Application

Filed: June 26, 2024

Publication date: October 24, 2024

Inventors: Srinivas Santosh Kumar MADUGULA, Olivier GIROUX, Wishwesh Anil GANDHI, Michael Allen PARKER, Raghuram L, Ivan TANASIC, Manan PATEL, Mark HUMMEL, Alexander L. MINKIN, Gregory Michael THORSON
Self-synchronizing remote memory operations in a multiprocessor system

Patent number: 12105960

Abstract: Various embodiments include techniques for performing self-synchronizing remote memory operations in a multiprocessor computing system. During a remote memory operation in the multiprocessor computing system, a source processing unit transmits multiple segments of data to a destination processing. For each segment of data, the source processing unit transmits a remote memory operation to the destination processing unit that includes associated metadata that identifies the memory location of a corresponding synchronization object. The remote memory operation along with the metadata is transmitted as a single unit to the destination processing unit. The destination processing unit splits the operation into the remote memory operation and the memory synchronization operation. As a result, the source processing unit avoids the need to perform a separate memory synchronization operation, thereby reducing inter-processor communications and increasing performance of remote memory operations.

Type: Grant

Filed: August 31, 2022

Date of Patent: October 1, 2024

Assignee: NVIDIA CORPORATION

Inventors: Srinivas Santosh Kumar Madugula, Olivier Giroux, Wishwesh Anil Gandhi, Michael Allen Parker, Raghuram L, Ivan Tanasic, Manan Patel, Mark Hummel, Alexander L. Minkin
APPLICATION PROGRAMMING INTERFACE TO SHARE DATA WITH THREADS

Publication number: 20240289186

Abstract: Apparatuses, systems, and techniques to perform an application programming interface (API) to select a single thread from a group of threads to perform a set of instructions and to broadcast a result of performance of said set of instructions to said group of threads. In at least one embodiment, processors or computer systems are to perform an API to indicate instructions to be performed by a single thread and to select that thread from a group of threads to perform said instructions, and to make available to said group of threads data generated as a result of performance of said instructions.

Type: Application

Filed: February 6, 2024

Publication date: August 29, 2024

Inventors: Piotr Tomasz Ciolkosz, Kyrylo Perelygin, Harold Carter Edwards, Gonzalo Brito Gadeschi, Georgii Evtushenko, Jake Hemstad, Vishalkumar Ketankumar Mehta, Michal Dominiak, Olivier Giroux, Konstantinos Kyriakopoulos
APPLICATION PROGRAMMING INTERFACE TO MODIFY THREAD

Publication number: 20240289129

Abstract: Apparatuses, systems, and techniques to perform an application programming interface (API) to select a single thread from a group of threads to perform a set of instructions. In at least one embodiment, processors or computer systems are to perform an API to indicate instructions to be performed by a single thread and to select that thread from a group of threads to perform said instructions.

Type: Application

Filed: February 6, 2024

Publication date: August 29, 2024

Inventors: Piotr Tomasz Ciolkosz, Kyrylo Perelygin, Harold Carter Edwards, Gonzalo Brito Gadeschi, Georgii Evtushenko, Jake Hemstad, Vishalkumar Ketankumar Mehta, Michal Dominiak, Olivier Giroux, Konstantinos Kyriakopoulos
APPLICATION PROGRAMMING INTERFACE TO CHECK MEMORY TRANSACTION INFORMATION

Publication number: 20240176516

Abstract: Apparatuses, systems, and techniques to check memory transaction information. In at least one embodiment, one or more circuits are to perform an application programming interface (API) to check for information provided in a token by one or more users about one or more memory transactions after a first amount of time indicated by one or more users.

Type: Application

Filed: December 14, 2022

Publication date: May 30, 2024

Inventors: Harold Carter Edwards, Olivier Giroux, Jack H. Choquette, Gokul Ramaswamy Hirisave Chandra Shekhara, Rui Guo, Chao Li, Vishalkumar Ketankumar Mehta, David Dastous St. Hilaire, Aditya Avinash Atluri, Apoorv Parle, Ronny Meir Krashinsky, Subhasmita Chakraborty, Vikram Dhar
APPLICATION PROGRAMMING INTERFACE TO PROVIDE MEMORY TRANSACTION INFORMATION

Publication number: 20240176515

Abstract: Apparatuses, systems, and techniques to provide memory transaction information. In at least one embodiment, one or more circuits are to perform an application programming interface (API) to cause information about one or more memory transactions to be provided to one or more users.

Type: Application

Filed: December 14, 2022

Publication date: May 30, 2024

Inventors: Harold Carter Edwards, Olivier Giroux, Jack H. Choquette, Gokul Ramaswamy Hirisave Chandra Shekhara, Rui Guo, Chao Li, Vishalkumar Ketankumar Mehta, David Dastous St. Hilaire, Aditya Avinash Atluri, Apoorv Parle, Ronny Meir Krashinsky, Subhasmita Chakraborty, Vikram Dhar
STORAGE OF INFORMATION IN A GRAPHICS PROCESSING UNIT CACHE

Publication number: 20240169471

Abstract: Apparatuses, systems, and techniques to perform a graphics processing unit (GPU) prefetch instruction to cause a variable amount of information to be stored into one or more GPU caches. In at least one embodiment, one or more circuits of a GPU are to perform a GPU prefetch instruction to cause a variable amount of information to be stored into one or more GPU caches.

Type: Application

Filed: December 21, 2022

Publication date: May 23, 2024

Inventors: Harold Carter Edwards, Stephen Anthony Bernard Jones, Alexander Lev Minkin, Olivier Giroux, Gokul Ramaswamy Hirisave Chandra Shekhara, Aditya Avinash Atluri, Apoorv Parle, Ronny Meir Krashinsky, Alan Kaatz, Andrew Robert Kerr, Jack H. Choquette
STORAGE OF TRANSFORMED TENSOR IN A CACHE

Publication number: 20240169472

Abstract: Apparatuses, systems, and techniques to perform a tensor prefetch instruction to cause one or more tensors to be transformed and stored into one or more caches. In at least one embodiment, one or more circuits of a GPU are to perform a tensor prefetch instruction to cause one or more tensors to be transformed and stored into one or more GPU caches.

Type: Application

Filed: December 21, 2022

Publication date: May 23, 2024

Inventors: Harold Carter Edwards, Stephen Anthony Bernard Jones, Alexander Lev Minkin, Olivier Giroux, Gokul Ramaswamy Hirisave Chandra Shekhara, Aditya Avinash Atluri, Apoorv Parle, Ronny Meir Krashinsky, Alan Kaatz, Andrew Robert Kerr, Jack H. Choquette
APPLICATION PROGRAMMING INTERFACE TO PROVIDE INFORMATION

Publication number: 20240169468

Abstract: Apparatuses, systems, and techniques to cause information to be provided. In at least one embodiment, one or more circuits are to perform an application programming interface (API) to cause an amount of information to be accessed as a result of one or more memory transactions to be provided to one or more users.

Type: Application

Filed: December 14, 2022

Publication date: May 23, 2024

Inventors: Harold Carter Edwards, Olivier Giroux, Jack H. Choquette, Gokul Ramaswamy Hirisave Chandra Shekhara, Rui Guo, Chao Li, Vishalkumar Ketankumar Mehta, David Dastous St. Hilaire, Aditya Avinash Atluri, Apoorv Parle, Ronny Meir Krashinsky, Subhasmita Chakraborty, Vikram Dhar
APPLICATION PROGRAMMING INTERFACE TO GENERATE A TENSOR MAPPING

Publication number: 20240168829

Abstract: Apparatuses, systems, and techniques to generate a tensor mapping. In at least one embodiment, one or more circuits are to perform an application programming interface (API) to cause a mapping from a first tensor to a second tensor to be generated.

Type: Application

Filed: December 21, 2022

Publication date: May 23, 2024

Inventors: Harold Carter Edwards, Stephen Anthony Bernard Jones, Alexander Lev Minkin, Olivier Giroux, Gokul Ramaswamy Hirisave Chandra Shekhara, Vishalkumar Ketankumar Mehta, Aditya Avinash Atluri, Apoorv Parle, Ronny Meir Krashinsky, Alan Kaatz, Andrew Robert Kerr, Jack H. Choquette
APPLICATION PROGRAMMING INTERFACE TO TRANSFORM INFORMATION CORRESPONDING TO A MEMORY TRANSACTION

Publication number: 20240169469

Abstract: Apparatuses, systems, and techniques to transform information corresponding to one or more memory transactions. In at least one embodiment, one or more circuits are to perform an application programming interface (API) to cause information corresponding to one or more memory transactions resulting from performance of the API to be transformed.

Type: Application

Filed: December 21, 2022

Publication date: May 23, 2024

Inventors: Harold Carter Edwards, Stephen Anthony Bernard Jones, Alexander Lev Minkin, Olivier Giroux, Gokul Ramaswamy Hirisave Chandra Shekhara, Aditya Avinash Atluri, Apoorv Parle, Chao Li, Ronny Meir Krashinsky, Alan Kaatz, Andrew Robert Kerr, Jack H. Choquette
APPLICATION PROGRAMMING INTERFACE TO INDICATE STORAGE LOCATIONS

Publication number: 20240168830

Abstract: Apparatuses, systems, and techniques to indicate storage locations of information to be mapped from a first tensor to a second tensor. In at least one embodiment, one or more circuits are to perform an application programming interface (API) to indicate one or more storage locations of information to be mapped from a first tensor to a second tensor.

Type: Application

Filed: December 21, 2022

Publication date: May 23, 2024

Inventors: Harold Carter Edwards, Stephen Anthony Bernard Jones, Alexander Lev Minkin, Olivier Giroux, Gokul Ramaswamy Hirisave Chandra Shekhara, Aditya Avinash Atluri, Apoorv Parle, Ronny Meir Krashinsky, Alan Kaatz, Andrew Robert Kerr, Jack H. Choquette
APPLICATION PROGRAMMING INTERFACE TO TRANSFORM AND STORE INFORMATION CORRESPONDING TO A MEMORY TRANSACTION

Publication number: 20240168659

Abstract: Apparatuses, systems, and techniques to transform and store information corresponding to one or more memory transactions. In at least one embodiment, one or more circuits are to perform an application programming interface (API) to cause information corresponding to one or more memory transactions resulting from performance of the API to be transformed and stored.

Type: Application

Filed: December 21, 2022

Publication date: May 23, 2024

Inventors: Harold Carter Edwards, Stephen Anthony Bernard Jones, Alexander Lev Minkin, Olivier Giroux, Gokul Ramaswamy Hirisave Chandra Shekhara, Aditya Avinash Atluri, Apoorv Parle, Chao Li, Ronny Meir Krashinsky, Alan Kaatz, Andrew Robert Kerr, Jack H. Choquette
APPLICATION PROGRAMMING INTERFACE TO STORE MEMORY TRANSACTION INFORMATION

Publication number: 20240168794

Abstract: Apparatuses, systems, and techniques to store memory transaction information. In at least one embodiment, one or more circuits are to perform an application programming interface (API) to cause information received by the API about one or more memory transactions to be stored.

Type: Application

Filed: December 14, 2022

Publication date: May 23, 2024

Inventors: Harold Carter Edwards, Olivier Giroux, Jack H. Choquette, Gokul Ramaswamy Hirisave Chandra Shekhara, Rui Guo, Chao Li, Vishalkumar Ketankumar Mehta, David Dastous St. Hilaire, Aditya Avinash Atluri, Apoorv Parle, Ronny Meir Krashinsky, Subhasmita Chakraborty, Vikram Dhar
APPLICATION PROGRAMMING INTERFACE TO TRANSLATE A TENSOR ACCORDING TO A TENSOR MAP

Publication number: 20240168831

Abstract: Apparatuses, systems, and techniques to cause a first tensor to be translated into a second tensor according to a tensor map. In at least one embodiment, one or more circuits are to perform an application programming interface (API) to cause a first tensor to be translated into a second tensor according to a tensor map.

Type: Application

Filed: December 21, 2022

Publication date: May 23, 2024

Inventors: Harold Carter Edwards, Stephen Anthony Bernard Jones, Alexander Lev Minkin, Olivier Giroux, Gokul Ramaswamy Hirisave Chandra Shekhara, Aditya Avinash Atluri, Apoorv Parle, Chao Li, Ronny Meir Krashinsky, Alan Kaatz, Andrew Robert Kerr, Jack H. Choquette
APPLICATION PROGRAMMING INTERFACE TO STORE INFORMATION IN A PLURALITY OF STORAGE LOCATIONS

Publication number: 20240169470

Abstract: Apparatuses, systems, and techniques to store information in a plurality of storage locations allocated to a graphics processing unit (GPU). In at least one embodiment, one or more circuits are to perform an application programming interface (API) to cause information to be stored in a plurality of storage locations allocated to a first GPU.

Type: Application

Filed: December 21, 2022

Publication date: May 23, 2024

Inventors: Harold Carter Edwards, Stephen Anthony Bernard Jones, Alexander Lev Minkin, Olivier Giroux, Gokul Ramaswamy Hirisave Chandra Shekhara, Vishalkumar Ketankumar Mehta, Aditya Avinash Atluri, Apoorv Parle, Chao Li, Ronny Meir Krashinsky, Alan Kaatz, Andrew Robert Kerr, Jack H. Choquette
APPLICATION PROGRAMMING INTERFACE TO PERFORM DELAYED MEMORY TRANSACTION INFORMATION CHECK

Publication number: 20240168795

Abstract: Apparatuses, systems, and techniques to perform delayed memory transaction information check. In at least one embodiment, one or more circuits are to perform an application programming interface (API) to check for information provided by one or more users about one or more memory transactions after a timeout event indicated by one or more users.

Type: Application

Filed: December 14, 2022

Publication date: May 23, 2024

Inventors: Harold Carter Edwards, Olivier Giroux, Jack H. Choquette, Gokul Ramaswamy Hirisave Chandra Shekhara, Rui Guo, Chao Li, Vishalkumar Ketankumar Mehta, David Dastous St. Hilaire, Aditya Avinash Atluri, Apoorv Parle, Ronny Meir Krashinsky, Subhasmita Chakraborty, Vikram Dhar

1 2 3 4 5 … next