Patents by Inventor Mark Hummel

Mark Hummel has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

EFFICIENT MULTI-DEVICE SYNCHRONIZATION BARRIERS USING MULTICASTING

Publication number: 20230229524

Abstract: In various examples, a single notification (e.g., a request for a memory access operation) that a processing element (PE) has reached a synchronization barrier may be propagated to multiple physical addresses (PAs) and/or devices associated with multiple processing elements. Thus, the notification may allow an indication that the processing element has reached the synchronization barrier to be recoded at multiple targets. Each notification may access the PAs of each PE and/or device of a barrier group to update a corresponding counter. The PEs and/or devices may poll or otherwise use the counter to determine when each PE of the group has reached the synchronization barrier. When a corresponding counter indicates synchronization at the synchronization barrier, a PE may proceed with performing a compute task asynchronously with one or more other PEs until a subsequent synchronization barrier may be reached.

Type: Application

Filed: January 18, 2022

Publication date: July 20, 2023

Inventors: Glenn Alan Dearth, Mark Hummel, Daniel Joseph Lustig
NETWORK MULTICASTING USING ALTERNATE SETS OF DIRECTIVES

Publication number: 20230224239

Abstract: Apparatuses, systems, and techniques to multicast a transaction to a group of targets. In at least one embodiment, a set is selected from alternate sets of directives associated with the group of targets, and the transaction is transmitted to the group of targets in accordance with the selected set.

Type: Application

Filed: January 13, 2022

Publication date: July 13, 2023

Inventors: Glenn Dearth, Nan Jiang, Mark Hummel, Richard Reeves
TECHNIQUES FOR AN EFFICIENT FABRIC ATTACHED MEMORY

Publication number: 20220043759

Abstract: Fabric Attached Memory (FAM) provides a pool of memory that can be accessed by one or more processors, such as a graphics processing unit(s) (GPU)(s), over a network fabric. In one instance, a technique is disclosed for using imperfect processors as memory controllers to allow memory, which is local to the imperfect processors, to be accessed by other processors as fabric attached memory. In another instance, memory address compaction is used within the fabric elements to fully utilize the available memory space.

Type: Application

Filed: October 20, 2021

Publication date: February 10, 2022

Inventors: John FEEHRER, Denis FOLEY, Mark HUMMEL, Vyas VENKATARAMAN, Ram GUMMADI, Samuel H. DUNCAN, Glenn DEARTH, Brian KELLEHER
Network routing using aggregated links

Patent number: 11233730

Abstract: Introduced herein is a routing technique that, for example, routes a transaction to a destination port over a network that supports link aggregation and multi-port connection. In one embodiment, two tables that can be searched based on the target and supplemental routing IDs of the transaction are utilized to route the transaction to the proper port of the destination endpoint. In an embodiment, the first table provides a list of available ports at each hop/route point that can route the transaction to the destination endpoint, and the second table provides a supplemental routing ID that can select a specific group of ports from the first table that can correctly route the transaction to the proper port.

Type: Grant

Filed: December 2, 2019

Date of Patent: January 25, 2022

Assignee: Nvidia Corporation

Inventors: Glenn Dearth, Mark Hummel
MEMORY POOLS IN A MEMORY MODEL FOR A UNIFIED COMPUTING SYSTEM

Publication number: 20210406196

Abstract: A method and system for allocating memory to a memory operation executed by a processor in a computer arrangement having a plurality of processors. The method includes receiving a memory operation from a processor that references an address in a shared memory, mapping the received memory operation to at least one virtual memory pool to produce a mapping result, and providing the mapping result to the processor.

Type: Application

Filed: September 10, 2021

Publication date: December 30, 2021

Applicants: Advanced Micro Devices, Inc., ATI Technologies ULC

Inventors: Anthony Asaro, Kevin Normoyle, Mark Hummel
Techniques for an efficient fabric attached memory

Patent number: 11182309

Abstract: Fabric Attached Memory (FAM) provides a pool of memory that can be accessed by one or more processors, such as a graphics processing unit(s) (GPU)(s), over a network fabric. In one instance, a technique is disclosed for using imperfect processors as memory controllers to allow memory, which is local to the imperfect processors, to be accessed by other processors as fabric attached memory. In another instance, memory address compaction is used within the fabric elements to fully utilize the available memory space.

Type: Grant

Filed: November 4, 2019

Date of Patent: November 23, 2021

Assignee: NVIDIA Corporation

Inventors: John Feehrer, Denis Foley, Mark Hummel, Vyas Venkataraman, Ram Gummadi, Samuel H. Duncan, Glenn Dearth, Brian Kelleher
Memory pools in a memory model for a unified computing system

Patent number: 11119944

Abstract: A method and system for allocating memory to a memory operation executed by a processor in a computer arrangement having a plurality of processors. The method includes receiving a memory operation from a processor that receives a memory operation from a processor that references an address in a shared memory, mapping the received memory operation to at least one of a plurality of virtual memory pools to produce a mapping result, and providing the mapping result to the processor.

Type: Grant

Filed: June 17, 2019

Date of Patent: September 14, 2021

Assignees: Advanced Micro Devices, Inc., ATI Technologies ULC

Inventors: Anthony Asaro, Kevin Normoyle, Mark Hummel
Techniques for reducing congestion in a computer network

Patent number: 11082347

Abstract: Multiple processors are often used in computing systems to solve very large, complex problems, such as those encountered in artificial intelligence. Such processors typically exchange data among each other via an interconnect fabric (such as, e.g., a group of network connections and switches) in solving such complex problems. The amount of data injected into the interconnect fabric by the processors can at times overwhelm the interconnect fabric preventing some of the processors from communicating with each other. To address this problem, techniques are disclosed to enable, for example, processors that are connected to an interconnect fabric to coordinate and control the amount of data injected so that the interconnect fabric does not get overwhelmed.

Type: Grant

Filed: February 15, 2019

Date of Patent: August 3, 2021

Assignee: Nvidia Corporation

Inventors: Glenn Dearth, Nan Jiang, John Wortman, Alex Ishii, Mark Hummel, Rich Reeves
Techniques for reducing the overhead of providing responses in a computing network

Patent number: 11038800

Abstract: An endpoint in a network may make posted or non-posted write requests to another endpoint in the network. For a non-posted write request, the target endpoint provides a response to the requesting endpoint indicating that the write request has been serviced. For a posted write request, the target endpoint does not provide such an acknowledgment. Hence, posted write requests have lower overhead, but they suffer from potential synchronization and resiliency issues. While non-posted write requests do not have those issues, they cause increased load on the network because such requests require the target endpoint to acknowledge each write request. Introduced herein is a network operation technique that uses non-posted transactions while maintaining a load overhead of the network as a manageable level. The introduced technique reduces the load overhead of the non-posted write requests by collapsing and reducing a number of the responses.

Type: Grant

Filed: August 28, 2019

Date of Patent: June 15, 2021

Assignee: Nvidia Corporation

Inventors: Glenn Dearth, Mark Hummel, Jonathan Owen, Mike Osborn, John Wortman, Rich Reeves
TECHNIQUES FOR AN EFFICIENT FABRIC ATTACHED MEMORY

Publication number: 20210133123

Abstract: Fabric Attached Memory (FAM) provides a pool of memory that can be accessed by one or more processors, such as a graphics processing unit(s) (GPU)(s), over a network fabric. In one instance, a technique is disclosed for using imperfect processors as memory controllers to allow memory, which is local to the imperfect processors, to be accessed by other processors as fabric attached memory. In another instance, memory address compaction is used within the fabric elements to fully utilize the available memory space.

Type: Application

Filed: November 4, 2019

Publication date: May 6, 2021

Inventors: John FEEHRER, Denis Foley, Mark Hummel, Vyas Venkataraman, Ram Gummadi, Samuel H. Duncan, Glenn Dearth, Brian Kelleher
TECHNIQUES FOR REDUCING THE OVERHEAD OF PROVIDING RESPONSES IN A COMPUTING NETWORK

Publication number: 20210067449

Abstract: An endpoint in a network may make posted or non-posted write requests to another endpoint in the network. For a non-posted write request, the target endpoint provides a response to the requesting endpoint indicating that the write request has been serviced. For a posted write request, the target endpoint does not provide such an acknowledgment. Hence, posted write requests have lower overhead, but they suffer from potential synchronization and resiliency issues. While non-posted write requests do not have those issues, they cause increased load on the network because such requests require the target endpoint to acknowledge each write request. Introduced herein is a network operation technique that uses non-posted transactions while maintaining a load overhead of the network as a manageable level. The introduced technique reduces the load overhead of the non-posted write requests by collapsing and reducing a number of the responses.

Type: Application

Filed: August 28, 2019

Publication date: March 4, 2021

Inventors: Glenn Dearth, Mark Hummel, Jonathan Owen, Mike Osborn, John Wortman, Rich Reeves
NETWORK ROUTING USING AGGREGATED LINKS

Publication number: 20210014156

Abstract: Introduced herein is a routing technique that, for example, routes a transaction to a destination port over a network that supports link aggregation and multi-port connection. In one embodiment, two tables that can be searched based on the target and supplemental routing IDs of the transaction are utilized to route the transaction to the proper port of the destination endpoint. In an embodiment, the first table provides a list of available ports at each hop/route point that can route the transaction to the destination endpoint, and the second table provides a supplemental routing ID that can select a specific group of ports from the first table that can correctly route the transaction to the proper port.

Type: Application

Filed: December 2, 2019

Publication date: January 14, 2021

Inventors: Glenn Dearth, Mark Hummel
Systems, method, and device for heat management of heating implements for water pipes

Patent number: 10806175

Abstract: Systems, methods, and devices for heat management of heating implements for a water pipe.

Type: Grant

Filed: May 8, 2018

Date of Patent: October 20, 2020

Assignee: Kaloud, Inc.

Inventors: Reza Bavar, Tylor Garland, Michael Latham, Stephen Bradford, Richard Siemer, Mark Hummel, Wilson Reniers, Stephen Harper, Andrew Castro
Techniques for efficiently synchronizing data transmissions on a network

Patent number: 10789194

Abstract: Systems and techniques for synchronizing transactions between processing devices on an interconnection network are provided. Upon receiving a stream of posted transactions followed by a flush transaction from a source processing device connected to the interconnection network, the flush transaction is trapped before it enters the interconnecting network. Subsequently, based on monitoring for responses received from a destination processing device for transactions corresponding to the posted transactions, a flush response is generated and returned to the source processing device. The described techniques enable efficient synchronizing posted writes, posted atomics and the like over complex interconnection fabrics such that a first GPU can write data to a second GPU so that a third GPU can safely consume the data written to the second GPU.

Type: Grant

Filed: March 26, 2019

Date of Patent: September 29, 2020

Assignee: NVIDIA Corporation

Inventors: Larry R. Dennison, Mark Hummel, Glenn Dearth
MEMORY POOLS IN A MEMORY MODEL FOR A UNIFIED COMPUTING SYSTEM

Publication number: 20190303302

Abstract: A method and system for allocating memory to a memory operation executed by a processor in a computer arrangement having a plurality of processors. The method includes receiving a memory operation from a processor that receives a memory operation from a processor that references an address in a shared memory, mapping the received memory operation to at least one of a plurality of virtual memory pools to produce a mapping result, and providing the mapping result to the processor.

Type: Application

Filed: June 17, 2019

Publication date: October 3, 2019

Applicants: Advanced Micro Devices, Inc., ATI Technologies ULC

Inventors: Anthony Asaro, Kevin Normoyle, Mark Hummel
TECHNIQUES FOR EFFICIENTLY SYNCHRONIZING DATA TRANSMISSIONS ON A NETWORK

Publication number: 20190294575

Abstract: Systems and techniques for synchronizing transactions between processing devices on an interconnection network are provided. Upon receiving a stream of posted transactions followed by a flush transaction from a source processing device connected to the interconnection network, the flush transaction is trapped before it enters the interconnecting network. Subsequently, based on monitoring for responses received from a destination processing device for transactions corresponding to the posted transactions, a flush response is generated and returned to the source processing device. The described techniques enable efficient synchronizing posted writes, posted atomics and the like over complex interconnection fabrics such that a first GPU can write data to a second GPU so that a third GPU can safely consume the data written to the second GPU.

Type: Application

Filed: March 26, 2019

Publication date: September 26, 2019

Inventors: Larry R. DENNISON, Mark HUMMEL, Glenn DEARTH
TECHNIQUES FOR REDUCING CONGESTION IN A COMPUTER NETWORK

Publication number: 20190297018

Abstract: Multiple processors are often used in computing systems to solve very large, complex problems, such as those encountered in artificial intelligence. Such processors typically exchange data among each other via an interconnect fabric (such as, e.g., a group of network connections and switches) in solving such complex problems. The amount of data injected into the interconnect fabric by the processors can at times overwhelm the interconnect fabric preventing some of the processors from communicating with each other. To address this problem, techniques are disclosed to enable, for example, processors that are connected to an interconnect fabric to coordinate and control the amount of data injected so that the interconnect fabric does not get overwhelmed.

Type: Application

Filed: February 15, 2019

Publication date: September 26, 2019

Inventors: Glenn Dearth, Nan Jiang, John Wortman, Alex Ishii, Mark Hummel, Rich Reeves
Memory heaps in a memory model for a unified computing system

Patent number: 10324860

Abstract: A method and system for allocating memory to a memory operation executed by a processor in a computer arrangement having a first processor configured for unified operation with a second processor. The method includes receiving a memory operation from a processor and mapping the memory operation to one of a plurality of memory heaps. The mapping produces a mapping result. The method also includes providing the mapping result to the processor.

Type: Grant

Filed: September 5, 2017

Date of Patent: June 18, 2019

Assignees: Advanced Micro Devices, Inc., ATI Technologies ULC

Inventors: Anthony Asaro, Kevin Normoyle, Mark Hummel
System and method for early packet header verification

Patent number: 10200154

Abstract: A receiver, transmitter and method for early packet header verification are provided. In one embodiment, the method includes: (1) receiving a payload flit of a preceding packet and a header flit of a current packet; and (2) using a Cyclic Redundancy Check (CRC) in the header flit to verify the payload flit of the preceding packet and the header flit of the current packet.

Type: Grant

Filed: June 23, 2017

Date of Patent: February 5, 2019

Assignee: Nvidia Corporation

Inventors: Stephen D. Glaser, Eric Tyson, Mark Hummel, Michael Osborn, Jonathan Owen, Marvin Denman, Dennis Ma, Denis Foley
SYSTEMS, METHODS, AND DEVICES FOR HEAT MANAGEMENT OF HEATING IMPLEMENTS FOR WATER PIPES

Publication number: 20180317545

Abstract: Systems, methods, and devices for heat management of heating implements for a water pipe.

Type: Application

Filed: May 8, 2018

Publication date: November 8, 2018

Inventors: Reza Bavar, Tylor Garland, Michael Latham, Stephen Bradford, Richard Siemer, Mark Hummel, Wilson Reniers, Stephen Harper, Andrew Castro

prev 1 2 3 4 5 6 next