Patents Assigned to NVidia

Watertight ray triangle intersection

Patent number: 10825230

Abstract: A hardware-based traversal coprocessor provides acceleration of tree traversal operations searching for intersections between primitives represented in a tree data structure and a ray. The primitives may include triangles used in generating a virtual scene. The hardware-based traversal coprocessor is configured to properly handle numerically challenging computations at or near edges and/or vertices of primitives and/or ensure that a single intersection is reported when a ray intersects a surface formed by primitives at or near edges and/or vertices of the primitives.

Type: Grant

Filed: August 10, 2018

Date of Patent: November 3, 2020

Assignee: NVIDIA Corporation

Inventors: Samuli Laine, Tero Karras, Timo Aila, Robert Ohannessian, William Parsons Newhall, Jr., Greg Muthler, Ian Kwong, Peter Nelson, John Burgess
Method for continued bounding volume hierarchy traversal on intersection without shader intervention

Patent number: 10825232

Abstract: A hardware-based traversal coprocessor provides acceleration of tree traversal operations searching for intersections between primitives represented in a tree data structure and a ray. The primitives may include opaque and alpha triangles used in generating a virtual scene. The hardware-based traversal coprocessor is configured to determine primitives intersected by the ray, and return intersection information to a streaming multiprocessor for further processing. The hardware-based traversal coprocessor is configured to omit reporting of one or more primitives the ray is determined to intersect. The omitted primitives include primitives which are provably capable of being omitted without a functional impact on visualizing the virtual scene.

Type: Grant

Filed: January 22, 2020

Date of Patent: November 3, 2020

Assignee: NVIDIA Corporation

Inventors: Greg Muthler, Tero Karras, Samuli Laine, William Parsons Newhall, Jr., Ronald Charles Babich, Jr., John Burgess, Ignacio Llamas
Optimizing software-directed instruction replication for GPU error detection

Patent number: 10817289

Abstract: Software-only and software-hardware optimizations to reduce the overhead of intra-thread instruction duplication on a GPU or other instruction processor are disclosed. The optimizations trade off error containment for performance and include ISA extensions with limited hardware changes and area costs.

Type: Grant

Filed: October 3, 2018

Date of Patent: October 27, 2020

Assignee: NVIDIA Corp.

Inventors: Siva Hari, Michael Sullivan, Timothy Tsai, Stephen W. Keckler, Abdulrahman Mahmoud
Thread-level sleep in a multithreaded architecture

Patent number: 10817295

Abstract: A streaming multiprocessor (SM) includes a nanosleep (NS) unit configured to cause individual threads executing on the SM to sleep for a programmer-specified interval of time. For a given thread, the NS unit parses a NANOSLEEP instruction and extracts a sleep time. The NS unit then maps the sleep time to a single bit of a timer and causes the thread to sleep. When the timer bit changes, the sleep time expires, and the NS unit awakens the thread. The thread may then continue executing. The SM also includes a nanotrap (NT) unit configured to issue traps using a similar timing mechanism to that described above. For a given thread, the NT unit parses a NANOTRAP instruction and extracts a trap time. The NT unit then maps the trap time to a single bit of a timer. When the timer bit changes, the NT unit issues a trap.

Type: Grant

Filed: April 28, 2017

Date of Patent: October 27, 2020

Assignee: NVIDIA Corporation

Inventors: Olivier Giroux, Peter Nelson, Jack Choquette, Ajay Sudarshan Tirumala
Dynamic partitioning of execution resources

Patent number: 10817338

Abstract: Embodiments of the present invention set forth techniques for allocating execution resources to groups of threads within a graphics processing unit. A compute work distributor included in the graphics processing unit receives an indication from a process that a first group of threads is to be launched. The compute work distributor determines that a first subcontext associated with the process has at least one processor credit. In some embodiments, CTAs may be launched even when there are no processor credits, if one of the TPCs that was already acquired has sufficient space. The compute work distributor identifies a first processor included in a plurality of processors that has a processing load that is less than or equal to the processor loads associated with all other processors included in the plurality of processors. The compute work distributor launches the first group of threads to execute on the first processor.

Type: Grant

Filed: January 31, 2018

Date of Patent: October 27, 2020

Assignee: NVIDIA Corporation

Inventors: Jerome F. Duluk, Jr., Luke Durant, Ramon Matas Navarro, Alan Menezes, Jeffrey Tuckey, Gentaro Hirota, Brian Pharris
Scalable light-weight protocols for wire-speed packet ordering

Patent number: 10820057

Abstract: A communication method between a source device and a target device utilizes speculative connection setup between the source device and the target device, target-device-side packet ordering, and fine-grained ordering to remove packet dependencies.

Type: Grant

Filed: April 5, 2019

Date of Patent: October 27, 2020

Assignee: NVIDIA Corp.

Inventors: Hans Eberle, Larry Robert Dennison
Secure reconfiguration of hardware device operating features

Patent number: 10817609

Abstract: A secure reconfigurable operating mode system includes a hardware device having multiple operating modes and an operating mode selector that is coupled to the hardware device. The operating mode selector has a virtual fusing register that selects an operating mode for the hardware device and a security processor that enables a secure virtual fusing based on documented security files authorizing selection of the operating mode. A method of secure hardware device operating mode reconfiguration is also provided.

Type: Grant

Filed: September 30, 2015

Date of Patent: October 27, 2020

Assignee: Nvidia Corporation

Inventors: Apoorv Gupta, Ryan Speiser, Varun Kumar, Tony Cheng, Erik Zuroski
System and method for entering and exiting sleep mode in a graphics subsystem

Patent number: 10817043

Abstract: A technique is disclosed for a graphics processing unit (GPU) to enter and exit a power saving deep sleep mode. The technique involves preserving processing state within local memory by configuring the local memory to operate in a self-refresh mode while the GPU is powered off for deep sleep. An interface circuit coupled to the local memory is configured to prevent spurious GPU signals from disrupting proper self-refresh of the local memory. Spurious GPU signals may result from GPU power down and GPU power up events associated with the GPU entering and exiting the deep sleep mode.

Type: Grant

Filed: July 26, 2011

Date of Patent: October 27, 2020

Assignee: NVIDIA Corporation

Inventors: Rajeev Jayavant, Thomas E. Dewey, David Wyatt
CLOCK DATA RECOVERY CONVERGENCE USING SIGNED TIMING INJECTION

Publication number: 20200336286

Abstract: A system for data and clock recovery includes a timing error detector, a phase detector, and a phase increment injector. The phase increment injector may be used to determine an increment to affect an output of the phase detector or a clocking element. A sign of the increment is determined from a sign or direction of an accumulated version of a clock and data recovery gradient value.

Type: Application

Filed: February 26, 2020

Publication date: October 22, 2020

Applicant: NVIDIA Corp.

Inventors: Pervez Mirza Aziz, Vishnu Balan, Viswanath Annampedu
Spatio-temporal image metric for rendered animations

Patent number: 10810455

Abstract: An image processing method transforms image sequences into luminances, filters the luminances, determines the temporal differences between the luminances, performs a frequency domain transformation on the temporal differences, and applies a temporal contrast sensitivity function envelope integral to the frequency transform output to generate a temporal image metric. The temporal image metric may be applied for example to train a neural network or to configure a display device to depict a visual indication of the temporal image metric.

Type: Grant

Filed: March 4, 2019

Date of Patent: October 20, 2020

Assignee: NVIDIA Corp.

Inventors: Jim Nilsson, Tomas Akenine-Moller
Techniques for preloading textures in rendering graphics

Patent number: 10810784

Abstract: Systems and methods for improved texture mapping and graphics processing are described. According to an example implementation, whole or parts of texture blocks are prefetched to an intermediate cache by a processing unit so that the same processing unit or another processing unit can subsequently obtain the prefetched texture block from the intermediate cache. Moreover, in some example implementations, control circuitry associated with the intermediate cache may throttle prefetch requests in order to avoid the memory system and/or the interconnect system receiving excessive amounts of prefetch requests. Additionally, in some implementations, deduplication of prefetch requests can be performed at the intermediate cache and/or the processing unit. Some implementations also include an efficient technique for calculating the address of the next texture block to be prefetched.

Type: Grant

Filed: July 25, 2019

Date of Patent: October 20, 2020

Assignee: NVIDIA Corporation

Inventors: Pranava Ajith Rai, Amit Jain
Method for forward progress tree traversal mechanisms in hardware

Patent number: 10810785

Abstract: In a ray tracer, to prevent any long-running query from hanging the graphics processing unit, a traversal coprocessor provides a preemption mechanism that will allow rays to stop processing or time out early. The example non-limiting implementations described herein provide such a preemption mechanism, including a forward progress guarantee, and additional programmable timeout options that can be time or cycle based. Those programmable options provide a means for quality of service timing guarantees for applications such as virtual reality (VR) that have strict timing requirements.

Type: Grant

Filed: August 10, 2018

Date of Patent: October 20, 2020

Assignee: NVIDIA CORPORATION

Inventors: Greg Muthler, Ronald Charles Babich, Jr., William Parsons Newhall, Jr., Peter Nelson, James Robertson, John Burgess
IR DROP PREDICTION WITH MAXIMUM CONVOLUTIONAL NEURAL NETWORK

Publication number: 20200327417

Abstract: IR drop predictions are obtained using a maximum convolutional neural network. A circuit structure is partitioned into a grid. For cells of the circuit structure in sub-intervals of a clock period, power consumption of the cell is amortized into a set of grid tiles that include portions of the cell, thus forming a set of power maps. The power maps are applied to a neural network to generate IR drop predictions for the circuit structure.

Type: Application

Filed: March 17, 2020

Publication date: October 15, 2020

Applicant: NVIDIA Corp.

Inventors: Zhiyao Xie, Haoxing Ren, Brucek Khailany, Sheng Ye
Compute task state encapsulation

Patent number: 10795722

Abstract: One embodiment of the present invention sets forth a technique for encapsulating compute task state that enables out-of-order scheduling and execution of the compute tasks. The scheduling circuitry organizes the compute tasks into groups based on priority levels. The compute tasks may then be selected for execution using different scheduling schemes. Each group is maintained as a linked list of pointers to compute tasks that are encoded as task metadata (TMD) stored in memory. A TMD encapsulates the state and parameters needed to initialize, schedule, and execute a compute task.

Type: Grant

Filed: November 9, 2011

Date of Patent: October 6, 2020

Assignee: NVIDIA Corporation

Inventors: Jerome F. Duluk, Jr., Lacky V. Shah, Sean J. Treichler
System, method, and computer program product for simultaneously determining settings for a plurality of parameter variations

Patent number: 10795691

Abstract: A system, method, and computer program product are provided for simultaneously determining settings for a plurality of parameter variations. In use, a plurality of parameter variations associated with a device is identified. Additionally, settings for each of the plurality of parameter variations are determined simultaneously.

Type: Grant

Filed: November 12, 2019

Date of Patent: October 6, 2020

Assignee: NVIDIA CORPORATION

Inventors: John F. Spitzer, Rev Lebaredian, Yury Uralsky
Start-up performance improvement for remote video gaming

Patent number: 10798457

Abstract: A gaming system includes a network server and a gaming manager communicatively coupled to the network server. The gaming manager having a video control unit that starts a video game running remotely with a static video portion and a user interactive video portion and a video receiving unit, coupled to the video control unit, that receives the static video portion for local display while the user interactive video portion is being initialized remotely for subsequent local game play. The gaming system further includes a local user device, coupled to the gaming manager, that initially displays the static video portion and provides a user interface for the subsequent local game play following completion of remote initialization of the user interactive video portion. A method of managing a remote game is also provided.

Type: Grant

Filed: May 27, 2016

Date of Patent: October 6, 2020

Assignee: Nvidia Corporation

Inventors: Bojan Vukojevic, Franck Diard
RECONSTRUCTION FOR TEMPORALLY DENSE RAY TRACE RENDERING

Publication number: 20200312010

Abstract: A method dynamically selects one of a first sampling order and a second sampling order for a ray trace of pixels in a tile where the selection is based on a motion vector for the tile. The sampling order may be a bowtie pattern or an hourglass pattern. Subframes generated based on the sampling order are communicated over a bus along with motion vectors for tiles of the subframes.

Type: Application

Filed: March 27, 2020

Publication date: October 1, 2020

Applicant: NVIDIA Corp.

Inventors: Johan Pontus Andersson, Tomas Guy Akenine-Möller, Jim Nilsson, Marco Salvi, Josef Spjut
Adaptive Pixel Sampling Order for Temporally Dense Rendering

Publication number: 20200314442

Abstract: A method dynamically selects one of a first sampling order and a second sampling order for a ray trace of pixels in a tile where the selection is based on a motion vector for the tile. The sampling order may be a bowtie pattern or an hourglass pattern.

Type: Application

Filed: March 26, 2020

Publication date: October 1, 2020

Applicant: NVIDIA Corp.

Inventors: Johan Pontus Andersson, Jim Nilsson, Tomas Guy Akenine-Möller
Electronically actuated retaining latch for AC-DC adapter removable plug assembly

Patent number: 10790628

Abstract: A power adapter has a solenoid actuated retaining latch controlled by an electronic circuit that detects the presence or absence of AC mains voltage. When the assembled AC-DC adapter and plug assembly are removed from the wall, the latch detects removal and unlocks the plug assembly for easy removal without undue force required by the user. The circuit is designed for minimal power consumption, and the solenoid only consumes power when it is engaging or disengaging the latch.

Type: Grant

Filed: May 18, 2018

Date of Patent: September 29, 2020

Assignee: NVIDIA Corporation

Inventors: Boris Landwehr, James Lee, Craig Crawford, Andrew Bell, Samuel Duell
Techniques for efficiently synchronizing data transmissions on a network

Patent number: 10789194

Abstract: Systems and techniques for synchronizing transactions between processing devices on an interconnection network are provided. Upon receiving a stream of posted transactions followed by a flush transaction from a source processing device connected to the interconnection network, the flush transaction is trapped before it enters the interconnecting network. Subsequently, based on monitoring for responses received from a destination processing device for transactions corresponding to the posted transactions, a flush response is generated and returned to the source processing device. The described techniques enable efficient synchronizing posted writes, posted atomics and the like over complex interconnection fabrics such that a first GPU can write data to a second GPU so that a third GPU can safely consume the data written to the second GPU.

Type: Grant

Filed: March 26, 2019

Date of Patent: September 29, 2020

Assignee: NVIDIA Corporation

Inventors: Larry R. Dennison, Mark Hummel, Glenn Dearth

prev … 75 76 77 78 79 80 81 82 83 … next