Patents Assigned to NVidia

Method and system for bin coalescing for parallel divide-and-conquer sorting algorithms

Patent number: 9619204

Abstract: A system and method for performing sorting. The method includes partitioning a plurality of keys needing sorting into a first plurality of bins, wherein the bins are sequentially sorted. The plurality of keys is capable of being sorted into a sequence of keys using a corresponding ordering system. The method includes coalescing a first pair of consecutive bins, such that when coalesced the first pair of bins falls below a threshold. The method also includes ordering keys in the first coalesced pair to generate a first sub-sequence of keys in the sequence of keys.

Type: Grant

Filed: June 14, 2013

Date of Patent: April 11, 2017

Assignee: Nvidia Corporation

Inventor: Duane Merrill
Grouping and analysis of data access hazard reports

Patent number: 9619364

Abstract: A method for analyzing race conditions between multiple threads of an application is disclosed. The method comprises accessing hazard records for an application under test. It further comprises creating a graph comprising a plurality of vertices and a plurality of edges using the hazard records, wherein each vertex of the graph comprises information about a code location of a hazard and wherein each edge of the graph comprises hazard information between one or more vertices. Additionally, it comprises assigning each edge with a weight, wherein the weight depends on a number and relative priority of hazards associated with a respective edge. Finally, it comprises traversing the graph to report an analysis record for each hazard represented in the graph.

Type: Grant

Filed: August 28, 2013

Date of Patent: April 11, 2017

Assignee: NVIDIA CORPORATION

Inventor: Vyas Venkataraman
Method and system of curve fitting for common focus measures

Patent number: 9621780

Abstract: An efficient method and system for estimating an optimal focus position for capturing an image are presented. Embodiments of the present invention initially determine an initial lens position dataset. Then, scores are calculated for each value of the initial lens position dataset producing a plurality of scores. Embodiments of the present invention then determine an optimum focus position through interpolation and extrapolation by relating the initial lens position dataset to the score dataset, in which the score dataset comprises of the plurality of scores.

Type: Grant

Filed: October 4, 2012

Date of Patent: April 11, 2017

Assignee: NVIDIA CORPORATION

Inventor: Hugh Phu Nguyen
Method and apparatus for simulating stiff stacks

Patent number: 9613449

Abstract: A computer implemented method of simulating a stack of objects represented as data within memory of a computer system is disclosed. The method comprises modeling the stack within a computer simulation as a set of associated primitives with associated constraints thereto in the memory, wherein the stack comprises a plurality of layers and wherein each layer comprises at least one primitive. The method further comprises estimating a height for each of the primitives in the stack and determining a respective scaling factor for each of the primitives in parallel, wherein each scaling factor is operable to adjust a mass value of each of the primitives. Also, the method comprises scaling a mass value of each of the primitives in accordance with a respective scaling factor in parallel. Finally, the method comprises solving over a plurality of constraints iteratively using a scaled mass value for each of the primitives.

Type: Grant

Filed: December 31, 2013

Date of Patent: April 4, 2017

Assignee: NVIDIA CORPORATION

Inventors: Matthias Mueller-Fischer, Miles Macklin
Confluence analysis and loop fast-forwarding for improving SIMD execution efficiency

Patent number: 9612811

Abstract: One embodiment of the present invention sets forth a method for causing thread convergence. The method includes determining that a control flow graph representing a first section of a program includes at least two non-overlapping paths that extend from a first divergent node to a candidate node. The method also includes determining that the first divergent node is not a dominator of the candidate node or that the candidate node is not a post-dominator of the first divergent node. The method further includes identifying an external node and inserting a first instruction configured to cause a predicate variable to be set to true for a first set of threads that is to execute the external node. The method additionally includes inserting into the program a second divergent node configured to cause various threads to execute or not execute a first control flow path associated with the external node.

Type: Grant

Filed: January 21, 2014

Date of Patent: April 4, 2017

Assignee: NVIDIA Corporation

Inventors: Amit Jayant Sabne, Yuan Lin, Vinod Grover
Jittered coverage accumulation path rendering

Patent number: 9613451

Abstract: One embodiment of the present invention sets forth a technique for rendering anti-aliased paths by first generating an alpha buffer representing coverage data. To generate the alpha buffer, jittered versions of the rendered path are rendered and corresponding stencil buffers indicating sub-pixel samples of the path that should be covered are generated. After each stencil buffer is generated, the jittered path is rasterized to convert the sub-pixel coverage into coverage weights that are stored in the alpha component of a frame buffer. As each jittered path is rasterized, the coverage weights are accumulated. Finally, geometry representing the union of the jittered versions of the path is rendered to shade pixels based on the accumulated coverage weights. The anti-aliased rendered paths may be filled or stroked without tessellating the paths.

Type: Grant

Filed: May 17, 2011

Date of Patent: April 4, 2017

Assignee: NVIDIA Corporation

Inventor: Mark J. Kilgard
Method and system for implementing a secure chain of trust

Patent number: 9613215

Abstract: A method, an integrated circuit and a system for implementing a secure chain of trust is disclosed. While executing secure boot code in a secure boot mode, less-secure boot code may be authenticated using a secret key. A secure key may also be calculated or generated during the secure boot mode. After control is turned over to the authenticated less-secure boot code, at least one application may be authenticated using the secure key. Once authenticated in the less-secure boot mode, the application may be executed by the programmable integrated circuit. In this manner, a secure chain of trust may be implemented for the programmable integrated circuit.

Type: Grant

Filed: April 10, 2008

Date of Patent: April 4, 2017

Assignee: NVIDIA CORPORATION

Inventors: Michael Cox, Phillip Smith
Higher accuracy Z-culling in a tile-based architecture

Patent number: 9612839

Abstract: A graphics processing pipeline configured for z-cull operations. The graphics processing pipeline comprising a screen-space pipeline and a tiling unit. The screen-space pipeline includes a z-cull unit configured to perform z-culling operations. The tiling unit is configured to determine that a first set of primitives overlaps a first cache tile. The tiling unit is also configured to transmit the first set of primitives to the screen-space pipeline for processing. The tiling unit is further configured to select between processing the first set of primitives in a full-surface z-cull mode or processing the first set of primitives in a partial-surface z-cull mode. The tiling unit is also configured to cause the z-cull unit to process the first set of primitives in the full-surface z-cull mode or to process the first set of primitives in the partial-surface z-cull mode.

Type: Grant

Filed: October 23, 2013

Date of Patent: April 4, 2017

Assignee: NVIDIA Corporation

Inventors: Ziyad S. Hakura, Jerome F. Duluk, Jr.
Power supply for ring-oscillator based true random number generator and method of generating true random numbers

Patent number: 9612801

Abstract: A true random number generator, a method of generating a true random number and a system incorporating the generator or the method. In one embodiment, the generator includes: (1) a ring oscillator including inverting gates having power inputs and (2) a time-varying power supply coupled to the power inputs to provide power thereto and including power perturbation circuitry operable to perturb the power provided to at least one of the power inputs.

Type: Grant

Filed: October 22, 2015

Date of Patent: April 4, 2017

Assignee: Nvidia Corporation

Inventor: Sachin Idgunji
System, method, and computer program product for implementing software-based scoreboarding

Patent number: 9612836

Abstract: A system, method, and computer program product are provided for implementing a software-based scoreboarding mechanism. The method includes the steps of receiving a dependency barrier instruction that includes an immediate value and an identifier corresponding to a first register and, based on a comparison of the immediate value to the value stored in the first register, dispatching a subsequent instruction to at least a first processing unit of two or more processing units.

Type: Grant

Filed: February 3, 2014

Date of Patent: April 4, 2017

Assignee: NVIDIA Corporation

Inventors: Robert Ohannessian, Jr., Michael Alan Fetterman, Olivier Giroux, Jack H. Choquette, Xiaogang Qiu, Shirish Gadre, Meenaradchagan Vishnu
Host context techniques for server based graphics processing

Patent number: 9613390

Abstract: The server based graphics processing techniques, describer herein, include receiving function calls by a three dimension graphics application programming interface host-guest communication manager (D3D HGCM) service module from one or more given instances of a guest shin layer through a communication channel of a host-guest communication manager (HGCM). The one or more given instances of the guest shim layer are executing under control of a respective given instance of a guest operating system. The HGCM and D3D HGCM service module are executing under control of a host operating system. The rendering context for each function call received from the each instance of the guest shim layer is determined by the D3D HGCM service module. Each function call of a given rendering context is sent by the D3D HGCM service module to a corresponding device specific kernel mode driver of a given graphics processing unit for scheduling execution with the given graphics processing unit of the given rendering context.

Type: Grant

Filed: October 29, 2013

Date of Patent: April 4, 2017

Assignee: NVIDIA CORPORATION

Inventor: Franck Diard
Snoop and replay for completing bus transaction

Patent number: 9612994

Abstract: Systems and devices configured to implement techniques for ensuring the completion of transactions while minimizing latency and power consumption are described. A device may be operably coupled to a bidirectional communications bus. A bidirectional communications bus may include a clock line and a data line. The device may be configured to determine if an initiated transaction corresponds to a device in a low power state. The device may pause the transaction. The device may replay portions of the transaction when the device is in an appropriate power state. The device may replay portions of the transaction using an override interface.

Type: Grant

Filed: September 18, 2013

Date of Patent: April 4, 2017

Assignee: NVIDIA Corporation

Inventors: Kevin Wong, Craig Ross, Thomas Dewey
Audio channel mapping in a portable electronic device

Patent number: 9615176

Abstract: A portable electronic device is provided having an audio subsystem with a plurality of audio devices, each of which is coupled to a logic subsystem via its own audio path. The portable electronic device may also include a display configured to present visual content, with the display being fixed in position relative to the plurality of audio devices. The portable electronic device further includes an orientation sensor electronically coupled to the logic subsystem, the logic subsystem being configured, using data received from the orientation sensor, (i) to determine whether the portable electronic device has been reoriented; and (ii) in response to such determination, vary operation of one or more of the audio paths.

Type: Grant

Filed: December 28, 2012

Date of Patent: April 4, 2017

Assignee: NVIDIA CORPORATION

Inventor: Mark Pereira
Variable-width differential memory compression

Patent number: 9607407

Abstract: A method, in one embodiment, can include performing difference transformation of image samples. In addition, the method can also include performing length selection. Furthermore; the method can include performing packing that includes utilizing varying sized bit fields to produce a compressed representation.

Type: Grant

Filed: December 31, 2012

Date of Patent: March 28, 2017

Assignee: NVIDIA CORPORATION

Inventors: Jonathan Dunaisky, David Kirk McAllister, William Craig McKnight
Method and apparatus for watermarking binary computer code

Patent number: 9607133

Abstract: A method and apparatus for inserting a watermark into a compiled computer program. A location process specifies an insertion point in the compiled program and a watermark generating process inserts a watermark, based on data to be encoded, into the program at the insertion point. The location process is also utilized to specify the location of watermark data to be decoded.

Type: Grant

Filed: November 9, 2007

Date of Patent: March 28, 2017

Assignee: NVIDIA Corporation

Inventors: Robert Rubin, Eric Murray
Hardware command training for memory using write leveling mechanism

Patent number: 9607714

Abstract: A method of training a command signal for a memory module. The method includes programming a memory controller into a mode where a single bit of an address signal is active for a single clock cycle. The method then programs a programmable delay line of the address signal with a delay value and performs initialization of the memory module. The memory module is then placed in a write leveling mode. A write leveling procedure is then performed and a response to the write leveling procedure is determined from the memory module. A determination is made whether the memory module is in a pass state or an error state based on the response.

Type: Grant

Filed: December 27, 2012

Date of Patent: March 28, 2017

Assignee: NVIDIA CORPORATION

Inventors: Venkata Ramana Malladi, Tony Yuhsiang Cheng, Sharath Raghava, Ambuj Kumar, Arunjit Sahni, Paul Lam
Method and system for resolving thread divergences

Patent number: 9606808

Abstract: A computing device detects divergences between threads in a thread group executing on a parallel processing unit. The computing device includes an address divergence unit that identifies a subset of non-divergent threads included in the thread group. The address divergence unit stores instructions related to the subset of non-divergent threads in a multi-issue queue. The address divergence unit causes the instructions related to the subset of non-divergent threads to be retrieved from the multi-issue queue when the parallel processing unit is available. The address divergence unit causes the subset of non-divergent threads to be issued for execution on the parallel processing unit. The address divergence unit repeats the identifying, storing, and causing steps for the remaining threads in the thread group that are not included in the subset of non-divergent threads.

Type: Grant

Filed: January 11, 2012

Date of Patent: March 28, 2017

Assignee: NVIDIA Corporation

Inventors: Jack Choquette, Xiaogang Qiu, Jeff Tuckey, Michael (Ming Yiu) Siu, Robert J. Stoll, Olivier Giroux
Estimating channel information

Patent number: 9602230

Abstract: Disclosed is a method of providing channel state information for a desired downlink channel of a wireless communication system. In a configuration phase, the method comprises receiving on a signaling channel configuration information comprising an identifier of an interference source and an association which associates the identifier with at least one resource element not used for transmission on the desired downlink channel. In an estimation phase, the method comprises estimating channel state information for an expected transmission on the desired downlink channel accounting for an incoming interference transmission from the identified interference source as observed from the at least one resource element. In a reporting phase, the method comprises reporting the channel state information.

Type: Grant

Filed: March 13, 2015

Date of Patent: March 21, 2017

Assignee: Nvidia Corporation

Inventors: Timo Roman, Tommi Koivisto, Tero Kuosmanen, Pekka Janis
Parallel multicolor incomplete LU factorization preconditioning processor and method of use thereof

Patent number: 9600446

Abstract: A preconditioner processor and a method of computing a preconditioning matrix. In one embodiment, the preconditioner processor has parallel computing pipelines including: (1) a graph coloring circuit operable to identify parallelisms in a sparse linear system, (2) an incomplete lower triangle, upper triangle factorization (ILU) computer configured to employ the parallel computing pipelines according to the parallelisms to: (2a) determine a sparsity pattern for an ILU preconditioning matrix, and (2b) compute non-zero elements of the ILU preconditioning matrix according to the sparsity pattern, and (3) a memory communicably couplable to the parallel computing pipelines and configured to store the ILU preconditioning matrix.

Type: Grant

Filed: August 8, 2013

Date of Patent: March 21, 2017

Assignee: Nvidia Corporation

Inventors: Robert Strzodka, Julien Demouth, Patrice Castonguay
Slice ordering for video encoding

Patent number: 9602821

Abstract: For encoding, a frame of video data can be segregated into macroblocks, which can be segregated into slices, which in turn can be segregated into slice groups. A macroblock identifier (ID) can be associated with each of the macroblocks. When at least one slice from each of the slice groups has been encoded, the macroblock IDs associated with the encoded slices can be compared to determine an order in which the encoded slices are to be placed in an access unit for the frame. Of the encoded slices, the slice that includes the macroblock with the lowest macroblock ID will be placed in the access unit before the other encoded slices.

Type: Grant

Filed: October 1, 2008

Date of Patent: March 21, 2017

Assignee: Nvidia Corporation

Inventors: Harikrishna M. Reddy, Yiu Cheong Ho, Cheng-Chiang Chen

prev … 111 112 113 114 115 116 117 118 119 … next