Patents by Inventor NVIDIA Corporation

NVIDIA Corporation has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

TEXTURE PIPELINE CONTEXT SWITCH

Publication number: 20130169651

Abstract: Circuits, methods, and apparatus that perform a context switch quickly while not wasting a significant amount of in-progress work. A texture pipeline includes a cutoff point or stage. After receipt of a context switch instruction, texture requests and state updates above the cutoff point are stored in a memory, while those below the cutoff point are processed before the context switch is completed. After this processing is complete, global states in the texture pipeline are stored in the memory. A previous context may then be restored by reading its texture requests and global states from the memory and loading them into the texture pipeline. The location of the cutoff point can be a point in the pipeline where a texture request can no longer result in a page fault in the memory.

Type: Application

Filed: February 25, 2013

Publication date: July 4, 2013

Applicant: NVIDIA Corporation

Inventor: NVIDIA Corporation
CLOTH APPLICATION PROGRAMMER INTERFACE

Publication number: 20130152109

Abstract: A method of executing a physics simulation is performed in a system comprising a computational platform, a main application stored in the computational platform, a secondary application stored in the computational platform, and a cloth application programming interface (API) implemented in the computational platform. The method defines a cloth simulation call in the cloth API, and by operation of the main application, invokes a software routine using the cloth simulation call. Additionally, by operation of the secondary application, a state of the physics simulation is updated in response to the software routine.

Type: Application

Filed: November 30, 2012

Publication date: June 13, 2013

Applicant: NVIDIA CORPORATION

Inventor: NVIDIA CORPORATION
SYSTEM AND PROCESS FOR AUTOMATIC CLOCK ROUTING IN AN APPLICATION SPECIFIC INTEGRATED CIRCUIT

Publication number: 20130152035

Abstract: Embodiments of the claimed subject matter are directed to methods and a system that use a standardized grid of clock buffers to automatically route clocks according to a uniform clock grid throughout an ASIC of a non-uniform arrangement of non-uniformly sized logic partitions. According to one embodiment, clock sources and sinks are mapped to grid point locations and a novel grid routing process is performed to link them together. A clock routing macro is assigned to a corresponding partition and associated with the corresponding partition or logic unit according to a partition hierarchy. The underlying routing structure and resources of a clock routing macro are automatically renamed to correspond to the local partition in a script or schedule of programmed instructions, or a routing map. The position of blockages within a partition may also be detected and alternate routes for traversing the blockage may be preemptively determined as well.

Type: Application

Filed: February 8, 2013

Publication date: June 13, 2013

Applicant: NVIDIA CORPORATION

Inventor: NVIDIA Corporation
INDIRECT FUNCTION CALL INSTRUCTIONS IN A SYNCHRONOUS PARALLEL THREAD PROCESSOR

Publication number: 20130138926

Abstract: An indirect branch instruction takes an address register as an argument in order to provide indirect function call capability for single-instruction multiple-thread (SIMT) processor architectures. The indirect branch instruction is used to implement indirect function calls, virtual function calls, and switch statements to improve processing performance compared with using sequential chains of tests and branches.

Type: Application

Filed: November 12, 2012

Publication date: May 30, 2013

Applicant: NVIDIA CORPORATION

Inventor: NVIDIA CORPORATION
SYSTEM, METHOD, AND COMPUTER PROGRAM PRODUCT FOR ASSIGNING ELEMENTS OF A MATRIX TO PROCESSING THREADS WITH INCREASED CONTIGUOUSNESS

Publication number: 20130132707

Abstract: A system, method, and computer program product are provided for assigning elements of a matrix to processing threads. In use, a matrix is received to be processed by a parallel processing architecture. Such parallel processing architecture includes a plurality of processors each capable of processing a plurality of threads. Elements of the matrix are assigned to each of the threads for processing, utilizing an algorithm that increases a contiguousness of the elements being processed by each thread.

Type: Application

Filed: January 15, 2013

Publication date: May 23, 2013

Applicant: NVIDIA CORPORATION

Inventor: NVIDIA CORPORATION
GRAPHICS PROCESSING UNIT MODULE

Publication number: 20130120408

Abstract: A general-purpose graphics processing unit (GPU) module, a system containing the general-purpose GPU module, and a method for driving the system are provided in accordance with various embodiments of the invention. In an embodiment, a general-purpose GPU module comprises a GPU, a data transfer input/output (I/O) port, a power supply I/O port, a control/SYNC module, and a power supply module. When a new general-purpose GPU module is detected being coupled to the transfer link bus, the graphics processing tasks are allocated to all the coupled general-purpose GPU modules. In accordance with various embodiments of the invention, the costs of designing and using GPUs will be decreased.

Type: Application

Filed: November 5, 2012

Publication date: May 16, 2013

Applicant: NVIDIA CORPORATION

Inventor: NVIDIA Corporation
GRAPHICS PROCESSING

Publication number: 20130124772

Abstract: In one embodiment, a computer system comprises two or more graphics cards, each graphics card comprising: a graphics processing unit and an interface. An interface of the first graphics card is coupled to an interface of the second graphics card for enabling communication between the first and second graphics cards. A cable couples the interface of the first graphics card with the interface of the second graphics card. The transmitting speed of data exchanging between graphics cards of the computer system is increased, and the arrangement of the PCB (printed circuit board) of the graphics card is simple and the cost thereof is low.

Type: Application

Filed: November 15, 2012

Publication date: May 16, 2013

Applicant: NVIDIA Corporation

Inventor: NVIDIA Corporation
NAVIGATION DEVICE

Publication number: 20130120524

Abstract: In one embodiment, a car navigation device is provided. The device comprises: at least one wide-angle camera; a video correction unit for acquiring video data from the wide-angle lens and correcting the video data; a video merging unit for acquiring corrected video data from video correction unit and merging the corrected video data; an image recognition unit for acquiring video from the video merging unit and performing image recognition to the video; and a driving assistant unit for acquiring data from the image recognition unit and assisting driving in accordance with the recognized content. The navigation device provided by various embodiments in accordance with the present invention can correct and recognize the images taken by fisheye lens in real-time so as to assist the driver for driving or drive the car automatically without a human being.

Type: Application

Filed: November 14, 2012

Publication date: May 16, 2013

Applicant: NVIDIA CORPORATION

Inventor: NVIDIA Corporation
PRIORITIZED COMPRESSION FOR VIDEO

Publication number: 20130121563

Abstract: In one embodiment, a method of prioritized compression for 3D video wireless display, the method comprising: inputting video data; abstracting scene depth of the video data; estimating foreground and background for each image of the video data; performing different kinds of compressions to the foreground and background in each image; and outputting the processed video data. Thus, the image quality is not affected by the data loss during the wireless transmission.

Type: Application

Filed: November 13, 2012

Publication date: May 16, 2013

Applicant: NVIDIA CORPORATION

Inventor: NVIDIA Corporation
CHARGING ELECTRONIC DEVICES

Publication number: 20130119783

Abstract: In one embodiment, a power adapter comprising: an input for coupling to an external power supply; a transformer module for converting the external power supply voltage to a voltage for an electronic device; a relay coupled between the input and the transformer module; a reset switch coupled to the relay for resetting the relay; a detection input for receiving a control signal from the electronic device to control operation of the relay; and an output for outputting the voltage for charging or supplying power to the electronic device, the output coupled to the transformer module and the relay.

Type: Application

Filed: November 13, 2012

Publication date: May 16, 2013

Applicant: NVIDIA CORPORATION

Inventor: Nvidia Corporation
ERROR DETECTION AND CORRECTION FOR EXTERNAL DRAM

Publication number: 20130117631

Abstract: One embodiment of the present invention sets forth a technique for protecting data with an error correction code (ECC). The data is accessed by a processing unit and stored in an external memory, such as dynamic random access memory (DRAM). Application data and related ECC data are advantageously stored in a common page within a common DRAM device. Application data and ECC data are transmitted between the processor and the external common DRAM device over a common set of input/output (I/O) pins. Eliminating I/O pins and DRAM devices conventionally associated with transmitting and storing ECC data advantageously reduces system complexity and cost.

Type: Application

Filed: October 25, 2012

Publication date: May 9, 2013

Applicant: NVIDIA CORPORATION

Inventor: NVIDIA CORPORATION
ALGORITHM FOR VECTORIZATION AND MEMORY COALESCING DURING COMPILING

Publication number: 20130117548

Abstract: One embodiment of the present invention sets forth a technique for reducing the number of assembly instructions included in a computer program. The technique involves receiving a directed acyclic graph (DAG) that includes a plurality of nodes, where each node includes an assembly instruction of the computer program, hierarchically parsing the plurality of nodes to identify at least two assembly instructions that are vectorizable and can be replaced by a single vectorized assembly instruction, and replacing the at least two assembly instructions with the single vectorized assembly instruction.

Type: Application

Filed: October 25, 2012

Publication date: May 9, 2013

Applicant: NVIDIA CORPORATION

Inventor: NVIDIA Corporation
TECHNIQUE FOR LIVE ANALYSIS-BASED REMATERIALIZATION TO REDUCE REGISTER PRESSURES AND ENHANCE PARALLELISM

Publication number: 20130117734

Abstract: A device compiler and linker within a parallel processing unit (PPU) is configured to optimize program code of a co-processor enabled application by rematerializing a subset of live-in variables for a particular block in a control flow graph generated for that program code. The device compiler and linker identifies the block of the control flow graph that has the greatest number of live-in variables, then selects a subset of the live-in variables associated with the identified block for which rematerializing confers the greatest estimated profitability. The profitability of rematerializing a given subset of live-in variables is determined based on the number of live-in variables reduced, the cost of rematerialization, and the potential risk of rematerialization.

Type: Application

Filed: November 5, 2012

Publication date: May 9, 2013

Applicant: NVIDIA CORPORATION

Inventor: NVIDIA Corporation
DEMAND-DRIVEN ALGORITHM TO REDUCE SIGN-EXTENSION INSTRUCTIONS INCLUDED IN LOOPS OF A 64-BIT COMPUTER PROGRAM

Publication number: 20130117737

Abstract: One embodiment of the present invention sets forth a technique for reducing sign-extension instructions (SEIs) included in a computer program, the technique involves receiving intermediate code that is associated with the computer program and includes a first SEI that is included in a loop structure within the computer program, determining that the first SEI is eligible to be moved outside of the loop structure, inserting into a preheader of the loop a second SEI that, when executed by a processor, promotes an original value targeted by the first SEI from a smaller type to a larger type, and replacing the first SEI with one or more intermediate instructions that are eligible for additional compiler optimizations.

Type: Application

Filed: October 26, 2012

Publication date: May 9, 2013

Applicant: NVIDIA CORPORATION

Inventor: NVIDIA Corporation
ACCELERATED RENDERING WITH TEMPORALLY INTERLEAVED DETAILS

Publication number: 20130113803

Abstract: One embodiment of the invention sets forth a mechanism for interleaving consecutive display frames rendered at complementary reduced resolutions. The GPU driver configures a command stream associated with a frame received from a graphics application for reduced frame rendering. The command stream specifies a nominal resolution at which the frame should be rendered. The reduced resolution associated with the frame is determined based on the reduced resolution of an immediately preceding frame (i.e., the complementary reduced resolution), if one exists, or on GPU configuration information. The GPU driver then modifies the command stream to specify the reduced resolution. The GPU driver also inserts an upscale command sequence specifying the nominal resolution into the command stream. Once the command stream is configured in such a manner, the GPU driver transmits the command stream to the GPU for reduced rendering.

Type: Application

Filed: October 22, 2012

Publication date: May 9, 2013

Applicant: NVIDIA CORPORATION

Inventor: NVIDIA Corporation
TECHNIQUE FOR INTER-PROCEDURAL MEMORY ADDRESS SPACE OPTIMIZATION IN GPU COMPUTING COMPILER

Publication number: 20130113809

Abstract: A device compiler and linker is configured to optimize program code of a co-processor enabled application by resolving generic memory access operations within that program code to target specific memory spaces. In situations where a generic memory access operation cannot be resolved and may target constant memory, constant variables associated with those generic memory access operations are transferred to reside in global memory.

Type: Application

Filed: October 24, 2012

Publication date: May 9, 2013

Applicant: NVIDIA CORPORATION

Inventor: NVIDIA CORPORATION
ALGORITHM FOR 64-BIT ADDRESS MODE OPTIMIZATION

Publication number: 20130117735

Abstract: One embodiment of the present invention sets forth a technique for extracting a memory address offset from a 64-bit type-conversion expression included in high-level source code of a computer program. The technique involves receiving the 64-bit type-conversion expression, where the 64-bit type-conversion expression includes one or more 32-bit expressions, determining a range for each of the one or more 32-bit expressions, calculating a total range by summing the ranges of the 32-bit expressions, determining that the total range is a subset of a range for a 32-bit unsigned integer, calculating the memory address offset based on the ranges for the one or more 32-bit expressions, and generating at least one assembly-level instruction that references the memory address offset.

Type: Application

Filed: October 24, 2012

Publication date: May 9, 2013

Applicant: NVIDIA CORPORATION

Inventor: NVIDIA Corporation
INTERPOLATION OF VERTEX ATTRIBUTES IN A GRAPHICS PROCESSOR

Publication number: 20130093766

Abstract: Vertex data can be accessed for a graphics primitive. The vertex data includes homogeneous coordinates for each vertex of the primitive. The homogeneous coordinates can be used to determine perspective-correct barycentric coordinates that are normalized by the area of the primitive. The normalized perspective-correct barycentric coordinates can be used to determine an interpolated value of an attribute for the pixel. These operations can be performed using adders and multipliers implemented in hardware.

Type: Application

Filed: December 5, 2012

Publication date: April 18, 2013

Applicant: NVIDIA CORPORATION

Inventor: NVIDIA Corporation
SYSTEM AND METHOD FOR DETERMINING A TIME FOR SAFELY SAMPLING A SIGNAL OF A CLOCK DOMAIN

Publication number: 20130070880

Abstract: A system and method are provided for determining a time for safely sampling a signal of a clock domain. In one embodiment, a phase estimate of a first clock domain is calculated based on a relative frequency estimate between a second clock domain and the first clock domain and, based on the phase estimate, a first time during which a signal from the first clock domain is unchanging such that the signal is capable of being safely sampled by the second clock domain is determined to generate a first sampled signal in the second clock domain. Additionally, an updated phase estimate is calculated, and, based on the updated phase estimate, a second time during which the signal from the first clock domain is changing such that the signal is not capable of being safely sampled by the second clock domain is determined. During the second time the first sampled signal in the second clock domain is maintained.

Type: Application

Filed: November 12, 2012

Publication date: March 21, 2013

Applicant: NVIDIA CORPORATION

Inventor: NVIDIA Corporation

prev 1 2