Patents by Inventor Ashish Karandikar

Ashish Karandikar has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Enhanced in-system test coverage based on detecting component degradation

Patent number: 11693753

Abstract: In various examples, permanent faults in hardware component(s) and/or connections to the hardware component(s) of a computing platform may be predicted before they occur using in-system testing. As a result of this prediction, one or more remedial actions may be determined to enhance the safety of the computing platform (e.g., an autonomous vehicle). A degradation rate of a performance characteristic associated with the hardware component may be determined, detected, and/or computed by monitoring values of performance characteristics over time using fault testing.

Type: Grant

Filed: October 15, 2019

Date of Patent: July 4, 2023

Assignee: NVIDIA Corporation

Inventors: Gunaseelan Ponnuvel, Ashish Karandikar
ENHANCED IN-SYSTEM TEST COVERAGE BASED ON DETECTING COMPONENT DEGRADATION

Publication number: 20200117565

Abstract: In various examples, permanent faults in hardware component(s) and/or connections to the hardware component(s) of a computing platform may be predicted before they occur using in-system testing. As a result of this prediction, one or more remedial actions may be determined to enhance the safety of the computing platform (e.g., an autonomous vehicle). A degradation rate of a performance characteristic associated with the hardware component may be determined, detected, and/or computed by monitoring values of performance characteristics over time using fault testing.

Type: Application

Filed: October 15, 2019

Publication date: April 16, 2020

Inventors: Gunaseelan Ponnuvel, Ashish Karandikar
Pipelined L2 cache for memory transfers for a video processor

Patent number: 9111368

Abstract: A method for using a pipelined L2 cache to implement memory transfers for a video processor. The method includes accessing a queue of read requests from a video processor. For each of the read requests, a determination is made as to whether there is a cache line hit corresponding to the request. For each cache line miss, a cache line slot is allocated to store a new cache line responsive to the cache line miss. An in-order set of cache lines is output to the video processor responsive to the queue of read requests.

Type: Grant

Filed: November 4, 2005

Date of Patent: August 18, 2015

Assignee: NVIDIA CORPORATION

Inventors: Ashish Karandikar, Shirish Gadre, Franciscus W. Sijstermans, Zhiqiang Jonathan Su
Methods and systems for command acceleration in a video processor via translation of scalar instructions into vector instructions

Patent number: 8738891

Abstract: A method for implementing command acceleration. The method includes receiving a first set of instructions from a first processor, wherein the first set of instructions are formatted in accordance with a microarchitecture of the first processor. The first set of instructions are translated into a second set of instructions, wherein the second set of instructions are formatted in accordance with a microarchitecture of a second processor. The second set instructions are then transmitted to the second processor for execution by the second processor.

Type: Grant

Filed: November 4, 2005

Date of Patent: May 27, 2014

Assignee: NVIDIA Corporation

Inventors: Ashish Karandikar, Shirish Gadre, Amir H. Salek
Programmable DMA engine for implementing memory transfers and video processing for a video processor

Patent number: 8736623

Abstract: A method for using a programmable DMA engine to implement memory transfers and video processing for a video processor. A DMA control program is configured for controlling DMA memory transfers between a frame buffer memory and a video processor. The DMA control program is stored in the DMA engine. A DMA request can be received from the video processor. The DMA control program is executable to implement the DMA request for the video processor. The DMA engine is operable to execute low-level command for accessing the frame buffer memory to implement a high-level command.

Type: Grant

Filed: November 4, 2005

Date of Patent: May 27, 2014

Assignee: NVIDIA Corporation

Inventors: Stephen D. Lew, Shirish Gadre, Ashish Karandikar, Franciscus W. Sijstermans
Configurable SIMD engine with high, low and mixed precision modes

Patent number: 8725990

Abstract: A configurable SIMD engine in a video processor for executing video processing operations. The engine includes a SIMD component having a plurality of inputs for receiving input data and a plurality of outputs for providing output data. A plurality of execution units are included in the SIMD component. Each of the execution units comprise a first and a second data path, and are configured for selectively implementing arithmetic operations on a set of low precision or high precision inputs. Each of the execution units have a first configuration and a second configuration, such that the first data path and the second data path are combined to produce a single high precision output in the first configuration, and such that the first data path and the second data path are partitioned to produce a respective first low precision output and second low precision output in the second configuration.

Type: Grant

Filed: November 4, 2005

Date of Patent: May 13, 2014

Assignee: Nvidia Corporation

Inventors: Ashish Karandikar, Pooja Agarwal
Video processor having scalar and vector components

Patent number: 8698817

Abstract: A video processor for executing video processing operations. The video processor includes a host interface for implementing communication between the video processor and a host CPU. A memory interface is included for implementing communication between the video processor and a frame buffer memory. A scalar execution unit is coupled to the host interface and the memory interface and is configured to execute scalar video processing operations. A vector execution unit is coupled to the host interface and the memory interface and is configured to execute vector video processing operations.

Type: Grant

Filed: November 4, 2005

Date of Patent: April 15, 2014

Assignee: Nvidia Corporation

Inventors: Shirish Gadre, Ashish Karandikar, Stephen D. Lew, Christopher T. Cheng
Latency tolerant system for executing video processing operations

Patent number: 8687008

Abstract: A latency tolerant system for executing video processing operations. The system includes a host interface for implementing communication between the video processor and a host CPU, a scalar execution unit coupled to the host interface and configured to execute scalar video processing operations, and a vector execution unit coupled to the host interface and configured to execute vector video processing operations. A command FIFO is included for enabling the vector execution unit to operate on a demand driven basis by accessing the memory command FIFO. A memory interface is included for implementing communication between the video processor and a frame buffer memory. A DMA engine is built into the memory interface for implementing DMA transfers between a plurality of different memory locations and for loading the command FIFO with data and instructions for the vector execution unit.

Type: Grant

Filed: November 4, 2005

Date of Patent: April 1, 2014

Assignee: NVIDIA Corporation

Inventors: Ashish Karandikar, Shirish Gadre, Stephen D. Lew
Multi context execution on a video processor

Patent number: 8683184

Abstract: A method for implementing multi context execution on a video processor having a scalar execution unit and a vector execution unit. The method includes allocating a first task to a vector execution unit and allocating a second task to the vector execution unit. The first task is from a first context in the second task is from a second context. The method further includes interleaving a plurality of work packages comprising the first task and the second task to generate a combined work package stream. The combined work package stream is subsequently executed on the vector execution unit.

Type: Grant

Filed: November 4, 2005

Date of Patent: March 25, 2014

Assignee: Nvidia Corporation

Inventors: Stephen D. Lew, Ashish Karandikar, Shirish Gadre, Franciscus W. Sijstermans
State machine control for a pipelined L2 cache to implement memory transfers for a video processor

Patent number: 8493397

Abstract: A method for using a state machine to control a pipelined L2 cache to implement memory transfers for a video processor. The method includes accessing a queue of read requests from a video processor, and tracking each of a plurality of cache lines stored within the cache using a least recently used variable. For each a cache line hit out of the plurality of cache lines and corresponding to one of the read requests, the least recently used variable is adjusted for a remainder of the plurality of cache lines. A replacement cache line is determined by examining the least recently used variables for each of the plurality of cache lines. For each cache line miss, a cache line slot corresponding to the replacement cache line is allocated to store a new cache line responsive to the cache line miss.

Type: Grant

Filed: November 4, 2005

Date of Patent: July 23, 2013

Assignee: Nvidia Corporation

Inventors: Zhiqiang Jonathan Su, Ashish Karandikar
Multidimensional datapath processing in a video processor

Patent number: 8493396

Abstract: A multidimensional datapath processing system for a video processor for executing video processing operations. The video processor includes a scalar execution unit configured to execute scalar video processing operations and a vector execution unit configured to execute vector video processing operations. A data store memory is included for storing data for the vector execution unit. The data store memory includes a plurality of tiles having symmetrical bank data structures arranged in an array. The bank data structures are configured to support accesses to different tiles of each bank.

Type: Grant

Filed: November 4, 2005

Date of Patent: July 23, 2013

Assignee: Nvidia Corporation

Inventors: Ashish Karandikar, Shirish Gadre, Stephen D. Lew, Christopher T. Cheng
Increasing memory capacity of a frame buffer via a memory splitter chip

Patent number: 8489839

Abstract: The memory splitter chip couples multiple DRAM units to the PPU, thereby expanding the memory capacity available to the PPU for storing data and increasing the overall performance of the graphics processing system. The memory splitter chip includes logic for managing the transmission of data between the PPU and the DRAM units when the transmission frequencies and the burst lengths of the PPU interface and the DRAM interfaces differ. Specifically, the memory splitter chip implements an overlapping transmission mode, a pairing transmission mode or a combination of the two modes when the transmission frequencies or the burst lengths differ.

Type: Grant

Filed: December 16, 2009

Date of Patent: July 16, 2013

Assignee: Nvidia Corporation

Inventors: Ashish Karandikar, Kaustubh Sanghani, Jonah M. Alben, Shane Keil
Context switching on a video processor having a scalar execution unit and a vector execution unit

Patent number: 8424012

Abstract: A method for context switching on a video processor having a scalar execution unit and a vector execution unit. The method includes executing a first task and a second task on a vector execution unit. The first task in the second task can be from different respective contexts. The first task and the second task are each allocated to the vector execution unit from a scalar execution unit. The first task and the second task each comprise a plurality of work packages. In response to a switch notification, a work package boundary of the first task is designated. A context switch from the first task to the second task is then executed on the work package boundary.

Type: Grant

Filed: November 4, 2005

Date of Patent: April 16, 2013

Assignee: Nvidia Corporation

Inventors: Ashish Karandikar, Shirish Gadre, Frederick R. Gruner, Franciscus W. Sijstermans
Stream processing in a video processor

Patent number: 8416251

Abstract: A stream based memory access system for a video processor for executing video processing operations. The video processor includes a scalar execution unit configured to execute scalar video processing operations and a vector execution unit configured to execute vector video processing operations. A frame buffer memory is included for storing data for the scalar execution unit and the vector execution unit. A memory interface is included for establishing communication between the scalar execution unit and the vector execution unit and the frame buffer memory. The frame buffer memory comprises a plurality of tiles. The memory interface implements a first sequential access of tiles and implements a second stream comprising a second sequential access of tiles for the vector execution unit or the scalar execution unit.

Type: Grant

Filed: November 4, 2005

Date of Patent: April 9, 2013

Assignee: Nvidia Corporation

Inventors: Shirish Gadre, Ashish Karandikar, Stephen D. Lew
Dynamic detection of block boundaries on memory reads

Patent number: 7380084

Abstract: In some embodiments a processing device is disclosed. The processing device is configured to read data from a memory device. The processing device transmits a read request to the memory device for a data block at a certain address and reads the data block for the certain address from the memory device. The processing device dynamically detects boundaries for the data block read by detecting an alignment pattern in data received from the memory device. Other embodiments are otherwise disclosed herein.

Type: Grant

Filed: September 30, 2005

Date of Patent: May 27, 2008

Assignee: Intel Corporation

Inventors: Mark Heuchert, Anujan Varma, Ashish Karandikar
Dynamic detection of block boundaries on memory reads

Publication number: 20070079104

Abstract: In some embodiments a processing device is disclosed. The processing device is configured to read data from a memory device. The processing device transmits a read request to the memory device for a data block at a certain address and reads the data block for the certain address from the memory device. The processing device dynamically detects boundaries for the data block read by detecting an alignment pattern in data received from the memory device. Other embodiments are otherwise disclosed herein.

Type: Application

Filed: September 30, 2005

Publication date: April 5, 2007

Inventors: Mark Heuchert, Anujan Varma, Ashish Karandikar
Multidimensional datapath processing in a video processor

Publication number: 20060176308

Abstract: A multidimensional datapath processing system for a video processor for executing video processing operations. The video processor includes a scalar execution unit configured to execute scalar video processing operations and a vector execution unit configured to execute vector video processing operations. A data store memory is included for storing data for the vector execution unit. The data store memory includes a plurality of tiles having symmetrical bank data structures arranged in an array. The bank data structures are configured to support accesses to different tiles of each bank.

Type: Application

Filed: November 4, 2005

Publication date: August 10, 2006

Inventors: Ashish Karandikar, Shirish Gadre, Stephen Lew, Christopher Cheng
Video processor having scalar and vector components

Publication number: 20060176309

Abstract: A video processor for executing video processing operations. The video processor includes a host interface for implementing communication between the video processor and a host CPU. A memory interface is included for implementing communication between the video processor and a frame buffer memory. A scalar execution unit is coupled to the host interface and the memory interface and is configured to execute scalar video processing operations. A vector execution unit is coupled to the host interface and the memory interface and is configured to execute vector video processing operations.

Type: Application

Filed: November 4, 2005

Publication date: August 10, 2006

Inventors: Shirish Gadre, Ashish Karandikar, Stephen Lew, Christopher Cheng
Stream processing in a video processor

Publication number: 20060152520

Abstract: A stream based memory access system for a video processor for executing video processing operations. The video processor includes a scalar execution unit configured to execute scalar video processing operations and a vector execution unit configured to execute vector video processing operations. A frame buffer memory is included for storing data for the scalar execution unit and the vector execution unit. A memory interface is included for establishing communication between the scalar execution unit and the vector execution unit and the frame buffer memory. The frame buffer memory comprises a plurality of tiles. The memory interface implements a first sequential access of tiles and implements a second stream comprising a second sequential access of tiles for the vector execution unit or the scalar execution unit.

Type: Application

Filed: November 4, 2005

Publication date: July 13, 2006

Inventors: Shirish Gadre, Ashish Karandikar, Stephen Lew
Latency tolerant system for executing video processing operations

Publication number: 20060103659

Abstract: A latency tolerant system for executing video processing operations. The system includes a host interface for implementing communication between the video processor and a host CPU, a scalar execution unit coupled to the host interface and configured to execute scalar video processing operations, and a vector execution unit coupled to the host interface and configured to execute vector video processing operations. A command FIFO is included for enabling the vector execution unit to operate on a demand driven basis by accessing the memory command FIFO. A memory interface is included for implementing communication between the video processor and a frame buffer memory. A DMA engine is built into the memory interface for implementing DMA transfers between a plurality of different memory locations and for loading the command FIFO with data and instructions for the vector execution unit.

Type: Application

Filed: November 4, 2005

Publication date: May 18, 2006

Inventors: Ashish Karandikar, Shirish Gadre, Stephen Lew