Patents by Inventor Satyaki Koneru

Satyaki Koneru has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Accelerated Operation of a Graph Streaming Processor

Publication number: 20190171497

Abstract: Methods, systems and apparatuses for graph processing are disclosed. One graph streaming processor includes a thread manager, wherein the thread manager is operative to dispatch operation of the plurality of threads of a plurality of thread processors before dependencies of the dependent threads have been resolved, maintain a scorecard of operation of the plurality of threads of the plurality of thread processors, and provide an indication to at least one of the plurality of thread processors when a dependency between the at least one of the plurality of threads that a request has or has not been satisfied. Further, a producer thread provides a response to the dependency when the dependency has been satisfied, and each of the plurality of thread processors is operative to provide processing updates to the thread manager, and provide queries to the thread manager upon reaching a dependency.

Type: Application

Filed: February 8, 2019

Publication date: June 6, 2019

Applicant: ThinCl, Inc.

Inventors: Lokesh Agarwal, Sarvendra Govindammagari, Venkata Ganapathi Puppala, Satyaki Koneru
Node topology employing recirculating ring command and data buffers for executing thread scheduling

Patent number: 10311542

Abstract: The claimed invention discloses system comprising a plurality of logical nodes comprised in a single or plurality of stages, with defined properties and resources associated with each node, for reducing compute resources, said system further comprising: at least a recirculating ring buffer holding only any one of a control information, input, and, or out data necessary to stream a temporary data between node and, or nodes in an execution graph, thereby reducing size of said recirculating ring buffer; said recirculating ring buffer being sufficiently reduced in size to reside in an on-chip cache, such that any one of the control information, input, and, or out data between node and, or nodes need not be stored in memory; wherein the control information further comprises a command related to invalidating any one of the input and, or out data held in a recirculating ring data buffer, clearing the buffer of tasked data; and wherein a producer is stalled from writing any more control information into a recirculati

Type: Grant

Filed: March 6, 2017

Date of Patent: June 4, 2019

Assignee: THINCI, Inc.

Inventors: Val G. Cook, Satyaki Koneru, Ke Yin, Dinakar C. Munagala
NODE TOPOLOGY EMPLOYING RECIRCULATING RING COMMAND AND DATA BUFFERS FOR EXECUTING THREAD SCHEDULING

Publication number: 20180253890

Abstract: The claimed invention discloses system comprising a plurality of logical nodes comprised in a single or plurality of stages, with defined properties and resources associated with each node, for reducing compute resources, said system further comprising: at least a recirculating ring buffer holding only any one of a control information, input, and, or out data necessary to stream a temporary data between node and, or nodes in an execution graph, thereby reducing size of said recirculating ring buffer; said recirculating ring buffer being sufficiently reduced in size to reside in an on-chip cache, such that any one of the control information, input, and, or out data between node and, or nodes need not be stored in memory; wherein the control information further comprises a command related to invalidating any one of the input and, or out data held in a recirculating ring data buffer, clearing the buffer of tasked data; and wherein a producer is stalled from writing any more control information into a recirculati

Type: Application

Filed: March 6, 2017

Publication date: September 6, 2018

Inventors: Val G. Cook, Satyaki Koneru, Ke Yin, Dinakar C. Munagala
Processing a Plurality of Threads of a Single Instruction Multiple Data Group

Publication number: 20170365237

Abstract: Methods, systems and apparatuses for processing a plurality of threads of a single-instruction multiple data (SIMD) group are disclosed. One method includes initializing a current instruction pointer of the SIMD group, initializing a thread instruction pointer for each of the plurality of threads of the SIMD group including setting a flag for each of the plurality of threads, determining whether a current instruction of the processing includes a conditional branch, resetting a flag of each thread of the plurality of threads that fails a condition of the conditional branch, and setting the thread instruction pointer for each of the plurality of threads that fails the condition of the conditional branch to a jump instruction pointer, and incrementing the current instruction pointer and each thread instruction pointer of the threads that do not fail, if at least one of the threads do not fail the condition.

Type: Application

Filed: August 17, 2017

Publication date: December 21, 2017

Applicant: ThinCl, Inc.

Inventors: Satyaki Koneru, Ke Yin
SELECTING DATA OF A SERVER SYSTEM FOR TRANSMISSION

Publication number: 20170193630

Abstract: Methods, systems and apparatuses for selecting graphics data of a server system for transmission are disclosed. One method includes reading data from memory of the server system, checking if the data is being read for the first time, checking if the data was written by a processor of the server system during processing, comprising checking if the data is available on a client system or present in a transmit buffer, placing the data in the transmit buffer if the data is being read for the first time and was not written by the processor during the processing as determined by the checking if the data was written by the processor of the server system during processing, wherein if the data is being read for the first time and was written by the processor of the server system during processing the data is not placed in the transmit buffer.

Type: Application

Filed: March 22, 2017

Publication date: July 6, 2017

Applicant: ThinCl, Inc.

Inventors: Satyaki Koneru, Ke Yin, Dinakar C. Munagala
Selecting data of a server system for transmission

Patent number: 9640150

Abstract: Methods, systems and apparatuses for selecting graphics data of a server system for transmission are disclosed. One method includes reading data from memory of the server system, checking if the data is being read for the first time, checking if the data was written by a processor of the server system during processing, comprising checking if the data is available on a client system or present in a transmit buffer, placing the data in the transmit buffer if the data is being read for the first time and was not written by the processor during the processing as determined by the checking if the data was written by the processor of the server system during processing, wherein if the data is being read for the first time and was written by the processor of the server system during processing the data is not placed in the transmit buffer.

Type: Grant

Filed: May 19, 2016

Date of Patent: May 2, 2017

Assignee: ThinCI, Inc.

Inventors: Satyaki Koneru, Ke Yin, Dinakar C. Munagala
Mechanism for minimal computation and power consumption for rendering synthetic 3D images, containing pixel overdraw and dynamically generated intermediate images

Patent number: 9589388

Abstract: Embodiments disclosed include a mechanism in a system and method for significantly reducing power consumption by reducing computation and bandwidth. This mechanism is particularly applicable for modern 3D synthetic images which contain high pixel overdraw and dynamically generated intermediates images. Only blocks of computation which contribute to the final image are performed. This is accomplished by rendering in reverse order and by performing multiple visibility sort in a streaming fashion through the pipeline. Rendering of dynamically generated intermediate images is performed sparsely by projecting texture coordinates from a current image back into one or more dependent images in a recursive manner. The newly computed pixel values are then filtered and control is returned to the sampling shader of the current image. When only visible pixels are projected optimal computation is performed. Several implementations are presented with increasing efficiency.

Type: Grant

Filed: July 9, 2014

Date of Patent: March 7, 2017

Assignee: ThinCI, Inc.

Inventors: Val G. Cook, Satyaki Koneru, Ke Yin, Dinakar C. Munagala
SELECTING DATA OF A SERVER SYSTEM FOR TRANSMISSION

Publication number: 20160267889

Abstract: Methods, systems and apparatuses for selecting graphics data of a server system for transmission are disclosed. One method includes reading data from memory of the server system, checking if the data is being read for the first time, checking if the data was written by a processor of the server system during processing, comprising checking if the data is available on a client system or present in a transmit buffer, placing the data in the transmit buffer if the data is being read for the first time and was not written by the processor during the processing as determined by the checking if the data was written by the processor of the server system during processing, wherein if the data is being read for the first time and was written by the processor of the server system during processing the data is not placed in the transmit buffer.

Type: Application

Filed: May 19, 2016

Publication date: September 15, 2016

Applicant: ThinCl, Inc.

Inventors: Satyaki Koneru, Ke Yin, Dinakar C. Munagala
Processing of graphics data of a server system for transmission including multiple rendering passes

Patent number: 9373152

Abstract: Methods, systems and apparatuses for selecting graphics data of a server system for transmission are disclosed. One method includes a plurality of graphic render passes, wherein one or more of the graphics render passes includes reading data from graphics memory of the server system. The data read from the graphics memory is placed in a transmit buffer if the data is being read for the first time, and was not written by a processor of the server system. One system includes a server system including graphics memory, a frame buffer and a processor. The server system is operable to read data from the graphics memory. The server system is operable to place the data in a transmit buffer if the data is being read for the first time, and was not written by the processor during rendering.

Type: Grant

Filed: May 25, 2014

Date of Patent: June 21, 2016

Assignee: ThinCI, Inc.

Inventors: Satyaki Koneru, Ke Yin, Dinakar C. Munagala
PROCESSING OF GRAPHICS DATA OF A SERVER SYSTEM FOR TRANSMISSION INCLUDING MULTIPLE RENDERING PASSES

Publication number: 20140253563

Abstract: Methods, systems and apparatuses for selecting graphics data of a server system for transmission are disclosed. One method includes a plurality of graphic render passes, wherein one or more of the graphics render passes includes reading data from graphics memory of the server system. The data read from the graphics memory is placed in a transmit buffer if the data is being read for the first time, and was not written by a processor of the server system. One system includes a server system including graphics memory, a frame buffer and a processor. The server system is operable to read data from the graphics memory. The server system is operable to place the data in a transmit buffer if the data is being read for the first time, and was not written by the processor during rendering.

Type: Application

Filed: May 25, 2014

Publication date: September 11, 2014

Applicant: THINCL, INC.

Inventors: Satyaki Koneru, Ke Yin, Dinakar C. Munagala
Configurable motion estimation

Patent number: 8804757

Abstract: In some embodiments, a motion estimation method and engine are provided. A motion estimation engine may, for example, compare source blocks from a source frame against reference blocks in a reference frame to find a suitable match for the source block. According to some embodiments, groups of reference blocks are compared at the same time against the source block, with search units within each group being selected for comparison simultaneously using both a fixed path navigation and an adaptive path navigation.

Type: Grant

Filed: December 26, 2007

Date of Patent: August 12, 2014

Assignee: Intel Corporation

Inventors: Ning Lu, Hong Jiang, Satyaki Koneru
Processing of graphics data of a server system for transmission

Patent number: 8754900

Abstract: Methods, systems and apparatuses for selecting graphics data of a server system for transmission are disclosed. One method includes reading data from graphics memory of the server system. The data read from the graphics memory is placed in a transmit buffer if the data is being read for the first time, and was not written by a processor of the server system. One system includes a server system including graphics memory, a frame buffer and a processor. The server system is operable to read data from the graphics memory. The server system is operable to place the data in a transmit buffer if the data is being read for the first time, and was not written by the processor during rendering.

Type: Grant

Filed: June 16, 2011

Date of Patent: June 17, 2014

Inventors: Satyaki Koneru, Ke Yin, Dinakar Munagala
Processing of Graphics Data of a Server System for Transmission

Publication number: 20110310105

Abstract: Methods, systems and apparatuses for selecting graphics data of a server system for transmission are disclosed. One method includes reading data from graphics memory of the server system. The data read from the graphics memory is placed in a transmit buffer if the data is being read for the first time, and was not written by a processor of the server system. One system includes a server system including graphics memory, a frame buffer and a processor. The server system is operable to read data from the graphics memory. The server system is operable to place the data in a transmit buffer if the data is being read for the first time, and was not written by the processor during rendering.

Type: Application

Filed: June 16, 2011

Publication date: December 22, 2011

Applicant: THINCI INC.

Inventors: Satyaki Koneru, Ke Yin, Dinakar Munagala
CONFIGURABLE MOTION ESTIMATION

Publication number: 20090168881

Abstract: In some embodiments, a motion estimation method and engine are provided.

Type: Application

Filed: December 26, 2007

Publication date: July 2, 2009

Inventors: Ning Lu, Hong Jiang, Satyaki Koneru
Method and apparatus for reading texture data from a cache

Publication number: 20050225557

Abstract: A texture data reading apparatus includes a cache memory including a plurality of read ports and a plurality of regions to store pixel texture data. An address comparator includes a plurality of input ports to receive incoming pixels, wherein the address comparator compares the memory addresses associated with the incoming pixels to determine which regions of cache memory are accessed. A cache lookup device accesses new texture data from the cache memory for the incoming pixels in the same clock cycle in response to the number of memory regions accessed being less than or equal to the number of cache memory read ports.

Type: Application

Filed: June 7, 2005

Publication date: October 13, 2005

Inventors: Satyaki Koneru, Steven Spangler, Val Cook
Method and apparatus for reading texture data from a cache

Patent number: 6924812

Abstract: A texture data reading apparatus includes a cache memory including a plurality of read ports and a plurality of regions to store pixel texture data. An address comparator includes a plurality of input ports to receive incoming pixels, wherein the address comparator compares the memory addresses associated with the incoming pixels to determine which regions of cache memory are accessed. A cache lookup device accesses new texture data from the cache memory for the incoming pixels in the same clock cycle in response to the number of memory regions accessed being less than or equal to the number of cache memory read ports.

Type: Grant

Filed: December 24, 2002

Date of Patent: August 2, 2005

Assignee: Intel Corporation

Inventors: Satyaki Koneru, Steven J. Spangler, Val G. Cook
Method and apparatus for reading texture data from a cache

Publication number: 20040119719

Abstract: A texture data reading apparatus includes a cache memory including a plurality of read ports and a plurality of regions to store pixel texture data. An address comparator includes a plurality of input ports to receive incoming pixels, wherein the address comparator compares the memory addresses associated with the incoming pixels to determine which regions of cache memory are accessed. A cache lookup device accesses new texture data from the cache memory for the incoming pixels in the same clock cycle in response to the number of memory regions accessed being less than or equal to the number of cache memory read ports.

Type: Application

Filed: December 24, 2002

Publication date: June 24, 2004

Inventors: Satyaki Koneru, Steven J. Spangler, Val G. Cook
Method and apparatus for determining bins to be updated for polygons, including lines

Patent number: 6693637

Abstract: Embodiments of the present invention provide a method and apparatus for determining the zones that a polygon overlaps to minimize the bins that are updated during binning and reduce the number of polygons to be set up in the render phase. The polygons supported include, but are not limited to, points, lines, triangles and rectangles. Each zone has associated with it a batch buffer to which data is written back for the polygons that overlap that zone. This data includes the setting up of state and the indices for those polygons. Since the zones that a polygon overlaps is precisely determined, the data written back is minimized and the polygons to be set up during the render phase is optimally reduced.

Type: Grant

Filed: December 31, 2001

Date of Patent: February 17, 2004

Assignee: Intel Corporation

Inventors: Satyaki Koneru, Sajjad A. Zaidi
Method and apparatus for determining bins to be updated for polygons, including lines

Publication number: 20030122819

Abstract: Embodiments of the present invention provide a method and apparatus for determining the zones that a polygon overlaps to minimize the bins that are updated during binning and reduce the number of polygons to be set up in the render phase. The polygons supported include, but are not limited to, points, lines, triangles and rectangles. Each zone has associated with it a batch buffer to which data is written back for the polygons that overlap that zone. This data includes the setting up of state and the indices for those polygons. Since the zones that a polygon overlaps is precisely determined, the data written back is minimized and the polygons to be set up during the render phase is optimally reduced.

Type: Application

Filed: December 31, 2001

Publication date: July 3, 2003

Inventors: Satyaki Koneru, Sajjad A. Zaidi
Method and apparatus for determining bins to be updated for polygons, including triangles

Publication number: 20030122850

Abstract: Embodiments of the present invention provide a method and apparatus for determining the zones that a polygon overlaps to minimize the bins that are updated during binning and reduce the number of polygons to be set up in the render phase. The polygons supported include, but are not limited to, points, lines, triangles and rectangles. Each zone has associated with it a batch buffer to which data is written back for the polygons that overlap that zone. This data includes the setting up of state and the indices for those polygons. Since the zones that a polygon overlaps is precisely determined, the data written back is minimized and the polygons to be set up during the render phase is optimally reduced.

Type: Application

Filed: December 28, 2001

Publication date: July 3, 2003

Inventors: Satyaki Koneru, Sajjad A. Zaidi

prev 1 2 3