Patents by Inventor Val G. Cook

Val G. Cook has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Unsupervised Data Drift Detection for Classification Neural Networks

Publication number: 20240037923

Abstract: Methods, systems, and apparatuses for unsupervised data drift detection for classification neural networks are disclosed.

Type: Application

Filed: September 28, 2022

Publication date: February 1, 2024

Applicant: Blaize, Inc.

Inventors: Adam P. Geringer, Val G. Cook
GRAPH STREAMING NEURAL NETWORK PROCESSING SYSTEM AND METHOD THEREOF

Publication number: 20230418666

Abstract: Disclosed herein is a graph streaming neural network processing system comprising a first processor array, a second processor, and a thread scheduler. The thread scheduler dispatches a thread of a first node to the first processor array or the second processor, wherein the thread is executed to generate output data comprising a data unit stored in a private data buffer of the second processor. The thread scheduler determines that the data unit is sufficient for executing a thread of a second node. The second node is dependent on the output data generated by execution of a plurality of threads of the first node. Upon determining that the data unit is sufficient, the thread scheduler dispatches the thread of the second node. The thread scheduler determines to dispatch a subsequent thread of the first node for execution when a predefined threshold buffer size is available on the private data buffer.

Type: Application

Filed: June 16, 2023

Publication date: December 28, 2023

Applicant: Blaize Inc.

Inventors: Venkata Ganapathi Puppala, Val G. Cook, Srinivasulu Nagisetty
Configurable scheduler for graph processing on multi-processor computing systems

Patent number: 11755368

Abstract: Systems and methods are disclosures for scheduling code in a multiprocessor system. Code is portioned into code blocks by a compiler. The compiler schedules execution of code blocks in nodes. The nodes are connected in a directed acyclical graph with a top node, terminal node and a plurality of intermediate nodes. Execution of the top node is initiated by the compiler. After executing at least one instance of the top node, an instruction in the code block indicates to the scheduler to initiate at least one intermediary node. The scheduler schedules a thread for execution of the intermediary node. The data for the nodes resides in a plurality of data buffers; the index to the data buffer is stored in a command buffer.

Type: Grant

Filed: August 8, 2021

Date of Patent: September 12, 2023

Assignee: Blaize , Inc.

Inventors: Satyaki Koneru, Val G. Cook, Ke Yin
Node topology employing command and data buffers for executing thread scheduling

Patent number: 11593911

Abstract: The described embodiments include systems, methods, and apparatuses for increased efficiency processing flow. One method includes a plurality of stages configured to process an execution graph that includes a plurality of logical nodes with defined properties and resources associated with each logical node of the plurality of logical nodes, a recirculating ring buffer, wherein the recirculating ring buffer is configured to holding only any one of a control information, input, and, or out data necessary to stream a temporary data between each logical node of the execution graph, and a data producer, wherein the data producer is configured to stall from writing control information into a command buffer upon the command buffer being full, preventing command buffer over-writing.

Type: Grant

Filed: August 9, 2021

Date of Patent: February 28, 2023

Assignee: Blaze, Inc.

Inventors: Val G. Cook, Satyaki Koneru, Ke Yin, Dinakar C. Munagala
Configurable scheduler in a graph streaming processing system

Patent number: 11416282

Abstract: Systems, apparatuses and methods are disclosed for scheduling threads comprising of code blocks in a graph streaming processor (GSP) system. One system includes a scheduler for scheduling plurality of threads, the plurality of threads includes a set of instructions operating on the graph streaming processors of GSP system. The scheduler comprises a plurality of stages where each stage is coupled to an input command buffer and an output command buffer. A portion of the scheduler is implemented in hardware and comprises of a command parser operative to interpret commands within a corresponding input command buffer, a thread generator coupled to the command parser operate to generate the plurality of threads, and a thread scheduler coupled to the thread generator for dispatching the plurality of threads for operating on the plurality of graph streaming processors.

Type: Grant

Filed: April 14, 2019

Date of Patent: August 16, 2022

Assignee: Blaize, Inc.

Inventors: Satyaki Koneru, Val G. Cook, Ke Yin
CONFIGURABLE SCHEDULER FOR GRAPH PROCESSING ON MULTI-PROCESSOR COMPUTING SYSTEMS

Publication number: 20210406069

Abstract: Systems and methods are disclosures for scheduling code in a multiprocessor system. Code is portioned into code blocks by a compiler. The compiler schedules execution of code blocks in nodes. The nodes are connected in a directed acyclical graph with a top node, terminal node and a plurality of intermediate nodes. Execution of the top node is initiated by the compiler. After executing at least one instance of the top node, an instruction in the code block indicates to the scheduler to initiate at least one intermediary node. The scheduler schedules a thread for execution of the intermediary node. The data for the nodes resides in a plurality of data buffers; the index to the data buffer is stored in a command buffer.

Type: Application

Filed: August 8, 2021

Publication date: December 30, 2021

Applicant: Blaize, Inc.

Inventors: Satyaki Koneru, Val G. Cook, Ke Yin
NODE TOPOLOGY EMPLOYING COMMAND AND DATA BUFFERS FOR EXECUTING THREAD SCHEDULING

Publication number: 20210374901

Abstract: The described embodiments include systems, methods, and apparatuses for increased efficiency processing flow. One method includes a plurality of stages configured to process an execution graph that includes a plurality of logical nodes with defined properties and resources associated with each logical node of the plurality of logical nodes, a recirculating ring buffer, wherein the recirculating ring buffer is configured to holding only any one of a control information, input, and, or out data necessary to stream a temporary data between each logical node of the execution graph, and a data producer, wherein the data producer is configured to stall from writing control information into a command buffer upon the command buffer being full, preventing command buffer over-writing.

Type: Application

Filed: August 9, 2021

Publication date: December 2, 2021

Applicant: Blaize, Inc.

Inventors: Val G. Cook, Satyaki Koneru, Ke Yin, Dinakar C. Munagala
Node topology employing command and data buffers for executing thread scheduling

Patent number: 11151684

Abstract: The described embodiments include systems, methods, and apparatuses for increased efficiency processing flow. One method includes a plurality of stages configured to process an execution graph that includes a plurality of logical nodes with defined properties and resources associated with each logical node of the plurality of logical nodes, a recirculating ring buffer, wherein the recirculating ring buffer is configured to holding only any one of a control information, input, and, or out data necessary to stream a temporary data between each logical node of the execution graph, and a data producer, wherein the data producer is configured to stall from writing control information into a command buffer upon the command buffer being full, preventing command buffer over-writing.

Type: Grant

Filed: June 22, 2020

Date of Patent: October 19, 2021

Assignee: Blaize, Inc.

Inventors: Val G. Cook, Satyaki Koneru, Ke Yin, Dinakar C. Munagala
Configurable scheduler in a graph streaming processing system

Patent number: 11126462

Abstract: Systems and methods are disclosures for scheduling code in a multiprocessor system. Code is portioned into code blocks by a compiler. The compiler schedules execution of code blocks in nodes. The nodes are connected in a directed acyclical graph with a top node, terminal node and a plurality of intermediate nodes. Execution of the top node is initiated by the compiler. After executing at least one instance of the top node, an instruction in the code block indicates to the scheduler to initiate at least one intermediary node. The scheduler schedules a thread for execution of the intermediary node. The data for the nodes resides in a plurality of data buffers; the index to the data buffer is stored in a command buffer.

Type: Grant

Filed: July 8, 2019

Date of Patent: September 21, 2021

Assignee: Blaize, Inc.

Inventors: Satyaki Koneru, Val G. Cook, Ke Yin
Node Topology Employing Command and Data Buffers for Executing Thread Scheduling

Publication number: 20200320663

Abstract: The described embodiments include systems, methods, and apparatuses for increased efficiency processing flow. One method includes a plurality of stages configured to process an execution graph that includes a plurality of logical nodes with defined properties and resources associated with each logical node of the plurality of logical nodes, a recirculating ring buffer, wherein the recirculating ring buffer is configured to holding only any one of a control information, input, and, or out data necessary to stream a temporary data between each logical node of the execution graph, and a data producer, wherein the data producer is configured to stall from writing control information into a command buffer upon the command buffer being full, preventing command buffer over-writing.

Type: Application

Filed: June 22, 2020

Publication date: October 8, 2020

Applicant: Blaize, Inc.

Inventors: Val G. Cook, Satyaki Koneru, Ke Yin, Dinakar C. Munagala
Node topology employing recirculating ring command and data buffers for executing thread scheduling

Patent number: 10740868

Abstract: The described embodiments include systems, methods, and apparatuses for increased efficiency processing flow. One method includes a plurality of stages configured to process an execution graph that includes a plurality of logical nodes with defined properties and resources associated with each logical node of the plurality of logical nodes, a recirculating ring buffer, wherein the recirculating ring buffer is configured to holding only any one of a control information, input, and, or out data necessary to stream a temporary data between each logical node of the execution graph, and a data producer, wherein the data producer is configured to stall from writing control information into a command buffer upon the command buffer being full, preventing command buffer over-writing.

Type: Grant

Filed: November 28, 2019

Date of Patent: August 11, 2020

Assignee: Blaize, Inc.

Inventors: Val G. Cook, Satyaki Koneru, Ke Yin, Dinakar C. Munagala
Node Topology Employing Recirculating Ring Command and Data Buffers for Executing Thread Scheduling

Publication number: 20200098084

Abstract: The described embodiments include systems, methods, and apparatuses for increased efficiency processing flow. One method includes a plurality of stages configured to process an execution graph that includes a plurality of logical nodes with defined properties and resources associated with each logical node of the plurality of logical nodes, a recirculating ring buffer, wherein the recirculating ring buffer is configured to holding only any one of a control information, input, and, or out data necessary to stream a temporary data between each logical node of the execution graph, and a data producer, wherein the data producer is configured to stall from writing control information into a command buffer upon the command buffer being full, preventing command buffer over-writing.

Type: Application

Filed: November 28, 2019

Publication date: March 26, 2020

Applicant: ThinCI, Inc.

Inventors: Val G. Cook, Satyaki Koneru, Ke Yin, Dinakar C. Munagala
Node topology employing recirculating ring command and data buffers for executing thread scheduling

Patent number: 10540740

Abstract: The claimed invention discloses system comprising a plurality of logical nodes comprised in a single or plurality of stages, with defined properties and resources associated with each node, for reducing compute resources, said system further comprising: at least a recirculating ring buffer holding only any one of a control information, input, and, or out data necessary to stream a temporary data between node and, or nodes in an execution graph, thereby reducing size of said recirculating ring buffer; said recirculating ring buffer being sufficiently reduced in size to reside in an on-chip cache, such that any one of the control information, input, and, or out data between node and, or nodes need not be stored in memory; wherein the control information further comprises a command related to invalidating any one of the input and, or out data held in a recirculating ring data buffer, clearing the buffer of tasked data; and wherein a producer is stalled from writing any more control information into a recirculati

Type: Grant

Filed: May 18, 2019

Date of Patent: January 21, 2020

Assignee: Blaize, Inc.

Inventors: Val G. Cook, Satyaki Koneru, Ke Yin, Dinakar C. Munagala
CONFIGURABLE SCHEDULER IN A GRAPH STREAMING PROCESSING SYSTEM

Publication number: 20190332429

Abstract: Systems and methods are disclosures for scheduling code in a multiprocessor system. Code is portioned into code blocks by a compiler. The compiler schedules execution of code blocks in nodes. The nodes are connected in a directed acyclical graph with a top node, terminal node and a plurality of intermediate nodes. Execution of the top node is initiated by the compiler. After executing at least one instance of the top node, an instruction in the code block indicates to the scheduler to initiate at least one intermediary node. The scheduler schedules a thread for execution of the intermediary node. The data for the nodes resides in a plurality of data buffers; the index to the data buffer is stored in a command buffer.

Type: Application

Filed: July 8, 2019

Publication date: October 31, 2019

Applicant: ThinCl, Inc.

Inventors: Satyaki Koneru, Val G. Cook, Ke Yin
Node Topology Employing Recirculating Ring Command and Data Buffers for Executing Thread Scheduling

Publication number: 20190325551

Abstract: The claimed invention discloses system comprising a plurality of logical nodes comprised in a single or plurality of stages, with defined properties and resources associated with each node, for reducing compute resources, said system further comprising: at least a recirculating ring buffer holding only any one of a control information, input, and, or out data necessary to stream a temporary data between node and, or nodes in an execution graph, thereby reducing size of said recirculating ring buffer; said recirculating ring buffer being sufficiently reduced in size to reside in an on-chip cache, such that any one of the control information, input, and, or out data between node and, or nodes need not be stored in memory; wherein the control information further comprises a command related to invalidating any one of the input and, or out data held in a recirculating ring data buffer, clearing the buffer of tasked data; and wherein a producer is stalled from writing any more control information into a recirculati

Type: Application

Filed: May 18, 2019

Publication date: October 24, 2019

Applicant: Thinci, Inc.

Inventors: Val G. Cook, Satyaki Koneru, Ke Yin, Dinakar C. Munagala
Configurable scheduler for graph processing on multi-processor computing systems

Patent number: 10437637

Abstract: Systems and methods are disclosures for scheduling code in a multiprocessor system. Code is portioned into code blocks by a compiler. The compiler schedules execution of code blocks in nodes. The nodes are connected in a directed acyclical graph with a top node, terminal node and a plurality of intermediate nodes. Execution of the top node is initiated by the compiler. After executing at least one instance of the top node, an instruction in the code block indicates to the scheduler to initiate at least one intermediary node. The scheduler schedules a thread for execution of the intermediary node. The data for the nodes resides in a plurality of data buffers; the index to the data buffer is stored in a command buffer.

Type: Grant

Filed: May 25, 2016

Date of Patent: October 8, 2019

Assignee: Thin CI, Inc.

Inventors: Satyaki Koneru, Val G Cook, Ke Yin
CONFIGURABLE SCHEDULER IN A GRAPH STREAMING PROCESSING SYSTEM

Publication number: 20190235917

Abstract: Systems, apparatuses and methods are disclosed for scheduling threads comprising of code blocks in a graph streaming processor (GSP) system. One system includes a scheduler for scheduling plurality of threads, the plurality of threads includes a set of instructions operating on the graph streaming processors of GSP system. The scheduler comprises a plurality of stages where each stage is coupled to an input command buffer and an output command buffer. A portion of the scheduler is implemented in hardware and comprises of a command parser operative to interpret commands within a corresponding input command buffer, a thread generator coupled to the command parser operate to generate the plurality of threads, and a thread scheduler coupled to the thread generator for dispatching the plurality of threads for operating on the plurality of graph streaming processors.

Type: Application

Filed: April 14, 2019

Publication date: August 1, 2019

Applicant: ThinCl, Inc.

Inventors: Satyaki Koneru, Val G. Cook, Ke Yin
Node topology employing recirculating ring command and data buffers for executing thread scheduling

Patent number: 10311542

Abstract: The claimed invention discloses system comprising a plurality of logical nodes comprised in a single or plurality of stages, with defined properties and resources associated with each node, for reducing compute resources, said system further comprising: at least a recirculating ring buffer holding only any one of a control information, input, and, or out data necessary to stream a temporary data between node and, or nodes in an execution graph, thereby reducing size of said recirculating ring buffer; said recirculating ring buffer being sufficiently reduced in size to reside in an on-chip cache, such that any one of the control information, input, and, or out data between node and, or nodes need not be stored in memory; wherein the control information further comprises a command related to invalidating any one of the input and, or out data held in a recirculating ring data buffer, clearing the buffer of tasked data; and wherein a producer is stalled from writing any more control information into a recirculati

Type: Grant

Filed: March 6, 2017

Date of Patent: June 4, 2019

Assignee: THINCI, Inc.

Inventors: Val G. Cook, Satyaki Koneru, Ke Yin, Dinakar C. Munagala
NODE TOPOLOGY EMPLOYING RECIRCULATING RING COMMAND AND DATA BUFFERS FOR EXECUTING THREAD SCHEDULING

Publication number: 20180253890

Abstract: The claimed invention discloses system comprising a plurality of logical nodes comprised in a single or plurality of stages, with defined properties and resources associated with each node, for reducing compute resources, said system further comprising: at least a recirculating ring buffer holding only any one of a control information, input, and, or out data necessary to stream a temporary data between node and, or nodes in an execution graph, thereby reducing size of said recirculating ring buffer; said recirculating ring buffer being sufficiently reduced in size to reside in an on-chip cache, such that any one of the control information, input, and, or out data between node and, or nodes need not be stored in memory; wherein the control information further comprises a command related to invalidating any one of the input and, or out data held in a recirculating ring data buffer, clearing the buffer of tasked data; and wherein a producer is stalled from writing any more control information into a recirculati

Type: Application

Filed: March 6, 2017

Publication date: September 6, 2018

Inventors: Val G. Cook, Satyaki Koneru, Ke Yin, Dinakar C. Munagala
Mechanism for minimal computation and power consumption for rendering synthetic 3D images, containing pixel overdraw and dynamically generated intermediate images

Patent number: 9589388

Abstract: Embodiments disclosed include a mechanism in a system and method for significantly reducing power consumption by reducing computation and bandwidth. This mechanism is particularly applicable for modern 3D synthetic images which contain high pixel overdraw and dynamically generated intermediates images. Only blocks of computation which contribute to the final image are performed. This is accomplished by rendering in reverse order and by performing multiple visibility sort in a streaming fashion through the pipeline. Rendering of dynamically generated intermediate images is performed sparsely by projecting texture coordinates from a current image back into one or more dependent images in a recursive manner. The newly computed pixel values are then filtered and control is returned to the sampling shader of the current image. When only visible pixels are projected optimal computation is performed. Several implementations are presented with increasing efficiency.

Type: Grant

Filed: July 9, 2014

Date of Patent: March 7, 2017

Assignee: ThinCI, Inc.

Inventors: Val G. Cook, Satyaki Koneru, Ke Yin, Dinakar C. Munagala

1 2 next