Patents by Inventor Soren T. Soe

Soren T. Soe has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

DYNAMIC PROVISIONING OF PORTIONS OF A DATA PROCESSING ARRAY FOR SPATIAL AND TEMPORAL SHARING

Publication number: 20240211302

Abstract: Dynamic provisioning of portions of a data processing array includes receiving, from an executing application, a context request. The context request specifies a requested task to be performed by a data processing array. A configuration for the data processing array is selected from a plurality of configurations for the data processing array. The selected configuration conforms with the context request and is capable of performing the requested task. A determination is made whether the selected configuration is implementable in the data processing array based, at least in part, on a space requirement of the selected configuration and a current status of the data processing array. The selected configuration is selectively implemented in the data processing array based on the determination.

Type: Application

Filed: December 22, 2022

Publication date: June 27, 2024

Applicant: Xilinx, Inc.

Inventors: Sonal Santan, Yu Liu, Akila Subramaniam, Vinod K. Kathail, King Chiu Tam, Tung Chuen Kwong, Pranjal Joshi, Soren T. Soe
Unified container for hardware and software binaries

Patent number: 11720422

Abstract: A unified container file can be selected using computer hardware. The unified container file can include a plurality of files embedded therein used to configure a programmable integrated circuit (IC). The plurality of files can include a first partial configuration bitstream and a second partial configuration bitstream. The unified container file also includes metadata specifying a defined relationship between the first partial configuration bitstream and the second partial configuration bitstream for programming the programmable IC. The defined relationship can be determined using computer hardware by reading the metadata from the unified container file. The programmable IC can be configured, using the computer hardware, based on the defined relationship specified by the metadata using the first partial configuration bitstream and the second partial configuration bitstream.

Type: Grant

Filed: March 11, 2021

Date of Patent: August 8, 2023

Assignee: Xilinx, Inc.

Inventors: Hem C. Neema, Sonal Santan, Soren T. Soe, Stephen P. Rozum, Nik Cimino
Machine learning runtime library for neural network acceleration

Patent number: 11694066

Abstract: Embodiments herein describe techniques for interfacing a neural network application with a neural network accelerator using a library. The neural network application may execute on a host computing system while the neural network accelerator executes on a massively parallel hardware system, e.g., a FPGA. The library operates a pipeline for submitting the tasks received from the neural network application to the neural network accelerator. In one embodiment, the pipeline includes a pre-processing stage, an FPGA execution stage, and a post-processing stage which each correspond to different threads. When receiving a task from the neural network application, the library generates a packet that includes the information required for the different stages in the pipeline to perform the tasks. Because the stages correspond to different threads, the library can process multiple packets in parallel which can increase the utilization of the neural network accelerator on the hardware system.

Type: Grant

Filed: October 17, 2017

Date of Patent: July 4, 2023

Assignee: XILINX, INC.

Inventors: Aaron Ng, Jindrich Zejda, Elliott Delaye, Xiao Teng, Sonal Santan, Soren T. Soe, Ashish Sirasao, Ehsan Ghasemi, Sean Settle
Supporting access to accelerators on a programmable integrated circuit by multiple host processes

Patent number: 11086815

Abstract: Supporting multiple clients on a single programmable integrated circuit (IC) can include implementing a first image within the programmable IC in response to a first request for processing to be performed by the programmable IC, wherein the request is from a first process executing in a host data processing system coupled to the programmable IC, receiving, using a processor of the host data processing system, a second request for processing to be performed on the programmable IC from a second and different process executing in the host data processing system while the programmable IC still implements the first image, comparing, using the processor, a second image specified by the second request to the first image, and, in response to determining that the second image matches the first image based on the comparing, granting, using the processor, the second request for processing to be performed by the programmable IC.

Type: Grant

Filed: April 15, 2019

Date of Patent: August 10, 2021

Assignee: Xilinx, Inc.

Inventors: Sonal Santan, Soren T. Soe, Cheng Zhen
Unified container for hardware and software binaries

Patent number: 10956241

Abstract: A computer program product can include a non-transitory computer readable storage medium storing a unified container. The unified container can include a header structure, wherein the header structure has a fixed length and specifies a number of section headers included in the unified container. The unified container can include a plurality of section headers equivalent to the number of section headers specified in the header structure. The unified container can include a plurality of data sections corresponding to the plurality of section headers on a one-to-one basis. The plurality of data sections includes a first data section including a hardware binary and a second data section including a software binary. The hardware binary and the software binary are configured to program a programmable integrated circuit. Each section header specifies a type of data stored in the corresponding data section and specifies a mapping for the corresponding data section.

Type: Grant

Filed: December 20, 2017

Date of Patent: March 23, 2021

Assignee: Xilinx, Inc.

Inventors: Hem C. Neema, Sonal Santan, Soren T. Soe, Stephen P. Rozum, Nik Cimino
Embedded scheduling of hardware resources for hardware acceleration

Patent number: 10877766

Abstract: An integrated circuit (IC) may include a scheduler for hardware acceleration. The scheduler may include a command queue having a plurality of slots and configured to store commands offloaded from a host processor for execution by compute units of the IC. The scheduler may include a status register having bit locations corresponding to the slots of the command queue. The scheduler may also include a controller coupled to the command queue and the status register. The controller may be configured to schedule the compute units of the IC to execute the commands stored in the slots of the command queue and update the bit locations of the status register to indicate which commands from the command queue are finished executing.

Type: Grant

Filed: May 24, 2018

Date of Patent: December 29, 2020

Assignee: Xilinx, Inc.

Inventors: Soren T. Soe, Idris I. Tarwala, Umang Parekh, Sonal Santan, Hem C. Neema
Programming and controlling compute units in an integrated circuit

Patent number: 10705993

Abstract: An integrated circuit (IC) can include a command queue having a plurality of slots corresponding to commands from a host processor for execution by a plurality of compute units of the IC and a command request register having a plurality of locations corresponding to the plurality of slots in the command queue. The command request register is configured to generate an interrupt indicating a new command stored within the command queue. The IC can include a controller configured to, in response to the interrupt from the command request register, determine a selected compute unit that is idle from the plurality of compute units to execute the new command. The IC can also include a compute unit direct memory access circuit configured to provide the new command to the available compute unit.

Type: Grant

Filed: November 19, 2018

Date of Patent: July 7, 2020

Assignee: Xilinx, Inc.

Inventors: Soren T. Soe, Idris I. Tarwala, Ellery Cochell
PROGRAMMING AND CONTROLLING COMPUTE UNITS IN AN INTEGRATED CIRCUIT

Publication number: 20200159680

Abstract: An integrated circuit (IC) can include a command queue having a plurality of slots corresponding to commands from a host processor for execution by a plurality of compute units of the IC and a command request register having a plurality of locations corresponding to the plurality of slots in the command queue. The command request register is configured to generate an interrupt indicating a new command stored within the command queue. The IC can include a controller configured to, in response to the interrupt from the command request register, determine a selected compute unit that is idle from the plurality of compute units to execute the new command. The IC can also include a compute unit direct memory access circuit configured to provide the new command to the available compute unit.

Type: Application

Filed: November 19, 2018

Publication date: May 21, 2020

Applicant: Xilinx, Inc.

Inventors: Soren T. Soe, Idris I. Tarwala, Ellery Cochell
EMBEDDED SCHEDULING OF HARDWARE RESOURCES FOR HARDWARE ACCELERATION

Publication number: 20190361708

Abstract: An integrated circuit (IC) may include a scheduler for hardware acceleration. The scheduler may include a command queue having a plurality of slots and configured to store commands offloaded from a host processor for execution by compute units of the IC. The scheduler may include a status register having bit locations corresponding to the slots of the command queue. The scheduler may also include a controller coupled to the command queue and the status register. The controller may be configured to schedule the compute units of the IC to execute the commands stored in the slots of the command queue and update the bit locations of the status register to indicate which commands from the command queue are finished executing.

Type: Application

Filed: May 24, 2018

Publication date: November 28, 2019

Applicant: Xilinx, Inc.

Inventors: Soren T. Soe, Idris I. Tarwala, Umang Parekh, Sonal Santan, Hem C. Neema
Scheduling hardware resources for offloading functions in a heterogeneous computing system

Patent number: 10402223

Abstract: A heterogeneous computing system can include a host memory and a host processor. The host memory is configured to maintain a write task queue and a read task queue. The host processor is coupled to the host memory and a processing device. The host processor is adapted to store write tasks in the write task queue. The write tasks cause transfer of input data to the processing device. The processing device is adapted to perform offloaded functions. The host processor is adapted to store read tasks in the read task queue. The read tasks cause transfer of results from the offloaded functions from the processing device. The host processor is further adapted to maintain a number of direct memory access (DMA) worker threads corresponding to concurrent data transfer capability of the processing device. Each DMA worker thread is preconfigured to execute tasks from the write task queue or the read task queue.

Type: Grant

Filed: April 26, 2017

Date of Patent: September 3, 2019

Assignee: XILINX, INC.

Inventors: Sonal Santan, Soren T. Soe
MACHINE LEARNING RUNTIME LIBRARY FOR NEURAL NETWORK ACCELERATION

Publication number: 20190114533

Abstract: Embodiments herein describe techniques for interfacing a neural network application with a neural network accelerator using a library. The neural network application may execute on a host computing system while the neural network accelerator executes on a massively parallel hardware system, e.g., a FPGA. The library operates a pipeline for submitting the tasks received from the neural network application to the neural network accelerator. In one embodiment, the pipeline includes a pre-processing stage, an FPGA execution stage, and a post-processing stage which each correspond to different threads. When receiving a task from the neural network application, the library generates a packet that includes the information required for the different stages in the pipeline to perform the tasks. Because the stages correspond to different threads, the library can process multiple packets in parallel which can increase the utilization of the neural network accelerator on the hardware system.

Type: Application

Filed: October 17, 2017

Publication date: April 18, 2019

Applicant: Xilinx, Inc.

Inventors: Aaron Ng, Jindrich Zejda, Elliott Delaye, Xiao Teng, Sonal Santan, Soren T. Soe, Ashish Sirasao, Ehsan Ghasemi, Sean Settle
Programmable logic device having a programmable selector circuit

Patent number: 7620929

Abstract: A PLD is configurable to efficiently implement a wide variety of user functions. The PLD includes a programmable interconnect circuit, programmable logic circuits, one-bit registers, selector circuits, and input/output blocks. The programmable interconnect circuit is configurable to connect the signal lines of its output ports to the signal lines of its input ports. The programmable logic circuits are configurable to implement a programmable function generating one-bit signal values from a respective output port of the programmable interconnect circuit. The one-bit registers store a respective one of these one-bit signal values. The programmable selector circuits are each coupled to output ports of a plurality of the one-bit registers, with each of these one-bit registers coupled to a respective one of the programmable logic circuits.

Type: Grant

Filed: January 28, 2008

Date of Patent: November 17, 2009

Assignee: Xilinx, Inc.

Inventors: Tetse Jang, Soren T. Soe, Scott Te-Sheng Lien
Programmable logic device having a programmable selector circuit

Patent number: 7345508

Abstract: A PLD is configurable to efficiently implement a wide variety of user functions. The PLD includes a programmable interconnect circuit, programmable logic circuits, one-bit registers, selector circuits, and input/output blocks. The programmable interconnect circuit is configurable to connect the signal lines of its output ports to the signal lines of its input ports. The programmable logic circuits are configurable to implement a programmable function generating one-bit signal values from a respective output port of the programmable interconnect circuit. The one-bit registers store a respective one of these one-bit signal values. The programmable selector circuits are each coupled to output ports of a plurality of the one-bit registers, with each of these one-bit registers coupled to a respective one of the programmable logic circuits.

Type: Grant

Filed: January 24, 2006

Date of Patent: March 18, 2008

Assignee: Xilinx, Inc.

Inventors: Tetse Jang, Soren T. Soe, Scott Te-Sheng Lien
CPLD with fast logic sharing between function blocks

Patent number: 7129747

Abstract: Fast logic sharing is created using a feedback path from the output logic macrocell of one functional block to the product term inputs of another function block without going through an advanced interconnect matrix (AIM). The fast feedback path may be provided from the macrocell after the product terms XOR gate without registering, and/or after the register in the macrocell. The fast logic sharing avoids the slow AIM for feedback logic, and allows additional resources to be borrowed from other function blocks with a limited delay penalty. In particular, delay penalties resulting from dividing wide operations requiring multiple product terms between the product terms of multiple functional blocks are significantly reduced.

Type: Grant

Filed: October 15, 2004

Date of Patent: October 31, 2006

Assignee: Xilinx, Inc.

Inventors: Tetse Jang, Soren T. Soe, Scott Te-Sheng Lien
Incremental logic synthesis system for revisions of logic circuit designs

Patent number: 6484292

Abstract: A method and apparatus for implementing incremental design changes. In various embodiments, primary outputs of a new design are compared for logical equivalence to corresponding primary outputs of a prior implementation. If the logic is equivalent, the implementation of the primary outputs from the prior implementation is reused to implement the corresponding primary outputs of the new design.

Type: Grant

Filed: February 7, 2000

Date of Patent: November 19, 2002

Assignee: Xilinx, Inc.

Inventors: Gitu Jain, Soren T. Soe
Method and apparatus for implementing type-safe heterogeneous property lists

Patent number: 6336211

Abstract: A method and apparatus for implementing type-safe, heterogeneous object-oriented property lists. A library of classes and template classes implementing a heterogeneous property list is defined, wherein each object of the property list includes a property name and an associated property value of a selected property type. Instantiation of objects having equal property names and unequal property types is prevented with a constructor, and template functions are provided each of which accesses the objects of the property list according to a type of data passed to the function.

Type: Grant

Filed: May 20, 1999

Date of Patent: January 1, 2002

Assignee: Xilinx, Inc.

Inventor: Soren T. Soe
Method for programming complex PLD having more than one function block type

Patent number: 5963048

Abstract: A method for programming programmable logic devices (PLDs) having multiple function block types to implement a logic function, whereby the logic function is mapped into one of the function block types before being mapped into the remaining function block types. In one embodiment, a PLD containing both "fast" function blocks (FFBs) and "high density" function blocks (HDFBs) are programmed such that the FFBs are programmed prior to the HDFBs. This method maximizes the overall speed of an implemented logic function.

Type: Grant

Filed: March 12, 1997

Date of Patent: October 5, 1999

Assignee: Xilinx, Inc.

Inventors: David A. Harrison, Joshua M. Silver, Soren T. Soe
Method for programming complex PLD having more than one function block type

Patent number: 5636368

Abstract: A method for programming programmable logic devices (PLDs) having multiple function block types to implement a logic function, whereby the logic function is mapped into one of the function block types before being mapped into the remaining function block types. In one embodiment, a PLD containing both "fast" function blocks (FFBs) and "high density" function blocks (HDFBs) are programmed such that the FFBs are programmed prior to the HDFBs. This method maximizes the overall speed of an implemented logic function.

Type: Grant

Filed: December 23, 1994

Date of Patent: June 3, 1997

Assignee: Xilinx, Inc.

Inventors: David A. Harrison, Joshua M. Silver, Soren T. Soe