Patents by Inventor Peng OUYANG

Peng OUYANG has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

DATA-FLOW-DRIVEN RECONFIGURABLE PROCESSOR CHIP AND RECONFIGURABLE PROCESSOR CLUSTER

Publication number: 20250094379

Abstract: A reconfigurable processor chip includes: a plurality of reconfigurable processing elements based on distributed storage, components of the reconfigurable processing elements being logically interconnected. The components include: a reconfigurable computing component configured to calculate data; a data flow controller using a data flow driving mode, the data flow driving mode is configured to control start and end of a computing task and a data transmission task based on data flow information about the computing task and message transferring of upstream and downstream reconfigurable processing elements; a distributed memory configured to implement data storage of a corresponding reconfigurable processing element; and a programmable data routing element configured to implement communication between the plurality of reconfigurable processing elements to control a direction of a data packet, and implement flexible transmission of the data packet.

Type: Application

Filed: December 6, 2024

Publication date: March 20, 2025

Inventor: Peng Ouyang
Data loading and storage system and method

Patent number: 12124853

Abstract: A data loading and storage system includes a storage module, a buffering module, a control module, a plurality of data loading modules, a plurality of data storage modules and a multi-core processor array module. The data is continuously stored in a DDR, and the data computed by the multi-core processor may be arranged continuously or be arranged according to a certain rule. After DMA reads the data into the DATA_BUF module by a BURST mode, in order to support fast loading of the data into the multi-core processor array, the data loading modules (i.e., load modules) are designed. In order to quickly store the computed result of the multi-core processor array into the (DATA_BUF module according to a certain rule, the data storage modules (i.e., store module) are designed.

Type: Grant

Filed: September 24, 2021

Date of Patent: October 22, 2024

Assignee: BEIJING TSINGMICRO INTELLIGENT TECHNOLOGY CO., LTD.

Inventors: Pengpeng Zhang, Peng Ouyang
BRIDGE DRIVE SYSTEM

Publication number: 20240250581

Abstract: A bridge drive system, including a housing, a motor, a speed reduction mechanism, and a shaft assembly. The housing has first and second spaces separated from each other. The motor is in the first space, and the speed reduction mechanism is in the second space. The shaft assembly has an oil inlet and an oil outlet which are both located in the second space. An oil passage extending from the oil inlet into the first space and then returning to the oil outlet is formed inside the shaft assembly so that oil from the second space can flow from the oil inlet into the oil passage to cool a rotor and then return to the second space from the oil outlet. In this way, the cooling performance of a motor is improved, improving its performance and reducing heat resistance requirements of each component.

Type: Application

Filed: June 1, 2021

Publication date: July 25, 2024

Applicant: Schaeffer Technologies AG & Co. KG

Inventors: Lei Liu, Zhinqiang SONG, Peng OUYANG, Bo HUANG
Image correction method and apparatus for camera

Patent number: 12033308

Abstract: An image correction method includes: capturing speckle patterns on two planes at different distances to obtain a first image of speckle projected on a first plane and a second image of speckle projected on a second plane; matching the first image with the second image to obtain sub-pixel matching points; obtaining, based on first physical coordinates of the sub-pixel matching points on the first image and second physical coordinates of the sub-pixel matching points on the second image, a mapping matrix between the first and second physical coordinates; obtaining a direction vector of a center of the speckle projector in a camera reference frame according to the mapping matrix; adjusting coordinate axis directions of the camera reference frame to align a horizontal axis direction with the direction vector, updating an imaging matrix of the camera; and mapping a target scene image through the imaging matrix to obtain a corrected image.

Type: Grant

Filed: September 29, 2021

Date of Patent: July 9, 2024

Assignee: BEIJING TSINGMICRO INTELLIGENT TECHNOLOGY CO., LTD.

Inventors: Kai Zhou, Shouyi Yin, Shibin Tang, Peng Ouyang, Xiudong Li, Bo Wang
Memory coupled compiling method and system of reconfigurable chip

Patent number: 12032841

Abstract: Provided are a memory coupled compiling method and system of a reconfigurable chip. The memory coupled compiling method includes: acquiring a cycle number of a data flow graph (DFG); acquiring a linear transformation vector of the cycle number through a mapping time difference; determining whether a linear array of the linear transformation vector is acquired by a heuristic algorithm; acquiring a memory mapping result through a current DFG or acquiring a cycle number of the current DFG until the linear array is acquired, depending on the determination result.

Type: Grant

Filed: September 24, 2021

Date of Patent: July 9, 2024

Assignee: BEIJING TSINGMICRO INTELLIGENT TECHNOLOGY CO., LTD.

Inventors: Zhen Zhang, Peng Ouyang, Junbao Hu
Computing array and processor having the same

Patent number: 12013809

Abstract: A computing array includes a plurality of process element groups, and each of the plurality of the process element groups includes four process elements arranged in two rows and two columns and a merging unit. Each of the four process elements includes an input subunit; a fetch and decode subunit configured to obtain and compile the instruction to output a logic computing type; an operation subunit configured to obtain computing result data according to the logic computing type and the operation data; an output subunit configured to output the computing result data. The merging unit is connected to the output subunit of each of the four process elements, and configured to receive the computing result data output by the output subunit of each of the four process elements, merge the computing result data and output the merged computing result data.

Type: Grant

Filed: September 23, 2021

Date of Patent: June 18, 2024

Assignee: BEIJING TSINGMICRO INTELLIGENT TECHNOLOGY CO., LTD.

Inventors: Peng Ouyang, Yaxue Zhang
Method and system for distributing instructions in reconfigurable processor and storage medium

Patent number: 11977894

Abstract: The disclosure provides a method for distributing instructions in a reconfigurable processor. The reconfigurable processor includes an instruction fetch module, an instruction sync control module and an instruction queue module. The method includes: configuring a format of a Memory Sync ID Table of each instruction type, obtaining a first memory identification field and a second memory identification field of each instruction, obtaining one-hot encodings of first and second memory identification fields, obtaining a sync table and executing each instruction of a plurality of to-be-run instructions.

Type: Grant

Filed: May 7, 2021

Date of Patent: May 7, 2024

Assignee: BEIJING TSINGMICRO INTELLIGENT TECHNOLOGY CO., LTD.

Inventors: Baochuan Fei, Peng Ouyang, Shibin Tang, Liwei Deng
Mapping method and mapping device for reconfigurable array

Patent number: 11954061

Abstract: A mapping method for a reconfigurable array, including: Si obtaining and analyzing a DDG; providing an initial interval; obtaining a reconfigurable architecture; copying the first adjacency matrix and the second adjacency matrix to form a mapping space; establishing an integer linear programming model, and mapping, with the integer linear programming model, a processing vertex, an intra-cycle edge, and an inter-cycle edge in the DDG, to the mapping space, respectively; obtaining a mapping relationship from the processing vertex and the edge in the DDG to the processing element and the link of extended TS_max layers; and generating configuration information by the mapping relationship modulo the initial interval.

Type: Grant

Filed: September 23, 2021

Date of Patent: April 9, 2024

Assignee: BEIJING TSINGMICRO INTELLIGENT TECHNOLOGY CO., LTD.

Inventors: Chongyang Wang, Zhen Zhang, Peng Ouyang
Instruction scheduling method and system for reconfigurable array processor

Patent number: 11928473

Abstract: An instruction scheduling method and an instruction scheduling system for a reconfigurable array processor. The method includes: determining whether a fan-out of a vertex in a data flow graph (DFG) is less than an actual interconnection number of a processing unit in a reconfigurable array; establishing a corresponding relationship between the vertex and a correlation operator of the processing unit; introducing a register to a directed edge, acquiring a retiming value of each vertex; arranging instructions in such a manner that retiming values of the instruction vertexes are in ascending order, and acquiring transmission time and scheduling order of the instructions; folding the DFG, placing an instruction to an instruction vertex; inserting a register and acquiring a current DFG; and acquiring a common maximum subset of the current DFG and the reconfigurable array by a maximum clique algorithm, and distributing the instructions.

Type: Grant

Filed: March 22, 2022

Date of Patent: March 12, 2024

Assignee: BEIJING TSINGMICRO INTELLIGENT TECHNOLOGY CO., LTD.

Inventors: Kejia Zhu, Zhen Zhang, Peng Ouyang
Processor array and multiple-core processor

Patent number: 11921668

Abstract: The present disclosure provides a processor array and a multiple-core processor. The processor array includes a plurality of processing elements arranged in a two-dimensional array, a plurality of first load units correspondingly arranged and connected to the processing elements of the first edge row, respectively, a plurality of second load units correspondingly arranged and connected to the processing elements of the first edge column, respectively, a plurality of first store units correspondingly arranged and connected to the processing elements of the second edge column, respectively, a plurality of second store units correspondingly arranged and connected to the processing elements of the second edge row, respectively.

Type: Grant

Filed: July 15, 2021

Date of Patent: March 5, 2024

Assignee: BEIJING TSINGMICRO INTELLIGENT TECHNOLOGY CO., LTD.

Inventors: Peng Ouyang, Guozhi Song
Data storage and reading method and device

Patent number: 11740832

Abstract: A data storage method includes: obtaining memory banks of arithmetic data; generating undetermined memory bank numbers of the memory banks sequentially; scanning storage dimensions of the arithmetic data to obtain the undetermined memory bank numbers, filling elements to make the undetermined memory bank numbers continuous if the undetermined memory bank numbers of two adjacent dimensions are not continuous; taking as a current transformation vector through a greedy algorithm a determined transformation vector where conflict is least and the number of the filling elements is smallest; generating current memory bank numbers of the memory banks according to the current transformation vector; converting each of the current memory bank numbers into a physical storage bank address through an offset function to obtain a corresponding internal offset address; and storing the arithmetic data into the memory banks according to the current memory bank numbers and the internal offset addresses.

Type: Grant

Filed: September 24, 2021

Date of Patent: August 29, 2023

Assignee: BEIJING TSINGMICRO INTELLIGENT TECHNOLOGY CO., LTD.

Inventors: Cheng Li, Peng Ouyang, Zhen Zhang
METHOD AND SYSTEM FOR DISTRIBUTING INSTRUCTIONS IN RECONFIGURABLE PROCESSOR AND STORAGE MEDIUM

Publication number: 20230068463

Abstract: The disclosure provides a method for distributing instructions in a reconfigurable processor. The reconfigurable processor includes an instruction fetch module, an instruction sync control module and an instruction queue module. The method includes: configuring a format of a Memory Sync ID Table of each instruction type, obtaining a first memory identification field and a second memory identification field of each instruction, obtaining one-hot encodings of first and second memory identification fields, obtaining a sync table and executing each instruction of a plurality of to-be-run instructions.

Type: Application

Filed: May 7, 2021

Publication date: March 2, 2023

Inventors: Baochuan FEI, Peng OUYANG, Shibin TANG, Liwei DENG
METHOD AND APPARATUS FOR PROCESSING SPARSE DATA

Publication number: 20230068450

Abstract: The disclosure provides a method and apparatus for processing sparse data. The method is applied to a reconfigurable processor that includes a PE array, and the PE array includes P×Q PE units. The method includes: dividing a sparse weight matrix to be calculated into at least one unit block; grouping a plurality of unit blocks into a computing group; and obtaining an effective weight address corresponding to each effective weight in the computing group.

Type: Application

Filed: May 27, 2021

Publication date: March 2, 2023

Inventors: Shibin TANG, Peng OUYANG
INSTRUCTION SCHEDULING METHOD AND SYSTEM FOR RECONFIGURABLE ARRAY PROCESSOR

Publication number: 20220214883

Abstract: An instruction scheduling method and an instruction scheduling system for a reconfigurable array processor. The method includes: determining whether a fan-out of a vertex in a data flow graph (DFG) is less than an actual interconnection number of a processing unit in a reconfigurable array; establishing a corresponding relationship between the vertex and a correlation operator of the processing unit; introducing a register to a directed edge, acquiring a retiming value of each vertex; arranging instructions in such a manner that retiming values of the instruction vertexes are in ascending order, and acquiring transmission time and scheduling order of the instructions; folding the DFG, placing an instruction to an instruction vertex; inserting a register and acquiring a current DFG; and acquiring a common maximum subset of the current DFG and the reconfigurable array by a maximum clique algorithm, and distributing the instructions.

Type: Application

Filed: March 22, 2022

Publication date: July 7, 2022

Inventors: Kejia ZHU, Zhen ZHANG, Peng OUYANG
MEMORY COUPLED COMPILING METHOD AND SYSTEM OF RECONFIGURABLE CHIP

Publication number: 20220206697

Abstract: Provided are a memory coupled compiling method and system of a reconfigurable chip. The memory coupled compiling method includes: acquiring a cycle number of a data flow graph (DFG); acquiring a linear transformation vector of the cycle number through a mapping time difference; determining whether a linear array of the linear transformation vector is acquired by a heuristic algorithm; acquiring a memory mapping result through a current DFG or acquiring a cycle number of the current DFG until the linear array is acquired, depending on the determination result.

Type: Application

Filed: September 24, 2021

Publication date: June 30, 2022

Inventors: Zhen ZHANG, Peng OUYANG, Junbao HU
DATA STORAGE AND READING METHOD AND DEVICE

Publication number: 20220100414

Abstract: A data storage method includes: obtaining memory banks of arithmetic data; generating undetermined memory bank numbers of the memory banks sequentially; scanning storage dimensions of the arithmetic data to obtain the undetermined memory bank numbers, filling elements to make the undetermined memory bank numbers continuous if the undetermined memory bank numbers of two adjacent dimensions are not continuous; taking as a current transformation vector through a greedy algorithm a determined transformation vector where conflict is least and the number of the filling elements is smallest; generating current memory bank numbers of the memory banks according to the current transformation vector; converting each of the current memory bank numbers into a physical storage bank address through an offset function to obtain a corresponding internal offset address; and storing the arithmetic data into the memory banks according to the current memory bank numbers and the internal offset addresses.

Type: Application

Filed: September 24, 2021

Publication date: March 31, 2022

Inventors: Cheng LI, Peng OUYANG, Zhen ZHANG
COMPUTING ARRAY AND PROCESSOR HAVING THE SAME

Publication number: 20220100699

Abstract: A computing array includes a plurality of process element groups, and each of the plurality of the process element groups includes four process elements arranged in two rows and two columns and a merging unit. Each of the four process elements includes an input subunit; a fetch and decode subunit configured to obtain and compile the instruction to output a logic computing type; an operation subunit configured to obtain computing result data according to the logic computing type and the operation data; an output subunit configured to output the computing result data. The merging unit is connected to the output subunit of each of the four process elements, and configured to receive the computing result data output by the output subunit of each of the four process elements, merge the computing result data and output the merged computing result data.

Type: Application

Filed: September 23, 2021

Publication date: March 31, 2022

Inventors: Peng OUYANG, Yaxue ZHANG
PROCESSOR ARRAY AND MULTIPLE-CORE PROCESSOR

Publication number: 20220100698

Abstract: The present disclosure provides a processor array and a multiple-core processor. The processor array includes a plurality of processing elements arranged in a two-dimensional array, a plurality of first load units correspondingly arranged and connected to the processing elements of the first edge row, respectively, a plurality of second load units correspondingly arranged and connected to the processing elements of the first edge column, respectively, a plurality of first store units correspondingly arranged and connected to the processing elements of the second edge column, respectively, a plurality of second store units correspondingly arranged and connected to the processing elements of the second edge row, respectively.

Type: Application

Filed: July 15, 2021

Publication date: March 31, 2022

Inventors: Peng OUYANG, Guozhi SONG
DATA LOADING AND STORAGE SYSTEM AND METHOD

Publication number: 20220100521

Abstract: A data loading and storage system includes a storage module, a buffering module, a control module, a plurality of data loading modules, a plurality of data storage modules and a multi-core processor array module. The data is continuously stored in a DDR, and the data computed by the multi-core processor may be arranged continuously or be arranged according to a certain rule. After DMA reads the data into the DATA_BUF module by a BURST mode, in order to support fast loading of the data into the multi-core processor array, the data loading modules (i.e., load modules) are designed. In order to quickly store the computed result of the multi-core processor array into the (DATA_BUF module according to a certain rule, the data storage modules (i.e., store module) are designed.

Type: Application

Filed: September 24, 2021

Publication date: March 31, 2022

Inventors: Pengpeng ZHANG, Peng OUYANG
MAPPING METHOD AND MAPPING DEVICE FOR RECONFIGURABLE ARRAY

Publication number: 20220083495

Abstract: A mapping method for a reconfigurable array, including: Si obtaining and analyzing a DDG; providing an initial interval; obtaining a reconfigurable architecture; copying the first adjacency matrix and the second adjacency matrix to form a mapping space; establishing an integer linear programming model, and mapping, with the integer linear programming model, a processing vertex, an intra-cycle edge, and an inter-cycle edge in the DDG, to the mapping space, respectively; obtaining a mapping relationship from the processing vertex and the edge in the DDG to the processing element and the link of extended TS_max layers; and generating configuration information by the mapping relationship modulo the initial interval.

Type: Application

Filed: September 23, 2021

Publication date: March 17, 2022

Inventors: Chongyang WANG, Zhen Zhang, Peng OUYANG

1 2 next