Patents Assigned to T-Head (Shanghai) Semiconductor Co., Ltd.
-
Patent number: 11960437Abstract: A system includes a high-bandwidth inter-chip network (ICN) that allows communication between parallel processing units (PPUs) in the system. For example, the ICN allows a PPU to communicate with other PPUs on the same compute node or server and also with PPUs on other compute nodes or servers. In embodiments, communication may be at the command level (e.g., at the direct memory access level) and at the instruction level (e.g., the finer-grained load/store instruction level). The ICN allows PPUs in the system to communicate without using a PCIe bus, thereby avoiding its bandwidth limitations and relative lack of speed. The respective routing tables comprise information of multiple paths to any given other PPU.Type: GrantFiled: July 15, 2022Date of Patent: April 16, 2024Assignee: T-Head (Shanghai) Semiconductor Co., Ltd.Inventors: Liang Han, Yunxiao Zou
-
Patent number: 11886352Abstract: This specification describes methods and systems for accelerating attribute data access for graph neural network (GNN) processing. An example method includes: receiving a root node identifier corresponding to a node in a graph for GNN processing; determining one or more candidate node identifiers according to the root node identifier, wherein attribute data corresponding to the one or more candidate node identifiers are sequentially stored in a memory; and sampling one or more graph node identifiers at least from the one or more candidate node identifiers for the GNN processing.Type: GrantFiled: January 21, 2022Date of Patent: January 30, 2024Assignee: T-Head (Shanghai) Semiconductor Co., Ltd.Inventors: Heng Liu, Tianchan Guan, Shuangchen Li, Hongzhong Zheng
-
Publication number: 20230401060Abstract: The embodiments of the present application provide a processing unit. The processing unit comprises: an instruction fetching unit configured for fusing instruction of vector configuration instruction and vector operation instruction that are adjacent in order to obtain fusion instruction; an instruction decoding unit configured to decode the fusion instruction to obtain first execution information and second execution information; a vector configuration unit configured to execute the vector configuration instruction according to the first execution information, modify vector control register, and bypass the value of the modified vector control register to the vector operation unit; the vector operation unit configured to execute the vector operation instruction according to the second execution information and the value of the modified vector control register.Type: ApplicationFiled: December 30, 2022Publication date: December 14, 2023Applicant: T-HEAD (SHANGHAI) SEMICONDUCTOR CO., LTD.Inventors: Dongqi LIU, Haowen CHEN, Zhao JIANG, Chang LIU, Dingyan WEI, Wenjian XU, Tao JIANG
-
Publication number: 20230401062Abstract: The embodiments of the present application provide an instruction retirement unit, an instruction execution unit, and a related apparatus and method. The instruction retirement unit includes: a reception subunit, configured to receive a completion request signal; an arbitration subunit, configured to send a completion permission signal to an instruction execution unit after determining that instructions to be completed can be completed, and store instruction completion information of the instructions to be completed into instruction buffer table entries in a buffer; and a retirement subunit, configured to perform, based on the instruction completion information stored in each of the instruction buffer table entries, retirement processing on instructions that have already been completed and have not been retired, and delete the instruction completion information of retired instructions from the instruction buffer table entries.Type: ApplicationFiled: December 30, 2022Publication date: December 14, 2023Applicant: T-HEAD (SHANGHAI) SEMICONDUCTOR CO., LTD.Inventors: Chang LIU, Haowen CHEN, Tao JIANG
-
Patent number: 11841799Abstract: This application describes a hardware accelerator, a computer system, and a method for accelerating Graph Neural Network (GNN) node attribute fetching. The hardware accelerator comprises a GNN attribute processor; and a first memory, wherein the GNN attribute processor is configured to: receive a graph node identifier; determine a target memory address within the first memory based on the graph node identifier; determine, based on the received graph node identifier, whether attribute data corresponding to the received graph node identifier is cached in the first memory at the target memory address; and in response to determining that the attribute data is not cached in the first memory: fetch the attribute data from a second memory, and write the fetched attribute data into the first memory at the target memory address.Type: GrantFiled: January 21, 2022Date of Patent: December 12, 2023Assignee: T-Head (Shanghai) Semiconductor Co., Ltd.Inventors: Tianchan Guan, Heng Liu, Shuangchen Li, Hongzhong Zheng
-
Patent number: 11803224Abstract: The present disclosure provides a power management method, a multi-processing unit system, and a power management module thereof. The multi-processing unit system comprises a plurality of local power management units and a global power management unit, each of the local power management units corresponds to a processing unit of the multi-processing unit system. The power management method comprises: obtaining, using the global power management unit, a global power budget for the multi-processing unit system; allocating, using the global power management unit, local power budget for each of the local power management units according to the global power budget and power management parameters of the processing units; managing, using the local power management unit, local power resources of corresponding processing unit based on the allocated local power budget; reporting, using the local power management unit, the power management parameters of the processing unit to the global power management unit.Type: GrantFiled: January 6, 2022Date of Patent: October 31, 2023Assignee: T-Head (Shanghai) Semiconductor Co., Ltd.Inventors: Haoran Li, Fei Sun
-
Patent number: 11762776Abstract: The present application discloses a cache access method and an associated graph neural network system. The graph neural network processor is used for performing computation upon a graph neural network. The graph neural network is stored in the memory in compressed sparse row format. The method includes: receiving an address corresponding to a node of the graph neural network and a type of the address; in response to the type is one of a first type or a second type, performing lookup by comparing the address with a tag field of a degree lookup table to at least obtain a degree of the node; determining whether the degree is greater than a predetermined value to obtain a determination result; and determining whether to perform lookup on a region of the cache corresponding to the type according to the determination result.Type: GrantFiled: January 25, 2022Date of Patent: September 19, 2023Assignee: T-HEAD (SHANGHAI) SEMICONDUCTOR CO., LTD.Inventors: Zhe Zhang, Shuangchen Li, Hongzhong Zheng
-
Patent number: 11550572Abstract: This disclosure provides an instruction transmitting unit, an instruction execution unit, and a related apparatus and method. The instruction transmitting unit includes: an instruction splitter adapted to split a to-be-executed vector instruction into microinstructions; a microinstruction index fetcher adapted to acquire a number-of-effective-elements index of the microinstructions resulting from the splitting based on an element range involved in the microinstructions; an index comparison subunit adapted to compare the acquired number-of-effective-elements index with a first index, where the first index is a number-of-effective-elements index of a fault-only-first microinstruction whose processing has not been completed; and a microinstruction transmission controller adapted to transmit the microinstructions resulting from the splitting to a vector execution unit for execution when the number-of-effective-elements index is less than the first index.Type: GrantFiled: October 25, 2021Date of Patent: January 10, 2023Assignee: T-Head (Shanghai) Semiconductor Co., Ltd.Inventors: Jiahui Luo, Taotao Zhu, Chang Liu