Patents Assigned to T-Head (Shanghai) Semiconductor Co., Ltd.
  • Patent number: 11960437
    Abstract: A system includes a high-bandwidth inter-chip network (ICN) that allows communication between parallel processing units (PPUs) in the system. For example, the ICN allows a PPU to communicate with other PPUs on the same compute node or server and also with PPUs on other compute nodes or servers. In embodiments, communication may be at the command level (e.g., at the direct memory access level) and at the instruction level (e.g., the finer-grained load/store instruction level). The ICN allows PPUs in the system to communicate without using a PCIe bus, thereby avoiding its bandwidth limitations and relative lack of speed. The respective routing tables comprise information of multiple paths to any given other PPU.
    Type: Grant
    Filed: July 15, 2022
    Date of Patent: April 16, 2024
    Assignee: T-Head (Shanghai) Semiconductor Co., Ltd.
    Inventors: Liang Han, Yunxiao Zou
  • Patent number: 11886352
    Abstract: This specification describes methods and systems for accelerating attribute data access for graph neural network (GNN) processing. An example method includes: receiving a root node identifier corresponding to a node in a graph for GNN processing; determining one or more candidate node identifiers according to the root node identifier, wherein attribute data corresponding to the one or more candidate node identifiers are sequentially stored in a memory; and sampling one or more graph node identifiers at least from the one or more candidate node identifiers for the GNN processing.
    Type: Grant
    Filed: January 21, 2022
    Date of Patent: January 30, 2024
    Assignee: T-Head (Shanghai) Semiconductor Co., Ltd.
    Inventors: Heng Liu, Tianchan Guan, Shuangchen Li, Hongzhong Zheng
  • Publication number: 20230401060
    Abstract: The embodiments of the present application provide a processing unit. The processing unit comprises: an instruction fetching unit configured for fusing instruction of vector configuration instruction and vector operation instruction that are adjacent in order to obtain fusion instruction; an instruction decoding unit configured to decode the fusion instruction to obtain first execution information and second execution information; a vector configuration unit configured to execute the vector configuration instruction according to the first execution information, modify vector control register, and bypass the value of the modified vector control register to the vector operation unit; the vector operation unit configured to execute the vector operation instruction according to the second execution information and the value of the modified vector control register.
    Type: Application
    Filed: December 30, 2022
    Publication date: December 14, 2023
    Applicant: T-HEAD (SHANGHAI) SEMICONDUCTOR CO., LTD.
    Inventors: Dongqi LIU, Haowen CHEN, Zhao JIANG, Chang LIU, Dingyan WEI, Wenjian XU, Tao JIANG
  • Publication number: 20230401062
    Abstract: The embodiments of the present application provide an instruction retirement unit, an instruction execution unit, and a related apparatus and method. The instruction retirement unit includes: a reception subunit, configured to receive a completion request signal; an arbitration subunit, configured to send a completion permission signal to an instruction execution unit after determining that instructions to be completed can be completed, and store instruction completion information of the instructions to be completed into instruction buffer table entries in a buffer; and a retirement subunit, configured to perform, based on the instruction completion information stored in each of the instruction buffer table entries, retirement processing on instructions that have already been completed and have not been retired, and delete the instruction completion information of retired instructions from the instruction buffer table entries.
    Type: Application
    Filed: December 30, 2022
    Publication date: December 14, 2023
    Applicant: T-HEAD (SHANGHAI) SEMICONDUCTOR CO., LTD.
    Inventors: Chang LIU, Haowen CHEN, Tao JIANG
  • Patent number: 11841799
    Abstract: This application describes a hardware accelerator, a computer system, and a method for accelerating Graph Neural Network (GNN) node attribute fetching. The hardware accelerator comprises a GNN attribute processor; and a first memory, wherein the GNN attribute processor is configured to: receive a graph node identifier; determine a target memory address within the first memory based on the graph node identifier; determine, based on the received graph node identifier, whether attribute data corresponding to the received graph node identifier is cached in the first memory at the target memory address; and in response to determining that the attribute data is not cached in the first memory: fetch the attribute data from a second memory, and write the fetched attribute data into the first memory at the target memory address.
    Type: Grant
    Filed: January 21, 2022
    Date of Patent: December 12, 2023
    Assignee: T-Head (Shanghai) Semiconductor Co., Ltd.
    Inventors: Tianchan Guan, Heng Liu, Shuangchen Li, Hongzhong Zheng
  • Patent number: 11803224
    Abstract: The present disclosure provides a power management method, a multi-processing unit system, and a power management module thereof. The multi-processing unit system comprises a plurality of local power management units and a global power management unit, each of the local power management units corresponds to a processing unit of the multi-processing unit system. The power management method comprises: obtaining, using the global power management unit, a global power budget for the multi-processing unit system; allocating, using the global power management unit, local power budget for each of the local power management units according to the global power budget and power management parameters of the processing units; managing, using the local power management unit, local power resources of corresponding processing unit based on the allocated local power budget; reporting, using the local power management unit, the power management parameters of the processing unit to the global power management unit.
    Type: Grant
    Filed: January 6, 2022
    Date of Patent: October 31, 2023
    Assignee: T-Head (Shanghai) Semiconductor Co., Ltd.
    Inventors: Haoran Li, Fei Sun
  • Patent number: 11762776
    Abstract: The present application discloses a cache access method and an associated graph neural network system. The graph neural network processor is used for performing computation upon a graph neural network. The graph neural network is stored in the memory in compressed sparse row format. The method includes: receiving an address corresponding to a node of the graph neural network and a type of the address; in response to the type is one of a first type or a second type, performing lookup by comparing the address with a tag field of a degree lookup table to at least obtain a degree of the node; determining whether the degree is greater than a predetermined value to obtain a determination result; and determining whether to perform lookup on a region of the cache corresponding to the type according to the determination result.
    Type: Grant
    Filed: January 25, 2022
    Date of Patent: September 19, 2023
    Assignee: T-HEAD (SHANGHAI) SEMICONDUCTOR CO., LTD.
    Inventors: Zhe Zhang, Shuangchen Li, Hongzhong Zheng
  • Patent number: 11550572
    Abstract: This disclosure provides an instruction transmitting unit, an instruction execution unit, and a related apparatus and method. The instruction transmitting unit includes: an instruction splitter adapted to split a to-be-executed vector instruction into microinstructions; a microinstruction index fetcher adapted to acquire a number-of-effective-elements index of the microinstructions resulting from the splitting based on an element range involved in the microinstructions; an index comparison subunit adapted to compare the acquired number-of-effective-elements index with a first index, where the first index is a number-of-effective-elements index of a fault-only-first microinstruction whose processing has not been completed; and a microinstruction transmission controller adapted to transmit the microinstructions resulting from the splitting to a vector execution unit for execution when the number-of-effective-elements index is less than the first index.
    Type: Grant
    Filed: October 25, 2021
    Date of Patent: January 10, 2023
    Assignee: T-Head (Shanghai) Semiconductor Co., Ltd.
    Inventors: Jiahui Luo, Taotao Zhu, Chang Liu