Patents by Inventor Xiaozhang Gong

Xiaozhang Gong has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11892970
    Abstract: A method for data processing, a processor chip. The method includes: acquiring a first relationship instruction; executing at least one first computing instruction acquired before the first relationship instruction based on the first relationship instruction; and sending acknowledgment information based on the first relationship instruction in response to completing executing the at least one first computing instruction, to cause a second coprocessor receiving the acknowledgment information to revert to a state of acquiring a second computing instruction after the second relationship instruction acquired by a second coprocessor based on the acknowledgment information.
    Type: Grant
    Filed: July 19, 2022
    Date of Patent: February 6, 2024
    Assignee: KUNLUNXIN TECHNOLOGY (BEIJING) COMPANY
    Inventors: Jing Wang, Jiaxin Shi, Hanlin Xie, Xiaozhang Gong
  • Patent number: 11615295
    Abstract: A data processing system includes a central processing unit (CPU) and accelerator cards coupled to the CPU over a bus, each of the accelerator cards having a plurality of data processing (DP) accelerators to receive DP tasks from the CPU and to perform the received DP tasks. At least two of the accelerator cards are coupled to each other via an inter-card connection, and at least two of the DP accelerators are coupled to each other via an inter-chip connection. Each of the inter-card connection and the inter-chip connection is capable of being dynamically activated or deactivated, such that in response to a request received from the CPU, any one of the accelerator cards or any one of the DP accelerators within any one of the accelerator cards can be enabled or disabled to process any one of the DP tasks received from the CPU.
    Type: Grant
    Filed: November 15, 2019
    Date of Patent: March 28, 2023
    Assignees: BAIDU USA LLC, BAIDU.COM TIMES TECHNOLOGY (BEIJING) CO., LTD., KUNLUNXIN TECHNOLOGY (BEIJING) COMPANY LIMITED
    Inventors: Hefei Zhu, Jian Ouyang, Zhibiao Zhao, Xiaozhang Gong, Qingshu Chen
  • Patent number: 11520563
    Abstract: Disclosed are an apparatus and method for transforming a matrix, and a data processing system. The apparatus may include: a first shift unit, configured to receive matrix data and perform first cyclic shift on the matrix data to generate first data; a cache unit, configured to write each row of data into the cache unit in the first data thereto in an order different from the order of respective data in the row of data to store the first data as second data; and a second shift unit, configured to read the second data from the cache unit and perform second cyclic shift on the second data to generate transformed matrix data.
    Type: Grant
    Filed: December 11, 2019
    Date of Patent: December 6, 2022
    Inventor: Xiaozhang Gong
  • Publication number: 20220350774
    Abstract: A method for data processing, a processor chip. The method includes: acquiring a first relationship instruction; executing at least one first computing instruction acquired before the first relationship instruction based on the first relationship instruction; and sending acknowledgment information based on the first relationship instruction in response to completing executing the at least one first computing instruction, to cause a second coprocessor receiving the acknowledgment information to revert to a state of acquiring a second computing instruction after the second relationship instruction acquired by a second coprocessor based on the acknowledgment information.
    Type: Application
    Filed: July 19, 2022
    Publication date: November 3, 2022
    Inventors: Jing WANG, Jiaxin SHI, Hanlin XIE, Xiaozhang GONG
  • Patent number: 11422817
    Abstract: A method and apparatus for executing an instruction are provided. In the method, an instruction queue is first generated, and an instruction from the instruction queue in preset order is acquired. Then, a sending step including: determining a type of the acquired instruction; determining, in response to determining that the acquired instruction is an arithmetic instruction, an executing component for executing the arithmetic instruction from an executing component set; and sending the arithmetic instruction to the determined executing component is executed. Last, in response to determining that the acquired instruction is a blocking instruction, a next instruction is acquired after receiving a signal for instructing an instruction associated with the blocking instruction being completely executed.
    Type: Grant
    Filed: July 1, 2019
    Date of Patent: August 23, 2022
    Assignee: Kunlunxin Technology (Beijing) Company Limited
    Inventors: Jing Wang, Wei Qi, Yupeng Li, Xiaozhang Gong
  • Patent number: 11301255
    Abstract: Methods, apparatuses, devices, and storage media for performing a processing task are provided. A portion of portions of the processing task can include a group of operations that are to be performed at a processing unit of processing units. The group of operations can include operations of a first type and operations of a second type. In the method, a first queue for performing the operations of the first type and a second queue for performing the operations of the second type can be built, respectively. Based on a definition of the processing task, a dependency relationship between a group of operations to be performed at the processing unit and a group of operations to be performed at other processing units in the plurality of processing units can be obtained. Operations in the first queue and operations in the second queue can be performed respectively based on the dependency relationship.
    Type: Grant
    Filed: December 30, 2019
    Date of Patent: April 12, 2022
    Assignee: Kunlunxin Technology (Beijing) Company Limited
    Inventors: Qingshu Chen, Zhibiao Zhao, Hefei Zhu, Xiaozhang Gong, Yong Wang, Jian Ouyang
  • Patent number: 11163714
    Abstract: Embodiments of the present disclosure relate to a method, an apparatus, an electronic device and a computer readable storage medium for determining connection relationships among a plurality of chips. The method includes determining identity information of a plurality of chips managed by a host, the plurality of chips being connected by respective inter-chip communication interfaces for inter-chip communication. The method further includes allowing one or more of the plurality of chips to acquire identity information of other chips connected to the inter-chip communication interface of the one or more chips. The method further includes reading identity information of the other chips by means of a management interface of the one or more chips with regard to communicating with the host, so as to determine connection relationships among the plurality of chips.
    Type: Grant
    Filed: December 11, 2019
    Date of Patent: November 2, 2021
    Inventors: Xianglun Leng, Hefei Zhu, Qingshu Chen, Zhibiao Zhao, Xiaozhang Gong
  • Patent number: 11093388
    Abstract: The present disclosure relates to a method, an apparatus, an electronic device and a computer readable storage medium for accessing static random access memories. The method includes: receiving an access request for data associated with the static random access memories; writing a plurality of sections of the data into a plurality of different static random access memories in an interleaved manner in response to the access request being a write request for the data, each of the plurality of sections having its respective predetermined size; and reading the plurality of sections of the data from the plurality of static random access memories in an interleaved manner in response to the access request being a read request for the data, each of the plurality of sections having its respective predetermined size.
    Type: Grant
    Filed: November 13, 2019
    Date of Patent: August 17, 2021
    Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.
    Inventors: Xiaozhang Gong, Jing Wang
  • Publication number: 20210241095
    Abstract: Embodiments of the present disclosure propose a deep learning processing apparatus and method, device and storage medium, relating to the field of artificial intelligence.
    Type: Application
    Filed: September 10, 2020
    Publication date: August 5, 2021
    Inventors: Xiaozhang Gong, Jian Ouyang, Jing Wang, Wei Qi
  • Publication number: 20210174174
    Abstract: A data processing system includes a central processing unit (CPU) and accelerator cards coupled to the CPU over a bus, each of the accelerator cards having a plurality of data processing (DP) accelerators to receive DP tasks from the CPU and to perform the received DP tasks. At least two of the accelerator cards are coupled to each other via an inter-card connection, and at least two of the DP accelerators are coupled to each other via an inter-chip connection. Each of the inter-card connection and the inter-chip connection is capable of being dynamically activated or deactivated, such that in response to a request received from the CPU, any one of the accelerator cards or any one of the DP accelerators within any one of the accelerator cards can be enabled or disabled to process any one of the DP tasks received from the CPU.
    Type: Application
    Filed: November 15, 2019
    Publication date: June 10, 2021
    Inventors: Hefei ZHU, Jian OUYANG, Zhibiao ZHAO, Xiaozhang GONG, Qingshu CHEN
  • Publication number: 20210072996
    Abstract: Methods, apparatuses, devices, and storage media for performing a processing task are provided. A portion of portions of the processing task can include a group of operations that are to be performed at a processing unit of processing units. The group of operations can include operations of a first type and operations of a second type. In the method, a first queue for performing the operations of the first type and a second queue for performing the operations of the second type can be built, respectively. Based on a definition of the processing task, a dependency relationship between a group of operations to be performed at the processing unit and a group of operations to be performed at other processing units in the plurality of processing units can be obtained. Operations in the first queue and operations in the second queue can be performed respectively based on the dependency relationship.
    Type: Application
    Filed: December 30, 2019
    Publication date: March 11, 2021
    Inventors: Qingshu CHEN, Zhibiao ZHAO, Hefei ZHU, Xiaozhang GONG, Yong WANG, Jian OUYANG
  • Publication number: 20200409895
    Abstract: Embodiments of the present disclosure relate to a method, an apparatus, an electronic device and a computer readable storage medium for determining connection relationships among a plurality of chips. The method includes determining identity information of a plurality of chips managed by a host, the plurality of chips being connected by respective inter-chip communication interfaces for inter-chip communication. The method further includes allowing one or more of the plurality of chips to acquire identity information of other chips connected to the inter-chip communication interface of the one or more chips. The method further includes reading identity information of the other chips by means of a management interface of the one or more chips with regard to communicating with the host, so as to determine connection relationships among the plurality of chips.
    Type: Application
    Filed: December 11, 2019
    Publication date: December 31, 2020
    Inventors: Xianglun Leng, Hefei Zhu, Qingshu Chen, Zhibiao Zhao, Xiaozhang Gong
  • Publication number: 20200412382
    Abstract: Disclosed are an apparatus and method for transforming a matrix, and a data processing system. The apparatus may include: a first shift unit, configured to receive matrix data and perform first cyclic shift on the matrix data to generate first data; a cache unit, configured to write each row of data into the cache unit in the first data thereto in an order different from the order of respective data in the row of data to store the first data as second data; and a second shift unit, configured to read the second data from the cache unit and perform second cyclic shift on the second data to generate transformed matrix data.
    Type: Application
    Filed: December 11, 2019
    Publication date: December 31, 2020
    Inventor: Xiaozhang Gong
  • Publication number: 20200159658
    Abstract: The present disclosure relates to a method, an apparatus, an electronic device and a computer readable storage medium for accessing static random access memories. The method includes: receiving an access request for data associated with the static random access memories; writing a plurality of sections of the data into a plurality of different static random access memories in an interleaved manner in response to the access request being a write request for the data, each of the plurality of sections having its respective predetermined size; and reading the plurality of sections of the data from the plurality of static random access memories in an interleaved manner in response to the access request being a read request for the data, each of the plurality of sections having its respective predetermined size.
    Type: Application
    Filed: November 13, 2019
    Publication date: May 21, 2020
    Inventors: Xiaozhang GONG, Jing WANG
  • Publication number: 20200050450
    Abstract: Embodiments of the present disclosure relate to a method and apparatus for executing an instruction. A method may include: acquiring an instruction queue; acquiring a to-be-sent instruction from the instruction queue in preset order, and executing following sending: determining a type of the to-be-sent instruction; determining, in response to determining that the to-be-sent instruction is an arithmetic instruction, an executing component executing the to-be-sent instruction from an executing component set, and sending the to-be-sent instruction to the determined executing component; and acquiring, in response to determining that the to-be-sent instruction is a blocking instruction, a next to-be-sent instruction after receiving a signal for instructing an instruction associated with the to-be-sent instruction being completely executed.
    Type: Application
    Filed: July 1, 2019
    Publication date: February 13, 2020
    Inventors: Jing Wang, Wei Qi, Yupeng Li, Xiaozhang Gong