Patents by Inventor Jiaxin Shi

Jiaxin Shi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Asymmetric quantization for compression and for acceleration of inference for neural networks

Patent number: 12008467

Abstract: Presented herein are embodiments of an improved asymmetric quantization, which may generally be referred to as improved asymmetric quantization (IAQ) embodiments. IAQ embodiments combine the benefits of conventional asymmetric quantization and symmetric quantization but also provide additional computation efficiencies. Embodiments of IAQ adopt an asymmetric range of the weights of a neural network layer, so they circumvent the limitation of symmetric range of symmetric quantization. Moreover, the inference process of a neural network quantized by an IAQ embodiment is much faster than that of the neural network quantized by conventional asymmetric quantization by quantizing an offset value of each layer.

Type: Grant

Filed: May 19, 2020

Date of Patent: June 11, 2024

Assignee: Baidu USA LLC

Inventors: Yingzhen Yang, Zhibiao Zhao, Baoxin Zhao, Jun Huan, Jian Ouyang, Yong Wang, Jiaxin Shi
Synchronizing coprocessors using synchronization instructions to force a second coprocessor to wait until receiving an acknowledgement signal from a first coprocessor

Patent number: 11892970

Abstract: A method for data processing, a processor chip. The method includes: acquiring a first relationship instruction; executing at least one first computing instruction acquired before the first relationship instruction based on the first relationship instruction; and sending acknowledgment information based on the first relationship instruction in response to completing executing the at least one first computing instruction, to cause a second coprocessor receiving the acknowledgment information to revert to a state of acquiring a second computing instruction after the second relationship instruction acquired by a second coprocessor based on the acknowledgment information.

Type: Grant

Filed: July 19, 2022

Date of Patent: February 6, 2024

Assignee: KUNLUNXIN TECHNOLOGY (BEIJING) COMPANY

Inventors: Jing Wang, Jiaxin Shi, Hanlin Xie, Xiaozhang Gong
Data processing method and apparatus for neural network

Patent number: 11651198

Abstract: Embodiments of the present disclosure relate to a data processing method and apparatus for a neural network. The neural network is provided with at least one activation function. A method may include: converting, in response to that an activation function acquiring current data is a target function, based on a conversion relationship between the target function and a preset function, the current data into input data of the preset function; finding out first output data of the preset function with the input data as an input in a lookup table corresponding to the preset function; obtaining second output data of the target function with the current data as an input by conversion based on the conversion relationship and the first output data; and outputting the second output data.

Type: Grant

Filed: July 3, 2019

Date of Patent: May 16, 2023

Assignee: Beijing Baidu Netcom Science and Technology Co., Ltd.

Inventors: Jiaxin Shi, Huimin Li, Yong Wang
Method and apparatus for generating chip-based computing function, device, and storage medium

Patent number: 11507348

Abstract: Embodiments of the present disclosure provide a method and apparatus for generating a chip-based computing function, a device, and a storage medium. The method includes: acquiring an input parameter value associated with a computing function supported by a chip; determining, based on the input parameter value, at least one candidate computing function template corresponding to the computing function, the candidate computing function template having a configurable parameter associated with performance of the candidate computing function template, and the configurable parameter having at least one candidate value; and determining, according to the input parameter value and candidate values of the configurable parameter of the candidate computing function template, a target computing function template and a target value of a configurable parameter of the target computing function template.

Type: Grant

Filed: December 11, 2019

Date of Patent: November 22, 2022

Inventors: Yong Wang, Jiaxin Shi
METHOD FOR DATA PROCESSING, PROCESSOR CHIP

Publication number: 20220350774

Abstract: A method for data processing, a processor chip. The method includes: acquiring a first relationship instruction; executing at least one first computing instruction acquired before the first relationship instruction based on the first relationship instruction; and sending acknowledgment information based on the first relationship instruction in response to completing executing the at least one first computing instruction, to cause a second coprocessor receiving the acknowledgment information to revert to a state of acquiring a second computing instruction after the second relationship instruction acquired by a second coprocessor based on the acknowledgment information.

Type: Application

Filed: July 19, 2022

Publication date: November 3, 2022

Inventors: Jing WANG, Jiaxin SHI, Hanlin XIE, Xiaozhang GONG
Artificial intelligence chip and instruction execution method for artificial intelligence chip

Patent number: 11372673

Abstract: Embodiments of the present disclosure disclose an artificial intelligence chip and an instruction execution method for an artificial intelligence chip. A specific embodiment of the artificial intelligence chip includes: an instruction memory, a data memory, at least one general execution unit, and at least one dedicated execution unit. The instruction memory is configured to: receive a kernel code including at least one code block. The general execution unit is configured to: receive the code block, lock the dedicated execution unit associated with the received code block, and send an instruction in the received code block to the locked dedicated execution unit. The dedicated execution unit is configured to: execute the received instruction, and store an execution result in the data memory. The data memory is configured to: store the execution result sent by the dedicated execution unit.

Type: Grant

Filed: July 8, 2019

Date of Patent: June 28, 2022

Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventors: Ningyi Xu, Yan Huang, Jinchen Han, Peng Wu, Jiaxin Shi
Method and apparatus for executing instruction for artificial intelligence chip

Patent number: 10891134

Abstract: Embodiments of the present disclosure disclose a method and apparatus for executing an instruction for an artificial intelligence chip. A specific embodiment of the method comprises: receiving descriptive information for describing a neural network model sent by a central processing unit, the descriptive information including at least one operation instruction; analyzing the descriptive information to acquire the at least one operation instruction; determining, for an operation instruction of the at least one operation instruction, a special-purpose execution component executing the operation instruction, and locking the determined special-purpose execution component; sending the operation instruction to the determined special-purpose execution component; and unlocking the determined special-purpose execution component in response to receiving a notification for instructing the operation instruction being completely executed.

Type: Grant

Filed: July 9, 2019

Date of Patent: January 12, 2021

Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventors: Yong Wang, Jiaxin Shi, Rong Chen, Jinchen Han
ASYMMETRIC QUANTIZATION FOR COMPRESSION AND FOR ACCELERATION OF INFERENCE FOR NEURAL NETWORKS

Publication number: 20210004679

Abstract: Presented herein are embodiments of an improved asymmetric quantization, which may generally be referred to as improved asymmetric quantization (IAQ) embodiments. IAQ embodiments combine the benefits of conventional asymmetric quantization and symmetric quantization but also provide additional computation efficiencies. Embodiments of IAQ adopt an asymmetric range of the weights of a neural network layer, so they circumvent the limitation of symmetric range of symmetric quantization. Moreover, the inference process of a neural network quantized by an IAQ embodiment is much faster than that of the neural network quantized by conventional asymmetric quantization by quantizing an offset value of each layer.

Type: Application

Filed: May 19, 2020

Publication date: January 7, 2021

Applicant: Baidu USA LLC

Inventors: Yingzhen YANG, Zhibiao ZHAO, Baoxin ZHAO, Jun HUAN, Jian OUYANG, Yong WANG, Jiaxin SHI
METHOD AND APPARATUS FOR GENERATING CHIP-BASED COMPUTING FUNCTION, DEVICE, AND STORAGE MEDIUM

Publication number: 20200409662

Abstract: Embodiments of the present disclosure provide a method and apparatus for generating a chip-based computing function, a device, and a storage medium. The method includes: acquiring an input parameter value associated with a computing function supported by a chip; determining, based on the input parameter value, at least one candidate computing function template corresponding to the computing function, the candidate computing function template having a configurable parameter associated with performance of the candidate computing function template, and the configurable parameter having at least one candidate value; and determining, according to the input parameter value and candidate values of the configurable parameter of the candidate computing function template, a target computing function template and a target value of a configurable parameter of the target computing function template to implement the chip-based computing function.

Type: Application

Filed: December 11, 2019

Publication date: December 31, 2020

Inventors: Yong Wang, Jiaxin Shi
Data Processing Method and Apparatus for Neural Network

Publication number: 20200050924

Abstract: Embodiments of the present disclosure relate to a data processing method and apparatus for a neural network. The neural network is provided with at least one activation function. A method may include: converting, in response to that an activation function acquiring current data is a target function, based on a conversion relationship between the target function and a preset function, the current data into input data of the preset function; finding out first output data of the preset function with the input data as an input in a lookup table corresponding to the preset function; obtaining second output data of the target function with the current data as an input by conversion based on the conversion relationship and the first output data; and outputting the second output data.

Type: Application

Filed: July 3, 2019

Publication date: February 13, 2020

Inventors: Jiaxin Shi, Huimin Li, Yong Wang
Artificial Intelligence Chip And Instruction Execution Method For Artificial Intelligence Chip

Publication number: 20200050476

Abstract: Embodiments of the present disclosure disclose an artificial intelligence chip and an instruction execution method for an artificial intelligence chip. A specific embodiment of the artificial intelligence chip includes: an instruction memory, a data memory, at least one general execution unit, and at least one dedicated execution unit. The instruction memory is configured to: receive a kernel code including at least one code block. The general execution unit is configured to: receive the code block, lock the dedicated execution unit associated with the received code block, and send an instruction in the received code block to the locked dedicated execution unit. The dedicated execution unit is configured to: execute the received instruction, and store an execution result in the data memory. The data memory is configured to: store the execution result sent by the dedicated execution unit.

Type: Application

Filed: July 8, 2019

Publication date: February 13, 2020

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventors: Ningyi Xu, Yan Huang, Jinchen Han, Peng Wu, Jiaxin Shi
METHOD AND APPARATUS FOR EXECUTING INSTRUCTION FOR ARTIFICIAL INTELLIGENCE CHIP

Publication number: 20200050457

Abstract: Embodiments of the present disclosure disclose a method and apparatus for executing an instruction for an artificial intelligence chip. A specific embodiment of the method comprises: receiving descriptive information for describing a neural network model sent by a central processing unit, the descriptive information including at least one operation instruction; analyzing the descriptive information to acquire the at least one operation instruction; determining, for an operation instruction of the at least one operation instruction, a special-purpose execution component executing the operation instruction, and locking the determined special-purpose execution component; sending the operation instruction to the determined special-purpose execution component; and unlocking the determined special-purpose execution component in response to receiving a notification for instructing the operation instruction being completely executed.

Type: Application

Filed: July 9, 2019

Publication date: February 13, 2020

Applicant: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventors: Yong Wang, Jiaxin Shi, Rong Chen, Jinchen Han