Patents by Inventor Yequn Zhang

Yequn Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Greedy approach for obtaining an artificial intelligence model in a parallel configuration

Patent number: 11507829

Abstract: A system may include multiple client devices and a processing device communicatively coupled to the client devices. One or more client devices may implement a greedy approach in searching for an optimal artificial intelligence (AI) model. For example, a client device may use a training dataset to perform an AI task, and update its AI model. The client device may verify the performance of the AI task and determine whether to accept or reject its updated AI model. Upon rejection, the client device may repeat updating its AI model until the updated AI model is accepted, or until a stopping criteria is met. The processing device may be configured to update the initial AI models based on the accepted updated AI models obtained in the multiple client device. Training data for each of the client devices may contain a subset shuffled from a larger training dataset.

Type: Grant

Filed: December 3, 2019

Date of Patent: November 22, 2022

Assignee: Gyrfalcon Technology Inc.

Inventors: Yinbo Shi, Yequn Zhang, Xiaochun Li, Bowei Liu
Using quantization in training an artificial intelligence model in a semiconductor solution

Patent number: 11475298

Abstract: A system for training an artificial intelligence (AI) model for an AI chip to implement an AI task may include an AI training unit to train weights of an AI model in floating point, a convolution quantization unit for quantizing the trained weights to a number of quantization levels, and an activation quantization unit for updating the weights of the AI model so that output of the AI model based at least on the updated weights are within a range of activation layers of the AI chip. The updated weights may be stored in fixed point and uploadable to the AI chip. The various units may be configured to account for the hardware constraints in the AI chip to minimize performance degradation when the trained weights are uploaded to the AI chip and expedite training convergence. Forward propagation and backward propagation may be combined in training the AI model.

Type: Grant

Filed: September 27, 2019

Date of Patent: October 18, 2022

Assignee: Gyrfalcon Technology Inc.

Inventors: Yongxiong Ren, Yi Fan, Yequn Zhang, Baohua Sun, Bin Yang, Xiaochun Li, Lin Yang
Systems and methods for determining an artificial intelligence model in a communication system

Patent number: 11429853

Abstract: A system may include multiple client devices and a processing device communicatively coupled to the client devices. Each client device includes an artificial intelligence (AI) chip and is configured to generate an AI model. The processing device may be configured to (i) receive a respective AI model and an associated performance value of the respective AI model from each of the plurality of client devices; (ii) determine an optimal AI model based on the performance values associated with the respective AI models from the plurality of client devices; and (iii) determine a global AI model based on the optimal AI model. The system may load the global AI model into an AI chip of a client device to cause the client device to perform an AI task based on the global AI model in the AI chip. The AI model may include a convolutional neural network.

Type: Grant

Filed: November 13, 2018

Date of Patent: August 30, 2022

Assignee: Gyrfalcon Technology Inc.

Inventors: Yequn Zhang, Yongxiong Ren, Baohua Sun, Lin Yang, Qi Dong
Systems and methods for determining an artificial intelligence model in a communication system

Patent number: 11334801

Abstract: A device for obtaining a local optimal AI model may include an artificial intelligence (AI) chip and a processing device configured to receive a first initial AI model from the host device. The device may load the initial AI model into the AI chip to determine a performance value of the AI model based on a dataset, and determine a probability that a current AI model should be replaced by the initial AI model. The device may determine, based on the probability, whether to replace the current AI model with the initial AI model. If it is determined that the current AI model be replaced, the device may replace the current AI model with the initial AI model. The device may repeat the above processes and obtain a final current AI model. The device may transmit the final current AI model to the host device.

Type: Grant

Filed: November 13, 2018

Date of Patent: May 17, 2022

Assignee: Gyrfalcon Technology Inc.

Inventors: Yequn Zhang, Yongxiong Ren, Baohua Sun, Lin Yang, Qi Dong
Combining feature maps in an artificial intelligence semiconductor solution

Patent number: 11335045

Abstract: In some embodiments, a system includes an artificial intelligence (AI) chip and a processor coupled to the AI chip and configured to receive an input image, crop the input image into a plurality of cropped images, and execute the AI chip to produce a plurality of feature maps based on at least a subset of the plurality of cropped images. The system may further merge at least a subset of the plurality of feature maps to form a merged feature map, and produce an output image based on the merged feature map. The cropping and merging operations may be performed according to a same pattern. The system may also include a training network configured to train weights of the CNN model in the AI chip in a gradient descent network. Cropping and merging may be performed over the training sample images in the training work in a similar manner.

Type: Grant

Filed: January 3, 2020

Date of Patent: May 17, 2022

Assignee: Gyrfalcon Technology Inc.

Inventors: Bin Yang, Lin Yang, Xiaochun Li, Yequn Zhang, Yongxiong Ren, Yinbo Shi, Patrick Dong
COMBINING FEATURE MAPS IN AN ARTIFICIAL INTELLIGENCE SEMICONDUCTOR SOLUTION

Publication number: 20210209822

Abstract: In some embodiments, a system includes an artificial intelligence (AI) chip and a processor coupled to the AI chip and configured to receive an input image, crop the input image into a plurality of cropped images, and execute the AI chip to produce a plurality of feature maps based on at least a subset of the plurality of cropped images. The system may further merge at least a subset of the plurality of feature maps to form a merged feature map, and produce an output image based on the merged feature map. The cropping and merging operations may be performed according to a same pattern. The system may also include a training network configured to train weights of the CNN model in the AI chip in a gradient descent network. Cropping and merging may be performed over the training sample images in the training work in a similar manner.

Type: Application

Filed: January 3, 2020

Publication date: July 8, 2021

Applicant: Gyrfalcon Technology Inc.

Inventors: Bin Yang, Lin Yang, Xiaochun Li, Yequn Zhang, Yongxiong Ren, Yinbo Shi, Patrick Dong
VIDEO RETRIEVAL IN FEATURE DESCRIPTOR DOMAIN IN AN ARTIFICIAL INTELLIGENCE SEMICONDUCTOR SOLUTION

Publication number: 20210097290

Abstract: A video retrieval system may include a feature extractor configured to extract first feature descriptors for multiple image frames in the query video. The system may also include a feature extractor to extract second feature descriptors for multiple image frames in a candidate video in a video database. The system may include a comparator to compare the first and second feature descriptors to determine a subset of image frames in the candidate video that are similar to the first video. The system may output die query output by displaying the subset of image frames in a slide show. The system may also output the query by displaying a video formed by at least the subset of image frames. The feature extractor may be implemented in a convolution neural network (CNN) in an artificial intelligence (AI) chip. The system may include key frame extractor to detect key frames in the video.

Type: Application

Filed: September 27, 2019

Publication date: April 1, 2021

Applicant: Gyrfalcon Technology Inc.

Inventors: Lin Yang, Bin Yang, Qi Dong, Xiaochun Li, Wenhan Zhang, Yequn Zhang, Hua Zhou, Patrick Dong
System and method for determining an artificial intelligence model in a decentralized network

Patent number: 10943168

Abstract: A system may include a decentralized communication network and multiple processing devices on the network. Each processing device may have an artificial intelligence (AI) chip, the device may be configured to generate an AI model, determine the performance value of the AI model on the AI chip, receive a chain from the network where the chain contains a performance measure. If the performance value of the AI model is better than the performance measure, then the processing device may broadcast the AI model to the network for verification. If the AI model is verified by the network, the device may update the chain with the performance value so that the chain can be shared by the multiple processing devices on the network. Any processing device on the network may also verify an AI model broadcasted by any other device. Methods for generating the AI model are also provided.

Type: Grant

Filed: April 10, 2018

Date of Patent: March 9, 2021

Assignee: Gyrfalcon Technology Inc.

Inventors: Lin Yang, Charles Jin Young, Jason Zeng Dong, Patrick Zeng Dong, Baohua Sun, Yequn Zhang
System and method for determining an artificial intelligence model in a decentralized network

Patent number: 10902313

Abstract: A system may include a decentralized communication network and multiple processing devices on the network. Each processing device may have an artificial intelligence (AI) chip, the device may be configured to generate an AI model, determine the performance value of the AI model on the AI chip, receive a chain from the network where the chain contains a performance measure. If the performance value of the AI model is better than the performance measure, then the processing device may broadcast the AI model to the network for verification. If the AI model is verified by the network, the device may update the chain with the performance value so that the chain can be shared by the multiple processing devices on the network. Any processing device on the network may also verify an AI model broadcasted by any other device. Methods for generating the AI model are also provided.

Type: Grant

Filed: April 10, 2018

Date of Patent: January 26, 2021

Assignee: Gyrfalcon Technology Inc.

Inventors: Lin Yang, Charles Jin Young, Jason Zeng Dong, Patrick Zeng Dong, Baohua Sun, Yequn Zhang
DETECTING KEY FRAMES IN VIDEO COMPRESSION IN AN ARTIFICIAL INTELLIGENCE SEMICONDUCTOR SOLUTION

Publication number: 20200380263

Abstract: A system for detecting key frames in a video may include a feature extractor configured to extract feature descriptors for each of the multiple image frames in the video. The feature extractor may be an embedded cellular neural network of an artificial intelligence (AI) chip. The system may also include a key frame extractor configured to determine one or more key frames in the multiple image frames based on the corresponding feature descriptors of the image frames. The key frame extractor may determine the key frames based on distance values between a first set of feature descriptors corresponding to a first subset of image frames and a second set of feature descriptors corresponding to a second subset of image frames. The system may output an alert based on determining the key frames and/or display the key frames. The system may also compress the video by removing the non-key frames.

Type: Application

Filed: May 29, 2019

Publication date: December 3, 2020

Applicant: Gyrfalcon Technology Inc.

Inventors: Lin Yang, Bin Yang, Qi Dong, Xiaochun Li, Wenhan Zhang, Yinbo Shi, Yequn Zhang
ARTIFICIAL INTELLIGENCE SEMICONDUCTOR CHIP HAVING WEIGHTS OF VARIABLE COMPRESSION RATIO

Publication number: 20200302276

Abstract: An artificial intelligence (AI) semiconductor having an embedded convolution neural network (CNN) may include a first convolution layer and a second convolution layer, in which the weights of the first layer and the weights of the second layer are quantized in different bit-widths, thus at different compression ratios. In a VGG neural network, the weights of a first group of convolution layers may have a different compression ratio than the weights of a second group of convolution layers. The weights of the CNN may be obtained in a training system including convolution quantization and/or activation quantization. Depending on the compression ratio, the weights of a convolution layer may be trained with or without re-training. An AI task, such as image retrieval, may be implemented in the AI semiconductor having the CNN described above.

Type: Application

Filed: September 27, 2019

Publication date: September 24, 2020

Applicant: Gyrfalcon Technology Inc.

Inventors: Lin Yang, Bin Yang, Hua Zhou, Xiaochun Li, Wenhan Zhang, Qi Dong, Yequn Zhang, Yongxiong Ren, Patrick Dong
USING OUTPUT EQUALIZATION IN TRAINING AN ARTIFICIAL INTELLIGENCE MODEL IN A SEMICONDUCTOR SOLUTION

Publication number: 20200302288

Abstract: A system for training an artificial intelligence (AI) model for an AI chip may include an AI training unit to train weights of an AI model in floating point, and one or more quantization units for updating the weights of the AI model while accounting for the hardware constraints in the AI chip. The system may also include customization unit for performing one or more linear transformations on the updated weights. The system may also perform output equalization for one or more convolution layers of the AI model to equalize the inputs and/or outputs of each layer of the AI model to within the range allowed in the physical AI chip. The system may further update the weights by performing shift-based quantization that mimics the characteristics of a hardware chip. The updated weights may be stored in fixed point and uploadable to an AI chip implementing an AI task.

Type: Application

Filed: September 27, 2019

Publication date: September 24, 2020

Applicant: Gyrfalcon Technology Inc.

Inventors: Yongxiong Ren, Yi Fan, Yequn Zhang, Tianran Chen, Yinbo Shi, Xiaochun Li, Lin Yang
USING QUANTIZATION IN TRAINING AN ARTIFICIAL INTELLIGENCE MODEL IN A SEMICONDUCTOR SOLUTION

Publication number: 20200302289

Abstract: A system for training an artificial intelligence (AI) model for an AI chip to implement an AI task may include an AI training unit to train weights of an AI model in floating point, a convolution quantization unit for quantizing the trained weights to a number of quantization levels, and an activation quantization unit for updating the weights of the AI model so that output of the AI model based at least on the updated weights are within a range of activation layers of the AI chip. The updated weights may be stored in fixed point and uploadable to the AI chip. The various units may be configured to account for the hardware constraints in the AI chip to minimize performance degradation when the trained weights are uploaded to the AI chip and expedite training convergence. Forward propagation and backward propagation may be combined in training the AI model.

Type: Application

Filed: September 27, 2019

Publication date: September 24, 2020

Applicant: Gyrfalcon Technology Inc.

Inventors: Yongxiong Ren, Yi Fan, Yequn Zhang, Baohua Sun, Bin Yang, Xiaochun Li, Lin Yang
USING IDENTITY LAYER IN A CELLULAR NEURAL NETWORK ARCHITECTURE

Publication number: 20200293865

Abstract: A cellular neural network architecture may include a processor and embedded cellular, neural network (CeNN) executable in an artificial intelligence (AI) integrated circuit and configured to perform certain AI functions. The CeNN may include multiple convolution layers, each having multiple binary weights. In some examples, a method may configure a given layer of the CeNN and one or more additional layers of the CeNN to retrieve the output of the given layer for debugging or training the CeNN. In configuring the one or more additional layers, the method may use an identity layer.

Type: Application

Filed: March 14, 2019

Publication date: September 17, 2020

Applicant: Gyrfalcon Technology Inc.

Inventors: Bowei Liu, Yinbo Shi, Yequn Zhang, Xiaochun Li
IMPLEMENTING RESIDUAL CONNECTION IN A CELLULAR NEURAL NETWORK ARCHITECTURE

Publication number: 20200293856

Abstract: A cellular neural network architecture may include a processor and an embedded cellular neural network (CeNN) executable in an artificial intelligence (AI) integrated circuit and configured to perform certain AI functions. The CeNN may include multiple convolution layers, such as first, second, and third layers, each layer having multiple binary weights. In some examples, a method may configure the multiple layers in the CeNN to produce a residual connection. In configuring the second and third layers, the method may use an identity matrix.

Type: Application

Filed: March 14, 2019

Publication date: September 17, 2020

Applicant: Gyrfalcon Technology Inc.

Inventors: Bowei Liu, Yinbo Shi, Yequn Zhang, Xiaochun Li
SYSTEMS AND METHODS FOR OPTIMIZING AN ARTIFICIAL INTELLIGENCE MODEL IN A SEMICONDUCTOR SOLUTION

Publication number: 20200250523

Abstract: In some examples, given an AI model in floating point, a system may use one or more artificial intelligence (AI) chips to train a global gain vector for use to convert the AI model in floating point to an AI model in fixed point for uploading to a physical AI chip. The system may determine initial gain vectors, and in each of multiple iterations, obtain the performance values of the AI chips based on the gain vectors and update the gam vectors for the next iteration. The gain vectors are updated based on a velocity of gain. The performance value may be based on feature maps of an AI model before and after the converting. The performance value may also be based on interference over a test dataset. Upon completion of the iterations, the system determines the global gain vector that resulted in the best performance value during the iterations.

Type: Application

Filed: February 5, 2019

Publication date: August 6, 2020

Applicant: Gyrfalcon Technology Inc.

Inventors: Yongxiong Ren, Yequn Zhang, Baohua Sun, Xiaochun Li, Qi Dong, Lin Yang
GREEDY APPROACH FOR OBTAINING AN ARTIFICIAL INTELLIGENCE MODEL IN A PARALLEL CONFIGURATION

Publication number: 20200234118

Abstract: A system may include multiple client devices and a processing device communicatively coupled to the client devices. One or more client devices may implement a greedy approach in searching for an optimal artificial intelligence (AI) model. For example, a client device may use a training dataset to perform an AI task, and update its AI model. The client device may verify the performance of the AI task and determine whether to accept or reject its updated AI model. Upon rejection, the client device may repeat updating its AI model until the updated AI model is accepted, or until a stopping criteria is met. The processing device may be configured to update the initial AI models based on the accepted updated AI models obtained in the multiple client device. Training data for each of the client devices may contain a subset shuffled from a larger training dataset.

Type: Application

Filed: December 3, 2019

Publication date: July 23, 2020

Applicant: Gyrfalcon Technology Inc.

Inventors: Yinbo Shi, Yequn Zhang, Xiaochun Li, Bowei Liu
SYSTEMS AND METHODS FOR OBTAINING AN ARTIFICIAL INTELLIGENCE MODEL IN A PARALLEL CONFIGURATION

Publication number: 20200234119

Abstract: A system may include multiple client devices and a processing device communicatively coupled to the client devices. A client device may receive an initial artificial intelligence (AI) model, use a training dataset to perform an AI task, and update its AI model. The client device may verify the performance of the AI task to determine whether to accept or reject its updated AI model. Upon rejection, the client device may repeat updating its AI model until the updated AI model is accepted, or until a stopping criteria is met. The processing device may be configured to update the initial AI models based on the accepted updated AI models obtained in the multiple client devices, and repeat the process for each client device using the updated initial AI models. Training data for each of the client devices may contain a subset shuffled from a larger training dataset.

Type: Application

Filed: December 3, 2019

Publication date: July 23, 2020

Applicant: Gyrfalcon Technology Inc.

Inventors: Yinbo Shi, Yequn Zhang, Xiaochun Li, Bowei Liu
SYSTEMS AND METHODS FOR DETERMINING AN ARTIFICIAL INTELLIGENCE MODEL IN A COMMUNICATION SYSTEM

Publication number: 20200151551

Abstract: A system may include multiple client devices and a processing device communicatively coupled to the client devices. Each client device includes an artificial intelligence (AI) chip and is configured to generate an AI model. The processing device may be configured to (i) receive a respective AI model and an associated performance value of the respective AI model from each of the plurality of client devices; (ii) determine an optimal AI model based on the performance values associated with the respective AI models from the plurality of client devices; and (iii) determine a global AI model based on the optimal AI model. The system may load the global AI model into an AI chip of a client device to cause the client device to perform an AI task based on the global AI model in the AI chip. The AI model may include a convolutional neural network.

Type: Application

Filed: November 13, 2018

Publication date: May 14, 2020

Applicant: Gyrfalcon Technology Inc.

Inventors: Yequn Zhang, Yongxiong Ren, Baohua Sun, Lin Yang, Qi Dong
SYSTEMS AND METHODS FOR UPDATING AN ARTIFICIAL INTELLIGENCE MODEL BY A SUBSET OF PARAMETERS IN A COMMUNICATION SYSTEM

Publication number: 20200151558

Abstract: A system may be configured to obtain a global artificial intelligence (AI) model for uploading into an AI chip to perform AI tasks. The system may implement a training process including receiving updated AI models from one or more client devices, determining a global AI model based on the received AI models from the client devices, and updating initial AI models for the client devices. Each client device may receive an initial AI model and train an updated AI model by training the entire parameters of the AI model together, by training a subset of the parameters of the AI model in a layer by layer fashion, or by training a subset of the parameters by parameter types. Each client device may include one or more AI chips configured to run an AI task to measure performance of an AI model. The AI model may include a convolutional neural network.

Type: Application

Filed: February 11, 2019

Publication date: May 14, 2020

Applicant: Gyrfalcon Technology Inc.

Inventors: Yongxiong Ren, Yequn Zhang, Baohua Sun, Xiaochun Li, Qi Dong, Lin Yang

1 2 next