Patents by Inventor Baohua Sun

Baohua Sun has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20210019602
    Abstract: An integrated circuit may include multiple cellular neural networks (CNN) processing engines coupled in a loop circuit and configured to perform an AI task. Each CNN processing engine includes multiple convolution layers, a first memory buffer to store imagery data and a second memory buffer to store filter coefficients. The CNN processing engines are configured to perform convolution operations over an input image simultaneously in one or more iterations. In each iteration, various sub-images of the input image are loaded to the first memory buffer circularly. A portion of the filter coefficients corresponding to the sub-image are loaded to the second memory buffer in a cyclic order. Data may be arranged in the second memory buffer to facilitate loading of duplicate filter coefficients among at least two convolution layers without requiring duplicate memory space. Methods of training a CNN model having duplicate weights are also provided.
    Type: Application
    Filed: July 18, 2019
    Publication date: January 21, 2021
    Applicant: Gyrfalcon Technology Inc.
    Inventors: Lin Yang, Baohua Sun, Yongxiong Ren, Wenhan Zhang
  • Publication number: 20210019606
    Abstract: An integrated circuit may include multiple cellular neural networks (CNN) processing engines coupled to at least one input/output data bus and a clock-skew circuit in a loop circuit. Each CNN processing engine includes multiple convolution layers, a first memory buffer to store imagery data and a second memory buffer to store filter coefficients. Each of the CNN processing engines is configured to perform convolution operations over an input image simultaneously in a first clock cycle to generate output to be fed to an immediate neighbor CNN processing engine for performing convolution operations in a next clock cycle. The second memory buffer may store a first subset of filter coefficients for a first convolution layer of the CNN processing engine and store a reference location to the first subset of filter coefficients for a second convolution layer, where the filter coefficients for the first and second convolution layers are duplicate.
    Type: Application
    Filed: July 18, 2019
    Publication date: January 21, 2021
    Applicant: Gyrfalcon Technology Inc.
    Inventors: Lin Yang, Baohua Sun, Yongxiong Ren, Wenhan Zhang
  • Publication number: 20210012177
    Abstract: A pixel feature vector extraction system for extracting multi-scale features contains a cellular neural networks (CNN) based integrated circuit (IC) for extracting pixel feature vector out of input imagery data by performing convolution operations using pre-trained filter coefficients of ordered convolutional layers in a deep learning model. The ordered convolutional layers are organized in a number of groups with each group followed by a pooling layer. Each group is configured for a different size of feature map. Pixel feature vector contains a combination of feature maps from at least two groups, for example, concatenation of the feature maps. The first group of the at least two groups contains the largest size of the feature maps amongst all of the at least two groups. Feature maps of the remaining of the at least two groups are modified to match the size of the feature map of the first group.
    Type: Application
    Filed: July 9, 2019
    Publication date: January 14, 2021
    Inventors: Lin Yang, Baohua Sun
  • Publication number: 20200342232
    Abstract: P feature encoding values are obtained for each of the Q frames in a video clip by image transformations of each frame along with performing computations of a specific succession of convolution and pooling layers of a CNN based deep learning model followed with operations of a nested invariance pooling layer. Each feature encoding value is then converted from real number to a corresponding integer value within a range designated for color display intensity according to a quantization scheme. A 2-D graphical symbol that contains N×N pixels is formed by placing respective color display intensities into the N×N pixels according to a data arrangement pattern for representing all frames of the video clip in form of P×Q feature encoding values, such that the 2-D graphical symbol possesses a semantic meaning of the video clip that can be recognized via image classification task using another trained CNN based deep learning model.
    Type: Application
    Filed: November 5, 2019
    Publication date: October 29, 2020
    Inventors: Lin Yang, Baohua Sun, Hao Sha
  • Publication number: 20200320385
    Abstract: A system for training an artificial intelligence (AI) model for an AI chip may include a forward network and a backward propagation network. The AI model may be a convolution neural network (CNN). The forward network may infer the output of the AI chip based on the training data. The backward network may use the output of the AI chip and the ground truth data to train the weights of the AI model. In some examples, the system may train the AI model using a gradient descent method. The system may quantize the weights and update the weights during the training. In some examples, the system may perform a uniform quantization over the weights. The system may also determine the distribution of the weights. If the weight distribution is not symmetric, the system may group the weights and quantize the weights based on the grouping.
    Type: Application
    Filed: April 30, 2019
    Publication date: October 8, 2020
    Applicant: Gyrfalcon Technology Inc.
    Inventors: Lin Yang, Baohua Sun, Yongxiong Ren, Wenhan Zhang, Patrick Zeng Dong
  • Publication number: 20200302289
    Abstract: A system for training an artificial intelligence (AI) model for an AI chip to implement an AI task may include an AI training unit to train weights of an AI model in floating point, a convolution quantization unit for quantizing the trained weights to a number of quantization levels, and an activation quantization unit for updating the weights of the AI model so that output of the AI model based at least on the updated weights are within a range of activation layers of the AI chip. The updated weights may be stored in fixed point and uploadable to the AI chip. The various units may be configured to account for the hardware constraints in the AI chip to minimize performance degradation when the trained weights are uploaded to the AI chip and expedite training convergence. Forward propagation and backward propagation may be combined in training the AI model.
    Type: Application
    Filed: September 27, 2019
    Publication date: September 24, 2020
    Applicant: Gyrfalcon Technology Inc.
    Inventors: Yongxiong Ren, Yi Fan, Yequn Zhang, Baohua Sun, Bin Yang, Xiaochun Li, Lin Yang
  • Publication number: 20200304831
    Abstract: Methods and systems for using feature encoding for storing a video stream without redundant frames are disclosed. A video stream containing a plurality of frames is received in a computing system. Each frame is divided to one or more sub-frames with each sub-frame containing a resolution suitable as an input image to a deep learning model based on VGG-16 model, ResNet or MobilNet. Respective vectors of feature encoding values of all sub-frames of current and immediately prior frames are obtained by performing computations of the deep learning model. A difference metric between the current frame and the immediately prior frame is obtained by comparing the respective vectors using a difference measurement technique. The current frame is stored in a to-be-kept video file only when the difference metric indicates that the current frame and the immediately prior frame are different in accordance with a predefined criterion.
    Type: Application
    Filed: April 9, 2019
    Publication date: September 24, 2020
    Inventors: Lin Yang, Patrick Z. Dong, Baohua Sun
  • Publication number: 20200250523
    Abstract: In some examples, given an AI model in floating point, a system may use one or more artificial intelligence (AI) chips to train a global gain vector for use to convert the AI model in floating point to an AI model in fixed point for uploading to a physical AI chip. The system may determine initial gain vectors, and in each of multiple iterations, obtain the performance values of the AI chips based on the gain vectors and update the gam vectors for the next iteration. The gain vectors are updated based on a velocity of gain. The performance value may be based on feature maps of an AI model before and after the converting. The performance value may also be based on interference over a test dataset. Upon completion of the iterations, the system determines the global gain vector that resulted in the best performance value during the iterations.
    Type: Application
    Filed: February 5, 2019
    Publication date: August 6, 2020
    Applicant: Gyrfalcon Technology Inc.
    Inventors: Yongxiong Ren, Yequn Zhang, Baohua Sun, Xiaochun Li, Qi Dong, Lin Yang
  • Patent number: 10713830
    Abstract: An image and the maximum number of tokens for a to-be-created image caption are received in a computing system. Font size of graphical image of the token is calculated from the maximum number of tokens and the dimension of desired input image for prediction-style image classification technique. Desired input image is divided into first and second portions. A 2-D symbol is formed by placing a resized image derived from the received image with substantially similar contents in the first portion and by initializing the second portion with blank images. Next token of the image caption is predicted by classifying the 2-D symbol using the prediction-style image classification technique. 2-D symbol is modified by appending the graphical image of just-predicted token to the existing image caption in the second portion, if termination condition for image caption creation is false. Next token is repeatedly predicted until termination condition becomes true.
    Type: Grant
    Filed: May 13, 2019
    Date of Patent: July 14, 2020
    Assignee: Gyrfalcon Technology Inc.
    Inventors: Lin Yang, Baohua Sun
  • Publication number: 20200218970
    Abstract: A list of keywords in a category of interest is defined and a list of to-be-excluded items is derived therefrom. A first set of general texts is obtained. A second set of texts is created by inserting or replacing a randomly selected item from the list of keywords into each of the first set at a randomly chosen location. A third set of texts is created by inserting or replacing a randomly selected item from the list of to-be-excluded into each of the first set at a randomly chosen location. First and second groups of 2-D symbols are formed to graphically represent the second set and the third set, respectively. The first group is associated with the category of interest while the second group is associated with the category of uninterested. Keyword detection training dataset is created by combining first and second groups of 2-D symbols.
    Type: Application
    Filed: March 11, 2019
    Publication date: July 9, 2020
    Inventors: Lin Yang, Baohua Sun
  • Publication number: 20200151558
    Abstract: A system may be configured to obtain a global artificial intelligence (AI) model for uploading into an AI chip to perform AI tasks. The system may implement a training process including receiving updated AI models from one or more client devices, determining a global AI model based on the received AI models from the client devices, and updating initial AI models for the client devices. Each client device may receive an initial AI model and train an updated AI model by training the entire parameters of the AI model together, by training a subset of the parameters of the AI model in a layer by layer fashion, or by training a subset of the parameters by parameter types. Each client device may include one or more AI chips configured to run an AI task to measure performance of an AI model. The AI model may include a convolutional neural network.
    Type: Application
    Filed: February 11, 2019
    Publication date: May 14, 2020
    Applicant: Gyrfalcon Technology Inc.
    Inventors: Yongxiong Ren, Yequn Zhang, Baohua Sun, Xiaochun Li, Qi Dong, Lin Yang
  • Publication number: 20200151551
    Abstract: A system may include multiple client devices and a processing device communicatively coupled to the client devices. Each client device includes an artificial intelligence (AI) chip and is configured to generate an AI model. The processing device may be configured to (i) receive a respective AI model and an associated performance value of the respective AI model from each of the plurality of client devices; (ii) determine an optimal AI model based on the performance values associated with the respective AI models from the plurality of client devices; and (iii) determine a global AI model based on the optimal AI model. The system may load the global AI model into an AI chip of a client device to cause the client device to perform an AI task based on the global AI model in the AI chip. The AI model may include a convolutional neural network.
    Type: Application
    Filed: November 13, 2018
    Publication date: May 14, 2020
    Applicant: Gyrfalcon Technology Inc.
    Inventors: Yequn Zhang, Yongxiong Ren, Baohua Sun, Lin Yang, Qi Dong
  • Publication number: 20200151584
    Abstract: A device for obtaining a local optimal AI model may include an artificial intelligence (AI) chip and a processing device configured to receive a first initial AI model from the host device. The device may load the initial AI model into the AI chip to determine a performance value of the AI model based on a dataset, and determine a probability that a current AI model should be replaced by the initial AI model. The device may determine, based on the probability, whether to replace the current AI model with the initial AI model. If it is determined that the current AI model be replaced, the device may replace the current AI model with the initial AI model. The device may repeat the above processes and obtain a final current AI model. The device may transmit the final current AI model to the host device.
    Type: Application
    Filed: November 13, 2018
    Publication date: May 14, 2020
    Applicant: Gyrfalcon Technology Inc.
    Inventors: Yequn Zhang, Yongxiong Ren, Baohua Sun, Lin Yang, Qi Dong
  • Patent number: 10596557
    Abstract: A preparation method for a molecular sieve-multiple oxide composite integral extrusion type denitration catalyst includes constructing an organic structure coating on the surface of a metal ion-exchanged molecular sieves and synchronously adding multiple oxide components, thus obtaining an ion-exchanged molecular sieve-multiple oxide composite denitration catalyst active component; and then mixing, kneading into paste, staling, carrying out integral extrusion forming, drying, and calcining, thus obtaining the integral extrusion type denitration catalyst. The molecular sieve-multiple oxide composite integral extraction type denitration catalyst has a denitration efficiency more than 80% at the temperature ranging from 250° C. to 420° C. in the presence of 10% steam and 500 ppm sulfuric dioxide.
    Type: Grant
    Filed: December 27, 2016
    Date of Patent: March 24, 2020
    Assignee: Valiant Co., Ltd.
    Inventors: Li Sun, Yongzhen Xu, Quansheng Li, Xiaoling Liu, Baohua Hu, Ming Cui, Yingjie Tang, Yuchang Wang, Zhenlei Zhou
  • Patent number: 10559694
    Abstract: A device including a biopolymer membrane, a passivation layer on the biopolymer membrane, a graphene layer on the passivation layer, a source electrode on the graphene layer, and a drain electrode on the graphene layer, wherein the graphene layer extends between the source electrode and the drain electrode.
    Type: Grant
    Filed: May 9, 2017
    Date of Patent: February 11, 2020
    Assignee: HRL Laboratories, LLC
    Inventors: Kyung-Ah Son, Baohua Yang, Hwa Chang Seo, Danny Wong, Jeong-Sun Moon
  • Patent number: 10482374
    Abstract: An ensemble learning based image classification system contains multiple cellular neural networks (CNN) based integrated circuits (ICs) operatively coupling together as a set of base learners of an ensemble for an image classification task. Each CNN based IC is configured with at least one distinct deep learning model in form of filter coefficients. The ensemble learning based image classification system further contains a controller configured as a meta learner of the ensemble and a memory based data buffer for holding various data used in the ensemble by the controller and the CNN based ICs. Various data may include input imagery data to be classified. Various data may also include extracted feature vectors or image classification outputs out of the set of base learners. The extracted feature vectors or image classification outputs are then used by the meta learner to further perform the image classification task.
    Type: Grant
    Filed: July 3, 2019
    Date of Patent: November 19, 2019
    Assignee: Gyrfalcon Technology Inc.
    Inventors: Lin Yang, Patrick Z. Dong, Charles Jin Young, Jason Z. Dong, Michael Lin, Baohua Sun
  • Publication number: 20190318226
    Abstract: A deep learning image processing system contains at least first and second groups of cellular neural networks (CNN) based integrated circuits (ICs). The first group and the second group are operatively connected in parallel via a network bus. CNN based ICs within each of the first and second groups are operatively connected in series via the network bus. The first group is configured for performing convolutional operations in respective portions of a deep learning model for extracting features out of a first subsection of input data. The second group is configured for performing convolutional operations in respective portions of the deep learning model for extracting features out of a second subsection of the input data. The deep learning model is divided into a plurality of consecutive portions being handled by the respective CNN based ICs. The input data is partitioned into at least first and second subsections.
    Type: Application
    Filed: April 12, 2018
    Publication date: October 17, 2019
    Inventors: Lin Yang, Baohua Sun, Jason Z. Dong, Charles Jin Young
  • Patent number: 10445568
    Abstract: Two-dimensional symbols with each containing multiple ideograms for facilitating machine learning are disclosed. Two-dimensional symbol comprises a matrix of N×N pixels of data representing a “super-character”. The matrix is divided into M×M sub-matrices with each of the sub-matrices containing (N/M)×(N/M) pixels. N and M are positive integers or whole numbers, and N is preferably a multiple of M. Each of the sub-matrices represents one ideogram defined in an ideogram collection set. “Super-character” represents at least one meaning each formed with a specific combination of a plurality of ideograms. Ideogram collection set includes, but is not limited to, pictograms, logosyllabic characters, Japanese characters, Korean characters, punctuation marks, numerals, special characters. Logosyllabic characters may contain one or more of Chinese characters, Japanese characters, Korean characters. Features of each ideogram can be represented by more than one layer of two-dimensional symbol.
    Type: Grant
    Filed: April 4, 2019
    Date of Patent: October 15, 2019
    Assignee: Gyrfalcon Technology Inc.
    Inventors: Lin Yang, Patrick Z. Dong, Baohua Sun
  • Publication number: 20190311246
    Abstract: A system may include a decentralized communication network and multiple processing devices on the network. Each processing device may have an artificial intelligence (AI) chip, the device may be configured to generate an AI model, determine the performance value of the AI model on the AI chip, receive a chain from the network where the chain contains a performance measure. If the performance value of the AI model is better than the performance measure, then the processing device may broadcast the AI model to the network for verification. If the AI model is verified by the network, the device may update the chain with the performance value so that the chain can be shared by the multiple processing devices on the network. Any processing device on the network may also verify an AI model broadcasted by any other device. Methods for generating the AI model are also provided.
    Type: Application
    Filed: April 10, 2018
    Publication date: October 10, 2019
    Applicant: GYRFALCON TECHNOLOGY INC.
    Inventors: Lin Yang, Charles Jin Young, Jason Zeng Dong, Patrick Zeng Dong, Baohua Sun, Yequn Zhang
  • Publication number: 20190311247
    Abstract: A system may include a decentralized communication network and multiple processing devices on the network. Each processing device may have an artificial intelligence (AI) chip, the device may be configured to generate an AI model, determine the performance value of the AI model on the AI chip, receive a chain from the network where the chain contains a performance measure. If the performance value of the AI model is better than the performance measure, then the processing device may broadcast the AI model to the network for verification. If the AI model is verified by the network, the device may update the chain with the performance value so that the chain can be shared by the multiple processing devices on the network. Any processing device on the network may also verify an AI model broadcasted by any other device. Methods for generating the AI model are also provided.
    Type: Application
    Filed: April 10, 2018
    Publication date: October 10, 2019
    Applicant: GYRFALCON TECHNOLOGY INC.
    Inventors: Lin Yang, Charles Jin Young, Jason Zeng Dong, Patrick Zeng Dong, Baohua Sun, Yequn Zhang