Patents by Inventor Ming-Kai Hsu

Ming-Kai Hsu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Methods and apparatuses for high performance and accuracy fixed-point scale implementation

Patent number: 12293275

Abstract: A method to implement a fixed-point scale layer in a neural network for data processing is provided in the present disclosure. The method includes: receiving fixed-point input data over a channel of a standalone floating-point scale layer, and converting the floating-point input data into fixed-point input data of the standalone floating-point scale layer; obtaining fixed-point quantization parameters in each channel based on the input data and floating-point parameters ?i, ?i in each channel; converting the standalone floating-point scale layer based on the fixed-point quantization parameters into a fixed-point scale layer for processing the fixed-point input data to generate fixed-point output data; and mapping the fixed-point scale layer to a fixed-point convolution layer and the computation of convolution is done by matrix multiplication that can be executed on a GEMM engine.

Type: Grant

Filed: July 6, 2021

Date of Patent: May 6, 2025

Assignee: BEIJING DAJIA INTERNET INFORMATION TECHNOLOGY CO., LTD.

Inventors: Ming Kai Hsu, Sitong Feng
Generative adversarial network for video compression

Patent number: 12058312

Abstract: A method and an apparatus for video processing are provided. The method includes that a decoding terminal receives a plurality of coded video frames coded using one or more generative adversarial networks (GANs), receives network parameters related to the one or more GANs, and decodes the plurality of coded video frames using GANs based on the network parameters. Further, the one or more GANs respectively implement one or more video coding functions including reference-frame coding, motion-compensated frame prediction, and residue-frame coding.

Type: Grant

Filed: October 6, 2021

Date of Patent: August 6, 2024

Assignees: KWAI INC., SANTA CLARA UNIVERSITY

Inventors: Pengli Du, Ying Liu, Nam Ling, Lingzhi Liu, Yongxiong Ren, Ming Kai Hsu
Quantized softmax layer for neural networks

Patent number: 11861452

Abstract: Quantized softmax layers in neural networks are described. Some embodiments involve receiving, at an input to a softmax layer of a neural network from an intermediate layer of the neural network, a non-normalized output comprising a plurality of intermediate network decision values. Then for each intermediate network decision value of the plurality of intermediate network decision values, the embodiment involves: calculating a difference between the intermediate network decision value and a maximum network decision value; requesting, from a lookup table, a corresponding lookup table value using the difference between the intermediate network decision value and the maximum network decision value; and selecting the corresponding lookup table value as a corresponding decision value. A normalized output is then generated comprising the corresponding lookup table value for said each intermediate network decision value of the plurality of intermediate network decision values.

Type: Grant

Filed: June 17, 2019

Date of Patent: January 2, 2024

Assignee: Cadence Design Systems, Inc.

Inventor: Ming Kai Hsu
Quantizing trained neural networks with removal of normalization

Patent number: 11861492

Abstract: Various embodiments provide for quantizing a trained neural network with removal of normalization with respect to at least one layer of the quantized neural network, such as a quantized multiple fan-in layer (e.g., element-wise add or sum layer).

Type: Grant

Filed: December 26, 2019

Date of Patent: January 2, 2024

Assignee: Cadence Design Systems, Inc.

Inventor: Ming Kai Hsu
Constraint-based dynamic quantization adjustment for fixed-point processing

Patent number: 11630982

Abstract: Aspects of the present disclosure address systems and methods for fixed-point quantization using a dynamic quantization level adjustment scheme. Consistent with some embodiments, a method comprises accessing a neural network comprising floating-point representations of filter weights corresponding to one or more convolution layers. The method further includes determining a peak value of interest from the filter weights and determining a quantization level for the filter weights based on a number of bits in a quantization scheme. The method further includes dynamically adjusting the quantization level based on one or more constraints. The method further includes determining a quantization scale of the filter weights based on the peak value of interest and the adjusted quantization level. The method further includes quantizing the floating-point representations of the filter weights using the quantization scale to generate fixed-point representations of the filter weights.

Type: Grant

Filed: September 14, 2018

Date of Patent: April 18, 2023

Assignee: Cadence Design Systems, Inc.

Inventors: Ming Kai Hsu, Sandip Parikh
GENERATIVE ADVERSARIAL NETWORK FOR VIDEO COMPRESSION

Publication number: 20230105436

Abstract: A method and an apparatus for video processing are provided. The method includes that a decoding terminal receives a plurality of coded video frames coded using one or more generative adversarial networks (GANs), receives network parameters related to the one or more GANs, and decodes the plurality of coded video frames using GANs based on the network parameters. Further, the one or more GANs respectively implement one or more video coding functions including reference-frame coding, motion-compensated frame prediction, and residue-frame coding.

Type: Application

Filed: October 6, 2021

Publication date: April 6, 2023

Applicants: KWAI INC., SANTA CLARA UNIVERSITY

Inventors: Pengli DU, Ying LIU, Nam LING, Lingzhi LIU, Yongxiong REN, Ming Kai HSU
METHODS AND DEVICES FOR NEURAL NETWORK QUANTIZATION USING TEMPORAL PROFILING

Publication number: 20230084000

Abstract: Methods and apparatuses are provided for temporal profiling for neural network quantization. The method includes: obtaining a neural network that comprises anode connected to different paths at different time periods; obtaining node outputs for the node at the different time periods; determining statistic properties of the node outputs at the different time periods; and determining activation ranges of the node outputs based on the statistic properties.

Type: Application

Filed: September 15, 2021

Publication date: March 16, 2023

Applicant: KWAI INC.

Inventors: Ming Kai HSU, Chao YANG, Yue MA, Sikai WANG, Sitong FENG, Wenhui CAO, Danqing LI, Hui ZHONG, Lingzhi LIU
METHODS AND DEVICES FOR EFFICIENT GENERAL DECONVOLUTION IMPLEMENTATION ON HARDWARE ACCELERATOR

Publication number: 20230075264

Abstract: Methods and devices are provided for implementing efficient general deconvolution Implementation on hardware accelerator.

Type: Application

Filed: September 7, 2021

Publication date: March 9, 2023

Applicant: KWAI INC.

Inventors: Shiya LIU, Ming Kai HSU, Quan LIN, Lingzhi LIU
Methods and apparatuses for high performance and accuracy fixed-point scale implementation

Publication number: 20230010981

Abstract: A method to implement a fixed-point scale layer in a neural network for data processing is provided in the present disclosure. The method includes: receiving fixed-point input data over a channel of a standalone floating-point scale layer, and converting the floating-point input data into fixed-point input data of the standalone floating-point scale layer; obtaining fixed-point quantization parameters in each channel based on the input data and floating-point parameters ?i, ?i in each channel; converting the standalone floating-point scale layer based on the fixed-point quantization parameters into a fixed-point scale layer for processing the fixed-point input data to generate fixed-point output data; and mapping the fixed-point scale layer to a fixed-point convolution layer and the computation of convolution is done by matrix multiplication that can be executed on a GEMM engine.

Type: Application

Filed: July 6, 2021

Publication date: January 12, 2023

Applicant: KWAI INC.

Inventors: Ming Kai HSU, Sitong FENG
METHODS AND APPARATUSES FOR HIGH PERFORMANCE AND ACCURACY FIXED-POINT BATCHNORM IMPLEMENTATION

Publication number: 20230010197

Abstract: A method to implement a fixed-point batchnorm layer in a neural network for data processing is provided in the present disclosure. The method includes: receiving fixed-point input data over a channel of a standalone floating-point batchnorm layer, and converting the floating-point input data into fixed-point input data of the standalone floating-point batchnorm layer; obtaining fixed-point quantization parameters in each channel based on the input data and floating-point parameters ?i, ?i, ?i in each channel; converting the standalone floating-point batchnorm layer based on the fixed-point quantization parameters into a fixed-point batchnorm layer for processing the fixed-point input data to generate fixed-point output data; and mapping the fixed-point batchnorm layer to a fixed-point convolution layer and the computation of convolution is done by matrix multiplication that can be executed on a GEMM engine.

Type: Application

Filed: July 6, 2021

Publication date: January 12, 2023

Applicant: KWAI INC.

Inventors: Ming Kai HSU, Sikai WANG
CLASS-SPECIFIC NEURAL NETWORK FOR VIDEO COMPRESSED SENSING

Publication number: 20220292727

Abstract: A class-specific neural network for video compressed sensing and methods for training and testing the class-specific neural network are provided. The class-specific neural network includes a Gaussian-mixture model (GMM) and a plurality of encoders, where the GMM classifies video frame blocks with a plurality of clusters and assigns the video frame blocks to the plurality of clusters. Further, the plurality of encoders receive the video frame blocks and generate a plurality of compressed-sensed frame block vectors, where the plurality of encoders correspond to the plurality of clusters.

Type: Application

Filed: March 15, 2022

Publication date: September 15, 2022

Applicants: KWAI INC., SANTA CLARA UNIVERSITY

Inventors: Yifei PEI, Ying LIU, Nam LING, Lingzhi LIU, Yongxiong REN, Ming Kai HSU
3D SEPARABLE DEEP CONVOLUTIONAL NEURAL NETWORK FOR MOVING OBJECT DETECTION

Publication number: 20220164630

Abstract: A method for detecting moving objects in video frames, an apparatus and a non-transitory computer-readable storage medium thereof are provided. The method includes that: an encoder in a 3-dimenional (3D) separable convolutional neural network with multi-input multi-output (3DS_MM) receives a first input including multiple video frames, where the encoder includes a plurality of encoder layers including 3D separable convolutional neural network (CNN) layers; the encoder generates a first encoder output; and a decoder in the 3DS_MM receives the first encoder output and generates a first output including multiple first binary masks related to the first input, where the decoder includes a plurality of decoder layers comprising 3D separable transposed CNN layers.

Type: Application

Filed: November 22, 2021

Publication date: May 26, 2022

Applicants: KWAI INC., SANTA CLARA UNIVERSITY

Inventors: Bingxin HOU, Ying LIU, Nam LING, Lingzhi LIU, Yongxiong REN, Ming Kai HSU
System and method for evaluating a classifier implemented within an image signal processor

Patent number: 9842280

Abstract: A system for evaluating a classifier of an image signal processor (ISP) includes (i) a microprocessor and (ii) memory storing training images, the microprocessor being capable of sending each training image to the ISP. The system includes machine-readable instructions stored within the memory and executed by the microprocessor capable of: (i) selecting a subset of images based upon a divider position, (ii) controlling the ISP to classify each image as belonging or not belonging to an object class, (iii) determining a positive-match count, (iv) determining an error count based upon the positive-match count and total number of training images belonging to the object class, (v) repeating, for other divider positions, steps of selecting, controlling, and determining to identify an optimal divider position and a minimum-error count; and (vi) determining the classifier's optimality by comparing the optimal divider position to a predetermined optimal divider position and a predetermined minimum-error count.

Type: Grant

Filed: November 4, 2015

Date of Patent: December 12, 2017

Assignee: OmniVision Technologies, Inc.

Inventor: Ming Kai Hsu
Non-visible illumination scheme

Patent number: 9674465

Abstract: Embodiments of an apparatus and process are described. The process includes capturing a first image frame using an image sensor, the image sensor including a pixel array comprising a plurality of pixels arranged in rows and columns and a color filter array optically coupled to the pixel array. A region of interest within the first image frame is determined, and the exposure time of the image sensor is adjusted to eliminate a substantial fraction of the visible light captured by the image sensor. A rolling shutter procedure is used with the pixel array to capture at least one subsequent frame using the adjusted exposure time, and a source of invisible radiation is activated while the rolling shutter enters the region of interest and deactivated when the rolling shutter exits the region of interest. Finally, an image of the region of interest is output. Other embodiments are disclosed and claimed.

Type: Grant

Filed: June 3, 2015

Date of Patent: June 6, 2017

Assignee: OmniVision Technologies, Inc.

Inventors: Jizhang Shan, Ming-Kai Hsu
System And Method For Evaluating A Classifier Implemented Within An Image Signal Processor

Publication number: 20170124429

Abstract: A system for evaluating a classifier of an image signal processor (ISP) includes (i) a microprocessor and (ii) memory storing training images, the microprocessor being capable of sending each training image to the ISP. The system includes machine-readable instructions stored within the memory and executed by the microprocessor capable of: (i) selecting a subset of images based upon a divider position, (ii) controlling the ISP to classify each image as belonging or not belonging to an object class, (iii) determining a positive-match count, (iv) determining an error count based upon the positive-match count and total number of training images belonging to the object class, (v) repeating, for other divider positions, steps of selecting, controlling, and determining to identify an optimal divider position and a minimum-error count; and (vi) determining the classifier's optimality by comparing the optimal divider position to a predetermined optimal divider position and a predetermined minimum-error count.

Type: Application

Filed: November 4, 2015

Publication date: May 4, 2017

Inventor: Ming Kai Hsu
Imaging systems with single-photon-avalanche-diodes and sensor translation, and associated methods

Patent number: 9628735

Abstract: An imaging system with single-photon-avalanche-diodes (SPADs) and sensor translation for capturing a plurality of first images to enable generation of an enhanced-resolution image includes (a) an image sensor with SPAD pixels for capturing the plurality of first images at a plurality of spatially shifted positions of the image sensor, respectively, and (b) an actuator for translating the image sensor, parallel to its light receiving surface, to place the image sensor at the plurality of spatially shifted positions. A method for capturing a plurality of first images that enable composition of an enhanced-resolution image includes (a) translating an image sensor parallel to its light receiving surface to place the image sensor at a plurality of spatially shifted positions, and (b) capturing, using SPAD pixels implemented in pixel array of the image sensor, the plurality of first images at the plurality of spatially shifted positions, respectively.

Type: Grant

Filed: June 22, 2015

Date of Patent: April 18, 2017

Assignee: OmniVision Technologies, Inc.

Inventors: Bowei Zhang, Ming-Kai Hsu
Imaging Systems With Single-Photon-Avalanche-Diodes And Sensor Translation, And Associated Methods

Publication number: 20160373676

Abstract: An imaging system with single-photon-avalanche-diodes (SPADs) and sensor translation for capturing a plurality of first images to enable generation of an enhanced-resolution image includes (a) an image sensor with SPAD pixels for capturing the plurality of first images at a plurality of spatially shifted positions of the image sensor, respectively, and (b) an actuator for translating the image sensor, parallel to its light receiving surface, to place the image sensor at the plurality of spatially shifted positions. A method for capturing a plurality of first images that enable composition of an enhanced-resolution image includes (a) translating an image sensor parallel to its light receiving surface to place the image sensor at a plurality of spatially shifted positions, and (b) capturing, using SPAD pixels implemented in pixel array of the image sensor, the plurality of first images at the plurality of spatially shifted positions, respectively.

Type: Application

Filed: June 22, 2015

Publication date: December 22, 2016

Inventors: Bowei Zhang, Ming-Kai Hsu
NON-VISIBLE ILLUMINATION SCHEME

Publication number: 20160360124

Abstract: Embodiments of an apparatus and process are described. The process includes capturing a first image frame using an image sensor, the image sensor including a pixel array comprising a plurality of pixels arranged in rows and columns and a color filter array optically coupled to the pixel array. A region of interest within the first image frame is determined, and the exposure time of the image sensor is adjusted to eliminate a substantial fraction of the visible light captured by the image sensor. A rolling shutter procedure is used with the pixel array to capture at least one subsequent frame using the adjusted exposure time, and a source of invisible radiation is activated while the rolling shutter enters the region of interest and deactivated when the rolling shutter exits the region of interest. Finally, an image of the region of interest is output. Other embodiments are disclosed and claimed.

Type: Application

Filed: June 3, 2015

Publication date: December 8, 2016

Inventors: Jizhang Shan, Ming-Kai Hsu
Feature detection in image capture

Patent number: 9444999

Abstract: A feature detection process includes identifying an approximate location of a feature in a preliminary image. A gradient phase map of image pixel intensities within the approximate location is computed. A projection result is determined by applying a projection function to the gradient phase map. The projection result is analyzed to determine a state of the feature.

Type: Grant

Filed: August 5, 2014

Date of Patent: September 13, 2016

Assignee: OmniVision Technologies, Inc.

Inventor: Ming-Kai Hsu
FEATURE DETECTION IN IMAGE CAPTURE

Publication number: 20160044237

Abstract: A feature detection process includes identifying an approximate location of a feature in a preliminary image. A gradient phase map of image pixel intensities within the approximate location is computed. A projection result is determined by applying a projection function to the gradient phase map. The projection result is analyzed to determine a state of the feature.

Type: Application

Filed: August 5, 2014

Publication date: February 11, 2016

Inventor: Ming-Kai Hsu

1 2 next