Patents Assigned to KWAI INC.

Methods and devices for joint sensor and pixel domain denoising

Patent number: 11967047

Abstract: A method, apparatus, and a non-transitory computer-readable storage medium for image denoising. The method may include obtaining a raw image captured by a camera. The method may also include obtaining a color modeled image based on the raw image. The method may further include obtaining a subsampled raw image based on the raw image. The method may also include obtaining a denoised image based on a neural network processing the color modeled image and the subsampled raw image.

Type: Grant

Filed: September 30, 2021

Date of Patent: April 23, 2024

Assignee: KWAI INC.

Inventors: Paras Maharjan, Ning Xu, Xuan Xu, Yuyan Song
Multi-level intermediate representation decoder for heterogeneous platforms

Patent number: 11928446

Abstract: A method, apparatus, and a non-transitory computer-readable storage medium for generating heterogenous platform code. The method may obtain a neural network model. The neural network model may be programed to run on at least one platform. The method may also obtain an initial intermediate representation (IR) code by encoding the neural network model, and obtain a target IR code by adding decorations to the initial IR code based on a target platform. The method may also output an executable code optimized to run on the target platform by decoding the target IR code.

Type: Grant

Filed: November 11, 2021

Date of Patent: March 12, 2024

Assignee: KWAI INC.

Inventors: Zhen Peng, Yang Liu, Hanxian Huang, Yongxiong Ren, Jishen Yang, Lingzhi Liu, Xin Chen
Systems and methods for accelerating automatic speech recognition based on compression and decompression

Patent number: 11830480

Abstract: Systems and methods are provided for automatic speech recognition. In the method, the system obtains a padded sequence by processing a plurality of acoustic signals. The system compresses the padded sequence by reducing the size of the padded sequence to obtain a compressed sequence. The system inputs the compressed sequence into a pre-trained encoder neural network to obtain an encoded sequence and then decompresses the encoded sequence by recovering the encoded sequence to an original sequential ordering. The system inputs the encoded sequence to a decoding module to obtain recognition texts.

Type: Grant

Filed: February 17, 2021

Date of Patent: November 28, 2023

Assignee: KWAI INC.

Inventors: Yongxiong Ren, Yang Liu, Heng Liu, Lingzhi Liu
Systems and methods for automatic speech recognition based on graphics processing units

Patent number: 11741967

Abstract: An automatic speech recognition system and a method thereof are provided. The system includes an encoder and a decoder. The encoder comprises a plurality of encoder layers. At least one encoder layer includes a plurality of encoder sublayers fused into one or more encoder kernels. The system further comprises a first pair of ping-pong buffers communicating with the one or more encoder kernels. The decoder comprises a plurality of decoder layers. At least one decoder layer includes a plurality of decoder sublayers fused into one or more decoder kernels. The decoder receives a decoder output related to the encoder output and generates a decoder output. The encoder sends the decoder output to a beam search kernel.

Type: Grant

Filed: January 4, 2021

Date of Patent: August 29, 2023

Assignee: KWAI INC.

Inventors: Yongxiong Ren, Heng Liu, Yang Liu, Lingzhi Liu, Jie Li, Yuanyuan Zhao, Xiaorui Wang
Methods and device for video data analysis

Patent number: 11682210

Abstract: Methods and apparatuses are provided for movie and television series video data analysis. The method includes: gathering and reading, by a processor, a plurality of input movies; removing a video border of each input movie; splitting the input movie into short clips, based on accuracy and efficiency requirements of different analyzing models; assessing attributes of each input movie by analyzing, with the different analyzing models, the input movie, the short clips cut from the input movie, and the frame images extracted from the input movie; and summarizing the plurality of input movies based on matching and integrating the attributes assessed for each input movie.

Type: Grant

Filed: November 30, 2021

Date of Patent: June 20, 2023

Assignee: Kwai Inc.

Inventors: Jiayi Liu, Huayan Wang, Xin Miao
METHOD AND APPARATUS FOR GENERATING PAIRED LOW RESOLUTION AND HIGH RESOLUTION IMAGES USING A GENERATIVE ADVERSARIAL NETWORK

Publication number: 20230169326

Abstract: A method for training a neural network system for generating paired low resolution (LR) and high resolution (HR) images, the neural network system, an apparatus, and a non-transitory computer-readable storage medium thereof are provided. The method includes that a first generator in the neural network system generates a LR image based on a random vector; a second generator in the neural network system generates a HR image based on the random vector, where the HR image is paired with the LR image; obtaining a plurality of losses based on the LR image and the HR image; and updating the first generator based on the plurality of losses.

Type: Application

Filed: November 30, 2021

Publication date: June 1, 2023

Applicant: KWAI INC.

Inventors: Ahmed Cheikh SIDIYA, Xuan XU, Ning XU
NEURAL NETWORK SYSTEM AND METHOD FOR RESTORING IMAGES USING TRANSFORMER AND GENERATIVE ADVERSARIAL NETWORK

Publication number: 20230169626

Abstract: A neural network system for restoring images, a method and a non-transitory computer-readable storage medium thereof are provided. The neural network system includes an encoder and a generative adversarial network (GAN) prior network. The encoder includes a plurality of encoder blocks, where each encoder block includes at least one transformer block and one convolution layer, where the encoder receives an input image and generates a plurality of encoder features and a plurality of latent vectors. Additionally, the GAN prior network includes a plurality of pre-trained generative prior layers, where the GAN prior network receives the plurality of encoder features and the plurality of latent vectors from the encoder and generates an output image with super-resolution.

Type: Application

Filed: November 30, 2021

Publication date: June 1, 2023

Applicant: KWAI INC.

Inventors: Ahmed Cheikh SIDIYA, Xuan XU, Ning XU
METHODS AND DEVICE FOR VIDEO DATA ANALYSIS

Publication number: 20230169770

Abstract: Methods and apparatuses are provided for movie and television series video data analysis. The method includes: gathering and reading, by a processor, a plurality of input movies; removing a video border of each input movie; splitting the input movie into short clips, based on accuracy and efficiency requirements of different analyzing models; assessing attributes of each input movie by analyzing, with the different analyzing models, the input movie, the short clips cut from the input movie, and the frame images extracted from the input movie; and summarizing the plurality of input movies based on matching and integrating the attributes assessed for each input movie.

Type: Application

Filed: November 30, 2021

Publication date: June 1, 2023

Applicant: KWAI INC.

Inventors: Jiayi LIU, Huayan WANG, Xin MIAO
METHOD AND APPARATUS FOR LENGTH-AWARE LOCAL TILING IN A SPARSE ATTENTION MODULE IN A TRANSFORMER

Publication number: 20230153381

Abstract: A method and an apparatus for length-aware local tiling in a sparse attention module in a transformer in heterogeneous devices are provided. The method includes that a heterogeneous device including one or more GPUs: divides a transformed sparsity mask into a plurality of first tiles and obtaining one or more effective first tiles from the plurality of first tiles, where each effective first tile includes at least one non-zero element; loads the one or more effective first tiles into a shared memory in the one or more GPUs and loads a plurality of elements in a first matrix corresponding to the one or more effective first tiles into the shared memory; and performs multiplication by a first sampled dense-dense matrix multiplication (SDDMM) kernel in the sparse attention module in the transformer by fetching the one or more effective first tiles and the plurality of elements from the shared memory.

Type: Application

Filed: November 17, 2021

Publication date: May 18, 2023

Applicant: KWAI INC.

Inventors: Zhendong WANG, Yongxiong REN, Yang LIU, Lingzhi LIU
MULTI-LEVEL INTERMEDIATE REPRESENTATION DECODER FOR HETEROGENEOUS PLATFORMS

Publication number: 20230143291

Abstract: A method, apparatus, and a non-transitory computer-readable storage medium for generating heterogenous platform code. The method may obtain a neural network model. The neural network model may be programed to run on at least one platform. The method may also obtain an initial intermediate representation (IR) code by encoding the neural network model, and obtain a target IR code by adding decorations to the initial IR code based on a target platform. The method may also output an executable code optimized to run on the target platform by decoding the target IR code.

Type: Application

Filed: November 11, 2021

Publication date: May 11, 2023

Applicant: KWAI INC.

Inventors: Zhen PENG, Yang LIU, Hanxian HUANG, Yongxiong REN, Jishen YANG, Lingzhi LIU, Xin CHEN
METHODS AND DEVICES FOR ACCELERATING A TRANSFORMER WITH A SPARSE ATTENTION PATTERN

Publication number: 20230133305

Abstract: A method and an apparatus for accelerating a transformer with a sparse attention pattern are provided. The method includes that a heterogeneous device including one or more GPUs loads a first matrix, a second matrix, and a transformed sparsity mask into a first sampled dense-dense matrix multiplication (SDDMM) kernel in a sparse attention module in the transformer and generates a first output based on the first matrix, the second matrix, and the transformed sparsity mask by the first SDDMM kernel, generates a second output by a softmax kernel in the sparse attention module based on the first output, loads the second output, a third matrix, and the transformed sparsity mask into a matrix multiplication kernel in the sparse attention module, and generates an output of the sparse attention module.

Type: Application

Filed: October 28, 2021

Publication date: May 4, 2023

Applicant: KWAI INC.

Inventors: Zhendong WANG, Yongxiong REN, Yang LIU, Lingzhi LIU
METHODS AND DEVICES FOR JOINT SENSOR AND PIXEL DOMAIN DENOISING

Publication number: 20230109090

Abstract: A method, apparatus, and a non-transitory computer-readable storage medium for image denoising. The method may include obtaining a raw image captured by a camera. The method may also include obtaining a color modeled image based on the raw image. The method may further include obtaining a subsampled raw image based on the raw image. The method may also include obtaining a denoised image based on a neural network processing the color modeled image and the subsampled raw image.

Type: Application

Filed: September 30, 2021

Publication date: April 6, 2023

Applicant: KWAI INC.

Inventors: Paras MAHARJAN, Ning XU, Xuan XU, Yuyan SONG
GENERATIVE ADVERSARIAL NETWORK FOR VIDEO COMPRESSION

Publication number: 20230105436

Abstract: A method and an apparatus for video processing are provided. The method includes that a decoding terminal receives a plurality of coded video frames coded using one or more generative adversarial networks (GANs), receives network parameters related to the one or more GANs, and decodes the plurality of coded video frames using GANs based on the network parameters. Further, the one or more GANs respectively implement one or more video coding functions including reference-frame coding, motion-compensated frame prediction, and residue-frame coding.

Type: Application

Filed: October 6, 2021

Publication date: April 6, 2023

Applicants: KWAI INC., SANTA CLARA UNIVERSITY

Inventors: Pengli DU, Ying LIU, Nam LING, Lingzhi LIU, Yongxiong REN, Ming Kai HSU
Methods and devices for editing and generating videos

Patent number: 11620828

Abstract: Methods and apparatuses are provided for editing and generating a short video based upon a long video. The method includes: obtaining a plurality of short source video clips as candidate video clips; obtaining attributes of each short source video clip; obtaining a plurality of target base videos according to a target genre, processing the plurality of target base videos by splitting each target base video into a plurality of short target base video clips, and obtaining attributes of each short target base video clip; selecting short target video clips from the plurality of short source video clips, based on distribution of the attributes obtained for the plurality of the short source video clips and the plurality of short target base video clips; and editing and combining the short target video clips selected from the plurality of short source video clips, to obtain a target video.

Type: Grant

Filed: November 29, 2021

Date of Patent: April 4, 2023

Assignee: KWAI INC.

Inventors: Jiayi Liu, Huayan Wang
METHODS AND DEVICES FOR IMAGE RESTORATION USING SUB-BAND SPECIFIC TRANSFORM DOMAIN LEARNING

Publication number: 20230099539

Abstract: A method, apparatus, and a non-transitory computer-readable storage medium for sub-band image reconstruction. The method may include obtaining an image captured by a camera. The method may also obtain a transform image based on the image captured by the camera. The transform image may be in a transform domain. The method may further obtain decomposed image components of the transform image. The decomposed image components may include a low frequency component and at least one high frequency component.

Type: Application

Filed: September 30, 2021

Publication date: March 30, 2023

Applicant: KWAI INC.

Inventors: Paras MAHARJAN, Ning XU, Xuan XU, Yuyan SONG
METHODS AND APPARATUSES FOR PHOTOREALISTIC RENDERING OF IMAGES USING MACHINE LEARNING

Publication number: 20230087476

Abstract: A neural network training method, an image processing method, and apparatuses thereof are provided. The neural network training method includes obtaining a first domain image and a second domain image, where the first domain image and the second domain image are unpaired images in different domains; obtaining a scaled first domain image by scaling, at an iteration, the first domain image; obtaining a training patch by cropping the scaled first domain image, where each training patch has a same number of pixels with different contents; inputting the training patch into the neural network at the iteration, and outputting an output patch; calculating a contrastive loss based on a query sub-patch and negative sub-patches selected from the training patch and a corresponding positive sub-patch selected from the output patch; and updating model parameters of the neural network based on the contrastive loss and a generative adversarial network loss.

Type: Application

Filed: September 17, 2021

Publication date: March 23, 2023

Applicant: KWAI INC.

Inventors: Oliver Dayun LIU, Mengtian LI, Yi ZHENG, Haibin HUANG, Chongyang MA
METHODS AND DEVICES FOR NEURAL NETWORK QUANTIZATION USING TEMPORAL PROFILING

Publication number: 20230084000

Abstract: Methods and apparatuses are provided for temporal profiling for neural network quantization. The method includes: obtaining a neural network that comprises anode connected to different paths at different time periods; obtaining node outputs for the node at the different time periods; determining statistic properties of the node outputs at the different time periods; and determining activation ranges of the node outputs based on the statistic properties.

Type: Application

Filed: September 15, 2021

Publication date: March 16, 2023

Applicant: KWAI INC.

Inventors: Ming Kai HSU, Chao YANG, Yue MA, Sikai WANG, Sitong FENG, Wenhui CAO, Danqing LI, Hui ZHONG, Lingzhi LIU
METHODS AND DEVICES FOR EFFICIENT GENERAL DECONVOLUTION IMPLEMENTATION ON HARDWARE ACCELERATOR

Publication number: 20230075264

Abstract: Methods and devices are provided for implementing efficient general deconvolution Implementation on hardware accelerator.

Type: Application

Filed: September 7, 2021

Publication date: March 9, 2023

Applicant: KWAI INC.

Inventors: Shiya LIU, Ming Kai HSU, Quan LIN, Lingzhi LIU
TRANSFERABLE VISION TRANSFORMER FOR UNSUPERVISED DOMAIN ADAPTATION

Publication number: 20230062151

Abstract: A method and an apparatus for training a transferable vision transformer (TVT) for unsupervised domain adaption (UDA) in heterogeneous devices are provided. The method includes that a heterogeneous device including one or more graphic processing units (GPUs) loads multiple patches into the TVT which includes a transferability adaption module (TAM). Furthermore, a patch-level domain discriminator in the TAM assigns weights to the multiple patches and determines one or more transferable patches based on the weights. Moreover, the heterogeneous device generates a transferable attention output for an attention module in the TAM based on the one or more transferable patches.

Type: Application

Filed: September 24, 2021

Publication date: March 2, 2023

Applicant: KWAI INC.

Inventors: Ning XU, Jingjing LIU, Jinyu YANG
METHODS AND APPARATUSES FOR GENERATING STYLE PICTURES

Publication number: 20230054283

Abstract: A style picture generating method, an apparatus and a non-transitory computer readable storage medium thereof are provided. The method includes: obtaining one or more models by training a neural network; obtaining a plurality of interpolated models based on the one or more models; generating a plurality of pictures by the plurality of interpolated models; and generating the style picture by combining two or more pictures in the plurality of pictures using one or more model-specific alpha masks.

Type: Application

Filed: August 20, 2021

Publication date: February 23, 2023

Applicant: KWAI INC.

Inventors: Jiayi LIU, Shen WANG, Zhenyu LIAO, Huayan WANG

1 2 next