Patents by Inventor Sergey Tulyakov

Sergey Tulyakov has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Video compression system

Patent number: 11736717

Abstract: Systems and methods herein describe a video compression system. The described systems and methods accesses a sequence of image frames from a first computing device, the sequence of image frames comprising a first image frame and a second image frame, detects a first set of keypoints for the first image frame, transmits the first image frame and the first set of keypoints to a second computing device, detects a second set of keypoints for the second image frame, transmits the second set of keypoints to the second computing device, causes an animated image to be displayed on the second computing device.

Type: Grant

Filed: September 30, 2021

Date of Patent: August 22, 2023

Assignee: Snap Inc.

Inventors: Sergey Demyanov, Andrew Cheng-min Lin, Walton Lin, Aleksei Podkin, Aleksei Stoliar, Sergey Tulyakov
VIDEO SYNTHESIS VIA MULTIMODAL CONDITIONING

Publication number: 20230262293

Abstract: A multimodal video generation framework (MMVID) that benefits from text and images provided jointly or separately as input. Quantized representations of videos are utilized with a bidirectional transformer with multiple modalities as inputs to predict a discrete video representation. A new video token trained with self-learning and an improved mask-prediction algorithm for sampling video tokens is used to improve video quality and consistency. Text augmentation is utilized to improve the robustness of the textual representation and diversity of generated videos. The framework incorporates various visual modalities, such as segmentation masks, drawings, and partially occluded images. In addition, the MMVID extracts visual information as suggested by a textual prompt.

Type: Application

Filed: September 30, 2022

Publication date: August 17, 2023

Inventors: Francesco Barbieri, Ligong Han, Hsin-Ying Lee, Shervin Minaee, Kyle Olszewski, Jian Ren, Sergey Tulyakov
Generative neural network distillation

Patent number: 11727280

Abstract: A compact generative neural network can be distilled from a teacher generative neural network using a training network. The compact network can be trained on the input data and output data of the teacher network. The training network train the student network using a discrimination layer and one or more types of losses, such as perception loss and adversarial loss.

Type: Grant

Filed: March 2, 2021

Date of Patent: August 15, 2023

Assignee: Snap Inc.

Inventors: Sergey Tulyakov, Sergei Korolev, Aleksei Stoliar, Maksim Gusarov, Sergei Kotcur, Christopher Yale Crutchfield, Andrew Wan
FACIAL ANIMATION TRANSFER

Publication number: 20230252704

Abstract: Systems and methods are disclosed for generating, a source image sequence using an image sensor of the computing device, the source image sequence comprising a plurality of source images depicting a head and face, identifying driving image sequence data to modify face image feature data in the source image sequence, generating, using an image transformation neural network, a modified source image sequence comprising a plurality of modified source images depicting modified versions of the head and face, and storing the modified source image sequence on the computing device.

Type: Application

Filed: April 19, 2023

Publication date: August 10, 2023

Inventors: Sergey Demyanov, Aleksei Podkin, Aliaksandr Siarohin, Aleksei Stoliar, Sergey Tulyakov
OBJECT-CENTRIC NEURAL DECOMPOSITION FOR IMAGE RE-RENDERING

Publication number: 20230215085

Abstract: Three-dimensional object representation and re-rendering systems and methods for producing a 3D representation of an object from 2D images including the object that enables object-centric rendering. A modular approach is used that optimizes a Neural Radiance Field (NeRF) model to estimate object geometry and refine camera parameters and, then, infer surface material properties and per-image lighting conditions that fit the 2D images.

Type: Application

Filed: December 28, 2022

Publication date: July 6, 2023

Inventors: Kyle Olszewski, Sergey Tulyakov, Zhengfei Kuang, Menglei Chai
FIXED-POINT MULTIPLICATION FOR NETWORK QUANTIZATION

Publication number: 20230214639

Abstract: Techniques for training a neural network having a plurality of computational layers with associated weights and activations for computational layers in fixed-point formats include determining an optimal fractional length for weights and activations for the computational layers; training a learned clipping-level with fixed-point quantization using a PACT process for the computational layers; and quantizing on effective weights that fuses a weight of a convolution layer with a weight and running variance from a batch normalization layer. A fractional length for weights of the computational layers is determined from current values of weights using the determined optimal fractional length for the weights of the computational layers. A fixed-point activation between adjacent computational layers is related using PACT quantization of the clipping-level and an activation fractional length from a node in a following computational layer.

Type: Application

Filed: December 31, 2021

Publication date: July 6, 2023

Inventors: Sumant Milind Hanumante, Qing Jin, Sergei Korolev, Denys Makoviichuk, Jian Ren, Dhritiman Sagar, Patrick Timothy McSweeney Simons, Sergey Tulyakov, Yang Wen, Richard Zhuang
OPTIMIZER BASED PRUNNER FOR NEURAL NETWORKS

Publication number: 20230146865

Abstract: A neural network pruning system can sparsely prune neural network models using an optimizer based approach that is agnostic to the model architecture being pruned. The neural network pruning system can prune by operating on the parameter vector of the full model and the gradient vector of the loss function with respect to the model parameters. The neural network pruning system can iteratively update parameters based on the gradients, while zeroing out as many parameters as possible based a preconfigured penalty.

Type: Application

Filed: January 12, 2023

Publication date: May 11, 2023

Inventors: Enxu Yan, Sergey Tulyakov, Aleksei Podkin, Aleksei Stoliar
Facial animation transfer

Patent number: 11645798

Abstract: Systems and methods are disclosed for generating, a source image sequence using an image sensor of the computing device, the source image sequence comprising a plurality of source images depicting a head and face, identifying driving image sequence data to modify face image feature data in the source image sequence, generating, using an image transformation neural network, a modified source image sequence comprising a plurality of modified source images depicting modified versions of the head and face, and storing the modified source image sequence on the computing device.

Type: Grant

Filed: June 1, 2021

Date of Patent: May 9, 2023

Assignee: Snap Inc.

Inventors: Sergey Demyanov, Aleksei Podkin, Aliaksandr Siarohin, Aleksei Stoliar, Sergey Tulyakov
MESSAGING SYSTEM WITH NEURAL HAIR RENDERING

Publication number: 20230079136

Abstract: A messaging system performs neural network hair rendering for images provided by users of the messaging system. A method of neural network hair rendering includes processing a three-dimensional (3D) model of fake hair and a first real hair image depicting a first person to generate a fake hair structure, and encoding, using a fake hair encoder neural subnetwork, the fake hair structure to generate a coded fake hair structure. The method further includes processing, using a cross-domain structure embedding neural subnetwork, the coded fake hair structure to generate a fake and real hair structure, and encoding, using an appearance encoder neural subnetwork, a second real hair image depicting a second person having a second head to generate an appearance map. The method further includes processing, using a real appearance renderer neural subnetwork, the appearance map and the fake and real hair structure to generate a synthesized real image.

Type: Application

Filed: November 15, 2022

Publication date: March 16, 2023

Inventors: Artem Bondich, Menglei Chai, Oleksandr Pyshchenko, Jian Ren, Sergey Tulyakov
Optimizer based prunner for neural networks

Patent number: 11580400

Abstract: A neural network pruning system can sparsely prune neural network models using an optimizer based approach that is agnostic to the model architecture being pruned. The neural network pruning system can prune by operating on the parameter vector of the full model and the gradient vector of the loss function with respect to the model parameters. The neural network pruning system can iteratively update parameters based on the gradients, while zeroing out as many parameters as possible based a preconfigured penalty.

Type: Grant

Filed: September 27, 2019

Date of Patent: February 14, 2023

Assignee: Snap Inc.

Inventors: Enxu Yan, Sergey Tulyakov, Aleksei Podkin, Aleksei Stoliar
CLOUD BASED MACHINE LEARNING

Publication number: 20220405637

Abstract: Disclosed are various embodiments for implementing computational tasks in a cloud environment in one or more operating system level virtualized containers. A parameter file can specify different parameters including hardware parameters, library parameters, user code parameters, and job parameters (e.g., sets of hyperparameters). The parameter file can be converted via a mapping and implemented in a cloud-based container platform.

Type: Application

Filed: May 11, 2022

Publication date: December 22, 2022

Inventors: Eric Buehl, Jordan Hurwitz, Sergey Tulyakov, Shubham Vij
Messaging system with neural hair rendering

Patent number: 11521362

Abstract: A messaging system performs neural network hair rendering for images provided by users of the messaging system. A method of neural network hair rendering includes processing a three-dimensional (3D) model of fake hair and a first real hair image depicting a first person to generate a fake hair structure, and encoding, using a fake hair encoder neural subnetwork, the fake hair structure to generate a coded fake hair structure. The method further includes processing, using a cross-domain structure embedding neural subnetwork, the coded fake hair structure to generate a fake and real hair structure, and encoding, using an appearance encoder neural subnetwork, a second real hair image depicting a second person having a second head to generate an appearance map. The method further includes processing, using a real appearance renderer neural subnetwork, the appearance map and the fake and real hair structure to generate a synthesized real image.

Type: Grant

Filed: August 20, 2021

Date of Patent: December 6, 2022

Assignee: Snap Inc.

Inventors: Artem Bondich, Menglei Chai, Oleksandr Pyshchenko, Jian Ren, Sergey Tulyakov
IMAGE LANDMARK DETECTION

Publication number: 20220292866

Abstract: A landmark detection system can more accurately detect landmarks in images using a detection scheme that penalizes for dispersion parameters, such as variance or scale. The landmark detection system can be trained using both labeled and unlabeled training data in a semi-supervised approach. The landmark detection system can further implement tracking of an object across multiple images using landmark data.

Type: Application

Filed: June 1, 2022

Publication date: September 15, 2022

Inventors: Sergey Tulyakov, Roman Furko, Aleksei Stoliar
COMPRESSING IMAGE-TO-IMAGE MODELS WITH AVERAGE SMOOTHING

Publication number: 20220292724

Abstract: System and methods for compressing image-to-image models. Generative Adversarial Networks (GANs) have achieved success in generating high-fidelity images. An image compression system and method adds a novel variant to class-dependent parameters (CLADE), referred to as CLADE-Avg, which recovers the image quality without introducing extra computational cost. An extra layer of average smoothing is performed between the parameter and normalization layers. Compared to CLADE, this image compression system and method smooths abrupt boundaries, and introduces more possible values for the scaling and shift. In addition, the kernel size for the average smoothing can be selected as a hyperparameter, such as a 3×3 kernel size. This method does not introduce extra multiplications but only addition, and thus does not introduce much computational overhead, as the division can be absorbed into the parameters after training.

Type: Application

Filed: March 4, 2021

Publication date: September 15, 2022

Inventors: Jian Ren, Menglei Chai, Sergey Tulyakov, Qing Jin
MACHINE LEARNING-BASED SELECTION OF A REPRESENTATIVE VIDEO FRAME WITHIN A MESSAGING APPLICATION

Publication number: 20220207875

Abstract: Aspects of the present disclosure involve a system comprising a medium storing a program and method for machine-learning based selection of a representative video frame. The program and method provide for receiving a set of video frames; determining a first subset of frames by removing frames outside of an image quality threshold; determining a second subset by removing frames outside of an image stillness threshold; computing feature data for each frame in the second subset; providing, for each frame in the second subset, the feature data to a machine learning model (MLM), the MLM being configured to output a score for each frame in the second subset of frames based on the feature data, the MLM having been trained with a first set of images labeled based on aesthetics, and with a second set of images labeled based on image quality; and selecting a frame based on output scores.

Type: Application

Filed: December 14, 2021

Publication date: June 30, 2022

Inventors: Kavya Venkata Kota Kopparapu, Benjamin Dodson, Francesc Xavier Drudis Rius, Angus Kong, Richard Leider, Jian Ren, Sergey Tulyakov, Jiayao Yu
COMPRESSING IMAGE-TO-IMAGE MODELS

Publication number: 20220207329

Abstract: Systems and methods herein describe an image compression system. The image compression system generates a first generative adversarial network (GAN), identifies a threshold, based on the threshold, generates a second GAN by pruning channels of the first GAN, trains the second GAN using similarity-based knowledge distillation from the first GAN, and stores the trained second GAN.

Type: Application

Filed: December 21, 2021

Publication date: June 30, 2022

Inventors: Jian Ren, Oliver Woodford, Sergey Tulyakov, Jiazhuo Wang, Qing Jin
FLOW-GUIDED MOTION RETARGETING

Publication number: 20220207786

Abstract: Systems and methods herein describe a motion retargeting system. The motion retargeting system accesses a plurality of two-dimensional images comprising a person performing a plurality of body poses, extracts a plurality of implicit volumetric representations from the plurality of body poses, generates a three-dimensional warping field, the three-dimensional warping field configured to warp the plurality of implicit volumetric representations from a canonical pose to a target pose, and based on the three-dimensional warping field, generates a two-dimensional image of an artificial person performing the target pose.

Type: Application

Filed: December 21, 2021

Publication date: June 30, 2022

Inventors: Jian Ren, Menglei Chai, Oliver Woodford, Kyle Olszewski, Sergey Tulyakov
Image landmark detection

Patent number: 11354922

Abstract: A landmark detection system can more accurately detect landmarks in images using a detection scheme that penalizes for dispersion parameters, such as variance or scale. The landmark detection system can be trained using both labeled and unlabeled training data in a semi-supervised approach. The landmark detection system can further implement tracking of an object across multiple images using landmark data.

Type: Grant

Filed: December 30, 2020

Date of Patent: June 7, 2022

Assignee: Snap Inc.

Inventors: Sergey Tulyakov, Roman Furko, Aleksei Stoliar
Cloud based machine learning

Patent number: 11334815

Abstract: Disclosed are various embodiments for implementing computational tasks in a cloud environment in one or more operating system level virtualized containers. A parameter file can specify different parameters including hardware parameters, library parameters, user code parameters, and job parameters (e.g., sets of hyperparameters). The parameter file can be converted via a mapping and implemented in a cloud-based container platform.

Type: Grant

Filed: September 28, 2018

Date of Patent: May 17, 2022

Assignee: Snap Inc.

Inventors: Eric Buehl, Jordan Hurwitz, Sergey Tulyakov, Shubham Vij
VIDEO SYNTHESIS WITHIN A MESSAGING SYSTEM

Publication number: 20220101104

Abstract: Aspects of the present disclosure involve a system comprising a computer-readable storage medium storing a program and method for video synthesis. The program and method provide for accessing a primary generative adversarial network (GAN) comprising a pre-trained image generator, a motion generator comprising a plurality of neural networks, and a video discriminator; generating an updated GAN based on the primary GAN, by performing operations comprising identifying input data of the updated GAN, the input data comprising an initial latent code and a motion domain dataset, training the motion generator based on the input data, and adjusting weights of the plurality of neural networks of the primary GAN based on an output of the video discriminator; and generating a synthesized video based on the primary GAN and the input data.

Type: Application

Filed: September 30, 2021

Publication date: March 31, 2022

Inventors: Menglei Chai, Kyle Olszewski, Jian Ren, Yu Tian, Sergey Tulyakov

prev 1 2 3 next