Patents by Inventor Yuanyi XUE

Yuanyi XUE has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Microdosing for low bitrate video compression

Patent number: 12382069

Abstract: A system includes a machine learning (ML) model-based video encoder configured to receive an uncompressed video sequence including multiple video frames, determine, from among the multiple video frames, a first video frame subset and a second video frame subset, encode the first video frame subset to produce a first compressed video frame subset, and identify a first decompression data for the first compressed video frame subset. The ML model-based video encoder is further configured to encode the second video frame subset to produce a second compressed video frame subset, and identify a second decompression data for the second compressed video frame subset. The first decompression data is specific to decoding the first compressed video frame subset but not the second compressed video frame subset, and the second decompression data is specific to decoding the second compressed video frame subset but not the first compressed video frame subset.

Type: Grant

Filed: May 2, 2024

Date of Patent: August 5, 2025

Assignees: Disney Enterprises, Inc., ETH ZÜRICH (EIDGENÖSSISCHE TECHNISCHE HOCHSCHULE ZÜRICH)

Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson De Albuquerque Azevedo, Christopher Richard Schroers, Scott Labrozzi, Yuanyi Xue
OPTIMIZING INSERTION POINTS FOR CONTENT BASED ON AUDIO CHARACTERISTICS

Publication number: 20250240502

Abstract: One embodiment of the present invention sets forth a technique for inserting content into a media program. The technique includes determining a plurality of markers corresponding to a plurality of locations within a media program. The technique also includes for each marker included in the plurality of markers, automatically analyzing a first set of intervals within the media program that lead up to the marker and a second set of intervals within the media program that immediately follow the marker and determine a set of audio characteristics associated with the first set of intervals and the second set of intervals. The technique further includes generating a plurality of scores for the plurality of markers based on the set of audio characteristics for each marker and inserting additional content at one or more markers included in the plurality of markers based on the plurality of scores.

Type: Application

Filed: April 8, 2025

Publication date: July 24, 2025

Inventors: Yuanyi Xue, Michael John Bracco, Scott Christopher Labrozzi, Christopher Richard Schroers, Wenhao Zhang
Codec Rate Distortion Compensating Downsampler

Publication number: 20250211758

Abstract: A system includes a machine learning (ML) model-based video downsampler configured to receive an input video sequence having a first display resolution, and to map the input video sequence to a lower resolution video sequence having a second display resolution lower than the first display resolution. The system also includes a neural network-based (NN-based) proxy video codec configured to transform the lower resolution video sequence into a decoded proxy bitstream. In addition, the system includes an upsampler configured to produce an output video sequence using the decoded proxy bitstream.

Type: Application

Filed: March 12, 2025

Publication date: June 26, 2025

Inventors: Christopher Richard Schroers, Roberto Gerson de Albuquerque Azevedo, Nicholas David Gregory, Yuanyi Xue, Scott Labrozzi, Abdelaziz Djelouah
LOSSY IMAGE COMPRESSION WITH DIFFUSION MODELS

Publication number: 20250157087

Abstract: In some embodiments, a method receives a quantized latent representation of an image in a latent space. The image is encoded into a representation in the latent space and quantized to generate the quantized latent representation. A time step parameter is received that is generated based on the representation. The method performs an inverse quantization process to generate a reconstructed representation. A diffusion model performs a denoising process for a number of iterations based on the time step parameter to remove noise from the reconstructed representation to generate a denoised reconstructed representation. The denoised reconstructed representation is decoded into a reconstructed image.

Type: Application

Filed: October 18, 2024

Publication date: May 15, 2025

Applicants: Disney Enterprises, Inc., ETH Zürich (Eidgenössische Technische Hochschule Zürich)

Inventors: Lucas Relic, Roberto Gerson De Albuquerque Azevedo, Christopher Richard Schroers, Yuanyi Xue, Scott Labrozzi
Performance optimization of pre-processor using video proxy codec

Patent number: 12284360

Abstract: In some embodiments, a method trains a first parameter of a differentiable proxy codec to encode source content based on a first loss between first compressed source content and second compressed source content that is output by a target codec. A pre-processor pre-processes a source image to output a pre-processed source image, the pre-processing being based on a second parameter. The differentiable proxy codec encodes the pre-processed source image into a compressed pre-processed source image based on the first parameter. The method determines a second loss between the source image and the compressed pre-processed source image and determines an adjustment to the first parameter based on the second loss. The adjustment is used to adjust the second parameter of the pre-processor based on the second loss.

Type: Grant

Filed: October 19, 2023

Date of Patent: April 22, 2025

Assignees: DISNEY ENTERPRISES, INC., ETH ZÜRICH (EIDGENÖSSISCHE TECHNISCHE HOCHSCHULE ZÜRICH)

Inventors: Yang Zhang, Mingyang Song, Christopher Richard Schroers, Tunc Ozan Aydin, Yuanyi Xue, Scott Labrozzi
DYNAMIC SELECTION OF CANDIDATE BITRATES FOR VIDEO ENCODING

Publication number: 20250126309

Abstract: In some embodiments, a method generates a first representation of a first relationship between bitrate and quality based on first features of a first portion of a video. The first representation is analyzed to determine a first list of potential bitrates for the first portion of video. The method analyzes potential bitrates and quality associated with the respective potential bitrates to refine the first list of potential bitrates to a second list of bitrates. The second list of bitrates includes a different list of bitrates than the first list of potential bitrates. The method outputs the second list of bitrates for encoding the first portion of video.

Type: Application

Filed: December 20, 2024

Publication date: April 17, 2025

Applicants: Disney Enterprises, Inc., Beijing YoJaJa Software Technology Development Co., Ltd.

Inventors: Chen Liu, Wenhao Zhang, Scott Labrozzi, Yuanyi Xue, Xuchang Huangfu, Xiaobo Liu
Codec rate distortion compensating downsampler

Patent number: 12278969

Abstract: A system includes a machine learning (ML) model-based video downsampler configured to receive an input video sequence having a first display resolution, and to map the input video sequence to a lower resolution video sequence having a second display resolution lower than the first display resolution. The system also includes a neural network-based (NN-based) proxy video codec configured to transform the lower resolution video sequence into a decoded proxy bitstream. In addition, the system includes an upsampler configured to produce an output video sequence using the decoded proxy bitstream.

Type: Grant

Filed: August 4, 2023

Date of Patent: April 15, 2025

Assignees: Disney Enterprises, Inc., ETH Zurich (Eidgenossische Technische Hochschule Zurich)

Inventors: Christopher Richard Schroers, Roberto Gerson de Albuquerque Azevedo, Nicholas David Gregory, Yuanyi Xue, Scott Labrozzi, Abdelaziz Djelouah
FILM GRAIN MEASUREMENT BASED ON SUBBAND ANALYSIS IN FREQUENCY DOMAIN

Publication number: 20250117909

Abstract: In some embodiments, a method receives a first image and a second image for a comparison of film grain. The first image and the second image are converted from a spatial domain to a frequency domain to generate a first frequency domain representation for the first image and a second frequency domain representation of the second image. The method compares a first distribution of frequency components from the first frequency domain representation to a second distribution of frequency components from the second frequency domain representation. A score for an assessment of differences of the film grain in the first image and the second image is generated based on the comparing.

Type: Application

Filed: September 18, 2024

Publication date: April 10, 2025

Applicants: Disney Enterprises, Inc., Beijing YoJaJa Software Technology Development Co., Ltd.

Inventors: Xuewei Meng, Wenhao Zhang, Chen Liu, Xuchang Huangfu, Yuanyi Xue
CONTENT ADAPTIVE MICRO ENCODING OPTIMIZATION FOR VIDEO

Publication number: 20250106408

Abstract: In some embodiments, a method analyzes flagged locations from a plurality of locations in an encoding of a video to form a cluster of locations. Draft micro-chunk boundaries for the cluster are determined based on searching for a first start location and a first end location in the encoding. The method searches in a first search range before the first start location and a second search range after the first end location for a second start location in the first search range and a second end location in the second search range. The second start location and the second end location form a micro-chunk. An encoding parameter set is determined for the micro-chunk formed by the second start location and the second end location based on content characteristics of the micro-chunk. The method uses the encoding parameter set to encode the micro-chunk for insertion in the encoding of the video.

Type: Application

Filed: September 25, 2023

Publication date: March 27, 2025

Applicants: Disney Enterprises, Inc., Beijing YoJaJa Software Technology Development Co., Ltd.

Inventors: YUANYI XUE, Roberto Gerson De Albuquerque Azevedo, Christopher Richard Schroers, SCOTT LABROZZI, Wenhao Zhang
FILM GRAIN ANALYSIS, SYNTHESIS, AND REMOVAL

Publication number: 20250095115

Abstract: In some embodiments, a grain analysis system is configured for analyzing a first video frame and outputting respective first film grain information for film grain that is included in the first video frame or configured for analyzing a second video frame and outputting second film grain information. At least one of a grain removal system and a grain synthesis system is included. The grain removal system is configured for removing the film grain from the first video frame using the first film grain information to generate a third video frame corresponding to the first video frame with film grain removed. The grain analysis system is separate from the grain removal system. The grain synthesis system is configured for synthesizing film grain for the third video frame using the first film grain information or the second film grain information. The grain analysis system is separate from the grain synthesis system.

Type: Application

Filed: September 20, 2023

Publication date: March 20, 2025

Applicants: Disney Enterprises, Inc., Beijing YoJaJa Software Technology Development Co., Ltd., ETH Zürich (Eidgenössische Technische Hochschule Zürich)

Inventors: Abdelaziz Djelouah, Yang Zhang, Roberto Gerson De Albuquerque Azevedo, Elham Amin Mansour, Mingyang Song, Christopher Richard Schroers, Yuanyi Xue, Scott Labrozzi, Wenhao Zhang, Xuewei Meng, Jeroen Schulte
OPTIMIZING INSERTION POINTS FOR CONTENT BASED ON AUDIO AND VIDEO CHARACTERISTICS

Publication number: 20250080797

Abstract: One embodiment of the present invention sets forth a technique for inserting content into a media program. The technique includes determining a plurality of markers corresponding to a plurality of locations within a media program. The technique also includes for each marker included in the plurality of markers, automatically analyzing a first set of intervals within the media program that lead up to the marker and a second set of intervals within the media program that immediately follow the marker and determine a set of audio characteristics associated with the first set of intervals and the second set of intervals. The technique further includes generating a plurality of scores for the plurality of markers based on the set of audio characteristics for each marker and inserting additional content at one or more markers included in the plurality of markers based on the plurality of scores.

Type: Application

Filed: December 13, 2022

Publication date: March 6, 2025

Inventors: Yuanyi XUE, Michael John BRACCO, Scott Christopher LABROZZI, Christopher Richard SCHROERS, Wenhao ZHANG
Automation of media content playback

Patent number: 12236979

Abstract: A system includes processing hardware and a memory storing software code. The processing hardware executes the software code to receive automation data for media content having a default playback experience, analyze, using the automation data, at least one parameter of the media content, and generate, based on the analyzing, one or more automation instruction(s) for at least one portion(s) of the media content. The automation instruction(s) include at least one of: one or more bounding timestamps of the media content portion(s), an increased or reduced playback speed for the media content portion(s) relative to the default playback experience, or a variable playback speed for the media content portion(s). The software code is further executed to outputs the automation instruction(s) to a media delivery platform configured to distribute and control the quality of the media content or to a media player configured to automate playback of the media content.

Type: Grant

Filed: July 14, 2022

Date of Patent: February 25, 2025

Assignee: Disney Enterprises, Inc.

Inventors: Manuel Briand, Yuanyi Xue, Nathan Crowe, Scott Labrozzi, Michael Bracco, Mugdha Oltikar
Optimizing insertion points for content based on audio and video characteristics

Patent number: 12225272

Abstract: One embodiment of the present invention sets forth a technique for inserting content into a media program. The technique includes determining a plurality of markers corresponding to a plurality of locations within a media program. The technique also includes for each marker included in the plurality of markers, automatically analyzing a first set of intervals within the media program that lead up to the marker and a second set of intervals within the media program that immediately follow the marker and determine a set of audio characteristics associated with the first set of intervals and the second set of intervals. The technique further includes generating a plurality of scores for the plurality of markers based on the set of audio characteristics for each marker and inserting additional content at one or more markers included in the plurality of markers based on the plurality of scores.

Type: Grant

Filed: February 24, 2023

Date of Patent: February 11, 2025

Assignees: Disney Enterprises, Inc., BEIJING YOJAJA SOFTWARE TECHNOLOGY DEVELOPMENT CO., LTD.

Inventors: Yuanyi Xue, Michael John Bracco, Scott Christopher Labrozzi, Christopher Richard Schroers, Wenhao Zhang
Dynamic selection of candidate bitrates for video encoding

Patent number: 12225252

Abstract: In some embodiments, a method generates a first representation of a first relationship between bitrate and quality based on first features of a first portion of a video. Also, the method generates a second representation of a second relationship between bitrate and quality based on second features of a second portion of a video. The first representation is analyzed to determine a first list of bitrates for the first portion of video and the second representation is analyzed to determine a second list of bitrates for the second portion of video. The first list of bitrates is different from the second list of bitrates. The method outputs the first list of bitrates for use encoding the first portion of video and the second list of bitrates for use encoding the second portion of video.

Type: Grant

Filed: March 6, 2023

Date of Patent: February 11, 2025

Assignees: Disney Enterprises, Inc., Beijing YoJaJa Software Technology Development Co., Ltd.

Inventors: Chen Liu, Wenhao Zhang, Scott Labrozzi, Yuanyi Xue, Xuchang Huangfu, Xiaobo Liu
Machine Learning Model-Based Video Compression

Publication number: 20250016382

Abstract: A system processing hardware executes a machine learning (ML) model-based video compression encoder to receive uncompressed video content and corresponding motion compensated video content, compare the uncompressed and motion compensated video content to identify an image space residual, transform the image space residual to a latent space representation of the uncompressed video content, and transform, using a trained image compression ML model, the motion compensated video content to a latent space representation of the motion compensated video content.

Type: Application

Filed: September 13, 2024

Publication date: January 9, 2025

Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson de Albuquerque Azevedo, Scott Labrozzi, Christopher Richard Schroers, Yuanyi Xue
PERFORMANCE OPTIMIZATION OF PRE-PROCESSOR USING VIDEO PROXY CODEC

Publication number: 20240430440

Abstract: In some embodiments, a method trains a first parameter of a differentiable proxy codec to encode source content based on a first loss between first compressed source content and second compressed source content that is output by a target codec. A pre-processor pre-processes a source image to output a pre-processed source image, the pre-processing being based on a second parameter. The differentiable proxy codec encodes the pre-processed source image into a compressed pre-processed source image based on the first parameter. The method determines a second loss between the source image and the compressed pre-processed source image and determines an adjustment to the first parameter based on the second loss. The adjustment is used to adjust the second parameter of the pre-processor based on the second loss.

Type: Application

Filed: October 19, 2023

Publication date: December 26, 2024

Applicants: Disney Enterprises, Inc., ETH Zürich (Eidgenössische Technische Hochschule Zürich)

Inventors: Yang Zhang, Mingyang Song, Christopher Richard Schroers, Tunc Ozan Aydin, Yuanyi Xue, Scott Labrozzi
SUBJECTIVE QUALITY ASSESSMENT TOOL FOR IMAGE/VIDEO ARTIFACTS

Publication number: 20240362896

Abstract: In some embodiments, a method sends information for a sample of content, a first question, and a second question for output on an interface. The first question receives, from a subject, a first response for a sample level rating for an artifact that is perceived to be visible in the sample and the second question receives, from the subject, a second response for regions in the sample that are perceived to contain the artifact. The method receives the first response for the sample level rating and the second response for regions that are perceived to contain the artifact. First responses are combined from multiple subjects to generate an opinion score for the sample and second responses are combined to generate region scores for regions. The method generates training data from the opinion score and the region scores to train a process to perform an action based on the artifacts.

Type: Application

Filed: April 11, 2024

Publication date: October 31, 2024

Applicants: Disney Enterprises, Inc., Beijing Hulu Software Technology Development Co., Ltd.

Inventors: Yuanyi XUE, Scott LABROZZI, Wenhao ZHANG, Christopher Richard SCHROERS, Roberto Gerson DE ALBUQUERQUE AZEVEDO, Xuchang HUANGFU, Lemei HUANG, Yang ZHANG
Machine learning model-based video compression

Patent number: 12120359

Abstract: A system processing hardware executes a machine learning (ML) model-based video compression encoder to receive uncompressed video content and corresponding motion compensated video content, compare the uncompressed and motion compensated video content to identify an image space residual, transform the image space residual to a latent space representation of the uncompressed video content, and transform, using a trained image compression ML model, the motion compensated video content to a latent space representation of the motion compensated video content.

Type: Grant

Filed: March 25, 2022

Date of Patent: October 15, 2024

Assignees: Disney Enterprises, Inc., ETH Zürich (EIDGENÖSSISCHE TECHNISCHE HOCHSCHULE ZÜRICH)

Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson De Albuquerque Azevedo, Scott Labrozzi, Christopher Richard Schroers, Yuanyi Xue
DYNAMIC SELECTION OF CANDIDATE BITRATES FOR VIDEO ENCODING

Publication number: 20240305842

Abstract: In some embodiments, a method generates a first representation of a first relationship between bitrate and quality based on first features of a first portion of a video. Also, the method generates a second representation of a second relationship between bitrate and quality based on second features of a second portion of a video. The first representation is analyzed to determine a first list of bitrates for the first portion of video and the second representation is analyzed to determine a second list of bitrates for the second portion of video. The first list of bitrates is different from the second list of bitrates. The method outputs the first list of bitrates for use encoding the first portion of video and the second list of bitrates for use encoding the second portion of video.

Type: Application

Filed: March 6, 2023

Publication date: September 12, 2024

Applicants: Beijing Hulu Software Technology Development Co., Ltd., Disney Enterprises, Inc.

Inventors: Chen Liu, Wenhao Zhang, Scott Labrozzi, Yuanyi XUE, Xuchang Huangfu, Xiaobo Liu
Image compression using normalizing flows

Patent number: 12087024

Abstract: According to one implementation, an image compression system includes a computing platform having a hardware processor and a system memory storing a software code. The hardware processor executes the software code to receive an input image, transform the input image to a latent space representation of the input image, and quantize the latent space representation of the input image to produce multiple quantized latents. The hardware processor further executes the software code to encode the quantized latents using a probability density function of the latent space representation of the input image, to generate a bitstream, and convert the bitstream into an output image corresponding to the input image. The probability density function of the latent space representation of the input image is obtained based on a normalizing flow mapping of one of the input image or the latent space representation of the input image.

Type: Grant

Filed: March 6, 2020

Date of Patent: September 10, 2024

Assignees: Disney Enterprises, Inc., ETH Zurich

Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Scott Labrozzi, Yuanyi Xue, Erika Varis Doggett, Jared McPhillen, Christopher Richard Schroers

1 2 3 next