Patents by Inventor Yuanyi XUE
Yuanyi XUE has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12284360Abstract: In some embodiments, a method trains a first parameter of a differentiable proxy codec to encode source content based on a first loss between first compressed source content and second compressed source content that is output by a target codec. A pre-processor pre-processes a source image to output a pre-processed source image, the pre-processing being based on a second parameter. The differentiable proxy codec encodes the pre-processed source image into a compressed pre-processed source image based on the first parameter. The method determines a second loss between the source image and the compressed pre-processed source image and determines an adjustment to the first parameter based on the second loss. The adjustment is used to adjust the second parameter of the pre-processor based on the second loss.Type: GrantFiled: October 19, 2023Date of Patent: April 22, 2025Assignees: DISNEY ENTERPRISES, INC., ETH ZÜRICH (EIDGENÖSSISCHE TECHNISCHE HOCHSCHULE ZÜRICH)Inventors: Yang Zhang, Mingyang Song, Christopher Richard Schroers, Tunc Ozan Aydin, Yuanyi Xue, Scott Labrozzi
-
Publication number: 20250126309Abstract: In some embodiments, a method generates a first representation of a first relationship between bitrate and quality based on first features of a first portion of a video. The first representation is analyzed to determine a first list of potential bitrates for the first portion of video. The method analyzes potential bitrates and quality associated with the respective potential bitrates to refine the first list of potential bitrates to a second list of bitrates. The second list of bitrates includes a different list of bitrates than the first list of potential bitrates. The method outputs the second list of bitrates for encoding the first portion of video.Type: ApplicationFiled: December 20, 2024Publication date: April 17, 2025Applicants: Disney Enterprises, Inc., Beijing YoJaJa Software Technology Development Co., Ltd.Inventors: Chen Liu, Wenhao Zhang, Scott Labrozzi, Yuanyi Xue, Xuchang Huangfu, Xiaobo Liu
-
Patent number: 12278969Abstract: A system includes a machine learning (ML) model-based video downsampler configured to receive an input video sequence having a first display resolution, and to map the input video sequence to a lower resolution video sequence having a second display resolution lower than the first display resolution. The system also includes a neural network-based (NN-based) proxy video codec configured to transform the lower resolution video sequence into a decoded proxy bitstream. In addition, the system includes an upsampler configured to produce an output video sequence using the decoded proxy bitstream.Type: GrantFiled: August 4, 2023Date of Patent: April 15, 2025Assignees: Disney Enterprises, Inc., ETH Zurich (Eidgenossische Technische Hochschule Zurich)Inventors: Christopher Richard Schroers, Roberto Gerson de Albuquerque Azevedo, Nicholas David Gregory, Yuanyi Xue, Scott Labrozzi, Abdelaziz Djelouah
-
Publication number: 20250117909Abstract: In some embodiments, a method receives a first image and a second image for a comparison of film grain. The first image and the second image are converted from a spatial domain to a frequency domain to generate a first frequency domain representation for the first image and a second frequency domain representation of the second image. The method compares a first distribution of frequency components from the first frequency domain representation to a second distribution of frequency components from the second frequency domain representation. A score for an assessment of differences of the film grain in the first image and the second image is generated based on the comparing.Type: ApplicationFiled: September 18, 2024Publication date: April 10, 2025Applicants: Disney Enterprises, Inc., Beijing YoJaJa Software Technology Development Co., Ltd.Inventors: Xuewei Meng, Wenhao Zhang, Chen Liu, Xuchang Huangfu, Yuanyi Xue
-
Publication number: 20250106408Abstract: In some embodiments, a method analyzes flagged locations from a plurality of locations in an encoding of a video to form a cluster of locations. Draft micro-chunk boundaries for the cluster are determined based on searching for a first start location and a first end location in the encoding. The method searches in a first search range before the first start location and a second search range after the first end location for a second start location in the first search range and a second end location in the second search range. The second start location and the second end location form a micro-chunk. An encoding parameter set is determined for the micro-chunk formed by the second start location and the second end location based on content characteristics of the micro-chunk. The method uses the encoding parameter set to encode the micro-chunk for insertion in the encoding of the video.Type: ApplicationFiled: September 25, 2023Publication date: March 27, 2025Applicants: Disney Enterprises, Inc., Beijing YoJaJa Software Technology Development Co., Ltd.Inventors: YUANYI XUE, Roberto Gerson De Albuquerque Azevedo, Christopher Richard Schroers, SCOTT LABROZZI, Wenhao Zhang
-
Publication number: 20250095115Abstract: In some embodiments, a grain analysis system is configured for analyzing a first video frame and outputting respective first film grain information for film grain that is included in the first video frame or configured for analyzing a second video frame and outputting second film grain information. At least one of a grain removal system and a grain synthesis system is included. The grain removal system is configured for removing the film grain from the first video frame using the first film grain information to generate a third video frame corresponding to the first video frame with film grain removed. The grain analysis system is separate from the grain removal system. The grain synthesis system is configured for synthesizing film grain for the third video frame using the first film grain information or the second film grain information. The grain analysis system is separate from the grain synthesis system.Type: ApplicationFiled: September 20, 2023Publication date: March 20, 2025Applicants: Disney Enterprises, Inc., Beijing YoJaJa Software Technology Development Co., Ltd., ETH Zürich (Eidgenössische Technische Hochschule Zürich)Inventors: Abdelaziz Djelouah, Yang Zhang, Roberto Gerson De Albuquerque Azevedo, Elham Amin Mansour, Mingyang Song, Christopher Richard Schroers, Yuanyi Xue, Scott Labrozzi, Wenhao Zhang, Xuewei Meng, Jeroen Schulte
-
Publication number: 20250080797Abstract: One embodiment of the present invention sets forth a technique for inserting content into a media program. The technique includes determining a plurality of markers corresponding to a plurality of locations within a media program. The technique also includes for each marker included in the plurality of markers, automatically analyzing a first set of intervals within the media program that lead up to the marker and a second set of intervals within the media program that immediately follow the marker and determine a set of audio characteristics associated with the first set of intervals and the second set of intervals. The technique further includes generating a plurality of scores for the plurality of markers based on the set of audio characteristics for each marker and inserting additional content at one or more markers included in the plurality of markers based on the plurality of scores.Type: ApplicationFiled: December 13, 2022Publication date: March 6, 2025Inventors: Yuanyi XUE, Michael John BRACCO, Scott Christopher LABROZZI, Christopher Richard SCHROERS, Wenhao ZHANG
-
Patent number: 12236979Abstract: A system includes processing hardware and a memory storing software code. The processing hardware executes the software code to receive automation data for media content having a default playback experience, analyze, using the automation data, at least one parameter of the media content, and generate, based on the analyzing, one or more automation instruction(s) for at least one portion(s) of the media content. The automation instruction(s) include at least one of: one or more bounding timestamps of the media content portion(s), an increased or reduced playback speed for the media content portion(s) relative to the default playback experience, or a variable playback speed for the media content portion(s). The software code is further executed to outputs the automation instruction(s) to a media delivery platform configured to distribute and control the quality of the media content or to a media player configured to automate playback of the media content.Type: GrantFiled: July 14, 2022Date of Patent: February 25, 2025Assignee: Disney Enterprises, Inc.Inventors: Manuel Briand, Yuanyi Xue, Nathan Crowe, Scott Labrozzi, Michael Bracco, Mugdha Oltikar
-
Patent number: 12225272Abstract: One embodiment of the present invention sets forth a technique for inserting content into a media program. The technique includes determining a plurality of markers corresponding to a plurality of locations within a media program. The technique also includes for each marker included in the plurality of markers, automatically analyzing a first set of intervals within the media program that lead up to the marker and a second set of intervals within the media program that immediately follow the marker and determine a set of audio characteristics associated with the first set of intervals and the second set of intervals. The technique further includes generating a plurality of scores for the plurality of markers based on the set of audio characteristics for each marker and inserting additional content at one or more markers included in the plurality of markers based on the plurality of scores.Type: GrantFiled: February 24, 2023Date of Patent: February 11, 2025Assignees: Disney Enterprises, Inc., BEIJING YOJAJA SOFTWARE TECHNOLOGY DEVELOPMENT CO., LTD.Inventors: Yuanyi Xue, Michael John Bracco, Scott Christopher Labrozzi, Christopher Richard Schroers, Wenhao Zhang
-
Patent number: 12225252Abstract: In some embodiments, a method generates a first representation of a first relationship between bitrate and quality based on first features of a first portion of a video. Also, the method generates a second representation of a second relationship between bitrate and quality based on second features of a second portion of a video. The first representation is analyzed to determine a first list of bitrates for the first portion of video and the second representation is analyzed to determine a second list of bitrates for the second portion of video. The first list of bitrates is different from the second list of bitrates. The method outputs the first list of bitrates for use encoding the first portion of video and the second list of bitrates for use encoding the second portion of video.Type: GrantFiled: March 6, 2023Date of Patent: February 11, 2025Assignees: Disney Enterprises, Inc., Beijing YoJaJa Software Technology Development Co., Ltd.Inventors: Chen Liu, Wenhao Zhang, Scott Labrozzi, Yuanyi Xue, Xuchang Huangfu, Xiaobo Liu
-
Publication number: 20250016382Abstract: A system processing hardware executes a machine learning (ML) model-based video compression encoder to receive uncompressed video content and corresponding motion compensated video content, compare the uncompressed and motion compensated video content to identify an image space residual, transform the image space residual to a latent space representation of the uncompressed video content, and transform, using a trained image compression ML model, the motion compensated video content to a latent space representation of the motion compensated video content.Type: ApplicationFiled: September 13, 2024Publication date: January 9, 2025Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson de Albuquerque Azevedo, Scott Labrozzi, Christopher Richard Schroers, Yuanyi Xue
-
Publication number: 20240430440Abstract: In some embodiments, a method trains a first parameter of a differentiable proxy codec to encode source content based on a first loss between first compressed source content and second compressed source content that is output by a target codec. A pre-processor pre-processes a source image to output a pre-processed source image, the pre-processing being based on a second parameter. The differentiable proxy codec encodes the pre-processed source image into a compressed pre-processed source image based on the first parameter. The method determines a second loss between the source image and the compressed pre-processed source image and determines an adjustment to the first parameter based on the second loss. The adjustment is used to adjust the second parameter of the pre-processor based on the second loss.Type: ApplicationFiled: October 19, 2023Publication date: December 26, 2024Applicants: Disney Enterprises, Inc., ETH Zürich (Eidgenössische Technische Hochschule Zürich)Inventors: Yang Zhang, Mingyang Song, Christopher Richard Schroers, Tunc Ozan Aydin, Yuanyi Xue, Scott Labrozzi
-
Publication number: 20240362896Abstract: In some embodiments, a method sends information for a sample of content, a first question, and a second question for output on an interface. The first question receives, from a subject, a first response for a sample level rating for an artifact that is perceived to be visible in the sample and the second question receives, from the subject, a second response for regions in the sample that are perceived to contain the artifact. The method receives the first response for the sample level rating and the second response for regions that are perceived to contain the artifact. First responses are combined from multiple subjects to generate an opinion score for the sample and second responses are combined to generate region scores for regions. The method generates training data from the opinion score and the region scores to train a process to perform an action based on the artifacts.Type: ApplicationFiled: April 11, 2024Publication date: October 31, 2024Applicants: Disney Enterprises, Inc., Beijing Hulu Software Technology Development Co., Ltd.Inventors: Yuanyi XUE, Scott LABROZZI, Wenhao ZHANG, Christopher Richard SCHROERS, Roberto Gerson DE ALBUQUERQUE AZEVEDO, Xuchang HUANGFU, Lemei HUANG, Yang ZHANG
-
Patent number: 12120359Abstract: A system processing hardware executes a machine learning (ML) model-based video compression encoder to receive uncompressed video content and corresponding motion compensated video content, compare the uncompressed and motion compensated video content to identify an image space residual, transform the image space residual to a latent space representation of the uncompressed video content, and transform, using a trained image compression ML model, the motion compensated video content to a latent space representation of the motion compensated video content.Type: GrantFiled: March 25, 2022Date of Patent: October 15, 2024Assignees: Disney Enterprises, Inc., ETH Zürich (EIDGENÖSSISCHE TECHNISCHE HOCHSCHULE ZÜRICH)Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson De Albuquerque Azevedo, Scott Labrozzi, Christopher Richard Schroers, Yuanyi Xue
-
Publication number: 20240305842Abstract: In some embodiments, a method generates a first representation of a first relationship between bitrate and quality based on first features of a first portion of a video. Also, the method generates a second representation of a second relationship between bitrate and quality based on second features of a second portion of a video. The first representation is analyzed to determine a first list of bitrates for the first portion of video and the second representation is analyzed to determine a second list of bitrates for the second portion of video. The first list of bitrates is different from the second list of bitrates. The method outputs the first list of bitrates for use encoding the first portion of video and the second list of bitrates for use encoding the second portion of video.Type: ApplicationFiled: March 6, 2023Publication date: September 12, 2024Applicants: Beijing Hulu Software Technology Development Co., Ltd., Disney Enterprises, Inc.Inventors: Chen Liu, Wenhao Zhang, Scott Labrozzi, Yuanyi XUE, Xuchang Huangfu, Xiaobo Liu
-
Patent number: 12087024Abstract: According to one implementation, an image compression system includes a computing platform having a hardware processor and a system memory storing a software code. The hardware processor executes the software code to receive an input image, transform the input image to a latent space representation of the input image, and quantize the latent space representation of the input image to produce multiple quantized latents. The hardware processor further executes the software code to encode the quantized latents using a probability density function of the latent space representation of the input image, to generate a bitstream, and convert the bitstream into an output image corresponding to the input image. The probability density function of the latent space representation of the input image is obtained based on a normalizing flow mapping of one of the input image or the latent space representation of the input image.Type: GrantFiled: March 6, 2020Date of Patent: September 10, 2024Assignees: Disney Enterprises, Inc., ETH ZurichInventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Scott Labrozzi, Yuanyi Xue, Erika Varis Doggett, Jared McPhillen, Christopher Richard Schroers
-
Publication number: 20240283957Abstract: A system includes a machine learning (ML) model-based video encoder configured to receive an uncompressed video sequence including multiple video frames, determine, from among the multiple video frames, a first video frame subset and a second video frame subset, encode the first video frame subset to produce a first compressed video frame subset, and identify a first decompression data for the first compressed video frame subset. The ML model-based video encoder is further configured to encode the second video frame subset to produce a second compressed video frame subset, and identify a second decompression data for the second compressed video frame subset. The first decompression data is specific to decoding the first compressed video frame subset but not the second compressed video frame subset, and the second decompression data is specific to decoding the second compressed video frame subset but not the first compressed video frame subset.Type: ApplicationFiled: May 2, 2024Publication date: August 22, 2024Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson de Albuquerque Azevedo, Christopher Richard Schroers, Scott Labrozzi, Yuanyi Xue
-
Publication number: 20240196066Abstract: One embodiment of the present invention sets forth a technique for inserting content into a media program. The technique includes determining a plurality of markers corresponding to a plurality of locations within a media program. The technique also includes for each marker included in the plurality of markers, automatically analyzing a first set of intervals within the media program that lead up to the marker and a second set of intervals within the media program that immediately follow the marker and determine a set of audio characteristics associated with the first set of intervals and the second set of intervals. The technique further includes generating a plurality of scores for the plurality of markers based on the set of audio characteristics for each marker and inserting additional content at one or more markers included in the plurality of markers based on the plurality of scores.Type: ApplicationFiled: February 24, 2023Publication date: June 13, 2024Inventors: Yuanyi XUE, Michael John BRACCO, Scott Christopher LABROZZI, Christopher Richard SCHROERS, Wenhao ZHANG
-
Patent number: 12010369Abstract: Embodiments provide for improved stream generation. A first encoded segment is generated by encoding a first segment, of a plurality of segments in a media asset, using a first bitrate of a plurality of bitrates specified in an encoding ladder. A second encoded segment is generated by encoding the first segment using a second bitrate, where the second bitrate is lower than the first bitrate. Upon receiving a request for the first segment at the first bitrate, the second encoded segment is output based at least in part on determining that a first quality of the second encoded segment is within a tolerance of a second quality of the first encoded segment.Type: GrantFiled: January 31, 2023Date of Patent: June 11, 2024Assignee: Disney Enterprises, Inc.Inventors: Scott C. Labrozzi, Chetan K. Mathur, Yuanyi Xue, Michael J. Bracco
-
Patent number: 12010335Abstract: A system includes a machine learning (ML) model-based video encoder configured to receive an uncompressed video sequence including multiple video frames, determine, from among the multiple video frames, a first video frame subset and a second video frame subset, encode the first video frame subset to produce a first compressed video frame subset, and identify a first decompression data for the first compressed video frame subset. The ML model-based video encoder is further configured to encode the second video frame subset to produce a second compressed video frame subset, and identify a second decompression data for the second compressed video frame subset. The first decompression data is specific to decoding the first compressed video frame subset but not the second compressed video frame subset, and the second decompression data is specific to decoding the second compressed video frame subset but not the first compressed video frame subset.Type: GrantFiled: March 25, 2022Date of Patent: June 11, 2024Assignee: Disney Enterprises, Inc.Inventors: Abdelaziz Djelouah, Leonhard Markus Helminger, Roberto Gerson de Albuquerque Azevedo, Christopher Richard Schroers, Scott Labrozzi, Yuanyi Xue