Patents by Inventor Tobias Hinz

Tobias Hinz has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Multi-view coding with effective handling of renderable portions

Patent number: 12356002

Abstract: A proposed intermediate way of handling the renderable portion of the first view results in more efficient coding. Instead of omitting the coding of the renderable portion completely, even more efficient coding of multi-view signals entails merely suppressing the coding of the residual signal within the renderable portion, whereas the prediction parameter coding still takes place from the non-renderable portion of the multi-view signal across the renderable portion so that prediction parameters for the renderable portion may be exploited for predicting parameters for the non-renderable portion. The additional coding rate for transmitting the prediction parameters for the renderable portion may be kept low as this merely aims at forming a continuation of the parameter history across the renderable portion to serve as a basis for prediction parameters of other portions of the multi-view signal.

Type: Grant

Filed: December 21, 2023

Date of Patent: July 8, 2025

Assignee: Dolby Video Compression, LLC

Inventors: Sebastian Bosse, Heiko Schwarz, Thomas Wiegand, Tobias Hinz
GENERATING A MODIFIED DIGITAL IMAGE UTILIZING A HUMAN INPAINTING MODEL

Publication number: 20250217946

Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that modify digital images via scene-based editing using image understanding facilitated by artificial intelligence. For example, in one or more embodiments the disclosed systems utilize generative machine learning models to create modified digital images portraying human subjects. In particular, the disclosed systems generate modified digital images by performing infill modifications to complete a digital image or human inpainting for portions of a digital image that portrays a human. Moreover, in some embodiments, the disclosed systems perform reposing of subjects portrayed within a digital image to generate modified digital images. In addition, the disclosed systems in some embodiments perform facial expression transfer and facial expression animations to generate modified digital images or animations.

Type: Application

Filed: March 7, 2025

Publication date: July 3, 2025

Inventors: Krishna Kumar Singh, Yijun Li, Jingwan Lu, Duygu Ceylan Aksit, Yangtuanfeng Wang, Jimei Yang, Tobias Hinz, Qing Liu, Jianming Zhang, Zhe Lin
Human inpainting utilizing a segmentation branch for generating an infill segmentation map

Patent number: 12347080

Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that modify digital images via scene-based editing using image understanding facilitated by artificial intelligence. For example, in one or more embodiments the disclosed systems utilize generative machine learning models to create modified digital images portraying human subjects. In particular, the disclosed systems generate modified digital images by performing infill modifications to complete a digital image or human inpainting for portions of a digital image that portrays a human. Moreover, in some embodiments, the disclosed systems perform reposing of subjects portrayed within a digital image to generate modified digital images. In addition, the disclosed systems in some embodiments perform facial expression transfer and facial expression animations to generate modified digital images or animations.

Type: Grant

Filed: March 27, 2023

Date of Patent: July 1, 2025

Assignee: Adobe Inc.

Inventors: Krishna Kumar Singh, Yijun Li, Jingwan Lu, Duygu Ceylan Aksit, Yangtuanfeng Wang, Jimei Yang, Tobias Hinz, Qing Liu, Jianming Zhang, Zhe Lin
ENCODING AND DECODING A PICTURE USING FILTERING BLOCKS

Publication number: 20250193390

Abstract: A picture of a sequence of pictures is decoded by deriving partitioning information from the data stream, subdividing the picture into coding blocks according to the partitioning information, and decoding the picture in units of the coding blocks. Filtering information is derived from the data stream, which indicates a subdivision of the picture into filtering blocks. The subdivision into filtering blocks is used for filtering the picture.

Type: Application

Filed: February 21, 2025

Publication date: June 12, 2025

Inventors: Valeri GEORGE, Adam WIECKOWSKI, Tobias HINZ, Jens BRANDENBURG, Benjamin BROSS, Yago SÁNCHEZ DE LA FUENTE, Robert SKUPIN, Thomas SCHIERL, Detlev MARPE, Thomas WIEGAND
SCALABLE VIDEO CODING USING INTER-LAYER PREDICTION CONTRIBUTION TO ENHANCEMENT LAYER PREDICTION

Publication number: 20250168408

Abstract: A scalable video decoder is described which is configured to reconstruct a base layer signal from a coded data stream to obtain a reconstructed base layer signal; and reconstruct an enhancement layer signal including spatially or temporally predicting a portion of an enhancement layer signal, currently to be reconstructed, from an already reconstructed portion of the enhancement layer signal to obtain an enhancement layer internal prediction signal; forming, at the portion currently to be reconstructed, a weighted average of an inter-layer prediction signal obtained from the reconstructed base layer signal, and the enhancement layer internal prediction signal to obtain an enhancement layer prediction signal such that a weighting between the inter-layer prediction signal and the enhancement layer internal prediction signal varies over different spatial frequency components; and predictively reconstructing the enhancement layer signal using the enhancement layer prediction signal.

Type: Application

Filed: November 22, 2024

Publication date: May 22, 2025

Inventors: Tobias HINZ, Haricharan LAKSHMAN, Jan STEGEMANN, Philipp HELLE, Mischa SIEKMANN, Karsten SUEHRING, Detlev MARPE, Heiko SCHWARZ, Christian BARTNIK, Ali Atef Ibrahim KHAIRAT ABDELHAMID, Heiner KIRCHHOFFER, Thomas WIEGAND
EFFICIENT IMPLEMENTATION OF MATRIX-BASED INTRA-PREDICTION

Publication number: 20250159257

Abstract: A method for decoding a predetermined block of a picture using intra-prediction by reading, for each of predetermined intra-prediction blocks, from a data stream, a mode index identifying a matrix-based intra-prediction mode is disclosed. Samples of each predetermined intra-prediction block are predicted by computing a matrix-vector product between an input vector derived from reference samples and a prediction matrix associated with the identified matrix-based intra-prediction mode (k) and associating components of an output vector obtained by the matrix-vector product onto sample positions of the predetermined block. Further predetermined intra-predicted blocks of the picture are predicted to obtain a prediction signal. A transformation flag is decoded from the data stream using context adaptive binary arithmetic coding, and a prediction residual is decoded and re-transformed using a reverse transformation based on the transformation flag to obtain a prediction residual signal.

Type: Application

Filed: January 16, 2025

Publication date: May 15, 2025

Inventors: Jonathan PFAFF, Björn STALLENBERGER, Michael SCHAFER, Philipp MERKLE, Tobias HINZ, Philipp HELLE, Heiko SCHWARZ, Detlev MARPE, Thomas WIEGAND, Benjamin BROSS, Martin WINKEN, Mischa SIEKMANN
MIP FOR ALL CHANNELS IN THE CASE OF 4:4:4-CHROMA FORMAT AND OF SINGLE TREE

Publication number: 20250150593

Abstract: A video decoder includes one or more processors that are configured to: determine that a picture is in a 4:4:4 color sampling format, determine, based at least in part on an index signaled in a data stream, a matrix-based intra prediction (MIP) mode, decode a luma block of the picture using the MIP mode, determine, for a chroma block of the picture, whether a coding tree of the chroma block is a single tree, select an intra prediction mode for decoding the chroma block, and decode the chroma block using the selected intra prediction mode. The selected intra prediction mode is the MIP mode in response to the coding tree of the chroma block being the single tree, and the selected intra prediction mode is a planar intra prediction mode in response to the coding tree of the chroma block not being the single tree.

Type: Application

Filed: January 9, 2025

Publication date: May 8, 2025

Inventors: Jonathan PFAFF, Tobias HINZ, Philipp HELLE, Philipp MERKLE, Björn STALLENBERGER, Michael SCHÄFER, Benjamin BROSS, Heiko SCHWARZ, Detlev MARPE, Thomas WIEGAND
HIGH-RESOLUTION IMAGE GENERATION USING DIFFUSION MODELS

Publication number: 20250117968

Abstract: Methods, non-transitory computer readable media, apparatuses, and systems for high-resolution image generation using diffusion models include obtaining a prompt and generating, using a first diffusion model, a predicted denoised image at a first resolution based on the prompt. The predicted denoised image is generated at a first intermediate diffusion step of the first diffusion model. The predicted denoised image is upsampled to obtain an upsampled denoised image at a second resolution that is higher than the first resolution. A second diffusion model then generates an output image at the second resolution based on the prompt and the upsampled denoised image.

Type: Application

Filed: October 5, 2023

Publication date: April 10, 2025

Inventors: Tobias Hinz, Siddharth Iyer
VIDEO DIFFUSION USING SUPERPOSITION NETWORK ARCHITECTURE SEARCH

Publication number: 20250117971

Abstract: A method, apparatus, non-transitory computer readable medium, apparatus, and system for video generation include first obtaining a training set including a training video. Then, embodiments initialize a video generation model, sample a subnet architecture from an architecture search space, and a identify a subset of the weights of the video generation model based on the sampled subnet architecture. Subsequently, embodiments train, based on the training video, a subnet of the video generation model to generate synthetic video data. The subnet includes a subset of the weights of the video generation model.

Type: Application

Filed: August 27, 2024

Publication date: April 10, 2025

Inventors: Feng Liu, Zhengang Li, Yan Kang, Yuchen Liu, Difan Liu, Tobias Hinz
APPARATUS AND METHOD FOR ENCODING AND DECODING A PICTURE USING PICTURE BOUNDARY HANDLING

Publication number: 20250106397

Abstract: The present invention concerns an apparatus configured to partition a picture into leaf blocks using recursive multi-tree partitioning, block-based encode the picture into a data stream using the partitioning of the picture into the leaf blocks, wherein the apparatus is configured to, in partitioning the picture into the leaf blocks, for a predetermined block which extends beyond a boundary of the picture, reduce an available set of split modes depending on a position at which the boundary of the picture crosses the predetermined block in order to obtain a reduced set of one or more split modes, wherein the apparatus is configured to signal a selected split mode in the data stream.

Type: Application

Filed: December 9, 2024

Publication date: March 27, 2025

Inventors: Adam WIECKOWSKI, Valeri GEORGE, Tobias HINZ, Heiko SCHWARZ, Detlev MARPE, Thomas WIEGAND, Jackie MA, Jens BRANDENBURG
Generating a modified digital image utilizing a human inpainting model

Patent number: 12260530

Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that modify digital images via scene-based editing using image understanding facilitated by artificial intelligence. For example, in one or more embodiments the disclosed systems utilize generative machine learning models to create modified digital images portraying human subjects. In particular, the disclosed systems generate modified digital images by performing infill modifications to complete a digital image or human inpainting for portions of a digital image that portrays a human. Moreover, in some embodiments, the disclosed systems perform reposing of subjects portrayed within a digital image to generate modified digital images. In addition, the disclosed systems in some embodiments perform facial expression transfer and facial expression animations to generate modified digital images or animations.

Type: Grant

Filed: March 27, 2023

Date of Patent: March 25, 2025

Assignee: Adobe Inc.

Inventors: Krishna Kumar Singh, Yijun Li, Jingwan Lu, Duygu Ceylan Aksit, Yangtuanfeng Wang, Jimei Yang, Tobias Hinz, Qing Liu, Jianming Zhang, Zhe Lin
Efficient implementation of matrix-based intra-prediction

Patent number: 12244864

Abstract: A method for decoding a predetermined block of a picture using intra-prediction by reading, for each of predetermined intra-prediction blocks, from a data stream, a mode index identifying a matrix-based intra-prediction mode is disclosed. Samples of each predetermined intra-prediction block are predicted by computing a matrix-vector product between an input vector derived from reference samples and a prediction matrix associated with the identified matrix-based intra-prediction mode (k)—and associating components of an output vector obtained by the matrix-vector product onto sample positions of the predetermined block. Further predetermined intra-predicted blocks of the picture are predicted to obtain a prediction signal. A transformation flag is decoded from the data stream using context adaptive binary arithmetic coding, and a prediction residual is decoded and re-transformed using a reverse transformation based on the transformation flag to obtain a prediction residual signal.

Type: Grant

Filed: May 12, 2024

Date of Patent: March 4, 2025

Assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

Inventors: Jonathan Pfaff, Björn Stallenberger, Michael Schafer, Philipp Merkle, Tobias Hinz, Philipp Helle, Heiko Schwarz, Detlev Marpe, Thomas Wiegand, Benjamin Bross, Martin Winken, Mischa Siekmann
MIP for all channels in the case of 4:4:4-chroma format and of single tree

Patent number: 12225198

Abstract: A block-based decoder is configured to partition a picture of more than one color component and of a color sampling format, according to which each color component is equally sampled, into blocks using a partitioning scheme in which the picture is equally partitioned with respect to each color component. A first color component is decoded in units of the blocks with selecting, for each of intra-predicted first color component blocks, one out of a first set of intra-prediction modes. The first set comprises matrix-based intra prediction modes according to each of which a block inner is predicted by deriving a sample value vector out of neighboring reference samples, computing a matrix-vector product between the sample value vector and a prediction matrix associated with the respective matrix-based intra prediction mode to obtain a prediction vector, and predicting samples in the block inner on the basis of the prediction vector.

Type: Grant

Filed: April 1, 2021

Date of Patent: February 11, 2025

Assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

Inventors: Jonathan Pfaff, Tobias Hinz, Philipp Helle, Philipp Merkle, Björn Stallenberger, Michael Schäfer, Benjamin Bross, Heiko Schwarz, Detlev Marpe, Thomas Wiegand
APPARATUS FOR SELECTING AN INTRA-PREDICTION MODE FOR PADDING

Publication number: 20250047843

Abstract: Video codec for supporting temporal inter-prediction, configured to perform padding of an area of a referenced portion of a reference picture which extends beyond a border of the reference picture, which referenced portion is referenced by an inter predicted block of a current picture by selecting one of a plurality of intra-prediction modes, and padding the area using the selected intra-prediction mode.

Type: Application

Filed: October 22, 2024

Publication date: February 6, 2025

Inventors: Jens BRANDENBURG, Tobias HINZ, Adam WIECKOWSKI, Jackie MA, Valeri GEORGE, Christian LEHMANN, Heiko SCHWARZ, Detlev MARPE, Thomas WIEGAND, Robert SKUPIN, Yago SÁNCHEZ DE LA FUENTE, Thomas SCHIERL
Transferring faces between digital images by combining latent codes utilizing a blending network

Patent number: 12211178

Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods for combining digital images. In particular, in one or more embodiments, the disclosed systems combine latent codes of a source digital image and a target digital image utilizing a blending network to determine a combined latent encoding and generate a combined digital image from the combined latent encoding utilizing a generative neural network. In some embodiments, the disclosed systems determine an intersection face mask between the source digital image and the combined digital image utilizing a face segmentation network and combine the source digital image and the combined digital image utilizing the intersection face mask to generate a blended digital image.

Type: Grant

Filed: April 21, 2022

Date of Patent: January 28, 2025

Assignee: Adobe Inc.

Inventors: Tobias Hinz, Shabnam Ghadar, Richard Zhang, Ratheesh Kalarot, Jingwan Lu, Elya Shechtman
AFFINE LINEAR WEIGHTED INTRA PREDICTIONS

Publication number: 20250030847

Abstract: Techniques for block wise encoding and decoding may be applied to encoders, decoders, and methods for encoding or decoding and include assigning, based on a first signalization in the data stream, the predetermined block to the first set or the second set, sorting the assigned set of intra-prediction modes according to intra-prediction modes used for neighboring blocks, neighboring the predetermined block, to obtain a list of intra prediction modes, deriving, for the predetermined block, from the data stream, an index into the list of intra prediction modes, predicting the predetermined block using an intra prediction mode onto which the index points. The decoder or encoder, if the assigned set is the first set of intra-prediction modes, in sorting the assigned set, uses a second mapping which maps each intra-prediction mode of the second set of prediction modes onto a representative one in the first set of intra-prediction modes.

Type: Application

Filed: October 2, 2024

Publication date: January 23, 2025

Inventors: Jonathan PFAFF, Heiko SCHWARZ, Philipp HELLE, Michael SCHÄFER, Roman RISCHKE, Tobias HINZ, Philipp MERKLE, Björn STALLENBERGER, Martin WINKEN, Mischa SIEKMANN, Detlev MARPE, Thomas WIEGAND
ENCODER AND DECODER, ENCODING METHOD AND DECODING METHOD FOR VERSATILE SPATIAL PARTITIONING OF CODED PICTURES

Publication number: 20250030853

Abstract: A video decoder for decoding an encoded video signal including encoded picture data and indication data of a picture of a video to reconstruct the picture of the video is provided. The video decoder includes an interface configured for receiving the encoded video signal, and a data decoder configured for reconstructing the picture of the video by decoding the encoded picture data using the indication data. The picture is partitioned into a plurality of coding areas. One or more coding areas of the plurality of coding areas include two or more coding tree units of the plurality of coding tree units, wherein each coding area of the one or more coding areas which includes two or more coding tree units exhibits a coding order for the two or more coding tree units of the coding area.

Type: Application

Filed: October 4, 2024

Publication date: January 23, 2025

Inventors: Valeri GEORGE, Tobias HINZ, Jackie MA, Yago SÁNCHEZ DE LA FUENTE, Robert SKUPIN, Thomas SCHIERL, Jens BRANDENBURG, Christian LEHMANN, Adam WIECKOWSKI, Heiko SCHWARZ, Detlev MARPE, Thomas WIEGAND
Apparatus and method for encoding and decoding a picture using picture boundary handling

Patent number: 12192461

Abstract: The present invention concerns an apparatus configured to partition a picture into leaf blocks using recursive multi-tree partitioning, block-based encode the picture into a data stream using the partitioning of the picture into the leaf blocks, wherein the apparatus is configured to, in partitioning the picture into the leaf blocks, for a predetermined block which extends beyond a boundary of the picture, reduce an available set of split modes depending on a position at which the boundary of the picture crosses the predetermined block in order to obtain a reduced set of one or more split modes, wherein the apparatus is configured to signal a selected split mode in the data stream.

Type: Grant

Filed: November 1, 2023

Date of Patent: January 7, 2025

Assignee: Fraunhofer-Gesellschaft zur Förderung derangewandten Forschung e.V.

Inventors: Adam Wieckowski, Valeri George, Tobias Hinz, Heiko Schwarz, Detlev Marpe, Thomas Wiegand, Jackie Ma, Jens Brandenburg
REFINED BLOCK-BASED PREDICTIVE CODING AND DECODING OF A PICTURE

Publication number: 20240430452

Abstract: An apparatus for block-based predictive decoding of a picture comprises a combiner configured to combine a residual signal a predetermined block of the picture and a reference signal for the predetermined block so as to obtain a first spectrum, the residual signal correcting a prediction error of a prediction of the predetermined block of the picture; a reducer configured to perform thresholding on the first spectrum to obtain a second spectrum so that coefficients below a threshold value are set to a predefined value; an extractor configured to obtain from the second spectrum a modified version of the residual signal; and a reconstructor block configured to decode the predetermined block of the picture from the data stream on the basis of the modified version of the residual signal.

Type: Application

Filed: September 10, 2024

Publication date: December 26, 2024

Inventors: Phan Hoang Tung NGUYEN, Gerhard TECH, Jonathan PFAFF, Michael SCHÄFER, Jennifer RASCH, Tobias HINZ, Heiko SCHWARZ, Detlev MARPE, Thomas WIEGAND
Affine linear weighted intra predictions

Patent number: 12160571

Abstract: Techniques for block wise encoding and decoding may be applied to encoders, decoders, and methods for encoding or decoding. In one example, there is provided a technique to decode or encode a predetermined block of the picture by assigning, based on a first signalization in the data stream, the predetermined block to a first set or a second set; sorting the assigned set of intra-prediction modes according to intra-prediction modes used for neighboring blocks, neighboring the predetermined block, to obtain a list of intra prediction modes; deriving, for the predetermined block, from the data stream, an index into the list of intra prediction modes; and predicting the predetermined block using an intra prediction mode onto which the index points.

Type: Grant

Filed: May 4, 2023

Date of Patent: December 3, 2024

Assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

Inventors: Jonathan Pfaff, Heiko Schwarz, Philipp Helle, Michael Schäfer, Roman Rischke, Tobias Hinz, Philipp Merkle, Björn Stallenberger, Martin Winken, Mischa Siekmann, Detlev Marpe, Thomas Wiegand

1 2 3 4 5 … next