Patents by Inventor Tobias Hinz

Tobias Hinz has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12356002
    Abstract: A proposed intermediate way of handling the renderable portion of the first view results in more efficient coding. Instead of omitting the coding of the renderable portion completely, even more efficient coding of multi-view signals entails merely suppressing the coding of the residual signal within the renderable portion, whereas the prediction parameter coding still takes place from the non-renderable portion of the multi-view signal across the renderable portion so that prediction parameters for the renderable portion may be exploited for predicting parameters for the non-renderable portion. The additional coding rate for transmitting the prediction parameters for the renderable portion may be kept low as this merely aims at forming a continuation of the parameter history across the renderable portion to serve as a basis for prediction parameters of other portions of the multi-view signal.
    Type: Grant
    Filed: December 21, 2023
    Date of Patent: July 8, 2025
    Assignee: Dolby Video Compression, LLC
    Inventors: Sebastian Bosse, Heiko Schwarz, Thomas Wiegand, Tobias Hinz
  • Publication number: 20250217946
    Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that modify digital images via scene-based editing using image understanding facilitated by artificial intelligence. For example, in one or more embodiments the disclosed systems utilize generative machine learning models to create modified digital images portraying human subjects. In particular, the disclosed systems generate modified digital images by performing infill modifications to complete a digital image or human inpainting for portions of a digital image that portrays a human. Moreover, in some embodiments, the disclosed systems perform reposing of subjects portrayed within a digital image to generate modified digital images. In addition, the disclosed systems in some embodiments perform facial expression transfer and facial expression animations to generate modified digital images or animations.
    Type: Application
    Filed: March 7, 2025
    Publication date: July 3, 2025
    Inventors: Krishna Kumar Singh, Yijun Li, Jingwan Lu, Duygu Ceylan Aksit, Yangtuanfeng Wang, Jimei Yang, Tobias Hinz, Qing Liu, Jianming Zhang, Zhe Lin
  • Patent number: 12347080
    Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that modify digital images via scene-based editing using image understanding facilitated by artificial intelligence. For example, in one or more embodiments the disclosed systems utilize generative machine learning models to create modified digital images portraying human subjects. In particular, the disclosed systems generate modified digital images by performing infill modifications to complete a digital image or human inpainting for portions of a digital image that portrays a human. Moreover, in some embodiments, the disclosed systems perform reposing of subjects portrayed within a digital image to generate modified digital images. In addition, the disclosed systems in some embodiments perform facial expression transfer and facial expression animations to generate modified digital images or animations.
    Type: Grant
    Filed: March 27, 2023
    Date of Patent: July 1, 2025
    Assignee: Adobe Inc.
    Inventors: Krishna Kumar Singh, Yijun Li, Jingwan Lu, Duygu Ceylan Aksit, Yangtuanfeng Wang, Jimei Yang, Tobias Hinz, Qing Liu, Jianming Zhang, Zhe Lin
  • Publication number: 20250193390
    Abstract: A picture of a sequence of pictures is decoded by deriving partitioning information from the data stream, subdividing the picture into coding blocks according to the partitioning information, and decoding the picture in units of the coding blocks. Filtering information is derived from the data stream, which indicates a subdivision of the picture into filtering blocks. The subdivision into filtering blocks is used for filtering the picture.
    Type: Application
    Filed: February 21, 2025
    Publication date: June 12, 2025
    Inventors: Valeri GEORGE, Adam WIECKOWSKI, Tobias HINZ, Jens BRANDENBURG, Benjamin BROSS, Yago SÁNCHEZ DE LA FUENTE, Robert SKUPIN, Thomas SCHIERL, Detlev MARPE, Thomas WIEGAND
  • Publication number: 20250168408
    Abstract: A scalable video decoder is described which is configured to reconstruct a base layer signal from a coded data stream to obtain a reconstructed base layer signal; and reconstruct an enhancement layer signal including spatially or temporally predicting a portion of an enhancement layer signal, currently to be reconstructed, from an already reconstructed portion of the enhancement layer signal to obtain an enhancement layer internal prediction signal; forming, at the portion currently to be reconstructed, a weighted average of an inter-layer prediction signal obtained from the reconstructed base layer signal, and the enhancement layer internal prediction signal to obtain an enhancement layer prediction signal such that a weighting between the inter-layer prediction signal and the enhancement layer internal prediction signal varies over different spatial frequency components; and predictively reconstructing the enhancement layer signal using the enhancement layer prediction signal.
    Type: Application
    Filed: November 22, 2024
    Publication date: May 22, 2025
    Inventors: Tobias HINZ, Haricharan LAKSHMAN, Jan STEGEMANN, Philipp HELLE, Mischa SIEKMANN, Karsten SUEHRING, Detlev MARPE, Heiko SCHWARZ, Christian BARTNIK, Ali Atef Ibrahim KHAIRAT ABDELHAMID, Heiner KIRCHHOFFER, Thomas WIEGAND
  • Publication number: 20250159257
    Abstract: A method for decoding a predetermined block of a picture using intra-prediction by reading, for each of predetermined intra-prediction blocks, from a data stream, a mode index identifying a matrix-based intra-prediction mode is disclosed. Samples of each predetermined intra-prediction block are predicted by computing a matrix-vector product between an input vector derived from reference samples and a prediction matrix associated with the identified matrix-based intra-prediction mode (k) and associating components of an output vector obtained by the matrix-vector product onto sample positions of the predetermined block. Further predetermined intra-predicted blocks of the picture are predicted to obtain a prediction signal. A transformation flag is decoded from the data stream using context adaptive binary arithmetic coding, and a prediction residual is decoded and re-transformed using a reverse transformation based on the transformation flag to obtain a prediction residual signal.
    Type: Application
    Filed: January 16, 2025
    Publication date: May 15, 2025
    Inventors: Jonathan PFAFF, Björn STALLENBERGER, Michael SCHAFER, Philipp MERKLE, Tobias HINZ, Philipp HELLE, Heiko SCHWARZ, Detlev MARPE, Thomas WIEGAND, Benjamin BROSS, Martin WINKEN, Mischa SIEKMANN
  • Publication number: 20250150593
    Abstract: A video decoder includes one or more processors that are configured to: determine that a picture is in a 4:4:4 color sampling format, determine, based at least in part on an index signaled in a data stream, a matrix-based intra prediction (MIP) mode, decode a luma block of the picture using the MIP mode, determine, for a chroma block of the picture, whether a coding tree of the chroma block is a single tree, select an intra prediction mode for decoding the chroma block, and decode the chroma block using the selected intra prediction mode. The selected intra prediction mode is the MIP mode in response to the coding tree of the chroma block being the single tree, and the selected intra prediction mode is a planar intra prediction mode in response to the coding tree of the chroma block not being the single tree.
    Type: Application
    Filed: January 9, 2025
    Publication date: May 8, 2025
    Inventors: Jonathan PFAFF, Tobias HINZ, Philipp HELLE, Philipp MERKLE, Björn STALLENBERGER, Michael SCHÄFER, Benjamin BROSS, Heiko SCHWARZ, Detlev MARPE, Thomas WIEGAND
  • Publication number: 20250117968
    Abstract: Methods, non-transitory computer readable media, apparatuses, and systems for high-resolution image generation using diffusion models include obtaining a prompt and generating, using a first diffusion model, a predicted denoised image at a first resolution based on the prompt. The predicted denoised image is generated at a first intermediate diffusion step of the first diffusion model. The predicted denoised image is upsampled to obtain an upsampled denoised image at a second resolution that is higher than the first resolution. A second diffusion model then generates an output image at the second resolution based on the prompt and the upsampled denoised image.
    Type: Application
    Filed: October 5, 2023
    Publication date: April 10, 2025
    Inventors: Tobias Hinz, Siddharth Iyer
  • Publication number: 20250117971
    Abstract: A method, apparatus, non-transitory computer readable medium, apparatus, and system for video generation include first obtaining a training set including a training video. Then, embodiments initialize a video generation model, sample a subnet architecture from an architecture search space, and a identify a subset of the weights of the video generation model based on the sampled subnet architecture. Subsequently, embodiments train, based on the training video, a subnet of the video generation model to generate synthetic video data. The subnet includes a subset of the weights of the video generation model.
    Type: Application
    Filed: August 27, 2024
    Publication date: April 10, 2025
    Inventors: Feng Liu, Zhengang Li, Yan Kang, Yuchen Liu, Difan Liu, Tobias Hinz
  • Publication number: 20250106397
    Abstract: The present invention concerns an apparatus configured to partition a picture into leaf blocks using recursive multi-tree partitioning, block-based encode the picture into a data stream using the partitioning of the picture into the leaf blocks, wherein the apparatus is configured to, in partitioning the picture into the leaf blocks, for a predetermined block which extends beyond a boundary of the picture, reduce an available set of split modes depending on a position at which the boundary of the picture crosses the predetermined block in order to obtain a reduced set of one or more split modes, wherein the apparatus is configured to signal a selected split mode in the data stream.
    Type: Application
    Filed: December 9, 2024
    Publication date: March 27, 2025
    Inventors: Adam WIECKOWSKI, Valeri GEORGE, Tobias HINZ, Heiko SCHWARZ, Detlev MARPE, Thomas WIEGAND, Jackie MA, Jens BRANDENBURG
  • Patent number: 12260530
    Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that modify digital images via scene-based editing using image understanding facilitated by artificial intelligence. For example, in one or more embodiments the disclosed systems utilize generative machine learning models to create modified digital images portraying human subjects. In particular, the disclosed systems generate modified digital images by performing infill modifications to complete a digital image or human inpainting for portions of a digital image that portrays a human. Moreover, in some embodiments, the disclosed systems perform reposing of subjects portrayed within a digital image to generate modified digital images. In addition, the disclosed systems in some embodiments perform facial expression transfer and facial expression animations to generate modified digital images or animations.
    Type: Grant
    Filed: March 27, 2023
    Date of Patent: March 25, 2025
    Assignee: Adobe Inc.
    Inventors: Krishna Kumar Singh, Yijun Li, Jingwan Lu, Duygu Ceylan Aksit, Yangtuanfeng Wang, Jimei Yang, Tobias Hinz, Qing Liu, Jianming Zhang, Zhe Lin
  • Patent number: 12244864
    Abstract: A method for decoding a predetermined block of a picture using intra-prediction by reading, for each of predetermined intra-prediction blocks, from a data stream, a mode index identifying a matrix-based intra-prediction mode is disclosed. Samples of each predetermined intra-prediction block are predicted by computing a matrix-vector product between an input vector derived from reference samples and a prediction matrix associated with the identified matrix-based intra-prediction mode (k)—and associating components of an output vector obtained by the matrix-vector product onto sample positions of the predetermined block. Further predetermined intra-predicted blocks of the picture are predicted to obtain a prediction signal. A transformation flag is decoded from the data stream using context adaptive binary arithmetic coding, and a prediction residual is decoded and re-transformed using a reverse transformation based on the transformation flag to obtain a prediction residual signal.
    Type: Grant
    Filed: May 12, 2024
    Date of Patent: March 4, 2025
    Assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.
    Inventors: Jonathan Pfaff, Björn Stallenberger, Michael Schafer, Philipp Merkle, Tobias Hinz, Philipp Helle, Heiko Schwarz, Detlev Marpe, Thomas Wiegand, Benjamin Bross, Martin Winken, Mischa Siekmann
  • Patent number: 12225198
    Abstract: A block-based decoder is configured to partition a picture of more than one color component and of a color sampling format, according to which each color component is equally sampled, into blocks using a partitioning scheme in which the picture is equally partitioned with respect to each color component. A first color component is decoded in units of the blocks with selecting, for each of intra-predicted first color component blocks, one out of a first set of intra-prediction modes. The first set comprises matrix-based intra prediction modes according to each of which a block inner is predicted by deriving a sample value vector out of neighboring reference samples, computing a matrix-vector product between the sample value vector and a prediction matrix associated with the respective matrix-based intra prediction mode to obtain a prediction vector, and predicting samples in the block inner on the basis of the prediction vector.
    Type: Grant
    Filed: April 1, 2021
    Date of Patent: February 11, 2025
    Assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.
    Inventors: Jonathan Pfaff, Tobias Hinz, Philipp Helle, Philipp Merkle, Björn Stallenberger, Michael Schäfer, Benjamin Bross, Heiko Schwarz, Detlev Marpe, Thomas Wiegand
  • Publication number: 20250047843
    Abstract: Video codec for supporting temporal inter-prediction, configured to perform padding of an area of a referenced portion of a reference picture which extends beyond a border of the reference picture, which referenced portion is referenced by an inter predicted block of a current picture by selecting one of a plurality of intra-prediction modes, and padding the area using the selected intra-prediction mode.
    Type: Application
    Filed: October 22, 2024
    Publication date: February 6, 2025
    Inventors: Jens BRANDENBURG, Tobias HINZ, Adam WIECKOWSKI, Jackie MA, Valeri GEORGE, Christian LEHMANN, Heiko SCHWARZ, Detlev MARPE, Thomas WIEGAND, Robert SKUPIN, Yago SÁNCHEZ DE LA FUENTE, Thomas SCHIERL
  • Patent number: 12211178
    Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods for combining digital images. In particular, in one or more embodiments, the disclosed systems combine latent codes of a source digital image and a target digital image utilizing a blending network to determine a combined latent encoding and generate a combined digital image from the combined latent encoding utilizing a generative neural network. In some embodiments, the disclosed systems determine an intersection face mask between the source digital image and the combined digital image utilizing a face segmentation network and combine the source digital image and the combined digital image utilizing the intersection face mask to generate a blended digital image.
    Type: Grant
    Filed: April 21, 2022
    Date of Patent: January 28, 2025
    Assignee: Adobe Inc.
    Inventors: Tobias Hinz, Shabnam Ghadar, Richard Zhang, Ratheesh Kalarot, Jingwan Lu, Elya Shechtman
  • Publication number: 20250030847
    Abstract: Techniques for block wise encoding and decoding may be applied to encoders, decoders, and methods for encoding or decoding and include assigning, based on a first signalization in the data stream, the predetermined block to the first set or the second set, sorting the assigned set of intra-prediction modes according to intra-prediction modes used for neighboring blocks, neighboring the predetermined block, to obtain a list of intra prediction modes, deriving, for the predetermined block, from the data stream, an index into the list of intra prediction modes, predicting the predetermined block using an intra prediction mode onto which the index points. The decoder or encoder, if the assigned set is the first set of intra-prediction modes, in sorting the assigned set, uses a second mapping which maps each intra-prediction mode of the second set of prediction modes onto a representative one in the first set of intra-prediction modes.
    Type: Application
    Filed: October 2, 2024
    Publication date: January 23, 2025
    Inventors: Jonathan PFAFF, Heiko SCHWARZ, Philipp HELLE, Michael SCHÄFER, Roman RISCHKE, Tobias HINZ, Philipp MERKLE, Björn STALLENBERGER, Martin WINKEN, Mischa SIEKMANN, Detlev MARPE, Thomas WIEGAND
  • Publication number: 20250030853
    Abstract: A video decoder for decoding an encoded video signal including encoded picture data and indication data of a picture of a video to reconstruct the picture of the video is provided. The video decoder includes an interface configured for receiving the encoded video signal, and a data decoder configured for reconstructing the picture of the video by decoding the encoded picture data using the indication data. The picture is partitioned into a plurality of coding areas. One or more coding areas of the plurality of coding areas include two or more coding tree units of the plurality of coding tree units, wherein each coding area of the one or more coding areas which includes two or more coding tree units exhibits a coding order for the two or more coding tree units of the coding area.
    Type: Application
    Filed: October 4, 2024
    Publication date: January 23, 2025
    Inventors: Valeri GEORGE, Tobias HINZ, Jackie MA, Yago SÁNCHEZ DE LA FUENTE, Robert SKUPIN, Thomas SCHIERL, Jens BRANDENBURG, Christian LEHMANN, Adam WIECKOWSKI, Heiko SCHWARZ, Detlev MARPE, Thomas WIEGAND
  • Patent number: 12192461
    Abstract: The present invention concerns an apparatus configured to partition a picture into leaf blocks using recursive multi-tree partitioning, block-based encode the picture into a data stream using the partitioning of the picture into the leaf blocks, wherein the apparatus is configured to, in partitioning the picture into the leaf blocks, for a predetermined block which extends beyond a boundary of the picture, reduce an available set of split modes depending on a position at which the boundary of the picture crosses the predetermined block in order to obtain a reduced set of one or more split modes, wherein the apparatus is configured to signal a selected split mode in the data stream.
    Type: Grant
    Filed: November 1, 2023
    Date of Patent: January 7, 2025
    Assignee: Fraunhofer-Gesellschaft zur Förderung derangewandten Forschung e.V.
    Inventors: Adam Wieckowski, Valeri George, Tobias Hinz, Heiko Schwarz, Detlev Marpe, Thomas Wiegand, Jackie Ma, Jens Brandenburg
  • Publication number: 20240430452
    Abstract: An apparatus for block-based predictive decoding of a picture comprises a combiner configured to combine a residual signal a predetermined block of the picture and a reference signal for the predetermined block so as to obtain a first spectrum, the residual signal correcting a prediction error of a prediction of the predetermined block of the picture; a reducer configured to perform thresholding on the first spectrum to obtain a second spectrum so that coefficients below a threshold value are set to a predefined value; an extractor configured to obtain from the second spectrum a modified version of the residual signal; and a reconstructor block configured to decode the predetermined block of the picture from the data stream on the basis of the modified version of the residual signal.
    Type: Application
    Filed: September 10, 2024
    Publication date: December 26, 2024
    Inventors: Phan Hoang Tung NGUYEN, Gerhard TECH, Jonathan PFAFF, Michael SCHÄFER, Jennifer RASCH, Tobias HINZ, Heiko SCHWARZ, Detlev MARPE, Thomas WIEGAND
  • Patent number: 12160571
    Abstract: Techniques for block wise encoding and decoding may be applied to encoders, decoders, and methods for encoding or decoding. In one example, there is provided a technique to decode or encode a predetermined block of the picture by assigning, based on a first signalization in the data stream, the predetermined block to a first set or a second set; sorting the assigned set of intra-prediction modes according to intra-prediction modes used for neighboring blocks, neighboring the predetermined block, to obtain a list of intra prediction modes; deriving, for the predetermined block, from the data stream, an index into the list of intra prediction modes; and predicting the predetermined block using an intra prediction mode onto which the index points.
    Type: Grant
    Filed: May 4, 2023
    Date of Patent: December 3, 2024
    Assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.
    Inventors: Jonathan Pfaff, Heiko Schwarz, Philipp Helle, Michael Schäfer, Roman Rischke, Tobias Hinz, Philipp Merkle, Björn Stallenberger, Martin Winken, Mischa Siekmann, Detlev Marpe, Thomas Wiegand