Abstract: With adaptive multiple quantization, a video or other digital media codec can adaptively select among multiple quantizers to apply to transform coefficients. The switch in quantizers can be signaled at the sequence level or frame level of the bitstream syntax, or can be implicitly specified in the syntax.
Type:
Grant
Filed:
December 15, 2021
Date of Patent:
January 31, 2023
Assignee:
Microsoft Technology Licensing, LLC
Inventors:
Thomas W. Holcomb, Chih-Lung Lin, Pohsiang Hsu, Sridhar Srinivasan
Abstract: An image processing method includes receiving image data and outputting an image stream including an initial group of pictures and a next group of pictures according to the image data. The initial group of pictures includes a first initial frame and at least one first non-initial frame. The next group of pictures includes a second initial frame and at least one second non-initial frame. The image processing method further includes obtaining a first ideal frame count of the next group of pictures according to a ratio relationship of frame sizes of the first initial frame and the at least one first non-initial frame of the initial group of pictures in a predetermined frame segment or a predetermined time segment and setting a frame count of the next group of pictures according to the first ideal frame count.
Abstract: Techniques for transmitting and sharing a video sequence over an ultra-low bandwidth channel, such as a short message service (SMS) channel, are disclosed herein. A video is segmented into regions of various interest levels. A set of parameters is developed from a video region of a high interest, wherein the parameters represent a mapping function of a database to model the video region. The set of parameters is transmitted over the ultra-low bandwidth channel to a remote device, wherein the remote device also has access to an instance of the database. The remote device synthesizes the video by using the mapping function of the database, which is represented by the transmitted set of parameters.
Abstract: A selected color component (R, G, B) for each of a plurality of image pixels (32, P) of an image cell (42, C) is encoded to form a corresponding color component (ER, EG, EB) of a corresponding plurality of encoded pixels (40, PE) in one-to-one correspondence with the image pixels (32, P). A bit length of each color component (ER, EG, EB) of an encoded pixel (40, PE) is less than that of a corresponding image pixel (32, P).
Abstract: Provided is a method of aligning a camera using correspondence information between multi-images. The camera alignment method using correspondences between multi-images includes defining a correspondence relation between images photographed in the multi-camera system, estimating an initial position of the camera using the correspondence relation between the images and a Structure From Motion (SFM) algorithm, redefining a changed correspondence relation between the images as a result of the estimation of the initial position of the camera using a bundle edge to generate an optimal edge, and correcting the position of the camera based on the optimal edge.
Type:
Grant
Filed:
July 29, 2014
Date of Patent:
February 21, 2017
Assignee:
ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE
Abstract: A method for encoding pictures within a groups of pictures using prediction, where a first reference picture from a group of pictures and a second reference pictures from the subsequent group of pictures are used in predicting pictures in the group of pictures associated with the first reference picture. A plurality of anchor pictures in the group of pictures associated with the first reference picture may be predicted using both the first and second reference pictures to ensure a smooth transition between different groups of pictures within a video frame.
Abstract: An apparatus and a method of coding/decoding an image are provided. The apparatus includes: a low resolution image coding unit that codes a low resolution image corresponding to an input image using motion estimation and compensation; and a residual image coding unit that performs intra mode coding with respect to a residual image which indicates a difference between a reconstructed image of the coded low resolution image and the input image by considering temporal correlation between neighboring residual images.
Abstract: The present invention provides a method and apparatus for encoding video content into a novel temporal layered coding structure. In one embodiment of the present invention, a plurality of sub-sequence layers are defined in a layer hierarchy in accordance with the present invention. A plurality of images within a sub-sequence of a group of images are selectively assigned to a respective one of the sub-sequence layers. A grouping of images contained in any sequential combination of sub-sequence layers that includes a hierarchically lowest of the sub-sequence layers results in a group of images that temporally are substantially evenly spaced. A related method and apparatus for decoding the temporally layered video content are also disclosed.
Abstract: A block transform-based digital media codec has a signaling scheme and bitstream syntax to flexibly signal that truncation of less significant information bits of transform coefficients coded as an optional layer of the bitstream has been performed adaptively per region or tile of the image.