Abstract: Disclosed is a method of decoding a plurality of coding units from a bitstream of video data. The method determines a coding unit structure of the plurality of coding units from a first data block of the bitstream. The coding unit structure describes a division of a coding unit into the plurality of coding units. The method decodes, according to the determined coding unit structure, bypass encoded data for the plurality of coding units from a second data block of the bitstream. The method decodes, according to the determined coding unit structure, residual data for the plurality of coding units from a third data block of the bitstream. The method then forms the plurality of coding units from the bitstream using the residual data and the bypass encoded data. A method of encoding is also disclosed.
Abstract: An image decoding method, according to the present invention, can comprise the steps of: deriving a spatial merge candidate of a current block; generating a merge candidate list for the current block on the basis of the spatial merge candidate; acquiring motion information on the current block on the basis of the merge candidate list; and performing motion compensation for the current block by using the motion information.
Abstract: A system and method for tracking the sun with a heliostat mirror is disclosed. The solar tracking system comprises: a camera configured to capture high dynamic range images of the sky, a plurality of cameras configured to capture images of the heliostat mirror, and a tracking controller. The images of the heliostat mirror include reflections of the sky. The tracking controller is configured to generate a circumsolar radiance map characterizing the brightness of at least a portion of the sky with the high dynamic range images.
Abstract: A digital loupe system is provided which can include a number of features. In one embodiment, the digital loupe system can include a stereo camera pair and a distance sensor. The system can further include a processor configured to perform a transformation to image signals from the stereo camera pair based on a distance measurement from the distance sensor and from camera calibration information. In some examples, the system can use the depth information and the calibration information to correct for parallax between the cameras to provide a multi-channel image. Ergonomic head mounting systems are also provided. In some implementations, the head mounting systems can be configurable to support the weight of a digital loupe system, including placing one or two oculars in a line of sight with an eye of a user, while improving overall ergonomics, including peripheral vision, comfort, stability, and adjustability. Methods of use are also provided.
Abstract: A method of detecting a substance includes, for each position of a plurality of positions in a field of view of an environment: emitting from a light source a light beam including a predetermined wavelength that is absorbable by a constituent of the substance, controlling at least one mirror to direct the emitted light beam to the position in the field of view, detecting by a detector light resulting from the emitted light beam, and determining whether the constituent of the substance is present at the position in the field of view based on characteristics of the detected light; generating a detection map indicating a presence or an absence of the substance at the plurality of positions in the field of view of the environment; capturing, by an image capture device, an image of the environment; and identifying a portion of the captured image having the substance based on the detection map.
Abstract: A method of controlling residual coding for decoding or encoding of a video sequence, is performed by at least one processor and includes, based on a multiple transform selection (MTS) index indicating that a transform skip mode is enabled for a coded block of the video sequence, identifying an identity transform as each of a horizontal transform and a vertical transform. The method further includes, based on the MTS index indicating that the transform skip mode is not enabled for the coded block, identifying one of a discrete cosine transform (DCT), a discrete sine transform (DST), a Hadamard transform and a Haar transform, as either one or both of the horizontal transform and the vertical transform. The method further includes performing the residual coding of the coded block, using the identified horizontal transform and the identified vertical transform.
Abstract: In a particular implementation, a video decoder may decode an initial motion vector predictor from the bitstream for a current block, and perform motion search in a small search window to refine the initial motion vector predictor. The initial motion vector may be rounded before being refined. The motion refinement can be based on a discontinuity measure between pixels of a current block and pixels of adjacent block, and can also be based on gradients between the current block and adjacent blocks. The motion vector for the current block is then decoded based on the refined motion vector predictor, and the motion vector difference if there is any. The motion refinement can also be performed on the motion vector for the current block directly. Through motion refinement, the decoder may increase the motion resolution. A corresponding video encoder may choose whether to use motion refinement based on encoder decisions.
Type:
Grant
Filed:
June 27, 2017
Date of Patent:
April 20, 2021
Assignee:
InterDigital VC Holdings, Inc.
Inventors:
Philippe Bordes, Fabien Racape, Tangi Poirier, Franck Galpin
Abstract: A server comprises a communications module, a processor coupled to the communications module, and a memory coupled to the processor. The memory stores processor-executable instructions which, when executed by the processor, configure the processor to receive, via the communications module and from a remote computing device, a signal representing an indication of a damage location of a vehicle, send, via the communications module and to the remote computing device, instructions for obtaining video data of the vehicle based on the damage location, receive, via the communications module and from the remote computing device, video data of the vehicle, and process the video data to confirm an identity of the vehicle and to numerically quantify an amount of damage to the vehicle.
Type:
Grant
Filed:
November 26, 2019
Date of Patent:
April 13, 2021
Assignee:
The Toronto-Dominion Bank
Inventors:
Denny Devasia Kuruvilla, Esli Gjini, Sarah Reeve, Harjot Singh Panag, Justin Seto, Naomi Sarah Feth, Anurag Thakur
Abstract: The present application discloses an iris identification system and an iris identification method thereof. The iris identification system includes: a camera module including at least two cameras, the cameras having different depths of field and being configured to capture iris images; a distance detection apparatus configured to detect a distance between a user and the camera module; a processing chip configured to find, according to the detected distance, a depth of field corresponding the distance and control a camera having the depth of field to be turned on, and further configured to control multiple cameras in the camera module to be turned on simultaneously until one camera has captured an ideal image.
Abstract: In a method to improve the coding efficiency of high-dynamic range (HDR) images, a decoder parses sequence processing set (SPS) data from an input coded bitstream to detect that an HDR extension syntax structure is present in the parsed SPS data. It extracts from the HDR extension syntax structure post-processing information that includes one or more of a color space enabled flag, a color enhancement enabled flag, an adaptive reshaping enabled flag, a dynamic range conversion flag, a color correction enabled flag, or an SDR viewable flag. It decodes the input bitstream to generate a preliminary output decoded signal, and generates a second output signal based on the preliminary output signal and the post-processing information.
Abstract: A multi-camera system for a component inspection comprising a table having a table top or, alternatively, another sufficiently rigid surface; a first camera having a narrow field-of-view lens; a second camera having a wide field-of-view lens linked to said first camera, wherein said first camera and said second camera are configured to move identical distances along a common axis relative to said table top or surface; and a pre-defined pattern defined on said table top or surface.
Type:
Grant
Filed:
May 4, 2018
Date of Patent:
March 23, 2021
Assignee:
Raytheon Technologies Corporation
Inventors:
Ziyou Xiong, Alan Matthew Finn, Richard W. Osborne, III, Jose Miguel Pasini, Edgar A. Bernal, Ozgur Erdinc
Abstract: An image decoding method performed by a decoding apparatus according to the present disclosure includes receiving a bitstream including residual information; deriving quantized transform coefficients for a current block based on the residual information included in the bitstream; deriving residual samples for the current block based on the quantized transform coefficients; and generating a reconstructed picture based on the residual samples for the current block.
Type:
Grant
Filed:
April 6, 2020
Date of Patent:
March 23, 2021
Assignee:
LG ELECTRONICS, INC.
Inventors:
Jungah Choi, Sunmi Yoo, Jin Heo, Ling Li, Jangwon Choi, Seunghwan Kim
Abstract: Methods to switch between renditions of a video stream are generally described. In some examples, the methods may include encoding a video stream at a first image quality in a first rendition and a second, lower image quality in a second rendition. The methods may further include sending the first rendition to a recipient computing device. The methods may include receiving a request to switch from the first rendition to the second rendition. The methods may include determining that first indicator data of a first inter-coded frame indicates that the video stream can be switched to a lower image quality rendition at the first inter-coded frame. In some examples, the methods may further include sending the second rendition to the recipient computing device.
Abstract: In an embodiment described herein, a method for face-speech bridging by cycle video/audio reconstruction is described. The method comprises encoding audio data and video data via a mutual autoencoders that comprise an audio autoencoder and a video autoencoder, wherein the mutual autoencoders share a common space with corresponding embeddings derived by each of the audio autoencoder and the video autoencoder. Additionally, the method comprises substituting embeddings from a non-corrupted modality for corresponding corrupted embeddings in a corrupted modality in real-time based at least in part on corrupted audio data or corrupted video data. The method also comprises synthesizing reconstructed audio data and reconstructed video data based on, at least in part, the substituted embeddings.
Abstract: Display devices and electronic apparatuses with 3D camera modules are provided. An exemplary device comprises a display and a 3D camera module, wherein the 3D camera module comprises a depth camera module disposed at a backlight side of the display; the depth camera module comprises an edge-emitting laser and an imaging module; the edge-emitting laser is configured for emitting laser light, for the emitted laser light to penetrate the display to reach an object; and the imaging module is configured for receiving laser light reflected by the object that penetrates the display, and obtaining a depth image of the object based on the reflected laser light.
Abstract: A depth measurement assembly (DMA) includes an illumination source that projects pulses of light (e.g., structured light) at a temporal pulsing frequency into a local area. The DMA includes a sensor that capture images of the pulses of light reflected from the local area and determines, using one or more of the captured images, one or more TOF phase shifts for the pulses of light. The DMA includes a controller coupled to the sensor and configured to determine a first set of estimated radial distances to an object in the local area based on the one or more TOF phase shifts. The controller determines a second estimated radial distance to the object based on an encoding of structured light and at least one of the captured images. The controller selects an estimated radial distance from the first set of radial distances.
Abstract: An information processing apparatus according to this invention includes a generating unit configured to generate a virtual viewpoint image in accordance with a position and/or line-of-sight direction of a viewpoint, and a notifying unit configured to send a notification of information about quality of the virtual viewpoint image generated by the generating unit.
Abstract: Methods, processes, and systems are presented for adaptive loop filtering in coding and decoding high dynamic range (HDR) video. Given an input image block, its luminance information may be used to adapt one or more parameters of adaptive loop filtering and compute gradient and directionality information, activity information, a classification index, and adaptive-loop-filtering coefficients.
Abstract: A method of video decoding performed in a video decoder includes receiving a coded video bitstream including signaling information for a current block. The method further includes determining block reconstruction information for the current block based on the signaling information. The method further includes reconstructing the current block using the determined block reconstruction information.
Abstract: The present disclosure relates to deblocking filtering, which may be advantageously applied for block-wise encoding and decoding of images or video signals. In particular, the present disclosure relates to an improved memory management in an automated decision on whether to apply or skip deblocking filtering for a block and to selection of the deblocking filter. The decision is performed on the basis of a segmentation of blocks in such a manner that memory usage is optimized. Preferably, the selection of appropriate deblocking filters is improved so as to reduce computational expense.
Type:
Grant
Filed:
July 27, 2020
Date of Patent:
January 19, 2021
Assignee:
SUN PATENT TRUST
Inventors:
Matthias Narroschke, Semih Esenlik, Thomas Wedi