Patents Assigned to Dolby Laboratories Licensing Corporation

Blind detection of binauralized stereo content

Patent number: 11929091

Abstract: An apparatus and method of blind detection of binauralized audio. If the input content is detected as binaural, a second binauralization may be avoided. In this manner, the user experience avoids audio artifacts introduced by multiple binauralizations.

Type: Grant

Filed: March 1, 2022

Date of Patent: March 12, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Chunmao Zhang, Lianwu Chen, Ziyu Yang, Joshua Brandon Lando, David Matthew Fischer, Lie Lu
Adaptive loudness normalization for audio object clustering

Patent number: 11930347

Abstract: A method of processing audio content including a plurality of audio elements comprises: clustering the plurality of audio elements into a plurality of clusters of audio elements; and for a cluster among the plurality of clusters: for each audio element in the cluster, determining a measure of energy that the audio element contributes to the cluster; for at least one audio element in the cluster, determining a compensation gain based at least in part on the measures of energy for the audio elements in the cluster; and applying the compensation gain to the at least one audio element in the cluster.

Type: Grant

Filed: February 12, 2020

Date of Patent: March 12, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Lianwu Chen, Lie Lu
CODING AND DECODING OF INTERLEAVED IMAGE DATA

Publication number: 20240080479

Abstract: Sampled data is packaged in checkerboard format for encoding and decoding. The sampled data may be quincunx sampled multi-image video data (e.g., 3D video or a multi-program stream), and the data may also be divided into sub-images of each image which are then multiplexed, or interleaved, in frames of a video stream to be encoded and then decoded using a standardized video encoder. A system for viewing may utilize a standard video decoder and a formatting device that de-interleaves the decoded sub-images of each frame reformats the images for a display device. A 3D video may be encoded using a most advantageous interleaving format such that a preferred quality and compression ratio is reached. In one embodiment, the invention includes a display device that accepts data in multiple formats.

Type: Application

Filed: November 7, 2023

Publication date: March 7, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Alexandros Tourapis, Walter J. Husak, Peshala V. Pahalawatta, Athanasios Leontaris
PERCEPTUALLY-BASED LOSS FUNCTIONS FOR AUDIO ENCODING AND DECODING BASED ON MACHINE LEARNING

Publication number: 20240079019

Abstract: Computer-implemented methods for training a neural network, as well as for implementing audio encoders and decoders via trained neural networks, are provided. The neural network may receive an input audio signal, generate an encoded audio signal and decode the encoded audio signal. A loss function generating module may receive the decoded audio signal and a ground truth audio signal, and may generate a loss function value corresponding to the decoded audio signal. Generating the loss function value may involve applying a psychoacoustic model. The neural network may be trained based on the loss function value. The training may involve updating at least one weight of the neural network.

Type: Application

Filed: November 13, 2023

Publication date: March 7, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Roy M. FEJGIN, Grant A. DAVIDSON, Chih-Wei WU, Vivek KUMAR
FRAME-RATE SCALABLE VIDEO CODING

Publication number: 20240080465

Abstract: Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific backward compatible target frame rate and shutter angle among those allowed are also presented.

Type: Application

Filed: November 13, 2023

Publication date: March 7, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Robin Atkins, Peng Yin, Taoran Lu, Fangjun Pu, Sean Thomas McCarthy, Walter J. Husak, Tao Chen, Guan-Ming Su
QUANTIZATION PARAMETER SIGNALING

Publication number: 20240080489

Abstract: A quantization parameter signalling mechanism for both SDR and HDR content in video coding is described using two approaches. The first approach is to send the user-defined QpC table directly in high level syntax. This leads to more flexible and efficient QP control for future codec development and video content coding. The second approach is to signal luma and chroma QPs independently. This approach eliminates the need for QpC tables and removes the dependency of chroma quantization parameter on luma QP.

Type: Application

Filed: November 10, 2023

Publication date: March 7, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Fangjun PU, Taoran LU, Peng YIN, Sean Thomas MCCARTHY
METHODS AND SYSTEMS FOR RENDERING OBJECT BASED AUDIO

Publication number: 20240079015

Abstract: Methods for generating an object based audio program, renderable in a personalizable manner, and including a bed of speaker channels renderable in the absence of selection of other program content (e.g., to provide a default full range audio experience). Other embodiments include steps of delivering, decoding, and/or rendering such a program. Rendering of content of the bed, or of a selected mix of other content of the program, may provide an immersive experience. The program may include multiple object channels (e.g., object channels indicative of user-selectable and user-configurable objects), the bed of speaker channels, and other speaker channels. Another aspect is an audio processing unit (e.g., encoder or decoder) configured to perform, or which includes a buffer memory which stores at least one frame (or other segment) of an object based audio program (or bitstream thereof) generated in accordance with, any embodiment of the method.

Type: Application

Filed: September 19, 2023

Publication date: March 7, 2024

Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: Sripal S. MEHTA, Thomas ZIEGLER, Giles BAKER, Jeffrey RIEDMILLER, Prinyar SAUNGSOMBOON
PERCEPTUAL ENHANCEMENT FOR BINAURAL AUDIO RECORDING

Publication number: 20240080608

Abstract: A method of audio processing includes capturing a binaural audio signal, calculating noise reduction gains using a machine learning model, and generating a modified binaural audio signal. The method may further including performing various corrections to the audio to account for video captured by different cameras such as a front camera and a rear camera. The method may further include performing smooth switching of the binaural audio when switching between the front camera and the rear camera. In this manner, noise may be reduced in the binaural audio, and the user perception of the combined video and binaural audio may be improved.

Type: Application

Filed: December 14, 2021

Publication date: March 7, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Yuanxing MA, Zhiwei SHUANG, Yang LIU
Rendering binaural audio over multiple near field transducers

Patent number: 11924619

Abstract: An apparatus and method of rendering audio. A binaural signal is split on an amplitude weighting basis into a front binaural signal and a rear binaural signal, based on perceived position information of the audio. In this manner, the front-back differentiation of the binaural signal is improved.

Type: Grant

Filed: September 12, 2022

Date of Patent: March 5, 2024

Assignee: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Mark F. Davis, Nicolas R. Tsingos, C. Phillip Brown
Signal reshaping for high dynamic range signals

Patent number: 11924477

Abstract: In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated hue values in the preferred color space is minimized. HDR images are coded in the reshaped color space. Legacy devices can still decode standard dynamic range images assuming they are coded in the legacy color space, while updated devices can use color reshaping information to decode HDR images in the preferred color space at full dynamic range.

Type: Grant

Filed: June 27, 2023

Date of Patent: March 5, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Robin Atkins, Peng Yin, Taoran Lu, Jaclyn Anne Pytlarz
HDR image generation from single-shot HDR color image sensors

Patent number: 11922639

Abstract: A method for generating an high-dynamic-range (HDR) color image from a dual-exposure-time single-shot HDR color image sensor includes obtaining pixel values generated by a local region of sensor pixels of the image sensor, determining a motion parameter for the local region from pixel values associated with a first color, and demosaicing the pixel values of the local region to determine, for each of three colors, an output value of the images pixel, wherein relative contributions of short-exposure-time pixels and long-exposure-time pixels to the output value are weighted according to the motion parameter.

Type: Grant

Filed: June 4, 2019

Date of Patent: March 5, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Yuelong Li, Jon Scott McElvain, Mohammad Tofighi
DETERMINING DIALOG QUALITY METRICS OF A MIXED AUDIO SIGNAL

Publication number: 20240071411

Abstract: Disclosed is a method for determining one or more dialog quality metrics of a mixed audio signal comprising a dialog component and a noise component, the method comprising separating an estimated dialog component from the mixed audio signal by means of a dialog separator using a dialog separating model determined by training the dialog separator based on the one or more quality metrics; providing the estimated dialog component from the dialog separator to a quality metrics estimator; and determining the one or more quality metrics by means of the quality metrics estimator based on the mixed signal and the estimated dialog component. Further disclosed is a method for training a dialog separator, a system comprising circuitry configured to perform the method, and a non-transitory computer-readable storage medium.

Type: Application

Filed: January 4, 2022

Publication date: February 29, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Jundai SUN, Lie LU, Shaofan YANG, Rhonda J. WILSON, Dirk Jeroen BREEBAART
MULTIPLE STAGE MODULATION PROJECTOR DISPLAY SYSTEMS HAVING EFFICIENT LIGHT UTILIZATION

Publication number: 20240073357

Abstract: Dual or multi-modulation display systems comprising a first modulator and a second modulator are disclosed. The first modulator may comprise a plurality of analog mirrors (e.g. MEMS array) and the second modulator may comprise a plurality of mirrors (e.g., DMD array). The display system may further comprise a controller that sends control signals to the first and second modulator. The display system may render highlight features within a projected image by affecting a time multiplexing scheme. In one embodiment, the first modulator may be switched on a sub-frame basis such that a desired proportion of the available light may be focused or directed onto the second modulator to form the highlight feature on a sub-frame rendering basis.

Type: Application

Filed: January 31, 2022

Publication date: February 29, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor: Martin J. Richards
SIGNAL RESHAPING FOR HIGH DYNAMIC RANGE SIGNALS

Publication number: 20240073459

Abstract: In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated hue values in the preferred color space is minimized HDR images are coded in the reshaped color space. Legacy devices can still decode standard dynamic range images assuming they are coded in the legacy color space, while updated devices can use color reshaping information to decode HDR images in the preferred color space at full dynamic range.

Type: Application

Filed: October 31, 2023

Publication date: February 29, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Robin Atkins, Peng Yin, Taoran Lu, Jaclyn Anne Pytlarz
METHOD AND DEVICE FOR ENCODING AND DECODING IMAGE USING MOTION VECTOR RESOLUTION SCALING

Publication number: 20240073444

Abstract: A video encoding method according to an embodiment of the present invention includes generating header information that includes information about resolutions of motion vectors of respective blocks, determined based on motion prediction for a unit image. Here, the header information includes flag information indicating whether resolutions of all motion vectors included in the unit image are integer-pixel resolutions. Further, a video decoding method according to another embodiment of the present invention includes extracting information about resolutions of motion vectors of each unit image from header information included in a target bitstream to be decoded; and a decoding unit for decoding the unit image based on the resolution information. Here, the header information includes flag information indicating whether resolutions of all motion vectors included in the unit image are integer-pixel resolutions.

Type: Application

Filed: November 8, 2023

Publication date: February 29, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Jong Ki HAN, Jae Yung LEE
Estimating user location in a system including smart audio devices

Patent number: 11917386

Abstract: Methods and systems for performing at least one audio activity (e.g., conducting a phone call or playing music or other audio content) in an environment including by determining an estimated location of a user in the environment in response to sound uttered by the user (e.g., a voice command), and controlling the audio activity in response to determining the estimated user location. The environment may have zones which are indicated by a zone map and estimation of the user location may include estimating in which of the zones the user is located. The audio activity may be performed using microphones and loudspeakers which are implemented in or coupled to smart audio devices.

Type: Grant

Filed: March 7, 2023

Date of Patent: February 27, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Carlos Eduardo Medaglia Dyonisio, David Gunawan
Scalable systems for controlling color management comprising varying levels of metadata

Patent number: 11917171

Abstract: Several embodiments of scalable image processing systems and methods are disclosed herein whereby color management processing of source image data to be displayed on a target display is changed according to varying levels of metadata.

Type: Grant

Filed: July 7, 2023

Date of Patent: February 27, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Neil W. Messmer, Robin Atkins, Steve Margerm, Peter W. Longhurst
Signal reshaping for high dynamic range signals

Patent number: 11910025

Abstract: In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated hue values in the preferred color space is minimized HDR images are coded in the reshaped color space. Legacy devices can still decode standard dynamic range images assuming they are coded in the legacy color space, while updated devices can use color reshaping information to decode HDR images in the preferred color space at full dynamic range.

Type: Grant

Filed: October 31, 2023

Date of Patent: February 20, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Robin Atkins, Peng Yin, Taoran Lu, Jaclyn Anne Pytlarz
METHOD FOR AND APPARATUS FOR DECODING/RENDERING AN AMBISONICS AUDIO SOUNDFIELD REPRESENTATION FOR AUDIO PLAYBACK USING 2D SETUPS

Publication number: 20240056755

Abstract: Improved methods and/or apparatus for decoding an encoded audio signal in soundfield format for L loudspeakers. The method and/or apparatus can render an Ambisonics format audio signal to 2D loudspeaker setup(s) based on a rendering matrix. The rendering matrix has elements based on loudspeaker positions and wherein the rendering matrix is determined based on weighting at least an element of a first matrix with a weighting factor ? = 1 L . The first matrix is determined based on positions of the L loudspeakers and at least a virtual position of at least a virtual loudspeaker that is added to the positions of the L loudspeakers.

Type: Application

Filed: August 28, 2023

Publication date: February 15, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Florian KEILER, Johannes Boehm
SOURCE COLOR VOLUME INFORMATION MESSAGING

Publication number: 20240056610

Abstract: Methods are described to communicate source color volume information in a coded bitstream using SEI messaging. Such data include at least the minimum, maximum, and average luminance values in the source data plus optional data that may include the color volume x and y chromaticity coordinates for the input color primaries (e.g., red, green, and blue) of the source data, and the color x and y chromaticity coordinates for the color primaries corresponding to the minimum, average, and maximum luminance values in the source data. Messaging data signaling an active region in each picture may also be included.

Type: Application

Filed: October 13, 2023

Publication date: February 15, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Tao CHEN, Peng YIN, Taoran LU, Walter J. HUSAK

prev 1 2 3 4 5 6 7 8 … next