Dolby Labs Patents

Dolby Laboratories, Inc. licenses its audio technologies, including its noise-reduction systems, to the media industry. Its product portfolio includes Dolby Digital Plus (DD+), Dolby Digital (DD), AAC and HE-AAC, Dolby TrueHD, Dolby Atmos, Dolby AC-4, Dolby Voice and Dolby Vision. Products that incorporate Dolby technologies include televisions, set-top boxes, computers, DVD and Blu-ray devices, soundbars, smartphones, tablets, video game consoles, and automobile entertainment systems.

Dolby Labs Patents by Type

Dolby Labs Patents Granted: Dolby Labs patents that have been granted by the United States Patent and Trademark Office (USPTO).
Dolby Labs Patent Applications: Dolby Labs patent applications that are pending before the United States Patent and Trademark Office (USPTO).

Adaptive loudness normalization for audio object clustering

Patent number: 11930347

Abstract: A method of processing audio content including a plurality of audio elements comprises: clustering the plurality of audio elements into a plurality of clusters of audio elements; and for a cluster among the plurality of clusters: for each audio element in the cluster, determining a measure of energy that the audio element contributes to the cluster; for at least one audio element in the cluster, determining a compensation gain based at least in part on the measures of energy for the audio elements in the cluster; and applying the compensation gain to the at least one audio element in the cluster.

Type: Grant

Filed: February 12, 2020

Date of Patent: March 12, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Lianwu Chen, Lie Lu
Method and apparatus for controlling enhancement of low-bitrate coded audio

Patent number: 11929085

Abstract: Described herein is a method of low-bitrate coding of audio data and generating enhancement metadata for controlling audio enhancement of the low-bitrate coded audio data at a decoder side, including the steps of: (a) core encoding original audio data at a low bitrate to obtain encoded audio data; (b) generating enhancement metadata to be used for controlling a type and/or amount of audio enhancement at the decoder side after core decoding the encoded audio data; and (c) outputting the encoded audio data and the enhancement metadata. Described is further an encoder configured to perform said method. Described is moreover a method for generating enhanced audio data from low-bitrate coded audio data based on enhancement metadata and a decoder configured to perform said method.

Type: Grant

Filed: August 29, 2019

Date of Patent: March 12, 2024

Assignees: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Arijit Biswas, Jia Dai, Aaron Steven Master
QUANTIZATION PARAMETER SIGNALING

Publication number: 20240080489

Abstract: A quantization parameter signalling mechanism for both SDR and HDR content in video coding is described using two approaches. The first approach is to send the user-defined QpC table directly in high level syntax. This leads to more flexible and efficient QP control for future codec development and video content coding. The second approach is to signal luma and chroma QPs independently. This approach eliminates the need for QpC tables and removes the dependency of chroma quantization parameter on luma QP.

Type: Application

Filed: November 10, 2023

Publication date: March 7, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Fangjun PU, Taoran LU, Peng YIN, Sean Thomas MCCARTHY
METHODS AND SYSTEMS FOR RENDERING OBJECT BASED AUDIO

Publication number: 20240079015

Abstract: Methods for generating an object based audio program, renderable in a personalizable manner, and including a bed of speaker channels renderable in the absence of selection of other program content (e.g., to provide a default full range audio experience). Other embodiments include steps of delivering, decoding, and/or rendering such a program. Rendering of content of the bed, or of a selected mix of other content of the program, may provide an immersive experience. The program may include multiple object channels (e.g., object channels indicative of user-selectable and user-configurable objects), the bed of speaker channels, and other speaker channels. Another aspect is an audio processing unit (e.g., encoder or decoder) configured to perform, or which includes a buffer memory which stores at least one frame (or other segment) of an object based audio program (or bitstream thereof) generated in accordance with, any embodiment of the method.

Type: Application

Filed: September 19, 2023

Publication date: March 7, 2024

Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: Sripal S. MEHTA, Thomas ZIEGLER, Giles BAKER, Jeffrey RIEDMILLER, Prinyar SAUNGSOMBOON
PERCEPTUAL ENHANCEMENT FOR BINAURAL AUDIO RECORDING

Publication number: 20240080608

Abstract: A method of audio processing includes capturing a binaural audio signal, calculating noise reduction gains using a machine learning model, and generating a modified binaural audio signal. The method may further including performing various corrections to the audio to account for video captured by different cameras such as a front camera and a rear camera. The method may further include performing smooth switching of the binaural audio when switching between the front camera and the rear camera. In this manner, noise may be reduced in the binaural audio, and the user perception of the combined video and binaural audio may be improved.

Type: Application

Filed: December 14, 2021

Publication date: March 7, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Yuanxing MA, Zhiwei SHUANG, Yang LIU
CODING AND DECODING OF INTERLEAVED IMAGE DATA

Publication number: 20240080479

Abstract: Sampled data is packaged in checkerboard format for encoding and decoding. The sampled data may be quincunx sampled multi-image video data (e.g., 3D video or a multi-program stream), and the data may also be divided into sub-images of each image which are then multiplexed, or interleaved, in frames of a video stream to be encoded and then decoded using a standardized video encoder. A system for viewing may utilize a standard video decoder and a formatting device that de-interleaves the decoded sub-images of each frame reformats the images for a display device. A 3D video may be encoded using a most advantageous interleaving format such that a preferred quality and compression ratio is reached. In one embodiment, the invention includes a display device that accepts data in multiple formats.

Type: Application

Filed: November 7, 2023

Publication date: March 7, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Alexandros Tourapis, Walter J. Husak, Peshala V. Pahalawatta, Athanasios Leontaris
PERCEPTUALLY-BASED LOSS FUNCTIONS FOR AUDIO ENCODING AND DECODING BASED ON MACHINE LEARNING

Publication number: 20240079019

Abstract: Computer-implemented methods for training a neural network, as well as for implementing audio encoders and decoders via trained neural networks, are provided. The neural network may receive an input audio signal, generate an encoded audio signal and decode the encoded audio signal. A loss function generating module may receive the decoded audio signal and a ground truth audio signal, and may generate a loss function value corresponding to the decoded audio signal. Generating the loss function value may involve applying a psychoacoustic model. The neural network may be trained based on the loss function value. The training may involve updating at least one weight of the neural network.

Type: Application

Filed: November 13, 2023

Publication date: March 7, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Roy M. FEJGIN, Grant A. DAVIDSON, Chih-Wei WU, Vivek KUMAR
FRAME-RATE SCALABLE VIDEO CODING

Publication number: 20240080465

Abstract: Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific backward compatible target frame rate and shutter angle among those allowed are also presented.

Type: Application

Filed: November 13, 2023

Publication date: March 7, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Robin Atkins, Peng Yin, Taoran Lu, Fangjun Pu, Sean Thomas McCarthy, Walter J. Husak, Tao Chen, Guan-Ming Su
HDR image generation from single-shot HDR color image sensors

Patent number: 11922639

Abstract: A method for generating an high-dynamic-range (HDR) color image from a dual-exposure-time single-shot HDR color image sensor includes obtaining pixel values generated by a local region of sensor pixels of the image sensor, determining a motion parameter for the local region from pixel values associated with a first color, and demosaicing the pixel values of the local region to determine, for each of three colors, an output value of the images pixel, wherein relative contributions of short-exposure-time pixels and long-exposure-time pixels to the output value are weighted according to the motion parameter.

Type: Grant

Filed: June 4, 2019

Date of Patent: March 5, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Yuelong Li, Jon Scott McElvain, Mohammad Tofighi
Rendering binaural audio over multiple near field transducers

Patent number: 11924619

Abstract: An apparatus and method of rendering audio. A binaural signal is split on an amplitude weighting basis into a front binaural signal and a rear binaural signal, based on perceived position information of the audio. In this manner, the front-back differentiation of the binaural signal is improved.

Type: Grant

Filed: September 12, 2022

Date of Patent: March 5, 2024

Assignee: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Mark F. Davis, Nicolas R. Tsingos, C. Phillip Brown
Image decoding device and image coding device

Patent number: 11924451

Abstract: A reference picture information decoding unit (13) omits decoding of a reference list sorting presence or absence flag and/or a reference list sorting order based on the number of current picture referable pictures.

Type: Grant

Filed: March 21, 2022

Date of Patent: March 5, 2024

Assignee: DOLBY INTERNATIONAL AB

Inventors: Tomoyuki Yamamoto, Tomohiro Ikai, Yukinobu Yasugi, Takeshi Tsukuba
Signal reshaping for high dynamic range signals

Patent number: 11924477

Abstract: In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated hue values in the preferred color space is minimized. HDR images are coded in the reshaped color space. Legacy devices can still decode standard dynamic range images assuming they are coded in the legacy color space, while updated devices can use color reshaping information to decode HDR images in the preferred color space at full dynamic range.

Type: Grant

Filed: June 27, 2023

Date of Patent: March 5, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Robin Atkins, Peng Yin, Taoran Lu, Jaclyn Anne Pytlarz
SIGNAL RESHAPING FOR HIGH DYNAMIC RANGE SIGNALS

Publication number: 20240073459

Abstract: In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated hue values in the preferred color space is minimized HDR images are coded in the reshaped color space. Legacy devices can still decode standard dynamic range images assuming they are coded in the legacy color space, while updated devices can use color reshaping information to decode HDR images in the preferred color space at full dynamic range.

Type: Application

Filed: October 31, 2023

Publication date: February 29, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Robin Atkins, Peng Yin, Taoran Lu, Jaclyn Anne Pytlarz
METHOD AND DEVICE FOR ENCODING AND DECODING IMAGE USING MOTION VECTOR RESOLUTION SCALING

Publication number: 20240073444

Abstract: A video encoding method according to an embodiment of the present invention includes generating header information that includes information about resolutions of motion vectors of respective blocks, determined based on motion prediction for a unit image. Here, the header information includes flag information indicating whether resolutions of all motion vectors included in the unit image are integer-pixel resolutions. Further, a video decoding method according to another embodiment of the present invention includes extracting information about resolutions of motion vectors of each unit image from header information included in a target bitstream to be decoded; and a decoding unit for decoding the unit image based on the resolution information. Here, the header information includes flag information indicating whether resolutions of all motion vectors included in the unit image are integer-pixel resolutions.

Type: Application

Filed: November 8, 2023

Publication date: February 29, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Jong Ki HAN, Jae Yung LEE
DETERMINING DIALOG QUALITY METRICS OF A MIXED AUDIO SIGNAL

Publication number: 20240071411

Abstract: Disclosed is a method for determining one or more dialog quality metrics of a mixed audio signal comprising a dialog component and a noise component, the method comprising separating an estimated dialog component from the mixed audio signal by means of a dialog separator using a dialog separating model determined by training the dialog separator based on the one or more quality metrics; providing the estimated dialog component from the dialog separator to a quality metrics estimator; and determining the one or more quality metrics by means of the quality metrics estimator based on the mixed signal and the estimated dialog component. Further disclosed is a method for training a dialog separator, a system comprising circuitry configured to perform the method, and a non-transitory computer-readable storage medium.

Type: Application

Filed: January 4, 2022

Publication date: February 29, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Jundai SUN, Lie LU, Shaofan YANG, Rhonda J. WILSON, Dirk Jeroen BREEBAART
MULTIPLE STAGE MODULATION PROJECTOR DISPLAY SYSTEMS HAVING EFFICIENT LIGHT UTILIZATION

Publication number: 20240073357

Abstract: Dual or multi-modulation display systems comprising a first modulator and a second modulator are disclosed. The first modulator may comprise a plurality of analog mirrors (e.g. MEMS array) and the second modulator may comprise a plurality of mirrors (e.g., DMD array). The display system may further comprise a controller that sends control signals to the first and second modulator. The display system may render highlight features within a projected image by affecting a time multiplexing scheme. In one embodiment, the first modulator may be switched on a sub-frame basis such that a desired proportion of the available light may be focused or directed onto the second modulator to form the highlight feature on a sub-frame rendering basis.

Type: Application

Filed: January 31, 2022

Publication date: February 29, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor: Martin J. Richards
Model based prediction in a critically sampled filterbank

Patent number: 11915713

Abstract: The present document relates to audio source coding systems. In particular, the present document relates to audio source coding systems which make use of linear prediction in combination with a filterbank. A method for estimating a first sample (615) of a first subband signal in a first subband of an audio signal is described. The first subband signal of the audio signal is determined using an analysis filterbank (612) comprising a plurality of analysis filters which provide a plurality of subband signals in a plurality of subbands from the audio signal, respectively.

Type: Grant

Filed: March 30, 2023

Date of Patent: February 27, 2024

Assignee: DOLBY INTERNATIONAL AB

Inventor: Lars Villemoes
Scalable systems for controlling color management comprising varying levels of metadata

Patent number: 11917171

Abstract: Several embodiments of scalable image processing systems and methods are disclosed herein whereby color management processing of source image data to be displayed on a target display is changed according to varying levels of metadata.

Type: Grant

Filed: July 7, 2023

Date of Patent: February 27, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Neil W. Messmer, Robin Atkins, Steve Margerm, Peter W. Longhurst
Estimating user location in a system including smart audio devices

Patent number: 11917386

Abstract: Methods and systems for performing at least one audio activity (e.g., conducting a phone call or playing music or other audio content) in an environment including by determining an estimated location of a user in the environment in response to sound uttered by the user (e.g., a voice command), and controlling the audio activity in response to determining the estimated user location. The environment may have zones which are indicated by a zone map and estimation of the user location may include estimating in which of the zones the user is located. The audio activity may be performed using microphones and loudspeakers which are implemented in or coupled to smart audio devices.

Type: Grant

Filed: March 7, 2023

Date of Patent: February 27, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Carlos Eduardo Medaglia Dyonisio, David Gunawan
METHODS AND DEVICES FOR JOINT MULTICHANNEL CODING

Publication number: 20240062765

Abstract: Encoding and decoding devices for encoding the channels of an audio system having at least four channels are disclosed. The decoding device has a first stereo decoding component which subjects a first pair of input channels to a first stereo decoding, and a second stereo decoding component which subjects a second pair of input channels to a second stereo decoding. The results of the first and second stereo decoding components are crosswise coupled to a third and a fourth stereo decoding component which each performs stereo decoding on one channel resulting from the first stereo decoding component, and one channel resulting from the second stereo decoding component.

Type: Application

Filed: September 1, 2023

Publication date: February 22, 2024

Applicant: DOLBY INTERNATIONAL AB

Inventors: Kristofer KJOERLING, Harald MUNDT, Heiko PURNHAGEN
Signal reshaping for high dynamic range signals

Patent number: 11910025

Abstract: In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated hue values in the preferred color space is minimized HDR images are coded in the reshaped color space. Legacy devices can still decode standard dynamic range images assuming they are coded in the legacy color space, while updated devices can use color reshaping information to decode HDR images in the preferred color space at full dynamic range.

Type: Grant

Filed: October 31, 2023

Date of Patent: February 20, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Robin Atkins, Peng Yin, Taoran Lu, Jaclyn Anne Pytlarz
Integration of high frequency reconstruction techniques with reduced post-processing delay

Patent number: 11908486

Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.

Type: Grant

Filed: January 20, 2023

Date of Patent: February 20, 2024

Assignee: DOLBY INTERNATIONAL AB

Inventors: Kristofer Kjoerling, Lars Villemoes, Heiko Purnhagen, Per Ekstrand
BINAURAL SIGNAL POST-PROCESSING

Publication number: 20240056760

Abstract: A method of audio processing includes performing spatial analysis on a binaural signal to estimate level differences and phase differences characteristic of a binaural filter of the binaural signal, performing object extraction on the binaural audio signal using the estimated level and phase differences to generate a left/right main component signal and a left/right residual component signal. The system may process the left/right main and left/right residual components differently using different object processing parameters for e.g. repositioning, equalization, compression, upmixing, channel remapping or storage to generate a processed binaural signal that provides an improved listening experience. Repositioning may be based on head tracking sensor data.

Type: Application

Filed: December 16, 2021

Publication date: February 15, 2024

Applicants: Dolby Laboratories Licensing Corporation, Dolby International AB

Inventors: Dirk Jeroen BREEBAART, Giulio CENGARLE, C. Phillip BROWN
METHOD FOR AND APPARATUS FOR DECODING/RENDERING AN AMBISONICS AUDIO SOUNDFIELD REPRESENTATION FOR AUDIO PLAYBACK USING 2D SETUPS

Publication number: 20240056755

Abstract: Improved methods and/or apparatus for decoding an encoded audio signal in soundfield format for L loudspeakers. The method and/or apparatus can render an Ambisonics format audio signal to 2D loudspeaker setup(s) based on a rendering matrix. The rendering matrix has elements based on loudspeaker positions and wherein the rendering matrix is determined based on weighting at least an element of a first matrix with a weighting factor ? = 1 L . The first matrix is determined based on positions of the L loudspeakers and at least a virtual position of at least a virtual loudspeaker that is added to the positions of the L loudspeakers.

Type: Application

Filed: August 28, 2023

Publication date: February 15, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Florian KEILER, Johannes Boehm
SOURCE COLOR VOLUME INFORMATION MESSAGING

Publication number: 20240056610

Abstract: Methods are described to communicate source color volume information in a coded bitstream using SEI messaging. Such data include at least the minimum, maximum, and average luminance values in the source data plus optional data that may include the color volume x and y chromaticity coordinates for the input color primaries (e.g., red, green, and blue) of the source data, and the color x and y chromaticity coordinates for the color primaries corresponding to the minimum, average, and maximum luminance values in the source data. Messaging data signaling an active region in each picture may also be included.

Type: Application

Filed: October 13, 2023

Publication date: February 15, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Tao CHEN, Peng YIN, Taoran LU, Walter J. HUSAK
ORCHESTRATION OF ACOUSTIC DIRECT SEQUENCE SPREAD SPECTRUM SIGNALS FOR ESTIMATION OF ACOUSTIC SCENE METRICS

Publication number: 20240056757

Abstract: Some methods may involve receiving a first content stream that includes first audio signals, rendering the first audio signals to produce first audio playback signals, generating first direct sequence spread spectrum (DSSS) signals, generating first modified audio playback signals by inserting the first DSSS signals into the first audio playback signals, and causing a loudspeaker system to play back the first modified audio playback signals, to generate first audio device playback sound. The method(s) may involve receiving microphone signals corresponding to at least the first audio device playback sound and to second through Nth audio device playback sound corresponding to second through Nth modified audio playback signals (including second through Nth DSSS signals) played back by second through Nth audio devices, extracting second through Nth DSSS signals from the microphone signals and estimating at least one acoustic scene metric based, at least partly, on the second through Nth DSSS signals.

Type: Application

Filed: December 2, 2021

Publication date: February 15, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Benjamin John SOUTHWELL, David GUNAWAN, Mark R.P. THOMAS, Christopher Graham HINES
METHOD AND APPARATUS FOR PROCESSING OF AUDIO DATA USING A PRE-CONFIGURED GENERATOR

Publication number: 20240055006

Abstract: Described herein is a method for setting up a decoder for generating processed audio data from an audio bitstream, the decoder comprising a Generator of a Generative Adversarial Network, GAN, for processing of the audio data, wherein the method includes the steps of (a) pre-configuring the Generator for processing of audio data with a set of parameters for the Generator, the parameters being determined by training, at training time, the Generator using the full concatenated distribution; and (b) pre-configuring the decoder to determine, at decoding time, a truncation mode for modifying the concatenated distribution and to apply the determined truncation mode to the concatenated distribution. Described are further a method of generating processed audio data from an audio bitstream using a Generator of a Generative Adversarial Network, GAN, for processing of the audio data and a respective apparatus. Moreover, described are also respective systems and computer program products.

Type: Application

Filed: December 15, 2021

Publication date: February 15, 2024

Applicant: Dolby International AB

Inventor: Arijit BISWAS
DIGITAL FILTERBANK FOR SPECTRAL ENVELOPE ADJUSTMENT

Publication number: 20240055010

Abstract: An apparatus and method are disclosed for processing an audio signal. The apparatus includes an input interface, a digital filterbank having an analysis part and a synthesis part, a first phase shifter, a spectral envelope adjuster, a second phase shifter, and an output interface. The first phase shifter and the second phase shifter reduce a complexity of the digital filterbank, which includes both analysis and synthesis filters that are complex-exponential modulated versions of a prototype filter.

Type: Application

Filed: August 21, 2023

Publication date: February 15, 2024

Applicant: DOLBY INTERNATIONAL AB

Inventor: Per EKSTRAND
MULTISOURCE MEDIA DELIVERY SYSTEMS AND METHODS

Publication number: 20240056649

Abstract: A method for delivering media content to one or more clients over a distributed system is disclosed. The method may include generating a plurality of network-coded symbols from a plurality of original symbols representing a first media asset. The method may further include generating an original plurality of coded variants of the first media asset. The method may further include distributing a first coded variant of the original plurality of coded variants to a first cache on a first server device for storage in the first cache. The method may further include distributing a second coded variant of the original plurality of coded variants to a second cache on a second server device for storage in the second cache.

Type: Application

Filed: December 16, 2021

Publication date: February 15, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Jeffrey RIEDMILLER, Mingchao YU, Jason Michael CLOUD
METHOD FOR ENCODING AND DECODING IMAGE USING ADAPTIVE DEBLOCKING FILTERING, AND APPARATUS THEREFOR

Publication number: 20240056613

Abstract: Disclosed is an encoding/decoding method and apparatus related to adaptive deblocking filtering. There is provided an image decoding method performing adaptive filtering in inter-prediction, the method including: reconstructing, from a bitstream, an image signal including a reference block on which block matching is performed in inter-prediction of a current block to be encoded; obtaining, from the bitstream, a flag indicating whether the reference block exists within a current picture where the current block is positioned; reconstructing the current block by using the reference block; adaptively applying an in-loop filter for the reconstructed current block based on the obtained flag; and storing the current block to which the in-loop filter is or is not applied in a decoded picture buffer (DPB).

Type: Application

Filed: October 24, 2023

Publication date: February 15, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Je Chang JEONG, Ki Baek KIM
Orientation-aware surround sound playback

Patent number: 11902762

Abstract: Example embodiments disclosed herein relate to orientation-aware surround sound playback. A method for processing audio on an electronic device that includes a plurality of loudspeakers is disclosed, the loudspeakers arranged in more than one dimension of the electronic device. The method includes, responsive to receipt of a plurality of received audio streams, generating a rendering component associated with the plurality of received audio streams, determining an orientation dependent component of the rendering component, processing the rendering component by updating the orientation dependent component according to an orientation of the loudspeakers and dispatching the received audio streams to the plurality of loudspeakers for playback based on the processed rendering component. Corresponding system and computer program products are also disclosed.

Type: Grant

Filed: May 4, 2022

Date of Patent: February 13, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Xuejing Sun, Guilin Ma, Xiguang Zheng
Methods, apparatus and systems for representation, encoding, and decoding of discrete directivity data

Patent number: 11902769

Abstract: The present disclosure relates to a method of processing audio content including directivity information for at least one sound source, the directivity information comprising a first set of first directivity unit vectors representing directivity directions and associated first directivity gains. The disclosure further relates to corresponding methods of encoding and decoding audio content including directivity information for at least one sound source.

Type: Grant

Filed: June 30, 2020

Date of Patent: February 13, 2024

Assignee: Dolby International AB

Inventors: Leon Terentiv, Christof Fersch, Daniel Fischer
PERSONALIZED HRTFS VIA OPTICAL CAPTURE

Publication number: 20240048932

Abstract: An apparatus and method of generating personalized HRTFs. The system is prepared by calculating a model for HRTFs described as the relationship between a finite example set of input data, namely anthropometric measures and demographic information for a set of individuals, and a corresponding set of output data, namely HRTFs numerically simulated using a high-resolution database of 3D scans of the same set of individuals. At the time of use, the system queries the user for their demographic information, and then from a series of images of the user, the system detects and measures various anthropometric characteristics. The system then applies the prepared model to the anthropometric and demographic data as part of generating a personalized HRTF. In this manner, the personalized HRTF can be generated with more convenience than by performing a high-resolution scan or an acoustic measurement of the user, and with less computational complexity than by numerically simulating their HRTF.

Type: Application

Filed: August 24, 2023

Publication date: February 8, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: McGregor Steele JOYNER, Alex BRANDMEYER, Scott DALY, Jeffrey Ross BAKER, Andrea FANELLI, Poppy Anne Carrie CRUM
ORCHESTRATION OF ACOUSTIC DIRECT SEQUENCE SPREAD SPECTRUM SIGNALS FOR ESTIMATION OF ACOUSTIC SCENE METRICS

Publication number: 20240048931

Abstract: Some methods may involve receiving a first content stream that includes first audio signals, rendering the first audio signals to produce first audio playback signals, generating first direct sequence spread spectrum (DSSS) signals, generating first modified audio playback signals by inserting the first DSSS signals into the first audio playback signals, and causing a loudspeaker system to play back the first modified audio playback signals, to generate first audio device playback sound. The method(s) may involve receiving microphone signals corresponding to at least the first audio device playback sound and to second through Nth audio device playback sound corresponding to second through Nth modified audio playback signals (including second through Nth DSSS signals) played back by second through Nth audio devices, extracting second through Nth DSSS signals from the microphone signals and estimating at least one acoustic scene metric based, at least partly, on the second through Nth DSSS signals.

Type: Application

Filed: December 2, 2021

Publication date: February 8, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Benjamin John SOUTHWELL, David GUNAWAN, Mark R.P. THOMAS, Christopher Graham HINES
CROSS PRODUCT ENHANCED SUBBAND BLOCK BASED HARMONIC TRANSPOSITION

Publication number: 20240046940

Abstract: The invention provides an efficient implementation of cross-product enhanced high-frequency reconstruction (HFR), wherein a new component at frequency Q?+r?0 is generated on the basis of existing components at ? and ?+?0. The invention provides a block-based harmonic transposition, wherein a time block of complex subband samples is processed with a common phase modification. Superposition of several modified samples has the net effect of limiting undesirable intermodulation products, thereby enabling a coarser frequency resolution and/or lower degree of oversampling to be used. In one embodiment, the invention further includes a window function suitable for use with block-based cross-product enhanced HFR. A hardware embodiment of the invention may include an analysis filter bank, a subband processing unit configurable by control data and a synthesis filter bank.

Type: Application

Filed: October 5, 2023

Publication date: February 8, 2024

Applicant: DOLBY INTERNATIONAL AB

Inventor: Lars Villemoes
Methods and apparatus for compressing and decompressing a higher order ambisonics representation

Patent number: 11895477

Abstract: Higher Order Ambisonics represents three-dimensional sound independent of a specific loudspeaker set-up. However, transmission of an HOA representation results in a very high bit rate. Therefore, compression with a fixed number of channels is used, in which directional and ambient signal components are processed differently. The ambient HOA component is represented by a minimum number of HOA coefficient sequences. The remaining channels contain either directional signals or additional coefficient sequences of the ambient HOA component, depending on what will result in optimum perceptual quality. This processing can change on a frame-by-frame basis.

Type: Grant

Filed: March 21, 2022

Date of Patent: February 6, 2024

Assignee: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Sven Kordon, Alexander Krueger
Media-aware navigation metadata

Patent number: 11895369

Abstract: The present disclosure relates to methods and apparatus for processing media content having video content and associated audio content. A method of processing media content having video content and associated audio content comprises the method includes receiving the video content and the associated audio content, analyzing the associated audio content, determining one or more navigation points for enabling navigation of the media content based on the analysis, wherein the one or more navigation points indicate points of interest in the associated audio content for short-term rewinding and/or fast forwarding, embedding the one or more navigation points into metadata for the media content, and outputting the video content, the associated audio content, and the metadata.

Type: Grant

Filed: August 22, 2018

Date of Patent: February 6, 2024

Assignee: DOLBY LABORATORIES LICENSING CORPORATION

Inventor: Christopher Graham Hines
Steering of binauralization of audio

Patent number: 11895479

Abstract: A method for steering binauralization of audio is provided. The method comprises steps of: receiving (410) an audio input signal, calculating (430) a confidence value indicating a likelihood that a current audio frame of the audio input signal comprises binauralized audio; determining (450) a state signal based on the confidence value; determining (460) a steering signal, based on the first confidence value, the state signal and an energy value of the audio frame; and generating (470) an audio output signal with steered binauralization by processing the audio input signal according to the steering signal.

Type: Grant

Filed: August 19, 2020

Date of Patent: February 6, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Qingyuan Bin, Libin Luo, Ziyu Yang, Zhiwei Shuang, Xuemei Yu, Guiping Wang
Layered augmented entertainment experiences

Patent number: 11893700

Abstract: Spatial information that describes spatial locations of visual objects as in a three-dimensional (3D) image space as represented in one or more multi-view unlayered images is accessed. Based on the spatial information, a cinema image layer and one or more device image layers are generated from the one or more multi-view unlayered images. A multi-layer multi-view video signal comprising the cinema image layer and the device image layers is sent to downstream devices for rendering.

Type: Grant

Filed: April 28, 2022

Date of Patent: February 6, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Ajit Ninan, Neil Mammen, Tyrome Y. Brown
Electro-optical transfer function conversion and signal legalization

Patent number: 11895416

Abstract: A device includes an electronic processor configured to define a first set of sample pixels from a set of sample pixels determined from received video data according to a first electro-optical transfer function (EOTF) in a first color representation of a first color space; convert the first set of sample pixels to a second EOTF via a mapping function, producing a second set of sample pixels according to the second EOTF; convert the first and second set of sample pixels from the first color representation to a second color representation of the first color space; determine a backward reshaping function by repeatedly applying and adjusting a sample backward reshaping function so as to minimize a difference between predicted pixel values obtained by applying the sample backward reshaping function to the pixels of the converted first set and the pixels of the converted second set.

Type: Grant

Filed: July 27, 2020

Date of Patent: February 6, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Guan-Ming Su, Harshad Kadu, Neeraj J. Gadgil, Qing Song, Yoon Yung Lee
Reconstruction of audio scenes from a downmix

Patent number: 11894003

Abstract: Audio objects are associated with positional metadata. A received downmix signal comprises downmix channels that are linear combinations of one or more audio objects and are associated with respective positional locators. In a first aspect, the downmix signal, the positional metadata and frequency-dependent object gains are received. An audio object is reconstructed by applying the object gain to an upmix of the downmix signal in accordance with coefficients based on the positional metadata and the positional locators. In a second aspect, audio objects have been encoded together with at least one bed channel positioned at a positional locator of a corresponding downmix channel. The decoding system receives the downmix signal and the positional metadata of the audio objects. A bed channel is reconstructed by suppressing the content representing audio objects from the corresponding downmix channel on the basis of the positional locator of the corresponding downmix channel.

Type: Grant

Filed: February 10, 2023

Date of Patent: February 6, 2024

Assignee: DOLBY INTERNATIONAL AB

Inventors: Toni Hirvonen, Heiko Purnhagen, Leif Jonas Samuelsson, Lars Villemoes
Method and apparatus for screen related adaptation of a Higher-Order Ambisonics audio signal

Patent number: 11895482

Abstract: A method for generating loudspeaker signals associated with a target screen size is disclosed. The method includes receiving a bit stream containing encoded higher order ambisonics signals, the encoded higher order ambisonics signals describing a sound field associated with a production screen size. The method further includes decoding the encoded higher order ambisonics signals to obtain a first set of decoded higher order ambisonics signals representing dominant components of the sound field and a second set of decoded higher order ambisonics signals representing ambient components of the sound field. The method also includes combining the first set of decoded higher order ambisonics signals and the second set of decoded higher order ambisonics signals to produce a combined set of decoded higher order ambisonics signals.

Type: Grant

Filed: January 25, 2023

Date of Patent: February 6, 2024

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Peter Jax, Johannes Boehm, William G. Redmann
Apparatus and method for processing an input audio signal using cascaded filterbanks

Patent number: 11894002

Abstract: An apparatus for processing an input audio signal relies on a cascade of filterbanks, the cascade having a synthesis filterbank for synthesizing an audio intermediate signal from the input audio signal, the input audio signal being represented by a plurality of first subband signals generated by an analysis filterbank, wherein a number of filterbank channels of the synthesis filterbank is smaller than a number of channels of the analysis filterbank. The apparatus furthermore has a further analysis filterbank for generating a plurality of second subband signals from the audio intermediate signal, wherein the further analysis filterbank has a number of channels being different from the number of channels of the synthesis filterbank, so that a sampling rate of a subband signal of the plurality of second subband signals is different from a sampling rate of a first subband signal of the plurality of first subband signals.

Type: Grant

Filed: October 21, 2022

Date of Patent: February 6, 2024

Assignees: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung, Dolby International AB

Inventors: Lars Villemoes, Per Ekstrand, Sascha Disch, Frederik Nagel, Stephan Wilde
Compressor target curve to avoid boosting noise

Patent number: 11894006

Abstract: The processing of audio signals during playback is provided, so that audio signals that fall below a specified threshold loudness level are processed to avoid making unwanted background noise audible. N-channel audio is received from a playback volume controller/leveler (101). The level of the audio is compared with a threshold level. If the level is greater than the threshold level, the audio is processed with a first amount of gain in accordance with a first dynamic range control (DRC) compression curve that is tuned for professionally produced audio. If the level is less than or equal to the threshold level, the audio is processed with a second amount of gain in accordance with a second DRC compression curve that is designed to avoid boosting unwanted background noise. After applying the gain to the audio, the audio is sent to a downstream device.

Type: Grant

Filed: July 18, 2019

Date of Patent: February 6, 2024

Assignee: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Zhongjin Wang, Andrew Peter Reilly, Michael William Mason
Enabling sampling rate diversity in a voice communication system

Patent number: 11894005

Abstract: An audio communication endpoint receives a bitstream containing spectral components representing spectral content of an audio signal, wherein the spectral components relate to a first range extending up to a first break frequency, above which any spectral components are unassigned. The endpoint adapts the received bitstream in accordance with a second range extending up to a second break frequency by removing spectral components or adding neutral-valued spectral components relating to a range between the first and second break frequencies. The endpoint then attenuates spectral content in a neighbourhood of the least of the first and second break frequencies for thereby achieving a gradual spectral decay. After this, reconstructing the audio signal is reconstructed by an inverse transform operating on spectral components relating to said second range in the adapted and attenuated received bitstream. At small computational expense, the endpoint may to adapt to different sample rates in received bitstreams.

Type: Grant

Filed: November 15, 2019

Date of Patent: February 6, 2024

Assignees: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB

Inventors: Heiko Purnhagen, Leif Sehlstrom, Lars Villemoes, Glenn N. Dickins, Mark S Vinton
VOLUME LEVELER CONTROLLER AND CONTROLLING METHOD

Publication number: 20240039499

Abstract: Volume leveler controller and controlling method are disclosed. In one embodiment, A volume leveler controller includes an audio content classifier for identifying the content type of an audio signal in real time; and an adjusting unit for adjusting a volume leveler in a continuous manner based on the content type as identified. The adjusting unit may configured to positively correlate the dynamic gain of the volume leveler with informative content types of the audio signal, and negatively correlate the dynamic gain of the volume leveler with interfering content types of the audio signal.

Type: Application

Filed: July 20, 2023

Publication date: February 1, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Jun WANG, Lie LU, Alan J. SEEFELDT
ACOUSTIC ENVIRONMENT SIMULATION

Publication number: 20240038248

Abstract: Encoding/decoding an audio signal having one or more audio components, wherein each audio component is associated with a spatial location. A first audio signal presentation (z) of the audio components, a first set of transform parameters (w(f)), and signal level data (?2) are encoded and transmitted to the decoder. The decoder uses the first set of transform parameters (w(f)) to form a reconstructed simulation input signal intended for an acoustic environment simulation, and applies a signal level modification (?) to the reconstructed simulation input signal. The signal level modification is based on the signal level data (?2) and data (p2) related to the acoustic environment simulation. The attenuated reconstructed simulation input signal is then processed in an acoustic environment simulator. With this process, the decoder does not need to determine the signal level of the simulation input signal, thereby reducing processing load.

Type: Application

Filed: August 7, 2023

Publication date: February 1, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventor: Dirk Jeroen BREEBAART
AUDIO CONTENT IDENTIFICATION

Publication number: 20240038258

Abstract: A method of audio content identification includes using a two-stage classifier. The first stage includes previously-existing classifiers and the second stage includes a new classifier. The outputs of the first and second stages calculated over different time periods are combined to generate a steering signal. The final classification results from a combination of the steering signal and the outputs of the first and second stages. In this manner, a new classifier may be added without disrupting existing classifiers.

Type: Application

Filed: August 18, 2021

Publication date: February 1, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Guiping Wang, Lie Lu
METHOD AND DEVICE FOR DECODING A HIGHER-ORDER AMBISONICS (HOA) REPRESENTATION OF AN AUDIO SOUNDFIELD

Publication number: 20240040327

Abstract: The invention discloses rendering sound field signals, such as Higher-Order Ambisonics (HOA), for arbitrary loudspeaker setups, where the rendering results in highly improved localization properties and is energy preserving. This is obtained by rendering an audio sound field representation for arbitrary spatial loudspeaker setups and/or by a a decoder that decodes based on a decode matrix (D). The decode matrix (D) is based on smoothing and scaling of a first decode matrix {circumflex over (D)} with smoothing coefficients. The first decode matrix {circumflex over (D)} is based on a mix matrix G and a mode matrix {tilde over (?)}, where the mix matrix G was determined based on L speakers and positions of a spherical modelling grid related to a HOA order N, and the mode matrix {tilde over (?)} was determined based on the spherical modelling grid and the HOA order N.

Type: Application

Filed: July 26, 2023

Publication date: February 1, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Johannes BOEHM, Florian KEILER
ACOUSTIC FEEDBACK MANAGEMENT IN REAL-TIME AUDIO COMMUNICATION

Publication number: 20240040043

Abstract: Disclosed is a method for managing acoustic feedback in real-time audio communications in a communications system, the method comprising determining, by means of a detection module, whether a first communication device is in loudspeaker mode, whether the first communication device is in real-time audio communications with a second communication, and whether the first communication device and the second communication device are in a same acoustic space. Upon determining that this is the case a request signal for requesting one or more measures against acoustic feedback is provided to a mitigation module. Further disclosed are a device and a system configured to perform the method, a non-transitory computer-readable medium, an encoder and a decoder.

Type: Application

Filed: December 22, 2021

Publication date: February 1, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Qianqian FANG, Kai LI, Yanmeng GUO, Wei HUANG, Yang LIU

prev 1 2 3 4 5 6 7 … next