Dolby Labs Patent Applications

Dolby Labs patent applications that are pending before the United States Patent and Trademark Office (USPTO).

METHOD OF CODING AND DECODING IMAGES, CODING AND DECODING DEVICE AND COMPUTER PROGRAMS CORRESPONDING THERETO

Publication number: 20240121420

Abstract: A method is provided for coding at least one image split up into partitions, a current partition to be coded containing data, at least one data item of which is allotted a sign. The coding method includes, for the current partition, the following steps: calculating the value of a function representative of the data of the current partition with the exclusion of the sign; comparing the calculated value with a predetermined value of the sign; as a function of the result of the comparison, modifying or not modifying at least one of the data items of the current partition, in the case of modification, coding the at least one modified data item.

Type: Application

Filed: December 19, 2023

Publication date: April 11, 2024

Applicant: DOLBY INTERNATIONAL AB

Inventors: Felix Henry, Gordon Clare
CANVAS SIZE SCALABLE VIDEO CODING

Publication number: 20240121424

Abstract: Methods and systems for canvas size scalability across the same or different bitstream layers of a video coded bitstream are described. Offset parameters for a conformance window, a reference region of interest (ROI) in a reference layer, and a current ROI in a current layer are received. The width and height of a current ROI and a reference ROI are computed based on the offset parameters and they are used to generate a width and height scaling factor to be used by a reference picture resampling unit to generate an output picture based on the current ROI and the reference ROI.

Type: Application

Filed: December 18, 2023

Publication date: April 11, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Taoran Lu, Fangjun Pu, Peng Yin, Sean Thomas McCarthy, Tao Chen
CROSS-ASSET GUIDE CHROMA REFORMATTING FOR MULTI-ASSET IMAGING FORMAT

Publication number: 20240114153

Abstract: A first image and a second image of different dynamic ranges are derived from the same source image. Based on a chroma sampling format of the first image, it is determined whether edge preserving filtering is to be used to generate chroma upsampled image data in a reconstructed image. If so, image metadata for performing the edge preserving filtering is generated. The first image, the second image and the image metadata are encoded into an image data container to enable a recipient device to generate the reconstructed image.

Type: Application

Filed: September 1, 2023

Publication date: April 4, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Anustup Kumar Atanu CHOUDHURY, Guan-Ming SU
REPRESENTING SPATIAL AUDIO BY MEANS OF AN AUDIO SIGNAL AND ASSOCIATED METADATA

Publication number: 20240114307

Abstract: There is provided encoding and decoding methods for representing spatial audio that is a combination of directional sound and diffuse sound. An exemplary encoding method includes inter alia creating a single- or multi-channel downmix audio signal by downmixing input audio signals from a plurality of microphones in an audio capture unit capturing the spatial audio; determining first metadata parameters associated with the downmix audio signal, wherein the first metadata parameters are indicative of one or more of: a relative time delay value, a gain value, and a phase value associated with each input audio signal; and combining the created downmix audio signal and the first metadata parameters into a representation of the spatial audio.

Type: Application

Filed: September 12, 2023

Publication date: April 4, 2024

Applicants: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION

Inventor: Stefan BRUHN
PROGRESSIVE CALCULATION AND APPLICATION OF RENDERING CONFIGURATIONS FOR DYNAMIC APPLICATIONS

Publication number: 20240114309

Abstract: Some examples involve rendering received audio data by determining a first relative activation of a set of loudspeakers in an environment according to a first rendering configuration corresponding to a first set of speaker activations, receiving a first rendering transition indication indicating a transition from the first rendering configuration to a second rendering configuration and determining a second set of speaker activations corresponding to a simplified version of the second rendering configuration. Some examples involve performing a first transition from the first set of speaker activations to the second set of speaker activations, determining a third set of speaker activations corresponding to a complete version of the second rendering configuration and performing a second transition to the third set of speaker activations without requiring completion of the first transition.

Type: Application

Filed: December 2, 2021

Publication date: April 4, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Joshua B. LANDO, Alan J. SEEFELDT
AUDIO FILTERBANK WITH DECORRELATING COMPONENTS

Publication number: 20240114306

Abstract: An multi-input, multi-output audio process is implemented as a linear system for use in an audio filterbank to convert a set of frequency-domain input audio signals into a set of frequency-domain output audio signals. A transfer function from one input to one output is defined as a frequency dependent gain function. In some implementations, the transfer function includes a direct component that is substantially defined as a frequency dependent gain, and one or more decorrelated components that have frequency-varying group phase response. The transfer function is formed from a set of sub-band functions, with each sub-band function being formed from a set of corresponding component transfer functions including direct component and one or more decorrelated components.

Type: Application

Filed: September 2, 2020

Publication date: April 4, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventor: David S. MCGRATH
INTRA-PREDICTION FOR HEXAGONALLY-SAMPLED VIDEO AND IMAGE COMPRESSION

Publication number: 20240114127

Abstract: Methods, systems, and devices implement intra-prediction for hexagonally-sampled compression and decompression of videos and images having a regular grid of hexagonally-shaped pixels. For encoding, a prediction unit (PU) shape is selected at a sequence level from the group consisting of parallelogram, zigzag-square, hexagonal super-pixel, a rectangular zigzag and an arrow, and the hexagonally-sampled image is divided into regions based on the PU shape. For each region: a prediction mode and a PU size are determined; reference pixels are determined for each predicted pixel in the PU shape based on the prediction mode; a weighted factor is determined for each of the reference pixels based on a distance between the reference pixel and the predicted pixel; and a predicted value of each of the predicted pixels in the PU shape is determined using the corresponding reference pixels and the weighted factors.

Type: Application

Filed: February 10, 2022

Publication date: April 4, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Zhaobin ZHANG, Neeraj J. GADGIL, Guan-Ming SU
FREQUENCY DOMAIN MULTIPLEXING OF SPATIAL AUDIO FOR MULTIPLE LISTENER SWEET SPOTS

Publication number: 20240114308

Abstract: Some methods involve receiving, by a control system that is configured for implementing a plurality of renderers, audio data and listening configuration data for a plurality of listening configurations, each listening configuration of the plurality of listening configurations corresponding to a listening position and a listening orientation in an audio environment, and rendering, by each renderer and according to the listening configuration data, the received audio data to obtain a set of renderer-specific loudspeaker feed signals for a corresponding listening configuration. Each renderer may be configured to render the audio data for a different listening configuration. Some such methods may involve decomposing each set of renderer-specific loudspeaker feed signals into a renderer-specific set of frequency bands and combining the renderer-specific frequency bands of each renderer to produce an output set of loudspeaker feed signals.

Type: Application

Filed: December 2, 2021

Publication date: April 4, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Alan J. SEEFELDT, C. Phillip BROWN
INSERTION OF FORCED GAPS FOR PERVASIVE LISTENING

Publication number: 20240107252

Abstract: An attenuation or “gap” may be inserted into at least a first frequency range of at least first and second audio playback signals of a content stream during at least a first time interval to generate at least first and second modified audio playback signals. Corresponding audio device playback sound may be provided by at least first and second audio devices. At least one microphone may detect at least the first audio device playback sound and the second audio device playback sound and may generate corresponding microphone signals. Audio data may be extracted from the microphone signals in at least the first frequency range, to produce extracted audio data. A far-field audio environment impulse response and/or audio environment noise may be estimated based, at least in part, on the extracted audio data.

Type: Application

Filed: December 2, 2021

Publication date: March 28, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Christopher Graham HINES, Benjamin John SOUTHWELL
SPATIAL NOISE FILLING IN MULTI-CHANNEL CODEC

Publication number: 20240105192

Abstract: Embodiments are disclosed for spatial noise filling in multi-channel codecs. In an embodiment, a method of regenerating background noise ambience in a multi-channel codec by generating spatial hole filling noise comprises: computing noise estimates based on a primary downmix channel generated from an input audio signal representing a spatial audio scene with background noise ambience; computing spectral shaping filter coefficients based on the noise estimates; spectrally shaping the multi-channel noise signal using the spectral shaping filter coefficients and a noise distribution, the spectral shaping resulting in a diffused, multi-channel noise signal with uncorrelated channels; spatially shaping the diffused, uncorrelated multi-channel noise signal with uncorrelated channels based on a noise ambience of the spatial audio scene; and adding the spatially and spectrally shaped multi-channel noise to a multi-channel codec output to synthesize the background noise ambience of the spatial audio scene.

Type: Application

Filed: December 1, 2021

Publication date: March 28, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Rishabh TYAGI, Michael ECKERT
FREQUENCY DOMAIN MULTIPLEXING OF SPATIAL AUDIO FOR MULTIPLE LISTENER SWEET SPOTS

Publication number: 20240107255

Abstract: Some methods involve receiving, by a control system configured for implementing a plurality of Tenderers, audio data and listening configuration data for a plurality of listening configurations, each listening configuration of the plurality of listening configurations corresponding to a listening position and a listening orientation in an audio environment, and rendering, by each Tenderer and according to the listening configuration data, the received audio data to obtain a set of Tenderer-specific loudspeaker feed signals for a corresponding listening configuration. Each Tenderer may be configured to render the audio data for a different listening configuration. Some such methods may involve decomposing each set of renderer-specific loudspeaker feed signals into a Tenderer-specific set of frequency bands and combining the renderer-specific frequency bands of each Tenderer to produce an output set of loudspeaker feed signals.

Type: Application

Filed: December 2, 2021

Publication date: March 28, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Alan J. SEEFELDT, C. Phillip BROWN
SYSTEM AND METHOD FOR OPTIMIZING LOUDNESS AND DYNAMIC RANGE ACROSS DIFFERENT PLAYBACK DEVICES

Publication number: 20240103801

Abstract: Embodiments are directed to a method and system for receiving, in a bitstream, metadata associated with the audio data, and analyzing the metadata to determine whether a loudness parameter for a first group of audio playback devices are available in the bitstream. Responsive to determining that the parameters are present for the first group, the system uses the parameters and audio data to render audio. Responsive to determining that the loudness parameters are not present for the first group, the system analyzes one or more characteristics of the first group, and determines the parameter based on the one or more characteristics.

Type: Application

Filed: October 9, 2023

Publication date: March 28, 2024

Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB

Inventors: Jeffrey RIEDMILLER, Scott Gregory NORCROSS, Karl Jonas ROEDEN
HARMONIC TRANSPOSITION IN AN AUDIO CODING METHOD AND SYSTEM

Publication number: 20240105191

Abstract: The present invention relates to transposing signals in time and/or frequency and in particular to coding of audio signals. More particular, the present invention relates to high frequency reconstruction (HFR) methods including a frequency domain harmonic transposer. A method and system for generating a transposed output signal from an input signal using a transposition factor T is described. The system comprises an analysis window of length La, extracting a frame of the input signal, and an analysis transformation unit of order M transforming the samples into M complex coefficients. M is a function of the transposition factor T. The system further comprises a nonlinear processing unit altering the phase of the complex coefficients by using the transposition factor T, a synthesis transformation unit of order M transforming the altered coefficients into M altered samples, and a synthesis window of length Ls, generating a frame of the output signal.

Type: Application

Filed: November 29, 2023

Publication date: March 28, 2024

Applicant: DOLBY INTERNATIONAL AB

Inventors: Per EKSTRAND, Lars VILLEMOES
Audio Encoding and Decoding Using Presentation Transform Parameters

Publication number: 20240105186

Abstract: A method for encoding an input audio stream including the steps of obtaining a first playback stream presentation of the input audio stream intended for reproduction on a first audio reproduction system, obtaining a second playback stream presentation of the input audio stream intended for reproduction on a second audio reproduction system, determining a set of transform parameters suitable for transforming an intermediate playback stream presentation to an approximation of the second playback stream presentation, wherein the transform parameters are determined by minimization of a measure of a difference between the approximation of the second playback stream presentation and the second playback stream presentation, and encoding the first playback stream presentation and the set of transform parameters for transmission to a decoder.

Type: Application

Filed: October 16, 2023

Publication date: March 28, 2024

Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB

Inventors: Dirk Jeroen Breebaart, David Matthew Cooper, Leif Jonas Samuelsson, Jeroen Koppens, Rhonda J. Wilson, Heiko Purnhagen, Alexander Stahlmann
HEAD TRACKED SPATIAL AUDIO AND/OR VIDEO RENDERING

Publication number: 20240098446

Abstract: Images are acquired through image sensors operating in conjunction with a media consumption system. The acquired images are used to determine a user's movement in a plurality of degrees of freedom. Sound images depicted in spatial audio rendered by audio speakers operating in conjunction with the media consumption system are adapted based at least in part on the user's movement in the plurality of degrees of freedom.

Type: Application

Filed: November 27, 2023

Publication date: March 21, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Ajit NINAN, William Anthony ROZZI
IMAGE ENHANCEMENT VIA GLOBAL AND LOCAL RESHAPING

Publication number: 20240095893

Abstract: A first reshaping mapping is performed on a first image represented in a first domain to generate a second image represented in a second domain. The first domain is of a first dynamic range different from a second dynamic range of which the second domain is. A second reshaping mapping is performed on the second image represented in the second domain to generate a third image represented in the first domain. The third image is perceptually different from the first image in at least one of: global contrast, global saturation, local contrast, local saturation, etc. A display image is derived from the third image and rendered on a display device.

Type: Application

Filed: January 26, 2022

Publication date: March 21, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Guan-Ming SU, Harshad KADU, Per Jonas Andreas KLITTMARK, Tao CHEN
METHOD AND DEVICE FOR APPLYING DYNAMIC RANGE COMPRESSION TO A HIGHER ORDER AMBISONICS SIGNAL

Publication number: 20240098436

Abstract: A method for performing DRC on a HOA signal comprises transforming the HOA signal to the spatial domain, analyzing the transformed HOA signal, and obtaining, from results of said analyzing, gain factors that are usable for dynamic compression. The gain factors can be transmitted together with the HOA signal. When applying the DRC, the HOA signal is transformed to the spatial domain, the gain factors are extracted and multiplied with the transformed HOA signal in the spatial domain, wherein a gain compensated transformed HOA signal is obtained. The gain compensated transformed HOA signal is transformed back into the HOA domain, wherein a gain compensated HOA signal is obtained. The DRC may be applied in the QMF-filter bank domain.

Type: Application

Filed: November 9, 2023

Publication date: March 21, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Johannes BOEHM, Florian KEILER
AUDIO CHANNEL SPATIAL TRANSLATION

Publication number: 20240098438

Abstract: The present invention is directed to methods and apparatus for translating a first plurality of audio input channels to a second plurality of audio output channels. This includes determining that there is pair-wise coding among any of the first plurality of audio input channels, determining an input/output-mapping matrix for mapping at least a first set of the first plurality of audio input channels to at least a second set of the second plurality of audio output channels; and deriving the second plurality of audio output channels based on first plurality of audio input channels, the input/output-mapping matrix and the determined pair-wise coding. The first plurality of audio input channels represent the same soundfield represented by the second plurality of audio output channels.

Type: Application

Filed: September 25, 2023

Publication date: March 21, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventor: Mark F. DAVIS
VIDEO CODING METHOD AND APPARATUS USING ANY TYPES OF BLOCK PARTITIONING

Publication number: 20240098264

Abstract: The present invention relates to a block partitioning structure in video coding technology, and a video encoding and decoding method and apparatus using the same, wherein the video encoding and decoding method includes the steps of: acquiring quad-partitioning information of a block; acquiring bi-partitioning information of the block when the acquired quad-partitioning information of the block does not indicate four partitions; acquiring partitioning direction information for bi-partitioning of the block when the acquired bi-partitioning information of the block indicates two partitions; acquiring information on whether to perform any other type of partitioning, when the acquired bi-partitioning information of the block does not indicate two partitions; and acquiring additional information required for the any other type of partitioning, when the acquired information on whether to perform any other type of partitioning indicates that the any other type of partitioning is performed.

Type: Application

Filed: November 29, 2023

Publication date: March 21, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Ho Chan RYU, Yong Jo AHN
METHOD OF RENDERING ONE OR MORE CAPTURED AUDIO SOUNDFIELDS TO A LISTENER

Publication number: 20240098435

Abstract: A computer implemented system for rendering captured audio soundfields to a listener comprises apparatus to deliver the audio soundfields to the listener. The delivery apparatus delivers the audio soundfields to the listener with first and second audio elements perceived by the listener as emanating from first and second virtual source locations, respectively, and with the first audio element and/or the second audio element delivered to the listener from a third virtual source location. The first virtual source location and the second virtual source location are perceived by the listener as being located to the front of the listener, and the third virtual source location is located to the rear or the side of the listener.

Type: Application

Filed: September 18, 2023

Publication date: March 21, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Richard J. CARTWRIGHT, David S. MCGRATH, Glenn N. DICKINS
METHOD FOR SIGNALING A STEP-WISE TEMPORAL SUB-LAYER ACCESS SAMPLE

Publication number: 20240098286

Abstract: An electronic device for encoding a picture is described. The electronic device includes a processor and instructions stored in memory that are in electronic communication with the processor. The instructions are executable to encode a step-wise temporal sub-layer access (STSA) sample grouping. The instructions are further executable to send and/or store the STSA sample grouping.

Type: Application

Filed: November 21, 2023

Publication date: March 21, 2024

Applicant: DOLBY INTERNATIONAL AB

Inventor: Sachin G. Deshpande
SYSTEMS AND METHODS FOR LOCAL DIMMING IN MULTI-MODULATION DISPLAYS

Publication number: 20240098229

Abstract: Dual and multi-modulator projector display systems and techniques are disclosed. In one embodiment, a projector display system comprises a light source; a controller, a first modulator, receiving light from the light source and rendering a halftone image of said the input image; a blurring optical system that blurs said halftone image with a Point Spread Function (PSF); and a second modulator receiving the blurred halftone image and rendering a pulse width modulated image which may be projected to form the desired screen image. Systems and techniques for forming a binary halftone image from input image, correcting for misalignment between the first and second modulators and calibrating the projector system—e.g. over time—for continuous image improvement are also disclosed.

Type: Application

Filed: November 22, 2023

Publication date: March 21, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Jerome SHIELDS, Martin J. RICHARDS, Juan P. PERTIERRA
PARAMETRIC RECONSTRUCTION OF AUDIO SIGNALS

Publication number: 20240087584

Abstract: An encoding system encodes an N-channel audio signal (X), wherein N?3, as a single-channel downmix signal (Y) together with dry and wet upmix parameters ({tilde over (C)}, {tilde over (P)}). In a decoding system, a decorrelating section outputs, based on the downmix signal, an (N?1)-channel decorrelated signal (Z); a dry upmix section maps the downmix signal linearly in accordance with dry upmix coefficients (C) determined based on the dry upmix parameters; a wet upmix section populates an intermediate matrix based on the wet upmix parameters and knowing that the intermediate matrix belongs to a predefined matrix class, obtains wet upmix coefficients (P) by multiplying the intermediate matrix by a predefined matrix, and maps the decorrelated signal linearly in accordance with the wet upmix coefficients; and a combining section combines outputs from the upmix sections to obtain a reconstructed signal ({circumflex over (X)}) corresponding to the signal to be reconstructed.

Type: Application

Filed: September 25, 2023

Publication date: March 14, 2024

Applicant: DOLBY INTERNATIONAL AB

Inventors: Lars Villemoes, Heidi-Maria Lehtonen, Heiko Purnhagen, Toni Hirvonen
FRAME-RATE SCALABLE VIDEO CODING

Publication number: 20240089474

Abstract: Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific backward compatible target frame rate and shutter angle among those allowed are also presented.

Type: Application

Filed: November 10, 2023

Publication date: March 14, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Robin Atkins, Peng Yin, Taoran Lu, Fangjun Pu, Sean Thomas McCarthy, Walter J. Husak, Tao Chen, Guan-Ming Su
INTEGRATION OF HIGH FREQUENCY AUDIO RECONSTRUCTION TECHNIQUES

Publication number: 20240087590

Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.

Type: Application

Filed: November 14, 2023

Publication date: March 14, 2024

Applicant: DOLBY INTERNATIONAL AB

Inventors: Kristofer KJOERLING, Lars VILLEMOES, Heiko PURNHAGEN, Per EKSTRAND
IMAGE ENCODING AND DECODING APPARATUS, AND IMAGE ENCODING AND DECODING METHOD

Publication number: 20240089438

Abstract: According to the present invention, an adaptive scheme is applied to an image encoding apparatus that includes an inter-predictor, an intra-predictor, a transformer, a quantizer, an inverse quantizer, and an inverse transformer, wherein input images are classified into two or more different categories, and two or more modules from among the inter-predictor, the intra-predictor, the transformer, the quantizer, and the inverse quantizer are implemented to perform respective operations in different schemes according to the category to which an input image belongs. Thus, the invention has the advantage of efficiently encoding an image without the loss of important information as compared to a conventional image encoding apparatus which adopts a packaged scheme.

Type: Application

Filed: November 21, 2023

Publication date: March 14, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Jong Ki HAN, Chan Won SEO, Kwang Hyun CHOI
QUANTIZATION PARAMETER SIGNALING

Publication number: 20240080489

Abstract: A quantization parameter signalling mechanism for both SDR and HDR content in video coding is described using two approaches. The first approach is to send the user-defined QpC table directly in high level syntax. This leads to more flexible and efficient QP control for future codec development and video content coding. The second approach is to signal luma and chroma QPs independently. This approach eliminates the need for QpC tables and removes the dependency of chroma quantization parameter on luma QP.

Type: Application

Filed: November 10, 2023

Publication date: March 7, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Fangjun PU, Taoran LU, Peng YIN, Sean Thomas MCCARTHY
METHODS AND SYSTEMS FOR RENDERING OBJECT BASED AUDIO

Publication number: 20240079015

Abstract: Methods for generating an object based audio program, renderable in a personalizable manner, and including a bed of speaker channels renderable in the absence of selection of other program content (e.g., to provide a default full range audio experience). Other embodiments include steps of delivering, decoding, and/or rendering such a program. Rendering of content of the bed, or of a selected mix of other content of the program, may provide an immersive experience. The program may include multiple object channels (e.g., object channels indicative of user-selectable and user-configurable objects), the bed of speaker channels, and other speaker channels. Another aspect is an audio processing unit (e.g., encoder or decoder) configured to perform, or which includes a buffer memory which stores at least one frame (or other segment) of an object based audio program (or bitstream thereof) generated in accordance with, any embodiment of the method.

Type: Application

Filed: September 19, 2023

Publication date: March 7, 2024

Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: Sripal S. MEHTA, Thomas ZIEGLER, Giles BAKER, Jeffrey RIEDMILLER, Prinyar SAUNGSOMBOON
PERCEPTUAL ENHANCEMENT FOR BINAURAL AUDIO RECORDING

Publication number: 20240080608

Abstract: A method of audio processing includes capturing a binaural audio signal, calculating noise reduction gains using a machine learning model, and generating a modified binaural audio signal. The method may further including performing various corrections to the audio to account for video captured by different cameras such as a front camera and a rear camera. The method may further include performing smooth switching of the binaural audio when switching between the front camera and the rear camera. In this manner, noise may be reduced in the binaural audio, and the user perception of the combined video and binaural audio may be improved.

Type: Application

Filed: December 14, 2021

Publication date: March 7, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Yuanxing MA, Zhiwei SHUANG, Yang LIU
FRAME-RATE SCALABLE VIDEO CODING

Publication number: 20240080465

Abstract: Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific backward compatible target frame rate and shutter angle among those allowed are also presented.

Type: Application

Filed: November 13, 2023

Publication date: March 7, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Robin Atkins, Peng Yin, Taoran Lu, Fangjun Pu, Sean Thomas McCarthy, Walter J. Husak, Tao Chen, Guan-Ming Su
CODING AND DECODING OF INTERLEAVED IMAGE DATA

Publication number: 20240080479

Abstract: Sampled data is packaged in checkerboard format for encoding and decoding. The sampled data may be quincunx sampled multi-image video data (e.g., 3D video or a multi-program stream), and the data may also be divided into sub-images of each image which are then multiplexed, or interleaved, in frames of a video stream to be encoded and then decoded using a standardized video encoder. A system for viewing may utilize a standard video decoder and a formatting device that de-interleaves the decoded sub-images of each frame reformats the images for a display device. A 3D video may be encoded using a most advantageous interleaving format such that a preferred quality and compression ratio is reached. In one embodiment, the invention includes a display device that accepts data in multiple formats.

Type: Application

Filed: November 7, 2023

Publication date: March 7, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Alexandros Tourapis, Walter J. Husak, Peshala V. Pahalawatta, Athanasios Leontaris
PERCEPTUALLY-BASED LOSS FUNCTIONS FOR AUDIO ENCODING AND DECODING BASED ON MACHINE LEARNING

Publication number: 20240079019

Abstract: Computer-implemented methods for training a neural network, as well as for implementing audio encoders and decoders via trained neural networks, are provided. The neural network may receive an input audio signal, generate an encoded audio signal and decode the encoded audio signal. A loss function generating module may receive the decoded audio signal and a ground truth audio signal, and may generate a loss function value corresponding to the decoded audio signal. Generating the loss function value may involve applying a psychoacoustic model. The neural network may be trained based on the loss function value. The training may involve updating at least one weight of the neural network.

Type: Application

Filed: November 13, 2023

Publication date: March 7, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Roy M. FEJGIN, Grant A. DAVIDSON, Chih-Wei WU, Vivek KUMAR
SIGNAL RESHAPING FOR HIGH DYNAMIC RANGE SIGNALS

Publication number: 20240073459

Abstract: In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated hue values in the preferred color space is minimized HDR images are coded in the reshaped color space. Legacy devices can still decode standard dynamic range images assuming they are coded in the legacy color space, while updated devices can use color reshaping information to decode HDR images in the preferred color space at full dynamic range.

Type: Application

Filed: October 31, 2023

Publication date: February 29, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Robin Atkins, Peng Yin, Taoran Lu, Jaclyn Anne Pytlarz
METHOD AND DEVICE FOR ENCODING AND DECODING IMAGE USING MOTION VECTOR RESOLUTION SCALING

Publication number: 20240073444

Abstract: A video encoding method according to an embodiment of the present invention includes generating header information that includes information about resolutions of motion vectors of respective blocks, determined based on motion prediction for a unit image. Here, the header information includes flag information indicating whether resolutions of all motion vectors included in the unit image are integer-pixel resolutions. Further, a video decoding method according to another embodiment of the present invention includes extracting information about resolutions of motion vectors of each unit image from header information included in a target bitstream to be decoded; and a decoding unit for decoding the unit image based on the resolution information. Here, the header information includes flag information indicating whether resolutions of all motion vectors included in the unit image are integer-pixel resolutions.

Type: Application

Filed: November 8, 2023

Publication date: February 29, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Jong Ki HAN, Jae Yung LEE
MULTIPLE STAGE MODULATION PROJECTOR DISPLAY SYSTEMS HAVING EFFICIENT LIGHT UTILIZATION

Publication number: 20240073357

Abstract: Dual or multi-modulation display systems comprising a first modulator and a second modulator are disclosed. The first modulator may comprise a plurality of analog mirrors (e.g. MEMS array) and the second modulator may comprise a plurality of mirrors (e.g., DMD array). The display system may further comprise a controller that sends control signals to the first and second modulator. The display system may render highlight features within a projected image by affecting a time multiplexing scheme. In one embodiment, the first modulator may be switched on a sub-frame basis such that a desired proportion of the available light may be focused or directed onto the second modulator to form the highlight feature on a sub-frame rendering basis.

Type: Application

Filed: January 31, 2022

Publication date: February 29, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor: Martin J. Richards
DETERMINING DIALOG QUALITY METRICS OF A MIXED AUDIO SIGNAL

Publication number: 20240071411

Abstract: Disclosed is a method for determining one or more dialog quality metrics of a mixed audio signal comprising a dialog component and a noise component, the method comprising separating an estimated dialog component from the mixed audio signal by means of a dialog separator using a dialog separating model determined by training the dialog separator based on the one or more quality metrics; providing the estimated dialog component from the dialog separator to a quality metrics estimator; and determining the one or more quality metrics by means of the quality metrics estimator based on the mixed signal and the estimated dialog component. Further disclosed is a method for training a dialog separator, a system comprising circuitry configured to perform the method, and a non-transitory computer-readable storage medium.

Type: Application

Filed: January 4, 2022

Publication date: February 29, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Jundai SUN, Lie LU, Shaofan YANG, Rhonda J. WILSON, Dirk Jeroen BREEBAART
METHODS AND DEVICES FOR JOINT MULTICHANNEL CODING

Publication number: 20240062765

Abstract: Encoding and decoding devices for encoding the channels of an audio system having at least four channels are disclosed. The decoding device has a first stereo decoding component which subjects a first pair of input channels to a first stereo decoding, and a second stereo decoding component which subjects a second pair of input channels to a second stereo decoding. The results of the first and second stereo decoding components are crosswise coupled to a third and a fourth stereo decoding component which each performs stereo decoding on one channel resulting from the first stereo decoding component, and one channel resulting from the second stereo decoding component.

Type: Application

Filed: September 1, 2023

Publication date: February 22, 2024

Applicant: DOLBY INTERNATIONAL AB

Inventors: Kristofer KJOERLING, Harald MUNDT, Heiko PURNHAGEN
SOURCE COLOR VOLUME INFORMATION MESSAGING

Publication number: 20240056610

Abstract: Methods are described to communicate source color volume information in a coded bitstream using SEI messaging. Such data include at least the minimum, maximum, and average luminance values in the source data plus optional data that may include the color volume x and y chromaticity coordinates for the input color primaries (e.g., red, green, and blue) of the source data, and the color x and y chromaticity coordinates for the color primaries corresponding to the minimum, average, and maximum luminance values in the source data. Messaging data signaling an active region in each picture may also be included.

Type: Application

Filed: October 13, 2023

Publication date: February 15, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Tao CHEN, Peng YIN, Taoran LU, Walter J. HUSAK
METHOD FOR AND APPARATUS FOR DECODING/RENDERING AN AMBISONICS AUDIO SOUNDFIELD REPRESENTATION FOR AUDIO PLAYBACK USING 2D SETUPS

Publication number: 20240056755

Abstract: Improved methods and/or apparatus for decoding an encoded audio signal in soundfield format for L loudspeakers. The method and/or apparatus can render an Ambisonics format audio signal to 2D loudspeaker setup(s) based on a rendering matrix. The rendering matrix has elements based on loudspeaker positions and wherein the rendering matrix is determined based on weighting at least an element of a first matrix with a weighting factor ? = 1 L . The first matrix is determined based on positions of the L loudspeakers and at least a virtual position of at least a virtual loudspeaker that is added to the positions of the L loudspeakers.

Type: Application

Filed: August 28, 2023

Publication date: February 15, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Florian KEILER, Johannes Boehm
BINAURAL SIGNAL POST-PROCESSING

Publication number: 20240056760

Abstract: A method of audio processing includes performing spatial analysis on a binaural signal to estimate level differences and phase differences characteristic of a binaural filter of the binaural signal, performing object extraction on the binaural audio signal using the estimated level and phase differences to generate a left/right main component signal and a left/right residual component signal. The system may process the left/right main and left/right residual components differently using different object processing parameters for e.g. repositioning, equalization, compression, upmixing, channel remapping or storage to generate a processed binaural signal that provides an improved listening experience. Repositioning may be based on head tracking sensor data.

Type: Application

Filed: December 16, 2021

Publication date: February 15, 2024

Applicants: Dolby Laboratories Licensing Corporation, Dolby International AB

Inventors: Dirk Jeroen BREEBAART, Giulio CENGARLE, C. Phillip BROWN
METHOD AND APPARATUS FOR PROCESSING OF AUDIO DATA USING A PRE-CONFIGURED GENERATOR

Publication number: 20240055006

Abstract: Described herein is a method for setting up a decoder for generating processed audio data from an audio bitstream, the decoder comprising a Generator of a Generative Adversarial Network, GAN, for processing of the audio data, wherein the method includes the steps of (a) pre-configuring the Generator for processing of audio data with a set of parameters for the Generator, the parameters being determined by training, at training time, the Generator using the full concatenated distribution; and (b) pre-configuring the decoder to determine, at decoding time, a truncation mode for modifying the concatenated distribution and to apply the determined truncation mode to the concatenated distribution. Described are further a method of generating processed audio data from an audio bitstream using a Generator of a Generative Adversarial Network, GAN, for processing of the audio data and a respective apparatus. Moreover, described are also respective systems and computer program products.

Type: Application

Filed: December 15, 2021

Publication date: February 15, 2024

Applicant: Dolby International AB

Inventor: Arijit BISWAS
ORCHESTRATION OF ACOUSTIC DIRECT SEQUENCE SPREAD SPECTRUM SIGNALS FOR ESTIMATION OF ACOUSTIC SCENE METRICS

Publication number: 20240056757

Abstract: Some methods may involve receiving a first content stream that includes first audio signals, rendering the first audio signals to produce first audio playback signals, generating first direct sequence spread spectrum (DSSS) signals, generating first modified audio playback signals by inserting the first DSSS signals into the first audio playback signals, and causing a loudspeaker system to play back the first modified audio playback signals, to generate first audio device playback sound. The method(s) may involve receiving microphone signals corresponding to at least the first audio device playback sound and to second through Nth audio device playback sound corresponding to second through Nth modified audio playback signals (including second through Nth DSSS signals) played back by second through Nth audio devices, extracting second through Nth DSSS signals from the microphone signals and estimating at least one acoustic scene metric based, at least partly, on the second through Nth DSSS signals.

Type: Application

Filed: December 2, 2021

Publication date: February 15, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Benjamin John SOUTHWELL, David GUNAWAN, Mark R.P. THOMAS, Christopher Graham HINES
METHOD FOR ENCODING AND DECODING IMAGE USING ADAPTIVE DEBLOCKING FILTERING, AND APPARATUS THEREFOR

Publication number: 20240056613

Abstract: Disclosed is an encoding/decoding method and apparatus related to adaptive deblocking filtering. There is provided an image decoding method performing adaptive filtering in inter-prediction, the method including: reconstructing, from a bitstream, an image signal including a reference block on which block matching is performed in inter-prediction of a current block to be encoded; obtaining, from the bitstream, a flag indicating whether the reference block exists within a current picture where the current block is positioned; reconstructing the current block by using the reference block; adaptively applying an in-loop filter for the reconstructed current block based on the obtained flag; and storing the current block to which the in-loop filter is or is not applied in a decoded picture buffer (DPB).

Type: Application

Filed: October 24, 2023

Publication date: February 15, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Je Chang JEONG, Ki Baek KIM
DIGITAL FILTERBANK FOR SPECTRAL ENVELOPE ADJUSTMENT

Publication number: 20240055010

Abstract: An apparatus and method are disclosed for processing an audio signal. The apparatus includes an input interface, a digital filterbank having an analysis part and a synthesis part, a first phase shifter, a spectral envelope adjuster, a second phase shifter, and an output interface. The first phase shifter and the second phase shifter reduce a complexity of the digital filterbank, which includes both analysis and synthesis filters that are complex-exponential modulated versions of a prototype filter.

Type: Application

Filed: August 21, 2023

Publication date: February 15, 2024

Applicant: DOLBY INTERNATIONAL AB

Inventor: Per EKSTRAND
MULTISOURCE MEDIA DELIVERY SYSTEMS AND METHODS

Publication number: 20240056649

Abstract: A method for delivering media content to one or more clients over a distributed system is disclosed. The method may include generating a plurality of network-coded symbols from a plurality of original symbols representing a first media asset. The method may further include generating an original plurality of coded variants of the first media asset. The method may further include distributing a first coded variant of the original plurality of coded variants to a first cache on a first server device for storage in the first cache. The method may further include distributing a second coded variant of the original plurality of coded variants to a second cache on a second server device for storage in the second cache.

Type: Application

Filed: December 16, 2021

Publication date: February 15, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Jeffrey RIEDMILLER, Mingchao YU, Jason Michael CLOUD
ORCHESTRATION OF ACOUSTIC DIRECT SEQUENCE SPREAD SPECTRUM SIGNALS FOR ESTIMATION OF ACOUSTIC SCENE METRICS

Publication number: 20240048931

Abstract: Some methods may involve receiving a first content stream that includes first audio signals, rendering the first audio signals to produce first audio playback signals, generating first direct sequence spread spectrum (DSSS) signals, generating first modified audio playback signals by inserting the first DSSS signals into the first audio playback signals, and causing a loudspeaker system to play back the first modified audio playback signals, to generate first audio device playback sound. The method(s) may involve receiving microphone signals corresponding to at least the first audio device playback sound and to second through Nth audio device playback sound corresponding to second through Nth modified audio playback signals (including second through Nth DSSS signals) played back by second through Nth audio devices, extracting second through Nth DSSS signals from the microphone signals and estimating at least one acoustic scene metric based, at least partly, on the second through Nth DSSS signals.

Type: Application

Filed: December 2, 2021

Publication date: February 8, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Benjamin John SOUTHWELL, David GUNAWAN, Mark R.P. THOMAS, Christopher Graham HINES
PERSONALIZED HRTFS VIA OPTICAL CAPTURE

Publication number: 20240048932

Abstract: An apparatus and method of generating personalized HRTFs. The system is prepared by calculating a model for HRTFs described as the relationship between a finite example set of input data, namely anthropometric measures and demographic information for a set of individuals, and a corresponding set of output data, namely HRTFs numerically simulated using a high-resolution database of 3D scans of the same set of individuals. At the time of use, the system queries the user for their demographic information, and then from a series of images of the user, the system detects and measures various anthropometric characteristics. The system then applies the prepared model to the anthropometric and demographic data as part of generating a personalized HRTF. In this manner, the personalized HRTF can be generated with more convenience than by performing a high-resolution scan or an acoustic measurement of the user, and with less computational complexity than by numerically simulating their HRTF.

Type: Application

Filed: August 24, 2023

Publication date: February 8, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: McGregor Steele JOYNER, Alex BRANDMEYER, Scott DALY, Jeffrey Ross BAKER, Andrea FANELLI, Poppy Anne Carrie CRUM
CROSS PRODUCT ENHANCED SUBBAND BLOCK BASED HARMONIC TRANSPOSITION

Publication number: 20240046940

Abstract: The invention provides an efficient implementation of cross-product enhanced high-frequency reconstruction (HFR), wherein a new component at frequency Q?+r?0 is generated on the basis of existing components at ? and ?+?0. The invention provides a block-based harmonic transposition, wherein a time block of complex subband samples is processed with a common phase modification. Superposition of several modified samples has the net effect of limiting undesirable intermodulation products, thereby enabling a coarser frequency resolution and/or lower degree of oversampling to be used. In one embodiment, the invention further includes a window function suitable for use with block-based cross-product enhanced HFR. A hardware embodiment of the invention may include an analysis filter bank, a subband processing unit configurable by control data and a synthesis filter bank.

Type: Application

Filed: October 5, 2023

Publication date: February 8, 2024

Applicant: DOLBY INTERNATIONAL AB

Inventor: Lars Villemoes
VOLUME LEVELER CONTROLLER AND CONTROLLING METHOD

Publication number: 20240039499

Abstract: Volume leveler controller and controlling method are disclosed. In one embodiment, A volume leveler controller includes an audio content classifier for identifying the content type of an audio signal in real time; and an adjusting unit for adjusting a volume leveler in a continuous manner based on the content type as identified. The adjusting unit may configured to positively correlate the dynamic gain of the volume leveler with informative content types of the audio signal, and negatively correlate the dynamic gain of the volume leveler with interfering content types of the audio signal.

Type: Application

Filed: July 20, 2023

Publication date: February 1, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Jun WANG, Lie LU, Alan J. SEEFELDT
METHOD AND DEVICE FOR DECODING A HIGHER-ORDER AMBISONICS (HOA) REPRESENTATION OF AN AUDIO SOUNDFIELD

Publication number: 20240040327

Abstract: The invention discloses rendering sound field signals, such as Higher-Order Ambisonics (HOA), for arbitrary loudspeaker setups, where the rendering results in highly improved localization properties and is energy preserving. This is obtained by rendering an audio sound field representation for arbitrary spatial loudspeaker setups and/or by a a decoder that decodes based on a decode matrix (D). The decode matrix (D) is based on smoothing and scaling of a first decode matrix {circumflex over (D)} with smoothing coefficients. The first decode matrix {circumflex over (D)} is based on a mix matrix G and a mode matrix {tilde over (?)}, where the mix matrix G was determined based on L speakers and positions of a spherical modelling grid related to a HOA order N, and the mode matrix {tilde over (?)} was determined based on the spherical modelling grid and the HOA order N.

Type: Application

Filed: July 26, 2023

Publication date: February 1, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Johannes BOEHM, Florian KEILER

prev 1 2 3 4 5 6 … next