Dolby Labs Patents

Dolby Laboratories, Inc. licenses its audio technologies, including its noise-reduction systems, to the media industry. Its product portfolio includes Dolby Digital Plus (DD+), Dolby Digital (DD), AAC and HE-AAC, Dolby TrueHD, Dolby Atmos, Dolby AC-4, Dolby Voice and Dolby Vision. Products that incorporate Dolby technologies include televisions, set-top boxes, computers, DVD and Blu-ray devices, soundbars, smartphones, tablets, video game consoles, and automobile entertainment systems.

Dolby Labs Patents by Type

Dolby Labs Patents Granted: Dolby Labs patents that have been granted by the United States Patent and Trademark Office (USPTO).
Dolby Labs Patent Applications: Dolby Labs patent applications that are pending before the United States Patent and Trademark Office (USPTO).

Method and device for applying Dynamic Range Compression to a Higher Order Ambisonics signal

Patent number: 11838738

Abstract: A method for performing DRC on a HOA signal comprises transforming the HOA signal to the spatial domain, analyzing the transformed HOA signal, and obtaining, from results of said analyzing, gain factors that are usable for dynamic compression. The gain factors can be transmitted together with the HOA signal. When applying the DRC, the HOA signal is transformed to the spatial domain, the gain factors are extracted and multiplied with the transformed HOA signal in the spatial domain, wherein a gain compensated transformed HOA signal is obtained. The gain compensated transformed HOA signal is transformed back into the HOA domain, wherein a gain compensated HOA signal is obtained. The DRC may be applied in the QMF-filter bank domain.

Type: Grant

Filed: January 8, 2021

Date of Patent: December 5, 2023

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Johannes Boehm, Florian Keiler
METHOD AND APPARTUS FOR AUDIO PROCESSING USING A NESTED CONVOLUTIONAL NEURAL NETWORK ARCHITECHTURE

Publication number: 20230386500

Abstract: Systems, methods, and computer program products for audio processing based on convolutional neural network (CNN) are described. The CNN architecture may comprise a multi-scale input block and a multi-scale nested block. The multi-scale input block may be configured to receive input data and to generate a first downsampled input data set by downsampling the input data. The multi-scale nested block may comprise a first encoding layer configured to generate a first encoded data set by performing a convolution based on the input data. The multi-scale nested block may comprise a second encoding layer configured to generate a second encoded data set by performing a convolution based on the first downsampled input data set. Furthermore, the multi-scale nested block may comprise a first convolutional layer configured to generate a first output data set by upsampling the second encoded data set, concatenating the first encoded data set and the upsampled second encoded data set, and performing a convolution.

Type: Application

Filed: October 19, 2021

Publication date: November 30, 2023

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Jundai Sun, Lie Lu, Zhiwei Shuang
BINAURAL RENDERING FOR HEADPHONES USING METADATA PROCESSING

Publication number: 20230385013

Abstract: Embodiments are described for a method of rendering audio for playback through headphones comprising receiving digital audio content, receiving binaural rendering metadata generated by an authoring tool processing the received digital audio content, receiving playback metadata generated by a playback device, and combining the binaural rendering metadata and playback metadata to optimize playback of the digital audio content through the headphones.

Type: Application

Filed: April 24, 2023

Publication date: November 30, 2023

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Nicolas R. TSINGOS, Rhonda WILSON, Sunil BHARITKAR, C. Phillip BROWN, Alan J. SEEFELDT, Remi AUDFRAY
ASYMMETRICAL HIGH-FREQUENCY WAVEGUIDE, 3-AXIS RIGGING, AND SPHERICAL ENCLOSURE FOR SURROUND SPEAKERS

Publication number: 20230388702

Abstract: Embodiments are described for a high-frequency waveguide that improves the performance of large-scale surround sound and immersive audio environments. A horn waveguide is configured to be asymmetric about one of a vertical axis and horizontal axis of the waveguide to form an asymmetric horn waveguide. A spherical enclosure surrounds the asymmetric horn waveguide to form a horn speaker, and a three-axis mounting system is configured to fix the horn speaker to one of a wall or ceiling surface of the venue, wherein the mounting system facilitates rotating the horn speaker to a location that provides maximum coverage of the venue within the passband of the asymmetric horn waveguide.

Type: Application

Filed: April 18, 2023

Publication date: November 30, 2023

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Garth Norman SHOWALTER, Mario DI COLA, John Michael GOTT, Patrick Ross SPURLOCK, Gregory Lynn CARNEY, Bryce Joseph GOTT
SYSTEM AND TOOLS FOR ENHANCED 3D AUDIO AUTHORING AND RENDERING

Publication number: 20230388738

Abstract: Improved tools for authoring and rendering audio reproduction data are provided. Some such authoring tools allow audio reproduction data to be generalized for a wide variety of reproduction environments. Audio reproduction data may be authored by creating metadata for audio objects. The metadata may be created with reference to speaker zones. During the rendering process, the audio reproduction data may be reproduced according to the reproduction speaker layout of a particular reproduction environment.

Type: Application

Filed: May 1, 2023

Publication date: November 30, 2023

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Nicolas R. Tsingos, Charles Q. Robinson, Jurgen W. Scharpf
PROJECTION SYSTEM AND METHOD WITH ADJUSTABLE ANGLE ILLUMINATION USING LENS DECENTRATION

Publication number: 20230384656

Abstract: A projection system and calibration method therefor relate to a light source configured to emit a light in response to an image data, an illumination optical system configured to steer the light, the illumination optical system including a first lens group and a second lens group, a digital micromirror device (DMD) including a plurality of micromirrors respectively configured to reflect the steered light to a predetermined location as on-state light or to reflect the steered light as off-state light to a light dump; determining a deviation between an actual angle of orientation and an expected angle of orientation of a respective micromirror of the plurality of micromirrors; calculating a first amount of lateral adjustment corresponding to the first lens group and a second amount of lateral adjustment corresponding to the second lens group, and actuating the first and second lens groups according to the corresponding first and second amount.

Type: Application

Filed: October 21, 2021

Publication date: November 30, 2023

Applicant: Dolby Laboratories Licensing Corporation

Inventors: John David JACKSON, Darren HENNIGAN, Nathan Shawn WAINWRIGHT
TRIM-PASS CORRECTION FOR CLOUD-BASED CODING OF HDR VIDEO

Publication number: 20230388555

Abstract: In a cloud-based system for encoding high dynamic range (HDR) video, each node receives a video segment and bumper frames. Each segment is subdivided into primary scenes and secondary scenes to derive scene-based forward reshaping functions that minimize the amount of reshaping-related metadata when coding the video segment. When a parent scene of a secondary scene is processed by two or more neighboring nodes, initial forward reshaping functions and trim-pass correction parameters are adjusted using reference tone-mapping functions and updated scene-based trim-pass correction parameters.

Type: Application

Filed: September 17, 2021

Publication date: November 30, 2023

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Harshad Kadu, Guan-Ming Su
ADAPTIVE BLOCK SWITCHING WITH DEEP NEURAL NETWORKS

Publication number: 20230386486

Abstract: The present invention relates to a method for predicting transform coefficients representing frequency content of an adaptive block length media signal, by receiving a frame and receiving block length information indicating a number of quantized transform coefficients for each block in the frame, the number of quantized transform coefficients being one of a first or second number, wherein the first number is greater than the second number, determining a first block has the second number of quantized transform coefficients, converting the first block into a converted block having the first number of quantized transform coefficients, conditioning a main neural network trained to predict at least one output variable given at least one conditioning variable, the at least one conditioning variable being based on information regarding the converted block and block length information for the first block, providing at least one predicted transform coefficients from an output stage of the main neural network.

Type: Application

Filed: October 15, 2021

Publication date: November 30, 2023

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Cong ZHOU, Grant A. DAVIDSON, Mark S. VINTON
Audio decoder for interleaving signals

Patent number: 11830510

Abstract: A method for decoding an encoded audio bitstream in an audio processing system is disclosed. The method includes extracting from the encoded audio bitstream a first waveform-coded signal comprising spectral coefficients corresponding to frequencies up to a first cross-over frequency for a time frame and performing parametric decoding at a second cross-over frequency for the time frame to generate a reconstructed signal. The second cross-over frequency is above the first cross-over frequency and the parametric decoding uses reconstruction parameters derived from the encoded audio bitstream to generate the reconstructed signal. The method also includes extracting from the encoded audio bitstream a second waveform-coded signal comprising spectral coefficients corresponding to a subset of frequencies above the first cross-over frequency for the time frame and interleaving the second waveform-coded signal with the reconstructed signal to produce an interleaved signal for the time frame.

Type: Grant

Filed: August 31, 2021

Date of Patent: November 28, 2023

Assignee: Dolby International AB

Inventors: Kristofer Kjoerling, Heiko Purnhagen, Harald Mundt, Karl Jonas Roeden, Leif Sehlstrom
Methods and apparatus for decoding a compressed HOA signal

Patent number: 11830504

Abstract: Methods and apparatus for decoding a compressed Higher Order Ambisonics (HOA) representation of a sound or soundfield. The method may include receiving a bit stream containing the compressed HOA representation and decoding, based on a determination that there are multiple layers, the compressed HOA representation from the bitstream to obtain a sequence of decoded HOA representations. A first subset of the sequence of decoded HOA representations is determined based only on corresponding ambient HOA components. A second subset of the sequence of decoded HOA representations is determined based on corresponding ambient HOA components and corresponding predominant sound components.

Type: Grant

Filed: September 30, 2022

Date of Patent: November 28, 2023

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Sven Kordon, Alexander Krueger, Oliver Wuebbolt
Coding dense transient events with companding

Patent number: 11830507

Abstract: Embodiments are directed to a companding method and system for reducing coding noise in an audio codec. A method of processing an audio signal includes the following operations. A system receives an audio signal. The system determines that a first frame of the audio signal includes a sparse transient signal. The system determines that a second frame of the audio signal includes a dense transient signal. The system compresses/expands (compands) the audio signal using a companding rule that applies a first companding exponent to the first frame of the audio signal and applies a second companding exponent to the second frame of the audio signal, each companding exponent being used to derive a respective degree of dynamic range compression and expansion for a corresponding frame. The system then provides the companded audio signal to a downstream device.

Type: Grant

Filed: August 21, 2019

Date of Patent: November 28, 2023

Assignee: Dolby International AB

Inventors: Arijit Biswas, Harald Mundt
Integration of high frequency reconstruction techniques with reduced post-processing delay

Patent number: 11830509

Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.

Type: Grant

Filed: March 3, 2023

Date of Patent: November 28, 2023

Assignee: Dolby International AB

Inventors: Kristofer Kjoerling, Lars Villemoes, Heiko Purnhagen, Per Ekstrand
Method and apparatus for processing of auxiliary media streams embedded in a MPEGH 3D audio stream

Patent number: 11830508

Abstract: The disclosure relates to methods, apparatus and systems for side load processing of packetized media streams. In an embodiment, the apparatus comprises: a receiver for receiving a bitstream, and a splitter for identifying a packet type in the bitstream and splitting, based on the identification of a value of the packet type in the bit stream into a main stream and an auxiliary stream.

Type: Grant

Filed: December 8, 2021

Date of Patent: November 28, 2023

Assignee: DOLBY INTERNATIONAL AB

Inventors: Stephan Schreiner, Christof Fersch
BACKWARD-COMPATIBLE INTEGRATION OF HARMONIC TRANSPOSER FOR HIGH FREQUENCY RECONSTRUCTION OF AUDIO SIGNALS

Publication number: 20230377589

Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag.

Type: Application

Filed: July 31, 2023

Publication date: November 23, 2023

Applicant: DOLBY INTERNATIONAL AB

Inventors: Lars Villemoes, Heiko Purnhagen, Per Ekstrand
REAL-TIME PACKET LOSS CONCEALMENT USING DEEP GENERATIVE NETWORKS

Publication number: 20230377584

Abstract: The present disclosure relates to a method and system for performing packet loss concealment using a neural network system. The method comprises obtaining a representation of an incomplete audio signal, inputting the representation of the incomplete audio signal to an encoder neural network and outputting a latent representation of a predicted complete audio signal. The latent representation is input to a decoder neural network which outputs a representation of a predicted complete audio signal comprising a reconstruction of the original portion of the complete audio signal, wherein said encoder neural network and said decoder neural network have been trained with an adversarial neural network.

Type: Application

Filed: October 14, 2021

Publication date: November 23, 2023

Applicant: DOLBY INTERNATIONAL AB

Inventors: Santiago PASCUAL, Joan SERRA, Jordi PONS PUIG
Integration of high frequency reconstruction techniques with reduced post-processing delay

Patent number: 11823695

Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.

Type: Grant

Filed: March 3, 2023

Date of Patent: November 21, 2023

Assignee: Dolby International AB

Inventors: Kristofer Kjoerling, Lars Villemoes, Heiko Purnhagen, Per Ekstrand
Audio encoder and decoder with dynamic range compression metadata

Patent number: 11823693

Abstract: An audio processing unit (APU) is disclosed. The APU includes a buffer memory configured to store at least one frame of an encoded audio bitstream, where the encoded audio bitstream includes audio data and a metadata container. The metadata container includes a header and one or more metadata payloads after the header. The one or more metadata payloads include dynamic range compression (DRC) metadata, and the DRC metadata is or includes profile metadata indicative of whether the DRC metadata includes dynamic range compression (DRC) control values for use in performing dynamic range compression in accordance with at least one compression profile on audio content indicated by at least one block of the audio data.

Type: Grant

Filed: August 1, 2022

Date of Patent: November 21, 2023

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Jeffrey Riedmiller, Michael Ward
Integration of high frequency reconstruction techniques with reduced post-processing delay

Patent number: 11823694

Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.

Type: Grant

Filed: March 3, 2023

Date of Patent: November 21, 2023

Assignee: Dolby International AB

Inventors: Kristofer Kjoerling, Lars Villemoes, Heiko Purnhagen, Per Ekstrand
Integration of high frequency reconstruction techniques with reduced post-processing delay

Patent number: 11823696

Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.

Type: Grant

Filed: March 3, 2023

Date of Patent: November 21, 2023

Assignee: Dolby International AB

Inventors: Kristofer Kjoerling, Lars Villemoes, Heiko Purnhagen, Per Ekstrand
DEEP-LEARNING BASED SPEECH ENHANCEMENT

Publication number: 20230368807

Abstract: A system for suppressing noise and enhancing speech and a related method are disclosed. The system trains a neural network model that takes banded energies corresponding to an original noisy waveform and produces a speech value indicating the amount of speech present in each band at each frame. The neural model comprises a feature extraction block that implements some lookahead. The feature extraction block is followed by an encoder with steady down-sampling along the frequency domain forming a contracting path. The encoder is followed by a corresponding decoder with steady up-sampling along the frequency domain forming an expanding path. The decoder receives scaled output feature maps from the encoder at a corresponding level. The decoder is followed by a classification block that generates a speech value indicating an amount of speech present for each frequency band of the plurality of frequency bands at each frame of the plurality of frames.

Type: Application

Filed: October 29, 2021

Publication date: November 16, 2023

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Xiaoyu LIU, Michael Getty HORGAN, Roy M. FEJGIN, Paul HOLMBERG
DECODING AUDIO BITSTREAMS WITH ENHANCED SPECTRAL BAND REPLICATION METADATA IN AT LEAST ONE FILL ELEMENT

Publication number: 20230368805

Abstract: Embodiments relate to an audio processing unit that includes a buffer, bitstream payload deformatter, and a decoding subsystem. The buffer stores at least one block of an encoded audio bitstream. The block includes a fill element that begins with an identifier followed by fill data. The fill data includes at least one flag identifying whether enhanced spectral band replication (eSBR) processing is to be performed on audio content of the block. A corresponding method for decoding an encoded audio bitstream is also provided.

Type: Application

Filed: May 16, 2023

Publication date: November 16, 2023

Applicant: Dolby International AB

Inventors: Lars Villemoes, Heiko Purnhagen, Per Ekstrand
COLOR TRANSFORMATION FOR HDR VIDEO WITH A CODING-EFFICIENCY CONSTRAINT

Publication number: 20230368344

Abstract: Using a standard-based RGB to YCbCr color transform a new RGB to YCC 3×3 transformation matrix and a 3×1 offset vector are derived under a set of coding-efficiency constraints. The new RGB to YCC 3×3 transform comprises a luminance scaling factor and a 2×2 chroma sub-matrix that preserves the energy of the standard-based RGB to YCbCr transform while maintaining or improving coding efficiency. It also adds support for an authorization or watermarking mechanism in streaming video applications. Examples of using the new color transform using image reshaping are also provided.

Type: Application

Filed: October 14, 2021

Publication date: November 16, 2023

Applicant: Dolby Laboratories Licensing Corporation

Inventor: Guan-Ming SU
ADAPTIVE LOCAL RESHAPING FOR SDR-TO-HDR UP-CONVERSION

Publication number: 20230370646

Abstract: A global index value is generated for selecting a global reshaping function for an input image of a relatively low dynamic range using luma codewords in the input image. Image filtering is applied to the input image to generate a filtered image. The filtered values of the filtered image provide a measure of local brightness levels in the input image. Local index values are generated for selecting specific local reshaping functions for the input image using the global index value and the filtered values of the filtered image. A reshaped image of a relatively high dynamic range is generated by reshaping the input image with the specific local reshaping functions selected using the local index values.

Type: Application

Filed: October 1, 2021

Publication date: November 16, 2023

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Tsung-Wei Huang, Guan-Ming Su, Neeraj J. Gadgil
Methods for parametric multi-channel encoding

Patent number: 11817108

Abstract: The present document relates to audio coding systems. In particular, the present document relates to efficient methods and systems for parametric multi-channel audio coding. An audio encoding system configured to generate a bitstream indicative of a downmix signal and spatial metadata for generating a multi-channel upmix signal from the downmix signal is described. The system comprises a downmix processing unit configured to generate the downmix signal from a multi-channel input signal; wherein the downmix signal comprises m channels and wherein the multi-channel input signal comprises n channels; n, m being integers with m<n. Furthermore, the system comprises a parameter processing unit configured to determine the spatial metadata from the multi-channel input signal.

Type: Grant

Filed: October 28, 2022

Date of Patent: November 14, 2023

Assignee: DOLBY INTERNATIONAL AB

Inventors: Tobias Friedrich, Alexander Mueller, Karsten Linzmeier, Claus-Christian Spenger, Tobias R. Wagenblass
Perceptually-based loss functions for audio encoding and decoding based on machine learning

Patent number: 11817111

Abstract: Computer-implemented methods for training a neural network, as well as for implementing audio encoders and decoders via trained neural networks, are provided. The neural network may receive an input audio signal, generate an encoded audio signal and decode the encoded audio signal. A loss function generating module may receive the decoded audio signal and a ground truth audio signal, and may generate a loss function value corresponding to the decoded audio signal. Generating the loss function value may involve applying a psychoacoustic model. The neural network may be trained based on the loss function value. The training may involve updating at least one weight of the neural network.

Type: Grant

Filed: April 10, 2019

Date of Patent: November 14, 2023

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Roy M. Fejgin, Grant A. Davidson, Chih-Wei Wu, Vivek Kumar
Content and environmentally aware environmental noise compensation

Patent number: 11817114

Abstract: Some implementations involve receiving a content stream that includes audio data, determining a content type corresponding to the content stream and determining, based at least in part on the content type, a noise compensation method. Some examples involve performing the noise compensation method on the audio data to produce noise-compensated audio data, rendering the noise-compensated audio data for reproduction via a set of audio reproduction transducers of the audio environment, to produce rendered audio signals, and providing the rendered audio signals to at least some audio reproduction transducers of the audio environment.

Type: Grant

Filed: December 9, 2019

Date of Patent: November 14, 2023

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Timothy Alan Port, Daniel Steven Templeton, Jack Gregory Hays
Adjustable trade-off between quality and computation complexity in video codecs

Patent number: 11818400

Abstract: A backward reshaping mapping table is initially generated as an inverse of a forward reshaping mapping table. The backward reshaping mapping table is updated by replacing the content-mapped luminance codewords with forward reshaped luminance codewords generated by applying a luminance forward mapping to the sampled luminance codewords. The luminance forward mapping is constructed from the forward reshaping mapping table. The backward reshaping mapping table and the luminance forward mapping are used to generate backward reshaping mappings for creating a reconstructed image from a forward reshaped image. The forward reshaped image is encoded, in a video signal, along with image metadata specifying the backward reshaping mappings. A recipient device of the video signal applies the backward reshaping mappings to the forward reshaped image to create the reconstructed image of the second dynamic range.

Type: Grant

Filed: October 16, 2020

Date of Patent: November 14, 2023

Assignee: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Qing Song, Arun Raj, Guan-Ming Su
Cross product enhanced subband block based harmonic transposition

Patent number: 11817110

Abstract: The invention provides an efficient implementation of cross-product enhanced high-frequency reconstruction (HFR), wherein a new component at frequency Q?+r?0 is generated on the basis of existing components at ? and ?+?0. The invention provides a block-based harmonic transposition, wherein a time block of complex subband samples is processed with a common phase modification. Superposition of several modified samples has the net effect of limiting undesirable intermodulation products, thereby enabling a coarser frequency resolution and/or lower degree of oversampling to be used. In one embodiment, the invention further includes a window function suitable for use with block-based cross-product enhanced HFR. A hardware embodiment of the invention may include an analysis filter bank, a subband processing unit configurable by control data and a synthesis filter bank.

Type: Grant

Filed: June 1, 2022

Date of Patent: November 14, 2023

Assignee: DOLBY INTERNATIONAL AB

Inventor: Lars Villemoes
Frame-rate scalable video coding

Patent number: 11818372

Abstract: Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific backward compatible target frame rate and shutter angle among those allowed are also presented.

Type: Grant

Filed: January 12, 2023

Date of Patent: November 14, 2023

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Robin Atkins, Peng Yin, Taoran Lu, Fangjun Pu, Sean Thomas McCarthy, Walter J. Husak, Tao Chen, Guan-Ming Su
MEDIA-COMPENSATED PASS-THROUGH AND MODE-SWITCHING

Publication number: 20230359430

Abstract: Media input audio data corresponding to a media stream and microphone input audio data from at least one microphone may be received. A first level of at least one of a plurality of frequency bands of the media input audio data, as well as a second level of at least one of a plurality of frequency bands of the microphone input audio data, may be determined. Media output audio data and microphone output audio data may be produced by adjusting levels of one or more of the first and second plurality of frequency bands based on the perceived loudness of the microphone input audio data, of the microphone output audio data, of the media output audio data and the media input audio data. One or more processes may be modified upon receipt of a mode-switching indication.

Type: Application

Filed: July 12, 2023

Publication date: November 9, 2023

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Mark Alexander, Chunjian Li, Joshua Brandon Lando, Alan J. Seefeldt, C. Phillip Brown, Dirk Jeroen Breebaart
AUDIO DECODER AND DECODING METHOD

Publication number: 20230360659

Abstract: A method for representing a second presentation of audio channels or objects as a data stream, the method comprising the steps of: (a) providing a set of base signals, the base signals representing a first presentation of the audio channels or objects; (b) providing a set of transformation parameters, the transformation parameters intended to transform the first presentation into the second presentation; the transformation parameters further being specified for at least two frequency bands and including a set of multi-tap convolution matrix parameters for at least one of the frequency bands.

Type: Application

Filed: July 13, 2023

Publication date: November 9, 2023

Applicants: Dolby Laboratories Licensing Corporation, Dolby International AB

Inventors: Dirk Jeroen Breebaart, David Matthew Cooper, Leif Jonas Samuelsson
METHOD AND DEVICE FOR PROCESSING A BINAURAL RECORDING

Publication number: 20230360662

Abstract: The present invention relates to a method and device for processing a first and a second audio signal representing an input binaural audio signal acquired by a binaural recording device. The present invention further relates to a method for rendering a binaural audio signal on a speaker system. The method for processing a binaural signal comprising extracting audio information from the first audio signal, computing a band gain for reducing noise in the first audio signal and applying the band gains to respective frequency bands of the first audio signal in accordance with a dynamic scaling factor, to provide a first output audio signal. Wherein the dynamic scaling factor has a value between zero and one and is selected so as to reduce quality degradation for the first audio signal.

Type: Application

Filed: September 15, 2021

Publication date: November 9, 2023

Applicants: Dolby Laboratories Licensing Corporation, Dolby International AB

Inventors: Zhiwei Shuang, Yuanxing Ma, Yang Liu, Ziyu Yang, Giulio Cengarle
METHOD AND SYSTEM FOR HANDLING LOCAL TRANSITIONS BETWEEN LISTENING POSITIONS IN A VIRTUAL REALITY ENVIRONMENT

Publication number: 20230362575

Abstract: A method (910) for rendering an audio signal in a virtual reality rendering environment (180) is described. The method (910) comprises rendering (911) an origin audio signal of an audio source (311, 312, 313) from an origin source position on an origin sphere (114) around an origin listening position (301) of a listener (181). Furthermore, the method (900) comprises determining (912) that the listener (181) moves from the origin listening position (301) to a destination listening position (302). In addition, the method (900) comprises determining (913) a destination source position of the audio source (311, 312, 313) on a destination sphere (114) around the destination listening position (302) based on the origin source position, and determining (914) a destination audio signal of the audio source (311, 312, 313) based on the origin audio signal.

Type: Application

Filed: July 13, 2023

Publication date: November 9, 2023

Applicant: DOLBY INTERNATIONAL AB

Inventors: Leon TERENTIV, Christof FERSCH, Daniel FISCHER
Quantization control for variable bit depth

Patent number: 11812020

Abstract: The quantization parameter QP is well-known in digital video compression as an indication of picture quality. Digital symbols representing a moving image are quantized with a quantizing step that is a function QSN of the quantization parameter QP, which function QSN has been normalized to the most significant bit of the bit depth of the digital symbols. As a result, the effect of a given QP is essentially independent of bit depth a particular QP value has a standard effect on image quality, regardless of bit depth. The invention is useful, for example, in encoding and decoding at different bit depths, to generate compatible, bitstreams having different bit depths, and to allow different bit depths for different components of a video signal by compressing each with the same fidelity (i.e., the same QP).

Type: Grant

Filed: March 15, 2021

Date of Patent: November 7, 2023

Assignee: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Walter C. Gish, Christopher J. Vogt
Integration of high frequency audio reconstruction techniques

Patent number: 11810592

Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.

Type: Grant

Filed: February 23, 2023

Date of Patent: November 7, 2023

Assignee: Dolby International AB

Inventors: Kristofer Kjoerling, Lars Villemoes, Heiko Purnhagen, Per Ekstrand
Integration of high frequency audio reconstruction techniques

Patent number: 11810589

Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.

Type: Grant

Filed: November 15, 2022

Date of Patent: November 7, 2023

Assignee: Dolby International AB

Inventors: Kristofer Kjoerling, Lars Villemoes, Heiko Purnhagen, Per Ekstrand
Integration of high frequency audio reconstruction techniques

Patent number: 11810590

Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.

Type: Grant

Filed: February 23, 2023

Date of Patent: November 7, 2023

Assignee: Dolby International AB

Inventors: Kristofer Kjoerling, Lars Villemoes, Heiko Purnhagen, Per Ekstrand
Integration of high frequency audio reconstruction techniques

Patent number: 11810591

Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.

Type: Grant

Filed: February 23, 2023

Date of Patent: November 7, 2023

Assignee: Dolby International AB

Inventors: Kristofer Kjoerling, Lars Villemoes, Heiko Purnhagen, Per Ekstrand
MDCT-based complex prediction stereo coding

Patent number: 11810582

Abstract: The invention provides methods and devices for stereo encoding and decoding using complex prediction in the frequency domain. In one embodiment, a decoding method, for obtaining an output stereo signal from an input stereo signal encoded by complex prediction coding and comprising first frequency-domain representations of two input channels, comprises the upmixing steps of: (i) computing a second frequency-domain representation of a first input channel; and (ii) computing an output channel on the basis of the first and second frequency-domain representations of the first input channel, the first frequency-domain representation of the second input channel and a complex prediction coefficient. The upmixing can be suspended responsive to control data.

Type: Grant

Filed: December 23, 2021

Date of Patent: November 7, 2023

Assignee: DOLBY INTERNATIONAL AB

Inventors: Heiko Purnhagen, Pontus Carlsson, Lars Villemoes
SYSTEM FOR AUTOMATED MULTITRACK MIXING

Publication number: 20230352058

Abstract: A deep-learning-based system for performing automated multitrack mixing based on a plurality of input audio tracks is described herein. The system comprises one or more instances of a deep-learning-based first network and one or more instances of a deep-learning-based second network. Particularly, the first network is configured to, based on the 5 input audio tracks, generate parameters for use in the automated multitrack mixing. The second network is configured to, based on the parameters, apply signal processing and at least one mixing gain to the input audio tracks, for generating an output mix of the audio tracks.

Type: Application

Filed: June 16, 2021

Publication date: November 2, 2023

Applicant: Dolby International AB

Inventors: Christian James Steinmetz, Joan Serra
Method of Coding and Decoding Images, Coding and Decoding Device and Computer Programs Corresponding Thereto

Publication number: 20230353740

Abstract: A method for coding includes; segmenting an image into blocks; grouping blocks into a number of subsets; coding, using an entropy coding module, each subset, by associating digital information with symbols of each block of a subset, including, for the first block of the image, initializing state variables of the coding module; and generating a data sub-stream representative of at least one of the coded subsets of blocks. Where a current block is the first block to be coded of a subset, symbol occurrence probabilities for the first current block are determined based on those for a coded and decoded predetermined block of at least one other subset. Where the current block is the last coded block of the subset: writing, in the sub-stream representative of the subset, the entire the digital information associated with the symbols during coding of the blocks of the subset, and implementing the initializing sub-step.

Type: Application

Filed: July 5, 2023

Publication date: November 2, 2023

Applicant: DOLBY INTERNATIONAL AB

Inventors: Felix Henry, Stephane Pateux, Gordon Clare
Method of Coding and Decoding Images, Coding and Decoding Device and Computer Programs Corresponding Thereto

Publication number: 20230353781

Abstract: A method of coding at least one image comprising the steps of splitting the image into a plurality of blocks, of grouping said blocks into a predetermined number of subsets of blocks, of coding each of said subsets of blocks in parallel, the blocks of a subset considered being coded according to a predetermined sequential order of traversal. The coding step comprises, for a current block of a subset considered, the sub-step of predictive coding of said current block with respect to at least one previously coded and decoded block, and the sub-step of entropy coding of said current block on the basis of at least one probability of appearance of a symbol.

Type: Application

Filed: July 6, 2023

Publication date: November 2, 2023

Applicant: DOLBY INTERNATIONAL AB

Inventors: Felix Henry, Stephane Pateux
METHOD, APPARATUS OR SYSTEMS FOR PROCESSING AUDIO OBJECTS

Publication number: 20230353970

Abstract: Diffuse or spatially large audio objects may be identified for special processing. A decorrelation process may be performed on audio signals corresponding to the large audio objects to produce decorrelated large audio object audio signals. These decorrelated large audio object audio signals may be associated with object locations, which may be stationary or time-varying locations. For example, the decorrelated large audio object audio signals may be rendered to virtual or actual speaker locations. The output of such a rendering process may be input to a scene simplification process. The decorrelation, associating and/or scene simplification processes may be performed prior to a process of encoding the audio data.

Type: Application

Filed: July 10, 2023

Publication date: November 2, 2023

Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: Dirk Jeroen BREEBAART, Lie LU, Nicolas R. TSINGOS, Antonio MATEOS SOLE
SCALABLE SYSTEMS FOR CONTROLLING COLOR MANAGEMENT COMPRISING VARYING LEVELS OF METADATA

Publication number: 20230353762

Abstract: Several embodiments of scalable image processing systems and methods are disclosed herein whereby color management processing of source image data to be displayed on a target display is changed according to varying levels of metadata.

Type: Application

Filed: July 7, 2023

Publication date: November 2, 2023

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Neil W. Messmer, Robin Atkins, Steve Margerm, Peter W. Longhurst
Audio channel spatial translation

Patent number: 11805379

Abstract: The present invention is directed to methods and apparatus for translating a first plurality of audio input channels to a second plurality of audio output channels. This includes determining that there is pair-wise coding among any of the first plurality of audio input channels, determining an input/output-mapping matrix for mapping at least a first set of the first plurality of audio input channels to at least a second set of the second plurality of audio output channels; and deriving the second plurality of audio output channels based on first plurality of audio input channels, the input/output-mapping matrix and the determined pair-wise coding. The first plurality of audio input channels represent the same soundfield represented by the second plurality of audio output channels.

Type: Grant

Filed: July 8, 2022

Date of Patent: October 31, 2023

Assignee: Dolby Laboratories Licensing Corporation

Inventor: Mark F. Davis
Scalable voice scene media server

Patent number: 11803351

Abstract: A communication system, method, and computer-readable medium therefor comprise a media server configured to receive a plurality of audio streams from a corresponding plurality of client devices, the media server including circuitry configured to rank the plurality of audio streams based on a predetermined metric, group a first portion of the plurality of audio streams into a first set, the first portion of the plurality of audio streams being the N highest-ranked audio streams, group a second portion of the plurality of audio streams into a second set, the second portion of the plurality of audio streams being the M lowest-ranked audio streams, forward respective audio streams of the first set to a receiver device, and discard respective audio streams of the second set, wherein N and M are independent integers.

Type: Grant

Filed: April 3, 2020

Date of Patent: October 31, 2023

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Glenn N. Dickins, Feng Deng, Michael Eckert, Craig Johnston, Paul Holmberg
Display management for high dynamic range images

Patent number: 11803948

Abstract: Methods and systems for the display management of HDR video signals are presented. The mapping is based on tone mapping and color volume mapping which map an input signal with an input dynamic range and color volume to a target display with a target dynamic range and color volume. Both a global tone-mapping and precision-mapping methods using pyramid filtering are presented.

Type: Grant

Filed: April 16, 2020

Date of Patent: October 31, 2023

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Robin Atkins, Jaclyn Anne Pytlarz, Elizabeth G. Pieri
AUDIO DECODER FOR AUDIO CHANNEL RECONSTRUCTION

Publication number: 20230345176

Abstract: A method and apparatus for reconstructing N audio channels from M audio channels is disclosed. The method includes receiving a bitstream containing an encoded audio signal representing the M audio channels and decoding the encoded audio signal to obtain a frequency domain representation of the M audio channels. The method further includes extracting a parameter from the bitstream and reconstructing at least one of the N audio channels using the parameter. The parameter represents an angle between two signals, at least one of which is included in the M audio channels.

Type: Application

Filed: May 3, 2023

Publication date: October 26, 2023

Applicant: DOLBY INTERNATIONAL AB

Inventors: Heiko PURNHAGEN, Lars VILLEMOES, Jonas ENGDEGARD, Jonas ROEDEN, Kristofer KJOERLING
Speaker

Patent number: D1004572

Type: Grant

Filed: July 19, 2021

Date of Patent: November 14, 2023

Assignee: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Drew Alexander Walcott, Grayson H. Byrd, Peter Michaelian, Brian Edward Renz, John Carson Stewart, Cody Michael Proksa, Vincent Voron, Sripal S. Mehta, Alan J. Seefeldt
Speaker

Patent number: D1004573

Type: Grant

Filed: July 19, 2021

Date of Patent: November 14, 2023

Assignee: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Drew Alexander Walcott, Grayson H. Byrd, Peter Michaelian, Brian Edward Renz, John Carson Stewart, Cody Michael Proksa, Vincent Voron, Sripal S. Mehta, Alan J. Seefeldt

prev 1 2 3 4 5 6 7 8 9 … next