Patents Assigned to Dolby Laboratories Licensing Corporation

AUDIO PROCESSING IN IMMERSIVE AUDIO SERVICES

Publication number: 20250088816

Abstract: The disclosure herein generally relates to capturing, acoustic pre-processing, encoding, decoding, and rendering of directional audio of an audio scene. In particular, it relates to a device adapted to modify a directional property of a captured directional audio in response to spatial data of a microphone system capturing the directional audio. The disclosure further relates to a rendering device configured to modify a directional property of a received directional audio in response to received spatial data.

Type: Application

Filed: November 26, 2024

Publication date: March 13, 2025

Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB

Inventors: Stefan Bruhn, Juan Felix Torres, David S. McGrath, Brian B. Lee
DETECTION AND ENHANCEMENT OF SPEECH IN BINAURAL RECORDINGS

Publication number: 20250078858

Abstract: Disclosed herein are method, systems, and computer-program products for segmenting a binaural recording of speech into parts containing self-speech and parts containing external speech, and processing each category with different settings, to obtain an enhanced overall presentation. The segmentation is based on a combination of: i) feature-based frame-by-frame classification, and ii) detecting dissimilarity by statistical methods. The segmentation information is then used by a speech enhancement chain, where independent settings are used to process the self- and external speech parts.

Type: Application

Filed: January 12, 2022

Publication date: March 6, 2025

Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB

Inventors: Giulio CENGARLE, Yuanxing MA
SYSTEM AND METHOD FOR NON-DESTRUCTIVELY NORMALIZING LOUDNESS OF AUDIO SIGNALS WITHIN PORTABLE DEVICES

Publication number: 20250078849

Abstract: Many portable playback devices cannot decode and playback encoded audio content having wide bandwidth and wide dynamic range with consistent loudness and intelligibility unless the encoded audio content has been prepared specially for these devices. This problem can be overcome by including with the encoded content some metadata that specifies a suitable dynamic range compression profile by either absolute values or differential values relative to another known compression profile. A playback device may also adaptively apply gain and limiting to the playback audio. Implementations in encoders, in transcoders and in decoders are disclosed.

Type: Application

Filed: November 18, 2024

Publication date: March 6, 2025

Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: Jeffrey RIEDMILLER, Harald MUNDT, Michael SCHUG, Martin WOLTERS
METHOD AND DEVICE FOR DECODING A HIGHER-ORDER AMBISONICS (HOA) REPRESENTATION OF AN AUDIO SOUNDFIELD

Publication number: 20250080937

Abstract: The invention discloses rendering sound field signals, such as Higher-Order Ambisonics (HOA), for arbitrary loudspeaker setups, where the rendering results in highly improved localization properties and is energy preserving. This is obtained by rendering an audio sound field representation for arbitrary spatial loudspeaker setups and/or by a decoder that decodes based on a decode matrix (D). The decode matrix (D) is based on smoothing and scaling of a first decode matrix {circumflex over (D)} with smoothing coefficients. The first decode matrix {circumflex over (D)} is based on a mix matrix G and a mode matrix {tilde over (?)}, where the mix matrix G was determined based on L speakers and positions of a spherical modelling grid related to a HOA order N, and the mode matrix {tilde over (?)} was determined based on the spherical modelling grid and the HOA order N.

Type: Application

Filed: September 18, 2024

Publication date: March 6, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Johannes BOEHM, Florian KEILER
PREDICTIVE MOTION VECTOR CODING

Publication number: 20250080749

Abstract: Overlapped block disparity estimation and compensation is described. Compensating for images with overlapped block disparity compensation (OBDC) involves determining if OBDC is enabled in a video bit stream, and determining if OBDC is enabled for one or more macroblocks that neighbor a first macroblock within the video bit stream. The neighboring macroblocks may be transform coded. If OBDC is enabled in the video bit stream and for the one or more neighboring macroblocks, predictions may be made for a region of the first macroblock that has an edge adjacent with the neighboring macroblocks. OBDC can be causally applied. Disparity compensation parameters or modes may be shared amongst views or layers. A variety of predictions may be used with causally-applied OBDC.

Type: Application

Filed: November 20, 2024

Publication date: March 6, 2025

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Alexandros Tourapis, Athanasios Leontaris
Generating Binaural Audio in Response to Multi-Channel Audio Using at Least One Feedback Delay Network

Publication number: 20250080943

Abstract: In some embodiments, virtualization methods for generating a binaural signal in response to channels of a multi-channel audio signal, which apply a binaural room impulse response (BRIR) to each channel including by using at least one feedback delay network (FDN) to apply a common late reverberation to a downmix of the channels. In some embodiments, input signal channels are processed in a first processing path to apply to each channel a direct response and early reflection portion of a single-channel BRIR for the channel, and the downmix of the channels is processed in a second processing path including at least one FDN which applies the common late reverberation. Typically, the common late reverberation emulates collective macro attributes of late reverberation portions of at least some of the single-channel BRIRs. Other aspects are headphone virtualizers configured to perform any embodiment of the method.

Type: Application

Filed: September 9, 2024

Publication date: March 6, 2025

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Kuan-Chieh YEN, Dirk Jeroen BREEBAART, Grant A. DAVIDSON, Rhonda WILSON, David M. COOPER, Zhiwei SHUANG
METHODS, APPARATUS AND SYSTEMS FOR DECOMPRESSING A HIGHER ORDER AMBISONICS (HOA) SIGNAL

Publication number: 20250080934

Abstract: A method for compressing a HOA signal being an input HOA representation with input time frames (C(k)) of HOA coefficient sequences comprises spatial HOA encoding of the input time frames and subsequent perceptual encoding and source encoding. Each input time frame is decomposed (802) into a frame of predominant sound signals (XPS(k?1)) and a frame of an ambient HOA component ({tilde over (C)}AMB(k?1)). The ambient HOA component ({tilde over (C)}AMB(k?1)) comprises, in a layered mode, first HOA coefficient sequences of the input HOA representation (cn(k?1)) in lower positions and second HOA coefficient sequences (cAMB,n(k?1)) in remaining higher positions. The second HOA coefficient sequences are part of an HOA representation of a residual between the input HOA representation and the HOA representation of the predominant sound signals.

Type: Application

Filed: August 2, 2024

Publication date: March 6, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Sven KORDON, Alexander KRUEGER, Oliver WUEBBOLT
Method and apparatus for compressing and decompressing a higher order ambisonics signal representation

Patent number: 12245012

Abstract: A method and apparatus for decompressing a Higher Order Ambisonics (HOA) signal representation is disclosed. The apparatus includes an input interface that receives an encoded directional signal and an encoded ambient signal and an audio decoder that perceptually decodes the encoded directional signal and encoded ambient signal to produce a decoded directional signal and a decoded ambient signal, respectively. The apparatus further includes an extractor for obtaining side information related to the directional signal and an inverse transformer for converting the decoded ambient signal from a spatial domain to an HOA domain representation of the ambient signal. The apparatus also includes a synthesizer for recomposing a Higher Order Ambisonics (HOA) signal from the HOA domain representation of the ambient signal and the decoded directional signal. The side information includes a direction of the directional signal selected from a set of uniformly spaced directions.

Type: Grant

Filed: October 16, 2023

Date of Patent: March 4, 2025

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Alexander Krueger, Sven Kordon, Johannes Boehm, Johann-Markus Batke
Methods and apparatus for decoding encoded HOA signals

Patent number: 12245013

Abstract: There are two representations for Higher Order Ambisonics denoted HOA: spatial domain and coefficient domain. The invention generates from a coefficient domain representation a mixed spatial/coefficient domain representation, wherein the number of said HOA signals can be variable. An aspect of the invention further relates to methods and apparatus decoding multiplexed and perceptually encoded HOA signals, including transforming a vector of PCM encoded spatial domain signals of the HOA representation to a corresponding vector of coefficient domain signals by multiplying the vector of PCM encoded spatial domain signals with a transform matrix and de-normalizing the vector of PCM encoded and normalized coefficient domain signals, wherein said de-normalizing comprises.

Type: Grant

Filed: November 22, 2023

Date of Patent: March 4, 2025

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Sven Kordon, Alexander Krueger
Method and device for deriving inter-view motion merging candidate

Patent number: 12244859

Abstract: The present invention provides a method and a device for deriving an inter-view motion merging candidate. A method for deriving an inter-view motion merging candidate, according to an embodiment of the present invention, can comprise the steps of: on the basis of encoding information of an inter-view reference block derived by means of a variation vector of a current block, determining whether or not inter-view motion merging of the current block is possible; and, if inter-view motion merging of the current block is not possible, generating an inter-view motion merging candidate of the current block by using encoding information of an adjacent block that is spatially adjacent to the inter-view reference block.

Type: Grant

Filed: May 17, 2023

Date of Patent: March 4, 2025

Assignee: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Gwang Hoon Park, Young Su Heo
Methods for reducing error in environmental noise compensation systems

Patent number: 12243548

Abstract: Some noise compensation methods involve receiving microphone signals corresponding to ambient noise from a noise source location in or near an audio environment, determining or estimating a listener position in the audio environment and estimating at least one critical distance, which is a distance from the noise source location at which directly propagated sound pressure is equal to diffuse field sound pressure. Some examples involve estimating whether the listener position is within the at least one critical distance and implementing a noise compensation method for the ambient noise based, at least in part, on an estimate of whether the listener position is within the critical distance.

Type: Grant

Filed: December 8, 2020

Date of Patent: March 4, 2025

Assignee: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Benjamin Alexander Jancovich, Timothy Alan Port, Andrew P. Reilly, Richard J. Cartwright
Quantization parameter signaling

Patent number: 12244867

Abstract: A quantization parameter signalling mechanism for both SDR and HDR content in video coding is described using two approaches. The first approach is to send the user-defined QpC table directly in high level syntax. This leads to more flexible and efficient QP control for future codec development and video content coding. The second approach is to signal luma and chroma QPs independently. This approach eliminates the need for QpC tables and removes the dependency of chroma quantization parameter on luma QP.

Type: Grant

Filed: November 10, 2023

Date of Patent: March 4, 2025

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Fangjun Pu, Taoran Lu, Peng Yin, Sean Thomas McCarthy
Wrapped reshaping for codeword augmentation with neighborhood consistency

Patent number: 12244872

Abstract: An input image of a first bit depth in an input domain is received. Forward reshaping operations are performed on the input image to generate a forward reshaped image of a second bit depth in a reshaping domain. An image container containing image data derived from the forward reshaped image is encoded into an output video signal of the second bit depth.

Type: Grant

Filed: November 10, 2021

Date of Patent: March 4, 2025

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Janos Horvath, Harshad Kadu, Guan-Ming Su
Method for and apparatus for decoding/rendering an Ambisonics audio soundfield representation for audio playback using 2D setups

Patent number: 12245014

Abstract: Improved methods and/or apparatus for decoding an encoded audio signal in soundfield format for L loudspeakers. The method and/or apparatus can render an Ambisonics format audio signal to 2D loudspeaker setup(s) based on a rendering matrix. The rendering matrix has elements based on loudspeaker positions and wherein the rendering matrix is determined based on weighting at least an element of a first matrix with a weighting factor ? = 1 L . The first matrix is determined based on positions of the L loudspeakers and at least a virtual position of at least a virtual loudspeaker that is added to the positions of the L loudspeakers.

Type: Grant

Filed: August 28, 2023

Date of Patent: March 4, 2025

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Florian Keiler, Johannes Boehm
Image reshaping in video coding using rate distortion optimization

Patent number: 12244869

Abstract: Given a sequence of images in a first codeword representation, methods, processes, and systems are presented for image reshaping using rate distortion optimization, wherein reshaping allows the images to be coded in a second codeword representation which allows more efficient compression than using the first codeword representation. Syntax methods for signaling reshaping parameters are also presented.

Type: Grant

Filed: March 29, 2022

Date of Patent: March 4, 2025

Assignee: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Peng Yin, Fangjun Pu, Taoran Lu, Tao Chen, Walter J. Husak, Sean Thomas McCarthy
PROCESSING OF MICROPHONE SIGNALS FOR SPATIAL PLAYBACK

Publication number: 20250071479

Abstract: Disclosed are methods and systems which convert a multi-microphone input signal to a multichannel output signal making use of a time-and frequency-varying matrix. For each time and frequency tile, the matrix is derived as a function of a dominant direction of arrival and a steering strength parameter. Likewise, the dominant direction and steering strength parameter are derived from characteristics of the multi-microphone signals, where those characteristics include values representative of the inter-channel amplitude and group-delay differences.

Type: Application

Filed: September 5, 2024

Publication date: February 27, 2025

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor: David S. MCGRATH
AUDIO SIGNAL LOUDNESS CONTROL

Publication number: 20250070737

Abstract: Example embodiments disclosed herein relate to audio signal loudness control. A method for controlling loudness of an audio signal is disclosed. The method includes responsive to determining presence of a noise signal, deriving a target partial loudness adjustment based, at least in part, on at least one of a first factor related to the noise signal and a second factor related to the audio signal. The method further includes determining a target partial loudness of the audio signal based, at least in part, on the target partial loudness adjustment. Corresponding system, apparatus and computer program product are also disclosed.

Type: Application

Filed: September 5, 2024

Publication date: February 27, 2025

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Guilin MA, Xiguang ZHENG
ADAPTIVE PROCESSING WITH MULTIPLE MEDIA PROCESSING NODES

Publication number: 20250069607

Abstract: Techniques for adaptive processing of media data based on separate data specifying a state of the media data are provided. A device in a media processing chain may determine whether a type of media processing has already been performed on an input version of media data. If so, the device may adapt its processing of the media data to disable performing the type of media processing. If not, the device performs the type of media processing. The device may create a state of the media data specifying the type of media processing. The device may communicate the state of the media data and an output version of the media data to a recipient device in the media processing chain, for the purpose of supporting the recipient device's adaptive processing of the media data.

Type: Application

Filed: May 24, 2024

Publication date: February 27, 2025

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Jeffrey RIEDMILLER, Regunathan RADHAKRISHNAN, Marvin PRIBADI, Farhad FARAHANI, Michael SMITHERS
SYSTEMS, METHODS AND APPARATUS FOR CONVERSION FROM CHANNEL-BASED AUDIO TO OBJECT-BASED AUDIO

Publication number: 20250069611

Abstract: Embodiments are disclosed for channel-based audio (CBA) (e.g., 22.2-ch audio) to object-based audio (OBA) conversion. The conversion includes converting CBA metadata to object audio metadata (OAMD) and reordering the CBA channels based on channel shuffle information derived in accordance with channel ordering constraints of the OAMD. The OBA with reordered channels is rendered in a playback device using the OAMD or in a source device, such as a set-top box or audio/video recorder. In an embodiment, the CBA metadata includes signaling that indicates a specific OAMD representation to be used in the conversion of the metadata. In an embodiment, pre-computed OAMD is transmitted in a native audio bitstream (e.g., AAC) for transmission (e.g., over HDMI) or for rendering in a source device. In an embodiment, pre-computed OAMD is transmitted in a transport layer bitstream (e.g., ISO BMFF, MPEG4 audio bitstream) to a playback device or source device.

Type: Application

Filed: September 10, 2024

Publication date: February 27, 2025

Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: Michael C. WARD, Freddie SANCHEZ, Christof FERSCH
INTERACTIVE MOTION BLUR ON MOBILE DEVICES

Publication number: 20250069200

Abstract: Novel methods and systems are described for providing interactive motion blur on an image by motion inputs from movements of the mobile device displaying the image. The device can process the motion blur by modules providing motion blur parameter estimation, blur application, and image composition based on metadata and a baseline image from the encoder. A pre-loaded filter bank can provide blur kernels for blur application.

Type: Application

Filed: December 7, 2022

Publication date: February 27, 2025

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Dae Yeol Lee, Neeraj J. Gadgil, Guan-Ming Su

prev 1 2 3 4 5 6 7 8 … next