Patents Assigned to Dolby Laboratories

VOLUMETRIC IMMERSIVE EXPERIENCE WITH MULTIPLE VIEWS

Publication number: 20250148699

Abstract: A multi-view input image covering multiple sampled views is received. A multi-view layered image stack is generated from the multi-view input image. A target view of a viewer to an image space depicted by the multi-view input image is determined based on user pose data. The target view is used to select user pose selected sampled views from among the multiple sampled views. Layered images for the user pose selected sampled views, along with alpha maps and beta scale maps for the user pose selected sampled views are encoded into a video signal to cause a recipient device of the video signal to generate a display image for rendering on the image display.

Type: Application

Filed: January 30, 2023

Publication date: May 8, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Ajit NINAN, Gregory John WARD
Luminance adjustment based on viewer adaptation state

Patent number: 12293498

Abstract: A video delivery system for luminance adjustment based upon a viewer adaptation state comprises a processor configured to: receive a source image including a current image frame including metadata corresponding to a mean luminance value of the current image frame, and the source image including an upcoming image frame including metadata corresponding to a mean luminance value of the upcoming image frame. The processor is configured to determine an ambient luminance value based on an ambient luminance, determine an incident luminance value based on the ambient luminance value and the mean luminance value, determine a difference between a current pupil size and a target pupil size, and generate an output image by modifying the source image based on a luminance adjustment factor, the luminance adjustment factor being a function of the difference between the current pupil size and the target pupil size.

Type: Grant

Filed: March 2, 2022

Date of Patent: May 6, 2025

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Jaclyn A. Pytlarz, Jake W. Zuena, Per Jonas A. Klittmark
METHODS AND DEVICES FOR RENDERING AN AMBISONICS AUDIO SIGNAL

Publication number: 20250142276

Abstract: The present document describes a method (400) for rendering an ambisonics signal using a loudspeaker arrangement comprising S loudspeakers. The method (400) comprises converting (401) a set of N ambisonics channel signals (111) into a set of unfiltered pre-rendered signals (211), with N>1 and S>1. Furthermore, the method (400) comprises performing (402) near field compensation, referred to as NFC, filtering of M unfiltered pre-rendered signals (211) of the set of unfiltered pre-rendered signals (211) to provide a set of S filtered loudspeaker channel signals (114) for rendering using the corresponding S loudspeakers.

Type: Application

Filed: February 3, 2023

Publication date: May 1, 2025

Applicants: Dolby Laboratories Licensing Corporation, Dolby International AB

Inventors: Saketh SATHUVALLI, Christof Joseph FERSCH, Panji SETIAWAN, Tripti TIWARI, Reshma RAI
METHOD, APPARATUS OR SYSTEMS FOR PROCESSING AUDIO OBJECTS

Publication number: 20250142285

Abstract: Diffuse or spatially large audio objects may be identified for special processing. A decorrelation process may be performed on audio signals corresponding to the large audio objects to produce decorrelated large audio object audio signals. These decorrelated large audio object audio signals may be associated with object locations, which may be stationary or time-varying locations. For example, the decorrelated large audio object audio signals may be rendered to virtual or actual speaker locations. The output of such a rendering process may be input to a scene simplification process. The decorrelation, associating and/or scene simplification processes may be performed prior to a process of encoding the audio data.

Type: Application

Filed: January 7, 2025

Publication date: May 1, 2025

Applicants: Dolby Laboratories Licensing Corporation, Dolby International AB

Inventors: Dirk Jeroen BREEBAART, Lie LU, Nicolas R. TSINGOS, Antonio MATEOS SOLE
EFFICIENT ORIENTATION TRACKING WITH FUTURE ORIENTATION PREDICTION

Publication number: 20250142282

Abstract: The present disclosure relates to a method and system for predicting a future orientation of an orientation tracker (100). The method comprising obtaining a sequence of angular velocity samples, each angular velocity sample indicating an angular velocity at a point in time and obtaining a sequence of angular acceleration samples, each angular acceleration sample indicating an acceleration or deceleration of the angular velocity at each point in time. Wherein said method further comprises determining (S5a), for each point in time where the angular velocity is accelerating, a predicted orientation of the orientation tracker (100) based on a first order prediction of an accumulated rotation of the orientation tracker (100) and determining (S5c), for each point in time where the angular velocity is decelerating, a predicted orientation of the orientation tracker (100) based on a second order prediction of the accumulated rotation of the orientation tracker (100).

Type: Application

Filed: September 15, 2022

Publication date: May 1, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: David S. MCGRATH, Jeremy Grant STODDARD
Audio filterbank with decorrelating components

Patent number: 12289594

Abstract: A multi-input, multi-output audio process is implemented as a linear system for use in an audio filterbank to convert a set of frequency-domain input audio signals into a set of frequency-domain output signals. A transfer function from one input to one output is defined as a frequency dependent gain function. In some implementations, the transfer function includes a direct component that is substantially defined as a frequency dependent gain, and one or more decorrelated components that have frequency-varying group phase response. The transfer function is formed from a set of sub-band functions, with each sub-band function being formed from a set of corresponding component transfer functions including direct component and one or more decorrelated components.

Type: Grant

Filed: September 2, 2020

Date of Patent: April 29, 2025

Assignee: Dolby Laboratories Licensing Corporation

Inventor: David S. McGrath
TENSOR-PRODUCT B-SPLINE PREDICTION FOR HDR VIDEO IN MOBILE APPLICATIONS

Publication number: 20250133241

Abstract: Tensor-Product B-splines (TPB) have been shown to improve video quality when used to represent reshaping functions to map reshaped standard dynamic range content into high dynamic range (HDR) content; however, TPB prediction is computationally intensive and may not be supported by legacy devices. Methods and systems for backwards-compatible signaling of TPB-related metadata and a fast TPB prediction method are presented to overcome both of these limitations. Computation overhead for a TPB-based 3D look-up table is reduced by using temporary two-dimensional arrays. A remapping of the most significant bits of a legacy bit-depth parameter allows for backwards compatibility.

Type: Application

Filed: September 8, 2022

Publication date: April 24, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Harshad Kadu, Guan-Ming Su, Sheng Qu, Per Jonas Andreas Klittmark, Hariharan Ganapathy Kathirvelu
MANAGEMENT OF PROFESSIONALLY GENERATED AND USER-GENERATED AUDIO CONTENT

Publication number: 20250130756

Abstract: A system for managing user-generated content (UGC) and professionally generated content (PGC) is disclosed. The system is programmed to receive digital audio data having two channels from a social media platform. The system is programmed to extract spatial features that capture differences in the two channels from the digital audio data. The system is programmed to also extract temporal features, spectral features, and background features from the digital audio data. The system is programmed to then use the extracted features to determine whether to process the digital audio data as UGC or PGC before playback.

Type: Application

Filed: August 11, 2022

Publication date: April 24, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Shaofan YANG, Kai LI
REVERB AND NOISE ROBUST VOICE ACTIVITY DETECTION BASED ON MODULATION DOMAIN ATTENTION

Publication number: 20250131941

Abstract: A system for detecting speech from reverberant signals is disclosed. The system is programmed to receive spectral temporal amplitude data in the modulation frequency domain. The system is programmed to then enhance the spectral temporal amplitude data by reducing reverberation and other noise as well as smoothing based on certain properties of the spectral temporal spectrogram associated with the spectral temporal amplitude data. Next, the system is programmed to compute various features related to the presence of speech based on the enhanced spectral temporal amplitude data and other data in the modulation frequency domain or in the (acoustic) frequency domain. The system is programmed to then determine an extent of speech present in the audio data corresponding to the received spectral temporal amplitude data based on the various features. The system can be programmed to transmit the extent of speech present to an output device.

Type: Application

Filed: August 11, 2022

Publication date: April 24, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Shaofan YANG, Kai LI
Neural networks for high dynamic range video super-resolution

Patent number: 12283023

Abstract: Methods and systems for the super resolution of high dynamic range (HDR) video are described. Given a sequence of video frames, a current frame and two or more neighboring frames are processed by a neural-network (NN) feature extraction module, followed by a NN upscaling module, and a NN reconstruction module. In parallel, the current frame is upscaled using traditional up-sampling to generate an intermediate up-sampled frame. The output of the reconstruction module is added to the intermediate up-sampled frame to generate an output frame. Additional traditional up-sampling may be performed on the output frame to match the desired up-scaling factor, beyond the up-scaling factor for which the neural network was trained.

Type: Grant

Filed: March 17, 2023

Date of Patent: April 22, 2025

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Shruthi Suresh Rotti, Anustup Kumar Atanu Choudhury
Method and device for decoding an audio soundfield representation

Patent number: 12283279

Abstract: Soundfield signals such as e.g. Ambisonics carry a representation of a desired sound field. The Ambisonics format is based on spherical harmonic decomposition of the soundfield, and Higher Order Ambisonics (HOA) uses spherical harmonics of at least 2nd order. However, commonly used loudspeaker setups are irregular and lead to problems in decoder design. A method for improved decoding an audio soundfield representation for audio playback comprises calculating a panning function (W) using a geometrical method based on the positions of a plurality of loudspeakers and a plurality of source directions, calculating a mode matrix (?) from the loudspeaker positions, calculating a pseudo-inverse mode matrix (?+) and decoding the audio soundfield representation. The decoding is based on a decode matrix (D) that is obtained from the panning function (W) and the pseudo-inverse mode matrix (?+).

Type: Grant

Filed: March 15, 2024

Date of Patent: April 22, 2025

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Johann-Markus Batke, Florian Keiler, Johannes Boehm
Bitrate distribution in immersive voice and audio services

Patent number: 12283281

Abstract: Embodiments are disclosed for bitrate distribution in immersive voice and audio services. In an embodiment, a method of encoding an IVAS bitstream comprises: receiving an input audio signal; downmixing the input audio signal into one or more downmix channels and spatial metadata; reading a set of one or more bitrates for the downmix channels and a set of quantization levels for the spatial metadata from a bitrate distribution control table; determining a combination of the one or more bitrates for the downmix channels; determining a metadata quantization level from the set of metadata quantization levels using a bitrate distribution process; quantizing and coding the spatial metadata using the metadata quantization level; generating, using the combination of one or more bitrates, a downmix bitstream for the one or more downmix channels; combining the downmix bitstream, the quantized and coded spatial metadata and the set of quantization levels into the IVAS bitstream.

Type: Grant

Filed: October 28, 2020

Date of Patent: April 22, 2025

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Rishabh Tyagi, Juan Felix Torres, Stefanie Brown
Audio control using auditory event detection

Patent number: 12283931

Abstract: In some embodiments, a method for processing an audio signal in an audio processing apparatus is disclosed. The method includes receiving an audio signal and a parameter, the parameter indicating a location of an auditory event boundary. An audio portion between consecutive auditory event boundaries constitutes an auditory event. The method further includes applying a modification to the audio signal based in part on an occurrence of the auditory event. The parameter may be generated by monitoring a characteristic of the audio signal and identifying a change in the characteristic.

Type: Grant

Filed: May 23, 2024

Date of Patent: April 22, 2025

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Brett G. Crockett, Alan J. Seefeldt
GENERATING CHANNEL AND OBJECT-BASED AUDIO FROM CHANNEL-BASED AUDIO

Publication number: 20250126428

Abstract: A method of audio processing includes generating a detection score based on the partial loudnesses of a reference audio signal, extracted audio objects, extracted bed channels, a rendered audio signal and a channel-based audio signal. The detection score is indicative of an audio artifact in one or more of the audio objects and the bed channels. The extracted audio objects and extracted bed channels may be modified, in accordance with the detection score, to reduce the audio artifact.

Type: Application

Filed: October 14, 2022

Publication date: April 17, 2025

Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: Xu Li, Giulio Cengarle, Qingyuan Bin, Michael Getty Horgan
METHODS AND SYSTEMS FOR RENDERING OBJECT BASED AUDIO

Publication number: 20250124933

Abstract: Methods for generating an object based audio program, renderable in a personalizable manner, and including a bed of speaker channels renderable in the absence of selection of other program content (e.g., to provide a default full range audio experience). Other embodiments include steps of delivering, decoding, and/or rendering such a program. Rendering of content of the bed, or of a selected mix of other content of the program, may provide an immersive experience. The program may include multiple object channels (e.g., object channels indicative of user-selectable and user-configurable objects), the bed of speaker channels, and other speaker channels. Another aspect is an audio processing unit (e.g., encoder or decoder) configured to perform, or which includes a buffer memory which stores at least one frame (or other segment) of an object based audio program (or bitstream thereof) generated in accordance with, any embodiment of the method.

Type: Application

Filed: December 23, 2024

Publication date: April 17, 2025

Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: Sripal S. Mehta, Thomas Ziegler, Giles Baker, Jeffrey Riedmiller, Prinyar Saungsomboon
LOUDNESS ADJUSTMENT FOR DOWNMIXED AUDIO CONTENT

Publication number: 20250126427

Abstract: Audio content coded for a reference speaker configuration is downmixed to downmix audio content coded for a specific speaker configuration. One or more gain adjustments are performed on individual portions of the downmix audio content coded for the specific speaker configuration. Loudness measurements are then performed on the individual portions of the downmix audio content. An audio signal that comprises the audio content coded for the reference speaker configuration and downmix loudness metadata is generated. The downmix loudness metadata is created based at least in part on the loudness measurements on the individual portions of the downmix audio content.

Type: Application

Filed: December 17, 2024

Publication date: April 17, 2025

Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: Michael C. Ward, Jeffrey Riedmiller, Scott Gregory Norcross, Alexander Stahlmann
Methods and systems for interactive rendering of object based audio

Patent number: 12277942

Abstract: Methods for generating an object based audio program which is renderable in a personalizable manner, e.g., to provide an immersive, perception of audio content of the program. Other embodiments include steps of delivering (e.g., broadcasting), decoding, and/or rendering such a program. Rendering of audio objects indicated by the program may provide an immersive experience. The audio content of the program may be indicative of multiple object channels (e.g., object channels indicative of user-selectable and user-configurable objects, and typically also a default set of objects which will be rendered in the absence of a selection by a user) and a bed of speaker channels. Another aspect is an audio processing unit (e.g., encoder or decoder) configured to perform, or which includes a buffer memory which stores at least one frame (or other segment) of an object based audio program (or bitstream thereof) generated in accordance with, any embodiment of the method.

Type: Grant

Filed: July 3, 2023

Date of Patent: April 15, 2025

Assignees: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: Robert Andrew France, Thomas Ziegler, Sripal S. Mehta, Andrew Jonathan Dowell, Prinyar Saungsomboon, Michael David Dwyer, Farhad Farahani, Nicolas R. Tsingos, Freddie Sanchez
Passive multi-wearable-devices tracking

Patent number: 12277724

Abstract: At a first time point, a first light capturing device at a first spatial location in a three-dimensional (3D) space captures first light rays from light sources located at designated spatial locations on a viewer device in the 3D space. At the first time point, a second light capturing device at a second spatial location in the 3D space captures second light rays from the light sources located at the designated spatial locations on the viewer device in the 3D space. Based on the first light rays captured by the first light capturing device and the second light rays captured by the second light capturing device, at least one of a spatial position and a spatial direction, at the first time point, of the viewer device is determined.

Type: Grant

Filed: May 9, 2023

Date of Patent: April 15, 2025

Assignee: Dolby Laboratories Licensing Corporation

Inventors: Ajit Ninan, Neil Mammen
Loudness adjustment for downmixed audio content

Patent number: 12279104

Abstract: Audio content coded for a reference speaker configuration is downmixed to downmix audio content coded for a specific speaker configuration. One or more gain adjustments are performed on individual portions of the downmix audio content coded for the specific speaker configuration. Loudness measurements are then performed on the individual portions of the downmix audio content. An audio signal that comprises the audio content coded for the reference speaker configuration and downmix loudness metadata is generated. The downmix loudness metadata is created based at least in part on the loudness measurements on the individual portions of the downmix audio content.

Type: Grant

Filed: December 17, 2024

Date of Patent: April 15, 2025

Assignees: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: Michael C. Ward, Jeffrey Riedmiller, Scott Gregory Norcross, Alexander Stahlmann
Methods and systems for rendering object based audio

Patent number: 12277943

Abstract: Methods for generating an object based audio program, renderable in a personalizable manner, and including a bed of speaker channels renderable in the absence of selection of other program content (e.g., to provide a default full range audio experience). Other embodiments include steps of delivering, decoding, and/or rendering such a program. Rendering of content of the bed, or of a selected mix of other content of the program, may provide an immersive experience. The program may include multiple object channels (e.g., object channels indicative of user-selectable and user-configurable objects), the bed of speaker channels, and other speaker channels. Another aspect is an audio processing unit (e.g., encoder or decoder) configured to perform, or which includes a buffer memory which stores at least one frame (or other segment) of an object based audio program (or bitstream thereof) generated in accordance with, any embodiment of the method.

Type: Grant

Filed: September 19, 2023

Date of Patent: April 15, 2025

Assignees: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: Sripal S. Mehta, Thomas Ziegler, Giles Baker, Jeffrey Riedmiller, Prinyar Saungsomboon

1 2 3 4 5 … next