Dolby Labs Patent Applications

Dolby Labs patent applications that are pending before the United States Patent and Trademark Office (USPTO).

SOURCE COLOR VOLUME INFORMATION MESSAGING

Publication number: 20250063201

Abstract: Methods are described to communicate source color volume information in a coded bitstream using SEI messaging. Such data include at least the minimum, maximum, and average luminance values in the source data plus optional data that may include the color volume x and y chromaticity coordinates for the input color primaries (e.g., red, green, and blue) of the source data, and the color x and y chromaticity coordinates for the color primaries corresponding to the minimum, average, and maximum luminance values in the source data. Messaging data signaling an active region in each picture may also be included.

Type: Application

Filed: November 7, 2024

Publication date: February 20, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Tao CHEN, Peng YIN, Taoran LU, Walter J. HUSAK
PRE-CONDITIONING AUDIO FOR MACHINE PERCEPTION

Publication number: 20250061914

Abstract: An apparatus and method of pre-conditioning audio for machine perception. Machine perception differs from human perception, and different processing parameters are used for machine perception applications (e.g., speech to text processing) as compared to those used for human perception applications (e.g., voice communications). These different parameters may result in pre-conditioned audio that is worsened for human perception yet improved for machine perception.

Type: Application

Filed: August 30, 2024

Publication date: February 20, 2025

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Hadis Nosrati, Glenn N. Dickins, Nicholas Luke Appleton
FILM GRAIN PARAMETERS ADAPTATION BASED ON VIEWING ENVIRONMENT

Publication number: 20250063203

Abstract: Methods, systems, and bitstream syntax are described for metadata signaling and film-grain parameter adaptation based on a viewing environment which may differ from a reference environment. Example adaptation models are provided for viewing parameters that include: ambient room illumination, viewing distance, and pixels per inch in a target display. Example systems include a single reference viewing environment model and a multi-reference viewing environment model supporting adaptation of film-grain model parameters via adaptation functions or interpolation.

Type: Application

Filed: December 19, 2022

Publication date: February 20, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Guan-Ming SU, Harshad KADU, Peng YIN
QUANTIZATION PARAMETER SIGNALING

Publication number: 20250063199

Abstract: A quantization parameter signaling mechanism for both SDR and HDR content in video coding is described using two approaches. The first approach is to send the user-defined QpC table directly in high level syntax. This leads to more flexible and efficient QP control for future codec development and video content coding. The second approach is to signal luma and chroma QPs independently. This approach eliminates the need for QpC tables and removes the dependency of chroma quantization parameter on luma QP.

Type: Application

Filed: November 4, 2024

Publication date: February 20, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Fangjun Pu, Taoran Lu, Peng Yin, Sean Thomas McCarthy
SOURCE COLOR VOLUME INFORMATION MESSAGING

Publication number: 20250063202

Abstract: Methods are described to communicate source color volume information in a coded bitstream using SEI messaging. Such data include at least the minimum, maximum, and average luminance values in the source data plus optional data that may include the color volume x and y chromaticity coordinates for the input color primaries (e.g., red, green, and blue) of the source data, and the color x and y chromaticity coordinates for the color primaries corresponding to the minimum, average, and maximum luminance values in the source data. Messaging data signaling an active region in each picture may also be included.

Type: Application

Filed: November 7, 2024

Publication date: February 20, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Tao CHEN, Peng YIN, Taoran LU, Walter J. HUSAK
VOLUME LEVELER CONTROLLER AND CONTROLLING METHOD

Publication number: 20250062736

Abstract: Volume leveler controller and controlling method are disclosed. In one embodiment, A volume leveler controller includes an audio content classifier for identifying the content type of an audio signal in real time; and an adjusting unit for adjusting a volume leveler in a continuous manner based on the content type as identified. The adjusting unit may configured to positively correlate the dynamic gain of the volume leveler with informative content types of the audio signal, and negatively correlate the dynamic gain of the volume leveler with interfering content types of the audio signal.

Type: Application

Filed: November 4, 2024

Publication date: February 20, 2025

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Jun WANG, Lie LU, Alan J. SEEFELDT
ENCODING AND DECODING MULTIPLE-INTENT IMAGES AND VIDEO USING METADATA

Publication number: 20250054204

Abstract: Systems and methods for encoding and decoding multiple-intent images and video using metadata. When encoding an image as a multiple-intent image, at least one appearance adjustment may be made to the image. Metadata characterizing the at least one appearance adjustment may be included in, or transmitted along with, the encoded multiple-intent image. When decoding a multiple-intent image, a system may obtain a selection of a desired rendering intent and, based on that selection, either render the multiple-intent image with the applied appearance adjustments or may use the metadata to invert the appearance adjustments and recover the image pre-appearance adjustments.

Type: Application

Filed: September 27, 2022

Publication date: February 13, 2025

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Robin Atkins, Jaclyn Anne Pytlarz, Robert Wanat, Jake William Zuena
IMPROVING NOISE COMPENSATION IN MASK-BASED SPEECH ENHANCEMENT

Publication number: 20250054508

Abstract: Methods and apparatus for improving noise compensation in mask-based speech enhancement are described. A method of processing an audio signal, which includes one or more speech segments, includes obtaining a mask for mask-based speech enhancement of the audio signal and obtaining a magnitude of the audio signal. An estimate of residual noise is determined in the audio signal after mask-based speech enhancement, based on the mask and the magnitude of the audio signal. A modified mask is determined based on the estimate of the residual noise. Further described are corresponding programs and computer-readable storage media.

Type: Application

Filed: November 7, 2022

Publication date: February 13, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Jundai SUN, Zhiwei SHUANG
HARMONIC TRANSPOSITION IN AN AUDIO CODING METHOD AND SYSTEM

Publication number: 20250054505

Abstract: The present invention relates to transposing signals in time and/or frequency and in particular to coding of audio signals. More particular, the present invention relates to high frequency reconstruction (HFR) methods including a frequency domain harmonic transposer. A method and system for generating a transposed output signal from an input signal using a transposition factor T is described. The system comprises an analysis window of length La, extracting a frame of the input signal, and an analysis transformation unit of order M transforming the samples into M complex coefficients. M is a function of the transposition factor T. The system further comprises a nonlinear processing unit altering the phase of the complex coefficients by using the transposition factor T, a synthesis transformation unit of order M transforming the altered coefficients into M altered samples, and a synthesis window of length Ls, generating a frame of the output signal.

Type: Application

Filed: October 3, 2024

Publication date: February 13, 2025

Applicant: DOLBY INTERNATIONAL AB

Inventors: Per EKSTRAND, Lars VILLEMOES
CROSS PRODUCT ENHANCED HARMONIC TRANSPOSITION

Publication number: 20250054507

Abstract: The present invention relates to audio coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR). A system and a method for generating a high frequency component of a signal from a low frequency component of the signal is described. The system comprises an analysis filter bank providing a plurality of analysis subband signals of the low frequency component of the signal. It also comprises a non-linear processing unit to generate a synthesis subband signal with a synthesis frequency by modifying the phase of a first and a second of the plurality of analysis subband signals and by combining the phase-modified analysis subband signals. Finally, it comprises a synthesis filter bank for generating the high frequency component of the signal from the synthesis subband signal.

Type: Application

Filed: October 31, 2024

Publication date: February 13, 2025

Applicant: DOLBY INTERNATIONAL AB

Inventors: Lars VILLEMOES, Per HEDELIN
STEREO AUDIO ENCODER AND DECODER

Publication number: 20250054506

Abstract: The present disclosure provides methods, devices and computer program products for encoding and decoding a stereo audio signal based on an input signal. According to the disclosure, a hybrid approach of using both parametric stereo coding and a discrete representation of the stereo audio signal is used which may improve the quality of the encoded and decoded audio for certain bitrates.

Type: Application

Filed: August 19, 2024

Publication date: February 13, 2025

Applicant: DOLBY INTERNATIONAL AB

Inventors: Heiko PURNHAGEN, Kristofer KJOERLING
IVAS SPAR FILTER BANK IN QMF DOMAIN

Publication number: 20250054503

Abstract: A method of processing a representation of a multichannel audio signal is provided. The representation includes a first channel and metadata relating to a second channel. The metadata includes, for each of a plurality of first bands of a first filter bank, a respective prediction parameter. The method includes: applying a second filterbank with a plurality of second bands to the first channel to obtain, for each second band, a banded version of the first channel; for each second band, generating a respective time-domain filter based on the prediction parameters and first filters corresponding to the first bands; and for each second band, generating a prediction for the second channel based on a filtered version of the first channel, the filtered version being obtained by applying the respective time-domain filter in that second band to the banded version of the first channel. Also provided are corresponding apparatus, programs, and computer-readable storage media.

Type: Application

Filed: December 20, 2022

Publication date: February 13, 2025

Applicant: DOLBY INTERNATIONAL AB

Inventors: Harald MUNDT, Lars VILLEMOES
IMAGE FILE FORMAT FOR MULTIPLANE IMAGES

Publication number: 20250056043

Abstract: Methods, systems, and bitstream syntax are described for a file container that supports the storage and transmission of multi-plane images. Examples are provided for coding texture and opacity information using HEVC or VVC coding and the HEIF container. Examples of carrying coded MPI images according to V3C and an example HEIF-based player are also presented.

Type: Application

Filed: October 16, 2024

Publication date: February 13, 2025

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor: SEJIN OH
DETERMINATION OF MOVEMENT DIRECTION

Publication number: 20250044100

Abstract: Methods, systems, and media for determining user movement direction are provided. In some embodiments, a method involves obtaining, using a control system, user acceleration data associated with a user. The method involves determining, using the control system, a movement period associated with a movement activity of the user using the user acceleration data, wherein the movement period indicates a duration between two sequential movements by the user. The method involves determining, using the control system, a movement direction corresponding to the movement activity using the user acceleration data based on a direction of acceleration orthogonal to the movement direction in which at least a portion of the user acceleration data is anti-periodic over a period of time corresponding to the movement period.

Type: Application

Filed: December 22, 2022

Publication date: February 6, 2025

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor: David S. McGrath
SOURCE SEPARATION AND REMIXING IN SIGNAL PROCESSING

Publication number: 20250046328

Abstract: The present disclosure relates to a method and audio processing system (1) for performing source separation. The method comprises obtaining (S1) an audio signal (Sin) including a mixture of speech content and noise content, determining (S2a, S2b, S2c), from the audio signal, speech content (formula A), stationary noise content (formula C) and non-speech content (formula B). The stationary noise content (formula C) is a true subset of the non-speech content (formula B) and the method further comprises determining (S3), based on a difference between the stationary noise content (formula C) and the non-speech content (formula B) a non-stationary noise content formula D), obtaining (S5) a set of weighting factors and forming (S6) a processed audio signal based on a combination of the speech content (formula A), the stationary noise content (formula C), and the non-stationary noise content (formula D) weighted with their respective weighting factor.

Type: Application

Filed: October 26, 2022

Publication date: February 6, 2025

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Jundai SUN, Zhiwei SHUANG, Yuanxing MA
METHOD FOR NEURAL NETWORK TRAINING WITH MULTIPLE SUPERVISORS

Publication number: 20250045585

Abstract: The present disclosure relates to a method for designing a processor (20) and a computer implemented neural network. The method comprises obtaining input data and corresponding ground truth target data and providing the input data to a processor (20) for outputting a first prediction of target data given the input data. The method further comprises providing the latent variables output by a processor module (21: 1, 21: 2, . . . 21: n?1) to a supervisor module (22: 1, 22: 2, 22: 3, . . . 22: n?1) which outputs a second prediction of target data based on latent variables and determining a first and second loss measure by comparing the predictions of target data with the ground truth target data. The method further comprises training the processor (20) and the supervisor module (22: 1, 22: 2, 22: 3, . . . 22: n?1) based on the first and second loss measure and adjusting the processor by at least one of removing, replacing and adding a processor module.

Type: Application

Filed: December 8, 2022

Publication date: February 6, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Jundai SUN, Lie LU, Zhiwei SHUANG, Yuanxing MA
BINAURAL RENDERING FOR HEADPHONES USING METADATA PROCESSING

Publication number: 20250045010

Abstract: Embodiments are described for a method of rendering audio for playback through headphones comprising receiving digital audio content, receiving binaural rendering metadata generated by an authoring tool processing the received digital audio content, receiving playback metadata generated by a playback device, and combining the binaural rendering metadata and playback metadata to optimize playback of the digital audio content through the headphones.

Type: Application

Filed: August 7, 2024

Publication date: February 6, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Nicolas R. TSINGOS, Rhonda WILSON, Sunil BHARITKAR, C. Phillip BROWN, Alan J. SEEFELDT, Remi AUDFRAY
A METHOD OF PROCESSING AUDIO FOR PLAYBACK OF IMMERSIVE AUDIO

Publication number: 20250048051

Abstract: A method (1000) of generating a plurality of audio channels from audio containing height and non-height audio channels for playing back them with an immersive loudspeaker system of with at least one height loudspeaker (5) inside a listening environment, comprising: applying (1500) a virtual height filter (1300) to a portion of each height channel (1010) for, when playing back the height channel by one of the loudspeakers, attenuating spectral components of the height channel directly emanating from said loudspeaker (1;2;3;4) and for amplifying spectral components of the height channel reflected from a roof or an area close to the roof inside the listening environment, to generate a plurality of virtual height filtered audio signals which are added to the corresponding non-height audio channels for playback by corresponding loudspeakers; and playing back the remaining portions of each height audio channel with the at least one height loudspeaker (5).

Type: Application

Filed: December 19, 2022

Publication date: February 6, 2025

Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: C. Phillip BROWN, Eytan RUBIN, Jacobo GIRALT, Michal Stanislaw RYBCZYNSKI, Thomas ZIEGLER
METHOD AND APPARATUS FOR PROCESSING OF AUDIO DATA

Publication number: 20250046318

Abstract: Described herein is a method of processing audio data for playback, the method including: receiving, by a decoder, a bitstream including encoded audio data and metadata, wherein the metadata includes one or more dynamic range control (DRC) sets, and for each DRC set, an indication of whether the DRC set is configured for providing a dynamic loudness compensation effect; parsing the metadata, by the decoder, to identify DRC sets that are configured for providing the dynamic range compensation effect; decoding, by the decoder, the encoded audio data to obtain decoded audio data; selecting, by the decoder, one of the identified DRC sets configured for providing the dynamic loudness compensation effect: extracting from the bitstream, by the decoder, one or more DRC gains corresponding to the selected DRC set; applying to the decoded audio data, by the decoder, the one or more DRC gains corresponding to the selected DRC set to obtain dynamic loudness compensated audio data; and outputting the dynamic loudness comp

Type: Application

Filed: August 24, 2022

Publication date: February 6, 2025

Applicant: DOLBY INTERNATIONAL AB

Inventor: Christof FERSCH
EFFICIENT COMBINED HARMONIC TRANSPOSITION

Publication number: 20250037688

Abstract: The present document relates to audio coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), and to digital effect processors, e.g. so-called exciters, where generation of harmonic distortion adds brightness to the processed signal. In particular, a system configured to generate a high frequency component of a signal from a low frequency component of the signal is described. The system may comprise an analysis filter bank (501) configured to provide a set of analysis subband signals from the low frequency component of the signal; wherein the set of analysis subband signals comprises at least two analysis subband signals; wherein the analysis filter bank (501) has a frequency resolution of ?f.

Type: Application

Filed: October 10, 2024

Publication date: January 30, 2025

Applicant: DOLBY INTERNATIONAL AB

Inventors: Per EKSTRAND, Lars VILLEMOES, Per HEDELIN
CONTENT-AWARE AUDIO LEVEL MANAGEMENT

Publication number: 20250038726

Abstract: Described herein is a method of performing content-aware audio processing for an audio signal comprising a plurality of audio components of different types. The method includes source separating the audio signal into at least a voice-related audio component and a residual audio component. The method further includes determining a dynamic audio gain based on the voice-related audio component and the residual audio component. The method also includes performing audio level adjustment for the audio signal based on the determined audio gain. Further described are corresponding apparatus, programs, and computer-readable storage media.

Type: Application

Filed: November 3, 2022

Publication date: January 30, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Shaofan YANG, Kai LI, Qianqian FANG
SCREEN INTERACTION USING EOG COORDINATES

Publication number: 20250036195

Abstract: A method comprising acquiring a set of voltage signals from a set of electrodes arranged in proximity to the ears of a user, based on the set of voltage signals, determining an EOG gaze vector in ego-centric coordinates, determining a head pose of the user in display coordinates, using a sensor device worn by the user, combining the EOG gaze vector and head pose to obtain a gaze vector in display coordinates, and determining a gaze point by calculating an intersection of the gaze vector and an imaging surface having a known position in display coordinates.

Type: Application

Filed: July 21, 2022

Publication date: January 30, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Andrea Fanelli, Evan David Gitterman, Nathan Carl Swedlow, Alex Brandmeyer, McGregor Steele Joyner, Scott Daly, Poppy Anne Carrie Crum
CONTROL OF SPEECH PRESERVATION IN SPEECH ENHANCEMENT

Publication number: 20250037729

Abstract: A method for performing denoising on audio signals is provided. In some implementations, the method involves determining an aggressiveness control parameter value that modulates a degree of speech preservation to be applied. In some implementations, the method involves obtaining a training set of training samples, a training sample having a noisy audio signal and a target denoising mask. In some implementations, the method involves training a machine learning model, wherein the trained machine learning model is usable to take, as an input, a noisy test audio signal and generate a corresponding denoised test audio signal, and wherein the aggressiveness control parameter value is used for: 1) generating a frequency domain representation of the noisy audio signals included in the training set: 2) modifying the target denoising masks: 3) determining an architecture of the machine learning model: or determining a loss during training of the machine learning model.

Type: Application

Filed: November 8, 2022

Publication date: January 30, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Jundai Sun, Lie Lu
HARMONIC TRANSPOSITION IN AN AUDIO CODING METHOD AND SYSTEM

Publication number: 20250029621

Abstract: The present invention relates to transposing signals in time and/or frequency and in particular to coding of audio signals. More particular, the present invention relates to high frequency reconstruction (HFR) methods including a frequency domain harmonic transposer. A method and system for generating a transposed output signal from an input signal using a transposition factor T is described. The system comprises an analysis window of length La, extracting a frame of the input signal, and an analysis transformation unit of order M transforming the samples into M complex coefficients. M is a function of the transposition factor T. The system further comprises a nonlinear processing unit altering the phase of the complex coefficients by using the transposition factor T, a synthesis transformation unit of order M transforming the altered coefficients into M altered samples, and a synthesis window of length Ls, generating a frame of the output signal.

Type: Application

Filed: October 3, 2024

Publication date: January 23, 2025

Applicant: DOLBY INTERNATIONAL AB

Inventors: Per EKSTRAND, Lars VILLEMOES
SYSTEM AND TOOLS FOR ENHANCED 3D AUDIO AUTHORING AND RENDERING

Publication number: 20250024222

Abstract: Improved tools for authoring and rendering audio reproduction data are provided. Some such authoring tools allow audio reproduction data to be generalized for a wide variety of reproduction environments. Audio reproduction data may be authored by creating metadata for audio objects. The metadata may be created with reference to speaker zones. During the rendering process, the audio reproduction data may be reproduced according to the reproduction speaker layout of a particular reproduction environment.

Type: Application

Filed: July 22, 2024

Publication date: January 16, 2025

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Nicolas R. Tsingos, Charles Q. Robinson, Jurgen W. Scharpf
ACOUSTIC ZONING WITH DISTRIBUTED MICROPHONES

Publication number: 20250022465

Abstract: A method for estimating a user's location in an environment may involve receiving output signals from each microphone of a plurality of microphones in the environment. At least two microphones of the plurality of microphones may be included in separate devices at separate locations in the environment and the output signals may correspond to a current utterance of a user. The method may involve determining multiple current acoustic features from the output signals of each microphone and applying a classifier to the multiple current acoustic features. Applying the classifier may involve applying a model trained on previously-determined acoustic features derived from a plurality of previous utterances made by the user in a plurality of user zones in the environment. The method may involve determining, based at least in part on output from the classifier, an estimate of the user zone in which the user is currently located.

Type: Application

Filed: September 30, 2024

Publication date: January 16, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Mark R. P. THOMAS, Richard J. CARTWRIGHT
AUDIO ENCODING AND DECODING USING PRESENTATION TRANSFORM PARAMETERS

Publication number: 20250022475

Abstract: A method for encoding an input audio stream including the steps of obtaining a first playback stream presentation of the input audio stream intended for reproduction on a first audio reproduction system, obtaining a second playback stream presentation of the input audio stream intended for reproduction on a second audio reproduction system, determining a set of transform parameters suitable for transforming an intermediate playback stream presentation to an approximation of the second playback stream presentation, wherein the transform parameters are determined by minimization of a measure of a difference between the approximation of the second playback stream presentation and the second playback stream presentation, and encoding the first playback stream presentation and the set of transform parameters for transmission to a decoder.

Type: Application

Filed: October 1, 2024

Publication date: January 16, 2025

Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: Dirk Jeroen Breebart, David Matthew Cooper, Leif Jonas Samuelsson, Jeroen Koppens, Rhonda J. Wilson, Heiko Purnhagen, Alexander Stahlmann
SIGNAL RESHAPING FOR HIGH DYNAMIC RANGE SIGNALS

Publication number: 20250024086

Abstract: In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated hue values in the preferred color space is minimized. HDR images are coded in the reshaped color space. Legacy devices can still decode standard dynamic range images assuming they are coded in the legacy color space, while updated devices can use color reshaping information to decode HDR images in the preferred color space at full dynamic range.

Type: Application

Filed: September 27, 2024

Publication date: January 16, 2025

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Robin Atkins, Peng Yin, Taoran Lu, Jaclyn Anne Pytlarz
METHOD AND DEVICE FOR DESIGNING AND OVERSAMPLED LOW DELAY FILTER BANK

Publication number: 20250015785

Abstract: The present document describes a method (200) for determining N coefficients of an asymmetric prototype filter p0 for use in a low delay M-channel analysis and/or synthesis filter bank (101, 102) comprising M analysis filters hk (103) and/or M synthesis filters fk(106), k=0, . . . , M?1, wherein M is greater than 1, and wherein subband signals which are processed by the analysis and/or synthesis filter bank (101, 102) are decimated by a decimation factor S, with S<M. The method (200) comprises determining (201) a target transfer function of the analysis and/or synthesis filler bank (101, 102) comprising a target delay D; wherein D is smaller or equal to N. Furthermore.

Type: Application

Filed: November 29, 2022

Publication date: January 9, 2025

Applicant: DOLBY INTERNATIONAL AB

Inventor: Per EKSTRAND
AUDIO CONTENT GENERATION AND CLASSIFICATION

Publication number: 20250006208

Abstract: Some disclosed methods involve receiving audio data of at least a first audio data type and a second audio data type, including audio signals and associated spatial data indicating intended perceived spatial positions for the audio signals, determining at least a first feature type from the audio data and applying a positional encoding process to the audio data, to produce encoded audio data. The encoded audio data may include representations of at least the spatial data and the first feature type in first embedding vectors of an embedding dimension. Some methods may involve training a neural network, based on the encoded audio data, to transform audio data from an input audio data type having an input spatial data type to a transformed audio data type having a transformed spatial data type. Some methods may involve training a neural network to identify an input audio data type.

Type: Application

Filed: November 3, 2022

Publication date: January 2, 2025

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Brenton James POTTER, Hadis NOSRATI
ESTIMATION OF AUDIO DEVICE AND SOUND SOURCE LOCATIONS

Publication number: 20250008262

Abstract: Some disclosed methods involve receiving, by a control system, location control data from a sound source as the sound source emits sound in a plurality of sound source locations within an audio environment. Some such methods involve receiving, by the control system, direction of arrival data from each audio device of a plurality of audio devices in the audio environment. In some examples, each audio device of the plurality of audio devices includes a microphone array and the direction of arrival data corresponding to microphone signals from microphone arrays responsive to sound emitted by the sound source in the plurality of sound source locations. Some such methods involve estimating, by the control system, sound source locations and audio device locations based, at least in part, on the location control data and the direction of arrival data.

Type: Application

Filed: November 7, 2022

Publication date: January 2, 2025

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Avery BRUNI, Mark R. P. THOMAS
PROJECTION SYSTEM AND METHOD WITH BLENDED COLOR GAMUT

Publication number: 20250008080

Abstract: A projection system and method therefore related to a first projection device; a second projection device; at least one spatial modulator; and an electronic processor configured to: receive a two-dimensional video data, generate a first plurality of intensity values of a first color gamut and a second plurality of intensity values of a second color gamut, subtract a luminance threshold from a plurality of pixel values of the second color gamut to yield a plurality of positive pixel values and a plurality of negative pixel values, set respective ones of the plurality of pixel values to predetermined values to obtain a residual of the second color gamut, add the residual to the first color gamut, add maximized pixel values to the first color gamut and the second color gamut to create an output gamut, and blend the output gamut with a native color gamut to generate a blended gamut.

Type: Application

Filed: August 26, 2024

Publication date: January 2, 2025

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Trevor DAVIES, Clement Luc Carol LE BARBENCHON, Elizabeth DoVale, Martin J. RICHARDS
SYSTEMS AND METHODS TO GENERATE COPIES OF DATA FOR TRANSMISSION OVER MULTIPLE COMMUNICATION CHANNELS

Publication number: 20250007645

Abstract: Systems and methods to transmit data over multiple communication channels in parallel with forward error correction. Original packets are evenly distributed to the channels as the initial systematically channel-encoded packets. Subsequent channel-encoded packets are configured to be linearly independent of their base sets of channel-encoded packets, where a base set for a subsequent channel-encoded packet includes those scheduled to be transmitted before the subsequent packet in the same channel as the subsequent packet, and optionally one or more initial packets from other channels. The compositions of the sequences of the encoded packets can be predetermined without the content of the packets; and the channel-encoded packets can be generated from the original packets on-the-fly by the transmitters of the channels during transmission. When a sufficient number of packets have been received via the channels, a recipient may terminate their transmissions.

Type: Application

Filed: September 12, 2024

Publication date: January 2, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Mingchao YU, Mark Craig REED
LEARNABLE HEURISTICS TO OPTIMIZE A MULTI-HYPOTHESIS FILTERING SYSTEM

Publication number: 20250006170

Abstract: Some disclosed methods involve receiving microphone signals from a microphone system, including signals corresponding to one or more sounds detected by the microphone system. Some methods may involve determining, via a trained neural network, a filtering scheme for the microphone signals, the filtering scheme including one or more filtering processes. The trained neural network may be configured to implement one or more subband-domain adaptive filter management modules. Some methods may involve applying the filtering scheme to the microphone signals, to produce enhanced microphone signals.

Type: Application

Filed: November 1, 2022

Publication date: January 2, 2025

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Hadis NOSRATI, Benjamin John SOUTHWELL
AUDIO ENHANCEMENT FOR MOBILE CAPTURE

Publication number: 20250008284

Abstract: A system for real-time monitoring of user-generated audio content for audio anomaly and a related method are disclosed. In some embodiments, the system is programmed to receive, in real time, audio data generated by a first mobile device, such as a smartphone. The system is programed to determine, in real time, whether an audio anomaly has occurred from the audio data. The system is programmed to cause, in real time, a presentation of an alert to a second mobile device, which could be the same smartphone, in response to detecting an occurrence of audio anomaly.

Type: Application

Filed: September 7, 2022

Publication date: January 2, 2025

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Kai Li, Hao Luo, Lei Gan, Xu Li, Weiwei Wen, Yuanxing Ma
LIGHT PROJECTION SYSTEM USING WHITE LIGHT ILLUMINATION

Publication number: 20250004357

Abstract: Light projection systems using white light illumination. One embodiment provides a projection system using white light illumination. The projection system includes an illumination assembly configured to receive a white light input. A prism is configured to separate the white light input into color light inputs, redirect the color light inputs to respective modulators, and combine modulated color light inputs from the respective modulators into a white light output. An optical filter is configured to spatially Fourier transform the white light output to generate a filtered white light output. A projection lens assembly is configured to project the filtered white light output.

Type: Application

Filed: October 14, 2022

Publication date: January 2, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: John David Jackson, Nathan Shawn Wainwright, Frank Joseph Poradish, Darren Hennigan, Duane Scott Dewald, Juan Pablo Pertierra, Martin J. Richards, Barret Lippey, Jon Scott Miller, Trevor Davies, Peter Francis Van Kessel, Douglas Reid Boyd Campbell
DATA STRUCTURE FOR MULTIMEDIA APPLICATONS

Publication number: 20250005068

Abstract: Embodiments described herein provide a unified container format for delivering different multimedia applications. One embodiment provides a data structure utilized for implementing a plurality of multimedia applications. The data structure includes a first metadata level including low-level metadata used to perform operations associated with media data in a bitstream. The data structure includes a second metadata level including mid-level metadata used to apply operation metadata to render the media data. The data structure includes a third metadata level including upper-level metadata used to utilize the low-level metadata and the mid-level metadata to deliver the plurality of multimedia applications.

Type: Application

Filed: December 19, 2022

Publication date: January 2, 2025

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Guan-Ming SU, Tao CHEN, Sheng QU, Samir N. HULYALKAR
METHODS, APPARATUS AND SYSTEMS FOR CONTROLLING DOPPLER EFFECT MODELLING

Publication number: 20240430639

Abstract: Described is a method of modelling a Doppler effect when rendering audio content for a 6 degrees of freedom (6DoF) environment on a user side. In particular, the method may comprise obtaining first parameter values of one or more first parameters indicative of an allowable range of pitch factor modification values. The method may further comprise obtaining a second parameter value of a second parameter indicative of a desired strength of the to-be-modelled Doppler effect. The method may yet further comprise determining a pitch factor modification value based on a relative velocity between a listener and an audio source in the audio content, and the first and second parameter values, using a predefined pitch factor modification function. Particularly, the predefined pitch factor modification function may have the first and second parameters and may be a function for mapping relative velocities to pitch factor modification values.

Type: Application

Filed: October 27, 2022

Publication date: December 26, 2024

Applicant: DOLBY INTERNATIONAL AB

Inventors: Rodrigo Andres Gutierrez, Leon Terentiv, Panji Setiawan, Daniel Fischer, Christof Fersch
LUMINANCE BASED CODING TOOLS FOR VIDEO COMPRESSION

Publication number: 20240430455

Abstract: Sample data and metadata related to spatial regions in images may be received from a coded video signal. It is determined whether specific spatial regions in the images correspond to a specific region of luminance levels. In response to determining the specific spatial regions correspond to the specific region of luminance levels, signal processing and video compression operations are performed on sets of samples in the specific spatial regions. The signal processing and video compression operations are at least partially dependent on the specific region of luminance levels.

Type: Application

Filed: June 26, 2023

Publication date: December 26, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Peng YIN, Guan-Ming SU, Taoran LU, Tao CHEN, Walter J. HUSAK
METHODS AND SYSTEMS FOR DESIGNING AND APPLYING NUMERICALLY OPTIMIZED BINAURAL ROOM IMPULSE RESPONSES

Publication number: 20240430637

Abstract: Methods and systems for designing binaural room impulse responses (BRIRs) for use in headphone virtualizers, and methods and systems for generating a binaural signal in response to a set of channels of a multi-channel audio signal, including by applying a BRIR to each channel of the set, thereby generating filtered signals, and combining the filtered signals to generate the binaural signal, where each BRIR has been designed in accordance with an embodiment of the design method. Other aspects are audio processing units configured to perform any embodiment of the inventive method. In accordance with some embodiments, BRIR design is formulated as a numerical optimization problem based on a simulation model (which generates candidate BRIRs) and at least one objective function (which evaluates each candidate BRIR), and includes identification of a best one of the candidate BRIRs as indicated by performance metrics determined for the candidate BRIRs by each objective function.

Type: Application

Filed: June 28, 2024

Publication date: December 26, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Grant A. DAVIDSON, Kuan-Chieh YEN, Dirk Jeroen BREEBAART
METHOD AND DEVICE FOR ENCODING AND DECODING IMAGE USING MOTION VECTOR RESOLUTION SCALING

Publication number: 20240430475

Abstract: A video encoding method according to an embodiment of the present invention includes generating header information that includes information about resolutions of motion vectors of respective blocks, determined based on motion prediction for a unit image. Here, the header information includes flag information indicating whether resolutions of all motion vectors included in the unit image are integer-pixel resolutions. Further, a video decoding method according to another embodiment of the present invention includes extracting information about resolutions of motion vectors of each unit image from header information included in a target bitstream to be decoded; and a decoding unit for decoding the unit image based on the resolution information. Here, the header information includes flag information indicating whether resolutions of all motion vectors included in the unit image are integer-pixel resolutions.

Type: Application

Filed: September 10, 2024

Publication date: December 26, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Jong Ki HAN, Jae Yung LEE
FACE REGION DETECTION AND LOCAL RESHAPING ENHANCEMENT

Publication number: 20240428612

Abstract: Methods and corresponding systems to process face regions are disclosed. The described methods include providing face bounding boxes and confidence levels for the faces, generating a histogram of the pixels and the faces, generating a probability of face, and generating a face probability map. A face contrast adjustment and a face saturation adjustment can be applied to the face probability map.

Type: Application

Filed: July 25, 2022

Publication date: December 26, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Tsung-Wei Huang, Guan-Ming Su
TRANSMISSION-AGNOSTIC PRESENTATION-BASED PROGRAM LOUDNESS

Publication number: 20240428815

Abstract: This disclosure falls into the field of audio coding, in particular it is related to the field of providing a framework for providing loudness consistency among differing audio output signals. In particular, the disclosure relates to methods, computer program products and apparatus for encoding and decoding of audio data bitstreams in order to attain a desired loudness level of an output audio signal.

Type: Application

Filed: August 30, 2024

Publication date: December 26, 2024

Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: Jeroen KOPPENS, Scott Gregory NORCROSS
RESHAPER FOR LEARNING BASED IMAGE/VIDEO CODING

Publication number: 20240422345

Abstract: An input image represented in an input domain is received from an input video signal. Forward reshaping is performed on the input image to generate a forward reshaped image represented in a reshaped image domain. Non-reshaping encoding operations are performed to encode the reshaped image into an encoded video signal. At least one of the non-reshaping encoding operations is implemented with an ML model that has been previously trained with training images in one or more training datasets in a preceding training stage. A recipient device of the encoded video signal is caused to generate a reconstructed image from the forward reshaped image.

Type: Application

Filed: August 5, 2022

Publication date: December 19, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Peng YIN, Fangjun PU, Taoran LU, Arjun ARORA, Guan-Ming SU, Tao CHEN, Sean Thomas MCCARTHY, Walter J. HUSAK
RENDERING BASED ON LOUDSPEAKER ORIENTATION

Publication number: 20240422503

Abstract: An audio processing method may involve receiving audio signals and associated spatial data, listener position data, loudspeaker position data and loudspeaker orientation data, and rendering the audio data for reproduction, based, at least in part, on the spatial data, the listener position data, the loudspeaker position data and the loudspeaker orientation data, to produce rendered audio signals. The rendering may involve applying a loudspeaker orientation factor that tends to reduce a relative activation of a loudspeaker based, at least in part, on an increased loudspeaker orientation angle. In some examples, the rendering may involve modifying an effect of the loudspeaker orientation factor based, at least in part, on a loudspeaker importance metric. The loudspeaker importance metric may correspond to a loudspeaker's importance for rendering an audio signal at the audio signal's intended perceived spatial position.

Type: Application

Filed: November 7, 2022

Publication date: December 19, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Kimberly Jean KAWCZINSKI, Alan Jeffrey SEEFELDT, Timothy Alan PORT
TRANSMISSION-AGNOSTIC PRESENTATION-BASED PROGRAM LOUDNESS

Publication number: 20240420717

Abstract: This disclosure falls into the field of audio coding, in particular it is related to the field of providing a framework for providing loudness consistency among differing audio output signals. In particular, the disclosure relates to methods, computer program products and apparatus for encoding and decoding of audio data bitstreams in order to attain a desired loudness level of an output audio signal.

Type: Application

Filed: August 30, 2024

Publication date: December 19, 2024

Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: Jeroen KOPPENS, Scott Gregory NORCROSS
APPLYING MINIMUM AND AVERAGE DISTANCE CONSTRAINT IN VIDEO STREAMING

Publication number: 20240406461

Abstract: Input images are received as input to a multi-node system. The input images are divided into segments assigned to respective nodes of the multi-node system. Primary and secondary scenes are identified in the segments to ensure compliance with minimum and average distance constraints. Scene-level forward reshaping mappings are generated for the scenes by a respective node for an assigned segment. Forward reshaped images in the segment are generated by the node using the forward reshaping mappings and encoded into an output video signal, which enables a recipient device to generate reconstructed images and to render display images derived from the reconstructed images on an image display.

Type: Application

Filed: August 14, 2022

Publication date: December 5, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Harshad KADU, Guan-Ming SU
BINAURAL DIALOGUE ENHANCEMENT

Publication number: 20240406650

Abstract: Methods for dialogue enhancing audio content, comprising providing a first audio signal presentation of the audio components, providing a second audio signal presentation, receiving a set of dialogue estimation parameters configured to enable estimation of dialogue components from the first audio signal presentation, applying said set of dialogue estimation parameters to said first audio signal presentation, to form a dialogue presentation of the dialogue components; and combining the dialogue presentation with said second audio signal presentation to form a dialogue enhanced audio signal presentation for reproduction on the second audio reproduction system, wherein at least one of said first and second audio signal presentation is a binaural audio signal presentation.

Type: Application

Filed: March 15, 2024

Publication date: December 5, 2024

Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: Leif Jonas Samuelsson, Dirk Jeroen Breebaart, David Matthew Cooper, Jeroen Koppens
VIDEO DECODER WITH REDUCED DYNAMIC RANGE TRANSFORM WITH INVERSE TRANSFORM SHIFTING MEMORY

Publication number: 20240406399

Abstract: A method for decoding video includes receiving quantized coefficients representative of a block of video representative of a plurality of pixels. The quantized coefficients are dequantized based upon a function of a remainder. The dequantized coefficients are inverse transformed to determine a decoded residue.

Type: Application

Filed: April 29, 2024

Publication date: December 5, 2024

Applicant: DOLBY INTERNATIONAL AB

Inventors: Louis J. Kerofsky, Kiran Misra, Christopher A. Segall
METHOD FOR DATA RATE AND BUFFER ESTIMATION FOR MULTI-SOURCE DELIVERY

Publication number: 20240406243

Abstract: The present disclosure relates to a method and variable quality playback system for selecting a quality of media content. The method comprising receiving (S4001) media content of a data segment (1010) over at least one network path (1031a, 1031b, 1031c), the media content being encoded with network or application-layer code and storing (S4002) the media content in a network or application-layer decoder (1050). The network or application-layer decoder (1050) is configured to decode the media content and provide decoded media content to a buffer (5061) associated with a media renderer (1060). The method further comprises obtaining a decoding metric of the network or application-layer decoder (1050), the decoding metric indicating a property of the decoding process and selecting the quality of the media content of subsequent data segments based on the decoding metric.

Type: Application

Filed: September 21, 2022

Publication date: December 5, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Jason Michael CLOUD, Elliot OSBORNE

prev 1 2 3 4 5 6 … next