Dolby Labs Patent Applications

Dolby Labs patent applications that are pending before the United States Patent and Trademark Office (USPTO).

AUDIO SPEAKER COORDINATION SYSTEM

Publication number: 20240406629

Abstract: Novel methods and systems for coordinating sound to both internal and external speakers for a device is disclosed. The audio signal is distributed among the internal and external speakers and aligned so that signals going to the internal speakers are aligned with signals going to the external speakers.

Type: Application

Filed: September 29, 2022

Publication date: December 5, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Pengfei ZHOU, Pengfeng ZHANG, Mingyi Huang
CONTEXT-DEPENDENT COLOR-MAPPING OF IMAGE AND VIDEO DATA

Publication number: 20240404030

Abstract: Systems and methods for performing color mapping operations. One system includes a processor to perform post-production editing of image data. The processor is configured to identify a first region of an image and identify a second region of the image. The first region includes a first white point having a first tone, and the second region includes a second white point having a second tone. The processor is further configured to determine a color mapping function based on the first tone, apply the color mapping function to the second region of the image, and generate an output image.

Type: Application

Filed: September 28, 2022

Publication date: December 5, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventor: Timo KUNKEL
METHOD AND DEVICE FOR ENCODING AND DECODING INTRA-FRAME PREDICTION

Publication number: 20240397039

Abstract: A method and a device for encoding and decoding intra prediction are disclosed. An image decoding method for performing intra prediction comprises the steps of: receiving a bitstream including data on prediction modes of a current block and a block adjacent to the current block; extracting the data from the received bitstream so as to confirm the prediction mode of the adjacent block; determining whether a boundary pixel within the adjacent block can be used as a reference pixel for the current block in consideration of the prediction mode of the adjacent block; obtaining the reference pixel of the current block according to the determined result; generating a prediction block predicted in the frame on the basis of the obtained reference pixel; and decoding the current block by using the generated prediction block.

Type: Application

Filed: August 8, 2024

Publication date: November 28, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Je Chang JEONG, Ki Baek KIM, Ung HWANG
EFFICIENT DRC PROFILE TRANSMISSION

Publication number: 20240395270

Abstract: A method for decoding an encoded audio signal is described. The encoded audio signal may comprise a sequence of frames and may be indicative of a plurality of different dynamic range control (DRC) profiles for a corresponding plurality of different rendering modes. Different subsets of DRC profiles may be comprised within different frames. The method may comprise determining a first rendering mode from the plurality of different rendering modes; determining one or more DRC profiles from a subset of DRC profiles comprised within a current frame; determining whether at least one of the DRC profiles is applicable to the first rendering mode; selecting a default DRC profile as a current DRC profile, if none of the DRC profiles is applicable to the first rendering mode; wherein definition data of the default DRC profile is known at a decoder; and decoding the current frame using the current DRC profile.

Type: Application

Filed: July 23, 2024

Publication date: November 28, 2024

Applicant: DOLBY INTERNATIONAL AB

Inventors: Holger HOERICH, Jeroen KOPPENS
UNIVERSAL SPEECH ENHANCEMENT USING GENERATIVE NEURAL NETWORKS

Publication number: 20240395278

Abstract: The disclosure relates to a neural network based system for speech enhancement, comprising a generative network for generating an enhanced audio signal and a conditioning network for generating conditioning information for the generative network. The conditioning network comprises a plurality of layers and is configured to receive an audio signal as input: propagate the audio signal through the plurality of layers; and provide one or more first internal representations of the audio signal or processed versions thereof as the conditioning information, wherein the one or more first internal representations of the audio signal are extracted at respective layers of the conditioning network. The generative network is configured to receive a noise vector and the conditioning information as input; and generate the enhanced audio signal based on the noise vector and the conditioning information. The disclosure further relates to a method of training the system.

Type: Application

Filed: September 29, 2022

Publication date: November 28, 2024

Applicant: DOLBY INTERNATIONAL AB

Inventors: Joan SERRA, Santiago PASCUAL, Jordi PONS PUIG
SELFIE VOLUMETRIC VIDEO

Publication number: 20240394987

Abstract: Camera tracking data with respect to a camera operating in a 3D physical space is received. An image portion depicting one or more visual objects not physically present in the 3D physical space is generated using a camera perspective derived from the camera tracking data. The one or more visual objects is caused to be visually combined with the camera perspective into a personal image taken by the camera.

Type: Application

Filed: September 23, 2022

Publication date: November 28, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventor: Ajit NINAN
DYNAMIC SPATIAL METADATA FOR IMAGE AND VIDEO PROCESSING

Publication number: 20240397088

Abstract: Methods and systems for generating and using dynamic spatial metadata in image and video processing are described. In an encoder, in addition to global metadata, local, spatial metadata for two or more image regions or image objects are generated, smoothed, and embedded as spatial metadata values. In a decoder, the decoder can reconstruct the spatial metadata and use interpolation techniques to generate metadata for specific regions of interest. Examples of generating spatial metadata related to min, mid, and max luminance values in an image are provided.

Type: Application

Filed: September 20, 2022

Publication date: November 28, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Robin ATKINS, Per Jonas Andreas KLITTMARK
METHODS FOR PARAMETRIC MULTI-CHANNEL ENCODING

Publication number: 20240395264

Abstract: The present document relates to audio coding systems. In particular, the present document relates to efficient methods and systems for parametric multi-channel audio coding. An audio encoding system configured to generate a bitstream indicative of a downmix signal and spatial metadata for generating a multi-channel upmix signal from the downmix signal is described. The system comprises a downmix processing unit configured to generate the downmix signal from a multi-channel input signal; wherein the downmix signal comprises m channels and wherein the multi-channel input signal comprises n channels; n, m being integers with m<n. Furthermore, the system comprises a parameter processing unit configured to determine the spatial metadata from the multi-channel input signal.

Type: Application

Filed: August 5, 2024

Publication date: November 28, 2024

Applicant: DOLBY INTERNATIONAL AB

Inventors: Tobias FRIEDRICH, Alexander MUELLER, Karsten LINZMEIER, Claus-Christian SPENGER, Tobias R. WAGENBLASS
SYSTEM AND METHOD FOR ENHANCEMENT OF A DEGRADED AUDIO SIGNAL

Publication number: 20240395267

Abstract: The present disclosure relates to the field of audio enhancement, and in particular to methods, devices and software for supervised training of a machine learning model, MLM, the MLM trained to enhance a degraded audio signal by calculating gains to be applied to frequency bands of the degraded audio signal. The present disclosure further relates to methods, devices and software for use of such a trained MLM.

Type: Application

Filed: May 24, 2024

Publication date: November 28, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Jia Dai, Kai Li, Richard J. Cartwright
SOURCE COLOR VOLUME INFORMATION MESSAGING

Publication number: 20240388738

Abstract: Methods are described to communicate source color volume information in a coded bitstream using SEI messaging. Such data include at least the minimum, maximum, and average luminance values in the source data plus optional data that may include the color volume x and y chromaticity coordinates for the input color primaries (e.g., red, green, and blue) of the source data, and the color x and y chromaticity coordinates for the color primaries corresponding to the minimum, average, and maximum luminance values in the source data. Messaging data signaling an active region in each picture may also be included.

Type: Application

Filed: July 29, 2024

Publication date: November 21, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Tao CHEN, Peng YIN, Taoran LU, Walter J. HUSAK
METHODS AND DEVICES FOR CONTROLLING AUDIO PARAMETERS

Publication number: 20240378014

Abstract: A method of controlling headphones having external microphone signal pass-through functionality may involve controlling a display to present a geometric shape on the display and receiving an indication of digit motion from a sensor system associated with the display. The sensor system may include a touch sensor system or a gesture sensor system. The indication may be an indication of a direction of digit motion relative to the display. The method may involve controlling the display to present a sequence of images indicating that the geometric shape either enlarges or contracts, depending on the direction of digit motion and changing a headphone transparency setting according to a current size of the geometric shape. The headphone transparency setting may correspond to an external microphone signal gain setting and/or a media signal gain setting of the headphones.

Type: Application

Filed: July 22, 2024

Publication date: November 14, 2024

Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: Lucas E. SAULE, Eugene CHEN, Julien Guy Pierre Derreveaux, Jakub Siwak, Daniel Christian Brinkley
PERVASIVE ACOUSTIC MAPPING

Publication number: 20240381046

Abstract: Some methods may involve receiving a first content stream that includes first audio signals, rendering the first audio signals to produce first audio playback signals, generating first calibration signals, generating first modified audio playback signals by inserting the first calibration signals into the first audio playback signals, and causing a loudspeaker system to play back the first modified audio playback signals, to generate first audio device playback sound. The method(s) may involve receiving microphone signals corresponding to at least the first audio device playback sound and to second through Nth audio device playback sound corresponding to second through Nth modified audio playback signals (including second through Nth calibration signals) played back by second through Nth audio devices, extracting second through Nth calibration signals from the microphone signals and estimating at least one acoustic scene metric based, at least partly, on the second through Nth calibration signals.

Type: Application

Filed: December 2, 2021

Publication date: November 14, 2024

Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: Mark R.P. THOMAS, Benjamin John SOUTHWELL, Avery BRUNI, Olha Michelle TOWNSEND, Daniel ARTEAGA, Davide SCAINI, Christopher Graham HINES, Alan J. SEEFELDT, David GUNAWAN, C. Phillip Brown
SIGNAL RESHAPING FOR HIGH DYNAMIC RANGE SIGNALS

Publication number: 20240380928

Abstract: In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated hue values in the preferred color space is minimized. HDR images are coded in the reshaped color space. Legacy devices can still decode standard dynamic range images assuming they are coded in the legacy color space, while updated devices can use color reshaping information to decode HDR images in the preferred color space at full dynamic range.

Type: Application

Filed: July 22, 2024

Publication date: November 14, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Robin Atkins, Peng Yin, Taoran Lu, Jaclyn Anne Pytlarz
SYSTEM AND METHOD OF DIGITAL WATERMARKING

Publication number: 20240370964

Abstract: A digital watermarking method including receiving, by an electronic processor, an original image signal containing a series of original visual images, where the original image signal encoded uses a perceptual quantizer (PQ) luminance level encoding transfer function resulting in PQ luminance steps within the original image signal, and where the PQ luminance steps have varying sizes across a luminance range. The method further includes receiving, by the electronic processor, a watermark image signal including a watermark, and adjusting the strength of the watermark by at least a first weighting factor that is a predetermined first number of PQ luminance steps.

Type: Application

Filed: July 25, 2022

Publication date: November 7, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor: Jerome D. Shields
A METHOD OF PROCESSING AUDIO FOR PLAYBACK OF IMMERSIVE AUDIO

Publication number: 20240373184

Abstract: A method (200) of processing audio in an immersive audio format comprising at least one height audio channel, comprising: obtaining (250) two height audio signals from at least a portion of the at least one height audio channel: modifying (270) a relative phase between the two height audio signals in frequency bands in which phase differences are predominantly out of phase to obtain two phase modified height audio signals: and playing back (290) the processed audio comprising the two phase modified height audio signals with at least two audio loudspeakers. The phase differences occur as a result of having monaural signals emanated from two audio loudspeakers at one or more listening positions symmetrically off-center with respect to the at least two loudspeakers. laterally spaced with respect to each of said one or more listening positions. The method allows perception of sound height/elevation without using overhead speakers.

Type: Application

Filed: July 21, 2022

Publication date: November 7, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: C. Phillip Brown, Michael J. Smithers
METHOD AND APPARATUS FOR DECODING STEREO LOUDSPEAKER SIGNALS FROM A HIGHER-ORDER AMBISONICS AUDIO SIGNAL

Publication number: 20240373186

Abstract: Decoding of Ambisonics representations for a stereo loudspeaker setup is known for first-order Ambisonics audio signals. But such first-order Ambisonics approaches have either high negative side lobes or poor localisation in the frontal region. The invention deals with the processing for stereo decoders for higher-order Ambisonics HOA.

Type: Application

Filed: May 10, 2024

Publication date: November 7, 2024

Applicant: DOLBY INTERNATIONAL AB

Inventors: Johannes Boehm, Florian Keiler
SYSTEM FOR NESTED ENTROPY ENCODING

Publication number: 20240373056

Abstract: Methods and systems for improving coding efficiency of video.

Type: Application

Filed: July 18, 2024

Publication date: November 7, 2024

Applicant: DOLBY INTERNATIONAL AB

Inventors: Yeping Su, Christopher A. Segall
METHOD AND SYSTEM FOR PICTURE SEGMENTATION USING COLUMNS

Publication number: 20240364894

Abstract: Described is picture segmentation through columns and slices in video encoding and decoding. A video picture is divided into a plurality of columns, each column covering only a part of the video picture in a horizontal dimension. All coded tree blocks (“CTBs”) belonging to a slice may belong to one or more columns. The columns may be used to break the same or different prediction or in-loop filtering mechanisms of the video coding, and the CTB scan order used for encoding and/or decoding may be local to a column. Column widths may be indicated in a parameter set and/or may be adjusted at the slice level. At the decoder, column width may be parsed from the bitstream, and slice decoding may occur in one or more columns.

Type: Application

Filed: March 13, 2024

Publication date: October 31, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor: Michael Horowitz
SPEECH ENHANCEMENT

Publication number: 20240363131

Abstract: A method for dereverberating audio signals is provided. In some implementations, the method involves obtaining a real acoustic impulse response (AIR); identifying a first portion of the real AIR corresponding to early reflections of a direct sound and a second portion of the real AIR that corresponding to late reflections of the direct sound; generating one or more synthesized AIRs by modifying the first portion of the real AIR and/or the second portion of the real AIR; and using the real AIR and the one or more synthesized AIRs to generate a plurality of training samples, each training sample comprising an input audio signal and a reverberated audio signal, wherein the reverberated audio signal is generated based on the input audio signal and one of the real AIR or one of the one or more synthesized AIRs, which plurality of training samples are used to train a machine learning model.

Type: Application

Filed: July 12, 2022

Publication date: October 31, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Jia DAI, Kai LI, Xiaoyu LIU, Richard J. CARTWRIGHT, Shaofan YANG
DETECTING ENVIRONMENTAL NOISE IN USER-GENERATED CONTENT

Publication number: 20240355348

Abstract: A method of audio processing includes classifying an audio signal as noise or as non-noise using a first model. For a noise signal. the audio signal is classified as user-generated content (UGC) noise or as professionally-generated content (PGC) noise using a second model. For a non-noise signal or PGC noise. the audio signal is processed using a first audio processing process. For UGC noise. the audio signal is processed using a second audio processing process.

Type: Application

Filed: August 23, 2022

Publication date: October 24, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Ziyu Yang, Zhiwei Shuang, Lie Lu
MULTI-LEVEL LATENT FUSION IN NEURAL NETWORKS FOR IMAGE AND VIDEO CODING

Publication number: 20240357112

Abstract: Methods, systems, and bitstream syntax are described for the fusion of latent features in multi-level, end-to-end, neural networks used in image and video compression. The fused architecture may be static or dynamic based on image characteristics (e.g., natural images versus screen content images) or other coding parameters, such as bitrate constrains or rate-distortion optimization. A variety of multi-level fusion architectures are discussed.

Type: Application

Filed: August 3, 2022

Publication date: October 24, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Arunkumar MOHANANCHETTIAR, Jay Nitin SHINGALA, Pankaj SHARMA, Nijil KOLLERI, Peng YIN, Arjun ARORA, Fangjun PU, Taoran LU, Sean Thomas MCCARTHY, Walter J. HUSAK
SOURCE COLOR VOLUME INFORMATION MESSAGING

Publication number: 20240357183

Abstract: Methods are described to communicate source color volume information in a coded bitstream using SEI messaging. Such data include at least the minimum, maximum, and average luminance values in the source data plus optional data that may include the color volume x and y chromaticity coordinates for the input color primaries (e.g., red, green, and blue) of the source data, and the color x and y chromaticity coordinates for the color primaries corresponding to the minimum, average, and maximum luminance values in the source data. Messaging data signaling an active region in each picture may also be included.

Type: Application

Filed: June 28, 2024

Publication date: October 24, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Tao CHEN, Peng YIN, Taoran LU, Walter J. HUSAK
METHOD AND APPARATUS FOR METADATA-BASED DYNAMIC PROCESSING OF AUDIO DATA

Publication number: 20240355338

Abstract: Described herein is a method of metadata-based dynamic processing of audio data for playback, the method including: receiving, by a decoder, a bitstream including audio data and metadata for dynamic loudness adjustment; decoding, by the decoder, the audio data and the metadata to obtain decoded audio data and the metadata; determining, by the decoder, from the metadata, one or more processing parameters for dynamic loudness adjustment based on a playback condition; applying the determined one or more processing parameters to the decoded audio data to obtain processed audio data; and outputting the processed audio data for playback. Described is further a method of encoding audio data and metadata for dynamic loudness adjustment into a bitstream. Moreover, described are a respective decoder and encoder, a respective system and computer program products.

Type: Application

Filed: August 24, 2022

Publication date: October 24, 2024

Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: Christof FERSCH, Scott Gregory NORCROSS
TRANSMISSION OF VOLUMETRIC IMAGES IN MULTIPLANE IMAGING FORMAT

Publication number: 20240357181

Abstract: Methods and apparatus for transmission of volumetric images in the MPI format. According to an example embodiment, texture and alpha layers of multiplane images are packed, as tiles, into a sequence of video frames. The sequence of video frames is then compressed to generate a video bitstream, which is transmitted together with a metadata bitstream specifying at least the parameters of the packing arrangement for the tiles in the sequence of video frames. Example packing arrangements include various selectable spatial and temporal arrangements for texture layers, alpha layers, and camera views. In some examples, the metadata bitstream is implemented using a SEI message and includes parameters selected from the group consisting of a size of the reference view, the number of layers in the multiplane image, the number of simultaneous views, one or more characteristics of the packing arrangement, layer merging information, dynamic range adjustment information, and reference view information.

Type: Application

Filed: May 22, 2024

Publication date: October 24, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Taoran Lu, Peng YIN, Guan-Ming Su, Dae Yeol Lee, Sean Thomas McCarthy, Tsung-Wei Huang, Sejin Oh
DECODING AUDIO BITSTREAMS WITH ENHANCED SPECTRAL BAND REPLICATION METADATA IN AT LEAST ONE FILL ELEMENT

Publication number: 20240355345

Abstract: Embodiments relate to an audio processing unit that includes a buffer, bitstream payload deformatter, and a decoding subsystem. The buffer stores at least one block of an encoded audio bitstream. The block includes a fill element that begins with an identifier followed by fill data. The fill data includes at least one flag identifying whether enhanced spectral band replication (eSBR) processing is to be performed on audio content of the block. A corresponding method for decoding an encoded audio bitstream is also provided.

Type: Application

Filed: April 11, 2024

Publication date: October 24, 2024

Applicant: DOLBY INTERNATIONAL AB

Inventors: Lars Villemoes, Heiko Purnhagen, Per Ekstrand
NEURAL NETWORKS FOR PRECISION RENDERING IN DISPLAY MANAGEMENT

Publication number: 20240354914

Abstract: Methods and systems for precision rendering in display mapping using neural networks are described. Given an intensity input image, a sequence of neural networks comprising a pyramid-halving sub-network, a pyramid down-sampling sub-network, a pyramid-up-sampling sub-network, and a final-layer generation sub-network generate a base layer image and a detail layer image to be used in display mapping.

Type: Application

Filed: August 23, 2022

Publication date: October 24, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Anustup Kumar Atanu CHOUDHURY, Robin ATKINS
METHODS, APPARATUS AND SYSTEMS FOR GENERATION, TRANSPORTATION AND PROCESSING OF IMMEDIATE PLAYOUT FRAMES (IPFS)

Publication number: 20240347068

Abstract: Described herein is an audio decoder for decoding a bitstream of encoded audio data, wherein the bitstream of encoded audio data represents a sequence of audio sample values and comprises a plurality of frames, wherein each frame comprises associated encoded audio sample values, the audio decoder comprising: a determiner configured to determine whether a frame of the bitstream of encoded audio data is an immediate playout frame comprising encoded audio sample values associated with a current frame and additional information; and an initializer configured to initialize the decoder if the determiner determines that the frame is an immediate playout frame, wherein initializing the decoder comprises decoding the encoded audio sample values comprised by the additional information before decoding the encoded audio sample values associated with the current frame.

Type: Application

Filed: March 18, 2024

Publication date: October 17, 2024

Applicant: Dolby International AB

Inventors: Christof Joseph FERSCH, Daniel FISCHER
METHODS AND DEVICES FOR GENERATING OR DECODING A BITSTREAM COMPRISING IMMERSIVE AUDIO SIGNALS

Publication number: 20240347069

Abstract: The present document describes a method for generating a bitstream, wherein the bitstream comprises a sequence of superframes for a sequence of frames of an immersive audio signal. The method comprises, repeatedly for the sequence of superframes, inserting coded audio data for one or more frames of one or more downmix channel signals derived from the immersive audio signal, into data fields of a superframe; and inserting metadata for reconstructing one or more frames of the immersive audio signal from the coded audio data, into a metadata field of the superframe.

Type: Application

Filed: June 21, 2024

Publication date: October 17, 2024

Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: Stefan BRUHN, Juan Felix TORRES
SYSTEM AND METHOD FOR ADAPTIVE AUDIO SIGNAL GENERATION, CODING AND RENDERING

Publication number: 20240349010

Abstract: Embodiments are described for an adaptive audio system that processes audio data comprising a number of independent monophonic audio streams. One or more of the streams has associated with it metadata that specifies whether the stream is a channel-based or object-based stream. Channel-based streams have rendering information encoded by means of channel name; and the object-based streams have location information encoded through location expressions encoded in the associated metadata. A codec packages the independent audio streams into a single serial bitstream that contains all of the audio data. This configuration allows for the sound to be rendered according to an allocentric frame of reference, in which the rendering location of a sound is based on the characteristics of the playback environment (e.g., room size, shape, etc.) to correspond to the mixer's intent.

Type: Application

Filed: April 12, 2024

Publication date: October 17, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Charles Q. ROBINSON, Nicolas R. TSINGOS, Christophe CHABANNE
MULTISOURCE MEDIA DELIVERY SYSTEMS AND METHODS

Publication number: 20240348891

Abstract: A method for delivering media content to one or more clients over a distributed system is disclosed. The method may include generating a plurality of network-coded symbols from a plurality of original symbols representing a first media asset. The method may further include generating an original plurality of coded variants of the first media asset. The method may further include distributing a first coded variant of the original plurality of coded variants to a first cache on a first server device for storage in the first cache. The method may further include distributing a second coded variant of the original plurality of coded variants to a second cache on a second server device for storage in the second cache.

Type: Application

Filed: June 21, 2024

Publication date: October 17, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Jeffrey Riedmiller, Mingchao Yu, Jason Michael Cloud
DIRECTED INTERPOLATION AND DATA POST-PROCESSING

Publication number: 20240340465

Abstract: An encoding device evaluates a plurality of processing and/or post-processing algorithms and/or methods to be applied to a video stream, and signals a selected method, algorithm, class or category of methods/algorithms either in an encoded bitstream or as side information related to the encoded bitstream. A decoding device or post-processor utilizes the signaled algorithm or selects an algorithm/method based on the signaled method or algorithm. The selection is based, for example, on availability of the algorithm/method at the decoder/post-processor and/or cost of implementation. The video stream may comprise, for example, downsampled multiplexed stereoscopic images and the selected algorithm may include any of upconversion and/or error correction techniques that contribute to a restoration of the downsampled images.

Type: Application

Filed: June 21, 2024

Publication date: October 10, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Alexandros TOURAPIS, Athanasios LEONTARIS, Peshala V. PAHALAWATTA, Kevin J. STEC
METHODS AND DEVICES FOR CONTROLLING ACCESS TO A SOFTWARE ASSET

Publication number: 20240338426

Abstract: The present document describes a method (900) for controlling access to a software asset of a software program, which is executed on an electronic device (110). The method (900) comprises, on one or more software provider servers (120), receiving (902) a request for authentication from a service provider server (210); upon successful authentication, providing (904) an authentication token to the service provider server (210); receiving (906) a request for a feature access token from the service provider server (210); and in reaction to the request, providing (908) the feature access token to the service provider server (210).

Type: Application

Filed: June 27, 2022

Publication date: October 10, 2024

Applicants: Dolby Laboratories Licensing Corporation, Dolby International AB

Inventors: Aleksandar BORKOVAC, Malte SCHMIDT, Paul F. GRIEPENTROG
METHOD AND SYSTEM FOR PROVIDING MEDIA CONTENT TO A CLIENT

Publication number: 20240340356

Abstract: A method for providing media content within a media distribution network. The method comprises transforming source media content into an interim format, thereby providing transformed content. Furthermore, the method comprises storing the transformed content on at least one core storage unit. In addition, the method comprises receiving a request for the source media content from a client. The method further comprises encoding the transformed content or intermediate coded content derived therefrom into encoded content suitable for transmission over a core network and/or an edge network, as well as sending the encoded content via the core network and/or the edge network to the client.

Type: Application

Filed: February 29, 2024

Publication date: October 10, 2024

Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB

Inventors: Jason Michael CLOUD, Jeffrey RIEDMILLER, Kristofer KJOERLING, Janusz KLEJSA
TRANSFORMING AUDIO SIGNALS CAPTURED IN DIFFERENT FORMATS INTO A REDUCED NUMBER OF FORMATS FOR SIMPLIFYING ENCODING AND DECODING OPERATIONS

Publication number: 20240331708

Abstract: The disclosed embodiments enable converting audio signals captured in various formats by various capture devices into a limited number of formats that can be processed by an audio codec (e.g., an Immersive Voice and Audio Services (IVAS) codec). In an embodiment, a simplification unit of the audio device receives an audio signal captured by one or more audio capture devices coupled to the audio device. The simplification unit determines whether the audio signal is in a format that is supported/not supported by an encoding unit of the audio device. Based on the determining, the simplification unit, converts the audio signal into a format that is supported by the encoding unit. In an embodiment, if the simplification unit determines that the audio signal is in a spatial format, the simplification unit can convert the audio signal into a spatial “mezzanine” format supported by the encoding.

Type: Application

Filed: May 8, 2024

Publication date: October 3, 2024

Applicants: Dolby Laboratories Licensing Corporation, Dolby International AB

Inventors: Stefan BRUHN, Michael ECKERT, Juan Felix TORRES, Stefanie BROWN, David S. MCGRATH
HEADPHONE RENDERING METADATA-PRESERVING SPATIAL CODING

Publication number: 20240334146

Abstract: Systems and methods for preserving headphone rendering mode (HRM) in object clustering are described. In an embodiment, an object-based audio data processing system includes a processor configured to receive a plurality of audio objects, wherein an audio object of the plurality of audio objects is associated with respective object metadata that indicates respective spatial position information and an HRM; determine a plurality of cluster positions by applying an extended hybrid distance metric to a spatial coding algorithm to calculate a partial loudness for each of the audio objects; render the audio objects to the cluster positions to form a plurality of clusters by applying the extended hybrid distance metric to the spatial coding algorithm to calculate object-to-cluster gains; and transmit the clusters to a spatial reproduction system.

Type: Application

Filed: September 8, 2022

Publication date: October 3, 2024

Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB

Inventors: Ziyu Yang, Lie Lu, Heiko Purnhagen, Jeremy Grant Stoddard, Dirk Jeroen Breebaart
HIGH PERFORMANCE MULTI SPECTRAL COATED FREEFORM OPTICAL ELEMENTS

Publication number: 20240326363

Abstract: A glass part to be made into a two-part freeform optical element is received. The glass part includes a precision freeform surface in a first specific spatial shape and a to-be-corrected surface. A non-glass material layer is added onto the to-be-corrected surface of the glass part to produce an accurate second surface in a second specific spatial shape. An optical coating is applied to the precision freeform surface of the glass part in the two-part freeform optical element.

Type: Application

Filed: July 27, 2022

Publication date: October 3, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventor: Titus Marc DEVINE
Methods and Apparatus for Rendering Audio Objects

Publication number: 20240334145

Abstract: Multiple virtual source locations may be defined for a volume within which audio objects can move. A set-up process for rendering audio data may involve receiving reproduction speaker location data and pre-computing gain values for each of the virtual sources according to the reproduction speaker location data and each virtual source location. The gain values may be stored and used during “run time,” during which audio reproduction data are rendered for the speakers of the reproduction environment. During run time, for each audio object, contributions from virtual source locations within an area or volume defined by the audio object position data and the audio object size data may be computed. A set of gain values for each output channel of the reproduction environment may be computed based, at least in part, on the computed contributions. Each output channel may correspond to at least one reproduction speaker of the reproduction environment.

Type: Application

Filed: April 1, 2024

Publication date: October 3, 2024

Applicants: Dolby Laboratories Licensing Corporation, Dolby International AB

Inventors: Antonio MATEOS SOLE, Nicolas R. TSINGOS
SCALABLE SYSTEMS FOR CONTROLLING COLOR MANAGEMENT COMPRISING VARYING LEVELS OF METADATA

Publication number: 20240323413

Abstract: Several embodiments of scalable image processing systems and methods are disclosed herein whereby color management processing of source image data to be displayed on a target display is changed according to varying levels of metadata.

Type: Application

Filed: June 3, 2024

Publication date: September 26, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Neil W. Messmer, Robin Atkins, Steve Margerm, Peter W. Longhurst
DYNAMICS PROCESSING ACROSS DEVICES WITH DIFFERING PLAYBACK CAPABILITIES

Publication number: 20240323608

Abstract: Individual loudspeaker dynamics processing configuration data, for each of a plurality of loudspeakers of a listening environment, may be obtained. Listening environment dynamics processing configuration data may be determined, based on the individual loudspeaker dynamics processing configuration data. Dynamics processing may be performed on received audio data based on the listening environment dynamics processing configuration data, to generate processed audio data. The processed audio data may be rendered for reproduction via a set of loudspeakers that includes at least some of the plurality of loudspeakers, to produce rendered audio signals. The rendered audio signals may be provided to, and reproduced by, the set of loudspeakers.

Type: Application

Filed: June 3, 2024

Publication date: September 26, 2024

Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: Alan J. Seefeldt, Joshua B. Lando, Daniel Arteaga
SCALABLE SYSTEMS FOR CONTROLLING COLOR MANAGEMENT COMPRISING VARYING LEVELS OF METADATA

Publication number: 20240323414

Abstract: Several embodiments of scalable image processing systems and methods are disclosed herein whereby color management processing of source image data to be displayed on a target display is changed according to varying levels of metadata.

Type: Application

Filed: June 3, 2024

Publication date: September 26, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Neil W. Messmer, Robin Atkins, Steve Margerm, Peter W. Longhurst
SIGNAL RESHAPING FOR HIGH DYNAMIC RANGE SIGNALS

Publication number: 20240314364

Abstract: In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated hue values in the preferred color space is minimized. HDR images are coded in the reshaped color space. Legacy devices can still decode standard dynamic range images assuming they are coded in the legacy color space, while updated devices can use color reshaping information to decode HDR images in the preferred color space at full dynamic range.

Type: Application

Filed: May 30, 2024

Publication date: September 19, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Robin Atkins, Peng Yin, Taoran Lu, Jaclyn Anne Pytlarz
BACKWARD-COMPATIBLE INTEGRATION OF HARMONIC TRANSPOSER FOR HIGH FREQUENCY RECONSTRUCTION OF AUDIO SIGNALS

Publication number: 20240312471

Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag.

Type: Application

Filed: May 29, 2024

Publication date: September 19, 2024

Applicant: DOLBY INTERNATIONAL AB

Inventors: Lars Villemoes, Heiko Purnhagen, Per Ekstrand
NARROW APERTURE WAVEGUIDE LOUDSPEAKER FOR USE WITH FLAT PANEL DISPLAY DEVICES

Publication number: 20240314494

Abstract: A speaker having a narrow aperture waveguide for transmitting sound from the rear side of a flat display panel. A transducer of the speaker radiates sound through the waveguide and outwards from the rear of the display panel and around an edge of the display panel to form soundwaves radiating directly or nearly directly to a listener positioned in front of the display panel. The waveguide includes fins that control the directivity of soundwaves exiting the waveguide and around the edge of the display panel.

Type: Application

Filed: April 14, 2022

Publication date: September 19, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventor: Benjamin Alexander JANCOVICH
AUDIO CONTROL USING AUDITORY EVENT DETECTION

Publication number: 20240313730

Abstract: In some embodiments, a method for processing an audio signal in an audio processing apparatus is disclosed. The method includes receiving an audio signal and a parameter, the parameter indicating a location of an auditory event boundary. An audio portion between consecutive auditory event boundaries constitutes an auditory event. The method further includes applying a modification to the audio signal based in part on an occurrence of the auditory event. The parameter may be generated by monitoring a characteristic of the audio signal and identifying a change in the characteristic.

Type: Application

Filed: May 23, 2024

Publication date: September 19, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Brett G. CROCKETT, Alan J. SEEFELDT
AUDIO CONTROL USING AUDITORY EVENT DETECTION

Publication number: 20240313731

Abstract: In some embodiments, a method for processing an audio signal in an audio processing apparatus is disclosed. The method includes receiving an audio signal and a parameter, the parameter indicating a location of an auditory event boundary. An audio portion between consecutive auditory event boundaries constitutes an auditory event. The method further includes applying a modification to the audio signal based in part on an occurrence of the auditory event. The parameter may be generated by monitoring a characteristic of the audio signal and identifying a change in the characteristic.

Type: Application

Filed: May 23, 2024

Publication date: September 19, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Brett G. CROCKETT, Alan J. SEEFELDT
AUDIO CONTROL USING AUDITORY EVENT DETECTION

Publication number: 20240313729

Abstract: In some embodiments, a method for processing an audio signal in an audio processing apparatus is disclosed. The method includes receiving an audio signal and a parameter, the parameter indicating a location of an auditory event boundary. An audio portion between consecutive auditory event boundaries constitutes an auditory event. The method further includes applying a modification to the audio signal based in part on an occurrence of the auditory event. The parameter may be generated by monitoring a characteristic of the audio signal and identifying a change in the characteristic.

Type: Application

Filed: May 23, 2024

Publication date: September 19, 2024

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Brett G. CROCKETT, Alan J. SEEFELDT
CROSS PRODUCT ENHANCED SUBBAND BLOCK BASED HARMONIC TRANSPOSITION

Publication number: 20240312470

Abstract: The invention provides an efficient implementation of cross-product enhanced high-frequency reconstruction (HFR), wherein a new component at frequency Q?+r?0 is generated on the basis of existing components at ? and ?+?0. The invention provides a block-based harmonic transposition, wherein a time block of complex subband samples is processed with a common phase modification. Superposition of several modified samples has the net effect of limiting undesirable intermodulation products, thereby enabling a coarser frequency resolution and/or lower degree of oversampling to be used. In one embodiment, the invention further includes a window function suitable for use with block-based cross-product enhanced HFR. A hardware embodiment of the invention may include an analysis filter bank, a subband processing unit configurable by control data and a synthesis filter bank.

Type: Application

Filed: May 28, 2024

Publication date: September 19, 2024

Applicant: DOLBY INTERNATIONAL AB

Inventor: Lars Villemoes
MULTI-BAND DUCKING OF AUDIO SIGNALS

Publication number: 20240304196

Abstract: A method for multi-band ducking of audio signals is provided. In some implementations, the method involves receiving, at a decoder, an input audio signal, wherein the input audio signal is a downmixed audio signal. In some implementations, the method involves separating the input audio signal into a first set of frequency bands. In some implementations, the method involves determining a set of ducking gains, a ducking gain corresponding to a frequency band of the first set of frequency bands. In some implementations, the method involves generating a broadband decorrelated audio signal, wherein ducking gains of the set of ducking gains are applied to at least one of: 1) a second set of frequency bands prior to generating the at least one broadband decorrelated audio signal: or 2) a third set of frequency bands that separates the at least one broadband decorrelated audio signal.

Type: Application

Filed: April 1, 2022

Publication date: September 12, 2024

Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: Rishabh TYAGI, Heiko PURNHAGEN
VIDEO COMPRESSION AND TRANSMISSION TECHNIQUES

Publication number: 20240305783

Abstract: Embodiments feature families of rate allocation and rate control methods that utilize advanced processing of past and future frame/field picture statistics and are designed to operate with one or more coding passes. At least two method families include: a family of methods for a rate allocation with picture look-ahead; and a family of methods for average bit rate (ABR) control methods. At least two other methods for each method family are described. For the first family of methods, some methods may involve intra rate control. For the second family of methods, some methods may involve high complexity ABR control and/or low complexity ABR control. These and other embodiments can involve any of the following: spatial coding parameter adaptation, coding prediction, complexity processing, complexity estimation, complexity filtering, bit rate considerations, quality considerations, coding parameter allocation, and/or hierarchical prediction structures, among others.

Type: Application

Filed: May 17, 2024

Publication date: September 12, 2024

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Athanasios Leontaris, Alexandros Tourapis
METHODS, APPARATUS AND SYSTEM FOR RENDERING AN AUDIO PROGRAM

Publication number: 20240304194

Abstract: A method for generating a bitstream indicative of an object based audio program is described. The bitstream comprises a sequence of containers. A first container of the sequence of containers comprises a plurality of substream entities for a plurality of substreams of the object based audio program and a presentation section. The method comprises determining a set of object channels. The method further comprises providing a set of object related metadata for the set of object channels. In addition, the method comprises inserting a first set of object channel frames and a first set of object related metadata frames into a respective set of substream entities of the first container. Furthermore, the method comprises inserting presentation data into the presentation section.

Type: Application

Filed: March 14, 2024

Publication date: September 12, 2024

Applicant: Dolby International AB

Inventors: Christof FERSCH, Alexander STAHLMANN

prev 1 2 3 4 5 6 7 … next