Dolby Labs Patents

Dolby Laboratories, Inc. licenses its audio technologies, including its noise-reduction systems, to the media industry. Its product portfolio includes Dolby Digital Plus (DD+), Dolby Digital (DD), AAC and HE-AAC, Dolby TrueHD, Dolby Atmos, Dolby AC-4, Dolby Voice and Dolby Vision. Products that incorporate Dolby technologies include televisions, set-top boxes, computers, DVD and Blu-ray devices, soundbars, smartphones, tablets, video game consoles, and automobile entertainment systems.

Dolby Labs Patents by Type
  • Patent number: 11930347
    Abstract: A method of processing audio content including a plurality of audio elements comprises: clustering the plurality of audio elements into a plurality of clusters of audio elements; and for a cluster among the plurality of clusters: for each audio element in the cluster, determining a measure of energy that the audio element contributes to the cluster; for at least one audio element in the cluster, determining a compensation gain based at least in part on the measures of energy for the audio elements in the cluster; and applying the compensation gain to the at least one audio element in the cluster.
    Type: Grant
    Filed: February 12, 2020
    Date of Patent: March 12, 2024
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu Chen, Lie Lu
  • Patent number: 11929085
    Abstract: Described herein is a method of low-bitrate coding of audio data and generating enhancement metadata for controlling audio enhancement of the low-bitrate coded audio data at a decoder side, including the steps of: (a) core encoding original audio data at a low bitrate to obtain encoded audio data; (b) generating enhancement metadata to be used for controlling a type and/or amount of audio enhancement at the decoder side after core decoding the encoded audio data; and (c) outputting the encoded audio data and the enhancement metadata. Described is further an encoder configured to perform said method. Described is moreover a method for generating enhanced audio data from low-bitrate coded audio data based on enhancement metadata and a decoder configured to perform said method.
    Type: Grant
    Filed: August 29, 2019
    Date of Patent: March 12, 2024
    Assignees: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Arijit Biswas, Jia Dai, Aaron Steven Master
  • Publication number: 20240080489
    Abstract: A quantization parameter signalling mechanism for both SDR and HDR content in video coding is described using two approaches. The first approach is to send the user-defined QpC table directly in high level syntax. This leads to more flexible and efficient QP control for future codec development and video content coding. The second approach is to signal luma and chroma QPs independently. This approach eliminates the need for QpC tables and removes the dependency of chroma quantization parameter on luma QP.
    Type: Application
    Filed: November 10, 2023
    Publication date: March 7, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Fangjun PU, Taoran LU, Peng YIN, Sean Thomas MCCARTHY
  • Publication number: 20240079015
    Abstract: Methods for generating an object based audio program, renderable in a personalizable manner, and including a bed of speaker channels renderable in the absence of selection of other program content (e.g., to provide a default full range audio experience). Other embodiments include steps of delivering, decoding, and/or rendering such a program. Rendering of content of the bed, or of a selected mix of other content of the program, may provide an immersive experience. The program may include multiple object channels (e.g., object channels indicative of user-selectable and user-configurable objects), the bed of speaker channels, and other speaker channels. Another aspect is an audio processing unit (e.g., encoder or decoder) configured to perform, or which includes a buffer memory which stores at least one frame (or other segment) of an object based audio program (or bitstream thereof) generated in accordance with, any embodiment of the method.
    Type: Application
    Filed: September 19, 2023
    Publication date: March 7, 2024
    Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB
    Inventors: Sripal S. MEHTA, Thomas ZIEGLER, Giles BAKER, Jeffrey RIEDMILLER, Prinyar SAUNGSOMBOON
  • Publication number: 20240080608
    Abstract: A method of audio processing includes capturing a binaural audio signal, calculating noise reduction gains using a machine learning model, and generating a modified binaural audio signal. The method may further including performing various corrections to the audio to account for video captured by different cameras such as a front camera and a rear camera. The method may further include performing smooth switching of the binaural audio when switching between the front camera and the rear camera. In this manner, noise may be reduced in the binaural audio, and the user perception of the combined video and binaural audio may be improved.
    Type: Application
    Filed: December 14, 2021
    Publication date: March 7, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Yuanxing MA, Zhiwei SHUANG, Yang LIU
  • Publication number: 20240080479
    Abstract: Sampled data is packaged in checkerboard format for encoding and decoding. The sampled data may be quincunx sampled multi-image video data (e.g., 3D video or a multi-program stream), and the data may also be divided into sub-images of each image which are then multiplexed, or interleaved, in frames of a video stream to be encoded and then decoded using a standardized video encoder. A system for viewing may utilize a standard video decoder and a formatting device that de-interleaves the decoded sub-images of each frame reformats the images for a display device. A 3D video may be encoded using a most advantageous interleaving format such that a preferred quality and compression ratio is reached. In one embodiment, the invention includes a display device that accepts data in multiple formats.
    Type: Application
    Filed: November 7, 2023
    Publication date: March 7, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Alexandros Tourapis, Walter J. Husak, Peshala V. Pahalawatta, Athanasios Leontaris
  • Publication number: 20240079019
    Abstract: Computer-implemented methods for training a neural network, as well as for implementing audio encoders and decoders via trained neural networks, are provided. The neural network may receive an input audio signal, generate an encoded audio signal and decode the encoded audio signal. A loss function generating module may receive the decoded audio signal and a ground truth audio signal, and may generate a loss function value corresponding to the decoded audio signal. Generating the loss function value may involve applying a psychoacoustic model. The neural network may be trained based on the loss function value. The training may involve updating at least one weight of the neural network.
    Type: Application
    Filed: November 13, 2023
    Publication date: March 7, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Roy M. FEJGIN, Grant A. DAVIDSON, Chih-Wei WU, Vivek KUMAR
  • Publication number: 20240080465
    Abstract: Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific backward compatible target frame rate and shutter angle among those allowed are also presented.
    Type: Application
    Filed: November 13, 2023
    Publication date: March 7, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Robin Atkins, Peng Yin, Taoran Lu, Fangjun Pu, Sean Thomas McCarthy, Walter J. Husak, Tao Chen, Guan-Ming Su
  • Patent number: 11922639
    Abstract: A method for generating an high-dynamic-range (HDR) color image from a dual-exposure-time single-shot HDR color image sensor includes obtaining pixel values generated by a local region of sensor pixels of the image sensor, determining a motion parameter for the local region from pixel values associated with a first color, and demosaicing the pixel values of the local region to determine, for each of three colors, an output value of the images pixel, wherein relative contributions of short-exposure-time pixels and long-exposure-time pixels to the output value are weighted according to the motion parameter.
    Type: Grant
    Filed: June 4, 2019
    Date of Patent: March 5, 2024
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Yuelong Li, Jon Scott McElvain, Mohammad Tofighi
  • Patent number: 11924619
    Abstract: An apparatus and method of rendering audio. A binaural signal is split on an amplitude weighting basis into a front binaural signal and a rear binaural signal, based on perceived position information of the audio. In this manner, the front-back differentiation of the binaural signal is improved.
    Type: Grant
    Filed: September 12, 2022
    Date of Patent: March 5, 2024
    Assignee: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Mark F. Davis, Nicolas R. Tsingos, C. Phillip Brown
  • Patent number: 11924451
    Abstract: A reference picture information decoding unit (13) omits decoding of a reference list sorting presence or absence flag and/or a reference list sorting order based on the number of current picture referable pictures.
    Type: Grant
    Filed: March 21, 2022
    Date of Patent: March 5, 2024
    Assignee: DOLBY INTERNATIONAL AB
    Inventors: Tomoyuki Yamamoto, Tomohiro Ikai, Yukinobu Yasugi, Takeshi Tsukuba
  • Patent number: 11924477
    Abstract: In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated hue values in the preferred color space is minimized. HDR images are coded in the reshaped color space. Legacy devices can still decode standard dynamic range images assuming they are coded in the legacy color space, while updated devices can use color reshaping information to decode HDR images in the preferred color space at full dynamic range.
    Type: Grant
    Filed: June 27, 2023
    Date of Patent: March 5, 2024
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Robin Atkins, Peng Yin, Taoran Lu, Jaclyn Anne Pytlarz
  • Publication number: 20240073459
    Abstract: In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated hue values in the preferred color space is minimized HDR images are coded in the reshaped color space. Legacy devices can still decode standard dynamic range images assuming they are coded in the legacy color space, while updated devices can use color reshaping information to decode HDR images in the preferred color space at full dynamic range.
    Type: Application
    Filed: October 31, 2023
    Publication date: February 29, 2024
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Robin Atkins, Peng Yin, Taoran Lu, Jaclyn Anne Pytlarz
  • Publication number: 20240073444
    Abstract: A video encoding method according to an embodiment of the present invention includes generating header information that includes information about resolutions of motion vectors of respective blocks, determined based on motion prediction for a unit image. Here, the header information includes flag information indicating whether resolutions of all motion vectors included in the unit image are integer-pixel resolutions. Further, a video decoding method according to another embodiment of the present invention includes extracting information about resolutions of motion vectors of each unit image from header information included in a target bitstream to be decoded; and a decoding unit for decoding the unit image based on the resolution information. Here, the header information includes flag information indicating whether resolutions of all motion vectors included in the unit image are integer-pixel resolutions.
    Type: Application
    Filed: November 8, 2023
    Publication date: February 29, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Jong Ki HAN, Jae Yung LEE
  • Publication number: 20240071411
    Abstract: Disclosed is a method for determining one or more dialog quality metrics of a mixed audio signal comprising a dialog component and a noise component, the method comprising separating an estimated dialog component from the mixed audio signal by means of a dialog separator using a dialog separating model determined by training the dialog separator based on the one or more quality metrics; providing the estimated dialog component from the dialog separator to a quality metrics estimator; and determining the one or more quality metrics by means of the quality metrics estimator based on the mixed signal and the estimated dialog component. Further disclosed is a method for training a dialog separator, a system comprising circuitry configured to perform the method, and a non-transitory computer-readable storage medium.
    Type: Application
    Filed: January 4, 2022
    Publication date: February 29, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Jundai SUN, Lie LU, Shaofan YANG, Rhonda J. WILSON, Dirk Jeroen BREEBAART
  • Publication number: 20240073357
    Abstract: Dual or multi-modulation display systems comprising a first modulator and a second modulator are disclosed. The first modulator may comprise a plurality of analog mirrors (e.g. MEMS array) and the second modulator may comprise a plurality of mirrors (e.g., DMD array). The display system may further comprise a controller that sends control signals to the first and second modulator. The display system may render highlight features within a projected image by affecting a time multiplexing scheme. In one embodiment, the first modulator may be switched on a sub-frame basis such that a desired proportion of the available light may be focused or directed onto the second modulator to form the highlight feature on a sub-frame rendering basis.
    Type: Application
    Filed: January 31, 2022
    Publication date: February 29, 2024
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Martin J. Richards
  • Patent number: 11915713
    Abstract: The present document relates to audio source coding systems. In particular, the present document relates to audio source coding systems which make use of linear prediction in combination with a filterbank. A method for estimating a first sample (615) of a first subband signal in a first subband of an audio signal is described. The first subband signal of the audio signal is determined using an analysis filterbank (612) comprising a plurality of analysis filters which provide a plurality of subband signals in a plurality of subbands from the audio signal, respectively.
    Type: Grant
    Filed: March 30, 2023
    Date of Patent: February 27, 2024
    Assignee: DOLBY INTERNATIONAL AB
    Inventor: Lars Villemoes
  • Patent number: 11917171
    Abstract: Several embodiments of scalable image processing systems and methods are disclosed herein whereby color management processing of source image data to be displayed on a target display is changed according to varying levels of metadata.
    Type: Grant
    Filed: July 7, 2023
    Date of Patent: February 27, 2024
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Neil W. Messmer, Robin Atkins, Steve Margerm, Peter W. Longhurst
  • Patent number: 11917386
    Abstract: Methods and systems for performing at least one audio activity (e.g., conducting a phone call or playing music or other audio content) in an environment including by determining an estimated location of a user in the environment in response to sound uttered by the user (e.g., a voice command), and controlling the audio activity in response to determining the estimated user location. The environment may have zones which are indicated by a zone map and estimation of the user location may include estimating in which of the zones the user is located. The audio activity may be performed using microphones and loudspeakers which are implemented in or coupled to smart audio devices.
    Type: Grant
    Filed: March 7, 2023
    Date of Patent: February 27, 2024
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Carlos Eduardo Medaglia Dyonisio, David Gunawan
  • Publication number: 20240062765
    Abstract: Encoding and decoding devices for encoding the channels of an audio system having at least four channels are disclosed. The decoding device has a first stereo decoding component which subjects a first pair of input channels to a first stereo decoding, and a second stereo decoding component which subjects a second pair of input channels to a second stereo decoding. The results of the first and second stereo decoding components are crosswise coupled to a third and a fourth stereo decoding component which each performs stereo decoding on one channel resulting from the first stereo decoding component, and one channel resulting from the second stereo decoding component.
    Type: Application
    Filed: September 1, 2023
    Publication date: February 22, 2024
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Kristofer KJOERLING, Harald MUNDT, Heiko PURNHAGEN
  • Patent number: 11910025
    Abstract: In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated hue values in the preferred color space is minimized HDR images are coded in the reshaped color space. Legacy devices can still decode standard dynamic range images assuming they are coded in the legacy color space, while updated devices can use color reshaping information to decode HDR images in the preferred color space at full dynamic range.
    Type: Grant
    Filed: October 31, 2023
    Date of Patent: February 20, 2024
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Robin Atkins, Peng Yin, Taoran Lu, Jaclyn Anne Pytlarz
  • Patent number: 11908486
    Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.
    Type: Grant
    Filed: January 20, 2023
    Date of Patent: February 20, 2024
    Assignee: DOLBY INTERNATIONAL AB
    Inventors: Kristofer Kjoerling, Lars Villemoes, Heiko Purnhagen, Per Ekstrand
  • Publication number: 20240056760
    Abstract: A method of audio processing includes performing spatial analysis on a binaural signal to estimate level differences and phase differences characteristic of a binaural filter of the binaural signal, performing object extraction on the binaural audio signal using the estimated level and phase differences to generate a left/right main component signal and a left/right residual component signal. The system may process the left/right main and left/right residual components differently using different object processing parameters for e.g. repositioning, equalization, compression, upmixing, channel remapping or storage to generate a processed binaural signal that provides an improved listening experience. Repositioning may be based on head tracking sensor data.
    Type: Application
    Filed: December 16, 2021
    Publication date: February 15, 2024
    Applicants: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Dirk Jeroen BREEBAART, Giulio CENGARLE, C. Phillip BROWN
  • Publication number: 20240056755
    Abstract: Improved methods and/or apparatus for decoding an encoded audio signal in soundfield format for L loudspeakers. The method and/or apparatus can render an Ambisonics format audio signal to 2D loudspeaker setup(s) based on a rendering matrix. The rendering matrix has elements based on loudspeaker positions and wherein the rendering matrix is determined based on weighting at least an element of a first matrix with a weighting factor ? = 1 L . The first matrix is determined based on positions of the L loudspeakers and at least a virtual position of at least a virtual loudspeaker that is added to the positions of the L loudspeakers.
    Type: Application
    Filed: August 28, 2023
    Publication date: February 15, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Florian KEILER, Johannes Boehm
  • Publication number: 20240056610
    Abstract: Methods are described to communicate source color volume information in a coded bitstream using SEI messaging. Such data include at least the minimum, maximum, and average luminance values in the source data plus optional data that may include the color volume x and y chromaticity coordinates for the input color primaries (e.g., red, green, and blue) of the source data, and the color x and y chromaticity coordinates for the color primaries corresponding to the minimum, average, and maximum luminance values in the source data. Messaging data signaling an active region in each picture may also be included.
    Type: Application
    Filed: October 13, 2023
    Publication date: February 15, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Tao CHEN, Peng YIN, Taoran LU, Walter J. HUSAK
  • Publication number: 20240056757
    Abstract: Some methods may involve receiving a first content stream that includes first audio signals, rendering the first audio signals to produce first audio playback signals, generating first direct sequence spread spectrum (DSSS) signals, generating first modified audio playback signals by inserting the first DSSS signals into the first audio playback signals, and causing a loudspeaker system to play back the first modified audio playback signals, to generate first audio device playback sound. The method(s) may involve receiving microphone signals corresponding to at least the first audio device playback sound and to second through Nth audio device playback sound corresponding to second through Nth modified audio playback signals (including second through Nth DSSS signals) played back by second through Nth audio devices, extracting second through Nth DSSS signals from the microphone signals and estimating at least one acoustic scene metric based, at least partly, on the second through Nth DSSS signals.
    Type: Application
    Filed: December 2, 2021
    Publication date: February 15, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Benjamin John SOUTHWELL, David GUNAWAN, Mark R.P. THOMAS, Christopher Graham HINES
  • Publication number: 20240055006
    Abstract: Described herein is a method for setting up a decoder for generating processed audio data from an audio bitstream, the decoder comprising a Generator of a Generative Adversarial Network, GAN, for processing of the audio data, wherein the method includes the steps of (a) pre-configuring the Generator for processing of audio data with a set of parameters for the Generator, the parameters being determined by training, at training time, the Generator using the full concatenated distribution; and (b) pre-configuring the decoder to determine, at decoding time, a truncation mode for modifying the concatenated distribution and to apply the determined truncation mode to the concatenated distribution. Described are further a method of generating processed audio data from an audio bitstream using a Generator of a Generative Adversarial Network, GAN, for processing of the audio data and a respective apparatus. Moreover, described are also respective systems and computer program products.
    Type: Application
    Filed: December 15, 2021
    Publication date: February 15, 2024
    Applicant: Dolby International AB
    Inventor: Arijit BISWAS
  • Publication number: 20240055010
    Abstract: An apparatus and method are disclosed for processing an audio signal. The apparatus includes an input interface, a digital filterbank having an analysis part and a synthesis part, a first phase shifter, a spectral envelope adjuster, a second phase shifter, and an output interface. The first phase shifter and the second phase shifter reduce a complexity of the digital filterbank, which includes both analysis and synthesis filters that are complex-exponential modulated versions of a prototype filter.
    Type: Application
    Filed: August 21, 2023
    Publication date: February 15, 2024
    Applicant: DOLBY INTERNATIONAL AB
    Inventor: Per EKSTRAND
  • Publication number: 20240056649
    Abstract: A method for delivering media content to one or more clients over a distributed system is disclosed. The method may include generating a plurality of network-coded symbols from a plurality of original symbols representing a first media asset. The method may further include generating an original plurality of coded variants of the first media asset. The method may further include distributing a first coded variant of the original plurality of coded variants to a first cache on a first server device for storage in the first cache. The method may further include distributing a second coded variant of the original plurality of coded variants to a second cache on a second server device for storage in the second cache.
    Type: Application
    Filed: December 16, 2021
    Publication date: February 15, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Jeffrey RIEDMILLER, Mingchao YU, Jason Michael CLOUD
  • Publication number: 20240056613
    Abstract: Disclosed is an encoding/decoding method and apparatus related to adaptive deblocking filtering. There is provided an image decoding method performing adaptive filtering in inter-prediction, the method including: reconstructing, from a bitstream, an image signal including a reference block on which block matching is performed in inter-prediction of a current block to be encoded; obtaining, from the bitstream, a flag indicating whether the reference block exists within a current picture where the current block is positioned; reconstructing the current block by using the reference block; adaptively applying an in-loop filter for the reconstructed current block based on the obtained flag; and storing the current block to which the in-loop filter is or is not applied in a decoded picture buffer (DPB).
    Type: Application
    Filed: October 24, 2023
    Publication date: February 15, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Je Chang JEONG, Ki Baek KIM
  • Patent number: 11902762
    Abstract: Example embodiments disclosed herein relate to orientation-aware surround sound playback. A method for processing audio on an electronic device that includes a plurality of loudspeakers is disclosed, the loudspeakers arranged in more than one dimension of the electronic device. The method includes, responsive to receipt of a plurality of received audio streams, generating a rendering component associated with the plurality of received audio streams, determining an orientation dependent component of the rendering component, processing the rendering component by updating the orientation dependent component according to an orientation of the loudspeakers and dispatching the received audio streams to the plurality of loudspeakers for playback based on the processed rendering component. Corresponding system and computer program products are also disclosed.
    Type: Grant
    Filed: May 4, 2022
    Date of Patent: February 13, 2024
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Xuejing Sun, Guilin Ma, Xiguang Zheng
  • Patent number: 11902769
    Abstract: The present disclosure relates to a method of processing audio content including directivity information for at least one sound source, the directivity information comprising a first set of first directivity unit vectors representing directivity directions and associated first directivity gains. The disclosure further relates to corresponding methods of encoding and decoding audio content including directivity information for at least one sound source.
    Type: Grant
    Filed: June 30, 2020
    Date of Patent: February 13, 2024
    Assignee: Dolby International AB
    Inventors: Leon Terentiv, Christof Fersch, Daniel Fischer
  • Publication number: 20240048932
    Abstract: An apparatus and method of generating personalized HRTFs. The system is prepared by calculating a model for HRTFs described as the relationship between a finite example set of input data, namely anthropometric measures and demographic information for a set of individuals, and a corresponding set of output data, namely HRTFs numerically simulated using a high-resolution database of 3D scans of the same set of individuals. At the time of use, the system queries the user for their demographic information, and then from a series of images of the user, the system detects and measures various anthropometric characteristics. The system then applies the prepared model to the anthropometric and demographic data as part of generating a personalized HRTF. In this manner, the personalized HRTF can be generated with more convenience than by performing a high-resolution scan or an acoustic measurement of the user, and with less computational complexity than by numerically simulating their HRTF.
    Type: Application
    Filed: August 24, 2023
    Publication date: February 8, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: McGregor Steele JOYNER, Alex BRANDMEYER, Scott DALY, Jeffrey Ross BAKER, Andrea FANELLI, Poppy Anne Carrie CRUM
  • Publication number: 20240048931
    Abstract: Some methods may involve receiving a first content stream that includes first audio signals, rendering the first audio signals to produce first audio playback signals, generating first direct sequence spread spectrum (DSSS) signals, generating first modified audio playback signals by inserting the first DSSS signals into the first audio playback signals, and causing a loudspeaker system to play back the first modified audio playback signals, to generate first audio device playback sound. The method(s) may involve receiving microphone signals corresponding to at least the first audio device playback sound and to second through Nth audio device playback sound corresponding to second through Nth modified audio playback signals (including second through Nth DSSS signals) played back by second through Nth audio devices, extracting second through Nth DSSS signals from the microphone signals and estimating at least one acoustic scene metric based, at least partly, on the second through Nth DSSS signals.
    Type: Application
    Filed: December 2, 2021
    Publication date: February 8, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Benjamin John SOUTHWELL, David GUNAWAN, Mark R.P. THOMAS, Christopher Graham HINES
  • Publication number: 20240046940
    Abstract: The invention provides an efficient implementation of cross-product enhanced high-frequency reconstruction (HFR), wherein a new component at frequency Q?+r?0 is generated on the basis of existing components at ? and ?+?0. The invention provides a block-based harmonic transposition, wherein a time block of complex subband samples is processed with a common phase modification. Superposition of several modified samples has the net effect of limiting undesirable intermodulation products, thereby enabling a coarser frequency resolution and/or lower degree of oversampling to be used. In one embodiment, the invention further includes a window function suitable for use with block-based cross-product enhanced HFR. A hardware embodiment of the invention may include an analysis filter bank, a subband processing unit configurable by control data and a synthesis filter bank.
    Type: Application
    Filed: October 5, 2023
    Publication date: February 8, 2024
    Applicant: DOLBY INTERNATIONAL AB
    Inventor: Lars Villemoes
  • Patent number: 11895477
    Abstract: Higher Order Ambisonics represents three-dimensional sound independent of a specific loudspeaker set-up. However, transmission of an HOA representation results in a very high bit rate. Therefore, compression with a fixed number of channels is used, in which directional and ambient signal components are processed differently. The ambient HOA component is represented by a minimum number of HOA coefficient sequences. The remaining channels contain either directional signals or additional coefficient sequences of the ambient HOA component, depending on what will result in optimum perceptual quality. This processing can change on a frame-by-frame basis.
    Type: Grant
    Filed: March 21, 2022
    Date of Patent: February 6, 2024
    Assignee: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Sven Kordon, Alexander Krueger
  • Patent number: 11895369
    Abstract: The present disclosure relates to methods and apparatus for processing media content having video content and associated audio content. A method of processing media content having video content and associated audio content comprises the method includes receiving the video content and the associated audio content, analyzing the associated audio content, determining one or more navigation points for enabling navigation of the media content based on the analysis, wherein the one or more navigation points indicate points of interest in the associated audio content for short-term rewinding and/or fast forwarding, embedding the one or more navigation points into metadata for the media content, and outputting the video content, the associated audio content, and the metadata.
    Type: Grant
    Filed: August 22, 2018
    Date of Patent: February 6, 2024
    Assignee: DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Christopher Graham Hines
  • Patent number: 11895479
    Abstract: A method for steering binauralization of audio is provided. The method comprises steps of: receiving (410) an audio input signal, calculating (430) a confidence value indicating a likelihood that a current audio frame of the audio input signal comprises binauralized audio; determining (450) a state signal based on the confidence value; determining (460) a steering signal, based on the first confidence value, the state signal and an energy value of the audio frame; and generating (470) an audio output signal with steered binauralization by processing the audio input signal according to the steering signal.
    Type: Grant
    Filed: August 19, 2020
    Date of Patent: February 6, 2024
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Qingyuan Bin, Libin Luo, Ziyu Yang, Zhiwei Shuang, Xuemei Yu, Guiping Wang
  • Patent number: 11893700
    Abstract: Spatial information that describes spatial locations of visual objects as in a three-dimensional (3D) image space as represented in one or more multi-view unlayered images is accessed. Based on the spatial information, a cinema image layer and one or more device image layers are generated from the one or more multi-view unlayered images. A multi-layer multi-view video signal comprising the cinema image layer and the device image layers is sent to downstream devices for rendering.
    Type: Grant
    Filed: April 28, 2022
    Date of Patent: February 6, 2024
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Ajit Ninan, Neil Mammen, Tyrome Y. Brown
  • Patent number: 11895416
    Abstract: A device includes an electronic processor configured to define a first set of sample pixels from a set of sample pixels determined from received video data according to a first electro-optical transfer function (EOTF) in a first color representation of a first color space; convert the first set of sample pixels to a second EOTF via a mapping function, producing a second set of sample pixels according to the second EOTF; convert the first and second set of sample pixels from the first color representation to a second color representation of the first color space; determine a backward reshaping function by repeatedly applying and adjusting a sample backward reshaping function so as to minimize a difference between predicted pixel values obtained by applying the sample backward reshaping function to the pixels of the converted first set and the pixels of the converted second set.
    Type: Grant
    Filed: July 27, 2020
    Date of Patent: February 6, 2024
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Guan-Ming Su, Harshad Kadu, Neeraj J. Gadgil, Qing Song, Yoon Yung Lee
  • Patent number: 11894003
    Abstract: Audio objects are associated with positional metadata. A received downmix signal comprises downmix channels that are linear combinations of one or more audio objects and are associated with respective positional locators. In a first aspect, the downmix signal, the positional metadata and frequency-dependent object gains are received. An audio object is reconstructed by applying the object gain to an upmix of the downmix signal in accordance with coefficients based on the positional metadata and the positional locators. In a second aspect, audio objects have been encoded together with at least one bed channel positioned at a positional locator of a corresponding downmix channel. The decoding system receives the downmix signal and the positional metadata of the audio objects. A bed channel is reconstructed by suppressing the content representing audio objects from the corresponding downmix channel on the basis of the positional locator of the corresponding downmix channel.
    Type: Grant
    Filed: February 10, 2023
    Date of Patent: February 6, 2024
    Assignee: DOLBY INTERNATIONAL AB
    Inventors: Toni Hirvonen, Heiko Purnhagen, Leif Jonas Samuelsson, Lars Villemoes
  • Patent number: 11895482
    Abstract: A method for generating loudspeaker signals associated with a target screen size is disclosed. The method includes receiving a bit stream containing encoded higher order ambisonics signals, the encoded higher order ambisonics signals describing a sound field associated with a production screen size. The method further includes decoding the encoded higher order ambisonics signals to obtain a first set of decoded higher order ambisonics signals representing dominant components of the sound field and a second set of decoded higher order ambisonics signals representing ambient components of the sound field. The method also includes combining the first set of decoded higher order ambisonics signals and the second set of decoded higher order ambisonics signals to produce a combined set of decoded higher order ambisonics signals.
    Type: Grant
    Filed: January 25, 2023
    Date of Patent: February 6, 2024
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Peter Jax, Johannes Boehm, William G. Redmann
  • Patent number: 11894002
    Abstract: An apparatus for processing an input audio signal relies on a cascade of filterbanks, the cascade having a synthesis filterbank for synthesizing an audio intermediate signal from the input audio signal, the input audio signal being represented by a plurality of first subband signals generated by an analysis filterbank, wherein a number of filterbank channels of the synthesis filterbank is smaller than a number of channels of the analysis filterbank. The apparatus furthermore has a further analysis filterbank for generating a plurality of second subband signals from the audio intermediate signal, wherein the further analysis filterbank has a number of channels being different from the number of channels of the synthesis filterbank, so that a sampling rate of a subband signal of the plurality of second subband signals is different from a sampling rate of a first subband signal of the plurality of first subband signals.
    Type: Grant
    Filed: October 21, 2022
    Date of Patent: February 6, 2024
    Assignees: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung, Dolby International AB
    Inventors: Lars Villemoes, Per Ekstrand, Sascha Disch, Frederik Nagel, Stephan Wilde
  • Patent number: 11894006
    Abstract: The processing of audio signals during playback is provided, so that audio signals that fall below a specified threshold loudness level are processed to avoid making unwanted background noise audible. N-channel audio is received from a playback volume controller/leveler (101). The level of the audio is compared with a threshold level. If the level is greater than the threshold level, the audio is processed with a first amount of gain in accordance with a first dynamic range control (DRC) compression curve that is tuned for professionally produced audio. If the level is less than or equal to the threshold level, the audio is processed with a second amount of gain in accordance with a second DRC compression curve that is designed to avoid boosting unwanted background noise. After applying the gain to the audio, the audio is sent to a downstream device.
    Type: Grant
    Filed: July 18, 2019
    Date of Patent: February 6, 2024
    Assignee: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Zhongjin Wang, Andrew Peter Reilly, Michael William Mason
  • Patent number: 11894005
    Abstract: An audio communication endpoint receives a bitstream containing spectral components representing spectral content of an audio signal, wherein the spectral components relate to a first range extending up to a first break frequency, above which any spectral components are unassigned. The endpoint adapts the received bitstream in accordance with a second range extending up to a second break frequency by removing spectral components or adding neutral-valued spectral components relating to a range between the first and second break frequencies. The endpoint then attenuates spectral content in a neighbourhood of the least of the first and second break frequencies for thereby achieving a gradual spectral decay. After this, reconstructing the audio signal is reconstructed by an inverse transform operating on spectral components relating to said second range in the adapted and attenuated received bitstream. At small computational expense, the endpoint may to adapt to different sample rates in received bitstreams.
    Type: Grant
    Filed: November 15, 2019
    Date of Patent: February 6, 2024
    Assignees: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Heiko Purnhagen, Leif Sehlstrom, Lars Villemoes, Glenn N. Dickins, Mark S Vinton
  • Publication number: 20240039499
    Abstract: Volume leveler controller and controlling method are disclosed. In one embodiment, A volume leveler controller includes an audio content classifier for identifying the content type of an audio signal in real time; and an adjusting unit for adjusting a volume leveler in a continuous manner based on the content type as identified. The adjusting unit may configured to positively correlate the dynamic gain of the volume leveler with informative content types of the audio signal, and negatively correlate the dynamic gain of the volume leveler with interfering content types of the audio signal.
    Type: Application
    Filed: July 20, 2023
    Publication date: February 1, 2024
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Jun WANG, Lie LU, Alan J. SEEFELDT
  • Publication number: 20240038248
    Abstract: Encoding/decoding an audio signal having one or more audio components, wherein each audio component is associated with a spatial location. A first audio signal presentation (z) of the audio components, a first set of transform parameters (w(f)), and signal level data (?2) are encoded and transmitted to the decoder. The decoder uses the first set of transform parameters (w(f)) to form a reconstructed simulation input signal intended for an acoustic environment simulation, and applies a signal level modification (?) to the reconstructed simulation input signal. The signal level modification is based on the signal level data (?2) and data (p2) related to the acoustic environment simulation. The attenuated reconstructed simulation input signal is then processed in an acoustic environment simulator. With this process, the decoder does not need to determine the signal level of the simulation input signal, thereby reducing processing load.
    Type: Application
    Filed: August 7, 2023
    Publication date: February 1, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Dirk Jeroen BREEBAART
  • Publication number: 20240038258
    Abstract: A method of audio content identification includes using a two-stage classifier. The first stage includes previously-existing classifiers and the second stage includes a new classifier. The outputs of the first and second stages calculated over different time periods are combined to generate a steering signal. The final classification results from a combination of the steering signal and the outputs of the first and second stages. In this manner, a new classifier may be added without disrupting existing classifiers.
    Type: Application
    Filed: August 18, 2021
    Publication date: February 1, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Guiping Wang, Lie Lu
  • Publication number: 20240040327
    Abstract: The invention discloses rendering sound field signals, such as Higher-Order Ambisonics (HOA), for arbitrary loudspeaker setups, where the rendering results in highly improved localization properties and is energy preserving. This is obtained by rendering an audio sound field representation for arbitrary spatial loudspeaker setups and/or by a a decoder that decodes based on a decode matrix (D). The decode matrix (D) is based on smoothing and scaling of a first decode matrix {circumflex over (D)} with smoothing coefficients. The first decode matrix {circumflex over (D)} is based on a mix matrix G and a mode matrix {tilde over (?)}, where the mix matrix G was determined based on L speakers and positions of a spherical modelling grid related to a HOA order N, and the mode matrix {tilde over (?)} was determined based on the spherical modelling grid and the HOA order N.
    Type: Application
    Filed: July 26, 2023
    Publication date: February 1, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Johannes BOEHM, Florian KEILER
  • Publication number: 20240040043
    Abstract: Disclosed is a method for managing acoustic feedback in real-time audio communications in a communications system, the method comprising determining, by means of a detection module, whether a first communication device is in loudspeaker mode, whether the first communication device is in real-time audio communications with a second communication, and whether the first communication device and the second communication device are in a same acoustic space. Upon determining that this is the case a request signal for requesting one or more measures against acoustic feedback is provided to a mitigation module. Further disclosed are a device and a system configured to perform the method, a non-transitory computer-readable medium, an encoder and a decoder.
    Type: Application
    Filed: December 22, 2021
    Publication date: February 1, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Qianqian FANG, Kai LI, Yanmeng GUO, Wei HUANG, Yang LIU