Dolby Labs Patent Applications

Dolby Labs patent applications that are pending before the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240121420
    Abstract: A method is provided for coding at least one image split up into partitions, a current partition to be coded containing data, at least one data item of which is allotted a sign. The coding method includes, for the current partition, the following steps: calculating the value of a function representative of the data of the current partition with the exclusion of the sign; comparing the calculated value with a predetermined value of the sign; as a function of the result of the comparison, modifying or not modifying at least one of the data items of the current partition, in the case of modification, coding the at least one modified data item.
    Type: Application
    Filed: December 19, 2023
    Publication date: April 11, 2024
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Felix Henry, Gordon Clare
  • Publication number: 20240121424
    Abstract: Methods and systems for canvas size scalability across the same or different bitstream layers of a video coded bitstream are described. Offset parameters for a conformance window, a reference region of interest (ROI) in a reference layer, and a current ROI in a current layer are received. The width and height of a current ROI and a reference ROI are computed based on the offset parameters and they are used to generate a width and height scaling factor to be used by a reference picture resampling unit to generate an output picture based on the current ROI and the reference ROI.
    Type: Application
    Filed: December 18, 2023
    Publication date: April 11, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Taoran Lu, Fangjun Pu, Peng Yin, Sean Thomas McCarthy, Tao Chen
  • Publication number: 20240114153
    Abstract: A first image and a second image of different dynamic ranges are derived from the same source image. Based on a chroma sampling format of the first image, it is determined whether edge preserving filtering is to be used to generate chroma upsampled image data in a reconstructed image. If so, image metadata for performing the edge preserving filtering is generated. The first image, the second image and the image metadata are encoded into an image data container to enable a recipient device to generate the reconstructed image.
    Type: Application
    Filed: September 1, 2023
    Publication date: April 4, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Anustup Kumar Atanu CHOUDHURY, Guan-Ming SU
  • Publication number: 20240114307
    Abstract: There is provided encoding and decoding methods for representing spatial audio that is a combination of directional sound and diffuse sound. An exemplary encoding method includes inter alia creating a single- or multi-channel downmix audio signal by downmixing input audio signals from a plurality of microphones in an audio capture unit capturing the spatial audio; determining first metadata parameters associated with the downmix audio signal, wherein the first metadata parameters are indicative of one or more of: a relative time delay value, a gain value, and a phase value associated with each input audio signal; and combining the created downmix audio signal and the first metadata parameters into a representation of the spatial audio.
    Type: Application
    Filed: September 12, 2023
    Publication date: April 4, 2024
    Applicants: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Stefan BRUHN
  • Publication number: 20240114309
    Abstract: Some examples involve rendering received audio data by determining a first relative activation of a set of loudspeakers in an environment according to a first rendering configuration corresponding to a first set of speaker activations, receiving a first rendering transition indication indicating a transition from the first rendering configuration to a second rendering configuration and determining a second set of speaker activations corresponding to a simplified version of the second rendering configuration. Some examples involve performing a first transition from the first set of speaker activations to the second set of speaker activations, determining a third set of speaker activations corresponding to a complete version of the second rendering configuration and performing a second transition to the third set of speaker activations without requiring completion of the first transition.
    Type: Application
    Filed: December 2, 2021
    Publication date: April 4, 2024
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Joshua B. LANDO, Alan J. SEEFELDT
  • Publication number: 20240114306
    Abstract: An multi-input, multi-output audio process is implemented as a linear system for use in an audio filterbank to convert a set of frequency-domain input audio signals into a set of frequency-domain output audio signals. A transfer function from one input to one output is defined as a frequency dependent gain function. In some implementations, the transfer function includes a direct component that is substantially defined as a frequency dependent gain, and one or more decorrelated components that have frequency-varying group phase response. The transfer function is formed from a set of sub-band functions, with each sub-band function being formed from a set of corresponding component transfer functions including direct component and one or more decorrelated components.
    Type: Application
    Filed: September 2, 2020
    Publication date: April 4, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: David S. MCGRATH
  • Publication number: 20240114127
    Abstract: Methods, systems, and devices implement intra-prediction for hexagonally-sampled compression and decompression of videos and images having a regular grid of hexagonally-shaped pixels. For encoding, a prediction unit (PU) shape is selected at a sequence level from the group consisting of parallelogram, zigzag-square, hexagonal super-pixel, a rectangular zigzag and an arrow, and the hexagonally-sampled image is divided into regions based on the PU shape. For each region: a prediction mode and a PU size are determined; reference pixels are determined for each predicted pixel in the PU shape based on the prediction mode; a weighted factor is determined for each of the reference pixels based on a distance between the reference pixel and the predicted pixel; and a predicted value of each of the predicted pixels in the PU shape is determined using the corresponding reference pixels and the weighted factors.
    Type: Application
    Filed: February 10, 2022
    Publication date: April 4, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Zhaobin ZHANG, Neeraj J. GADGIL, Guan-Ming SU
  • Publication number: 20240114308
    Abstract: Some methods involve receiving, by a control system that is configured for implementing a plurality of renderers, audio data and listening configuration data for a plurality of listening configurations, each listening configuration of the plurality of listening configurations corresponding to a listening position and a listening orientation in an audio environment, and rendering, by each renderer and according to the listening configuration data, the received audio data to obtain a set of renderer-specific loudspeaker feed signals for a corresponding listening configuration. Each renderer may be configured to render the audio data for a different listening configuration. Some such methods may involve decomposing each set of renderer-specific loudspeaker feed signals into a renderer-specific set of frequency bands and combining the renderer-specific frequency bands of each renderer to produce an output set of loudspeaker feed signals.
    Type: Application
    Filed: December 2, 2021
    Publication date: April 4, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Alan J. SEEFELDT, C. Phillip BROWN
  • Publication number: 20240107252
    Abstract: An attenuation or “gap” may be inserted into at least a first frequency range of at least first and second audio playback signals of a content stream during at least a first time interval to generate at least first and second modified audio playback signals. Corresponding audio device playback sound may be provided by at least first and second audio devices. At least one microphone may detect at least the first audio device playback sound and the second audio device playback sound and may generate corresponding microphone signals. Audio data may be extracted from the microphone signals in at least the first frequency range, to produce extracted audio data. A far-field audio environment impulse response and/or audio environment noise may be estimated based, at least in part, on the extracted audio data.
    Type: Application
    Filed: December 2, 2021
    Publication date: March 28, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Christopher Graham HINES, Benjamin John SOUTHWELL
  • Publication number: 20240105192
    Abstract: Embodiments are disclosed for spatial noise filling in multi-channel codecs. In an embodiment, a method of regenerating background noise ambience in a multi-channel codec by generating spatial hole filling noise comprises: computing noise estimates based on a primary downmix channel generated from an input audio signal representing a spatial audio scene with background noise ambience; computing spectral shaping filter coefficients based on the noise estimates; spectrally shaping the multi-channel noise signal using the spectral shaping filter coefficients and a noise distribution, the spectral shaping resulting in a diffused, multi-channel noise signal with uncorrelated channels; spatially shaping the diffused, uncorrelated multi-channel noise signal with uncorrelated channels based on a noise ambience of the spatial audio scene; and adding the spatially and spectrally shaped multi-channel noise to a multi-channel codec output to synthesize the background noise ambience of the spatial audio scene.
    Type: Application
    Filed: December 1, 2021
    Publication date: March 28, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Rishabh TYAGI, Michael ECKERT
  • Publication number: 20240107255
    Abstract: Some methods involve receiving, by a control system configured for implementing a plurality of Tenderers, audio data and listening configuration data for a plurality of listening configurations, each listening configuration of the plurality of listening configurations corresponding to a listening position and a listening orientation in an audio environment, and rendering, by each Tenderer and according to the listening configuration data, the received audio data to obtain a set of Tenderer-specific loudspeaker feed signals for a corresponding listening configuration. Each Tenderer may be configured to render the audio data for a different listening configuration. Some such methods may involve decomposing each set of renderer-specific loudspeaker feed signals into a Tenderer-specific set of frequency bands and combining the renderer-specific frequency bands of each Tenderer to produce an output set of loudspeaker feed signals.
    Type: Application
    Filed: December 2, 2021
    Publication date: March 28, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Alan J. SEEFELDT, C. Phillip BROWN
  • Publication number: 20240103801
    Abstract: Embodiments are directed to a method and system for receiving, in a bitstream, metadata associated with the audio data, and analyzing the metadata to determine whether a loudness parameter for a first group of audio playback devices are available in the bitstream. Responsive to determining that the parameters are present for the first group, the system uses the parameters and audio data to render audio. Responsive to determining that the loudness parameters are not present for the first group, the system analyzes one or more characteristics of the first group, and determines the parameter based on the one or more characteristics.
    Type: Application
    Filed: October 9, 2023
    Publication date: March 28, 2024
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Jeffrey RIEDMILLER, Scott Gregory NORCROSS, Karl Jonas ROEDEN
  • Publication number: 20240105191
    Abstract: The present invention relates to transposing signals in time and/or frequency and in particular to coding of audio signals. More particular, the present invention relates to high frequency reconstruction (HFR) methods including a frequency domain harmonic transposer. A method and system for generating a transposed output signal from an input signal using a transposition factor T is described. The system comprises an analysis window of length La, extracting a frame of the input signal, and an analysis transformation unit of order M transforming the samples into M complex coefficients. M is a function of the transposition factor T. The system further comprises a nonlinear processing unit altering the phase of the complex coefficients by using the transposition factor T, a synthesis transformation unit of order M transforming the altered coefficients into M altered samples, and a synthesis window of length Ls, generating a frame of the output signal.
    Type: Application
    Filed: November 29, 2023
    Publication date: March 28, 2024
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Per EKSTRAND, Lars VILLEMOES
  • Publication number: 20240105186
    Abstract: A method for encoding an input audio stream including the steps of obtaining a first playback stream presentation of the input audio stream intended for reproduction on a first audio reproduction system, obtaining a second playback stream presentation of the input audio stream intended for reproduction on a second audio reproduction system, determining a set of transform parameters suitable for transforming an intermediate playback stream presentation to an approximation of the second playback stream presentation, wherein the transform parameters are determined by minimization of a measure of a difference between the approximation of the second playback stream presentation and the second playback stream presentation, and encoding the first playback stream presentation and the set of transform parameters for transmission to a decoder.
    Type: Application
    Filed: October 16, 2023
    Publication date: March 28, 2024
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Dirk Jeroen Breebaart, David Matthew Cooper, Leif Jonas Samuelsson, Jeroen Koppens, Rhonda J. Wilson, Heiko Purnhagen, Alexander Stahlmann
  • Publication number: 20240098446
    Abstract: Images are acquired through image sensors operating in conjunction with a media consumption system. The acquired images are used to determine a user's movement in a plurality of degrees of freedom. Sound images depicted in spatial audio rendered by audio speakers operating in conjunction with the media consumption system are adapted based at least in part on the user's movement in the plurality of degrees of freedom.
    Type: Application
    Filed: November 27, 2023
    Publication date: March 21, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Ajit NINAN, William Anthony ROZZI
  • Publication number: 20240095893
    Abstract: A first reshaping mapping is performed on a first image represented in a first domain to generate a second image represented in a second domain. The first domain is of a first dynamic range different from a second dynamic range of which the second domain is. A second reshaping mapping is performed on the second image represented in the second domain to generate a third image represented in the first domain. The third image is perceptually different from the first image in at least one of: global contrast, global saturation, local contrast, local saturation, etc. A display image is derived from the third image and rendered on a display device.
    Type: Application
    Filed: January 26, 2022
    Publication date: March 21, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Guan-Ming SU, Harshad KADU, Per Jonas Andreas KLITTMARK, Tao CHEN
  • Publication number: 20240098436
    Abstract: A method for performing DRC on a HOA signal comprises transforming the HOA signal to the spatial domain, analyzing the transformed HOA signal, and obtaining, from results of said analyzing, gain factors that are usable for dynamic compression. The gain factors can be transmitted together with the HOA signal. When applying the DRC, the HOA signal is transformed to the spatial domain, the gain factors are extracted and multiplied with the transformed HOA signal in the spatial domain, wherein a gain compensated transformed HOA signal is obtained. The gain compensated transformed HOA signal is transformed back into the HOA domain, wherein a gain compensated HOA signal is obtained. The DRC may be applied in the QMF-filter bank domain.
    Type: Application
    Filed: November 9, 2023
    Publication date: March 21, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Johannes BOEHM, Florian KEILER
  • Publication number: 20240098438
    Abstract: The present invention is directed to methods and apparatus for translating a first plurality of audio input channels to a second plurality of audio output channels. This includes determining that there is pair-wise coding among any of the first plurality of audio input channels, determining an input/output-mapping matrix for mapping at least a first set of the first plurality of audio input channels to at least a second set of the second plurality of audio output channels; and deriving the second plurality of audio output channels based on first plurality of audio input channels, the input/output-mapping matrix and the determined pair-wise coding. The first plurality of audio input channels represent the same soundfield represented by the second plurality of audio output channels.
    Type: Application
    Filed: September 25, 2023
    Publication date: March 21, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Mark F. DAVIS
  • Publication number: 20240098264
    Abstract: The present invention relates to a block partitioning structure in video coding technology, and a video encoding and decoding method and apparatus using the same, wherein the video encoding and decoding method includes the steps of: acquiring quad-partitioning information of a block; acquiring bi-partitioning information of the block when the acquired quad-partitioning information of the block does not indicate four partitions; acquiring partitioning direction information for bi-partitioning of the block when the acquired bi-partitioning information of the block indicates two partitions; acquiring information on whether to perform any other type of partitioning, when the acquired bi-partitioning information of the block does not indicate two partitions; and acquiring additional information required for the any other type of partitioning, when the acquired information on whether to perform any other type of partitioning indicates that the any other type of partitioning is performed.
    Type: Application
    Filed: November 29, 2023
    Publication date: March 21, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Ho Chan RYU, Yong Jo AHN
  • Publication number: 20240098435
    Abstract: A computer implemented system for rendering captured audio soundfields to a listener comprises apparatus to deliver the audio soundfields to the listener. The delivery apparatus delivers the audio soundfields to the listener with first and second audio elements perceived by the listener as emanating from first and second virtual source locations, respectively, and with the first audio element and/or the second audio element delivered to the listener from a third virtual source location. The first virtual source location and the second virtual source location are perceived by the listener as being located to the front of the listener, and the third virtual source location is located to the rear or the side of the listener.
    Type: Application
    Filed: September 18, 2023
    Publication date: March 21, 2024
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Richard J. CARTWRIGHT, David S. MCGRATH, Glenn N. DICKINS
  • Publication number: 20240098286
    Abstract: An electronic device for encoding a picture is described. The electronic device includes a processor and instructions stored in memory that are in electronic communication with the processor. The instructions are executable to encode a step-wise temporal sub-layer access (STSA) sample grouping. The instructions are further executable to send and/or store the STSA sample grouping.
    Type: Application
    Filed: November 21, 2023
    Publication date: March 21, 2024
    Applicant: DOLBY INTERNATIONAL AB
    Inventor: Sachin G. Deshpande
  • Publication number: 20240098229
    Abstract: Dual and multi-modulator projector display systems and techniques are disclosed. In one embodiment, a projector display system comprises a light source; a controller, a first modulator, receiving light from the light source and rendering a halftone image of said the input image; a blurring optical system that blurs said halftone image with a Point Spread Function (PSF); and a second modulator receiving the blurred halftone image and rendering a pulse width modulated image which may be projected to form the desired screen image. Systems and techniques for forming a binary halftone image from input image, correcting for misalignment between the first and second modulators and calibrating the projector system—e.g. over time—for continuous image improvement are also disclosed.
    Type: Application
    Filed: November 22, 2023
    Publication date: March 21, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Jerome SHIELDS, Martin J. RICHARDS, Juan P. PERTIERRA
  • Publication number: 20240087584
    Abstract: An encoding system encodes an N-channel audio signal (X), wherein N?3, as a single-channel downmix signal (Y) together with dry and wet upmix parameters ({tilde over (C)}, {tilde over (P)}). In a decoding system, a decorrelating section outputs, based on the downmix signal, an (N?1)-channel decorrelated signal (Z); a dry upmix section maps the downmix signal linearly in accordance with dry upmix coefficients (C) determined based on the dry upmix parameters; a wet upmix section populates an intermediate matrix based on the wet upmix parameters and knowing that the intermediate matrix belongs to a predefined matrix class, obtains wet upmix coefficients (P) by multiplying the intermediate matrix by a predefined matrix, and maps the decorrelated signal linearly in accordance with the wet upmix coefficients; and a combining section combines outputs from the upmix sections to obtain a reconstructed signal ({circumflex over (X)}) corresponding to the signal to be reconstructed.
    Type: Application
    Filed: September 25, 2023
    Publication date: March 14, 2024
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Lars Villemoes, Heidi-Maria Lehtonen, Heiko Purnhagen, Toni Hirvonen
  • Publication number: 20240089474
    Abstract: Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific backward compatible target frame rate and shutter angle among those allowed are also presented.
    Type: Application
    Filed: November 10, 2023
    Publication date: March 14, 2024
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Robin Atkins, Peng Yin, Taoran Lu, Fangjun Pu, Sean Thomas McCarthy, Walter J. Husak, Tao Chen, Guan-Ming Su
  • Publication number: 20240087590
    Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.
    Type: Application
    Filed: November 14, 2023
    Publication date: March 14, 2024
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Kristofer KJOERLING, Lars VILLEMOES, Heiko PURNHAGEN, Per EKSTRAND
  • Publication number: 20240089438
    Abstract: According to the present invention, an adaptive scheme is applied to an image encoding apparatus that includes an inter-predictor, an intra-predictor, a transformer, a quantizer, an inverse quantizer, and an inverse transformer, wherein input images are classified into two or more different categories, and two or more modules from among the inter-predictor, the intra-predictor, the transformer, the quantizer, and the inverse quantizer are implemented to perform respective operations in different schemes according to the category to which an input image belongs. Thus, the invention has the advantage of efficiently encoding an image without the loss of important information as compared to a conventional image encoding apparatus which adopts a packaged scheme.
    Type: Application
    Filed: November 21, 2023
    Publication date: March 14, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Jong Ki HAN, Chan Won SEO, Kwang Hyun CHOI
  • Publication number: 20240080489
    Abstract: A quantization parameter signalling mechanism for both SDR and HDR content in video coding is described using two approaches. The first approach is to send the user-defined QpC table directly in high level syntax. This leads to more flexible and efficient QP control for future codec development and video content coding. The second approach is to signal luma and chroma QPs independently. This approach eliminates the need for QpC tables and removes the dependency of chroma quantization parameter on luma QP.
    Type: Application
    Filed: November 10, 2023
    Publication date: March 7, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Fangjun PU, Taoran LU, Peng YIN, Sean Thomas MCCARTHY
  • Publication number: 20240079015
    Abstract: Methods for generating an object based audio program, renderable in a personalizable manner, and including a bed of speaker channels renderable in the absence of selection of other program content (e.g., to provide a default full range audio experience). Other embodiments include steps of delivering, decoding, and/or rendering such a program. Rendering of content of the bed, or of a selected mix of other content of the program, may provide an immersive experience. The program may include multiple object channels (e.g., object channels indicative of user-selectable and user-configurable objects), the bed of speaker channels, and other speaker channels. Another aspect is an audio processing unit (e.g., encoder or decoder) configured to perform, or which includes a buffer memory which stores at least one frame (or other segment) of an object based audio program (or bitstream thereof) generated in accordance with, any embodiment of the method.
    Type: Application
    Filed: September 19, 2023
    Publication date: March 7, 2024
    Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB
    Inventors: Sripal S. MEHTA, Thomas ZIEGLER, Giles BAKER, Jeffrey RIEDMILLER, Prinyar SAUNGSOMBOON
  • Publication number: 20240080608
    Abstract: A method of audio processing includes capturing a binaural audio signal, calculating noise reduction gains using a machine learning model, and generating a modified binaural audio signal. The method may further including performing various corrections to the audio to account for video captured by different cameras such as a front camera and a rear camera. The method may further include performing smooth switching of the binaural audio when switching between the front camera and the rear camera. In this manner, noise may be reduced in the binaural audio, and the user perception of the combined video and binaural audio may be improved.
    Type: Application
    Filed: December 14, 2021
    Publication date: March 7, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Yuanxing MA, Zhiwei SHUANG, Yang LIU
  • Publication number: 20240080465
    Abstract: Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific backward compatible target frame rate and shutter angle among those allowed are also presented.
    Type: Application
    Filed: November 13, 2023
    Publication date: March 7, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Robin Atkins, Peng Yin, Taoran Lu, Fangjun Pu, Sean Thomas McCarthy, Walter J. Husak, Tao Chen, Guan-Ming Su
  • Publication number: 20240080479
    Abstract: Sampled data is packaged in checkerboard format for encoding and decoding. The sampled data may be quincunx sampled multi-image video data (e.g., 3D video or a multi-program stream), and the data may also be divided into sub-images of each image which are then multiplexed, or interleaved, in frames of a video stream to be encoded and then decoded using a standardized video encoder. A system for viewing may utilize a standard video decoder and a formatting device that de-interleaves the decoded sub-images of each frame reformats the images for a display device. A 3D video may be encoded using a most advantageous interleaving format such that a preferred quality and compression ratio is reached. In one embodiment, the invention includes a display device that accepts data in multiple formats.
    Type: Application
    Filed: November 7, 2023
    Publication date: March 7, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Alexandros Tourapis, Walter J. Husak, Peshala V. Pahalawatta, Athanasios Leontaris
  • Publication number: 20240079019
    Abstract: Computer-implemented methods for training a neural network, as well as for implementing audio encoders and decoders via trained neural networks, are provided. The neural network may receive an input audio signal, generate an encoded audio signal and decode the encoded audio signal. A loss function generating module may receive the decoded audio signal and a ground truth audio signal, and may generate a loss function value corresponding to the decoded audio signal. Generating the loss function value may involve applying a psychoacoustic model. The neural network may be trained based on the loss function value. The training may involve updating at least one weight of the neural network.
    Type: Application
    Filed: November 13, 2023
    Publication date: March 7, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Roy M. FEJGIN, Grant A. DAVIDSON, Chih-Wei WU, Vivek KUMAR
  • Publication number: 20240073459
    Abstract: In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated hue values in the preferred color space is minimized HDR images are coded in the reshaped color space. Legacy devices can still decode standard dynamic range images assuming they are coded in the legacy color space, while updated devices can use color reshaping information to decode HDR images in the preferred color space at full dynamic range.
    Type: Application
    Filed: October 31, 2023
    Publication date: February 29, 2024
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Robin Atkins, Peng Yin, Taoran Lu, Jaclyn Anne Pytlarz
  • Publication number: 20240073444
    Abstract: A video encoding method according to an embodiment of the present invention includes generating header information that includes information about resolutions of motion vectors of respective blocks, determined based on motion prediction for a unit image. Here, the header information includes flag information indicating whether resolutions of all motion vectors included in the unit image are integer-pixel resolutions. Further, a video decoding method according to another embodiment of the present invention includes extracting information about resolutions of motion vectors of each unit image from header information included in a target bitstream to be decoded; and a decoding unit for decoding the unit image based on the resolution information. Here, the header information includes flag information indicating whether resolutions of all motion vectors included in the unit image are integer-pixel resolutions.
    Type: Application
    Filed: November 8, 2023
    Publication date: February 29, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Jong Ki HAN, Jae Yung LEE
  • Publication number: 20240073357
    Abstract: Dual or multi-modulation display systems comprising a first modulator and a second modulator are disclosed. The first modulator may comprise a plurality of analog mirrors (e.g. MEMS array) and the second modulator may comprise a plurality of mirrors (e.g., DMD array). The display system may further comprise a controller that sends control signals to the first and second modulator. The display system may render highlight features within a projected image by affecting a time multiplexing scheme. In one embodiment, the first modulator may be switched on a sub-frame basis such that a desired proportion of the available light may be focused or directed onto the second modulator to form the highlight feature on a sub-frame rendering basis.
    Type: Application
    Filed: January 31, 2022
    Publication date: February 29, 2024
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Martin J. Richards
  • Publication number: 20240071411
    Abstract: Disclosed is a method for determining one or more dialog quality metrics of a mixed audio signal comprising a dialog component and a noise component, the method comprising separating an estimated dialog component from the mixed audio signal by means of a dialog separator using a dialog separating model determined by training the dialog separator based on the one or more quality metrics; providing the estimated dialog component from the dialog separator to a quality metrics estimator; and determining the one or more quality metrics by means of the quality metrics estimator based on the mixed signal and the estimated dialog component. Further disclosed is a method for training a dialog separator, a system comprising circuitry configured to perform the method, and a non-transitory computer-readable storage medium.
    Type: Application
    Filed: January 4, 2022
    Publication date: February 29, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Jundai SUN, Lie LU, Shaofan YANG, Rhonda J. WILSON, Dirk Jeroen BREEBAART
  • Publication number: 20240062765
    Abstract: Encoding and decoding devices for encoding the channels of an audio system having at least four channels are disclosed. The decoding device has a first stereo decoding component which subjects a first pair of input channels to a first stereo decoding, and a second stereo decoding component which subjects a second pair of input channels to a second stereo decoding. The results of the first and second stereo decoding components are crosswise coupled to a third and a fourth stereo decoding component which each performs stereo decoding on one channel resulting from the first stereo decoding component, and one channel resulting from the second stereo decoding component.
    Type: Application
    Filed: September 1, 2023
    Publication date: February 22, 2024
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Kristofer KJOERLING, Harald MUNDT, Heiko PURNHAGEN
  • Publication number: 20240056610
    Abstract: Methods are described to communicate source color volume information in a coded bitstream using SEI messaging. Such data include at least the minimum, maximum, and average luminance values in the source data plus optional data that may include the color volume x and y chromaticity coordinates for the input color primaries (e.g., red, green, and blue) of the source data, and the color x and y chromaticity coordinates for the color primaries corresponding to the minimum, average, and maximum luminance values in the source data. Messaging data signaling an active region in each picture may also be included.
    Type: Application
    Filed: October 13, 2023
    Publication date: February 15, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Tao CHEN, Peng YIN, Taoran LU, Walter J. HUSAK
  • Publication number: 20240056755
    Abstract: Improved methods and/or apparatus for decoding an encoded audio signal in soundfield format for L loudspeakers. The method and/or apparatus can render an Ambisonics format audio signal to 2D loudspeaker setup(s) based on a rendering matrix. The rendering matrix has elements based on loudspeaker positions and wherein the rendering matrix is determined based on weighting at least an element of a first matrix with a weighting factor ? = 1 L . The first matrix is determined based on positions of the L loudspeakers and at least a virtual position of at least a virtual loudspeaker that is added to the positions of the L loudspeakers.
    Type: Application
    Filed: August 28, 2023
    Publication date: February 15, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Florian KEILER, Johannes Boehm
  • Publication number: 20240056760
    Abstract: A method of audio processing includes performing spatial analysis on a binaural signal to estimate level differences and phase differences characteristic of a binaural filter of the binaural signal, performing object extraction on the binaural audio signal using the estimated level and phase differences to generate a left/right main component signal and a left/right residual component signal. The system may process the left/right main and left/right residual components differently using different object processing parameters for e.g. repositioning, equalization, compression, upmixing, channel remapping or storage to generate a processed binaural signal that provides an improved listening experience. Repositioning may be based on head tracking sensor data.
    Type: Application
    Filed: December 16, 2021
    Publication date: February 15, 2024
    Applicants: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Dirk Jeroen BREEBAART, Giulio CENGARLE, C. Phillip BROWN
  • Publication number: 20240055006
    Abstract: Described herein is a method for setting up a decoder for generating processed audio data from an audio bitstream, the decoder comprising a Generator of a Generative Adversarial Network, GAN, for processing of the audio data, wherein the method includes the steps of (a) pre-configuring the Generator for processing of audio data with a set of parameters for the Generator, the parameters being determined by training, at training time, the Generator using the full concatenated distribution; and (b) pre-configuring the decoder to determine, at decoding time, a truncation mode for modifying the concatenated distribution and to apply the determined truncation mode to the concatenated distribution. Described are further a method of generating processed audio data from an audio bitstream using a Generator of a Generative Adversarial Network, GAN, for processing of the audio data and a respective apparatus. Moreover, described are also respective systems and computer program products.
    Type: Application
    Filed: December 15, 2021
    Publication date: February 15, 2024
    Applicant: Dolby International AB
    Inventor: Arijit BISWAS
  • Publication number: 20240056757
    Abstract: Some methods may involve receiving a first content stream that includes first audio signals, rendering the first audio signals to produce first audio playback signals, generating first direct sequence spread spectrum (DSSS) signals, generating first modified audio playback signals by inserting the first DSSS signals into the first audio playback signals, and causing a loudspeaker system to play back the first modified audio playback signals, to generate first audio device playback sound. The method(s) may involve receiving microphone signals corresponding to at least the first audio device playback sound and to second through Nth audio device playback sound corresponding to second through Nth modified audio playback signals (including second through Nth DSSS signals) played back by second through Nth audio devices, extracting second through Nth DSSS signals from the microphone signals and estimating at least one acoustic scene metric based, at least partly, on the second through Nth DSSS signals.
    Type: Application
    Filed: December 2, 2021
    Publication date: February 15, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Benjamin John SOUTHWELL, David GUNAWAN, Mark R.P. THOMAS, Christopher Graham HINES
  • Publication number: 20240056613
    Abstract: Disclosed is an encoding/decoding method and apparatus related to adaptive deblocking filtering. There is provided an image decoding method performing adaptive filtering in inter-prediction, the method including: reconstructing, from a bitstream, an image signal including a reference block on which block matching is performed in inter-prediction of a current block to be encoded; obtaining, from the bitstream, a flag indicating whether the reference block exists within a current picture where the current block is positioned; reconstructing the current block by using the reference block; adaptively applying an in-loop filter for the reconstructed current block based on the obtained flag; and storing the current block to which the in-loop filter is or is not applied in a decoded picture buffer (DPB).
    Type: Application
    Filed: October 24, 2023
    Publication date: February 15, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Je Chang JEONG, Ki Baek KIM
  • Publication number: 20240055010
    Abstract: An apparatus and method are disclosed for processing an audio signal. The apparatus includes an input interface, a digital filterbank having an analysis part and a synthesis part, a first phase shifter, a spectral envelope adjuster, a second phase shifter, and an output interface. The first phase shifter and the second phase shifter reduce a complexity of the digital filterbank, which includes both analysis and synthesis filters that are complex-exponential modulated versions of a prototype filter.
    Type: Application
    Filed: August 21, 2023
    Publication date: February 15, 2024
    Applicant: DOLBY INTERNATIONAL AB
    Inventor: Per EKSTRAND
  • Publication number: 20240056649
    Abstract: A method for delivering media content to one or more clients over a distributed system is disclosed. The method may include generating a plurality of network-coded symbols from a plurality of original symbols representing a first media asset. The method may further include generating an original plurality of coded variants of the first media asset. The method may further include distributing a first coded variant of the original plurality of coded variants to a first cache on a first server device for storage in the first cache. The method may further include distributing a second coded variant of the original plurality of coded variants to a second cache on a second server device for storage in the second cache.
    Type: Application
    Filed: December 16, 2021
    Publication date: February 15, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Jeffrey RIEDMILLER, Mingchao YU, Jason Michael CLOUD
  • Publication number: 20240048931
    Abstract: Some methods may involve receiving a first content stream that includes first audio signals, rendering the first audio signals to produce first audio playback signals, generating first direct sequence spread spectrum (DSSS) signals, generating first modified audio playback signals by inserting the first DSSS signals into the first audio playback signals, and causing a loudspeaker system to play back the first modified audio playback signals, to generate first audio device playback sound. The method(s) may involve receiving microphone signals corresponding to at least the first audio device playback sound and to second through Nth audio device playback sound corresponding to second through Nth modified audio playback signals (including second through Nth DSSS signals) played back by second through Nth audio devices, extracting second through Nth DSSS signals from the microphone signals and estimating at least one acoustic scene metric based, at least partly, on the second through Nth DSSS signals.
    Type: Application
    Filed: December 2, 2021
    Publication date: February 8, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Benjamin John SOUTHWELL, David GUNAWAN, Mark R.P. THOMAS, Christopher Graham HINES
  • Publication number: 20240048932
    Abstract: An apparatus and method of generating personalized HRTFs. The system is prepared by calculating a model for HRTFs described as the relationship between a finite example set of input data, namely anthropometric measures and demographic information for a set of individuals, and a corresponding set of output data, namely HRTFs numerically simulated using a high-resolution database of 3D scans of the same set of individuals. At the time of use, the system queries the user for their demographic information, and then from a series of images of the user, the system detects and measures various anthropometric characteristics. The system then applies the prepared model to the anthropometric and demographic data as part of generating a personalized HRTF. In this manner, the personalized HRTF can be generated with more convenience than by performing a high-resolution scan or an acoustic measurement of the user, and with less computational complexity than by numerically simulating their HRTF.
    Type: Application
    Filed: August 24, 2023
    Publication date: February 8, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: McGregor Steele JOYNER, Alex BRANDMEYER, Scott DALY, Jeffrey Ross BAKER, Andrea FANELLI, Poppy Anne Carrie CRUM
  • Publication number: 20240046940
    Abstract: The invention provides an efficient implementation of cross-product enhanced high-frequency reconstruction (HFR), wherein a new component at frequency Q?+r?0 is generated on the basis of existing components at ? and ?+?0. The invention provides a block-based harmonic transposition, wherein a time block of complex subband samples is processed with a common phase modification. Superposition of several modified samples has the net effect of limiting undesirable intermodulation products, thereby enabling a coarser frequency resolution and/or lower degree of oversampling to be used. In one embodiment, the invention further includes a window function suitable for use with block-based cross-product enhanced HFR. A hardware embodiment of the invention may include an analysis filter bank, a subband processing unit configurable by control data and a synthesis filter bank.
    Type: Application
    Filed: October 5, 2023
    Publication date: February 8, 2024
    Applicant: DOLBY INTERNATIONAL AB
    Inventor: Lars Villemoes
  • Publication number: 20240039499
    Abstract: Volume leveler controller and controlling method are disclosed. In one embodiment, A volume leveler controller includes an audio content classifier for identifying the content type of an audio signal in real time; and an adjusting unit for adjusting a volume leveler in a continuous manner based on the content type as identified. The adjusting unit may configured to positively correlate the dynamic gain of the volume leveler with informative content types of the audio signal, and negatively correlate the dynamic gain of the volume leveler with interfering content types of the audio signal.
    Type: Application
    Filed: July 20, 2023
    Publication date: February 1, 2024
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Jun WANG, Lie LU, Alan J. SEEFELDT
  • Publication number: 20240040327
    Abstract: The invention discloses rendering sound field signals, such as Higher-Order Ambisonics (HOA), for arbitrary loudspeaker setups, where the rendering results in highly improved localization properties and is energy preserving. This is obtained by rendering an audio sound field representation for arbitrary spatial loudspeaker setups and/or by a a decoder that decodes based on a decode matrix (D). The decode matrix (D) is based on smoothing and scaling of a first decode matrix {circumflex over (D)} with smoothing coefficients. The first decode matrix {circumflex over (D)} is based on a mix matrix G and a mode matrix {tilde over (?)}, where the mix matrix G was determined based on L speakers and positions of a spherical modelling grid related to a HOA order N, and the mode matrix {tilde over (?)} was determined based on the spherical modelling grid and the HOA order N.
    Type: Application
    Filed: July 26, 2023
    Publication date: February 1, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Johannes BOEHM, Florian KEILER