Dolby Labs Patent Applications

Dolby Labs patent applications that are pending before the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240151799
    Abstract: A method for performing calibration of magnetometers is provided. In some embodiments, the method involves obtaining a sequence of gyroscope measurements from one or more gyroscopes and a sequence of magnetometer measurements from one or more magnetometers. In some embodiments, the method involves determining a sequence of angular velocity estimates based on the sequence of gyroscope measurements. In some embodiments, the method involves determining a first estimate of a derivative of an external magnetic field based on the sequence of magnetometer measurements. In some embodiments, the method involves determining a second estimate of the derivative of the external magnetic field based on the sequence of angular velocity estimates. In some embodiments, the method involves identifying magnetometer calibration constants based on a difference between the first estimate of the derivative and the second estimate of the derivative.
    Type: Application
    Filed: April 19, 2022
    Publication date: May 9, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: David S. MCGRATH
  • Publication number: 20240155156
    Abstract: Sampled data is packaged in checkerboard format for encoding and decoding. The sampled data may be quincunx sampled multi-image video data (e.g., 3D video or a multi-program stream), and the data may also be divided into sub-images of each image which are then multiplexed, or interleaved, in frames of a video stream to be encoded and then decoded using a standardized video encoder. A system for viewing may utilize a standard video decoder and a formatting device that de-interleaves the decoded sub-images of each frame reformats the images for a display device. A 3D video may be encoded using a most advantageous interleaving format such that a preferred quality and compression ratio is reached. In one embodiment, the invention includes a display device that accepts data in multiple formats.
    Type: Application
    Filed: January 16, 2024
    Publication date: May 9, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Alexandros Tourapis, Walter J. Husak, Peshala V. Pahalawatta, Athanasios Leontaris
  • Publication number: 20240155304
    Abstract: A method (700) for rendering an audio signal of an audio source (211, 212, 213) in a virtual reality rendering environment (180) is described. The method (700) comprises determining (701) whether or not a directivity pattern (232) of the audio source (211, 212, 213) is to be taken into account for a listening situation of a listener (181) within the virtual reality rendering environment (180). Furthermore, the method (700) comprises rendering (702) an audio signal of the audio source (211, 212, 213) without taking into account the directivity pattern (232) of the audio source (211, 212, 213), if it is determined that the directivity pattern (232) of the audio source (211, 212, 213) is not to be taken into account for the listening situation of the listener (181).
    Type: Application
    Filed: May 10, 2022
    Publication date: May 9, 2024
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Leon Terentiv, Christof Joseph Fersch, Panji Setiawan, Daniel Fischer
  • Publication number: 20240153512
    Abstract: A method for performing gain control on audio signals is provided. In some implementations, the method involves determining downmixed signals associated with one or more downmix channels associated with a current frame of an audio signal to be encoded. In some implementations, the method involves determining whether an overload condition exists for an encoder. In some implementation, the method involves determining a gain parameter. In some implementations, the method involves determining at least one gain transition function based on the gain parameter and a gain parameter associated with a preceding frame of the audio signal. In some implementations, the method involves applying the at least one gain transition function to one or more of the downmixed signals. In some implementations, the method involves encoding the downmixed signals in connection with information indicative of gain control applied to the current frame.
    Type: Application
    Filed: March 8, 2022
    Publication date: May 9, 2024
    Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB
    Inventors: Panji Setiawan, Rishabh Tyagi, Stefan Bruhn
  • Publication number: 20240153517
    Abstract: A method for decoding an encoded audio bitstream in an audio processing system is disclosed. The method includes extracting from the encoded audio bitstream a first waveform-coded signal comprising spectral coefficients corresponding to frequencies up to a first cross-over frequency for a time frame and performing parametric decoding at a second cross-over frequency for the time frame to generate a reconstructed signal. The second cross-over frequency is above the first cross-over frequency and the parametric decoding uses reconstruction parameters derived from the encoded audio bitstream to generate the reconstructed signal. The method also includes extracting from the encoded audio bitstream a second waveform-coded signal comprising spectral coefficients corresponding to a subset of frequencies above the first cross-over frequency for the time frame and interleaving the second waveform-coded signal with the reconstructed signal to produce an interleaved signal for the time frame.
    Type: Application
    Filed: November 8, 2023
    Publication date: May 9, 2024
    Applicant: Dolby International AB
    Inventors: Kristofer Kjörling, Heiko Purnhagen, Harald Mundt, Karl Jonas Roeden, Leif Sehlström
  • Publication number: 20240155143
    Abstract: Several embodiments of scalable image processing systems and methods are disclosed herein whereby color management processing of source image data to be displayed on a target display is changed according to varying levels of metadata.
    Type: Application
    Filed: January 16, 2024
    Publication date: May 9, 2024
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Neil W. Messmer, Robin Atkins, Steve Margerm, Peter W. Longhurst
  • Publication number: 20240155289
    Abstract: Embodiments are disclosed for context aware soundscape control. In an embodiment, an audio processing method comprises: capturing, using a first set of microphones on a mobile device, a first audio signal from an audio scene; capturing, using a second set of microphones on a pair of earbuds, a second audio signal from the audio scene; capturing, using a camera on the mobile device, a video signal from a video scene; generating, with at least one processor, a processed audio signal from the first audio signal and the second audio signal, the processed audio signal generated with adaptive soundscape control based on context information; and combining, with the at least one processor, the processed audio signal and the captured video signal as multimedia output.
    Type: Application
    Filed: April 28, 2022
    Publication date: May 9, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Zhiwei SHUANG, Yuanxing MA, Yang LIU
  • Publication number: 20240153515
    Abstract: An audio processing unit (APU) is disclosed. The APU includes a buffer memory configured to store at least one frame of an encoded audio bitstream, where the encoded audio bitstream includes audio data and a metadata container. The metadata container includes a header and one or more metadata payloads after the header. The one or more metadata payloads include dynamic range compression (DRC) metadata, and the DRC metadata is or includes profile metadata indicative of whether the DRC metadata includes dynamic range compression (DRC) control values for use in performing dynamic range compression in accordance with at least one compression profile on audio content indicated by at least one block of the audio data.
    Type: Application
    Filed: November 16, 2023
    Publication date: May 9, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Jeffrey RIEDMILLER, Michael WARD
  • Publication number: 20240155095
    Abstract: A volumetric image of a scene can be created, in one embodiment, by recording, through a camera in a device, a series of images of the scene as the camera is moved along a path relative to the scene; during the recording, the device stores motion path metadata about the path, and the series of images is associated with the motion path metadata and a metadata label is associated with the series of images, the metadata label indicating that the recorded series of images represent a volumetric image of the scene. The series of images, the motion path metadata and the metadata label can be assembled into a package for distribution to devices that can view the volumetric image, which may be referred to as a limited volumetric image. The devices that receive the volumetric image can display the individual images in the series of images or as a video.
    Type: Application
    Filed: May 5, 2022
    Publication date: May 9, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Robin ATKINS
  • Publication number: 20240155427
    Abstract: The present disclosure provides methods, devices and computer program products for non-uniform quantization of parameters. The disclosure further relates to a method and apparatus for reconstructing an audio object in an audio decoding system taking the non-uniformly quantized parameters into account. According to the disclosure, such an approach renders it possible to reduce bit consumption without substantially reducing the quality of the reconstructed audio object.
    Type: Application
    Filed: November 6, 2023
    Publication date: May 9, 2024
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Heiko PURNHAGEN, Per EKSTRAND
  • Publication number: 20240155144
    Abstract: Several embodiments of scalable image processing systems and methods are disclosed herein whereby color management processing of source image data to be displayed on a target display is changed according to varying levels of metadata.
    Type: Application
    Filed: January 16, 2024
    Publication date: May 9, 2024
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Neil W. Messmer, Robin Atkins, Steve Margerm, Peter W. Longhurst
  • Publication number: 20240155161
    Abstract: In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated hue values in the preferred color space is minimized HDR images are coded in the reshaped color space. Legacy devices can still decode standard dynamic range images assuming they are coded in the legacy color space, while updated devices can use color reshaping information to decode HDR images in the preferred color space at full dynamic range.
    Type: Application
    Filed: January 5, 2024
    Publication date: May 9, 2024
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Robin Atkins, Peng Yin, Taoran Lu, Jaclyn Anne Pytlarz
  • Publication number: 20240155207
    Abstract: A method for delivering media to a playback device including outputting first test media to be viewed by a first user. The method further includes receiving a first user input related to a first perception of the first test media by the first user and indicating a first personalized quality of experience of the first user with respect to the first test media. The method further includes generating a first personalized sensitivity profile including one or more viewing characteristics of the first user based on the first user input, and determining, based at least in part on the first personalized sensitivity profile, a first media parameter. The first media parameter is determined in order to increase an efficiency of media delivery to the first playback device over a network while preserving the first personalized quality of experience of the first user.
    Type: Application
    Filed: November 16, 2023
    Publication date: May 9, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Doh-Suk KIM, Sean Thomas MCCARTHY, Scott DALY, Jeffrey RIEDMILLER, Ludovic Christophe MALFAIT, Raphael Marc ULLMANN, Jason Michael CLOUD
  • Publication number: 20240155277
    Abstract: Disclosed is a portable computing device (1) comprising a keyboard (13) and an acoustic transducer (16, 17), the keyboard (13) comprising a key (14, 15), wherein the acoustic transducer (16, 17) is placed in the key (14, 15), and wherein the key (14, 15) comprises a sound port (150) allowing sound generated by the transducer (16, 17) to propagate.
    Type: Application
    Filed: March 10, 2022
    Publication date: May 9, 2024
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Pengfeng ZHANG, Tiezhong LIU, Ruozhou HUANG, Nengkun LV, Wenjie GUI
  • Publication number: 20240144895
    Abstract: A handheld imaging device has a data receiver that is configured to receive reference encoded image data. The data includes reference code values, which are encoded by an external coding system. The reference code values represent reference gray levels, which are being selected using a reference grayscale display function that is based on perceptual non-linearity of human vision adapted at different light levels to spatial frequencies. The imaging device also has a data converter that is configured to access a code mapping between the reference code values and device-specific code values of the imaging device. The device-specific code values are configured to produce gray levels that are specific to the imaging device. Based on the code mapping, the data converter is configured to transcode the reference encoded image data into device-specific image data, which is encoded with the device-specific code values.
    Type: Application
    Filed: December 18, 2023
    Publication date: May 2, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Jon Scott MILLER, Scott DALY, Mahdi NEZAMABADI, Robin ATKINS
  • Publication number: 20240147173
    Abstract: A method and apparatus for decompressing a Higher Order Ambisonics (HOA) signal representation is disclosed. The apparatus includes an input interface that receives an encoded directional signal and an encoded ambient signal and an audio decoder that perceptually decodes the encoded directional signal and encoded ambient signal to produce a decoded directional signal and a decoded ambient signal, respectively. The apparatus further includes an extractor for obtaining side information related to the directional signal and an inverse transformer for converting the decoded ambient signal from a spatial domain to an HOA domain representation of the ambient signal. The apparatus also includes a synthesizer for recomposing a Higher Order Ambisonics (HOA) signal from the HOA domain representation of the ambient signal and the decoded directional signal. The side information includes a direction of the directional signal selected from a set of uniformly spaced directions.
    Type: Application
    Filed: October 16, 2023
    Publication date: May 2, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Alexander KRUEGER, Sven KORDON, Johannes BOEHM, Johann-Markus BATKE
  • Publication number: 20240144940
    Abstract: The invention provides methods and devices for stereo encoding and decoding using complex prediction in the frequency domain. In one embodiment, a decoding method, for obtaining an output stereo signal from an input stereo signal encoded by complex prediction coding and comprising first frequency-domain representations of two input channels, comprises the upmixing steps of: (i) computing a second frequency-domain representation of a first input channel; and (ii) computing an output channel on the basis of the first and second frequency-domain representations of the first input channel, the first frequency-domain representation of the second input channel and a complex prediction coefficient. The upmixing can be suspended responsive to control data.
    Type: Application
    Filed: November 6, 2023
    Publication date: May 2, 2024
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Heiko Purnhagen, Pontus Carlsson, Lars Villemoes
  • Publication number: 20240142861
    Abstract: A projection system for etendue utilization includes a first light source configured to emit a light, the light including a first etendue component and a second etendue component, wherein the first etendue component has a lower etendue than the second etendue component, a first projection optics configured to project a first image on a screen, a second projection optics configured to project a second image on the screen, and an etendue splitter component. The etendue splitter component is configured to receive the light from the light source, extract, from the light, the first etendue component and the second etendue component, provide the first etendue component to the first projection optics, and provide the second etendue component to the second projection optics.
    Type: Application
    Filed: March 10, 2022
    Publication date: May 2, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Juan Pablo PERTIERRA, Martin J. RICHARDS, Barrett LIPPEY, Trevor DAVIES, John Frederick ARNTSEN
  • Publication number: 20240144941
    Abstract: The present document relates to audio coding systems. In particular, the present document relates to efficient methods and systems for parametric multi-channel audio coding. An audio encoding system configured to generate a bitstream indicative of a downmix signal and spatial metadata for generating a multi-channel upmix signal from the downmix signal is described. The system comprises a downmix processing unit configured to generate the downmix signal from a multi-channel input signal; wherein the downmix signal comprises m channels and wherein the multi-channel input signal comprises n channels; n, m being integers with m<n. Furthermore, the system comprises a parameter processing unit configured to determine the spatial metadata from the multi-channel input signal.
    Type: Application
    Filed: November 9, 2023
    Publication date: May 2, 2024
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Tobias FRIEDRICH, Alexander MUELLER, Karsten LINZMEIER, Claus-Christian SPENGER, Tobias R. WAGENBLASS
  • Publication number: 20240147180
    Abstract: Systems, methods, and computer program products implementing a sensor data prediction algorithm are disclosed. An example method comprises receiving motion data representing motions of a head-mounted listening device; transforming the motion data into quaternion domain; predicting, by one or more processors, future motions of the head-mounted listening device, the predicting including creating angular acceleration data from the transformed motion data and applying one or more smoothing filters to the angular acceleration data, the predicted future motions including rotation angles around corresponding axes in the quaternion domain; and providing the predicted future motions of the head-mounted listening device to a processor for adjusting a sound field presented by the listening device such that the sound field follows predicted movements of the head-mounted listening device.
    Type: Application
    Filed: March 18, 2022
    Publication date: May 2, 2024
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Qi Huang, Baoli Yan, Zhifang Liu, Libin Luo
  • Publication number: 20240135940
    Abstract: A method for modifying object reconstruction information, comprising obtaining a set of N spatial audio objects, each spatial audio object including an audio signal and spatial metadata, obtaining an audio presentation representing the N spatial audio objects, obtaining object reconstruction information configured to reconstruct the N spatial audio objects from the audio presentation, applying the reconstruction information to the audio presentation to form a set of N reconstructed spatial audio objects, using a first rendering configuration, rendering the N spatial audio objects to obtain a first rendered presentation, and rendering the N reconstructed spatial audio objects to obtain a second rendered presentation, and modifying the reconstruction information based on a difference between the first rendered presentation and the second rendered presentation, thereby forming modified reconstruction information.
    Type: Application
    Filed: February 9, 2022
    Publication date: April 25, 2024
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Leif Jonas SAMUELSSON, Heiko PURNHAGEN, Lars VILLEMOES
  • Publication number: 20240135937
    Abstract: Disclosed is an audio signal encoding/decoding method that uses an encoding downmix strategy applied at an encoder that is different than a decoding re-mix/upmix strategy applied at a decoder. Based on the type of downmix coding scheme, the method comprises: computing input downmixing gains to be applied to the input audio signal to construct a primary downmix channel; determining downmix scaling gains to scale the primary downmix channel; generating prediction gains based on the input audio signal, the input downmixing gains and the downmix scaling gains; determining residual channel(s) from the side channels by using the primary downmix channel and the prediction gains to generate side channel predictions and subtracting the side channel predictions from the side channels; determining decorrelation gains based on energy in the residual channels; encoding the primary downmix channel, the residual channel(s), the prediction gains and the decorrelation gains; and sending the bitstream to a decoder.
    Type: Application
    Filed: December 2, 2021
    Publication date: April 25, 2024
    Applicants: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Harald Mundt, David S. McGrath, Rishabh Tyagi
  • Publication number: 20240127831
    Abstract: Conventional audio compression technologies perform a standardized signal transformation, independent of the type of the content. Multi-channel signals are decomposed into their signal components, subsequently quantized and encoded. This is disadvantageous due to lack of knowledge on the characteristics of scene composition, especially for e.g. multi-channel audio or Higher-Order Ambisonics (HOA) content. A method for decoding an encoded bitstream of multi-channel audio data and associated metadata is provided, including transforming the first Ambisonics format of the multi-channel audio data to a second Ambisonics format representation of the multi-channel audio data, wherein the transforming maps the first Ambisonics format of the multi-channel audio data into the second Ambisonics format representation of the multi-channel audio data.
    Type: Application
    Filed: October 18, 2023
    Publication date: April 18, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Oliver WUEBBOLT, Peter JAX, Johannes BOEHM
  • Publication number: 20240127829
    Abstract: Methods and systems for advanced stereo processing of an audio signal are disclosed. The methods and systems include selecting a coding mode of either transform coding or linear predictive coding and performing advanced stereo processing when in the selected coding mode. Both encoding and decoding operations are provided.
    Type: Application
    Filed: December 18, 2023
    Publication date: April 18, 2024
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Heiko PURNHAGEN, Pontus CARLSSON, Kristofer KJOERLING
  • Publication number: 20240127845
    Abstract: The present document relates to audio source coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), as well as to digital effect processors, e.g. exciters, where generation of harmonic distortion add brightness to the processed signal, and to time stretchers where a signal duration is prolonged with maintained spectral content. A system and method configured to generate a time stretched and/or frequency transposed signal from an input signal is described. The system comprises an analysis filterbank configured to provide an analysis subband signal from the input signal; wherein the analysis subband signal comprises a plurality of complex valued analysis samples, each having a phase and a magnitude. Furthermore, the system comprises a subband processing unit configured to determine a synthesis subband signal from the analysis subband signal using a subband transposition factor Q and a subband stretch factor S.
    Type: Application
    Filed: December 20, 2023
    Publication date: April 18, 2024
    Applicant: DOLBY INTERNATIONAL AB
    Inventor: Lars VILLEMOES
  • Publication number: 20240121420
    Abstract: A method is provided for coding at least one image split up into partitions, a current partition to be coded containing data, at least one data item of which is allotted a sign. The coding method includes, for the current partition, the following steps: calculating the value of a function representative of the data of the current partition with the exclusion of the sign; comparing the calculated value with a predetermined value of the sign; as a function of the result of the comparison, modifying or not modifying at least one of the data items of the current partition, in the case of modification, coding the at least one modified data item.
    Type: Application
    Filed: December 19, 2023
    Publication date: April 11, 2024
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Felix Henry, Gordon Clare
  • Publication number: 20240121424
    Abstract: Methods and systems for canvas size scalability across the same or different bitstream layers of a video coded bitstream are described. Offset parameters for a conformance window, a reference region of interest (ROI) in a reference layer, and a current ROI in a current layer are received. The width and height of a current ROI and a reference ROI are computed based on the offset parameters and they are used to generate a width and height scaling factor to be used by a reference picture resampling unit to generate an output picture based on the current ROI and the reference ROI.
    Type: Application
    Filed: December 18, 2023
    Publication date: April 11, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Taoran Lu, Fangjun Pu, Peng Yin, Sean Thomas McCarthy, Tao Chen
  • Publication number: 20240114153
    Abstract: A first image and a second image of different dynamic ranges are derived from the same source image. Based on a chroma sampling format of the first image, it is determined whether edge preserving filtering is to be used to generate chroma upsampled image data in a reconstructed image. If so, image metadata for performing the edge preserving filtering is generated. The first image, the second image and the image metadata are encoded into an image data container to enable a recipient device to generate the reconstructed image.
    Type: Application
    Filed: September 1, 2023
    Publication date: April 4, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Anustup Kumar Atanu CHOUDHURY, Guan-Ming SU
  • Publication number: 20240114307
    Abstract: There is provided encoding and decoding methods for representing spatial audio that is a combination of directional sound and diffuse sound. An exemplary encoding method includes inter alia creating a single- or multi-channel downmix audio signal by downmixing input audio signals from a plurality of microphones in an audio capture unit capturing the spatial audio; determining first metadata parameters associated with the downmix audio signal, wherein the first metadata parameters are indicative of one or more of: a relative time delay value, a gain value, and a phase value associated with each input audio signal; and combining the created downmix audio signal and the first metadata parameters into a representation of the spatial audio.
    Type: Application
    Filed: September 12, 2023
    Publication date: April 4, 2024
    Applicants: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Stefan BRUHN
  • Publication number: 20240114309
    Abstract: Some examples involve rendering received audio data by determining a first relative activation of a set of loudspeakers in an environment according to a first rendering configuration corresponding to a first set of speaker activations, receiving a first rendering transition indication indicating a transition from the first rendering configuration to a second rendering configuration and determining a second set of speaker activations corresponding to a simplified version of the second rendering configuration. Some examples involve performing a first transition from the first set of speaker activations to the second set of speaker activations, determining a third set of speaker activations corresponding to a complete version of the second rendering configuration and performing a second transition to the third set of speaker activations without requiring completion of the first transition.
    Type: Application
    Filed: December 2, 2021
    Publication date: April 4, 2024
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Joshua B. LANDO, Alan J. SEEFELDT
  • Publication number: 20240114306
    Abstract: An multi-input, multi-output audio process is implemented as a linear system for use in an audio filterbank to convert a set of frequency-domain input audio signals into a set of frequency-domain output audio signals. A transfer function from one input to one output is defined as a frequency dependent gain function. In some implementations, the transfer function includes a direct component that is substantially defined as a frequency dependent gain, and one or more decorrelated components that have frequency-varying group phase response. The transfer function is formed from a set of sub-band functions, with each sub-band function being formed from a set of corresponding component transfer functions including direct component and one or more decorrelated components.
    Type: Application
    Filed: September 2, 2020
    Publication date: April 4, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: David S. MCGRATH
  • Publication number: 20240114127
    Abstract: Methods, systems, and devices implement intra-prediction for hexagonally-sampled compression and decompression of videos and images having a regular grid of hexagonally-shaped pixels. For encoding, a prediction unit (PU) shape is selected at a sequence level from the group consisting of parallelogram, zigzag-square, hexagonal super-pixel, a rectangular zigzag and an arrow, and the hexagonally-sampled image is divided into regions based on the PU shape. For each region: a prediction mode and a PU size are determined; reference pixels are determined for each predicted pixel in the PU shape based on the prediction mode; a weighted factor is determined for each of the reference pixels based on a distance between the reference pixel and the predicted pixel; and a predicted value of each of the predicted pixels in the PU shape is determined using the corresponding reference pixels and the weighted factors.
    Type: Application
    Filed: February 10, 2022
    Publication date: April 4, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Zhaobin ZHANG, Neeraj J. GADGIL, Guan-Ming SU
  • Publication number: 20240114308
    Abstract: Some methods involve receiving, by a control system that is configured for implementing a plurality of renderers, audio data and listening configuration data for a plurality of listening configurations, each listening configuration of the plurality of listening configurations corresponding to a listening position and a listening orientation in an audio environment, and rendering, by each renderer and according to the listening configuration data, the received audio data to obtain a set of renderer-specific loudspeaker feed signals for a corresponding listening configuration. Each renderer may be configured to render the audio data for a different listening configuration. Some such methods may involve decomposing each set of renderer-specific loudspeaker feed signals into a renderer-specific set of frequency bands and combining the renderer-specific frequency bands of each renderer to produce an output set of loudspeaker feed signals.
    Type: Application
    Filed: December 2, 2021
    Publication date: April 4, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Alan J. SEEFELDT, C. Phillip BROWN
  • Publication number: 20240107252
    Abstract: An attenuation or “gap” may be inserted into at least a first frequency range of at least first and second audio playback signals of a content stream during at least a first time interval to generate at least first and second modified audio playback signals. Corresponding audio device playback sound may be provided by at least first and second audio devices. At least one microphone may detect at least the first audio device playback sound and the second audio device playback sound and may generate corresponding microphone signals. Audio data may be extracted from the microphone signals in at least the first frequency range, to produce extracted audio data. A far-field audio environment impulse response and/or audio environment noise may be estimated based, at least in part, on the extracted audio data.
    Type: Application
    Filed: December 2, 2021
    Publication date: March 28, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Christopher Graham HINES, Benjamin John SOUTHWELL
  • Publication number: 20240103801
    Abstract: Embodiments are directed to a method and system for receiving, in a bitstream, metadata associated with the audio data, and analyzing the metadata to determine whether a loudness parameter for a first group of audio playback devices are available in the bitstream. Responsive to determining that the parameters are present for the first group, the system uses the parameters and audio data to render audio. Responsive to determining that the loudness parameters are not present for the first group, the system analyzes one or more characteristics of the first group, and determines the parameter based on the one or more characteristics.
    Type: Application
    Filed: October 9, 2023
    Publication date: March 28, 2024
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Jeffrey RIEDMILLER, Scott Gregory NORCROSS, Karl Jonas ROEDEN
  • Publication number: 20240105192
    Abstract: Embodiments are disclosed for spatial noise filling in multi-channel codecs. In an embodiment, a method of regenerating background noise ambience in a multi-channel codec by generating spatial hole filling noise comprises: computing noise estimates based on a primary downmix channel generated from an input audio signal representing a spatial audio scene with background noise ambience; computing spectral shaping filter coefficients based on the noise estimates; spectrally shaping the multi-channel noise signal using the spectral shaping filter coefficients and a noise distribution, the spectral shaping resulting in a diffused, multi-channel noise signal with uncorrelated channels; spatially shaping the diffused, uncorrelated multi-channel noise signal with uncorrelated channels based on a noise ambience of the spatial audio scene; and adding the spatially and spectrally shaped multi-channel noise to a multi-channel codec output to synthesize the background noise ambience of the spatial audio scene.
    Type: Application
    Filed: December 1, 2021
    Publication date: March 28, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Rishabh TYAGI, Michael ECKERT
  • Publication number: 20240105191
    Abstract: The present invention relates to transposing signals in time and/or frequency and in particular to coding of audio signals. More particular, the present invention relates to high frequency reconstruction (HFR) methods including a frequency domain harmonic transposer. A method and system for generating a transposed output signal from an input signal using a transposition factor T is described. The system comprises an analysis window of length La, extracting a frame of the input signal, and an analysis transformation unit of order M transforming the samples into M complex coefficients. M is a function of the transposition factor T. The system further comprises a nonlinear processing unit altering the phase of the complex coefficients by using the transposition factor T, a synthesis transformation unit of order M transforming the altered coefficients into M altered samples, and a synthesis window of length Ls, generating a frame of the output signal.
    Type: Application
    Filed: November 29, 2023
    Publication date: March 28, 2024
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Per EKSTRAND, Lars VILLEMOES
  • Publication number: 20240105186
    Abstract: A method for encoding an input audio stream including the steps of obtaining a first playback stream presentation of the input audio stream intended for reproduction on a first audio reproduction system, obtaining a second playback stream presentation of the input audio stream intended for reproduction on a second audio reproduction system, determining a set of transform parameters suitable for transforming an intermediate playback stream presentation to an approximation of the second playback stream presentation, wherein the transform parameters are determined by minimization of a measure of a difference between the approximation of the second playback stream presentation and the second playback stream presentation, and encoding the first playback stream presentation and the set of transform parameters for transmission to a decoder.
    Type: Application
    Filed: October 16, 2023
    Publication date: March 28, 2024
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Dirk Jeroen Breebaart, David Matthew Cooper, Leif Jonas Samuelsson, Jeroen Koppens, Rhonda J. Wilson, Heiko Purnhagen, Alexander Stahlmann
  • Publication number: 20240107255
    Abstract: Some methods involve receiving, by a control system configured for implementing a plurality of Tenderers, audio data and listening configuration data for a plurality of listening configurations, each listening configuration of the plurality of listening configurations corresponding to a listening position and a listening orientation in an audio environment, and rendering, by each Tenderer and according to the listening configuration data, the received audio data to obtain a set of Tenderer-specific loudspeaker feed signals for a corresponding listening configuration. Each Tenderer may be configured to render the audio data for a different listening configuration. Some such methods may involve decomposing each set of renderer-specific loudspeaker feed signals into a Tenderer-specific set of frequency bands and combining the renderer-specific frequency bands of each Tenderer to produce an output set of loudspeaker feed signals.
    Type: Application
    Filed: December 2, 2021
    Publication date: March 28, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Alan J. SEEFELDT, C. Phillip BROWN
  • Publication number: 20240098438
    Abstract: The present invention is directed to methods and apparatus for translating a first plurality of audio input channels to a second plurality of audio output channels. This includes determining that there is pair-wise coding among any of the first plurality of audio input channels, determining an input/output-mapping matrix for mapping at least a first set of the first plurality of audio input channels to at least a second set of the second plurality of audio output channels; and deriving the second plurality of audio output channels based on first plurality of audio input channels, the input/output-mapping matrix and the determined pair-wise coding. The first plurality of audio input channels represent the same soundfield represented by the second plurality of audio output channels.
    Type: Application
    Filed: September 25, 2023
    Publication date: March 21, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Mark F. DAVIS
  • Publication number: 20240098436
    Abstract: A method for performing DRC on a HOA signal comprises transforming the HOA signal to the spatial domain, analyzing the transformed HOA signal, and obtaining, from results of said analyzing, gain factors that are usable for dynamic compression. The gain factors can be transmitted together with the HOA signal. When applying the DRC, the HOA signal is transformed to the spatial domain, the gain factors are extracted and multiplied with the transformed HOA signal in the spatial domain, wherein a gain compensated transformed HOA signal is obtained. The gain compensated transformed HOA signal is transformed back into the HOA domain, wherein a gain compensated HOA signal is obtained. The DRC may be applied in the QMF-filter bank domain.
    Type: Application
    Filed: November 9, 2023
    Publication date: March 21, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Johannes BOEHM, Florian KEILER
  • Publication number: 20240095893
    Abstract: A first reshaping mapping is performed on a first image represented in a first domain to generate a second image represented in a second domain. The first domain is of a first dynamic range different from a second dynamic range of which the second domain is. A second reshaping mapping is performed on the second image represented in the second domain to generate a third image represented in the first domain. The third image is perceptually different from the first image in at least one of: global contrast, global saturation, local contrast, local saturation, etc. A display image is derived from the third image and rendered on a display device.
    Type: Application
    Filed: January 26, 2022
    Publication date: March 21, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Guan-Ming SU, Harshad KADU, Per Jonas Andreas KLITTMARK, Tao CHEN
  • Publication number: 20240098446
    Abstract: Images are acquired through image sensors operating in conjunction with a media consumption system. The acquired images are used to determine a user's movement in a plurality of degrees of freedom. Sound images depicted in spatial audio rendered by audio speakers operating in conjunction with the media consumption system are adapted based at least in part on the user's movement in the plurality of degrees of freedom.
    Type: Application
    Filed: November 27, 2023
    Publication date: March 21, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Ajit NINAN, William Anthony ROZZI
  • Publication number: 20240098229
    Abstract: Dual and multi-modulator projector display systems and techniques are disclosed. In one embodiment, a projector display system comprises a light source; a controller, a first modulator, receiving light from the light source and rendering a halftone image of said the input image; a blurring optical system that blurs said halftone image with a Point Spread Function (PSF); and a second modulator receiving the blurred halftone image and rendering a pulse width modulated image which may be projected to form the desired screen image. Systems and techniques for forming a binary halftone image from input image, correcting for misalignment between the first and second modulators and calibrating the projector system—e.g. over time—for continuous image improvement are also disclosed.
    Type: Application
    Filed: November 22, 2023
    Publication date: March 21, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Jerome SHIELDS, Martin J. RICHARDS, Juan P. PERTIERRA
  • Publication number: 20240098435
    Abstract: A computer implemented system for rendering captured audio soundfields to a listener comprises apparatus to deliver the audio soundfields to the listener. The delivery apparatus delivers the audio soundfields to the listener with first and second audio elements perceived by the listener as emanating from first and second virtual source locations, respectively, and with the first audio element and/or the second audio element delivered to the listener from a third virtual source location. The first virtual source location and the second virtual source location are perceived by the listener as being located to the front of the listener, and the third virtual source location is located to the rear or the side of the listener.
    Type: Application
    Filed: September 18, 2023
    Publication date: March 21, 2024
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Richard J. CARTWRIGHT, David S. MCGRATH, Glenn N. DICKINS
  • Publication number: 20240098264
    Abstract: The present invention relates to a block partitioning structure in video coding technology, and a video encoding and decoding method and apparatus using the same, wherein the video encoding and decoding method includes the steps of: acquiring quad-partitioning information of a block; acquiring bi-partitioning information of the block when the acquired quad-partitioning information of the block does not indicate four partitions; acquiring partitioning direction information for bi-partitioning of the block when the acquired bi-partitioning information of the block indicates two partitions; acquiring information on whether to perform any other type of partitioning, when the acquired bi-partitioning information of the block does not indicate two partitions; and acquiring additional information required for the any other type of partitioning, when the acquired information on whether to perform any other type of partitioning indicates that the any other type of partitioning is performed.
    Type: Application
    Filed: November 29, 2023
    Publication date: March 21, 2024
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Ho Chan RYU, Yong Jo AHN
  • Publication number: 20240098286
    Abstract: An electronic device for encoding a picture is described. The electronic device includes a processor and instructions stored in memory that are in electronic communication with the processor. The instructions are executable to encode a step-wise temporal sub-layer access (STSA) sample grouping. The instructions are further executable to send and/or store the STSA sample grouping.
    Type: Application
    Filed: November 21, 2023
    Publication date: March 21, 2024
    Applicant: DOLBY INTERNATIONAL AB
    Inventor: Sachin G. Deshpande
  • Publication number: 20240089474
    Abstract: Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific backward compatible target frame rate and shutter angle among those allowed are also presented.
    Type: Application
    Filed: November 10, 2023
    Publication date: March 14, 2024
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Robin Atkins, Peng Yin, Taoran Lu, Fangjun Pu, Sean Thomas McCarthy, Walter J. Husak, Tao Chen, Guan-Ming Su
  • Publication number: 20240087590
    Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.
    Type: Application
    Filed: November 14, 2023
    Publication date: March 14, 2024
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Kristofer KJOERLING, Lars VILLEMOES, Heiko PURNHAGEN, Per EKSTRAND
  • Publication number: 20240087584
    Abstract: An encoding system encodes an N-channel audio signal (X), wherein N?3, as a single-channel downmix signal (Y) together with dry and wet upmix parameters ({tilde over (C)}, {tilde over (P)}). In a decoding system, a decorrelating section outputs, based on the downmix signal, an (N?1)-channel decorrelated signal (Z); a dry upmix section maps the downmix signal linearly in accordance with dry upmix coefficients (C) determined based on the dry upmix parameters; a wet upmix section populates an intermediate matrix based on the wet upmix parameters and knowing that the intermediate matrix belongs to a predefined matrix class, obtains wet upmix coefficients (P) by multiplying the intermediate matrix by a predefined matrix, and maps the decorrelated signal linearly in accordance with the wet upmix coefficients; and a combining section combines outputs from the upmix sections to obtain a reconstructed signal ({circumflex over (X)}) corresponding to the signal to be reconstructed.
    Type: Application
    Filed: September 25, 2023
    Publication date: March 14, 2024
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Lars Villemoes, Heidi-Maria Lehtonen, Heiko Purnhagen, Toni Hirvonen