Patents Assigned to Dolby Laboratories Licensing Corporation
  • Publication number: 20250088816
    Abstract: The disclosure herein generally relates to capturing, acoustic pre-processing, encoding, decoding, and rendering of directional audio of an audio scene. In particular, it relates to a device adapted to modify a directional property of a captured directional audio in response to spatial data of a microphone system capturing the directional audio. The disclosure further relates to a rendering device configured to modify a directional property of a received directional audio in response to received spatial data.
    Type: Application
    Filed: November 26, 2024
    Publication date: March 13, 2025
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Stefan Bruhn, Juan Felix Torres, David S. McGrath, Brian B. Lee
  • Publication number: 20250078858
    Abstract: Disclosed herein are method, systems, and computer-program products for segmenting a binaural recording of speech into parts containing self-speech and parts containing external speech, and processing each category with different settings, to obtain an enhanced overall presentation. The segmentation is based on a combination of: i) feature-based frame-by-frame classification, and ii) detecting dissimilarity by statistical methods. The segmentation information is then used by a speech enhancement chain, where independent settings are used to process the self- and external speech parts.
    Type: Application
    Filed: January 12, 2022
    Publication date: March 6, 2025
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Giulio CENGARLE, Yuanxing MA
  • Publication number: 20250078849
    Abstract: Many portable playback devices cannot decode and playback encoded audio content having wide bandwidth and wide dynamic range with consistent loudness and intelligibility unless the encoded audio content has been prepared specially for these devices. This problem can be overcome by including with the encoded content some metadata that specifies a suitable dynamic range compression profile by either absolute values or differential values relative to another known compression profile. A playback device may also adaptively apply gain and limiting to the playback audio. Implementations in encoders, in transcoders and in decoders are disclosed.
    Type: Application
    Filed: November 18, 2024
    Publication date: March 6, 2025
    Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB
    Inventors: Jeffrey RIEDMILLER, Harald MUNDT, Michael SCHUG, Martin WOLTERS
  • Publication number: 20250080937
    Abstract: The invention discloses rendering sound field signals, such as Higher-Order Ambisonics (HOA), for arbitrary loudspeaker setups, where the rendering results in highly improved localization properties and is energy preserving. This is obtained by rendering an audio sound field representation for arbitrary spatial loudspeaker setups and/or by a decoder that decodes based on a decode matrix (D). The decode matrix (D) is based on smoothing and scaling of a first decode matrix {circumflex over (D)} with smoothing coefficients. The first decode matrix {circumflex over (D)} is based on a mix matrix G and a mode matrix {tilde over (?)}, where the mix matrix G was determined based on L speakers and positions of a spherical modelling grid related to a HOA order N, and the mode matrix {tilde over (?)} was determined based on the spherical modelling grid and the HOA order N.
    Type: Application
    Filed: September 18, 2024
    Publication date: March 6, 2025
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Johannes BOEHM, Florian KEILER
  • Publication number: 20250080749
    Abstract: Overlapped block disparity estimation and compensation is described. Compensating for images with overlapped block disparity compensation (OBDC) involves determining if OBDC is enabled in a video bit stream, and determining if OBDC is enabled for one or more macroblocks that neighbor a first macroblock within the video bit stream. The neighboring macroblocks may be transform coded. If OBDC is enabled in the video bit stream and for the one or more neighboring macroblocks, predictions may be made for a region of the first macroblock that has an edge adjacent with the neighboring macroblocks. OBDC can be causally applied. Disparity compensation parameters or modes may be shared amongst views or layers. A variety of predictions may be used with causally-applied OBDC.
    Type: Application
    Filed: November 20, 2024
    Publication date: March 6, 2025
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Alexandros Tourapis, Athanasios Leontaris
  • Publication number: 20250080943
    Abstract: In some embodiments, virtualization methods for generating a binaural signal in response to channels of a multi-channel audio signal, which apply a binaural room impulse response (BRIR) to each channel including by using at least one feedback delay network (FDN) to apply a common late reverberation to a downmix of the channels. In some embodiments, input signal channels are processed in a first processing path to apply to each channel a direct response and early reflection portion of a single-channel BRIR for the channel, and the downmix of the channels is processed in a second processing path including at least one FDN which applies the common late reverberation. Typically, the common late reverberation emulates collective macro attributes of late reverberation portions of at least some of the single-channel BRIRs. Other aspects are headphone virtualizers configured to perform any embodiment of the method.
    Type: Application
    Filed: September 9, 2024
    Publication date: March 6, 2025
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Kuan-Chieh YEN, Dirk Jeroen BREEBAART, Grant A. DAVIDSON, Rhonda WILSON, David M. COOPER, Zhiwei SHUANG
  • Publication number: 20250080934
    Abstract: A method for compressing a HOA signal being an input HOA representation with input time frames (C(k)) of HOA coefficient sequences comprises spatial HOA encoding of the input time frames and subsequent perceptual encoding and source encoding. Each input time frame is decomposed (802) into a frame of predominant sound signals (XPS(k?1)) and a frame of an ambient HOA component ({tilde over (C)}AMB(k?1)). The ambient HOA component ({tilde over (C)}AMB(k?1)) comprises, in a layered mode, first HOA coefficient sequences of the input HOA representation (cn(k?1)) in lower positions and second HOA coefficient sequences (cAMB,n(k?1)) in remaining higher positions. The second HOA coefficient sequences are part of an HOA representation of a residual between the input HOA representation and the HOA representation of the predominant sound signals.
    Type: Application
    Filed: August 2, 2024
    Publication date: March 6, 2025
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Sven KORDON, Alexander KRUEGER, Oliver WUEBBOLT
  • Patent number: 12245012
    Abstract: A method and apparatus for decompressing a Higher Order Ambisonics (HOA) signal representation is disclosed. The apparatus includes an input interface that receives an encoded directional signal and an encoded ambient signal and an audio decoder that perceptually decodes the encoded directional signal and encoded ambient signal to produce a decoded directional signal and a decoded ambient signal, respectively. The apparatus further includes an extractor for obtaining side information related to the directional signal and an inverse transformer for converting the decoded ambient signal from a spatial domain to an HOA domain representation of the ambient signal. The apparatus also includes a synthesizer for recomposing a Higher Order Ambisonics (HOA) signal from the HOA domain representation of the ambient signal and the decoded directional signal. The side information includes a direction of the directional signal selected from a set of uniformly spaced directions.
    Type: Grant
    Filed: October 16, 2023
    Date of Patent: March 4, 2025
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Alexander Krueger, Sven Kordon, Johannes Boehm, Johann-Markus Batke
  • Patent number: 12245013
    Abstract: There are two representations for Higher Order Ambisonics denoted HOA: spatial domain and coefficient domain. The invention generates from a coefficient domain representation a mixed spatial/coefficient domain representation, wherein the number of said HOA signals can be variable. An aspect of the invention further relates to methods and apparatus decoding multiplexed and perceptually encoded HOA signals, including transforming a vector of PCM encoded spatial domain signals of the HOA representation to a corresponding vector of coefficient domain signals by multiplying the vector of PCM encoded spatial domain signals with a transform matrix and de-normalizing the vector of PCM encoded and normalized coefficient domain signals, wherein said de-normalizing comprises.
    Type: Grant
    Filed: November 22, 2023
    Date of Patent: March 4, 2025
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Sven Kordon, Alexander Krueger
  • Patent number: 12244859
    Abstract: The present invention provides a method and a device for deriving an inter-view motion merging candidate. A method for deriving an inter-view motion merging candidate, according to an embodiment of the present invention, can comprise the steps of: on the basis of encoding information of an inter-view reference block derived by means of a variation vector of a current block, determining whether or not inter-view motion merging of the current block is possible; and, if inter-view motion merging of the current block is not possible, generating an inter-view motion merging candidate of the current block by using encoding information of an adjacent block that is spatially adjacent to the inter-view reference block.
    Type: Grant
    Filed: May 17, 2023
    Date of Patent: March 4, 2025
    Assignee: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Gwang Hoon Park, Young Su Heo
  • Patent number: 12243548
    Abstract: Some noise compensation methods involve receiving microphone signals corresponding to ambient noise from a noise source location in or near an audio environment, determining or estimating a listener position in the audio environment and estimating at least one critical distance, which is a distance from the noise source location at which directly propagated sound pressure is equal to diffuse field sound pressure. Some examples involve estimating whether the listener position is within the at least one critical distance and implementing a noise compensation method for the ambient noise based, at least in part, on an estimate of whether the listener position is within the critical distance.
    Type: Grant
    Filed: December 8, 2020
    Date of Patent: March 4, 2025
    Assignee: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Benjamin Alexander Jancovich, Timothy Alan Port, Andrew P. Reilly, Richard J. Cartwright
  • Patent number: 12244867
    Abstract: A quantization parameter signalling mechanism for both SDR and HDR content in video coding is described using two approaches. The first approach is to send the user-defined QpC table directly in high level syntax. This leads to more flexible and efficient QP control for future codec development and video content coding. The second approach is to signal luma and chroma QPs independently. This approach eliminates the need for QpC tables and removes the dependency of chroma quantization parameter on luma QP.
    Type: Grant
    Filed: November 10, 2023
    Date of Patent: March 4, 2025
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Fangjun Pu, Taoran Lu, Peng Yin, Sean Thomas McCarthy
  • Patent number: 12244872
    Abstract: An input image of a first bit depth in an input domain is received. Forward reshaping operations are performed on the input image to generate a forward reshaped image of a second bit depth in a reshaping domain. An image container containing image data derived from the forward reshaped image is encoded into an output video signal of the second bit depth.
    Type: Grant
    Filed: November 10, 2021
    Date of Patent: March 4, 2025
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Janos Horvath, Harshad Kadu, Guan-Ming Su
  • Patent number: 12245014
    Abstract: Improved methods and/or apparatus for decoding an encoded audio signal in soundfield format for L loudspeakers. The method and/or apparatus can render an Ambisonics format audio signal to 2D loudspeaker setup(s) based on a rendering matrix. The rendering matrix has elements based on loudspeaker positions and wherein the rendering matrix is determined based on weighting at least an element of a first matrix with a weighting factor ? = 1 L . The first matrix is determined based on positions of the L loudspeakers and at least a virtual position of at least a virtual loudspeaker that is added to the positions of the L loudspeakers.
    Type: Grant
    Filed: August 28, 2023
    Date of Patent: March 4, 2025
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Florian Keiler, Johannes Boehm
  • Patent number: 12244869
    Abstract: Given a sequence of images in a first codeword representation, methods, processes, and systems are presented for image reshaping using rate distortion optimization, wherein reshaping allows the images to be coded in a second codeword representation which allows more efficient compression than using the first codeword representation. Syntax methods for signaling reshaping parameters are also presented.
    Type: Grant
    Filed: March 29, 2022
    Date of Patent: March 4, 2025
    Assignee: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Peng Yin, Fangjun Pu, Taoran Lu, Tao Chen, Walter J. Husak, Sean Thomas McCarthy
  • Publication number: 20250071479
    Abstract: Disclosed are methods and systems which convert a multi-microphone input signal to a multichannel output signal making use of a time-and frequency-varying matrix. For each time and frequency tile, the matrix is derived as a function of a dominant direction of arrival and a steering strength parameter. Likewise, the dominant direction and steering strength parameter are derived from characteristics of the multi-microphone signals, where those characteristics include values representative of the inter-channel amplitude and group-delay differences.
    Type: Application
    Filed: September 5, 2024
    Publication date: February 27, 2025
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: David S. MCGRATH
  • Publication number: 20250070737
    Abstract: Example embodiments disclosed herein relate to audio signal loudness control. A method for controlling loudness of an audio signal is disclosed. The method includes responsive to determining presence of a noise signal, deriving a target partial loudness adjustment based, at least in part, on at least one of a first factor related to the noise signal and a second factor related to the audio signal. The method further includes determining a target partial loudness of the audio signal based, at least in part, on the target partial loudness adjustment. Corresponding system, apparatus and computer program product are also disclosed.
    Type: Application
    Filed: September 5, 2024
    Publication date: February 27, 2025
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Guilin MA, Xiguang ZHENG
  • Publication number: 20250069607
    Abstract: Techniques for adaptive processing of media data based on separate data specifying a state of the media data are provided. A device in a media processing chain may determine whether a type of media processing has already been performed on an input version of media data. If so, the device may adapt its processing of the media data to disable performing the type of media processing. If not, the device performs the type of media processing. The device may create a state of the media data specifying the type of media processing. The device may communicate the state of the media data and an output version of the media data to a recipient device in the media processing chain, for the purpose of supporting the recipient device's adaptive processing of the media data.
    Type: Application
    Filed: May 24, 2024
    Publication date: February 27, 2025
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Jeffrey RIEDMILLER, Regunathan RADHAKRISHNAN, Marvin PRIBADI, Farhad FARAHANI, Michael SMITHERS
  • Publication number: 20250069611
    Abstract: Embodiments are disclosed for channel-based audio (CBA) (e.g., 22.2-ch audio) to object-based audio (OBA) conversion. The conversion includes converting CBA metadata to object audio metadata (OAMD) and reordering the CBA channels based on channel shuffle information derived in accordance with channel ordering constraints of the OAMD. The OBA with reordered channels is rendered in a playback device using the OAMD or in a source device, such as a set-top box or audio/video recorder. In an embodiment, the CBA metadata includes signaling that indicates a specific OAMD representation to be used in the conversion of the metadata. In an embodiment, pre-computed OAMD is transmitted in a native audio bitstream (e.g., AAC) for transmission (e.g., over HDMI) or for rendering in a source device. In an embodiment, pre-computed OAMD is transmitted in a transport layer bitstream (e.g., ISO BMFF, MPEG4 audio bitstream) to a playback device or source device.
    Type: Application
    Filed: September 10, 2024
    Publication date: February 27, 2025
    Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB
    Inventors: Michael C. WARD, Freddie SANCHEZ, Christof FERSCH
  • Publication number: 20250069200
    Abstract: Novel methods and systems are described for providing interactive motion blur on an image by motion inputs from movements of the mobile device displaying the image. The device can process the motion blur by modules providing motion blur parameter estimation, blur application, and image composition based on metadata and a baseline image from the encoder. A pre-loaded filter bank can provide blur kernels for blur application.
    Type: Application
    Filed: December 7, 2022
    Publication date: February 27, 2025
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Dae Yeol Lee, Neeraj J. Gadgil, Guan-Ming Su