Dolby Labs Patent Applications

Dolby Labs patent applications that are pending before the United States Patent and Trademark Office (USPTO).

  • Publication number: 20210006924
    Abstract: A method (900) for rendering audio in a virtual reality rendering environment (180) is described. The method (900) comprises rendering (901) an origin audio signal of an origin audio source (113) of an origin audio scene (111) from an origin source position on a sphere (114) around a listening position (201) of a listener (181). Furthermore, the method (900) comprises determining (902) that the listener (181) moves from the listening position (201) within the origin audio scene (111) to a listening position (202) within a different destination audio scene (112). In addition, the method (900) comprises applying (903) a fade-out gain to the origin audio signal to determine a modified origin audio signal, and rendering (903) the modified origin audio signal of the origin audio source (113) from the origin source position on the sphere (114) around the listening position (201, 202).
    Type: Application
    Filed: December 18, 2018
    Publication date: January 7, 2021
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Leon Terentiv, Christof Fersch, Daniel Fischer
  • Publication number: 20210005211
    Abstract: A technique including receiving and decoding a coded bitstream encoded with audio content including first audio objects corresponding to a first media content type of two consecutive media content types and second audio objects corresponding to a second media content type of the two consecutive media content types, and audio metadata corresponding to the audio content. The audio metadata including first and second audio object gains, for the first and second audio objects, generated in part based on a first fading curve of the first media content type and a second fading curve of the second media content type, respectively. The technique further includes applying the first and second audio object gains to the first and second audio objects, and rendering a sound field represented by the first audio object with the applied first audio object gain and the second audio object with the applied second audio object gain.
    Type: Application
    Filed: July 1, 2020
    Publication date: January 7, 2021
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Alexander Stahlmann, Reinhold Boehm, Mark C. Leddy, Karsten Linzmeier, Vinay Mathew, Simon Plain, Heiko Purnhagen, Leif Sehlström, Robin Thesing
  • Publication number: 20210005212
    Abstract: The present disclosure relates to an apparatus for decoding an encoded Unified Audio and Speech stream. The apparatus comprises a core decoder for decoding the encoded Unified Audio and Speech stream. The core decoder includes a fast Fourier transform, FFT, module implementation based on a Cooley-Tuckey algorithm. The FFT module is configured to determine a discrete Fourier transform, DFT. Determining the DFT involves recursively breaking down the DFT into small FFTs based on the Cooley-Tucker algorithm and using radix-4 if a number of points of the FFT is a power of 4 and using mixed radix if the number is not a power of 4. Performing the small FFTs involves applying twiddle factors. Applying the twiddle factors involves referring to pre-computed values for the twiddle factors.
    Type: Application
    Filed: December 19, 2018
    Publication date: January 7, 2021
    Applicant: Dolby International AB
    Inventors: Rajat KUMAR, Ramesh KATURI, Saketh SATHUVALLI, Reshma RAI
  • Publication number: 20200410976
    Abstract: Computer-implemented methods for speech synthesis are provided. A speech synthesizer may be trained to generate synthesized audio data that corresponds to words uttered by a source speaker according to speech characteristics of a target speaker. The speech synthesizer may be trained by time-stamped phoneme sequences, pitch contour data and speaker identification data. The speech synthesizer may include a voice modeling neural network and a conditioning neural network.
    Type: Application
    Filed: February 14, 2019
    Publication date: December 31, 2020
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Cong ZHOU, Michael Getty HORGAN, Vivek KUMAR, Jaime H. MORALES, Cristina Michel VASCO
  • Publication number: 20200411017
    Abstract: The present disclosure provides methods, devices and computer program products for encoding and decoding of a vector of parameters in an audio coding system. The disclosure further relates to a method and apparatus for reconstructing an audio object in an audio decoding system. According to the disclosure, a modulo differential approach for coding and encoding a vector of a non-periodic quantity may improve the coding efficiency and provide encoders and decoders with less memory requirements. Moreover, an efficient method for encoding and decoding a sparse matrix is provided.
    Type: Application
    Filed: July 10, 2020
    Publication date: December 31, 2020
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Leif Jonas SAMUELSSON, Heiko PURNHAGEN
  • Publication number: 20200411024
    Abstract: Embodiments relate to an audio processing unit that includes a buffer, bitstream payload deformatter, and a decoding subsystem. The buffer stores at least one block of an encoded audio bitstream. The block includes a fill element that begins with an identifier followed by fill data. The fill data includes at least one flag identifying whether enhanced spectral band replication (eSBR) processing is to be performed on audio content of the block. A corresponding method for decoding an encoded audio bitstream is also provided.
    Type: Application
    Filed: July 17, 2020
    Publication date: December 31, 2020
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Lars Villemoes, Heiko Purnhagen, Per Ekstrand
  • Publication number: 20200413099
    Abstract: Sequence-level parameters are generated for an image frame sequence including sequence-level indicators for indicating metadata types present for each image frame in the sequence of image frames. Frame-present parameters are generated for a specific image frame in the sequence including frame-present indicators corresponding to the metadata types as indicated in the sequence-level parameters. The frame-present indicators identify first metadata types for which metadata parameter values are to be encoded in a coded bitstream as metadata payloads. The image frame sequence, the sequence-level parameters, the frame-present parameters and the metadata payloads are encoded in the coded bitstream. A recipient device can generate, from the specific image frame based partly on the metadata parameter values determined for the first metadata types, a target display image for a target display.
    Type: Application
    Filed: September 21, 2018
    Publication date: December 31, 2020
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Guan-Ming SU, Neeraj J. GADGIL, Tao CHEN, Sheng QU
  • Publication number: 20200411019
    Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag.
    Type: Application
    Filed: January 28, 2019
    Publication date: December 31, 2020
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Kristofer KJOERLING, Lars VILLEMOES, Heiko PURNHAGEN, Per EKSTRAND
  • Publication number: 20200403593
    Abstract: Volume leveler controller and controlling method are disclosed. In one embodiment, A volume leveler controller includes an audio content classifier for identifying the content type of an audio signal in real time; and an adjusting unit for adjusting a volume leveler in a continuous manner based on the content type as identified. The adjusting unit may configured to positively correlate the dynamic gain of the volume leveler with informative content types of the audio signal, and negatively correlate the dynamic gain of the volume leveler with interfering content types of the audio signal.
    Type: Application
    Filed: July 2, 2020
    Publication date: December 24, 2020
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Jun WANG, Lie LU, Alan J. SEEFELDT
  • Publication number: 20200402518
    Abstract: Methods and apparatus for decoding a compressed Higher Order Ambisonics (HOA) representation of a sound or soundfield. The method may include receiving a bit stream containing the compressed HOA representation and decoding, based on a determination that there are multiple layers, the compressed HOA representation from the bitstream to obtain a sequence of decoded HOA representations. A first subset of the sequence of decoded HOA representations is determined based only on corresponding ambient HOA components. A second subset of the sequence of decoded HOA representations is determined based on corresponding ambient HOA components and corresponding predominant sound components.
    Type: Application
    Filed: June 3, 2020
    Publication date: December 24, 2020
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Sven KORDON, Alexander KRUEGER, Oliver WUEBBOLT
  • Publication number: 20200404336
    Abstract: Methods are described to communicate source color volume information in a coded bitstream using SEI messaging. Such data include at least the minimum, maximum, and average luminance values in the source data plus optional data that may include the color volume x and y chromaticity coordinates for the input color primaries (e.g., red, green, and blue) of the source data, and the color x and y chromaticity coordinates for the color primaries corresponding to the minimum, average, and maximum luminance values in the source data. Messaging data signaling an active region in each picture may also be included.
    Type: Application
    Filed: September 1, 2020
    Publication date: December 24, 2020
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Tao CHEN, Peng YIN, Taoran LU, Walter J. HUSAK
  • Publication number: 20200395023
    Abstract: The invention provides methods and devices for outputting a stereo audio signal having a left channel and a right channel. The apparatus includes a demultiplexer, decoder, and upmixer. The upmixer is configured operate either in a prediction mode or a non-prediction mode based on a parameter encoded in the audio bitstream.
    Type: Application
    Filed: July 16, 2020
    Publication date: December 17, 2020
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Heiko Purnhagen, Pontus Carlsson, Lars Villemoes
  • Publication number: 20200395027
    Abstract: The disclosure relates to methods, apparatus and systems for side load processing of packetized media streams. In an embodiment, the apparatus comprises: a receiver for receiving a bitstream, and a splitter for identifying a packet type in the bitstream and splitting, based on the identification of a value of the packet type in the bit stream into a main stream and an auxiliary stream.
    Type: Application
    Filed: February 22, 2019
    Publication date: December 17, 2020
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Stephan Schreiner, Christof Fersch
  • Publication number: 20200395022
    Abstract: The present document relates to a method of layered encoding of a compressed sound representation of a sound or sound field. The compressed sound representation comprises a basic compressed sound representation comprising a plurality of components, basic side information for decoding the basic compressed sound representation to a basic reconstructed sound representation of the sound or sound field, and enhancement side information including parameters for improving the basic reconstructed sound representation.
    Type: Application
    Filed: July 1, 2020
    Publication date: December 17, 2020
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Sven KORDON, Alexander KRUEGER
  • Publication number: 20200394974
    Abstract: Apparatus and methods are provided that employ one or more of a variety of techniques for reducing the time required to display high resolution images on a high dynamic range display having a light source layer and a display layer. In one technique, the image resolution is reduced, an effective luminance pattern is determined for the reduced resolution image, and the resolution of the effective luminance pattern is then increased to the resolution of the display layer. In another technique, the light source layer's point spread function is decomposed into a plurality of components, and an effective luminance pattern is determined for each component. The effective luminance patterns are then combined to produce a total effective luminance pattern. Additional image display time reduction techniques are provided.
    Type: Application
    Filed: January 20, 2020
    Publication date: December 17, 2020
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Lorne A. WHITEHEAD, Helge SEETZEN, Gregory John WARD, Wolfgang HEIDRICH
  • Publication number: 20200396559
    Abstract: Embodiments are directed to upward-firing speakers that reflect sound off a ceiling to a listening location at a distance from a speaker. The reflected sound provides height cues to reproduce audio objects that have overhead audio components. A virtual height filter based on a directional hearing model is applied to the upward-firing driver signal to improve the perception of height for audio signals transmitted by the virtual height speaker to provide optimum reproduction of the overhead reflected sound. The upward firing driver is tilted at an inclination angle of approximately 20 degrees to the horizontal axis of the speaker and separate height and direct terminal connections are provided to interface to an adaptive audio rendering system.
    Type: Application
    Filed: July 28, 2020
    Publication date: December 17, 2020
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Brett G. CROCKETT, Alan J. SEEFELDT, C. Phillip BROWN
  • Publication number: 20200395025
    Abstract: The invention provides an efficient implementation of cross-product enhanced high-frequency reconstruction (HFR), wherein a new component at frequency Q?+r?0 is generated on the basis of existing components at ? and ?+?0. The invention provides a block-based harmonic transposition, wherein a time block of complex subband samples is processed with a common phase modification. Superposition of several modified samples has the net effect of limiting undesirable intermodulation products, thereby enabling a coarser frequency resolution and/or lower degree of oversampling to be used. In one embodiment, the invention further includes a window function suitable for use with block-based cross-product enhanced HFR. A hardware embodiment of the invention may include an analysis filter bank, a subband processing unit configurable by control data and a synthesis filter bank.
    Type: Application
    Filed: June 30, 2020
    Publication date: December 17, 2020
    Applicant: Dolby International AB
    Inventor: Lars Villemoes
  • Publication number: 20200395031
    Abstract: Embodiments are directed to a companding method and system for reducing coding noise in an audio codec. A compression process reduces an original dynamic range of an initial audio signal through a compression process that divides the initial audio signal into a plurality of segments using a defined window shape, calculates a wideband gain in the frequency domain using a non-energy based average of frequency domain samples of the initial audio signal, and applies individual gain values to amplify segments of relatively low intensity and attenuate segments of relatively high intensity. The compressed audio signal is then expanded back to the substantially the original dynamic range that applies inverse gain values to amplify segments of relatively high intensity and attenuating segments of relatively low intensity. A QMF filterbank is used to analyze the initial audio signal to obtain a frequency domain representation.
    Type: Application
    Filed: June 3, 2020
    Publication date: December 17, 2020
    Applicants: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Per Hedelin, Arijit Biswas, Michael Schug, Vinay Melkote
  • Publication number: 20200395908
    Abstract: Configurable amplifier systems are described in which the power supply rail of a linear amplifier, e.g., a class A amplifier, is modulated by a switching amplifier, e.g., a class D amplifier, that may also be configured to operate independently of the linear amplifier. Techniques are also described by which the standing current of the output stage of a linear amplifier is modulated based on the input signal to the linear amplifier or based on modulation of the power supply rail of the linear amplifier.
    Type: Application
    Filed: December 19, 2018
    Publication date: December 17, 2020
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Kenneth SCHINDLER, Scott P. ROBINSON, Joel A. BUTLER
  • Publication number: 20200396555
    Abstract: A computer implemented system for rendering captured audio soundfields to a listener comprises apparatus to deliver the audio soundfields to the listener. The delivery apparatus delivers the audio soundfields to the listener with first and second audio elements perceived by the listener as emanating from first and second virtual source locations, respectively, and with the first audio element and/or the second audio element delivered to the listener from a third virtual source location. The first virtual source location and the second virtual source location are perceived by the listener as being located to the front of the listener, and the third virtual source location is located to the rear or the side of the listener.
    Type: Application
    Filed: June 22, 2020
    Publication date: December 17, 2020
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Richard J. CARTWRIGHT, David S. MCGRATH, Glenn N. DICKINS
  • Publication number: 20200389648
    Abstract: Methods, processes, and systems are presented for combining signal reshaping (also referred to as luma mapping chroma residuals scaling) with local illumination compensation (LIC) in video coding. Examples and trade-offs when the LIC model parameters are computed in the original domain, the reshaped domain, or a mixed domain, are presented.
    Type: Application
    Filed: June 4, 2020
    Publication date: December 10, 2020
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Jay Nitin Shingala, Ashwin Natesan, Peng Yin
  • Publication number: 20200389815
    Abstract: The present disclosure provides methods, devices and computer program products for non-uniform quantization of parameters. The disclosure further relates to a method and apparatus for reconstructing an audio object in an audio decoding system taking the non-uniformly quantized parameters into account. According to the disclosure, such an approach renders it possible to reduce bit consumption without substantially reducing the quality of the reconstructed audio object.
    Type: Application
    Filed: June 19, 2020
    Publication date: December 10, 2020
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Heiko PURNHAGEN, Per EKSTRAND
  • Publication number: 20200388077
    Abstract: Spatial information that describes spatial locations of visual objects as in a three-dimensional (3D) image space as represented in one or more multi-view unlayered images is accessed. Based on the spatial information, a cinema image layer and one or more device image layers are generated from the one or more multi-view unlayered images. A multi-layer multi-view video signal comprising the cinema image layer and the device image layers is sent to downstream devices for rendering.
    Type: Application
    Filed: April 10, 2018
    Publication date: December 10, 2020
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Ajit NINAN, Neil MAMMEN, Tyrome Y. BROWN
  • Publication number: 20200389718
    Abstract: Personal audio systems and methods are disclosed. A personal audio system includes a voice activity detector to determine whether or not an ambient audio stream contains voice activity, a pitch estimator to determine a frequency of a fundamental component of an annoyance noise contained in the ambient audio stream, and a filter bank to attenuate the fundamental component and at least one harmonic component of the annoyance noise to generate a personal audio stream. The filter bank implements a first filter function when the ambient audio stream does not contain voice activity, or a second filter function when the ambient audio stream contains voice activity.
    Type: Application
    Filed: January 6, 2020
    Publication date: December 10, 2020
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Gints KLIMANIS, Anthony PARKS, Richard Fritz LANMAN, III, Noah KRAFT, Matthew J. JAFFE, Jeffrey Ross BAKER
  • Publication number: 20200388300
    Abstract: The present document relates to audio source coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), as well as to digital effect processors, e.g. exciters, where generation of harmonic distortion add brightness to the processed signal, and to time stretchers where a signal duration is prolonged with maintained spectral content. A system and method configured to generate a time stretched and/or frequency transposed signal from an input signal is described. The system comprises an analysis filterbank configured to provide an analysis subband signal from the input signal; wherein the analysis subband signal comprises a plurality of complex valued analysis samples, each having a phase and a magnitude. Furthermore, the system comprises a subband processing unit configured to determine a synthesis subband signal from the analysis subband signal using a subband transposition factor Q and a subband stretch factor S.
    Type: Application
    Filed: June 23, 2020
    Publication date: December 10, 2020
    Applicant: Dolby International AB
    Inventor: Lars VILLEMOES
  • Publication number: 20200389749
    Abstract: Embodiments of source separation for reverberant environment are disclosed. According to a method, first microphone signals for each individual one of at least one source are captured respectively by at least two microphones for a period during which only the individual one produces sounds. Mixing parameters for modeling acoustic paths between the at least one source and the at least two microphones are learned by a processor based on the first microphone signals. Second microphone signals are captured respectively by the at least two microphones for a period during which all of the at least one source produce sounds. The reconstruction model is estimated by the processor based on the mixing parameters and second microphone signals. The processor performs the source separation by applying the reconstruction model.
    Type: Application
    Filed: May 20, 2020
    Publication date: December 10, 2020
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Jun WANG
  • Publication number: 20200388294
    Abstract: The present invention relates to a new method and apparatus for improvement of High Frequency Reconstruction (HFR) techniques using frequency translation or folding or a combination thereof. The proposed invention is applicable to audio source coding systems, and offers significantly reduced computational complexity. This is accomplished by means of frequency translation or folding in the subband domain, preferably integrated with spectral envelope adjustment in the same domain. The concept of dissonance guard-band filtering is further presented. The proposed invention offers a low-complexity, intermediate quality HFR method useful in speech and natural audio coding applications.
    Type: Application
    Filed: June 23, 2020
    Publication date: December 10, 2020
    Applicant: Dolby International AB
    Inventors: Lars G. LILJERYD, Per EKSTRAND, Fredrik HENN, Kristofer KJOERLING
  • Publication number: 20200382892
    Abstract: Embodiments are described for a system of rendering object-based audio content through a system that includes individually addressable drivers, including at least one driver that is configured to project sound waves toward one or more surfaces within a listening environment for reflection to a listening area within the listening environment; a renderer configured to receive and process audio streams and one or more metadata sets associated with each of the audio streams and specifying a playback location of a respective audio stream; and a playback system coupled to the renderer and configured to render the audio streams to a plurality of audio feeds corresponding to the array of audio drivers in accordance with the one or more metadata sets.
    Type: Application
    Filed: August 24, 2020
    Publication date: December 3, 2020
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Sripal S. Mehta, Brett G. Crockett, Spencer Hooks, Alan Seefeldt, Christophe Chabanne, C. Phillip Brown, Joshua B. Lando, Brad Basler, Stewart Murrie
  • Publication number: 20200380979
    Abstract: One or more context aware processing parameters and an ambient audio stream are received. One or more sound characteristics associated with the ambient audio stream are identified using a machine learning model. One or more actions to perform are determined using the machine learning model and based on the one or more context aware processing parameters and the identified one or more sound characteristics. The one or more actions are performed.
    Type: Application
    Filed: February 3, 2020
    Publication date: December 3, 2020
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Jacob Meacham, Matthew Sills, Richard Fritz Lanman, III, Jeffrey Baker
  • Publication number: 20200380999
    Abstract: The present invention proposes a new method for improving the performance of a real-valued filterbank based spectral envelope adjuster. By adaptively locking the gain values for adjacent channels dependent on the sign of the channels, as defined in the application, reduced aliasing is achieved. Furthermore, the grouping of the channels during gain-calculation, gives an improved energy estimate of the real valued subband signals in the filterbank.
    Type: Application
    Filed: June 14, 2020
    Publication date: December 3, 2020
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Kristofer KJOERLING, Lars VILLEMOES
  • Publication number: 20200382802
    Abstract: Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific backward compatible target frame rate and shutter angle among those allowed are also presented.
    Type: Application
    Filed: June 15, 2020
    Publication date: December 3, 2020
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Robin Atkins, Peng Yin, Taoran Lu, Fangjun Pu, Sean Thomas McCarthy, Walter J. Husak, Tao Chen, Guan-Ming Su
  • Publication number: 20200382889
    Abstract: Improved methods and/or apparatus for decoding an encoded audio signal in soundfield format for L loudspeakers. The method and/or apparatus can render an Ambisonics format audio signal to 2D loudspeaker setup(s) based on a rendering matrix. The rendering matrix has elements based on loudspeaker positions and wherein the rendering matrix is determined based on weighting at least an element of a first matrix with a weighting factor g=1/?{square root over (L)}. The first matrix is determined based on positions of the L loudspeakers and at least a virtual position of at least a virtual loudspeaker that is added to the positions of the L loudspeakers.
    Type: Application
    Filed: June 16, 2020
    Publication date: December 3, 2020
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Florian Keiler, Johannes Boehm
  • Publication number: 20200381003
    Abstract: Methods (700, 800, 900), systems (200, 300, 400, 500, 600) and computer program products are provided. Location metadata (620) associated with an audio object is received (801). The location metadata defines a position of the audio object in an audio scene. It is estimated (630, 802), based on the location metadata, whether the audio object includes dialog. A value representative of a result of the estimation is assigned (803) to an object type parameter (231). In some example embodiments, audio objects are selected (661, 662, 804) based on values of their respective of object type parameters. In some example embodiments, at least one of the selected audio objects is submitted to dialog enhancement (690, 807).
    Type: Application
    Filed: July 26, 2018
    Publication date: December 3, 2020
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Mark William Gerrard
  • Publication number: 20200380997
    Abstract: The present disclosure relates to an apparatus for decoding an encoded Unified Audio and Speech stream. The apparatus comprises a core decoder for decoding the encoded Unified Audio and Speech stream. The core decoder includes an upmixing unit adapted to perform mono to stereo upmixing. The upmixing unit includes a decorrelator unit D adapted to apply a decorrelation filter to an input signal. The decorrelator unit is adapted to determine filter coefficients for the decorrelation filter by referring to pre-computed values. The present disclosure further relates to a an apparatus for encoding a Unified Audio and Speech stream, as well as to corresponding methods and storage media.
    Type: Application
    Filed: December 19, 2018
    Publication date: December 3, 2020
    Applicant: Dolby International AB
    Inventors: Rajat KUMAR, Ramesh KATURI, Saketh SATHUVALLI, Reshma RAI
  • Publication number: 20200366994
    Abstract: Embodiments are described for a method of simultaneously localizing a set of speakers and microphones, having only the times of arrival between each of the speakers and microphones. An autodiscovery process uses an external input to set: a global translation (3 continuous parameters), a global rotation (3 continuous parameters), and discrete symmetries, i.e., an exchange of any axis pairs and/or reversal of any axis. Different time of arrival acquisition techniques may be used, such as ultrasonic sweeps or generic multitrack audio content. The autodiscovery algorithm is based in minimizing a certain cost function, and the process allows for latencies in the recordings, possibly linked to the latencies in the emission.
    Type: Application
    Filed: August 6, 2020
    Publication date: November 19, 2020
    Applicants: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Daniel ARTEAGA, Giulio CENGARLE, David Matthew FISCHER, Antonio MATEOS SOLE, Davide SCAINI, Alan SEEFELDT
  • Publication number: 20200367003
    Abstract: The present disclosure relates to reverberation generation for headphone virtualization. A method of generating one or more components of a binaural room impulse response (BRIR) for headphone virtualization is described. In the method, directionally-controlled reflections are generated, wherein directionally-controlled reflections impart a desired perceptual cue to an audio input signal corresponding to a sound source location. Then at least the generated reflections are combined to obtain the one or more components of the BRIR. Corresponding system and computer program products are described as well.
    Type: Application
    Filed: August 6, 2020
    Publication date: November 19, 2020
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Louis D. Fielder, Zhiwei Shuang, Grant A. Davidson, Xiguang Zheng, Mark S. Vinton
  • Publication number: 20200364025
    Abstract: Embodiments are directed to a method and system for receiving, in a bitstream, metadata associated with the audio data, and analyzing the metadata to determine whether a loudness parameter for a first group of audio playback devices are available in the bitstream. Responsive to determining that the parameters are present for the first group, the system uses the parameters and audio data to render audio. Responsive to determining that the loudness parameters are not present for the first group, the system analyzes one or more characteristics of the first group, and determines the parameter based on the one or more characteristics.
    Type: Application
    Filed: June 1, 2020
    Publication date: November 19, 2020
    Applicants: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Jeffrey RIEDMILLER, Scott Gregory NORCROSS, Karl Jonas ROEDEN
  • Publication number: 20200357422
    Abstract: Apparatus and methods for generating an encoded audio bitstream, including by including program loudness metadata and audio data in the bitstream, and optionally also program boundary metadata in at least one segment (e.g., frame) of the bitstream. Other aspects are apparatus and methods for decoding such a bitstream, e.g., including by performing adaptive loudness processing of the audio data of an audio program indicated by the bitstream, or authentication and/or validation of metadata and/or audio data of such an audio program. Another aspect is an audio processing unit (e.g., an encoder, decoder, or post-processor) configured (e.g., programmed) to perform any embodiment of the method or which includes a buffer memory which stores at least one frame of an audio bitstream generated in accordance with any embodiment of the method.
    Type: Application
    Filed: May 28, 2020
    Publication date: November 12, 2020
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Michael GRANT, Scott Gregory NORCROSS, Jeffrey RIEDMILLER, Michael WARD
  • Publication number: 20200359152
    Abstract: Audio content coded for a reference speaker configuration is downmixed to downmix audio content coded for a specific speaker configuration. One or more gain adjustments are performed on individual portions of the downmix audio content coded for the specific speaker configuration. Loudness measurements are then performed on the individual portions of the downmix audio content. An audio signal that comprises the audio content coded for the reference speaker configuration and downmix loudness metadata is generated. The downmix loudness metadata is created based at least in part on the loudness measurements on the individual portions of the downmix audio content.
    Type: Application
    Filed: May 26, 2020
    Publication date: November 12, 2020
    Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB
    Inventors: Michael Ward, Jeffrey Riedmiller, Scott Gregory Norcross, Alexander Stahlmann
  • Publication number: 20200359150
    Abstract: A method for performing DRC on a HOA signal comprises transforming the HOA signal to the spatial domain, analyzing the transformed HOA signal, and obtaining, from results of said analyzing, gain factors that are usable for dynamic compression. The gain factors can be transmitted together with the HOA signal. When applying the DRC, the HOA signal is transformed to the spatial domain, the gain factors are extracted and multiplied with the transformed HOA signal in the spatial domain, wherein a gain compensated transformed HOA signal is obtained. The gain compensated transformed HOA signal is transformed back into the HOA domain, wherein a gain compensated HOA signal is obtained. The DRC may be applied in the QMF-filter bank domain.
    Type: Application
    Filed: April 23, 2020
    Publication date: November 12, 2020
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Johannes BOEHM, Florian KEILER
  • Publication number: 20200359154
    Abstract: A system and method of adjusting an audio output in one location so that its propagation into another location is reduced. As a first device in a first location generates sound, a second device in a second location detects the propagated sound. The first device then adjusts its output based on the detected sound.
    Type: Application
    Filed: January 8, 2019
    Publication date: November 12, 2020
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: C. Phillip BROWN, Michael J. SMITHERS, Remi S. AUDFRAY, Patrick David SAUNDERS
  • Publication number: 20200359026
    Abstract: The quantization parameter QP is well-known in digital video compression as an indication of picture quality. Digital symbols representing a moving image are quantized with a quantizing step that is a function QSN of the quantization parameter QP, which function QSN has been normalized to the most significant bit of the bit depth of the digital symbols. As a result, the effect of a given QP is essentially independent of bit depth a particular QP value has a standard effect on image quality, regardless of bit depth. The invention is useful, for example, in encoding and decoding at different bit depths, to generate compatible, bitstreams having different bit depths, and to allow different bit depths for different components of a video signal by compressing each with the same fidelity (i.e., the same QP).
    Type: Application
    Filed: July 27, 2020
    Publication date: November 12, 2020
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Walter C. GISH, Christopher J. VOGT
  • Publication number: 20200357420
    Abstract: A method for representing a second presentation of audio channels or objects as a data stream, the method comprising the steps of: (a) providing a set of base signals, the base signals representing a first presentation of the audio channels or objects; (b) providing a set of transformation parameters, the transformation parameters intended to transform the first presentation into the second presentation; the transformation parameters further being specified for at least two frequency bands and including a set of multi-tap convolution matrix parameters for at least one of the frequency bands.
    Type: Application
    Filed: May 26, 2020
    Publication date: November 12, 2020
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Dirk Jeroen BREEBAART, David Matthew COOPER, Leif Jonas SAMUELSSON
  • Publication number: 20200351524
    Abstract: A standard dynamic range (SDR) image and a reference backward reshaping mapping are received. The reference backward reshaping mapping comprises a reference luma backward reshaping mapping. A color preservation mapping function is used with inputs generated from the SDR image and the reference backward reshaping mapping to determine luminance increase for SDR luma histogram bins generated based on luma codewords in the SDR image. A modified backward reshaping mapping is generated and comprises a modified luma backward reshaping mapping generated from the reference backward reshaping function based on the luminance increase for the SDR luma histogram bins. The SDR image and the modified backward reshaping mapping are encoded into an SDR video signal.
    Type: Application
    Filed: April 30, 2020
    Publication date: November 5, 2020
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Yoon Yung LEE, Neeraj J. GADGIL, Guan-Ming SU
  • Publication number: 20200351606
    Abstract: An apparatus and method of rendering audio. The method includes deriving filters by defining a binaural error, defining an activation penalty, and minimizing a cost function that is a combination of the binaural error and the activation penalty. In this manner, the listening experience is improved by reducing the signal level output by loudspeakers further from an audio objects desired position.
    Type: Application
    Filed: October 24, 2018
    Publication date: November 5, 2020
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Alan J. SEEFELDT
  • Publication number: 20200349911
    Abstract: The present document relates to audio coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), and to digital effect processors, e.g. so-called exciters, where generation of harmonic distortion adds brightness to the processed signal. In particular, a system configured to generate a high frequency component of a signal from a low frequency component of the signal is described. The system may comprise an analysis filter bank (501) configured to provide a set of analysis subband signals from the low frequency component of the signal; wherein the set of analysis subband signals comprises at least two analysis subband signals; wherein the analysis filter bank (501) has a frequency resolution of ?f.
    Type: Application
    Filed: May 18, 2020
    Publication date: November 5, 2020
    Applicant: Dolby International AB
    Inventors: Per EKSTRAND, Lars VILLEMOES, Per HEDELIN
  • Publication number: 20200351465
    Abstract: An apparatus, method and system for connecting High-Definition Multimedia Interface (HDMI) devices. A loopback device connects between a first source device and a sink device on a first connection; a second source device connects to the sink device on a second connection. The loopback device manages the first connection, passes transition-minimized differential signaling (TMDS) or fixed-rate link (FRL) signals through to the sink device, and outputs audio received from the sink device on the audio return channel (ARC) or enhanced audio return channel (eARC). In this manner, audio that originates from any source device may be output without requiring a direct connection to the loopback device.
    Type: Application
    Filed: December 13, 2018
    Publication date: November 5, 2020
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Christian Wolff, David Matthew Fischer
  • Publication number: 20200344287
    Abstract: A service request for communication services for communication clients is received. In response, a communication service network is set up to support the communication services. Routing metadata is generated for each of the communication clients. The routing metadata is to be used by each of the communication clients for sharing service quality information with a respective peer communication client over a light-weight peer-to-peer (P2P) network. The routing metadata is downloaded to each of the communication clients. A communication client may exchange service signaling packets or service data packets over the communication service network. When the communication client determines that there is a problematic region in a bitstream received from the communication server, the communication client can request a peer communication client for a service quality information portion related to the problematic region.
    Type: Application
    Filed: July 13, 2020
    Publication date: October 29, 2020
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Shen HUANG, Doh-Suk KIM, Xuejing SUN
  • Publication number: 20200342884
    Abstract: A method for generating a bitstream indicative of an object based audio program is described. The bitstream comprises a sequence of containers. A first container of the sequence of containers comprises a plurality of substream entities for a plurality of substreams of the object based audio program and a presentation section. The method comprises determining a set of object channels. The method further comprises providing a set of object related metadata for the set of object channels. In addition, the method comprises inserting a first set of object channel frames and a first set of object related metadata frames into a respective set of substream entities of the first container. Furthermore, the method comprises inserting presentation data into the presentation section.
    Type: Application
    Filed: May 12, 2020
    Publication date: October 29, 2020
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Christof FERSCH, Alexander STAHLMANN
  • Publication number: 20200342883
    Abstract: Systems and methods for providing forward error correction for a multi-channel audio signal are described. Blocks of an audio stream are buffered into a frame. A transformation can be applied that compacts the energy of each block into a plurality of transformed channels. The energy compaction transform may compact the most energy of a block into the first transformed channel and to compact decreasing amounts of energy into each subsequent transformed channel. The transformed frame may be encoded using any suitable codec and transmitted in a packet over a network. Improved forward error correction may be provided by attaching a low bit rate encoding of the first transformed channel to a subsequent packet. To reconstruct a lost packet, the low bit rate encoding of the first channel for the lost packet may be combined with a packet loss concealment version of the other channels, constructed from a previously-received packet.
    Type: Application
    Filed: July 14, 2020
    Publication date: October 29, 2020
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Shen HUANG, Michael ECKERT, Glenn N. DICKINS