Dolby Labs Patent Applications

Dolby Labs patent applications that are pending before the United States Patent and Trademark Office (USPTO).

METHOD AND DEVICE FOR PERSONALIZATION OF MEDIA DATA FOR PLAYBACK

Publication number: 20210014578

Abstract: Described herein is a method for processing of media data for playback, wherein the method includes the steps of: (a) fetching, by a web proxy, from two or more media servers media data and a media manifest file including metadata information relating to the fetched media data, and merging, by the web proxy, the media data; (b) modifying, by said web proxy, the content of the media manifest file and/or the content of the media data; and (c) providing, by said web proxy, the media manifest file and the media data as modified in step (b) to a media retrieval element for receiving and processing the media manifest file and the media data for decoding or playback, wherein a localhost address is assigned to the web proxy and the web proxy acts as a server for said media retrieval element. Described are further a respective device and computer program product.

Type: Application

Filed: July 3, 2020

Publication date: January 14, 2021

Applicant: Dolby International AB

Inventors: Wolfgang A. Schildbach, Christof Fersch, Holger Hoerich
METHOD AND SYSTEM FOR HANDLING GLOBAL TRANSITIONS BETWEEN LISTENING POSITIONS IN A VIRTUAL REALITY ENVIRONMENT

Publication number: 20210006924

Abstract: A method (900) for rendering audio in a virtual reality rendering environment (180) is described. The method (900) comprises rendering (901) an origin audio signal of an origin audio source (113) of an origin audio scene (111) from an origin source position on a sphere (114) around a listening position (201) of a listener (181). Furthermore, the method (900) comprises determining (902) that the listener (181) moves from the listening position (201) within the origin audio scene (111) to a listening position (202) within a different destination audio scene (112). In addition, the method (900) comprises applying (903) a fade-out gain to the origin audio signal to determine a modified origin audio signal, and rendering (903) the modified origin audio signal of the origin audio source (113) from the origin source position on the sphere (114) around the listening position (201, 202).

Type: Application

Filed: December 18, 2018

Publication date: January 7, 2021

Applicant: DOLBY INTERNATIONAL AB

Inventors: Leon Terentiv, Christof Fersch, Daniel Fischer
USING METADATA TO AGGREGATE SIGNAL PROCESSING OPERATIONS

Publication number: 20210005211

Abstract: A technique including receiving and decoding a coded bitstream encoded with audio content including first audio objects corresponding to a first media content type of two consecutive media content types and second audio objects corresponding to a second media content type of the two consecutive media content types, and audio metadata corresponding to the audio content. The audio metadata including first and second audio object gains, for the first and second audio objects, generated in part based on a first fading curve of the first media content type and a second fading curve of the second media content type, respectively. The technique further includes applying the first and second audio object gains to the first and second audio objects, and rendering a sound field represented by the first audio object with the applied first audio object gain and the second audio object with the applied second audio object gain.

Type: Application

Filed: July 1, 2020

Publication date: January 7, 2021

Applicant: DOLBY INTERNATIONAL AB

Inventors: Alexander Stahlmann, Reinhold Boehm, Mark C. Leddy, Karsten Linzmeier, Vinay Mathew, Simon Plain, Heiko Purnhagen, Leif Sehlström, Robin Thesing
METHODS AND APPARATUS SYSTEMS FOR UNIFIED SPEECH AND AUDIO DECODING IMPROVEMENTS

Publication number: 20210005212

Abstract: The present disclosure relates to an apparatus for decoding an encoded Unified Audio and Speech stream. The apparatus comprises a core decoder for decoding the encoded Unified Audio and Speech stream. The core decoder includes a fast Fourier transform, FFT, module implementation based on a Cooley-Tuckey algorithm. The FFT module is configured to determine a discrete Fourier transform, DFT. Determining the DFT involves recursively breaking down the DFT into small FFTs based on the Cooley-Tucker algorithm and using radix-4 if a number of points of the FFT is a power of 4 and using mixed radix if the number is not a power of 4. Performing the small FFTs involves applying twiddle factors. Applying the twiddle factors involves referring to pre-computed values for the twiddle factors.

Type: Application

Filed: December 19, 2018

Publication date: January 7, 2021

Applicant: Dolby International AB

Inventors: Rajat KUMAR, Ramesh KATURI, Saketh SATHUVALLI, Reshma RAI
SPEECH STYLE TRANSFER

Publication number: 20200410976

Abstract: Computer-implemented methods for speech synthesis are provided. A speech synthesizer may be trained to generate synthesized audio data that corresponds to words uttered by a source speaker according to speech characteristics of a target speaker. The speech synthesizer may be trained by time-stamped phoneme sequences, pitch contour data and speaker identification data. The speech synthesizer may include a voice modeling neural network and a conditioning neural network.

Type: Application

Filed: February 14, 2019

Publication date: December 31, 2020

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Cong ZHOU, Michael Getty HORGAN, Vivek KUMAR, Jaime H. MORALES, Cristina Michel VASCO
AUDIO ENCODER AND DECODER

Publication number: 20200411017

Abstract: The present disclosure provides methods, devices and computer program products for encoding and decoding of a vector of parameters in an audio coding system. The disclosure further relates to a method and apparatus for reconstructing an audio object in an audio decoding system. According to the disclosure, a modulo differential approach for coding and encoding a vector of a non-periodic quantity may improve the coding efficiency and provide encoders and decoders with less memory requirements. Moreover, an efficient method for encoding and decoding a sparse matrix is provided.

Type: Application

Filed: July 10, 2020

Publication date: December 31, 2020

Applicant: DOLBY INTERNATIONAL AB

Inventors: Leif Jonas SAMUELSSON, Heiko PURNHAGEN
DECODING AUDIO BITSTREAMS WITH ENHANCED SPECTRAL BAND REPLICATION METADATA IN AT LEAST ONE FILL ELEMENT

Publication number: 20200411024

Abstract: Embodiments relate to an audio processing unit that includes a buffer, bitstream payload deformatter, and a decoding subsystem. The buffer stores at least one block of an encoded audio bitstream. The block includes a fill element that begins with an identifier followed by fill data. The fill data includes at least one flag identifying whether enhanced spectral band replication (eSBR) processing is to be performed on audio content of the block. A corresponding method for decoding an encoded audio bitstream is also provided.

Type: Application

Filed: July 17, 2020

Publication date: December 31, 2020

Applicant: DOLBY INTERNATIONAL AB

Inventors: Lars Villemoes, Heiko Purnhagen, Per Ekstrand
BACKWARD COMPATIBLE DISPLAY MANAGEMENT METADATA COMPRESSION

Publication number: 20200413099

Abstract: Sequence-level parameters are generated for an image frame sequence including sequence-level indicators for indicating metadata types present for each image frame in the sequence of image frames. Frame-present parameters are generated for a specific image frame in the sequence including frame-present indicators corresponding to the metadata types as indicated in the sequence-level parameters. The frame-present indicators identify first metadata types for which metadata parameter values are to be encoded in a coded bitstream as metadata payloads. The image frame sequence, the sequence-level parameters, the frame-present parameters and the metadata payloads are encoded in the coded bitstream. A recipient device can generate, from the specific image frame based partly on the metadata parameter values determined for the first metadata types, a target display image for a target display.

Type: Application

Filed: September 21, 2018

Publication date: December 31, 2020

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Guan-Ming SU, Neeraj J. GADGIL, Tao CHEN, Sheng QU
BACKWARD-COMPATIBLE INTEGRATION OF HIGH FREQUENCY RECONSTRUCTION TECHNIQUES FOR AUDIO SIGNALS

Publication number: 20200411019

Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag.

Type: Application

Filed: January 28, 2019

Publication date: December 31, 2020

Applicant: DOLBY INTERNATIONAL AB

Inventors: Kristofer KJOERLING, Lars VILLEMOES, Heiko PURNHAGEN, Per EKSTRAND
VOLUME LEVELER CONTROLLER AND CONTROLLING METHOD

Publication number: 20200403593

Abstract: Volume leveler controller and controlling method are disclosed. In one embodiment, A volume leveler controller includes an audio content classifier for identifying the content type of an audio signal in real time; and an adjusting unit for adjusting a volume leveler in a continuous manner based on the content type as identified. The adjusting unit may configured to positively correlate the dynamic gain of the volume leveler with informative content types of the audio signal, and negatively correlate the dynamic gain of the volume leveler with interfering content types of the audio signal.

Type: Application

Filed: July 2, 2020

Publication date: December 24, 2020

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Jun WANG, Lie LU, Alan J. SEEFELDT
METHODS AND APPARATUS FOR DECODING A COMPRESSED HOA SIGNAL

Publication number: 20200402518

Abstract: Methods and apparatus for decoding a compressed Higher Order Ambisonics (HOA) representation of a sound or soundfield. The method may include receiving a bit stream containing the compressed HOA representation and decoding, based on a determination that there are multiple layers, the compressed HOA representation from the bitstream to obtain a sequence of decoded HOA representations. A first subset of the sequence of decoded HOA representations is determined based only on corresponding ambient HOA components. A second subset of the sequence of decoded HOA representations is determined based on corresponding ambient HOA components and corresponding predominant sound components.

Type: Application

Filed: June 3, 2020

Publication date: December 24, 2020

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Sven KORDON, Alexander KRUEGER, Oliver WUEBBOLT
Source Color Volume Information Messaging

Publication number: 20200404336

Abstract: Methods are described to communicate source color volume information in a coded bitstream using SEI messaging. Such data include at least the minimum, maximum, and average luminance values in the source data plus optional data that may include the color volume x and y chromaticity coordinates for the input color primaries (e.g., red, green, and blue) of the source data, and the color x and y chromaticity coordinates for the color primaries corresponding to the minimum, average, and maximum luminance values in the source data. Messaging data signaling an active region in each picture may also be included.

Type: Application

Filed: September 1, 2020

Publication date: December 24, 2020

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Tao CHEN, Peng YIN, Taoran LU, Walter J. HUSAK
AUDIO UPMIXER OPERABLE IN PREDICTION OR NON-PREDICTION MODE

Publication number: 20200395023

Abstract: The invention provides methods and devices for outputting a stereo audio signal having a left channel and a right channel. The apparatus includes a demultiplexer, decoder, and upmixer. The upmixer is configured operate either in a prediction mode or a non-prediction mode based on a parameter encoded in the audio bitstream.

Type: Application

Filed: July 16, 2020

Publication date: December 17, 2020

Applicant: DOLBY INTERNATIONAL AB

Inventors: Heiko Purnhagen, Pontus Carlsson, Lars Villemoes
METHOD AND APPARATUS FOR PROCESSING OF AUXILIARY MEDIA STREAMS EMBEDDED IN A MPEGH 3D AUDIO STREAM

Publication number: 20200395027

Abstract: The disclosure relates to methods, apparatus and systems for side load processing of packetized media streams. In an embodiment, the apparatus comprises: a receiver for receiving a bitstream, and a splitter for identifying a packet type in the bitstream and splitting, based on the identification of a value of the packet type in the bit stream into a main stream and an auxiliary stream.

Type: Application

Filed: February 22, 2019

Publication date: December 17, 2020

Applicant: DOLBY INTERNATIONAL AB

Inventors: Stephan Schreiner, Christof Fersch
LAYERED CODING FOR COMPRESSED SOUND OR SOUND FIELD REPRESENTENTATIONS

Publication number: 20200395022

Abstract: The present document relates to a method of layered encoding of a compressed sound representation of a sound or sound field. The compressed sound representation comprises a basic compressed sound representation comprising a plurality of components, basic side information for decoding the basic compressed sound representation to a basic reconstructed sound representation of the sound or sound field, and enhancement side information including parameters for improving the basic reconstructed sound representation.

Type: Application

Filed: July 1, 2020

Publication date: December 17, 2020

Applicant: DOLBY INTERNATIONAL AB

Inventors: Sven KORDON, Alexander KRUEGER
Rapid Estimation of Effective Illuminance Patterns for Projected Light Fields

Publication number: 20200394974

Abstract: Apparatus and methods are provided that employ one or more of a variety of techniques for reducing the time required to display high resolution images on a high dynamic range display having a light source layer and a display layer. In one technique, the image resolution is reduced, an effective luminance pattern is determined for the reduced resolution image, and the resolution of the effective luminance pattern is then increased to the resolution of the display layer. In another technique, the light source layer's point spread function is decomposed into a plurality of components, and an effective luminance pattern is determined for each component. The effective luminance patterns are then combined to produce a total effective luminance pattern. Additional image display time reduction techniques are provided.

Type: Application

Filed: January 20, 2020

Publication date: December 17, 2020

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Lorne A. WHITEHEAD, Helge SEETZEN, Gregory John WARD, Wolfgang HEIDRICH
AUDIO SPEAKERS HAVING UPWARD FIRING DRIVERS FOR REFLECTED SOUND RENDERING

Publication number: 20200396559

Abstract: Embodiments are directed to upward-firing speakers that reflect sound off a ceiling to a listening location at a distance from a speaker. The reflected sound provides height cues to reproduce audio objects that have overhead audio components. A virtual height filter based on a directional hearing model is applied to the upward-firing driver signal to improve the perception of height for audio signals transmitted by the virtual height speaker to provide optimum reproduction of the overhead reflected sound. The upward firing driver is tilted at an inclination angle of approximately 20 degrees to the horizontal axis of the speaker and separate height and direct terminal connections are provided to interface to an adaptive audio rendering system.

Type: Application

Filed: July 28, 2020

Publication date: December 17, 2020

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Brett G. CROCKETT, Alan J. SEEFELDT, C. Phillip BROWN
CROSS PRODUCT ENHANCED SUBBAND BLOCK BASED HARMONIC TRANSPOSITION

Publication number: 20200395025

Abstract: The invention provides an efficient implementation of cross-product enhanced high-frequency reconstruction (HFR), wherein a new component at frequency Q?+r?0 is generated on the basis of existing components at ? and ?+?0. The invention provides a block-based harmonic transposition, wherein a time block of complex subband samples is processed with a common phase modification. Superposition of several modified samples has the net effect of limiting undesirable intermodulation products, thereby enabling a coarser frequency resolution and/or lower degree of oversampling to be used. In one embodiment, the invention further includes a window function suitable for use with block-based cross-product enhanced HFR. A hardware embodiment of the invention may include an analysis filter bank, a subband processing unit configurable by control data and a synthesis filter bank.

Type: Application

Filed: June 30, 2020

Publication date: December 17, 2020

Applicant: Dolby International AB

Inventor: Lars Villemoes
COMPANDING SYSTEM AND METHOD TO REDUCE QUANTIZATION NOISE USING ADVANCED SPECTRAL EXTENSION

Publication number: 20200395031

Abstract: Embodiments are directed to a companding method and system for reducing coding noise in an audio codec. A compression process reduces an original dynamic range of an initial audio signal through a compression process that divides the initial audio signal into a plurality of segments using a defined window shape, calculates a wideband gain in the frequency domain using a non-energy based average of frequency domain samples of the initial audio signal, and applies individual gain values to amplify segments of relatively low intensity and attenuate segments of relatively high intensity. The compressed audio signal is then expanded back to the substantially the original dynamic range that applies inverse gain values to amplify segments of relatively high intensity and attenuating segments of relatively low intensity. A QMF filterbank is used to analyze the initial audio signal to obtain a frequency domain representation.

Type: Application

Filed: June 3, 2020

Publication date: December 17, 2020

Applicants: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Per Hedelin, Arijit Biswas, Michael Schug, Vinay Melkote
CONFIGURABLE MODAL AMPLIFIER SYSTEM

Publication number: 20200395908

Abstract: Configurable amplifier systems are described in which the power supply rail of a linear amplifier, e.g., a class A amplifier, is modulated by a switching amplifier, e.g., a class D amplifier, that may also be configured to operate independently of the linear amplifier. Techniques are also described by which the standing current of the output stage of a linear amplifier is modulated based on the input signal to the linear amplifier or based on modulation of the power supply rail of the linear amplifier.

Type: Application

Filed: December 19, 2018

Publication date: December 17, 2020

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Kenneth SCHINDLER, Scott P. ROBINSON, Joel A. BUTLER
METHOD OF RENDERING ONE OR MORE CAPTURED AUDIO SOUNDFIELDS TO A LISTENER

Publication number: 20200396555

Abstract: A computer implemented system for rendering captured audio soundfields to a listener comprises apparatus to deliver the audio soundfields to the listener. The delivery apparatus delivers the audio soundfields to the listener with first and second audio elements perceived by the listener as emanating from first and second virtual source locations, respectively, and with the first audio element and/or the second audio element delivered to the listener from a third virtual source location. The first virtual source location and the second virtual source location are perceived by the listener as being located to the front of the listener, and the third virtual source location is located to the rear or the side of the listener.

Type: Application

Filed: June 22, 2020

Publication date: December 17, 2020

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Richard J. CARTWRIGHT, David S. MCGRATH, Glenn N. DICKINS
IN-LOOP RESHAPING WITH LOCAL ILLUMINATION COMPENSATION IN IMAGE CODING

Publication number: 20200389648

Abstract: Methods, processes, and systems are presented for combining signal reshaping (also referred to as luma mapping chroma residuals scaling) with local illumination compensation (LIC) in video coding. Examples and trade-offs when the LIC model parameters are computed in the original domain, the reshaped domain, or a mixed domain, are presented.

Type: Application

Filed: June 4, 2020

Publication date: December 10, 2020

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Jay Nitin Shingala, Ashwin Natesan, Peng Yin
METHOD AND APPARATUS FOR AUDIO DECODING BASED ON DEQUANTIZATION OF QUANTIZED PARAMETERS

Publication number: 20200389815

Abstract: The present disclosure provides methods, devices and computer program products for non-uniform quantization of parameters. The disclosure further relates to a method and apparatus for reconstructing an audio object in an audio decoding system taking the non-uniformly quantized parameters into account. According to the disclosure, such an approach renders it possible to reduce bit consumption without substantially reducing the quality of the reconstructed audio object.

Type: Application

Filed: June 19, 2020

Publication date: December 10, 2020

Applicant: DOLBY INTERNATIONAL AB

Inventors: Heiko PURNHAGEN, Per EKSTRAND
LAYERED AUGMENTED ENTERTAINMENT EXPERIENCES

Publication number: 20200388077

Abstract: Spatial information that describes spatial locations of visual objects as in a three-dimensional (3D) image space as represented in one or more multi-view unlayered images is accessed. Based on the spatial information, a cinema image layer and one or more device image layers are generated from the one or more multi-view unlayered images. A multi-layer multi-view video signal comprising the cinema image layer and the device image layers is sent to downstream devices for rendering.

Type: Application

Filed: April 10, 2018

Publication date: December 10, 2020

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Ajit NINAN, Neil MAMMEN, Tyrome Y. BROWN
Annoyance Noise Suppression

Publication number: 20200389718

Abstract: Personal audio systems and methods are disclosed. A personal audio system includes a voice activity detector to determine whether or not an ambient audio stream contains voice activity, a pitch estimator to determine a frequency of a fundamental component of an annoyance noise contained in the ambient audio stream, and a filter bank to attenuate the fundamental component and at least one harmonic component of the annoyance noise to generate a personal audio stream. The filter bank implements a first filter function when the ambient audio stream does not contain voice activity, or a second filter function when the ambient audio stream contains voice activity.

Type: Application

Filed: January 6, 2020

Publication date: December 10, 2020

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Gints KLIMANIS, Anthony PARKS, Richard Fritz LANMAN, III, Noah KRAFT, Matthew J. JAFFE, Jeffrey Ross BAKER
Subband Block Based Harmonic Transposition

Publication number: 20200388300

Abstract: The present document relates to audio source coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), as well as to digital effect processors, e.g. exciters, where generation of harmonic distortion add brightness to the processed signal, and to time stretchers where a signal duration is prolonged with maintained spectral content. A system and method configured to generate a time stretched and/or frequency transposed signal from an input signal is described. The system comprises an analysis filterbank configured to provide an analysis subband signal from the input signal; wherein the analysis subband signal comprises a plurality of complex valued analysis samples, each having a phase and a magnitude. Furthermore, the system comprises a subband processing unit configured to determine a synthesis subband signal from the analysis subband signal using a subband transposition factor Q and a subband stretch factor S.

Type: Application

Filed: June 23, 2020

Publication date: December 10, 2020

Applicant: Dolby International AB

Inventor: Lars VILLEMOES
SOURCE SEPARATION FOR REVERBERANT ENVIRONMENT

Publication number: 20200389749

Abstract: Embodiments of source separation for reverberant environment are disclosed. According to a method, first microphone signals for each individual one of at least one source are captured respectively by at least two microphones for a period during which only the individual one produces sounds. Mixing parameters for modeling acoustic paths between the at least one source and the at least two microphones are learned by a processor based on the first microphone signals. Second microphone signals are captured respectively by the at least two microphones for a period during which all of the at least one source produce sounds. The reconstruction model is estimated by the processor based on the mixing parameters and second microphone signals. The processor performs the source separation by applying the reconstruction model.

Type: Application

Filed: May 20, 2020

Publication date: December 10, 2020

Applicant: Dolby Laboratories Licensing Corporation

Inventor: Jun WANG
Spectral Translation/Folding in the Subband Domain

Publication number: 20200388294

Abstract: The present invention relates to a new method and apparatus for improvement of High Frequency Reconstruction (HFR) techniques using frequency translation or folding or a combination thereof. The proposed invention is applicable to audio source coding systems, and offers significantly reduced computational complexity. This is accomplished by means of frequency translation or folding in the subband domain, preferably integrated with spectral envelope adjustment in the same domain. The concept of dissonance guard-band filtering is further presented. The proposed invention offers a low-complexity, intermediate quality HFR method useful in speech and natural audio coding applications.

Type: Application

Filed: June 23, 2020

Publication date: December 10, 2020

Applicant: Dolby International AB

Inventors: Lars G. LILJERYD, Per EKSTRAND, Fredrik HENN, Kristofer KJOERLING
SYSTEM FOR RENDERING AND PLAYBACK OF OBJECT BASED AUDIO IN VARIOUS LISTENING ENVIRONMENTS

Publication number: 20200382892

Abstract: Embodiments are described for a system of rendering object-based audio content through a system that includes individually addressable drivers, including at least one driver that is configured to project sound waves toward one or more surfaces within a listening environment for reflection to a listening area within the listening environment; a renderer configured to receive and process audio streams and one or more metadata sets associated with each of the audio streams and specifying a playback location of a respective audio stream; and a playback system coupled to the renderer and configured to render the audio streams to a plurality of audio feeds corresponding to the array of audio drivers in accordance with the one or more metadata sets.

Type: Application

Filed: August 24, 2020

Publication date: December 3, 2020

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Sripal S. Mehta, Brett G. Crockett, Spencer Hooks, Alan Seefeldt, Christophe Chabanne, C. Phillip Brown, Joshua B. Lando, Brad Basler, Stewart Murrie
CONTEXT AWARE HEARING OPTIMIZATION ENGINE

Publication number: 20200380979

Abstract: One or more context aware processing parameters and an ambient audio stream are received. One or more sound characteristics associated with the ambient audio stream are identified using a machine learning model. One or more actions to perform are determined using the machine learning model and based on the one or more context aware processing parameters and the identified one or more sound characteristics. The one or more actions are performed.

Type: Application

Filed: February 3, 2020

Publication date: December 3, 2020

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Jacob Meacham, Matthew Sills, Richard Fritz Lanman, III, Jeffrey Baker
METHOD FOR REDUCTION OF ALIASING INTRODUCED BY SPECTRAL ENVELOPE ADJUSTMENT IN REAL-VALUED FILTERBANKS

Publication number: 20200380999

Abstract: The present invention proposes a new method for improving the performance of a real-valued filterbank based spectral envelope adjuster. By adaptively locking the gain values for adjacent channels dependent on the sign of the channels, as defined in the application, reduced aliasing is achieved. Furthermore, the grouping of the channels during gain-calculation, gives an improved energy estimate of the real valued subband signals in the filterbank.

Type: Application

Filed: June 14, 2020

Publication date: December 3, 2020

Applicant: DOLBY INTERNATIONAL AB

Inventors: Kristofer KJOERLING, Lars VILLEMOES
FRAME-RATE SCALABLE VIDEO CODING

Publication number: 20200382802

Abstract: Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific backward compatible target frame rate and shutter angle among those allowed are also presented.

Type: Application

Filed: June 15, 2020

Publication date: December 3, 2020

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Robin Atkins, Peng Yin, Taoran Lu, Fangjun Pu, Sean Thomas McCarthy, Walter J. Husak, Tao Chen, Guan-Ming Su
METHOD FOR AND APPARATUS FOR DECODING/RENDERING AN AMBISONICS AUDIO SOUNDFIELD REPRESENTATION FOR AUDIO PLAYBACK USING 2D SETUPS

Publication number: 20200382889

Abstract: Improved methods and/or apparatus for decoding an encoded audio signal in soundfield format for L loudspeakers. The method and/or apparatus can render an Ambisonics format audio signal to 2D loudspeaker setup(s) based on a rendering matrix. The rendering matrix has elements based on loudspeaker positions and wherein the rendering matrix is determined based on weighting at least an element of a first matrix with a weighting factor g=1/?{square root over (L)}. The first matrix is determined based on positions of the L loudspeakers and at least a virtual position of at least a virtual loudspeaker that is added to the positions of the L loudspeakers.

Type: Application

Filed: June 16, 2020

Publication date: December 3, 2020

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Florian Keiler, Johannes Boehm
AUDIO OBJECT CLASSIFICATION BASED ON LOCATION METADATA

Publication number: 20200381003

Abstract: Methods (700, 800, 900), systems (200, 300, 400, 500, 600) and computer program products are provided. Location metadata (620) associated with an audio object is received (801). The location metadata defines a position of the audio object in an audio scene. It is estimated (630, 802), based on the location metadata, whether the audio object includes dialog. A value representative of a result of the estimation is assigned (803) to an object type parameter (231). In some example embodiments, audio objects are selected (661, 662, 804) based on values of their respective of object type parameters. In some example embodiments, at least one of the selected audio objects is submitted to dialog enhancement (690, 807).

Type: Application

Filed: July 26, 2018

Publication date: December 3, 2020

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor: Mark William Gerrard
METHODS, APPARATUS AND SYSTEMS FOR UNIFIED SPEECH AND AUDIO DECODING AND ENCODING DECORRELATION FILTER IMPROVEMENTS

Publication number: 20200380997

Abstract: The present disclosure relates to an apparatus for decoding an encoded Unified Audio and Speech stream. The apparatus comprises a core decoder for decoding the encoded Unified Audio and Speech stream. The core decoder includes an upmixing unit adapted to perform mono to stereo upmixing. The upmixing unit includes a decorrelator unit D adapted to apply a decorrelation filter to an input signal. The decorrelator unit is adapted to determine filter coefficients for the decorrelation filter by referring to pre-computed values. The present disclosure further relates to a an apparatus for encoding a Unified Audio and Speech stream, as well as to corresponding methods and storage media.

Type: Application

Filed: December 19, 2018

Publication date: December 3, 2020

Applicant: Dolby International AB

Inventors: Rajat KUMAR, Ramesh KATURI, Saketh SATHUVALLI, Reshma RAI
AUTOMATIC DISCOVERY AND LOCALIZATION OF SPEAKER LOCATIONS IN SURROUND SOUND SYSTEMS

Publication number: 20200366994

Abstract: Embodiments are described for a method of simultaneously localizing a set of speakers and microphones, having only the times of arrival between each of the speakers and microphones. An autodiscovery process uses an external input to set: a global translation (3 continuous parameters), a global rotation (3 continuous parameters), and discrete symmetries, i.e., an exchange of any axis pairs and/or reversal of any axis. Different time of arrival acquisition techniques may be used, such as ultrasonic sweeps or generic multitrack audio content. The autodiscovery algorithm is based in minimizing a certain cost function, and the process allows for latencies in the recordings, possibly linked to the latencies in the emission.

Type: Application

Filed: August 6, 2020

Publication date: November 19, 2020

Applicants: Dolby Laboratories Licensing Corporation, Dolby International AB

Inventors: Daniel ARTEAGA, Giulio CENGARLE, David Matthew FISCHER, Antonio MATEOS SOLE, Davide SCAINI, Alan SEEFELDT
REVERBERATION GENERATION FOR HEADPHONE VIRTUALIZATION

Publication number: 20200367003

Abstract: The present disclosure relates to reverberation generation for headphone virtualization. A method of generating one or more components of a binaural room impulse response (BRIR) for headphone virtualization is described. In the method, directionally-controlled reflections are generated, wherein directionally-controlled reflections impart a desired perceptual cue to an audio input signal corresponding to a sound source location. Then at least the generated reflections are combined to obtain the one or more components of the BRIR. Corresponding system and computer program products are described as well.

Type: Application

Filed: August 6, 2020

Publication date: November 19, 2020

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Louis D. Fielder, Zhiwei Shuang, Grant A. Davidson, Xiguang Zheng, Mark S. Vinton
SYSTEM AND METHOD FOR OPTIMIZING LOUDNESS AND DYNAMIC RANGE ACROSS DIFFERENT PLAYBACK DEVICES

Publication number: 20200364025

Abstract: Embodiments are directed to a method and system for receiving, in a bitstream, metadata associated with the audio data, and analyzing the metadata to determine whether a loudness parameter for a first group of audio playback devices are available in the bitstream. Responsive to determining that the parameters are present for the first group, the system uses the parameters and audio data to render audio. Responsive to determining that the loudness parameters are not present for the first group, the system analyzes one or more characteristics of the first group, and determines the parameter based on the one or more characteristics.

Type: Application

Filed: June 1, 2020

Publication date: November 19, 2020

Applicants: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Jeffrey RIEDMILLER, Scott Gregory NORCROSS, Karl Jonas ROEDEN
DECODING OF ENCODED AUDIO BITSTREAM WITH METADATA CONTAINER LOCATED IN RESERVED DATA SPACE

Publication number: 20200357422

Abstract: Apparatus and methods for generating an encoded audio bitstream, including by including program loudness metadata and audio data in the bitstream, and optionally also program boundary metadata in at least one segment (e.g., frame) of the bitstream. Other aspects are apparatus and methods for decoding such a bitstream, e.g., including by performing adaptive loudness processing of the audio data of an audio program indicated by the bitstream, or authentication and/or validation of metadata and/or audio data of such an audio program. Another aspect is an audio processing unit (e.g., an encoder, decoder, or post-processor) configured (e.g., programmed) to perform any embodiment of the method or which includes a buffer memory which stores at least one frame of an audio bitstream generated in accordance with any embodiment of the method.

Type: Application

Filed: May 28, 2020

Publication date: November 12, 2020

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Michael GRANT, Scott Gregory NORCROSS, Jeffrey RIEDMILLER, Michael WARD
LOUDNESS ADJUSTMENT FOR DOWNMIXED AUDIO CONTENT

Publication number: 20200359152

Abstract: Audio content coded for a reference speaker configuration is downmixed to downmix audio content coded for a specific speaker configuration. One or more gain adjustments are performed on individual portions of the downmix audio content coded for the specific speaker configuration. Loudness measurements are then performed on the individual portions of the downmix audio content. An audio signal that comprises the audio content coded for the reference speaker configuration and downmix loudness metadata is generated. The downmix loudness metadata is created based at least in part on the loudness measurements on the individual portions of the downmix audio content.

Type: Application

Filed: May 26, 2020

Publication date: November 12, 2020

Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: Michael Ward, Jeffrey Riedmiller, Scott Gregory Norcross, Alexander Stahlmann
METHOD AND DEVICE FOR APPLYING DYNAMIC RANGE COMPRESSION TO A HIGHER ORDER AMBISONICS SIGNAL

Publication number: 20200359150

Abstract: A method for performing DRC on a HOA signal comprises transforming the HOA signal to the spatial domain, analyzing the transformed HOA signal, and obtaining, from results of said analyzing, gain factors that are usable for dynamic compression. The gain factors can be transmitted together with the HOA signal. When applying the DRC, the HOA signal is transformed to the spatial domain, the gain factors are extracted and multiplied with the transformed HOA signal in the spatial domain, wherein a gain compensated transformed HOA signal is obtained. The gain compensated transformed HOA signal is transformed back into the HOA domain, wherein a gain compensated HOA signal is obtained. The DRC may be applied in the QMF-filter bank domain.

Type: Application

Filed: April 23, 2020

Publication date: November 12, 2020

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Johannes BOEHM, Florian KEILER
REDUCING UNWANTED SOUND TRANSMISSION

Publication number: 20200359154

Abstract: A system and method of adjusting an audio output in one location so that its propagation into another location is reduced. As a first device in a first location generates sound, a second device in a second location detects the propagated sound. The first device then adjusts its output based on the detected sound.

Type: Application

Filed: January 8, 2019

Publication date: November 12, 2020

Applicant: Dolby Laboratories Licensing Corporation

Inventors: C. Phillip BROWN, Michael J. SMITHERS, Remi S. AUDFRAY, Patrick David SAUNDERS
Quantization Control for Variable Bit Depth

Publication number: 20200359026

Abstract: The quantization parameter QP is well-known in digital video compression as an indication of picture quality. Digital symbols representing a moving image are quantized with a quantizing step that is a function QSN of the quantization parameter QP, which function QSN has been normalized to the most significant bit of the bit depth of the digital symbols. As a result, the effect of a given QP is essentially independent of bit depth a particular QP value has a standard effect on image quality, regardless of bit depth. The invention is useful, for example, in encoding and decoding at different bit depths, to generate compatible, bitstreams having different bit depths, and to allow different bit depths for different components of a video signal by compressing each with the same fidelity (i.e., the same QP).

Type: Application

Filed: July 27, 2020

Publication date: November 12, 2020

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Walter C. GISH, Christopher J. VOGT
AUDIO DECODER AND DECODING METHOD

Publication number: 20200357420

Abstract: A method for representing a second presentation of audio channels or objects as a data stream, the method comprising the steps of: (a) providing a set of base signals, the base signals representing a first presentation of the audio channels or objects; (b) providing a set of transformation parameters, the transformation parameters intended to transform the first presentation into the second presentation; the transformation parameters further being specified for at least two frequency bands and including a set of multi-tap convolution matrix parameters for at least one of the frequency bands.

Type: Application

Filed: May 26, 2020

Publication date: November 12, 2020

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Dirk Jeroen BREEBAART, David Matthew COOPER, Leif Jonas SAMUELSSON
COLOR APPEARANCE PRESERVATION IN VIDEO CODECS

Publication number: 20200351524

Abstract: A standard dynamic range (SDR) image and a reference backward reshaping mapping are received. The reference backward reshaping mapping comprises a reference luma backward reshaping mapping. A color preservation mapping function is used with inputs generated from the SDR image and the reference backward reshaping mapping to determine luminance increase for SDR luma histogram bins generated based on luma codewords in the SDR image. A modified backward reshaping mapping is generated and comprises a modified luma backward reshaping mapping generated from the reference backward reshaping function based on the luminance increase for the SDR luma histogram bins. The SDR image and the modified backward reshaping mapping are encoded into an SDR video signal.

Type: Application

Filed: April 30, 2020

Publication date: November 5, 2020

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Yoon Yung LEE, Neeraj J. GADGIL, Guan-Ming SU
VIRTUAL RENDERING OF OBJECT BASED AUDIO OVER AN ARBITRARY SET OF LOUDSPEAKERS

Publication number: 20200351606

Abstract: An apparatus and method of rendering audio. The method includes deriving filters by defining a binaural error, defining an activation penalty, and minimizing a cost function that is a combination of the binaural error and the activation penalty. In this manner, the listening experience is improved by reducing the signal level output by loudspeakers further from an audio objects desired position.

Type: Application

Filed: October 24, 2018

Publication date: November 5, 2020

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor: Alan J. SEEFELDT
Efficient Combined Harmonic Transposition

Publication number: 20200349911

Abstract: The present document relates to audio coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), and to digital effect processors, e.g. so-called exciters, where generation of harmonic distortion adds brightness to the processed signal. In particular, a system configured to generate a high frequency component of a signal from a low frequency component of the signal is described. The system may comprise an analysis filter bank (501) configured to provide a set of analysis subband signals from the low frequency component of the signal; wherein the set of analysis subband signals comprises at least two analysis subband signals; wherein the analysis filter bank (501) has a frequency resolution of ?f.

Type: Application

Filed: May 18, 2020

Publication date: November 5, 2020

Applicant: Dolby International AB

Inventors: Per EKSTRAND, Lars VILLEMOES, Per HEDELIN
AUDIO DEVICE FOR HDMI

Publication number: 20200351465

Abstract: An apparatus, method and system for connecting High-Definition Multimedia Interface (HDMI) devices. A loopback device connects between a first source device and a sink device on a first connection; a second source device connects to the sink device on a second connection. The loopback device manages the first connection, passes transition-minimized differential signaling (TMDS) or fixed-rate link (FRL) signals through to the sink device, and outputs audio received from the sink device on the audio return channel (ARC) or enhanced audio return channel (eARC). In this manner, audio that originates from any source device may be output without requiring a direct connection to the loopback device.

Type: Application

Filed: December 13, 2018

Publication date: November 5, 2020

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventors: Christian Wolff, David Matthew Fischer
In-Service Quality Monitoring System with Intelligent Retransmission and Interpolation

Publication number: 20200344287

Abstract: A service request for communication services for communication clients is received. In response, a communication service network is set up to support the communication services. Routing metadata is generated for each of the communication clients. The routing metadata is to be used by each of the communication clients for sharing service quality information with a respective peer communication client over a light-weight peer-to-peer (P2P) network. The routing metadata is downloaded to each of the communication clients. A communication client may exchange service signaling packets or service data packets over the communication service network. When the communication client determines that there is a problematic region in a bitstream received from the communication server, the communication client can request a peer communication client for a service quality information portion related to the problematic region.

Type: Application

Filed: July 13, 2020

Publication date: October 29, 2020

Applicant: Dolby Laboratories Licensing Corporation

Inventors: Shen HUANG, Doh-Suk KIM, Xuejing SUN
METHODS, APPARATUS AND SYSTEM FOR RENDERING AN AUDIO PROGRAM

Publication number: 20200342884

Abstract: A method for generating a bitstream indicative of an object based audio program is described. The bitstream comprises a sequence of containers. A first container of the sequence of containers comprises a plurality of substream entities for a plurality of substreams of the object based audio program and a presentation section. The method comprises determining a set of object channels. The method further comprises providing a set of object related metadata for the set of object channels. In addition, the method comprises inserting a first set of object channel frames and a first set of object related metadata frames into a respective set of substream entities of the first container. Furthermore, the method comprises inserting presentation data into the presentation section.

Type: Application

Filed: May 12, 2020

Publication date: October 29, 2020

Applicant: DOLBY INTERNATIONAL AB

Inventors: Christof FERSCH, Alexander STAHLMANN

prev … 12 13 14 15 16 17 18 19 20 … next