Dolby Labs Patent Applications

Dolby Labs patent applications that are pending before the United States Patent and Trademark Office (USPTO).

  • Publication number: 20210118452
    Abstract: Some disclosed methods involve encoding or decoding directional audio data. Some encoding methods may involve receiving a mono audio signal corresponding to an audio object and a representation of a radiation pattern corresponding to the audio object. The radiation pattern may include sound levels corresponding to plurality of sample times, a plurality of frequency bands and a plurality of directions. The methods may involve encoding the mono audio signal and encoding the source radiation pattern to determine radiation pattern metadata. Encoding the radiation pattern may involve determining a spherical harmonic transform of the representation of the radiation pattern and compressing the spherical harmonic transform to obtain encoded radiation pattern metadata.
    Type: Application
    Filed: April 15, 2019
    Publication date: April 22, 2021
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Nicolas R. Tsingos, Mark R. P. Thomas, Christof Fersch
  • Publication number: 20210120306
    Abstract: A video content controller includes a memory and a microprocessor. The memory is configured to store non-transitory computer-readable instructions and video data representing a temporally-varying scene having a plurality of scene-regions. The microprocessor adapted to execute the instructions to (i) receive a current stream-segment of a video stream corresponding to a first scene-region of the plurality of scene-regions during a current playback-time interval, (ii) download, to a memory, a first video segment of a tagged scene-region within the first scene-region, (iii) receive a first display-region signal indicating at least one of a selection and a feature of the first scene-region, (iv) combine the current stream-segment and the first video segment as a video signal, and (vi) transmit, in response to the first display-region signal, the video signal to a display device.
    Type: Application
    Filed: December 28, 2020
    Publication date: April 22, 2021
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Timo Kunkel
  • Publication number: 20210112224
    Abstract: A novel high efficiency image projection system includes a beam-steering modulator, an amplitude modulator, and a controller. In a particular embodiment the controller generates beam-steering drive values from image data and uses the beam-steering drive values to drive the beam-steering modulator. Additionally, the controller utilizes the beam-steering drive values to generate a lightfield simulation of a lightfield projected onto the amplitude modulator by the beam-steering modulator. The controller utilizes the lightfield simulation to generate amplitude drive values for driving the amplitude modulator in order to project a high quality version of the image described by the image data.
    Type: Application
    Filed: December 23, 2020
    Publication date: April 15, 2021
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Juan P. Pertierra, Martin J. Richards, Christopher John Orlick, Clement Le Barbenchon, Angelo M. Pires Arrifano
  • Publication number: 20210112358
    Abstract: Embodiments are directed to a method of rendering adaptive audio by receiving input audio comprising channel-based audio, audio objects, and dynamic objects, wherein the dynamic objects are classified as sets of low-priority dynamic objects and high-priority dynamic objects, rendering the channel-based audio, the audio objects, and the low-priority dynamic objects in a first rendering processor of an audio processing system, and rendering the high-priority dynamic objects in a second rendering processor of the audio processing system. The rendered audio is then subject to virtualization and post-processing steps for playback through soundbars and other similar limited height capable speakers.
    Type: Application
    Filed: May 16, 2020
    Publication date: April 15, 2021
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Joshua Brandon Lando, Freddie Sanchez, Alan J. Seefeldt
  • Publication number: 20210104254
    Abstract: Methods for echo estimation or echo management (echo suppression or cancellation) on an input audio signal, with at least one of adaptation of a sparse prediction filter set, modification (for example, truncation) of adapted prediction filter impulse responses, generation of a composite impulse response from adapted prediction filter impulse responses, or use of echo estimation and/or echo management resources in a manner determined at least in part by classification of the input audio signal as being (or not being) echo free. Other aspects are systems configured to perform any embodiment of any of the methods.
    Type: Application
    Filed: October 20, 2020
    Publication date: April 8, 2021
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Dong SHI, Kai LI, Hannes MUESCH, David GUNAWAN, Paul HOLMBERG, Glenn N. DICKINS
  • Publication number: 20210104252
    Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag.
    Type: Application
    Filed: October 23, 2020
    Publication date: April 8, 2021
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Lars Villemoes, Heiko Purnhagen, Per Ekstrand
  • Publication number: 20210105530
    Abstract: The present document describes a method (500) for providing combined audio and video content from a source device (210) to an audio sink device (230) and to a video sink device (220). The method (500) comprises determining (501) latency information regarding the video latency for processing the video content and the audio latency for processing the audio content of the combined audio and video content along the device chain (100) between the source device (210) and the audio and video sink devices (220, 230). Furthermore, the method (5009 comprises delaying (502) the audio content or the video content in dependence of the latency information, and providing (503) the combined audio and video content with the delayed audio content or the delayed video content.
    Type: Application
    Filed: October 2, 2020
    Publication date: April 8, 2021
    Applicant: Dolby International AB
    Inventors: Gael Lassure, Tobias Heinl, Christian Schindler
  • Publication number: 20210093967
    Abstract: A system is configured to receive player multimedia having live coverage of physical expressions or actions of a player in an online activity and activity multimedia of live participation in the online activity associated with one or more play accounts. In a near continuous fashion, the system is configured to select in real time, for each time point, one or more player items being portions of the player multimedia or one or more activity items being portions of the activity multimedia corresponding to the time point to form a composite item. The system is configured to further transmit in real time the composite item to one or more viewer accounts. The system is configured to then receive viewer data in response and produce future composite items based on the viewer data.
    Type: Application
    Filed: September 30, 2020
    Publication date: April 1, 2021
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Stephen Spencer HOOKS, Aaron Joseph KRASNOV, Eric Whelan YEARGAN, Stewart MURRIE
  • Publication number: 20210099693
    Abstract: Based on viewing tracking data, a viewer's view direction to a three-dimensional (3D) scene depicted by a first video image is determined. The first video image has been streamed in a video stream to the streaming client device before the first time point and rendered with the streaming client device to the viewer at the first time point. Based on the viewer's view direction, a target view portion is identified in a second video image to be streamed in the video stream to the streaming client device to be rendered at a second time point subsequent to the first time point. The target view portion is encoded into the video stream with a higher target spatiotemporal resolution than that used to encode remaining non-target view portions in the second video image.
    Type: Application
    Filed: December 10, 2020
    Publication date: April 1, 2021
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Ajit Ninan
  • Publication number: 20210092461
    Abstract: In some embodiments, an encoder device is disclosed to receive an input video stream containing images in a first dynamic range including a first image. The device receives a second image representing the first image. The device obtains statistical data for the first and the second images. The device determines, at a first time delay, a scene cut data from the input video stream and storing the scene cut data in a first sliding window. The device determines, at a second time delay, a first smoothing mapping function based on a second sliding window and the determined scene cut data. The device determines, at a third time delay, a second smoothing mapping function based on a third sliding window and the determined scene cut data. The device generates, at the third time delay, a composer metadata for the first image based on the first and second smoothing mapping functions.
    Type: Application
    Filed: February 28, 2019
    Publication date: March 25, 2021
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Neeraj J. GADGIL, Guan-Ming SU
  • Publication number: 20210092523
    Abstract: Example embodiments disclosed herein relate to orientation-aware surround sound playback. A method for processing audio on an electronic device that includes a plurality of loudspeakers is disclosed, the loudspeakers arranged in more than one dimension of the electronic device. The method includes, responsive to receipt of a plurality of received audio streams, generating a rendering component associated with the plurality of received audio streams, determining an orientation dependent component of the rendering component, processing the rendering component by updating the orientation dependent component according to an orientation of the loudspeakers and dispatching the received audio streams to the plurality of loudspeakers for playback based on the processed rendering component. Corresponding system and computer program products are also disclosed.
    Type: Application
    Filed: November 19, 2020
    Publication date: March 25, 2021
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Xuejing SUN, Guilin MA, Xiguang ZHENG
  • Publication number: 20210087047
    Abstract: In one embodiment, a dual modulator display systems and methods for rendering target image data upon the dual modulator display system are disclosed where the display system receives target image data, possible HDR image data and first calculates display control signals and then calculates backlight control signals from the display control signals. This order of calculating display signals and then backlight control signals later as a function of the display systems may tend to reduce clipping artifacts. In other embodiments, it is possible to split the input target HDR image data into a base layer and a detail layer, wherein the base layer is the low spatial resolution image data that may be utilized as for backlight illumination data. The detail layer is higher spatial resolution image data that may be utilized for display control data.
    Type: Application
    Filed: December 8, 2020
    Publication date: March 25, 2021
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Robin ATKINS, Rabab K. WARD
  • Publication number: 20210091885
    Abstract: A control unit of a multipath data transportation system that optimizes the load of the multiple communication paths of this system when the system transmits a data segment over these paths in parallel with forward error correction. The control unit determines an optimized number of packets to send over each path based on a prediction of quality for each path. The transmitted packets include systematic packets and coded packets.
    Type: Application
    Filed: December 7, 2020
    Publication date: March 25, 2021
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Mingchao Yu, Mark Craig Reed
  • Publication number: 20210090583
    Abstract: A method performed in an audio decoder for decoding M encoded audio channels representing N audio channels is disclosed. The method includes receiving a bitstream containing the M encoded audio channels and a set of spatial parameters, decoding the M encoded audio channels, and extracting the set of spatial parameters from the bitstream. The method also includes analyzing the M audio channels to detect a location of a transient, decorrelating the M audio channels, and deriving N audio channels from the M audio channels and the set of spatial parameters. A first decorrelation technique is applied to a first subset of each audio channel and a second decorrelation technique is applied to a second subset of each audio channel. The first decorrelation technique represents a first mode of operation of a decorrelator, and the second decorrelation technique represents a second mode of operation of the decorrelator.
    Type: Application
    Filed: October 5, 2020
    Publication date: March 25, 2021
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Mark F. DAVIS
  • Publication number: 20210090580
    Abstract: A method for rendering an audio output based on an audio data stream including M audio signals, side information including a series of reconstruction instances of a reconstruction matrix C and first timing data, the side information allowing reconstruction of N audio objects from the M audio signals, and object metadata defining spatial relationships between the N audio objects. The method includes generating a synchronized rendering matrix based on the object metadata, the first timing data, and information relating to a current playback system configuration, the synchronized rendering matrix having a rendering instance for each reconstruction instance, multiplying each reconstruction instance with a corresponding rendering instance to form a corresponding instance of an integrated rendering matrix, and applying the integrated rendering matrix to the audio signals in order to render an audio output.
    Type: Application
    Filed: December 7, 2020
    Publication date: March 25, 2021
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Klaus PEICHL, Tobias FRIEDRICH, Robin THESING, Heiko PURNHAGEN, Martin WOLTERS
  • Publication number: 20210092546
    Abstract: A method (910) for rendering an audio signal in a virtual reality rendering environment (180) is described. The method (910) comprises rendering (911) an origin audio signal of an audio source (311, 312, 313) from an origin source position on an origin sphere (114) around an origin listening position (301) of a listener (181). Furthermore, the method (900) comprises determining (912) that the listener (181) moves from the origin listening position (301) to a destination listening position (302). In addition, the method (900) comprises determining (913) a destination source position of the audio source (311, 312, 313) on a destination sphere (114) around the destination listening position (302) based on the origin source position, and determining (914) a destination audio signal of the audio source (311, 312, 313) based on the origin audio signal.
    Type: Application
    Filed: December 18, 2018
    Publication date: March 25, 2021
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Leon TERENTIV, Christof FERSCH, Daniel FISCHER
  • Publication number: 20210082444
    Abstract: Computer-implemented methods for training a neural network, as well as for implementing audio encoders and decoders via trained neural networks, are provided. The neural network may receive an input audio signal, generate an encoded audio signal and decode the encoded audio signal. A loss function generating module may receive the decoded audio signal and a ground truth audio signal, and may generate a loss function value corresponding to the decoded audio signal. Generating the loss function value may involve applying a psychoacoustic model. The neural network may be trained based on the loss function value. The training may involve updating at least one weight of the neural network.
    Type: Application
    Filed: April 10, 2019
    Publication date: March 18, 2021
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Roy M. FEJGIN, Grant A. DAVIDSON, Chih-Wei WU, Vivek KUMAR
  • Publication number: 20210082451
    Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag. The high frequency regeneration is performed as a post-processing operation with a delay of 3010 samples per audio channel.
    Type: Application
    Filed: April 25, 2019
    Publication date: March 18, 2021
    Applicant: Dolby International AB
    Inventors: Kristofer KJOERLING, Lars VILLEMOES, Heiko PURNHAGEN, Per EKSTRAND
  • Publication number: 20210082440
    Abstract: The present document relates to a method of layered encoding of a compressed sound representation of a sound or sound field. The compressed sound representation comprises a basic compressed sound representation comprising a plurality of components, basic side information for decoding the basic compressed sound representation to a basic reconstructed sound representation of the sound or sound field, and enhancement side information including parameters for improving the basic reconstructed sound representation.
    Type: Application
    Filed: July 24, 2020
    Publication date: March 18, 2021
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Sven KORDON, Alexander Krueger
  • Publication number: 20210084307
    Abstract: To support lossless mode at the block level when in-loop reshaping (LMCS) is enabled, the following changes are proposed to the existing LMCS pipeline. In intra mode, encode lossless blocks in the original domain, thus bypassing inverse mapping after reconstruction in the decoder. In inter mode, encode lossless blocks in the original domain, thus bypassing both forward mapping after motion compensation and inverse mapping after reconstruction in the decoder. In both modes, disable any LMCS-related color scaling.
    Type: Application
    Filed: September 1, 2020
    Publication date: March 18, 2021
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Peng Yin, Taoran Lu, Fangjun Pu, Sean Thomas McCarthy
  • Publication number: 20210080737
    Abstract: A projector system comprising a laser light source, a collimating lens, a fly-eye lens, an integrating rod and a first modulator is disclosed. The light from a laser light source/fiber illuminates a collimator to substantially collimate the light and then is transmitted through a fly's-eye lens. The fly's-eye lens provides a desired angular/spatial light distribution for further processing to a first modulator of the projector system.
    Type: Application
    Filed: November 24, 2020
    Publication date: March 18, 2021
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Martin J. RICHARDS, Duane Scott DEWALD, Nathan WAINWRIGHT, Barret LIPPEY
  • Publication number: 20210076042
    Abstract: A method for generating metadata for use by a video decoder for displaying video content encoded by a video encoder includes: (1) accessing a target tone mapping curve; (2) accessing a decoder tone curve corresponding to a tone curve used by the video decoder for tone mapping the video content; (3) generating a plurality of parameters of a trim-pass function used by the video decoder to apply after applying the decoder tone curve to the video content, wherein the parameters of the trim-pass function are generated to approximate the target tone curve with the combination of the trim-pass function and the decoder tone curve, and (4) generating the metadata for use by the video decoder, including said plurality of parameters of the trim-pass function.
    Type: Application
    Filed: September 5, 2018
    Publication date: March 11, 2021
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Anustup Kumar Atanu CHOUDHURY, Robin ATKINS, Thaddeus BEIER, Ali ZANDIFAR, Ian GODIN
  • Publication number: 20210076079
    Abstract: Methods, processes, and systems are presented for inter-layer scaling and mapping to perform dynamic-range scalability in video coding using single-loop video decoders. Two approaches are proposed: a) A mapping function is applied to each of the color components, b) a mapping function is applied to the luma component and luma-based scaling is applied to the chroma components. Both schemes may be combined with existing schemes for in-loop reshaping, such as luma mapping and chroma residual scaling employed in the versatile video codec (VVC).
    Type: Application
    Filed: August 28, 2020
    Publication date: March 11, 2021
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Taoran Lu, Peng Yin, Fangjun Pu
  • Publication number: 20210065728
    Abstract: A method (600) for decoding an encoded audio signal (102) is described. The encoded audio signal (102) comprises a sequence of frames. Furthermore, the encoded audio signal (102) is indicative of a plurality of different dynamic range control (DRC) profiles for a corresponding plurality of different rendering modes. Different subsets of DRC profiles from the plurality of DRC profiles are comprised within different frames of the sequence of frames, such that two or more frames of the sequence of frames jointly comprise the plurality of DRC profiles.
    Type: Application
    Filed: September 16, 2020
    Publication date: March 4, 2021
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Holger HOERICH, Jeroen KOPPENS
  • Publication number: 20210065724
    Abstract: Systems, methods, and computer program products of audio processing based on Adaptive Intermediate Spatial Format (AISF) are described. The AISF is an extension to ISF that allows spatial resolution around an ISF ring to be adjusted dynamically with respect to content of incoming audio objects. An AISF encoder device adaptively warps each ISF ring during ISF encoding to adjust angular distance between objects, resulting in increase in uniformity of energy distribution around the ISF ring. At an AISF decoder device, matrices that decode sound positions to the output speaker take into account the warping that was performed at the AISF encoder device to reproduce the true positions of sound sources.
    Type: Application
    Filed: November 11, 2020
    Publication date: March 4, 2021
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Juan Felix TORRES, David S. MCGRATH, Michael William MASON
  • Publication number: 20210056978
    Abstract: Methods and audio processing units for generating an object based audio program including conditional rendering metadata corresponding to at least one object channel of the program, where the conditional rendering metadata is indicative of at least one rendering constraint, based on playback speaker array configuration, which applies to each corresponding object channel, and methods for rendering audio content determined by such a program, including by rendering content of at least one audio channel of the program in a manner compliant with each applicable rendering constraint in response to at least some of the conditional rendering metadata. Rendering of a selected mix of content of the program may provide an immersive experience.
    Type: Application
    Filed: August 18, 2020
    Publication date: February 25, 2021
    Applicants: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Sripal S, MEHTA, Thomas ZIEGLER, Stewart MURRIE
  • Publication number: 20210056984
    Abstract: An apparatus and method of blind detection of binauralized audio. If the input content is detected as binaural, a second binauralization may be avoided. In this manner, the user experience avoids audio artifacts introduced by multiple binauralizations.
    Type: Application
    Filed: April 24, 2019
    Publication date: February 25, 2021
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Chunmao ZHANG, Lianwu CHEN, Ziyu YANG, Joshua Brandon LANDO, David Matthew FISCHER, Lie LU
  • Publication number: 20210058729
    Abstract: A method for compressing a HOA signal being an input HOA representation with input time frames (C(k)) of HOA coefficient sequences comprises spatial HOA encoding of the input time frames and subsequent perceptual encoding and source encoding. Each input time frame is decomposed (802) into a frame of predominant sound signals (XPS(k?1)) and a frame of an ambient HOA component ({tilde over (C)}AMB(k?1)). The ambient HOA component ({tilde over (C)}AMB(k?1)) comprises, in a layered mode, first HOA coefficient sequences of the input HOA representation (cn(k?1)) in lower positions and second HOA coefficient sequences (cAMB,n(k?1)) in remaining higher positions. The second HOA coefficient sequences are part of an HOA representation of a residual between the input HOA representation and the HOA representation of the predominant sound signals.
    Type: Application
    Filed: September 3, 2020
    Publication date: February 25, 2021
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Sven Kordon, Alexander Krueger, Oliver Wuebbolt
  • Publication number: 20210050834
    Abstract: A variable-resolution graphic equalizer providing an improved interface for controlling gain values across the entire audio spectrum using many narrow-band filters (e.g., 120). It allows user selection of a frequency range for graphic equalization and automatically maps a reduced and fixed number of sliders to the selected range based on the number of filter bands falling within the selected range. In an audio processing system, specific user interface regions are highlighted to display selected frequency ranges and corresponding selected sliders to allow for rapid and precise equalization of the full audio spectrum using the many narrow-band filters.
    Type: Application
    Filed: June 6, 2017
    Publication date: February 18, 2021
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Ethan A. GROSSMAN
  • Publication number: 20210047061
    Abstract: Dual and multi-modulator projector display systems and techniques are disclosed. In one embodiment, a projector display system comprises a light source; a controller, a first modulator, receiving light from the light source and rendering a halftone image of said the input image; a blurring optical system that blurs said halftone image with a Point Spread Function (PSF); and a second modulator receiving the blurred halftone image and rendering a pulse width modulated image which may be projected to form the desired screen image. Systems and techniques for forming a binary halftone image from input image, correcting for misalignment between the first and second modulators and calibrating the projector system—e.g. over time—for continuous image improvement are also disclosed.
    Type: Application
    Filed: June 23, 2020
    Publication date: February 18, 2021
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Jerome SHIELDS, Martin J. RICHARDS, Juan P. PERTIERRA
  • Publication number: 20210049747
    Abstract: A coded video sequence is received in a bitstream with a set of content scan adaptive metadata. It is ascertained if the set of content scan adaptive metadata is received. The set of content scan adaptive metadata includes: a maximum content light level parameter; a maximum frame average light level parameter. The maximum content light level parameter and maximum frame average light level parameter are both dependent on a scan type of the frames of the coded video sequence, the scan type being at least one of a progressive frame type, complimentary field pair type, macroblock-adaptive frame-field frame type, and individual field picture type.
    Type: Application
    Filed: January 8, 2019
    Publication date: February 18, 2021
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Sean Thomas McCarthy
  • Publication number: 20210051435
    Abstract: In some embodiments, virtualization methods for generating a binaural signal in response to channels of a multi-channel audio signal, which apply a binaural room impulse response (BRIR) to each channel including by using at least one feedback delay network (FDN) to apply a common late reverberation to a downmix of the channels. In some embodiments, input signal channels are processed in a first processing path to apply to each channel a direct response and early reflection portion of a single-channel BRIR for the channel, and the downmix of the channels is processed in a second processing path including at least one FDN which applies the common late reverberation. Typically, the common late reverberation emulates collective macro attributes of late reverberation portions of at least some of the single-channel BRIRs. Other aspects are headphone virtualizers configured to perform any embodiment of the method.
    Type: Application
    Filed: September 4, 2020
    Publication date: February 18, 2021
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Kuan-Chieh YEN, Dirk Jeroen BREEBAART, Grant A. DAVIDSON, Rhonda WILSON, David M. COOPER, Zhiwei SHUANG
  • Publication number: 20210051432
    Abstract: A method for generating loudspeaker signals associated with a target screen size is disclosed. The method includes receiving a bit stream containing encoded higher order ambisonics signals, the encoded higher order ambisonics signals describing a sound field associated with a production screen size. The method further includes decoding the encoded higher order ambisonics signals to obtain a first set of decoded higher order ambisonics signals representing dominant components of the sound field and a second set of decoded higher order ambisonics signals representing ambient components of the sound field. The method also includes combining the first set of decoded higher order ambisonics signals and the second set of decoded higher order ambisonics signals to produce a combined set of decoded higher order ambisonics signals.
    Type: Application
    Filed: August 26, 2020
    Publication date: February 18, 2021
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Peter JAX, Johannes BOEHM, William REDMANN
  • Publication number: 20210050022
    Abstract: The present document describes a method (400) for encoding a soundfield representation (SR) input signal (101, 301) describing a soundfield at a reference position, wherein the SR input signal (101, 301) comprises a plurality of channels for a plurality of different directivity patterns of the soundfield at the reference position. The method (400) comprises extracting (401) one or more audio objects (103, 303) from the SR input signal (101, 301). Furthermore, the method (400) comprises determining (402) a residual signal (102, 302) based on the SR input signal (101, 301) and based on the one or more audio objects (103, 303). The method (400) also comprises performing joint coding of the one or more audio objects (103, 303) and/or the residual signal (102, 302). In addition, the method (400) comprises generating (403) a bitstream (701) based on data generated in the context of joint coding of the one or more audio objects (103, 303) and/or the residual signal (102, 302).
    Type: Application
    Filed: January 17, 2019
    Publication date: February 18, 2021
    Applicants: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Kristofer KJOERLING, David S. MCGRATH, Heiko PURNHAGEN, Mark R.P. THOMAS
  • Publication number: 20210044887
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for wireless audio source switching. One of the methods includes receiving user input selecting a user interface control corresponding to one of a plurality of paired audio source devices with a wireless audio sink device that is a slave device of a first master/slave connection with a first paired audio source device of the plurality of paired audio source devices. The wireless audio sink device initiates a request to become a master device of a second master/slave connection with a second paired audio source device corresponding to the selected user interface control. The wireless audio sink device sends to the second paired audio source device a request to become a slave device of the second master/slave connection. The wireless audio sink device receives audio input from the second paired audio source device and outputs the received audio input.
    Type: Application
    Filed: October 26, 2020
    Publication date: February 11, 2021
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Jack T. HA, Adam Scott KONIAK, Gareth James SELFE
  • Publication number: 20210035588
    Abstract: The present document relates to a method of layered encoding of a frame of a compressed higher-order Ambisonics, HOA, representation of a sound or sound field. The compressed HOA representation comprises a plurality of transport signals. The method comprises assigning the plurality of transport signals to a plurality of hierarchical layers, the plurality of layers including a base layer and one or more hierarchical enhancement layers, generating, for each layer, a respective HOA extension payload including side information for parametrically enhancing a reconstructed HOA representation obtainable from the transport signals assigned to the respective layer and any layers lower than the respective layer, assigning the generated HOA extension payloads to their respective layers, and signaling the generated HOA extension payloads in an output bitstream.
    Type: Application
    Filed: July 10, 2020
    Publication date: February 4, 2021
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Sven KORDON, Alexander KRUEGER
  • Publication number: 20210035592
    Abstract: In some embodiments, a pitch filter for filtering a preliminary audio signal generated from an audio bitstream is disclosed. The pitch filter has an operating mode selected from one of either: (i) an active mode where the preliminary audio signal is filtered using filtering information to obtain a filtered audio signal, and (ii) an inactive mode where the pitch filter is disabled. The preliminary audio signal is generated in an audio encoder or audio decoder having a coding mode selected from at least two distinct coding modes, and the pitch filter is capable of being selectively operated in either the active mode or the inactive mode while operating in the coding mode based on control information.
    Type: Application
    Filed: October 16, 2020
    Publication date: February 4, 2021
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Barbara RESCH, Kristofer KJÖRLING, Lars VILLEMOES
  • Publication number: 20210037335
    Abstract: Described is a method of processing position information indicative of an object position of an audio object, wherein the object position is usable for rendering of the audio object, that comprises: obtaining listener orientation information indicative of an orientation of a listener's head; obtaining listener displacement information indicative of a displacement of the listener's head; determining the object position from the position information; modifying the object position based on the listener displacement information by applying a translation to the object position; and further modifying the modified object position based on the listener orientation information. Further described is a corresponding apparatus for processing position information indicative of an object position of an audio object, wherein the object position is usable for rendering of the audio object.
    Type: Application
    Filed: April 9, 2019
    Publication date: February 4, 2021
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Christof FERSCH, Leon TERENTIV, Daniel FISCHER
  • Publication number: 20210029482
    Abstract: Embodiments are described for rendering spatial audio content through a system that is configured to reflect audio off of one or more surfaces of a listening environment. The system includes an array of audio drivers distributed around a room, wherein at least one driver of the array of drivers is configured to project sound waves toward one or more surfaces of the listening environment for reflection to a listening area within the listening environment and a renderer configured to receive and process audio streams and one or more metadata sets that are associated with each of the audio streams and that specify a playback location in the listening environment.
    Type: Application
    Filed: August 11, 2020
    Publication date: January 28, 2021
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Brett G. CROCKETT, Spencer HOOKS, Alan SEEFELDT, Joshua B. LANDO, C. Phillip BROWN, Sripal S. MEHTA, Stewart MURRIE
  • Publication number: 20210027423
    Abstract: A method for demosaicing an image captured by an image sensor includes (a) computing, for each of a first plurality of interpolated pixel-values and based on neighboring primary pixel-values, a respective first confidence value; (b) generating a first thresholded mapping from a first coordinate mapping by removing each interpolated pixel-value having a first confidence value less than a threshold value; (c) repeating steps of computing and generating for a second and third sub-plurality of sensor pixels to yield a second and a third thresholded mapping; (d) determining high-confidence array-coordinates as array-coordinates included in all thresholded mappings, remaining array-coordinates being low-confidence array-coordinates; (e) forming a refined image including, at each high-confidence array-coordinate, a respective pixel-value-triplet including one primary pixel-value and two interpolated pixel-values; and (f) filling the refined image by assigning, for each low-confidence array-coordinate, a pixel-value
    Type: Application
    Filed: January 15, 2019
    Publication date: January 28, 2021
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Gregory John Ward
  • Publication number: 20210029009
    Abstract: Some implementations involve analyzing audio packets received during a time interval that corresponds with a conversation analysis segment to determine network jitter dynamics data and conversational interactivity data. The network jitter dynamics data may provide an indication of jitter in a network that relays the audio data packets. The conversational interactivity data may provide an indication of interactivity between participants of a conversation represented by the audio data. A jitter buffer size may be controlled according to the network jitter dynamics data and the conversational interactivity data. The time interval may include a plurality of talkspurts.
    Type: Application
    Filed: August 7, 2020
    Publication date: January 28, 2021
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Kai LI, Xuejing SUN, Gary SPITTLE
  • Publication number: 20210027795
    Abstract: Higher Order Ambisonics represents three-dimensional sound independent of a specific loudspeaker set-up. However, transmission of an HOA representation results in a very high bit rate. Therefore compression with a fixed number of channels is used, in which directional and ambient signal components are processed differently. For coding, portions of the original HOA representation are predicted from the directional signal components. This prediction provides side information which is required for a corresponding decoding. By using some additional specific purpose bits, a known side information coding processing is improved in that the required number of bits for coding that side information is reduced on average.
    Type: Application
    Filed: July 10, 2020
    Publication date: January 28, 2021
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Sven Kordon, Alexander Krueger, Oliver Wuebbolt
  • Publication number: 20210029361
    Abstract: Given a representation of a forward reshaping function in an M-piecewise linear representation (M-PWL), methods, processes, and systems are presented for generating the forward reshaping function in an N-PWL representation (where N<M), by properly adjusting the first and last segments, and for generating a decoded bitstream using the adjusted N-PWL reshaping representation. Syntax methods for signaling reshaping parameters are also presented.
    Type: Application
    Filed: May 18, 2020
    Publication date: January 28, 2021
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Taoran LU, Peng YIN
  • Publication number: 20210020186
    Abstract: The present disclosure relates to an apparatus for decoding an encoded Unified Audio and Speech stream. The apparatus comprises a core decoder for decoding the encoded Unified Audio and Speech stream. The core decoder includes an eSBR unit for extending a bandwidth of an input signal, the eSBR unit including a QMF based harmonic transposer. The QMF based harmonic transposer is configured to process the input signal in the QMF domain, in each of a plurality of synthesis subbands, to extend the bandwidth of the input signal. The QMF based harmonic transposer is configured to operate at least in part based on pre-computed information. The present disclosure further relates to corresponding methods and storage media.
    Type: Application
    Filed: December 19, 2018
    Publication date: January 21, 2021
    Applicant: Dolby International AB
    Inventors: Rajat KUMAR, Ramesh KATURI, Saketh SATHUVALLI, Reshma RAI
  • Publication number: 20210021794
    Abstract: A display for displaying image data includes defining virtual color gamuts based on a plurality of primary display colors associated with a light source. At least one of the virtual color gamuts is defined to approximate an established color gamut. Intensity values associated with the virtual color gamuts are generated based on received video data, and the intensity values associated with the virtual color gamuts are used to generate drive values for the primary colors of the light source. A display using one or more virtual color gamuts is also disclosed.
    Type: Application
    Filed: October 6, 2020
    Publication date: January 21, 2021
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Trevor DAVIES, Martin J. RICHARDS, Ashley PENNA
  • Publication number: 20210021247
    Abstract: The invention relates to methods and apparatus for adjusting a level of an audio signal. An audio signal is divided into a plurality of frequency bands. Modification parameters are obtained for at least one of the plurality of frequency band. Gain factors are derived for at least one of the plurality of frequency bands, the gain factors determined based on the amplitude scale factors. The gain factors are smoothed. A level of noise from noise compensation factors is determined. The gain factors are applied to at least one of the frequency bands to generate gain adjusted frequency bands. The level of noise is adjusted based on the gain adjusted frequency bands. At least one of the frequency bands is filtered with a filter generated with the filter coefficients. The plurality of frequency bands is synthesized to generate an output audio signal.
    Type: Application
    Filed: July 20, 2020
    Publication date: January 21, 2021
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Alan Jeffrey SEEFELDT
  • Publication number: 20210014615
    Abstract: Some disclosed methods may involve receiving audio reproduction data and determining, based on the audio reproduction data, a sound source location at which a sound is to be rendered. A near-field gain and a far-field gain may be based, at least in part, on a sound source distance between the sound source location and a reproduction environment location. Room speaker feed signals may be based, at least in part, on room speaker positions, the sound source location and the far-field gain. Near-field speaker feed signals may be based, at least in part, on the near-field gain, the sound source location and a position of near-field speakers.
    Type: Application
    Filed: February 17, 2020
    Publication date: January 14, 2021
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Remi S. AUDFRAY, Nicolas R. TSINGOS, Pradeep Kumar GOVINDARAJU
  • Publication number: 20210014628
    Abstract: Described herein is a method (30) of rendering an audio signal (17) for playback in an audio environment (27) defined by a target loudspeaker system (23), the audio signal (17) including audio data relating to an audio object and associated position data indicative of an object position. Method (30) includes the initial step (31) of receiving the audio signal (17). At step (32) loudspeaker layout data for the target loudspeaker system (23) is received. At step (33) control data is received that is indicative of a position modification to be applied to the audio object in the audio environment (27). At step (38) in response to the position data, loudspeaker layout data and control data, rendering modification data is generated. Finally, at step (39) the audio signal (17) is rendered with the rendering modification data to output the audio signal (17) with the audio object at a modified object position that is between loudspeakers within the audio environment (27).
    Type: Application
    Filed: July 24, 2020
    Publication date: January 14, 2021
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Dirk Jeroen BREEBAART, Antonio MATEOS SOLE, Heiko PURNHAGEN, Nicolas R. TSINGOS
  • Publication number: 20210012781
    Abstract: Exemplary embodiments provide encoding and decoding methods, and associated encoders and decoders, for encoding and decoding of an audio scene which is represented by one or more audio signals. The encoder generates a bit stream which comprises downmix signals and side information which includes individual matrix elements of a reconstruction matrix which enables reconstruction of the one or more audio signals in the decoder.
    Type: Application
    Filed: July 24, 2020
    Publication date: January 14, 2021
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Heiko PURNHAGEN, Lars VILLEMOES, Leif Jonas SAMUELSSON, Toni HIRVONEN
  • Publication number: 20210014578
    Abstract: Described herein is a method for processing of media data for playback, wherein the method includes the steps of: (a) fetching, by a web proxy, from two or more media servers media data and a media manifest file including metadata information relating to the fetched media data, and merging, by the web proxy, the media data; (b) modifying, by said web proxy, the content of the media manifest file and/or the content of the media data; and (c) providing, by said web proxy, the media manifest file and the media data as modified in step (b) to a media retrieval element for receiving and processing the media manifest file and the media data for decoding or playback, wherein a localhost address is assigned to the web proxy and the web proxy acts as a server for said media retrieval element. Described are further a respective device and computer program product.
    Type: Application
    Filed: July 3, 2020
    Publication date: January 14, 2021
    Applicant: Dolby International AB
    Inventors: Wolfgang A. Schildbach, Christof Fersch, Holger Hoerich