Dolby Labs Patent Applications

Patents granted to Dolby Labs by the U.S. Patent and Trademark Office (USPTO).

  • Publication number: 20180268829
    Abstract: Methods for generating an object based audio program which is renderable in a personalizable manner, e.g., to provide an immersive, perception of audio content of the program. Other embodiments include steps of delivering (e.g., broadcasting), decoding, and/or rendering such a program. Rendering of audio objects indicated by the program may provide an immersive experience. The audio content of the program may be indicative of multiple object channels (e.g., object channels indicative of user-selectable and user-configurable objects, and typically also a default set of objects which will be rendered in the absence of a selection by a user) and a bed of speaker channels. Another aspect is an audio processing unit (e.g., encoder or decoder) configured to perform, or which includes a buffer memory which stores at least one frame (or other segment) of an object based audio program (or bitstream thereof) generated in accordance with, any embodiment of the method.
    Type: Application
    Filed: May 24, 2018
    Publication date: September 20, 2018
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Robert Andrew FRANCE, Thomas ZIEGLER, Sripal S. MEHTA, Andrew Jonathan DOWELL, Prinyar SAUNGSOMBOON, Michael David DWYER, Farhad FARAHANI, Nicolas R. Tsingos, Freddie SANCHEZ
  • Publication number: 20180268831
    Abstract: An encoding system (400) encodes an N-channel audio signal (X), wherein N?3, as a single-channel downmix signal (Y) together with dry and wet upmix parameters ({tilde over (C)}, {tilde over (P)}). In a decoding system (200), a decorrelating section (101) outputs, based on the downmix signal, an (N?1)-channel decorrelated signal (Z); a dry upmix section (102) maps the downmix signal linearly in accordance with dry upmix coefficients (C) determined based on the dry upmix parameters; a wet upmix section (103) populates an intermediate matrix based on the wet upmix parameters and knowing that the intermediate matrix belongs to a predefined matrix class, obtains wet upmix coefficients (P) by multiplying the intermediate matrix by a predefined matrix, and maps the decorrelated signal linearly in accordance with the wet upmix coefficients; and a combining section (104) combines outputs from the upmix sections to obtain a reconstructed signal ({circumflex over (X)}) corresponding to the signal to be reconstructed.
    Type: Application
    Filed: May 21, 2018
    Publication date: September 20, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Lars VILLEMOES, Heidi-Maria LEHTONEN, Heiko PURNHAGEN, Toni HIRVONEN
  • Publication number: 20180268827
    Abstract: The present document relates to a method of layered encoding of a frame of a compressed higher-order Ambisonics, HOA, representation of a sound or sound field. The compressed HOA representation comprises a plurality of transport signals. The method comprises assigning the plurality of transport signals to a plurality of hierarchical layers, the plurality of layers including a base layer and one or more hierarchical enhancement layers, generating, for each layer, a respective HOA extension payload including side information for parametrically enhancing a reconstructed HOA representation obtainable from the transport signals assigned to the respective layer and any layers lower than the respective layer, assigning the generated HOA extension payloads to their respective layers, and signaling the generated HOA extension payloads in an output bitstream.
    Type: Application
    Filed: October 7, 2016
    Publication date: September 20, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Sven KORDON, Alexander KRUEGER
  • Publication number: 20180270451
    Abstract: Systems and methods are described for detecting and remedying potential incongruence in a video conference. A camera of a video conferencing system may capture video images of a conference room. A processor of the video conferencing system may identify locations of a plurality of participants within an image plane of a video image. Using face and shape detection, a location of a center point of each identified participant's torso may be calculated. A region of congruence bounded by key parallax lines may be calculated, the key parallax lines being a subset of all parallax lines running through the center points of each identified participant. When the audio device location is not within the region of congruence, audio captured by an audio device may be adjusted to reduce effects of incongruence when the captured audio is replayed at a far end of the video conference.
    Type: Application
    Filed: March 12, 2018
    Publication date: September 20, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Glenn N. DICKINS, Ludovic Christophe MALFAIT, David GUNAWAN
  • Publication number: 20180270598
    Abstract: Audio perception in local proximity to visual cues is provided. A device includes a video display, first row of audio transducers, and second row of audio transducers. The first and second rows can be vertically disposed above and below the video display. An audio transducer of the first row and an audio transducer of the second row form a column to produce, in concert, an audible signal. The perceived emanation of the audible signal is from a plane of the video display (e.g., a location of a visual cue) by weighing outputs of the audio transducers of the column. In certain embodiments, the audio transducers are spaced farther apart at a periphery for increased fidelity in a center portion of the plane and less fidelity at the periphery.
    Type: Application
    Filed: October 19, 2016
    Publication date: September 20, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Christophe Chabanne, Nicolas R. Tsingos, Charles Q. Robinson
  • Publication number: 20180270600
    Abstract: For generating 3D audio content from a two-channel stereo signal, the stereo signal (x(t)) is partitioned into overlapping sample blocks and is transformed into time-frequency domain. From the stereo signal directional and ambient signal components are separated, wherein the estimated directions of the directional components are changed by a predetermined factor, wherein, if changes are within a predetermined interval, they are combined in order to form a directional centre channel object signal. For the other directions an encoding to Higher Order Ambisonics (HOA) is performed. Additional ambient signal channels are generated by de-correlation and rating by gain factors, followed by encoding to HOA. The directional HOA signals and the ambient HOA signals are combined, and the combined HOA signal and the centre channel object signals are transformed to time domain.
    Type: Application
    Filed: September 29, 2016
    Publication date: September 20, 2018
    Applicant: DOLBY INTERNATIONAL
    Inventors: Johannes BOEHM, Xiaoming CHEN
  • Publication number: 20180262769
    Abstract: Pixel data of a video sequence with enhanced dynamic range (EDR) are predicted based on pixel data of a corresponding video sequence with standard dynamic range (SDR) and an inter-layer predictor. Under a highlights clipping constrain, conventional SDR to EDR prediction is adjusted as follows: a) given a highlights threshold, the SDR to EDR predictor is adjusted to output a fixed output value for all input SDR pixel values larger than the highlights threshold, and b) given a dark-regions threshold, the residual values between the input EDR signal and its predicted value are set to zero for all input SDR pixel values lower than the dark-regions threshold. Example processes to determine the highlights and dark-regions thresholds and whether highlights clipping is occurring are provided.
    Type: Application
    Filed: February 16, 2016
    Publication date: September 13, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Guan-Ming SU, Qian CHEN
  • Publication number: 20180261184
    Abstract: A handheld imaging device has a data receiver that is configured to receive reference encoded image data. The data includes reference code values, which are encoded by an external coding system. The reference code values represent reference gray levels, which are being selected using a reference grayscale display function that is based on perceptual non-linearity of human vision adapted at different light levels to spatial frequencies. The imaging device also has a data converter that is configured to access a code mapping between the reference code values and device-specific code values of the imaging device. The device-specific code values are configured to produce gray levels that are specific to the imaging device. Based on the code mapping, the data converter is configured to transcode the reference encoded image data into device-specific image data, which is encoded with the device-specific code values.
    Type: Application
    Filed: September 21, 2016
    Publication date: September 13, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Ajit NINAN
  • Publication number: 20180262856
    Abstract: Example embodiments disclosed herein relates to upmixing of audio signals. A method of upmixing an audio signal is described. The method includes decomposing the audio signal into a diffuse signal and a direct signal, generating an audio bed at least in part based on the diffuse signal, the audio bed including a height channel, extracting an audio object from the direct signal, estimating metadata of the audio object, the metadata including height information of the audio object; and rendering the audio bed and the audio object as an upmixed audio signal, wherein the audio bed is rendered to a predefined position and the audio object is rendered according to the metadata. Corresponding system and computer program product are described as well.
    Type: Application
    Filed: February 9, 2016
    Publication date: September 13, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Jun WANG, Lie LU, Lianwu CHEN, Mingqing HU
  • Publication number: 20180262551
    Abstract: In one embodiment, a method for optimizing delivery of a digital program having a plurality of selectable program components includes delivering to a first node a composite set of program components, assembling from the composite set first and second subsets of program components, the first and second subsets differing by at least one program component, delivering the first subset of program components to a first user, and delivering the second subset of program components to a second user. The program components relate to multiple program categories and each of the multiple program categories is associated with a program presentation aspect and comprises a plurality of selections.
    Type: Application
    Filed: September 21, 2016
    Publication date: September 13, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Michael F. DEMEYER, Timothy E. ONDERS
  • Publication number: 20180255398
    Abstract: A system and method of providing a signal to wireless speakers. An audio stick connects to a source device and provides the audio signal to the wireless speakers. The audio stick combines the functionality of two separate devices: an audio receiver device and a transmitter device for the wireless speakers.
    Type: Application
    Filed: September 1, 2016
    Publication date: September 6, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: David Matthew FISCHER, Sripal S. MEHTA, Timothy James EGGERDING
  • Publication number: 20180254053
    Abstract: Example embodiments disclosed herein relate to filter coefficient updating in time domain filtering. A method of processing an audio signal is disclosed. The method includes obtaining a predetermined number of target gains for a first portion of the audio signal by analyzing the first portion of the audio signal. Each of the target gains is corresponding to a linear subband of the audio signal. The method also includes determining a filter coefficients for time domain filtering the first portion of the audio signal so as to approximate a frequency response given by the target gains. The filter coefficients are determined by iteratively selecting at least one target gain from the target gains and updating the filter coefficient based on the selected at least one target gain. Corresponding system and computer program product for processing an audio signal are also disclosed.
    Type: Application
    Filed: September 15, 2016
    Publication date: September 6, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Dong SHI, Xuejing SUN
  • Publication number: 20180255206
    Abstract: Methods and systems for color transforms are disclosed. A memory footprint of look up tables for color transforms can be reduced by separating the look up tables into factors, applying frequency domain transforms, dividing the look up tables into zones, or establishing hierarchical levels with increasing resolution. The methods can be applied to still image or video cameras with limited computation resources that can benefit from reduced memory footprints.
    Type: Application
    Filed: September 29, 2016
    Publication date: September 6, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Joonsoo KIM, Jon S. MCELVAIN
  • Publication number: 20180254047
    Abstract: Systems, methods, and computer program products of audio processing based on Adaptive Intermediate Spatial Format (AISF) are described. The AISF is an extension to ISF that allows spatial resolution around an ISF ring to be adjusted dynamically with respect to content of incoming audio objects. An AISF encoder device adaptively warps each ISF ring during ISF encoding to adjust angular distance between objects, resulting in increase in uniformity of energy distribution around the ISF ring. At an AISF decoder device, matrices that decode sound positions to the output speaker take into account the warping that was performed at the AISF encoder device to reproduce the true positions of sound sources.
    Type: Application
    Filed: February 22, 2018
    Publication date: September 6, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Juan Felix TORRES, David S. MCGRATH, Michael William MASON
  • Publication number: 20180247656
    Abstract: Conventional audio compression technologies perform a standardized signal transformation, independent of the type of the content. Multi-channel signals are decomposed into their signal components, subsequently quantized and encoded. This is disadvantageous due to lack of knowledge on the characteristics of scene composition, especially for e.g. multi-channel audio or Higher-Order Ambisonics (HOA) content. An improved method for encoding pre-processed audio data comprises encoding the pre-processed audio data, and encoding auxiliary data that indicate the particular audio pre-processing. An improved method for decoding encoded audio data comprises determining that the encoded audio data had been pre-processed before encoding, decoding the audio data, extracting from received data information about the pre-processing, and post-processing the decoded audio data according to the extracted pre-processing information.
    Type: Application
    Filed: April 30, 2018
    Publication date: August 30, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Oliver WUEBBOLT, Johannes BOEHM, Peter JAX
  • Publication number: 20180247660
    Abstract: An audio processing system (100) accepts an audio bitstream having one of a plurality of predefined audio frame rates. The system comprises a front-end component (110), which receives a variable number of quantized spectral components, corresponding to one audio frame in any of the predefined audio frame rates, and performs an inverse quantization according to predetermined, frequency-dependent quantization levels. The front-end component may be agnostic of the audio frame rate. The audio processing system further comprises a frequency-domain processing stage (120) and a sample rate converter (130), which provide a reconstructed audio signal sampled at a target sampling frequency independent of the audio frame rate. By its frame-rate adaptability, the system can be configured to operate frame-synchronously in parallel with a video processing system that accepts plural video frame rates.
    Type: Application
    Filed: February 23, 2018
    Publication date: August 30, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Heiko PURNHAGEN, Kristofer KJOERLING, Alexander STAHLMANN, Jens POPP, Karl Jonas ROEDEN
  • Publication number: 20180247646
    Abstract: One or more context aware processing parameters and an ambient audio stream are received. One or more sound characteristics associated with the ambient audio stream are identified using a machine learning model. One or more actions to perform are determined using the machine learning model and based on the one or more context aware processing parameters and the identified one or more sound characteristics. The one or more actions are performed.
    Type: Application
    Filed: December 21, 2017
    Publication date: August 30, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Jacob Meacham, Matthew Sills, Richard Fritz Lanman, III, Jeffrey Baker
  • Publication number: 20180240465
    Abstract: The present disclosure provides methods, devices and computer program products for encoding and decoding of a vector of parameters in an audio coding system. The disclosure further relates to a method and apparatus for reconstructing an audio object in an audio decoding system. According to the disclosure, a modulo differential approach for coding and encoding a vector of a non-periodic quantity may improve the coding efficiency and provide encoders and decoders with less memory requirements. Moreover, an efficient method for encoding and decoding a sparse matrix is provided.
    Type: Application
    Filed: April 5, 2018
    Publication date: August 23, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Leif Jonas SAMUELSSON, Heiko PURNHAGEN
  • Publication number: 20180240437
    Abstract: A handheld imaging device has a data receiver that is configured to receive reference encoded image data. The data includes reference code values, which are encoded by an external coding system. The reference code values represent reference gray levels, which are being selected using a reference grayscale display function that is based on perceptual non-linearity of human vision adapted at different light levels to spatial frequencies. The imaging device also has a data converter that is configured to access a code mapping between the reference code values and device-specific code values of the imaging device. The device-specific code values are configured to produce gray levels that are specific to the imaging device. Based on the code mapping, the data converter is configured to transcode the reference encoded image data into device-specific image data, which is encoded with the device-specific code values.
    Type: Application
    Filed: April 23, 2018
    Publication date: August 23, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Jon Scott MILLER, Scott DALY, Mahdi NEZAMABADI, Robin ATKINS
  • Publication number: 20180240469
    Abstract: Higher Order Ambisonics represents three-dimensional sound independent of a specific loudspeaker set-up. However, transmission of an HOA representation results in a very high bit rate. Therefore compression with a fixed number of channels is used, in which directional and ambient signal components are processed differently. For coding, portions of the original HOA representation are predicted from the directional signal components. This prediction provides side information which is required for a corresponding decoding. By using some additional specific purpose bits, a known side information coding processing is improved in that the required number of bits for coding that side information is reduced on average.
    Type: Application
    Filed: April 18, 2018
    Publication date: August 23, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Alexander KRUEGER, Sven KORDON, Oliver WUEBBOLT
  • Publication number: 20180240470
    Abstract: Example embodiments disclosed herein relate to source separation in audio content. A method for separating sources from audio content is disclosed, the audio content being of a multi-channel format based on a plurality of channels. The method comprises performing a component analysis on the audio content for each of the plurality of channels to generate a plurality of components, each of the plurality of components comprising a plurality of time-frequency tiles in full frequency band; generating at least one dominant source with at least one of the time-frequency tiles from the plurality of the components and separating the sources from the audio content by estimating spatial parameters and spectral parameters based on the dominant source. Corresponding system and computer program product are also disclosed.
    Type: Application
    Filed: February 11, 2016
    Publication date: August 23, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Jun WANG
  • Publication number: 20180242077
    Abstract: A soundbar speaker for transmitting reflected sound waves off an upper surface down to a listening environment, comprising: a cabinet containing a plurality of audio drivers, direct-firing drivers within the cabinet oriented to transmit sound along a horizontal axis substantially perpendicular to a front surface of the cabinet, and a pair of upward-firing slotted drivers placed proximate to ends of an top surface of the cabinet and oriented at an inclination angle relative to the horizontal axis. The slotted drivers are configured to create an overlapping reflected sound projection for high frequency sound when reflected down to a listening position located at a distance in front of the speaker pair. Such a speaker projects reflected sound that provides wider horizontal or side-to-side dispersion to better cover the listening area.
    Type: Application
    Filed: August 11, 2016
    Publication date: August 23, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Michael J. SMITHERS, Alan J. SEEFELDT
  • Publication number: 20180234612
    Abstract: Methods, systems, and computer program products for automatically positioning a content capturing device are disclosed. A vehicle, e.g., an UAV, carries the content capturing device, e.g., a camcorder. The UAV can position the content capturing device at a best location for viewing a subject based on one or more audio or visual cues. The UAV can follow movement of the subject to achieve best audio or visual effect. In some implementations, a controller device carried by the subject can generate one or more signals for the UAV to follow. The controller device may be coupled to a microphone that records audio. The signals can be used to temporally synchronize video captured at the UAV and audio captured by the microphone.
    Type: Application
    Filed: October 17, 2017
    Publication date: August 16, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Timo KUNKEL, Cong ZHOU, Vivek KUMAR, Remi S. AUDFRAY
  • Publication number: 20180234784
    Abstract: Higher Order Ambisonics (HOA) signals can be compressed by decomposition into a predominant sound component and a residual ambient component. The compressed representation comprises pre-dominant sound signals, coefficient sequences of the ambient component and side information. For efficiently combining HOA decompression and HOA rendering to obtain loudspeaker signals, combined rendering and decoding of the compressed HOA signal comprises perceptually decoding the perceptually coded portion and decoding the side information, without reconstructing HOA coefficient sequences. For reconstructing components of a first type, fading of coefficient sequences is not required, while for components of a second type fading is required. For each second type component, different linear operations are determined: one for coefficient sequences that in a current frame require no fading, one for those that require fading-in, and one for those that require fading-out.
    Type: Application
    Filed: March 1, 2016
    Publication date: August 16, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Sven KORDON, Alexander KRUEGER
  • Publication number: 20180234069
    Abstract: A method in a computing device configured to perform volume-levelling processing on input audio data by at least applying one or more filters to the input audio data, the computing device being configured to obtain an estimated difference between a target output loudness level and a loudness level associated with the input audio data, and to adapt the filter coefficients of the one or more filters based on the estimated difference. The method involves starting, or stopping, the volume-levelling processing. The method comprises gradually increasing, or decreasing, a weighting applied to the estimated difference, in response to obtaining an indication to start, or stop, the volume-levelling processing.
    Type: Application
    Filed: August 17, 2016
    Publication date: August 16, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Mark David DE BURGH
  • Publication number: 20180234785
    Abstract: A method for compressing a HOA signal being an input HOA representation with input time frames (C(k)) of HOA coefficient sequences comprises spatial HOA encoding of the input time frames and subsequent perceptual encoding and source encoding. Each input time frame is decomposed (802) into a frame of predominant sound signals (XPS(k?1)) and a frame of an ambient HOA component ({tilde over (C)}AMB(k?1)). The ambient HOA component ({tilde over (C)}AMB(k?1)) comprises, in a layered mode, first HOA coefficient sequences of the input HOA representation (cAMB,n(k?1)) in lower positions and second HOA coefficient sequences (cAMB,n(k?1)) in remaining higher positions. The second HOA coefficient sequences are part of an HOA representation of a residual between the input HOA representation and the HOA representation of the predominant sound signals.
    Type: Application
    Filed: February 8, 2018
    Publication date: August 16, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Sven KORDON, Alexander KRUEGER, Oliver WUEBBOLT
  • Publication number: 20180234704
    Abstract: In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated hue values in the preferred color space is minimized HDR images are coded in the reshaped color space. Legacy devices can still decode standard dynamic range images assuming they are coded in the legacy color space, while updated devices can use color reshaping information to decode HDR images in the preferred color space at full dynamic range.
    Type: Application
    Filed: August 3, 2016
    Publication date: August 16, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Robin ATKINS, Peng YIN, Taoran LU, Jaclyn Anne PYTLARZ
  • Publication number: 20180234109
    Abstract: The invention proposes a method and a device for arithmetic encoding of a current spectral coefficient using preceding spectral coefficients. Said preceding spectral coefficients are already encoded and both, said preceding and current spectral coefficients, are comprised in one or more quantized spectra resulting from quantizing time-frequency-transform of video, audio or speech signal sample values.
    Type: Application
    Filed: April 12, 2018
    Publication date: August 16, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Oliver Wuebbolt
  • Publication number: 20180233156
    Abstract: A method for representing a second presentation of audio channels or objects as a data stream, the method comprising the steps of: (a) providing a set of base signals, the base signals representing a first presentation of the audio channels or objects; (b) providing a set of transformation parameters, the transformation parameters intended to transform the first presentation into the second presentation; the transformation parameters further being specified for at least two frequency bands and including a set of multi-tap convolution matrix parameters for at least one of the frequency bands.
    Type: Application
    Filed: August 23, 2016
    Publication date: August 16, 2018
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Dirk Jeroen BREEBAART, David Matthew COOPER, Leif Jonas SAMUELSSON
  • Publication number: 20180227691
    Abstract: Example embodiments disclosed herein relate to audio signal processing. The audio signal has multiple audio objects. A method of processing an audio signal is disclosed. The method includes obtaining an object position for each of the audio objects; and determining cluster positions for grouping the audio objects into clusters based on the object positions, a plurality of object-to-cluster gains, and a set of metrics. The metrics indicate a quality of the cluster positions and a quality of the object-to-cluster gains, each of the cluster positions is a centroid of a respective one of the clusters, and one of the object-to-cluster gains defines a ratio of the respective audio object in one of the clusters. The method also includes determining the object-to-cluster gains based on the object positions, the cluster positions and the set of metrics; and generating a cluster signal based on the determined cluster positions and object-to-cluster gains.
    Type: Application
    Filed: August 4, 2016
    Publication date: August 9, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu Chen, Lie Lu, Dirk Jeroen Breebaart
  • Publication number: 20180218576
    Abstract: Techniques for low bit rate parametric encoding of haptic-tactile signals. The techniques encompass a parametric encoding method. The parametric encoding method includes the steps of: for at least one frame of a plurality of frames of a source haptic-tactile signal, representing the source haptic-tactile signal in the frame as a set of parameters and according to a functional representation; and including the set of parameters in a bit stream that encodes the source haptic-tactile signal. The functional representation is based on one of a set of orthogonal functionals, or polynomial approximation. For example, the functional representation can be based on one of Chebyshev functionals of the first kind through order n, Chebyshev functionals of the second kind through order n, or k-th order polynomial approximation.
    Type: Application
    Filed: August 3, 2016
    Publication date: August 2, 2018
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Sunil BHARITKAR, Charles Q. ROBINSON, Vivek KUMAR, Jeffrey RIEDMILLER, Christof FERSCH
  • Publication number: 20180218749
    Abstract: Methods for detecting whether a rendered version of a specified seamless connection (“SSC”) at a connection point between two audio segment sequences results in an audible discontinuity, and methods for analyzing at least one SSC between audio segment sequences to determine whether a renderable version of each SSC would have an audible discontinuity at the connection point when rendered, and in appropriate cases, for a SSC having a renderable version which is determined to have an audible discontinuity when rendered, correcting at least one audio segment of at least one segment sequence to be connected in accordance with the SSC in an effort to ensure that rendering of the SSC will result in seamless connection without an audible discontinuity. Other aspects are editing systems configured to implement any of the methods, and storage media and rendering systems which store audio data generated in accordance with any of the methods.
    Type: Application
    Filed: July 26, 2016
    Publication date: August 2, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Roy M. FEJGIN, Freddie SANCHEZ, Vinay MELKOTE, Michael WARD
  • Publication number: 20180218741
    Abstract: From an HOA signal representation (c(t)) of a sound field having an order of N and a number 0=(N+1)2 of coefficient sequences a mezzanine HOA signal representation (wMEZZ(t)) is generated that consists of an arbitrary number I<0 of virtual loudspeaker signals WMEZZ1(t), WMEZZ,2(t), . . . , WMEZZ,I(t). 0 directions are computed which are nearly uniformly distributed on the unit sphere. The mode vectors with respect to these directions are linearly weighted for constructing a matrix, of which the pseudo-inverse is used for multiplying the HOA signal representation (c(t)) in order to form (11) the mezzanine HOA signal representation (WMEZZ(t)).
    Type: Application
    Filed: July 29, 2016
    Publication date: August 2, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Florian Keiler, Sven Kordon, Alexander Krueger
  • Publication number: 20180220248
    Abstract: A method and apparatus for decompressing a Higher Order Ambisonics (HOA) signal representation is disclosed. The apparatus includes an input interface that receives an encoded directional signal and an encoded ambient signal and an audio decoder that perceptually decodes the encoded directional signal and encoded ambient signal to produce a decoded directional signal and a decoded ambient signal, respectively. The apparatus further includes an extractor for obtaining side information related to the directional signal and an inverse transformer for converting the decoded ambient signal from a spatial domain to an HOA domain representation of the ambient signal. The apparatus also includes a synthesizer for recomposing a Higher Order Ambisonics (HOA) signal from the HOA domain representation of the ambient signal and the decoded directional signal. The side information includes a direction of the directional signal selected from a set of uniformly spaced directions.
    Type: Application
    Filed: March 21, 2018
    Publication date: August 2, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Alexander Krüger, Sven Kordon, Johannes Boehm, Johann-Markus Batke
  • Publication number: 20180217485
    Abstract: Projector display systems comprising a light dimmer and first modulator are disclosed. The light dimmer may comprise an adjustable iris, adjustable light sources and/or LCD stack that is capable of lowering the luminance of the light source illuminated the first modulator. The first modulator may comprise a plurality of analog mirrors (e.g. MEMS array) and the second modulator may comprise a plurality of mirrors (e.g., DMD array). The display system may further comprise a controller that sends control signals to the light dimmer and first modulator. The display system may render a desired dynamic range for rendering a projected image by a combination of such control signals.
    Type: Application
    Filed: March 27, 2018
    Publication date: August 2, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Douglas J. GORNY, Douglas J. DARROW, Craig TODD, Martin J. RICHARDS
  • Publication number: 20180220144
    Abstract: In a method to code and transmit scalable HDR video signals, HDR signals are processed and encoded in the IPT-PQ color space to generate a base layer at reduced spatial resolution and/or dynamic range, and an enhancement layer with a residual signal. A signal reshaping block before the base layer encoder allows for improved coding of HDR signals using a reduced bit depth. A decoder can use a BL decoder and backward reshaping to generate a decoded BL HDR signal at a reduced dynamic range and/or spatial resolution, or it can combine the decoded BL HDR signal and the EL stream to generate a decoded HDR signal at full dynamic range and full resolution.
    Type: Application
    Filed: July 20, 2016
    Publication date: August 2, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Guan-Ming Su, Sheng Qu, Samir N. Hulyalkar, David Brooks
  • Publication number: 20180211682
    Abstract: In an apparatus configured to perform signal processing on audio data of a first sampling rate, methods disclosed herein comprise receiving audio data of a second sampling rate, the second sampling rate being higher than the first sampling rate. The methods comprise applying filtering to the audio data of the second sampling rate to thereby produce first filtered audio data and second filtered audio data, the first filtered audio data comprising mainly component frequencies which are audible to the human ear, the second filtered audio data comprising mainly components frequencies which are substantially inaudible to the human ear. The methods further comprise applying first signal processing to the first filtered audio data; and applying second signal processing to the second filtered audio data, the second signal processing having a lower computational complexity than the first signal processing. Corresponding apparatus and computer readable media are also disclosed herein.
    Type: Application
    Filed: September 21, 2016
    Publication date: July 26, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Nicholas L. ENGEL, Nicholas Luke APPLETON, Alan J. SEEFELDT
  • Publication number: 20180210220
    Abstract: Shaped glasses have curved surface lenses with spectrally complementary filters disposed thereon. The filters curved surface lenses are configured to compensate for wavelength shifts occurring due to viewing angles and other sources. Complementary images are projected for viewing through projection filters having passbands that pre-shift to compensate for subsequent wavelength shifts. At least one filter may have more than 3 primary passbands. For example, two filters include a first filter having passbands of low blue, high blue, low green, high green, and red, and a second filter having passbands of blue, green, and red. The additional passbands may be utilized to more closely match a color space and white point of a projector in which the filters are used. The shaped glasses and projection filters together may be utilized as a system for projecting and viewing 3D images.
    Type: Application
    Filed: March 21, 2018
    Publication date: July 26, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Martin J. Richards, Wilson Allen, Gary D. Gomes
  • Publication number: 20180210217
    Abstract: 3D glasses include an absorptive layer in a single lens of the glasses. The absorptive layer may be specifically tailored for spectral separation characteristics of a 3D filter portion of the lens. The absorptive layer may be combined with, and work in conjunction with, interference layers of a lens while also operating separately as an absorber. The absorptive layer may include biometric variations and/or positive runout. The absorptive layer may selectively absorb more of one color than other colors. A balancing absorber may be included in an opposite eye channel.
    Type: Application
    Filed: January 23, 2018
    Publication date: July 26, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Barret LIPPEY, Martin J. RICHARDS
  • Publication number: 20180210695
    Abstract: Embodiments are described for a method of rendering audio for playback through headphones comprising receiving digital audio content, receiving binaural rendering metadata generated by an authoring tool processing the received digital audio content, receiving playback metadata generated by a playback device, and combining the binaural rendering metadata and playback metadata to optimize playback of the digital audio content through the headphones.
    Type: Application
    Filed: March 23, 2018
    Publication date: July 26, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Nicolas R. TSINGOS, Rhonda WILSON, Sunil BHARITKAR, C. Phillip BROWN, Alan J. SEEFELDT, Remi AUDFRAY
  • Publication number: 20180211440
    Abstract: Directional image sensor data may be acquired with one or more directional image sensors. A light source and illumination image may be generated based on the directional image sensor data. A number of operations may be caused to be performed for an image based at least in part on light source information in the light source image. The operations may include display management operations, device positional operations, augmented reality superimposition operations, ambient light control operations, etc.
    Type: Application
    Filed: July 21, 2016
    Publication date: July 26, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Timo KUNKEL, Patrick GRIFFIS, Gregory John WARD
  • Publication number: 20180213246
    Abstract: Implementations are provided that relate, for example, to view tiling in video encoding and decoding. A particular method includes accessing a video picture that includes multiple pictures combined into a single picture (826), accessing information indicating how the multiple pictures in the accessed video picture are combined (806, 808, 822), decoding the video picture to provide a decoded representation of at least one of the multiple pictures (824, 826), and providing the accessed information and the decoded video picture as output (824, 826). Some other implementations format or process the information that indicates how multiple pictures included in a single video picture are combined into the single video picture, and format or process an encoded representation of the combined multiple pictures.
    Type: Application
    Filed: March 26, 2018
    Publication date: July 26, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Purvin Bibhas PANDIT, Peng YIN, Dong TIAN
  • Publication number: 20180211677
    Abstract: The present document relates an audio encoding and decoding system (referred to as an audio codec system). In particular, the present document relates to a transform-based audio codec system which is particularly well suited for voice encoding/decoding. A quantization unit configured to quantize a first coefficient of a block of coefficients is described. The block of coefficients comprises a plurality of coefficients for a plurality of corresponding frequency bins. The quantization unit is configured to provide a set of quantizers. The set of quantizers comprises a plurality of different quantizers associated with a plurality of different signal-to-noise ratios, referred to as SNR, respectively. The plurality of different quantizers includes a noise-filling quantizer; one or more dithered quantizers; and one or more un-dithered quantizers.
    Type: Application
    Filed: March 22, 2018
    Publication date: July 26, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Janusz KLEJSA, Lars VILLEMOES, Per HEDELIN
  • Publication number: 20180211672
    Abstract: A method for performing audio inpainting, wherein missing portions in an input audio signal are recovered and a recovered audio signal is obtained, comprises computing a Short-Time Fourier Transform (STFT) on portions of the input audio signal, computing conditional expectations of the source power spectra of the input audio signal, wherein estimated source power spectra P(f, n, j) are obtained and wherein the variance tensor V and complex Short-Time Fourier Transform (STFT) coefficients of the input audio signals are used, iteratively re-calculating the variance tensor V from the estimated power spectra P(f, n, j) and re-calculating updated estimated power spectra P(f, n, j), computing an array of STFT coefficients ? from the resulting variance tensor V according to ?(f, n, j)=E{S(f, n, j)|x, Is, IL, V}, and converting the array of STFT coefficients ? to the time domain, wherein coefficients {tilde over (s)}1, {tilde over (s)}2, . . . , {tilde over (s)}j of the recovered audio signal are obtained.
    Type: Application
    Filed: April 6, 2016
    Publication date: July 26, 2018
    Applicant: Dolby International AB
    Inventors: Cagdas BILEN, Alexey OZEROV, Patrick PEREZ
  • Publication number: 20180204581
    Abstract: According to an aspect of the present invention, a method for reconstructing an audio signal having a baseband portion and a highband portion is disclosed. The method includes obtaining a decoded baseband audio signal by decoding an encoded audio signal and obtaining a plurality of subband signals by filtering the decoded baseband audio signal. The method further includes generating a high-frequency reconstructed signal by copying a number of consecutive subband signals of the plurality of subband signals and obtaining an envelope adjusted high-frequency signal. The method further includes generating a noise component based on a noise parameter. Finally, the method includes adjusting a phase of the high-frequency reconstructed signal and obtaining a time-domain reconstructed audio signal by combining the decoded baseband audio signal and the combined high-frequency signal to obtain a time-domain reconstructed audio signal.
    Type: Application
    Filed: March 15, 2018
    Publication date: July 19, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Michael M. Truman, Mark S. Vinton
  • Publication number: 20180197561
    Abstract: Embodiments are directed to a companding method and system for reducing coding noise in an audio codec. A compression process reduces an original dynamic range of an initial audio signal through a compression process that divides the initial audio signal into a plurality of segments using a defined window shape, calculates a wideband gain in the frequency domain using a non-energy based average of frequency domain samples of the initial audio signal, and applies individual gain values to amplify segments of relatively low intensity and attenuate segments of relatively high intensity. The compressed audio signal is then expanded back to the substantially the original dynamic range that applies inverse gain values to amplify segments of relatively high intensity and attenuating segments of relatively low intensity. A QMF filterbank is used to analyze the initial audio signal to obtain a frequency domain representation.
    Type: Application
    Filed: March 7, 2018
    Publication date: July 12, 2018
    Applicants: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Per Hedelin, Arijit Biswas, Michael Schug, Vinay Melkote
  • Publication number: 20180199130
    Abstract: A time heuristic audio control system, comprises a receiver for receiving time-based data from a personal computing device and a memory storing one or more sets processing parameters comprising instructions for processing the ambient sound based upon the time-based data. The system further includes a processor coupled to the memory and the receiver configured to adjust the ambient sound as directed by a selected set of processing parameters retrieved from the memory to create adjusted audio, the selected set of processing parameters retrieved based upon the time-based data and at least one speaker for outputting the adjusted audio.
    Type: Application
    Filed: March 9, 2018
    Publication date: July 12, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Matthew J. Jaffe, Noah Kraft, Richard Fritz Lanman, III, Jeffrey Baker
  • Publication number: 20180197562
    Abstract: Embodiments are directed to a companding method and system for reducing coding noise in an audio codec. A compression process reduces an original dynamic range of an initial audio signal through a compression process that divides the initial audio signal into a plurality of segments using a defined window shape, calculates a wideband gain in the frequency domain using a non-energy based average of frequency domain samples of the initial audio signal, and applies individual gain values to amplify segments of relatively low intensity and attenuate segments of relatively high intensity. The compressed audio signal is then expanded back to the substantially the original dynamic range that applies inverse gain values to amplify segments of relatively high intensity and attenuating segments of relatively low intensity. A QMF filterbank is used to analyze the initial audio signal to obtain a frequency domain representation.
    Type: Application
    Filed: March 8, 2018
    Publication date: July 12, 2018
    Applicants: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Per Hedelin, Arijit Biswas, Michael Schug, Vinay Melkote
  • Publication number: 20180192062
    Abstract: In a decoder, a processor extracts a control map of false contour filtering from a part of a multi-layer video signal that includes a low dynamic range image mapped from an original high-dynamic range (HDR) image. It determines one or more filter parameters for a sparse finite-impulse-response (FIR) filter, where the one or more filter parameters relate to at least in part on the control map of false contour filtering and a predicted image predicted from the low dynamic range image. It applies the sparse FIR filter to filter pixel values in a portion of the predicted image based at least in part on the control map of false contour filtering, and it reconstructs a version of the original HDR image based at least in part on the portion of the predicted image as filtered by the FIR filter.
    Type: Application
    Filed: March 30, 2017
    Publication date: July 5, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Guan-Ming Su, Qian Chen, Peng Yin, Sheng Qu
  • Publication number: 20180192230
    Abstract: Embodiments are described for an adaptive audio system that processes audio data comprising a number of independent monophonic audio streams. One or more of the streams has associated with it metadata that specifies whether the stream is a channel-based or object-based stream. Channel-based streams have rendering information encoded by means of channel name; and the object-based streams have location information encoded through location expressions encoded in the associated metadata. A codec packages the independent audio streams into a single serial bitstream that contains all of the audio data. This configuration allows for the sound to be rendered according to an allocentric frame of reference, in which the rendering location of a sound is based on the characteristics of the playback environment (e.g., room size, shape, etc.) to correspond to the mixer's intent.
    Type: Application
    Filed: February 26, 2018
    Publication date: July 5, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Charles Q. ROBINSON, Nicolas R. TSINGOS, Christophe CHABANNE