Dolby Labs Patents

Dolby Laboratories, Inc. licenses its audio technologies, including its noise-reduction systems, to the media industry. Its product portfolio includes Dolby Digital Plus (DD+), Dolby Digital (DD), AAC and HE-AAC, Dolby TrueHD, Dolby Atmos, Dolby AC-4, Dolby Voice and Dolby Vision. Products that incorporate Dolby technologies include televisions, set-top boxes, computers, DVD and Blu-ray devices, soundbars, smartphones, tablets, video game consoles, and automobile entertainment systems.

Dolby Labs Patents by Type

  • Patent number: 10068577
    Abstract: A method of encoding adaptive audio, comprising receiving N objects and associated spatial metadata that describes the continuing motion of these objects, and partitioning the audio into segments based on the spatial metadata. The method encodes adaptive audio having objects and channel beds by capturing a continuing motion of a number N objects in a time-varying matrix trajectory comprising a sequence of matrices, coding coefficients of the time-varying matrix trajectory in spatial metadata to be transmitted via a high-definition audio format for rendering the adaptive audio through a number M output channels, and segmenting the sequence of matrices into a plurality of sub-segments based on the spatial metadata, wherein the plurality of sub-segments are configured to facilitate coding of one or more characteristics of the adaptive audio.
    Type: Grant
    Filed: April 23, 2015
    Date of Patent: September 4, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Vinay Melkote, Malcolm James Law, Roy M. Fejgin
  • Publication number: 20180247646
    Abstract: One or more context aware processing parameters and an ambient audio stream are received. One or more sound characteristics associated with the ambient audio stream are identified using a machine learning model. One or more actions to perform are determined using the machine learning model and based on the one or more context aware processing parameters and the identified one or more sound characteristics. The one or more actions are performed.
    Type: Application
    Filed: December 21, 2017
    Publication date: August 30, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Jacob Meacham, Matthew Sills, Richard Fritz Lanman, III, Jeffrey Baker
  • Publication number: 20180247656
    Abstract: Conventional audio compression technologies perform a standardized signal transformation, independent of the type of the content. Multi-channel signals are decomposed into their signal components, subsequently quantized and encoded. This is disadvantageous due to lack of knowledge on the characteristics of scene composition, especially for e.g. multi-channel audio or Higher-Order Ambisonics (HOA) content. An improved method for encoding pre-processed audio data comprises encoding the pre-processed audio data, and encoding auxiliary data that indicate the particular audio pre-processing. An improved method for decoding encoded audio data comprises determining that the encoded audio data had been pre-processed before encoding, decoding the audio data, extracting from received data information about the pre-processing, and post-processing the decoded audio data according to the extracted pre-processing information.
    Type: Application
    Filed: April 30, 2018
    Publication date: August 30, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Oliver WUEBBOLT, Johannes BOEHM, Peter JAX
  • Publication number: 20180247660
    Abstract: An audio processing system (100) accepts an audio bitstream having one of a plurality of predefined audio frame rates. The system comprises a front-end component (110), which receives a variable number of quantized spectral components, corresponding to one audio frame in any of the predefined audio frame rates, and performs an inverse quantization according to predetermined, frequency-dependent quantization levels. The front-end component may be agnostic of the audio frame rate. The audio processing system further comprises a frequency-domain processing stage (120) and a sample rate converter (130), which provide a reconstructed audio signal sampled at a target sampling frequency independent of the audio frame rate. By its frame-rate adaptability, the system can be configured to operate frame-synchronously in parallel with a video processing system that accepts plural video frame rates.
    Type: Application
    Filed: February 23, 2018
    Publication date: August 30, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Heiko PURNHAGEN, Kristofer KJOERLING, Alexander STAHLMANN, Jens POPP, Karl Jonas ROEDEN
  • Patent number: 10063845
    Abstract: A 3D display is characterized by a quality of viewing experience (QVE) mapping which represents a display-specific input-output relationship between input depth values and output QVE values. Examples of QVE mappings based on a metric of “viewing blur” are presented. Given reference depth data generated for a reference display and a representation of an artist's mapping function, which represents an input-output relationship between original input depth data and QVE data generated using a QVE mapping for a reference display, a decoder may reconstruct the reference depth data and apply an inverse QVE mapping for a target display to generate output depth data optimized for the target display.
    Type: Grant
    Filed: March 17, 2014
    Date of Patent: August 28, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Gopi Lakshminarayan, Samir Hulyalkar, Tao Chen, Hariharan Ganapathy, Santosh Chilkunda
  • Patent number: 10063985
    Abstract: Audio signals (201) are received. The audio signals include left and right surround channels (206). The audio signals are played back using far-field loudspeakers (101-108, 401-406) distributed around a space (111, 409) having a plurality of listener positions (112, 410). The left and right surround channels are played back by a pair of far-field loudspeakers (103, 106, 403, 405) arranged at opposite sides of the space having the plurality of listener positions. An audio component (208) coinciding with or approximating audio content common to the left and right surround channels is obtained. The audio component is played back using at least a pair of near-field transducers (109, 110, 407, 408) arranged at one of the listener positions. Associated systems (100, 400), methods (800) and computer program products are provided.
    Type: Grant
    Filed: May 12, 2016
    Date of Patent: August 28, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Remi Audfray, Nicolas R. Tsingos, Jurgen W. Scharpf
  • Patent number: 10062333
    Abstract: Systems and methods are disclosed for dynamically adjusting the backlight of a display during video playback or for generating filtered video metadata. Given an input video stream and associated metadata values of minimum, average, or maximum luminance values of the video frames in the video stream, values of a function of the frame min, mid, or max luminance values are filtered using a temporal filter to generate a filtered output value for each frame. At least one filtering coefficient of the temporal filter is adapted based on a logistic function controlled by slope and sensitivity values. The instantaneous dynamic range of a target display is determined based on the filtered metadata values and the minimum and maximum brightness values of the display.
    Type: Grant
    Filed: October 5, 2016
    Date of Patent: August 28, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Suzanne Farrell, Scott Daly, Robin Atkins, Timo Kunkel, Kimball Darr Thurston, III
  • Publication number: 20180240437
    Abstract: A handheld imaging device has a data receiver that is configured to receive reference encoded image data. The data includes reference code values, which are encoded by an external coding system. The reference code values represent reference gray levels, which are being selected using a reference grayscale display function that is based on perceptual non-linearity of human vision adapted at different light levels to spatial frequencies. The imaging device also has a data converter that is configured to access a code mapping between the reference code values and device-specific code values of the imaging device. The device-specific code values are configured to produce gray levels that are specific to the imaging device. Based on the code mapping, the data converter is configured to transcode the reference encoded image data into device-specific image data, which is encoded with the device-specific code values.
    Type: Application
    Filed: April 23, 2018
    Publication date: August 23, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Jon Scott MILLER, Scott DALY, Mahdi NEZAMABADI, Robin ATKINS
  • Publication number: 20180240470
    Abstract: Example embodiments disclosed herein relate to source separation in audio content. A method for separating sources from audio content is disclosed, the audio content being of a multi-channel format based on a plurality of channels. The method comprises performing a component analysis on the audio content for each of the plurality of channels to generate a plurality of components, each of the plurality of components comprising a plurality of time-frequency tiles in full frequency band; generating at least one dominant source with at least one of the time-frequency tiles from the plurality of the components and separating the sources from the audio content by estimating spatial parameters and spectral parameters based on the dominant source. Corresponding system and computer program product are also disclosed.
    Type: Application
    Filed: February 11, 2016
    Publication date: August 23, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Jun WANG
  • Publication number: 20180242077
    Abstract: A soundbar speaker for transmitting reflected sound waves off an upper surface down to a listening environment, comprising: a cabinet containing a plurality of audio drivers, direct-firing drivers within the cabinet oriented to transmit sound along a horizontal axis substantially perpendicular to a front surface of the cabinet, and a pair of upward-firing slotted drivers placed proximate to ends of an top surface of the cabinet and oriented at an inclination angle relative to the horizontal axis. The slotted drivers are configured to create an overlapping reflected sound projection for high frequency sound when reflected down to a listening position located at a distance in front of the speaker pair. Such a speaker projects reflected sound that provides wider horizontal or side-to-side dispersion to better cover the listening area.
    Type: Application
    Filed: August 11, 2016
    Publication date: August 23, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Michael J. SMITHERS, Alan J. SEEFELDT
  • Publication number: 20180240469
    Abstract: Higher Order Ambisonics represents three-dimensional sound independent of a specific loudspeaker set-up. However, transmission of an HOA representation results in a very high bit rate. Therefore compression with a fixed number of channels is used, in which directional and ambient signal components are processed differently. For coding, portions of the original HOA representation are predicted from the directional signal components. This prediction provides side information which is required for a corresponding decoding. By using some additional specific purpose bits, a known side information coding processing is improved in that the required number of bits for coding that side information is reduced on average.
    Type: Application
    Filed: April 18, 2018
    Publication date: August 23, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Alexander KRUEGER, Sven KORDON, Oliver WUEBBOLT
  • Publication number: 20180240465
    Abstract: The present disclosure provides methods, devices and computer program products for encoding and decoding of a vector of parameters in an audio coding system. The disclosure further relates to a method and apparatus for reconstructing an audio object in an audio decoding system. According to the disclosure, a modulo differential approach for coding and encoding a vector of a non-periodic quantity may improve the coding efficiency and provide encoders and decoders with less memory requirements. Moreover, an efficient method for encoding and decoding a sparse matrix is provided.
    Type: Application
    Filed: April 5, 2018
    Publication date: August 23, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Leif Jonas SAMUELSSON, Heiko PURNHAGEN
  • Patent number: 10056042
    Abstract: Systems and methods are disclosed for filtering metadata to be used in display management. Given an input video stream and input metadata comprising at least one of minimum, average, or maximum luminance values of the video frames in the video stream, values of a function of the input metadata are filtered using a temporal filter to generate filtered metadata, wherein the filtering is based only on metadata for input frames in the same scene. Methods for temporal filtering based on an exponential moving average filter or a look-ahead sliding window filter are presented, including methods for scene-change detection using the input metadata.
    Type: Grant
    Filed: May 11, 2016
    Date of Patent: August 21, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Robin Atkins, Gregory John Ward, Samir N. Hulyalkar, Guan-Ming Su, Sheng Qu
  • Patent number: 10056055
    Abstract: Apparatus and methods for mapping video signal parameters such as tone and color may be applied at various points in a video generation and delivery pipeline. apparatus may be configured to control mappings based on a range of inputs which may include one or more of: ambient conditions, user inputs, control information, adaptation models. Apparatus and methods may be applied to display video or other images so as to preserve a creative intent embodied in video or other image data.
    Type: Grant
    Filed: February 18, 2011
    Date of Patent: August 21, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Helge Seetzen, Robin Atkins, Neil W. Messmer, Gerwin Damberg
  • Patent number: 10055866
    Abstract: Systems and methods for overlaying a second image/video data onto a first image/video data are described herein. The first image/video data may be intended to be rendered on a display with certain characteristics—e.g., HDR, EDR, VDR or UHD capabilities. The second image/video data may comprise graphics, closed captioning, text, advertisement—or any data that may be desired to be overlaid and/or composited onto the first image/video data. The second image/video data may be appearance mapped according to the image statistics and/or characteristics of the first image/video data. In addition, such appearance mapping may be made according to the characteristics of the display that the composite data is to be rendered. Such appearance mapping is desired to render a composite data that is visually pleasing to a viewer, rendered upon a desired display.
    Type: Grant
    Filed: January 27, 2014
    Date of Patent: August 21, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Timo Kunkel, Ning Xu, Tao Chen, Bongsun Lee, Samir N. Hulyalkar
  • Patent number: 10057808
    Abstract: The present disclosure provides methods, devices and computer program products for non-uniform quantization of parameters relating to parametric spatial coding of audio signals. The disclosure further relates to a method and apparatus for reconstructing an audio object in an audio decoding system taking the non-uniformly quantized parameters into account. According to the disclosure, such an approach renders it possible to reduce bit consumption without substantially reducing the quality of the reconstructed audio object.
    Type: Grant
    Filed: May 2, 2017
    Date of Patent: August 21, 2018
    Assignee: Dolby International AB
    Inventors: Heiko Purnhagen, Per Ekstrand
  • Patent number: 10057707
    Abstract: Various disclosed implementations involve processing and/or playback of a recording of a conference involving a plurality of conference participants. Some implementations involve receiving or determining conversational dynamics data. One or more variables of a cost function may be based, at least in part, on the conversational dynamics data. The cost function may be a spatial optimization cost function of a vector describing a virtual conference participant position for each of the conference participants in a virtual acoustic space. The virtual acoustic space may be determined relative to a listener's head. The virtual conference participant positions may be assigned according to a solution of the cost function.
    Type: Grant
    Filed: February 3, 2016
    Date of Patent: August 21, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Richard J. Cartwright, Hannes Muesch
  • Patent number: 10057581
    Abstract: Methods and systems for improving coding decoding efficiency of video by providing a syntax modeler, a buffer, and a decoder. The syntax modeler may associate a first sequence of symbols with syntax elements. The buffer may store tables, each represented by a symbol in the first sequence, and each used to associate a respective symbol in a second sequence of symbols with encoded data. The decoder decodes the data into a bitstream using the second sequence retrieved from a table.
    Type: Grant
    Filed: June 15, 2017
    Date of Patent: August 21, 2018
    Assignee: Dolby International AB
    Inventors: Yeping Su, Christopher A. Segall
  • Patent number: 10057708
    Abstract: Embodiments are described for an adaptive audio system that processes audio data comprising a number of independent monophonic audio streams. One or more of the streams has associated with it metadata that specifies whether the stream is a channel-based or object-based stream. Channel-based streams have rendering information encoded by means of channel name; and the object-based streams have location information encoded through location expressions encoded in the associated metadata. A codec packages the independent audio streams into a single serial bitstream that contains all of the audio data. This configuration allows for the sound to be rendered according to an allocentric frame of reference, in which the rendering location of a sound is based on the characteristics of the playback environment (e.g., room size, shape, etc.) to correspond to the mixer's intent.
    Type: Grant
    Filed: February 26, 2018
    Date of Patent: August 21, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Charles Q. Robinson, Nicolas R. Tsingos, Christophe Chabanne
  • Publication number: 20180234069
    Abstract: A method in a computing device configured to perform volume-levelling processing on input audio data by at least applying one or more filters to the input audio data, the computing device being configured to obtain an estimated difference between a target output loudness level and a loudness level associated with the input audio data, and to adapt the filter coefficients of the one or more filters based on the estimated difference. The method involves starting, or stopping, the volume-levelling processing. The method comprises gradually increasing, or decreasing, a weighting applied to the estimated difference, in response to obtaining an indication to start, or stop, the volume-levelling processing.
    Type: Application
    Filed: August 17, 2016
    Publication date: August 16, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Mark David DE BURGH
  • Publication number: 20180234704
    Abstract: In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated hue values in the preferred color space is minimized HDR images are coded in the reshaped color space. Legacy devices can still decode standard dynamic range images assuming they are coded in the legacy color space, while updated devices can use color reshaping information to decode HDR images in the preferred color space at full dynamic range.
    Type: Application
    Filed: August 3, 2016
    Publication date: August 16, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Robin ATKINS, Peng YIN, Taoran LU, Jaclyn Anne PYTLARZ
  • Publication number: 20180234785
    Abstract: A method for compressing a HOA signal being an input HOA representation with input time frames (C(k)) of HOA coefficient sequences comprises spatial HOA encoding of the input time frames and subsequent perceptual encoding and source encoding. Each input time frame is decomposed (802) into a frame of predominant sound signals (XPS(k?1)) and a frame of an ambient HOA component ({tilde over (C)}AMB(k?1)). The ambient HOA component ({tilde over (C)}AMB(k?1)) comprises, in a layered mode, first HOA coefficient sequences of the input HOA representation (cAMB,n(k?1)) in lower positions and second HOA coefficient sequences (cAMB,n(k?1)) in remaining higher positions. The second HOA coefficient sequences are part of an HOA representation of a residual between the input HOA representation and the HOA representation of the predominant sound signals.
    Type: Application
    Filed: February 8, 2018
    Publication date: August 16, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Sven KORDON, Alexander KRUEGER, Oliver WUEBBOLT
  • Publication number: 20180234109
    Abstract: The invention proposes a method and a device for arithmetic encoding of a current spectral coefficient using preceding spectral coefficients. Said preceding spectral coefficients are already encoded and both, said preceding and current spectral coefficients, are comprised in one or more quantized spectra resulting from quantizing time-frequency-transform of video, audio or speech signal sample values.
    Type: Application
    Filed: April 12, 2018
    Publication date: August 16, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Oliver Wuebbolt
  • Publication number: 20180233156
    Abstract: A method for representing a second presentation of audio channels or objects as a data stream, the method comprising the steps of: (a) providing a set of base signals, the base signals representing a first presentation of the audio channels or objects; (b) providing a set of transformation parameters, the transformation parameters intended to transform the first presentation into the second presentation; the transformation parameters further being specified for at least two frequency bands and including a set of multi-tap convolution matrix parameters for at least one of the frequency bands.
    Type: Application
    Filed: August 23, 2016
    Publication date: August 16, 2018
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Dirk Jeroen BREEBAART, David Matthew COOPER, Leif Jonas SAMUELSSON
  • Publication number: 20180234612
    Abstract: Methods, systems, and computer program products for automatically positioning a content capturing device are disclosed. A vehicle, e.g., an UAV, carries the content capturing device, e.g., a camcorder. The UAV can position the content capturing device at a best location for viewing a subject based on one or more audio or visual cues. The UAV can follow movement of the subject to achieve best audio or visual effect. In some implementations, a controller device carried by the subject can generate one or more signals for the UAV to follow. The controller device may be coupled to a microphone that records audio. The signals can be used to temporally synchronize video captured at the UAV and audio captured by the microphone.
    Type: Application
    Filed: October 17, 2017
    Publication date: August 16, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Timo KUNKEL, Cong ZHOU, Vivek KUMAR, Remi S. AUDFRAY
  • Publication number: 20180234784
    Abstract: Higher Order Ambisonics (HOA) signals can be compressed by decomposition into a predominant sound component and a residual ambient component. The compressed representation comprises pre-dominant sound signals, coefficient sequences of the ambient component and side information. For efficiently combining HOA decompression and HOA rendering to obtain loudspeaker signals, combined rendering and decoding of the compressed HOA signal comprises perceptually decoding the perceptually coded portion and decoding the side information, without reconstructing HOA coefficient sequences. For reconstructing components of a first type, fading of coefficient sequences is not required, while for components of a second type fading is required. For each second type component, different linear operations are determined: one for coefficient sequences that in a current frame require no fading, one for those that require fading-in, and one for those that require fading-out.
    Type: Application
    Filed: March 1, 2016
    Publication date: August 16, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Sven KORDON, Alexander KRUEGER
  • Patent number: 10049683
    Abstract: This disclosure falls into the field of audio coding, in particular it is related to the field of spatial audio coding, where the audio information is represented by multiple signals, where the signals may comprise audio channels or/and audio objects. In particular the disclosure provides a method and apparatus for reconstructing audio objects in an audio decoding system. Furthermore, this disclosure provides a method and apparatus for encoding such audio objects.
    Type: Grant
    Filed: October 21, 2014
    Date of Patent: August 14, 2018
    Assignee: Dolby International AB
    Inventors: Heiko Purnhagen, Janusz Klejsa, Lars Villemoes, Toni Hirvonen
  • Patent number: 10051400
    Abstract: A method of outputting audio in a teleconferencing environment includes receiving audio streams, processing the audio streams according to information regarding effective spatial positions, and outputting, by at least three speakers arranged in more than one dimension, the audio streams having been processed. The information regarding the plurality of effective spatial positions corresponds to a perceived spatial scene that extends beyond the speakers in at least two dimensions. In this manner, participants in the teleconference perceive the audio from the remote participants as originating at different positions in the teleconference room.
    Type: Grant
    Filed: March 21, 2013
    Date of Patent: August 14, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: David S. McGrath, Glenn N. Dickins, Paul Holmberg, Gary Spittle, Michael Eckert
  • Publication number: 20180227691
    Abstract: Example embodiments disclosed herein relate to audio signal processing. The audio signal has multiple audio objects. A method of processing an audio signal is disclosed. The method includes obtaining an object position for each of the audio objects; and determining cluster positions for grouping the audio objects into clusters based on the object positions, a plurality of object-to-cluster gains, and a set of metrics. The metrics indicate a quality of the cluster positions and a quality of the object-to-cluster gains, each of the cluster positions is a centroid of a respective one of the clusters, and one of the object-to-cluster gains defines a ratio of the respective audio object in one of the clusters. The method also includes determining the object-to-cluster gains based on the object positions, the cluster positions and the set of metrics; and generating a cluster signal based on the determined cluster positions and object-to-cluster gains.
    Type: Application
    Filed: August 4, 2016
    Publication date: August 9, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu Chen, Lie Lu, Dirk Jeroen Breebaart
  • Patent number: 10044337
    Abstract: Equalizer controller and controlling method are disclosed. In one embodiment, an equalizer controller includes an audio classifier for identifying the audio type of an audio signal in real time; and an adjusting unit for adjusting an equalizer in a continuous manner based on the confidence value of the audio type as identified.
    Type: Grant
    Filed: February 15, 2017
    Date of Patent: August 7, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Lie Lu, Jun Wang, Alan J. Seefeldt, Mingqing Hu
  • Patent number: 10043526
    Abstract: The present invention relates to transposing signals in time and/or frequency and in particular to coding of audio signals. More particular, the present invention relates to high frequency reconstruction (HFR) methods including a frequency domain harmonic transposer. A method and system for generating a transposed output signal from an input signal using a transposition factor T is described. The system comprises an analysis window of length La, extracting a frame of the input signal, and an analysis transformation unit of order M transforming the samples into M complex coefficients. M is a function of the transposition factor T. The system further comprises a nonlinear processing unit altering the phase of the complex coefficients by using the transposition factor T, a synthesis transformation unit of order M transforming the altered coefficients into M altered samples, and a synthesis window of length Ls, generating a frame of the output signal.
    Type: Grant
    Filed: October 13, 2015
    Date of Patent: August 7, 2018
    Assignee: Dolby International AB
    Inventors: Per Ekstrand, Lars Villemoes
  • Patent number: 10043528
    Abstract: The present document relates an audio encoding and decoding system (referred to as an audio codec system). In particular, the present document relates to a transform-based audio codec system which is particularly well suited for voice encoding/decoding. A transform-based speech encoder (100, 170) configured to encode a speech signal into a bitstream is described. The encoder (100, 170) comprises a framing unit (101) configured to receive a set (132, 332) of blocks; wherein the set (132, 332) of blocks comprises a plurality of sequential blocks (131) of transform coefficients; wherein the plurality of blocks (131) is indicative of samples of the speech signal; wherein a block (131) of transform coefficients comprises a plurality of transform coefficients for a corresponding plurality of frequency bins (301).
    Type: Grant
    Filed: April 4, 2014
    Date of Patent: August 7, 2018
    Assignee: Dolby International AB
    Inventors: Lars Villemoes, Janusz Klejsa, Per Hedelin
  • Patent number: 10045115
    Abstract: Personal audio systems and methods are disclosed. A personal audio system includes a voice activity detector to determine whether or not an ambient audio stream contains voice activity, a pitch estimator to determine a frequency of a fundamental component of an annoyance noise contained in the ambient audio stream, and a filter bank to attenuate the fundamental component and at least one harmonic component of the annoyance noise to generate a personal audio stream. The filter bank implements a first filter function when the ambient audio stream does not contain voice activity, or a second filter function when the ambient audio stream contains voice activity.
    Type: Grant
    Filed: December 19, 2016
    Date of Patent: August 7, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Gints Klimanis, Anthony Parks
  • Publication number: 20180217485
    Abstract: Projector display systems comprising a light dimmer and first modulator are disclosed. The light dimmer may comprise an adjustable iris, adjustable light sources and/or LCD stack that is capable of lowering the luminance of the light source illuminated the first modulator. The first modulator may comprise a plurality of analog mirrors (e.g. MEMS array) and the second modulator may comprise a plurality of mirrors (e.g., DMD array). The display system may further comprise a controller that sends control signals to the light dimmer and first modulator. The display system may render a desired dynamic range for rendering a projected image by a combination of such control signals.
    Type: Application
    Filed: March 27, 2018
    Publication date: August 2, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Douglas J. GORNY, Douglas J. DARROW, Craig TODD, Martin J. RICHARDS
  • Publication number: 20180220144
    Abstract: In a method to code and transmit scalable HDR video signals, HDR signals are processed and encoded in the IPT-PQ color space to generate a base layer at reduced spatial resolution and/or dynamic range, and an enhancement layer with a residual signal. A signal reshaping block before the base layer encoder allows for improved coding of HDR signals using a reduced bit depth. A decoder can use a BL decoder and backward reshaping to generate a decoded BL HDR signal at a reduced dynamic range and/or spatial resolution, or it can combine the decoded BL HDR signal and the EL stream to generate a decoded HDR signal at full dynamic range and full resolution.
    Type: Application
    Filed: July 20, 2016
    Publication date: August 2, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Guan-Ming Su, Sheng Qu, Samir N. Hulyalkar, David Brooks
  • Publication number: 20180218741
    Abstract: From an HOA signal representation (c(t)) of a sound field having an order of N and a number 0=(N+1)2 of coefficient sequences a mezzanine HOA signal representation (wMEZZ(t)) is generated that consists of an arbitrary number I<0 of virtual loudspeaker signals WMEZZ1(t), WMEZZ,2(t), . . . , WMEZZ,I(t). 0 directions are computed which are nearly uniformly distributed on the unit sphere. The mode vectors with respect to these directions are linearly weighted for constructing a matrix, of which the pseudo-inverse is used for multiplying the HOA signal representation (c(t)) in order to form (11) the mezzanine HOA signal representation (WMEZZ(t)).
    Type: Application
    Filed: July 29, 2016
    Publication date: August 2, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Florian Keiler, Sven Kordon, Alexander Krueger
  • Publication number: 20180220248
    Abstract: A method and apparatus for decompressing a Higher Order Ambisonics (HOA) signal representation is disclosed. The apparatus includes an input interface that receives an encoded directional signal and an encoded ambient signal and an audio decoder that perceptually decodes the encoded directional signal and encoded ambient signal to produce a decoded directional signal and a decoded ambient signal, respectively. The apparatus further includes an extractor for obtaining side information related to the directional signal and an inverse transformer for converting the decoded ambient signal from a spatial domain to an HOA domain representation of the ambient signal. The apparatus also includes a synthesizer for recomposing a Higher Order Ambisonics (HOA) signal from the HOA domain representation of the ambient signal and the decoded directional signal. The side information includes a direction of the directional signal selected from a set of uniformly spaced directions.
    Type: Application
    Filed: March 21, 2018
    Publication date: August 2, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Alexander Krüger, Sven Kordon, Johannes Boehm, Johann-Markus Batke
  • Publication number: 20180218749
    Abstract: Methods for detecting whether a rendered version of a specified seamless connection (“SSC”) at a connection point between two audio segment sequences results in an audible discontinuity, and methods for analyzing at least one SSC between audio segment sequences to determine whether a renderable version of each SSC would have an audible discontinuity at the connection point when rendered, and in appropriate cases, for a SSC having a renderable version which is determined to have an audible discontinuity when rendered, correcting at least one audio segment of at least one segment sequence to be connected in accordance with the SSC in an effort to ensure that rendering of the SSC will result in seamless connection without an audible discontinuity. Other aspects are editing systems configured to implement any of the methods, and storage media and rendering systems which store audio data generated in accordance with any of the methods.
    Type: Application
    Filed: July 26, 2016
    Publication date: August 2, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Roy M. FEJGIN, Freddie SANCHEZ, Vinay MELKOTE, Michael WARD
  • Publication number: 20180218576
    Abstract: Techniques for low bit rate parametric encoding of haptic-tactile signals. The techniques encompass a parametric encoding method. The parametric encoding method includes the steps of: for at least one frame of a plurality of frames of a source haptic-tactile signal, representing the source haptic-tactile signal in the frame as a set of parameters and according to a functional representation; and including the set of parameters in a bit stream that encodes the source haptic-tactile signal. The functional representation is based on one of a set of orthogonal functionals, or polynomial approximation. For example, the functional representation can be based on one of Chebyshev functionals of the first kind through order n, Chebyshev functionals of the second kind through order n, or k-th order polynomial approximation.
    Type: Application
    Filed: August 3, 2016
    Publication date: August 2, 2018
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Sunil BHARITKAR, Charles Q. ROBINSON, Vivek KUMAR, Jeffrey RIEDMILLER, Christof FERSCH
  • Patent number: 10037764
    Abstract: When compressing an HOA data frame representation, a gain control (15, 151) is applied for each channel signal before it is perceptually encoded (16). The gain values are transferred in a differential manner as side information. However, for starting decoding of such streamed compressed HOA data frame representation absolute gain values are required, which should be coded with a minimum number of bits. For determining such lowest integer number (?e) of bits the HOA data frame representation (C(k)) is rendered in spatial domain to virtual loudspeaker signals lying on a unit sphere, followed by normalisation of the HOA data frame representation (C(k)). Then the lowest integer number of bits is set to ?e=? log2(?log2(?{square root over (KMAX)}·O)?+1)?.
    Type: Grant
    Filed: September 12, 2017
    Date of Patent: July 31, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Alexander Krueger, Sven Kordon
  • Patent number: 10036938
    Abstract: Projection systems and/or methods for efficient use of light by recycling a portion of the light energy are disclosed. In one embodiment, a projection display system is disclosed comprising a light source; an integrated rod that receives light from said light source at a proximal end that comprise a reflective surface which may reflecting/recycle light down said integrating rod; and a modulator comprising at least one moveable mirror that reflects light received from the integrating rod in either a projection direction or a light recycling direction. In other embodiments, dual and multiple modulator projector display systems are disclosed. A first modulator may affect either a pre-modulated halftone image or may affect a highlights modulated image for a desired image to be displayed. A second modulator may be provided for primary modulation of a desired image.
    Type: Grant
    Filed: June 19, 2015
    Date of Patent: July 31, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Douglas J. Gorny, Martin J. Richards
  • Patent number: 10037762
    Abstract: Soundfield signals such as e.g. Ambisonics carry a representation of a desired sound field. The Ambisonics format is based on spherical harmonic decomposition of the soundfield, and Higher Order Ambisonics (HOA) uses spherical harmonics of at least 2nd order. However, commonly used loudspeaker setups are irregular and lead to problems in decoder design. A method for improved decoding an audio soundfield representation for audio playback comprises calculating a panning function (W) using a geometrical method based on the positions of a plurality of loudspeakers and a plurality of source directions, calculating a mode matrix (?) from the loudspeaker positions, calculating a pseudo-inverse mode matrix (?+) and decoding the audio soundfield representation. The decoding is based on a decode matrix (D) that is obtained from the panning function (W) and the pseudo-inverse mode matrix (?+).
    Type: Grant
    Filed: August 21, 2017
    Date of Patent: July 31, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Johann-Markus Batke, Florian Keiler, Johannes Boehm
  • Patent number: 10037763
    Abstract: Apparatus and methods for generating an encoded audio bitstream, including by including substream structure metadata (SSM) and/or program information metadata (PIM) and audio data in the bitstream. Other aspects are apparatus and methods for decoding such a bitstream, and an audio processing unit (e.g., an encoder, decoder, or post-processor) configured (e.g., programmed) to perform any embodiment of the method or which includes a buffer memory which stores at least one frame of an audio bitstream generated in accordance with any embodiment of the method.
    Type: Grant
    Filed: June 12, 2014
    Date of Patent: July 31, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Jeffrey Riedmiller, Michael Ward
  • Patent number: 10038916
    Abstract: Encoding and decoding architectures for 3D video delivery are described, such as 2D compatible 3D video delivery and frame compatible 3D video delivery. The architectures include pre-processing stages to pre-process the output of a base layer video encoder and/or decoder and input the pre-processed output into an enhancement layer video encoder and/or decoder of one or more enhancement layers. Multiplexing methods of how to combine the base and enhancement layer videos are also described.
    Type: Grant
    Filed: August 11, 2017
    Date of Patent: July 31, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Alexandros Tourapis, Peshala V. Pahalawatta, Athanasios Leontaris, Kevin J. Stec, Walter J. Husak
  • Patent number: 10038965
    Abstract: The invention improves HOA sound field representation compression. The HOA representation is analyzed for the presence of dominant sound sources and their directions are estimated. Then the HOA representation is decomposed into a number of dominant directional signals and a residual component. This residual component is transformed into the discrete spatial domain in order to obtain general plane wave functions at uniform sampling directions, which are predicted from the dominant directional signals. Finally, the prediction error is transformed back to the HOA domain and represents the residual ambient HOA component for which an order reduction is performed, followed by perceptual encoding of the dominant directional signals and the residual component.
    Type: Grant
    Filed: February 16, 2017
    Date of Patent: July 31, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Alexander Krueger, Sven Kordon, Johannes Boehm
  • Patent number: 10038961
    Abstract: Example embodiments disclosed herein relate to modelling a frequency response characteristic of an electro-acoustic transducer. A method includes obtaining at least one measurement of the frequency response characteristic for at least one electro-acoustic transducer of the category. A model of a frequency response characteristic specific to a category of electro-acoustic transducers is generated at least in part based on perceptual importance of a frequency band, an averaged, normalized or microphone compensated measurement such that the distortion of the model is optimized. A further method for estimating a frequency response characteristic of an electro-acoustic transducer is based on the generated model and the sensitivity of the electro-acoustic transducer or headphone. Corresponding system and computer program product are also disclosed.
    Type: Grant
    Filed: June 2, 2015
    Date of Patent: July 31, 2018
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Guilin Ma, Xiguang Zheng, C. Phillip Brown
  • Patent number: 10038891
    Abstract: Stereoscopic images are subsampled and placed in a “checkerboard” pattern in an image. The image is encoded in a monoscopic video format. The monoscopic video is transmitted to a device where the “checkerboard” is decoded. Portions of the checkerboard (e.g., “black” portions) are used to reconstruct one of the stereoscopic images and the other portion of the checkerboard (e.g., “white” portions) are used to reconstruct the other image. The subsamples are, for example, taken from the image in a location coincident to the checkerboard position in which the subsamples are encoded.
    Type: Grant
    Filed: May 2, 2018
    Date of Patent: July 31, 2018
    Assignee: Dolby Laboratories Licensing Coporation
    Inventors: Walter J. Husak, David Ruhoff, Alexandros Tourapis, Athanasios Leontaris
  • Publication number: 20180210217
    Abstract: 3D glasses include an absorptive layer in a single lens of the glasses. The absorptive layer may be specifically tailored for spectral separation characteristics of a 3D filter portion of the lens. The absorptive layer may be combined with, and work in conjunction with, interference layers of a lens while also operating separately as an absorber. The absorptive layer may include biometric variations and/or positive runout. The absorptive layer may selectively absorb more of one color than other colors. A balancing absorber may be included in an opposite eye channel.
    Type: Application
    Filed: January 23, 2018
    Publication date: July 26, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Barret LIPPEY, Martin J. RICHARDS
  • Publication number: 20180210695
    Abstract: Embodiments are described for a method of rendering audio for playback through headphones comprising receiving digital audio content, receiving binaural rendering metadata generated by an authoring tool processing the received digital audio content, receiving playback metadata generated by a playback device, and combining the binaural rendering metadata and playback metadata to optimize playback of the digital audio content through the headphones.
    Type: Application
    Filed: March 23, 2018
    Publication date: July 26, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Nicolas R. TSINGOS, Rhonda WILSON, Sunil BHARITKAR, C. Phillip BROWN, Alan J. SEEFELDT, Remi AUDFRAY
  • Publication number: 20180211440
    Abstract: Directional image sensor data may be acquired with one or more directional image sensors. A light source and illumination image may be generated based on the directional image sensor data. A number of operations may be caused to be performed for an image based at least in part on light source information in the light source image. The operations may include display management operations, device positional operations, augmented reality superimposition operations, ambient light control operations, etc.
    Type: Application
    Filed: July 21, 2016
    Publication date: July 26, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Timo KUNKEL, Patrick GRIFFIS, Gregory John WARD