Dolby Labs Patent Applications

Patents granted to Dolby Labs by the U.S. Patent and Trademark Office (USPTO).

  • Publication number: 20190058886
    Abstract: For each content-mapped frame of a scene, it is determined whether the content mapped frame is susceptible to object fragmentation with respect to texture in a homogeneous region based on statistical values derived from the content-mapped image and a source image mapped into the content-mapped image. The homogeneous region is a region of consistent texture in the source image. Based on a count of content-mapped frames susceptible to object fragmentation in homogeneous region, it is determined whether the scene is susceptible to object fragmentation in homogeneous region. If so, an upper limit for mapped codewords for a prediction function for predicting codewords of a predicted image from the mapped codewords in the content-mapped image is adjusted. Mapped codewords above the upper limit are clipped to the upper limit.
    Type: Application
    Filed: September 22, 2016
    Publication date: February 21, 2019
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Qian CHEN, Guan-Ming SU
  • Publication number: 20190057713
    Abstract: A method for hybrid speech enhancement which employs parametric-coded enhancement (or blend of parametric-coded and waveform-coded enhancement) under some signal conditions and waveform-coded enhancement (or a different blend of parametric-coded and waveform-coded enhancement) under other signal conditions. Other aspects are methods for generating a bitstream indicative of an audio program including speech and other content, such that hybrid speech enhancement can be performed on the program, a decoder including a buffer which stores at least one segment of an encoded audio bitstream generated by any embodiment of the inventive method, and a system or device (e.g., an encoder or decoder) configured (e.g., programmed) to perform any embodiment of the inventive method. At least some of speech enhancement operations are performed by a recipient audio decoder with Mid/Side speech enhancement metadata generated by an upstream audio encoder.
    Type: Application
    Filed: October 22, 2018
    Publication date: February 21, 2019
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Jeroen KOPPENS, Hannes MUESCH
  • Publication number: 20190058892
    Abstract: The precision of up-sampling operations in a layered coding system is preserved when operating on video data with high bit-depth. In response to bit-depth requirements of the video coding or decoding system, scaling and rounding parameters are determined for a separable up-scaling filter. Input data are first filtered across a first spatial direction using a first rounding parameter to generate first up-sampled data. First intermediate data are generated by scaling the first up-sampled data using a first shift parameter. The intermediate data are then filtered across a second spatial direction using a second rounding parameter to generate second up-sampled data. Second intermediate data are generated by scaling the second up-sampled data using a second shift parameter. Final up-sampled data may be generated by clipping the second intermediate data.
    Type: Application
    Filed: October 23, 2018
    Publication date: February 21, 2019
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Peng YIN, Taoran LU, Tao CHEN
  • Publication number: 20190057694
    Abstract: The present disclosure relates to methods for processing a decoded audio signal and for selectively applying speech/dialog enhancement to the decoded audio signal. The present disclosure also relates to a method of operating a headset for computer-mediated reality. A method of processing a decoded audio signal comprises obtaining a measure of a cognitive load of a listener that listens to a rendering of the audio signal, determining whether speech/dialog enhancement shall be applied based on the obtained measure of the cognitive load, and performing speech/dialog enhancement based on the determination. A method of operating a headset for computer-mediated reality comprises obtaining eye-tracking data of a wearer of the headset, determining a measure of a cognitive load of the wearer of the headset based on the eye-tracking data, and outputting an indication of the cognitive load of the wearer of the headset.
    Type: Application
    Filed: August 16, 2018
    Publication date: February 21, 2019
    Applicant: Dolby International AB
    Inventor: Arijit Biswas
  • Publication number: 20190058944
    Abstract: A method of processing a series of microphone inputs of an audio conference, the method including the steps of: (a) conducting a spatial analysis and feature extraction of the audio conference based on current audio activity; (b) aggregating historical information to obtain information about the approximate relative location of recent sound objects relative to the series of microphone inputs; (c) utilising the relative location or distance of the sound objects from the series of microphone inputs to determine if beam forming should be utilised to enhance the audio reception from recent sound objects.
    Type: Application
    Filed: February 23, 2017
    Publication date: February 21, 2019
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: David GUNAWAN, Glenn N. DICKINS
  • Publication number: 20190052956
    Abstract: An acoustic manifold for altering a sound wavefront shape from a loudspeaker having a substantially planar driver, comprising a mounting surface configured to attach to a front surface of a case surrounding the driver and having two vertical openings matching corresponding vertical openings in the case to allow sound from the driver to project therethrough, and a waveguide portion coupled to the mounting surface and having a structure channeling sound projected from the driver through the two vertical openings to be combined in one output area. The structure has a plurality of reflective surfaces configured to create output sound that has a consistent dispersion pattern over a defined area. The manifold is configured to increase a vertical and/or horizontal beamwidth of the projected sound so that listeners positioned off an axis of the loudspeaker will hear a wide range of audible frequencies at a substantially similar sound level.
    Type: Application
    Filed: February 22, 2017
    Publication date: February 14, 2019
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Michael J. SMITHERS, Garth Norman SHOWALTER
  • Publication number: 20190051312
    Abstract: The present invention provides improvements to prior art audio codecs that generate a stereo-illusion through post-processing of a received mono signal. These improvements are accomplished by extraction of stereo-image describing parameters at the encoder side, which are transmitted and subsequently used for control of a stereo generator at the decoder side. Furthermore, the invention bridges the gap between simple pseudo-stereo methods, and current methods of true stereo-coding, by using a new form of parametric stereo coding. A stereo-balance parameter is introduced, which enables more advanced stereo modes, and in addition forms the basis of a new method of stereo-coding of spectral envelopes, of particular use in systems where guided HFR (High Frequency Reconstruction) is employed. As a special case, the application of this stereo-coding scheme in scalable HFR-based codecs is described.
    Type: Application
    Filed: October 11, 2018
    Publication date: February 14, 2019
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Fredrik HENN, Kristofer KJORLING, Lars LILJERYD, Jonas RODEN, Jonas ENGDEGARD
  • Publication number: 20190052991
    Abstract: Example embodiments disclosed herein relates to upmixing of audio signals. A method of upmixing an audio signal is described. The method includes decomposing the audio signal into a diffuse signal and a direct signal, generating an audio bed at least in part based on the diffuse signal, the audio bed including a height channel, extracting an audio object from the direct signal, estimating metadata of the audio object, the metadata including height information of the audio object; and rendering the audio bed and the audio object as an upmixed audio signal, wherein the audio bed is rendered to a predefined position and the audio object is rendered according to the metadata. Corresponding system and computer program product are described as well.
    Type: Application
    Filed: February 9, 2016
    Publication date: February 14, 2019
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Jun WANG, Lie LU, Lianwu CHEN, Mingqing HU
  • Publication number: 20190052989
    Abstract: The present disclosure relates to reverberation generation for headphone virtualization. A method of generating one or more components of a binaural room impulse response (BRIR) for headphone virtualization is described. In the method, directionally-controlled reflections are generated, wherein directionally-controlled reflections impart a desired perceptual cue to an audio input signal corresponding to a sound source location. Then at least the generated reflections are combined to obtain the one or more components of the BRIR. Corresponding system and computer program products are described as well.
    Type: Application
    Filed: October 18, 2018
    Publication date: February 14, 2019
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Louis D. FIELDER, Zhiwei SHUANG, Grant A. DAVIDSON, Xiguang ZHENG, Mark S. VINTON
  • Publication number: 20190052990
    Abstract: A method for performing DRC on a HOA signal comprises transforming the HOA signal to the spatial domain, analyzing the transformed HOA signal, and obtaining, from results of said analyzing, gain factors that are usable for dynamic compression. The gain factors can be transmitted together with the HOA signal. When applying the DRC, the HOA signal is transformed to the spatial domain, the gain factors are extracted and multiplied with the transformed HOA signal in the spatial domain, wherein a gain compensated transformed HOA signal is obtained. The gain compensated transformed HOA signal is transformed back into the HOA domain, wherein a gain compensated HOA signal is obtained. The DRC may be applied in the QMF-filter bank domain.
    Type: Application
    Filed: February 7, 2018
    Publication date: February 14, 2019
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Johannes BOEHM, Florian KEILER
  • Publication number: 20190043518
    Abstract: Methods and systems employing an internal microphone and an external microphone of a headset to capture own voice content in the presence of noise, extract the own voice content from background noise (by performing noise reduction on the microphone outputs to generate a noise reduced signal indicative of the own voice content), and optionally also perform voice activity detection to identify segments of own voice presence or absence. In some embodiments, the external microphone is employed to capture the own voice content, the internal microphone signal is employed to infer the noise captured by the external microphone, and the inferred noise is subtracted from the external microphone signal to generate the noise reduced signal. Aspects include methods performed by any embodiment of the system, and a system or device configured (e.g., programmed) to perform any embodiment of the method.
    Type: Application
    Filed: February 24, 2017
    Publication date: February 7, 2019
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Chunjian LI
  • Publication number: 20190045312
    Abstract: Described herein are audio capture systems and methods. One embodiment provides an audio capture system (1) including: microphones (9-11) positioned to capture respective audio signals from different directions or locations within an audio environment; a mixing module (7) configured to mix the audio signals in accordance with a mixing control signal to produce an output audio mix, wherein, upon the detection of vibration activity, the mixing control signal controls the mixing module (7) to selectively temporarily modify one or more of the audio signals to reduce the presence of noise associated with vibration activity in the output audio mix.
    Type: Application
    Filed: February 16, 2017
    Publication date: February 7, 2019
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: David GUNAWAN, Glenn N. DICKINS
  • Publication number: 20190045177
    Abstract: A projection display system includes a spatial modulator that is controlled to compensate for flare in a lens of the projector. The spatial modulator increases achievable intra-frame contrast and facilitates increased peak luminance without unacceptable black levels. Some embodiments provide 3D projection systems in which the spatial modulator is combined with a polarization control panel.
    Type: Application
    Filed: October 8, 2018
    Publication date: February 7, 2019
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Gregory John Ward, Robin Atkins
  • Publication number: 20190045181
    Abstract: In some embodiments, a display device is disclosed to optically communicating display parameters. The device receives input image data. Embedded in the input image data is a code value identifying a request for a portion of a display parameter of the display device. The device decodes the embedded code value. The device generates an optical image based on the request and transmits the generated optical image to an output of the display device to communicate the requested portion of the requested display parameter.
    Type: Application
    Filed: August 2, 2018
    Publication date: February 7, 2019
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Timo KUNKEL, Robin ATKINS, Tao CHEN
  • Publication number: 20190045315
    Abstract: A method for interactive and user guided manipulation of multichannel audio content, the method including the steps of: providing a content preview facility for replay and review of multichannel audio content by a user; providing a user interface for the user selection of a segment of multichannel audio content having an unsatisfactory audio content; processing the audio content to include associated audio object activity spatial or signal space regions, to create a time line of activity where one or more spatial or signal space regions are active at any given time; matching the user's gesture input against at least one of the active spatial or signal space regions; signal processing the audio emanating from selected active spatial or signal space region using a number of differing techniques to determine at least one processed alternative; providing the user with an interactive playback facility to listen to the processed alternative.
    Type: Application
    Filed: February 9, 2017
    Publication date: February 7, 2019
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Glenn N. DICKINS, David GUNAWAN
  • Publication number: 20190037333
    Abstract: An audio processing system and method which calculates, based on spatial metadata of the audio object, a panning coefficient for each of the audio objects in relation to each of a plurality of predefined channel coverage zones. Converts the audio signal into submixes in relation to the predefined channel coverage zones based on the calculated panning coefficients and the audio objects. Each of the submixes indicating a sum of components of the plurality of the audio objects in relation to one of the predefined channel coverage zones. Generating a submix gain by applying an audio processing to each of the submix and controls an object gain applied to each of the audio objects. The object gain being as a function of the panning coefficients for each of the audio objects and the submix gains in relation to each of the predefined channel coverage zones.
    Type: Application
    Filed: September 26, 2018
    Publication date: January 31, 2019
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Alan J. SEEFELDT, Lie LU, Chen ZHANG
  • Publication number: 20190037230
    Abstract: Implementations are provided that relate, for example, to view tiling in video encoding and decoding. A particular method includes accessing a video picture that includes multiple pictures combined into a single picture (826), accessing information indicating how the multiple pictures in the accessed video picture are combined (806, 808, 822), decoding the video picture to provide a decoded representation of at least one of the multiple pictures (824, 826), and providing the accessed information and the decoded video picture as output (824, 826). Some other implementations format or process the information that indicates how multiple pictures included in a single video picture are combined into the single video picture, and format or process an encoded representation of the combined multiple pictures.
    Type: Application
    Filed: September 27, 2018
    Publication date: January 31, 2019
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Purvin Bibhas PANDIT, Peng YIN, Dong TIAN
  • Publication number: 20190037331
    Abstract: Methods for dialogue enhancing audio content, comprising providing a first audio signal presentation of the audio components, providing a second audio signal presentation, receiving a set of dialogue estimation parameters configured to enable estimation of dialogue components from the first audio signal presentation, applying said set of dialogue estimation parameters to said first audio signal presentation, to form a dialogue presentation of the dialogue components; and combining the dialogue presentation with said second audio signal presentation to form a dialogue enhanced audio signal presentation for reproduction on the second audio reproduction system, wherein at least one of said first and second audio signal presentation is a binaural audio signal presentation.
    Type: Application
    Filed: January 26, 2017
    Publication date: January 31, 2019
    Applicants: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Leif Jonas SAMUELSSON, Dirk Jeroen BREEBAART, David Matthew COOPER, Jeroen KOPPENS
  • Publication number: 20190035411
    Abstract: A method for generating a bitstream indicative of an object based audio program is described. The bitstream comprises a sequence of containers. A first container of the sequence of containers comprises a plurality of substream entities for a plurality of substreams of the object based audio program and a presentation section. The method comprises determining a set of object channels. The method further comprises providing a set of object related metadata for the set of object channels. In addition, the method comprises inserting a first set of object channel frames and a first set of object related metadata frames into a respective set of substream entities of the first container. Furthermore, the method comprises inserting presentation data into the presentation section.
    Type: Application
    Filed: October 2, 2018
    Publication date: January 31, 2019
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Christof FERSCH, Alexander STAHLMANN
  • Publication number: 20190035410
    Abstract: Encoding/decoding an audio signal having one or more audio components, wherein each audio component is associated with a spatial location. A first audio signal presentation (z) of the audio components, a first set of transform parameters (w(f)), and signal level data (?2) are encoded and transmitted to the decoder. The decoder uses the first set of transform parameters (w(f)) to form a reconstructed simulation input signal intended for an acoustic environment simulation, and applies a signal level modification (?) to the reconstructed simulation input signal. The signal level modification is based on the signal level data (?2) and data (p2) related to the acoustic environment simulation. The attenuated reconstructed simulation input signal is then processed in an acoustic environment simulator. With this process, the decoder does not need to determine the signal level of the simulation input signal, thereby reducing processing load.
    Type: Application
    Filed: January 23, 2017
    Publication date: January 31, 2019
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Dirk Jeroen BREEBAART
  • Publication number: 20190037301
    Abstract: Personal audio systems and methods are disclosed. A personal audio system includes a voice activity detector to determine whether or not an ambient audio stream contains voice activity, a pitch estimator to determine a frequency of a fundamental component of an annoyance noise contained in the ambient audio stream, and a filter bank to attenuate the fundamental component and at least one harmonic component of the annoyance noise to generate a personal audio stream. The filter bank implements a first filter function when the ambient audio stream does not contain voice activity, or a second filter function when the ambient audio stream contains voice activity.
    Type: Application
    Filed: August 2, 2018
    Publication date: January 31, 2019
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Gints KLIMANIS, Anthony PARKS
  • Publication number: 20190037214
    Abstract: The quantization parameter QP is well-known in digital video compression as an indication of picture quality. Digital symbols representing a moving image are quantized with a quantizing step that is a function QSN of the quantization parameter QP, which function QSN has been normalized to the most significant bit of the bit depth of the digital symbols. As a result, the effect of a given QP is essentially independent of bit depth a particular QP value has a standard effect on image quality, regardless of bit depth. The invention is useful, for example, in encoding and decoding at different bit depths, to generate compatible, bitstreams having different bit depths, and to allow different bit depths for different components of a video signal by compressing each with the same fidelity (i.e., the same QP).
    Type: Application
    Filed: October 2, 2018
    Publication date: January 31, 2019
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Walter C. Gish, Christopher J. Vogt
  • Publication number: 20190037283
    Abstract: Personalized audio metadata is generated based on audio program elements and presentation configuration metadata to specify personalized audio presentations for a media presentation. The personalized audio metadata is transmitted to an adaptive streaming client in response to receiving a media presentation request by the adaptive streaming client for the media presentation. Audio program elements of a specific personalized audio presentation are transmitted to the adaptive streaming client in response to receiving subsequent media streaming requests for the audio program elements.
    Type: Application
    Filed: January 31, 2017
    Publication date: January 31, 2019
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Kurt KRAUSS, Michael Peter ASSENTI
  • Publication number: 20190027157
    Abstract: An importance metric, based at least in part on an energy metric, may be determined for each of a plurality of received audio objects. Some methods may involve: determining a global importance metric for all of the audio objects, based, at least in part, on a total energy value calculated by summing the energy metric of each of the audio objects; determining an estimated quantization bit depth and a quantization error for each of the audio objects; calculating a total noise metric for all of the audio objects, the total noise metric being based, at least in part, on a total quantization error corresponding with the estimated quantization bit depth; calculating a total signal-to-noise ratio corresponding with the total noise metric and the total energy value; and determining a final quantization bit depth for each of the audio objects by applying a signal-to-noise ratio threshold to the total signal-to-noise ratio.
    Type: Application
    Filed: January 26, 2017
    Publication date: January 24, 2019
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Nicolas R. TSINGOS, Zachary Gideon COHEN, Vivek KUMAR
  • Publication number: 20190028827
    Abstract: Audio content coded for a reference speaker configuration is downmixed to downmix audio content coded for a specific speaker configuration. One or more gain adjustments are performed on individual portions of the downmix audio content coded for the specific speaker configuration. Loudness measurements are then performed on the individual portions of the downmix audio content. An audio signal that comprises the audio content coded for the reference speaker configuration and downmix loudness metadata is generated. The downmix loudness metadata is created based at least in part on the loudness measurements on the individual portions of the downmix audio content.
    Type: Application
    Filed: August 28, 2018
    Publication date: January 24, 2019
    Applicants: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Michael Ward, Jeffrey Riedmiller, Scott Gregory Norcross, Alexander Stahlmann
  • Publication number: 20190019528
    Abstract: The present document relates to audio source coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), as well as to digital effect processors, e.g. exciters, where generation of harmonic distortion add brightness to the processed signal, and to time stretchers where a signal duration is prolonged with maintained spectral content. A system and method configured to generate a time stretched and/or frequency transposed signal from an input signal is described. The system comprises an analysis filterbank configured to provide an analysis subband signal from the input signal; wherein the analysis subband signal comprises a plurality of complex valued analysis samples, each having a phase and a magnitude. Furthermore, the system comprises a subband processing unit configured to determine a synthesis subband signal from the analysis subband signal using a subband transposition factor Q and a subband stretch factor S.
    Type: Application
    Filed: September 19, 2018
    Publication date: January 17, 2019
    Applicant: Dolby International AB
    Inventor: Lars Villemoes
  • Publication number: 20190013034
    Abstract: The present document relates to audio source coding systems. In particular, the present document relates to audio source coding systems which make use of linear prediction in combination with a filterbank. A method for estimating a first sample (615) of a first subband signal in a first subband of an audio signal is described. The first subband signal of the audio signal is determined using an analysis filterbank (612) comprising a plurality of analysis filters which provide a plurality of subband signals in a plurality of subbands from the audio signal, respectively.
    Type: Application
    Filed: September 12, 2018
    Publication date: January 10, 2019
    Applicant: DOLBY INTERNATIONAL AB
    Inventor: Lars VILLEMOES
  • Publication number: 20190013786
    Abstract: In some embodiments, a method for processing an audio signal in an audio processing apparatus is disclosed. The method includes receiving an audio signal and a parameter, the parameter indicating a location of an auditory event boundary. An audio portion between consecutive auditory event boundaries constitutes an auditory event. The method further includes applying a modification to the audio signal based in part on an occurrence of the auditory event. The parameter may be generated by monitoring a characteristic of the audio signal and identifying a change in the characteristic.
    Type: Application
    Filed: September 12, 2018
    Publication date: January 10, 2019
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Brett G. Crockett, Alan J. Seefeldt
  • Publication number: 20190012133
    Abstract: In some embodiments, a method for benchmarking an audio processing algorithm (“APA”) while the APA is executed in a manner simulating expected real time execution by a deployed system. Other embodiments include a method including steps of determining a synthetic APA which corresponds to a counterpart APA (intended for real use by a first deployed system), and benchmarking the synthetic APA while it is executed in a manner simulating expected real time execution of the synthetic APA by a contemplated deployed system. Other aspects include a system or device configured to implement any embodiment of the inventive method, or including a memory which stores data indicative of at least one synthetic APA determined in accordance with, or a benchmark generated by, an embodiment of the inventive method or steps thereof, and a computer readable medium which stores code for implementing any embodiment of the inventive method or steps thereof.
    Type: Application
    Filed: December 17, 2015
    Publication date: January 10, 2019
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Andrew P. REILLY, Marcus ALTMAN, Niall BATTSON, Nicholas ENGEL
  • Publication number: 20190013029
    Abstract: The present disclosure relates to a method of downmixing a plurality of input audio channels. The method include obtaining, for each of the input audio channels, a plurality of frequency coefficients in a plurality of corresponding frequency bins, and applying, for at least one frequency bin, a downmix matrix to a first array formed by the frequency coefficients of the plurality of input audio channels for the respective frequency bin to obtain a second array formed by the frequency coefficients of a plurality of intermediate audio channels for the respective frequency bin. The method further involves determining a third array including only the non-zero entries of the downmix matrix, and determining a fourth array including, for each entry of the third array, an entry indicative of a position of the respective entry of the third array within the downmix matrix.
    Type: Application
    Filed: February 3, 2017
    Publication date: January 10, 2019
    Applicant: Dolby International AB
    Inventor: Vesa RUOPPILA
  • Publication number: 20190007754
    Abstract: An input media signal that carries input media content is received. The input media content is used to generate output media content in an output media signal. It is determined whether identification-and-timing (IAT) data is to be authored for the output media content. In response to determining that the output IAT data is to be authored for the output media content, output IAT data is authored for the output media content. At least a part of the output IAT data for at least a part of the output media content is encoded, along with the part of the output media content, into the output media signal. In some example scenarios, this output media signal then contains the IAT data and other related data for synchronization of additional media content with the output media content in content rendering/presentation operations.
    Type: Application
    Filed: June 29, 2018
    Publication date: January 3, 2019
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Aaron S. MASTER, Marvin PRIBADI
  • Publication number: 20190007779
    Abstract: Multi-channel audio content is mixed for a particular loudspeaker setup. However, a consumer's audio setup is very likely to use a different placement of speakers. The present invention provides a method of rendering multi-channel audio that assures replay of the spatial signal components with equal loudness of the signal. A method for obtaining an energy preserving mixing matrix (G) for mixing L1 input audio channels to L2 output channels comprises steps of obtaining a first mixing matrix ?, performing a singular value decomposition on the first mixing matrix ? to obtain a singularity matrix S, processing the singularity matrix S to obtain a processed singularity matrix ?, determining a scaling factor a, and calculating an improved mixing matrix G according to G=a U ? VT. The perceived sound, loudness, timbre and spatial impression of multi-channel audio replayed on an arbitrary loudspeaker setup practically equals that of the original speaker setup.
    Type: Application
    Filed: September 6, 2018
    Publication date: January 3, 2019
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Johannes BOEHM
  • Publication number: 20190005919
    Abstract: Apparatus and methods for mapping video signal parameters such as tone and color may be applied at various points in a video generation and delivery pipeline. apparatus may be configured to control mappings based on a range of inputs which may include one or more of: ambient conditions, user inputs, control information, adaptation models. Apparatus and methods may be applied to display video or other images so as to preserve a creative intent embodied in video or other image data.
    Type: Application
    Filed: August 20, 2018
    Publication date: January 3, 2019
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Helge Seetzen, Robin Atkins, Neil W. Messmer, Gerwin Damberg
  • Publication number: 20190007703
    Abstract: A method for encoding a LUT defined as a lattice of vertices is disclosed. At least one value is of each vertex of the lattice. The method comprises for a current vertex: predicting the at least one value of said current vertex from another value which is for example obtained from reconstructed values of neighboring vertices; and encoding in a bitstream at least one residue computed between the at least one value of the current vertex and its prediction in a bitstream.
    Type: Application
    Filed: September 10, 2018
    Publication date: January 3, 2019
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Philippe BORDES, Pierre ANDRIVON, Emmanuel JOLLY
  • Publication number: 20180376146
    Abstract: Novel methods and systems for encoding standard dynamic range video to improve the final quality after converting standard dynamic range video into enhanced dynamic range video are disclosed. A dual layer codec structure that amplifies certain codeword ranges can be used to send enhanced information to the decoder in order to achieve an enhanced (higher bit depth) image signal. The enhanced standard dynamic range signal can then be up-converted to enhanced dynamic range video without banding artifacts in the areas corresponding to those certain codeword ranges.
    Type: Application
    Filed: July 26, 2016
    Publication date: December 27, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Guan-Ming SU
  • Publication number: 20180374496
    Abstract: Example embodiments disclosed herein relate to audio signal processing. A method of processing an audio signal is disclosed. The method includes detecting, based on a power distribution of the audio signal, a type of content of a frame of the audio signal, generating a first gain based on a sound level of the frame for adjusting the sound level, processing the audio signal by applying the first gain to the frame; and in response to the type of content being detected to be a breath sound, generating a second gain for mitigating the breath sound and processing the audio signal by applying the second gain to the frame. Corresponding system and computer program product are also disclosed.
    Type: Application
    Filed: December 15, 2016
    Publication date: December 27, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Dong SHI, David GUNAWAN, Glenn N. DICKINS
  • Publication number: 20180374192
    Abstract: A spherical image of a spatial environment is received and contains spherically arranged pixel values indexed by a time value. The spherical image is represented in a content creation coordinate system in reference to a spatial position in the spatial environment. The spatial position is indexed by the time value. A spatial relationship is determined between the content creation coordinate system and a spherical image reference coordinate system. Based at least in part on the spatial relationship and the spherically arranged pixel values, spherical distributions of image metadata are determined for the spherical image.
    Type: Application
    Filed: December 22, 2016
    Publication date: December 27, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Timo KUNKEL, Cristina Michel VASCO, Scott DALY
  • Publication number: 20180373129
    Abstract: A novel spatial light modulator (SLM) includes a cover glass, and modulation layer, and a plurality of pixel mirrors, and separates unwanted, reflected light from desired, modulated light. In one embodiment, a geometrical relationship exists between the cover glass and the pixel mirrors, such that light that reflects from the cover glass is separated from light that reflects from the pixel mirrors and is transmitted from the SLM. In one example, one of the cover glass or the pixel mirrors is angled with respect to the modulation layer. In another example embodiment, the cover glass has a particular thickness, which introduces destructive interference between light that reflects from the top and bottom surfaces of the cover glass. In another embodiment antireflective coatings are disposed between optical interfaces of the SLM. In another embodiment, light from the SLM is directed through an optical filter to remove unwanted light.
    Type: Application
    Filed: June 19, 2018
    Publication date: December 27, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Juan P. PERTIERRA, Martin J. RICHARDS, Barret LIPPEY
  • Publication number: 20180367939
    Abstract: Systems and methods are described for an adaptive audio system that renders reflected sound for adaptive audio systems in different ways depending on the orientation of at least one speaker in a set of speakers. A speaker of the system may comprise an integrated speaker having front-firing and upward-firing drivers, a sensor to determine the orientation of the speaker (e.g., horizontal or vertical) and a transceiver and control unit that transmits the orientation to a decoder and receives updated speaker feeds from the renderer based on the orientation.
    Type: Application
    Filed: December 14, 2016
    Publication date: December 20, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: David Matthew FISCHER, Warren MANSFIELD, Adam Christopher NOEL, Timothy James EGGERDING, Philip NICOL
  • Publication number: 20180366131
    Abstract: A method for compressing a HOA signal being an input HOA representation with input time frames (C(k)) of HOA coefficient sequences comprises spatial HOA encoding of the input time frames and subsequent perceptual encoding and source encoding. Each input time frame is decomposed (802) into a frame of predominant sound signals (XPS(k?1)) and a frame of an ambient HOA component ({tilde over (C)}AMB(k?1)). The ambient HOA component ({tilde over (C)}AMB(k?1)) comprises, in a layered mode, first HOA coefficient sequences of the input HOA representation (cn(k?1)) in lower positions and second HOA coefficient sequences (cAMB,n(k?1)) in remaining higher positions. The second HOA coefficient sequences are part of an HOA representation of a residual between the input HOA representation and the HOA representation of the predominant sound signals.
    Type: Application
    Filed: August 28, 2018
    Publication date: December 20, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Sven Kordon, Alexander Krueger, Oliver Wuebbolt
  • Publication number: 20180366136
    Abstract: Example embodiments disclosed herein relate to audio signal processing. A method of indicating a presence of a nuisance in an audio signal is disclosed. The method includes determining a probability of the presence of the nuisance in a frame of the audio signal based on a feature of the audio signal, the nuisance representing an unwanted sound made by a user, in response to the probability of the presence of the nuisance exceeding a threshold, tracking the audio signal based on a metric over a plurality of frames following the frame, determining, based on the tracking, that the presence of the nuisance is to be indicated to the user, and in response to the determination, presenting to the user a notification of the presence of the nuisance. Corresponding system and computer program product are also disclosed.
    Type: Application
    Filed: December 14, 2016
    Publication date: December 20, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Dong Shi, David Gunawan, Glenn N. Dickins
  • Publication number: 20180366070
    Abstract: Systems and methods are disclosed for dynamically adjusting the backlight of a display during video playback. Given an input video stream and associated minimum, average, or maximum luminance values of the video frames in the video stream, values of a function of the frame min, mid, or max luminance values are filtered using a temporal filter to generate a filtered output value for each frame. The instantaneous dynamic range of a target display is determined based on the filtered output value and the minimum and maximum brightness values of the display. A backlight control level is computed based on the instantaneous dynamic range, and the input signal is tone mapped by a display management process to be displayed on the target display at the selected backlight level. The design of a temporal filter based on an exponential moving average filter and scene-change detection is presented.
    Type: Application
    Filed: May 11, 2016
    Publication date: December 20, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Suzanne FARRELL, Scott DALY, Robin ATKINS, Timo KUNKEL, Gregory John WARD, Samir N. HULYALKAR, Ning XU
  • Publication number: 20180366132
    Abstract: Encoding and decoding devices for encoding the channels of an audio system having at least four channels are disclosed. The decoding device has a first stereo decoding component which subjects a first pair of input channels to a first stereo decoding, and a second stereo decoding component which subjects a second pair of input channels to a second stereo decoding. The results of the first and second stereo decoding components are crosswise coupled to a third and a fourth stereo decoding component which each performs stereo decoding on one channel resulting from the first stereo decoding component, and one channel resulting from the second stereo decoding component.
    Type: Application
    Filed: August 28, 2018
    Publication date: December 20, 2018
    Applicant: Dolby International AB
    Inventors: Kristofer Kjoerling, Harald Mundt, Heiko Purnhagen
  • Publication number: 20180367934
    Abstract: The invention discloses rendering sound field signals, such as Higher-Order Ambisonics (HOA), for arbitrary loudspeaker setups, where the rendering results in highly improved localization properties and is energy preserving. This is obtained by a new type of decode matrix for sound field data, and a new way to obtain the decode matrix.
    Type: Application
    Filed: August 28, 2018
    Publication date: December 20, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Johannes BOEHM, Florian KEILER
  • Publication number: 20180358028
    Abstract: Embodiments are directed to a companding method and system for reducing coding noise in an audio codec. A method of processing an audio signal comprises receiving an audio signal, classifying the audio signal as one of pure sinusoidal, hybrid, or pure transient signal using two defined threshold values, and selectively applying a companding operation by switching between a companding off mode, a companding on mode, and an average companding mode, comprising selecting between the companding on mode and the average companding mode for a classified hybrid signal using a companding rule that uses a temporal sharpness measure in a frequency domain.
    Type: Application
    Filed: October 27, 2016
    Publication date: December 13, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventor: Arijit BISWAS
  • Publication number: 20180359489
    Abstract: A target view to a 3D scene depicted by a multiview image is determined. The multiview image comprises multiple sampled views. Each sampled view comprises multiple texture images and multiple depth images in multiple image layers. The target view is used to select, from the multiple sampled views of the multiview image, sampled views. A texture image and a depth image for each sampled view in the selected sampled views are encoded into a multiview video signal to be transmitted to a downstream device.
    Type: Application
    Filed: June 7, 2018
    Publication date: December 13, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Haricharan LAKSHMAN, Ajit NINAN
  • Publication number: 20180359596
    Abstract: A method of encoding channel or object based input audio for playback, the method including the steps of: (a) initially rendering the channel or object based input audio into an initial output presentation; (b) determining an estimate of the dominant audio component from the channel or object based input audio and determining a series of dominant audio component weighting factors for mapping the initial output presentation into the dominant audio component; (c) determining an estimate of the dominant audio component direction or position; and (d) encoding the initial output presentation, the dominant audio component weighting factors, the dominant audio component direction or position as the encoded signal for playback.
    Type: Application
    Filed: November 17, 2016
    Publication date: December 13, 2018
    Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB
    Inventors: Dirk Jeroen BREEBAART, David Matthew COOPER, Mark F. DAVIS, David S. McGRATH, Kristofer KJOERLING, Harald MUNDT, Rhonda J. WILSON
  • Publication number: 20180350393
    Abstract: Systems and methods are described for measuring capture performance of multiple voice signals. A first speech signal is applied to a device, and measured at far-end at a far-end of a testing environment. A second speech signal is separately applied to the device, and is also measured at the far end. The measured speech signals are added, and a quality assessment model is applied to the first far-end combined signal to obtain a first quality metric. The first speech signal and the second speech signal are then both applied at the same time to the device and measured at the far-end. The quality assessment model is applied to the second far-end combined signal to obtain a second quality metric. The quality metric for the second far-end combined signal is normalized, based on the first quality metric, to obtain a performance index for the device.
    Type: Application
    Filed: January 17, 2017
    Publication date: December 6, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Doh-Suk KIM
  • Publication number: 20180352475
    Abstract: The present disclosure provides methods, devices and computer program products for non-uniform quantization of parameters relating to parametric spatial coding of audio signals. The disclosure further relates to a method and apparatus for reconstructing an audio object in an audio decoding system taking the non-uniformly quantized parameters into account. According to the disclosure, such an approach renders it possible to reduce bit consumption without substantially reducing the quality of the reconstructed audio object.
    Type: Application
    Filed: August 10, 2018
    Publication date: December 6, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Heiko PURNHAGEN, Per EKSTRAND
  • Publication number: 20180352366
    Abstract: The positions of a plurality of speakers at a media consumption site are determined. Audio information in an object-based format is received. Gain adjustment value for a sound content portion in the object-based format may be determined based on the position of the sound content portion and the positions of the plurality of speakers. Audio information in a ring-based channel format is received. Gain adjustment value for each ring-based channel in a set of ring-based channels may be determined based on the ring to which the ring-based channel belongs and the positions of the speakers at a media consumption site.
    Type: Application
    Filed: June 25, 2018
    Publication date: December 6, 2018
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Nicolas R. TSINGOS, David S. MCGRATH, Freddie SANCHEZ, Antonio MATEOS SOLE