Dolby Labs Patent Applications

Patents granted to Dolby Labs by the U.S. Patent and Trademark Office (USPTO).

  • Publication number: 20180018984
    Abstract: Example embodiments disclosed herein relate to assessment and adjustment for an audio environment. A computer-implemented method is provided. The method includes obtaining a first audio signal captured by a device located in an environment. The method also includes analyzing a characteristic of the first audio signal to determine an acoustic performance metric for the environment. The method further includes, in response to the acoustic performance metric being below a threshold, providing a first task for a user to perform based on the characteristic of the first audio signal. The first task is related to an adjustment to a setting of the environment. Embodiments in this regard further provide a corresponding computer program product. Corresponding system and computer program product are also disclosed.
    Type: Application
    Filed: July 12, 2017
    Publication date: January 18, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Glenn N. DICKINS, Paul HOLMBERG, Dong SHI
  • Publication number: 20180020224
    Abstract: Statistical values are computed based on received source images. An adaptive reshaping function is selected for one or more source images based on the one or more statistical values. A portion of source video content is adaptively reshaped, based on the selected adaptive reshaping function to generate a portion of reshaped video content. The portion of source video content is represented by the one or more source images. An approximation of an inverse of the selected adaptive reshaping function is generated. The reshaped video content and a set of adaptive reshaping parameters defining the approximation of the inverse of the selected adaptive reshaping function are encoded into a reshaped video signal. The reshaped video signal may be processed by a downstream recipient device to generate a version of reconstructed source images, for example, for rendering with a display device.
    Type: Application
    Filed: March 17, 2016
    Publication date: January 18, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Guan-Ming SU, Qiuwei WANG, Hsuan-Ting CHOU, Navaneeth KAMBALLUR KOTTAYIL
  • Publication number: 20180018932
    Abstract: A processor receives input video data of a video dynamic range and input dynamic metadata. It also receives: input graphics data of a graphics dynamic range and input static metadata, display identification data from a target display over a video interface, and a blending priority map characterizing a per-pixel priority of output pixels in an image generated by blending the input video data and the input graphics data. A video mapping function and a graphics mapping function which map data from the input video and graphics dynamic ranges to the target dynamic range are generated based on the dynamic and static metadata. Then, the input and graphics data are blended based on the blending priority map and a per-pixel decision to map pixels to the target dynamic range using either the video mapping function or the graphics mapping function.
    Type: Application
    Filed: September 28, 2017
    Publication date: January 18, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Robin ATKINS
  • Publication number: 20180018977
    Abstract: Some methods involve receiving an input audio signal that includes N input audio channels, the input audio signal representing a first soundfield format having a first soundfield format resolution, N being an integer ?2. A first decorrelation process may be applied to two or more of the input audio channels to produce a first set of decorrelated channels, the first decorrelation process maintaining an inter-channel correlation of the set of input audio channels. A first modulation process may be applied to the first set of decorrelated channels to produce a first set of decorrelated and modulated output channels. The first set of decorrelated and modulated output channels may be combined with two or more undecorrelated output channels to produce an output audio signal that includes O output audio channels representing a second and relatively higher-resolution soundfield format than the first soundfield format, O being an integer ?3.
    Type: Application
    Filed: March 2, 2016
    Publication date: January 18, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: David S. MCGRATH
  • Publication number: 20180020310
    Abstract: Embodiments are described for rendering spatial audio content through a system that is configured to reflect audio off of one or more surfaces of a listening environment. The system includes an array of audio drivers distributed around a room, wherein at least one driver of the array of drivers is configured to project sound waves toward one or more surfaces of the listening environment for reflection to a listening area within the listening environment and a renderer configured to receive and process audio streams and one or more metadata sets that are associated with each of the audio streams and that specify a playback location in the listening environment.
    Type: Application
    Filed: September 26, 2017
    Publication date: January 18, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Brett G. CROCKETT, Spencer HOOKS, Alan SEEFELDT, Joshua B. LANDO, C. Phillip BROWN, Sripal S. MEHTA, Stewart MURRIE
  • Publication number: 20180020226
    Abstract: The present invention relates generally to images. More particularly, an embodiment of the present invention relates to the pixel group segmented quantization and de-quantization of the residual signal in layered coding of high dynamic range images. By assigning the pixels in the residual image to different pixel groups based on the pixel value of the corresponding pixel in the decoded base layer signal, and by applying pixel group quantizing functions to assigned pixels a more efficient coding can be achieved.
    Type: Application
    Filed: January 25, 2016
    Publication date: January 18, 2018
    Applicants: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Klaas Heinrich SCHUEUER, Uwe Michael KOWALIK, Arion NEDDENS, Philipp KRAETZER, Guan-Ming SU
  • Publication number: 20180012609
    Abstract: This disclosure falls into the field of audio coding, in particular it is related to the field of providing a framework for providing loudness consistency among differing audio output signals. In particular, the disclosure relates to methods, computer program products and apparatus for encoding and decoding of audio data bitstreams in order to attain a desired loudness level of an output audio signal.
    Type: Application
    Filed: August 15, 2017
    Publication date: January 11, 2018
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Jeroen KOPPENS, Scott Gregory NORCROSS
  • Publication number: 20180013879
    Abstract: This disclosure falls into the field of voice communication systems, more specifically it is related to the field of voice quality estimation in a packet based voice communication system. In particular the disclosure provides a method and device for 5 reducing a prediction error of the voice quality estimation by considering the content of lost packets. Furthermore, this disclosure provides a method and device which uses a voice quality estimating algorithm to calculate the voice quality estimate based on an input which is switchable between a first and a second input mode.
    Type: Application
    Filed: December 23, 2015
    Publication date: January 11, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Doh-Suk KIM, Shen HUANG
  • Publication number: 20180012610
    Abstract: An audio processing unit (APU) is disclosed. The APU includes a buffer memory configured to store at least one frame of an encoded audio bitstream, where the encoded audio bitstream includes audio data and a metadata container. The metadata container includes a header and one or more metadata payloads after the header. The one or more metadata payloads include dynamic range compression (DRC) metadata, and the DRC metadata is or includes profile metadata indicative of whether the DRC metadata includes dynamic range compression (DRC) control values for use in performing dynamic range compression in accordance with at least one compression profile on audio content indicated by at least one block of the audio data.
    Type: Application
    Filed: September 1, 2017
    Publication date: January 11, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Jeffrey RIEDMILLER, Michael WARD
  • Publication number: 20180014139
    Abstract: Described herein is a method for creating an object-based audio signal from an audio input, the audio input including one or more audio channels that are recorded to collectively define an audio scene. The one or more audio channels are captured from a respective one or more spatially separated microphones disposed in a stable spatial configuration. The method includes the steps of: a) receiving the audio input; b) performing spatial analysis on the one or more audio channels to identify one or more audio objects within the audio scene; c) determining contextual information relating to the one or more audio objects; d) defining respective audio streams including audio data relating to at least one of the identified one or more audio objects; and e) outputting an object-based audio signal including the audio streams and the contextual information.
    Type: Application
    Filed: February 2, 2016
    Publication date: January 11, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Glenn N. DICKINS, Richard J. CARTWRIGHT
  • Publication number: 20180013927
    Abstract: A display management processor receives an input image with enhanced dynamic range to be displayed on a target display which has a different dynamic range than a reference display. The input image is first transformed into a perceptually-quantized (PQ) color space, preferably the IPT-PQ color space. A color volume mapping function, which includes an adaptive tone-mapping function and an adaptive gamut mapping function, generates a mapped image. A detail-preservation step is applied to the intensity component of the mapped image to generate a final mapped image with a filtered tone-mapped intensity image. The final mapped image is then translated back to the display's preferred color space. Examples of the adaptive tone mapping and gamut mapping functions are provided.
    Type: Application
    Filed: January 14, 2016
    Publication date: January 11, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Robin ATKINS
  • Publication number: 20180014015
    Abstract: Techniques are provided to encode and decode image data comprising a tone mapped (TM) image with HDR reconstruction data in the form of luminance ratios and color residual values. In an example embodiment, luminance ratio values and residual values in color channels of a color space are generated on an individual pixel basis based on a high dynamic range (HDR) image and a derivative tone-mapped (TM) image that comprises one or more color alterations that would not be recoverable from the TM image with a luminance ratio image. The TM image with HDR reconstruction data derived from the luminance ratio values and the color-channel residual values may be outputted in an image file to a downstream device, for example, for decoding, rendering, and/or storing. The image file may be decoded to generate a restored HDR image free of the color alterations.
    Type: Application
    Filed: September 25, 2017
    Publication date: January 11, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Wenhui JIA, Ajit NINAN, Arkady TEN, Gregory John WARD
  • Publication number: 20180011365
    Abstract: A display provides increased contrast and resolution via first LCD panel energized to generate an image and a second LCD panel configured to increase contrast of the image. The second panel is an LCD panel without color filters and is configured to increase contrast by decreasing black levels of dark portions of images using polarization rotation and filtration. The second LCD panel may have higher resolution than the first LCD panel. A half wave plate and/or film is placed in between the first and the second panel. The panels may be directly illuminated or edge lit, and may be globally or locally dimmed lights that may also include individual control of color intensities for each image or frame displayed.
    Type: Application
    Filed: September 8, 2017
    Publication date: January 11, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Jerome D. Shields, Marek Hadlaw, Gopal Erinjippurath
  • Publication number: 20180007374
    Abstract: Downsampled video content is generated in a subsampling color space from linearized video content in the subsampling color space. The linearized video content represents a first spatial dimension, whereas the downsampled video content represents a second spatial dimension lower than the first spatial dimension. Opponent channel data is derived in a transmission color space from the downsampled video content. Output video content is generated from luminance data in the linearized video content and the opponent channel data in the transmission color space. The output video content may be decoded by a downstream recipient device to generate video content in an output color space.
    Type: Application
    Filed: March 24, 2016
    Publication date: January 4, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Robin ATKINS, Jaclyn Anne PYTLARZ
  • Publication number: 20180005639
    Abstract: According to an aspect of the present invention, a method for reconstructing an audio signal having a baseband portion and a highband portion is disclosed. The method includes obtaining a decoded baseband audio signal by decoding an encoded audio signal and obtaining a plurality of subband signals by filtering the decoded baseband audio signal. The method further includes generating a high-frequency reconstructed signal by copying a number of consecutive subband signals of the plurality of subband signals and obtaining an envelope adjusted high-frequency signal. The method further includes generating a noise component based on a noise parameter. Finally, the method includes adjusting a phase of the high-frequency reconstructed signal and obtaining a time-domain reconstructed audio signal by combining the decoded baseband audio signal and the combined high-frequency signal to obtain a time-domain reconstructed audio signal.
    Type: Application
    Filed: September 12, 2017
    Publication date: January 4, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Michael M. Truman, Mark S. Vinton
  • Publication number: 20180005643
    Abstract: In some embodiments, a method, apparatus and computer program for reducing noise from an audio signal captured by a drone (e.g., canceling the noise signature of a drone from the audio signal) using a model of noise emitted by the drone's propulsion system set, where the propulsion system set includes one or more propulsion systems, each of the propulsion systems including an electric motor, and wherein the noise reduction is performed in response to voltage data indicative of instantaneous voltage supplied to each electric motor of the propulsion system set. In some other embodiments, a method, apparatus and computer program for generating a noise model by determining the noise signature of at least one drone based upon a database of noise signals corresponding to at least one propulsion system and canceling the noise signature of the drone in an audio signal based upon the noise model.
    Type: Application
    Filed: January 20, 2016
    Publication date: January 4, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Nicolas R. TSINGOS
  • Publication number: 20180006011
    Abstract: An overall displacement tolerance applicable to each pixel tile in a plurality of pixel tiles to be used as parts of an image rendering surface is determined. Each pixel tile in the plurality of pixel tiles comprises a plurality of sub-pixels. Random displacements are generated in each pixel tile in the plurality of pixel tiles based on the overall displacement tolerance. The plurality of image rendering tiles with the random displacements are combined into the image rendering surface.
    Type: Application
    Filed: January 11, 2016
    Publication date: January 4, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Ajit NINAN, Tyrome Y. BROWN
  • Publication number: 20180006837
    Abstract: Some aspects of the present disclosure involve the recording, processing and playback of audio data corresponding to conferences, such as teleconferences. In some teleconference implementations, the audio experience heard when a recording of the conference is played back may be substantially different from the audio experience of an individual conference participant during the original teleconference. In some implementations, the recorded audio data may include at least some audio data that was not available during the teleconference. In some examples, the spatial characteristics of the played-back audio data may be different from that of the audio heard by participants of the teleconference.
    Type: Application
    Filed: February 3, 2016
    Publication date: January 4, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Richard J. CARTWRIGHT, Glenn N. DICKINS
  • Publication number: 20180007327
    Abstract: Projection systems and/or methods for efficient use of light by recycling a portion of the light energy for future use are disclosed. In one embodiment, a projection display system is disclosed comprising a light source; an integrating rod that receives light from said light source at a proximal end that comprise a reflective surface which may reflecting/recycle light down said integrating rod; of reflecting light down said integrating rod; a relay optical system, said relay optical system further comprising optical elements that are capable of moving the focal plane of the projector display system; and a modulator comprising at least one moveable mirror that reflects light received from the integrating rod in either a projection direction or a light recycling direction.
    Type: Application
    Filed: December 23, 2015
    Publication date: January 4, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Martin J. RICHARDS, Nathan WAINWRIGHT, Douglas J. GORNY, Duane Scott DEWALD
  • Publication number: 20180005641
    Abstract: When compressing an HOA data frame representation, a gain control (15, 151) is applied for each channel signal before it is perceptually encoded (16). The gain values are transferred in a differential manner as side information. However, for starting decoding of such streamed compressed HOA data frame representation absolute gain values are required, which should be coded with a minimum number of bits. For determining such lowest integer number (?e) of bits the HOA data frame representation (C(k)) is rendered in spatial domain to virtual loudspeaker signals lying on a unit sphere, followed by normalisation of the HOA data frame representation (C(k)). Then the lowest integer number of bits is set to ?e=?log2(?log2(?{square root over (KMAX)}·O)?+1)?.
    Type: Application
    Filed: September 12, 2017
    Publication date: January 4, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Alexander KRUEGER, Sven KORDON
  • Publication number: 20180007484
    Abstract: When compressing an HOA data frame representation, a gain control (15, 151) is applied for each channel signal before it is perceptually encoded (16). The gain values are transferred in a differential manner as side information. However, for starting decoding of such streamed compressed HOA data frame representation absolute gain values are required, which should be coded with a minimum number of bits. For determining such lowest integer number (?e) of bits the HOA data frame representation (C(k)) is rendered in spatial domain to virtual loudspeaker signals lying on a unit sphere, followed by normalisation of the HOA data frame representation (C(k)). Then the lowest integer number of bits is set to ?e=?log2(?log2(?{square root over (KMAX)}·O)?+1)?.
    Type: Application
    Filed: September 12, 2017
    Publication date: January 4, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Sven KORDON, Alexander KRUEGER
  • Publication number: 20180007356
    Abstract: In a system for coding high dynamic range (HDR) images using lower-dynamic range (LDR) images, a reshaping function allows for a more efficient distribution of the codewords in the lower dynamic range images for improved compression. A trim pass of the LDR images by a colorist may satisfy a director's intent for a given “look,” but may also result in unpleasant clipping artifacts in the reconstructed HDR images. Given an original forward reshaping function which maps HDR luminance values to LDR pixel values, a processor identifies areas of potential clipping and generates modified forward and backward reshaping functions to reduce the visibility of potential artifacts from the trim pass process while preserving the director's intent.
    Type: Application
    Filed: September 8, 2017
    Publication date: January 4, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Harshad KADU, Guan-Ming SU
  • Publication number: 20170373857
    Abstract: The present document relates to transcoding of metadata, and in particular to a method and system for transcoding metadata with reduced computational complexity. A transcoder configured to transcode an inbound bitstream comprising an inbound content frame and an associated inbound metadata frame into an outbound bitstream comprising an outbound content frame and an associated outbound metadata frame is described. The inbound content frame is indicative of a signal encoded according to a first codec system and the outbound content frame is indicative of the signal encoded according to a second codec system. The transcoder is configured to identify an inbound block of metadata from the inbound metadata frame, the inbound block of metadata associated with an inbound descriptor indicative of one or more properties of metadata comprised within the inbound block of metadata, and to generate the outbound metadata frame from the inbound metadata frame based on the inbound descriptor.
    Type: Application
    Filed: September 2, 2017
    Publication date: December 28, 2017
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Andreas SCHNEIDER, Christof FERSCH, Martin WOLTERS, Jeffrey RIEDMILLER, Scott Gregory NORCROSS, Michael GRANT
  • Publication number: 20170371167
    Abstract: Discrete light fiber inputs for high powered image projector display systems are disclosed herein. Various embodiments disclosed herein may employ a bundle of light fiber inputs, a diffuser and reducing relay optic to convert the fiber input array into a smaller pattern of spots that may be interfaced to a projector display system that may perform light recycling. Many embodiments herein may facilitate higher power laser light for illumination and, possibly, recycling. In these embodiments, laser fibers may be individually collimated and illuminate a diffuser. The diffuser spots may be then imaged through a common path relay that can be resized to allow room for the individual lasers and collimation lenses. The diffuser spots may be imaged through holes in a mirror that is on the input side of an integration rod which recycles the light.
    Type: Application
    Filed: December 23, 2015
    Publication date: December 28, 2017
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Duane Scott DEWALD, Nathan WAINWRIGHT, Douglas J. GORNY, Martin J. RICHARDS
  • Publication number: 20170374485
    Abstract: The encoding and decoding of HOA signals using Singular Value Decomposition includes forming based on sound source direction values and an Ambisonics order corresponding ket vectors (|Y(?s)) of spherical harmonics and an encoder mode matrix (?O×S). From the audio input signal (|x(?s)) a singular threshold value (?S) determined. On the encoder mode matrix a Singular Value Decomposition is carried out in order to get related singular values which are compared with the threshold value, leading to a final encoder mode matrix rank (rfins). Based on direction values (?l) of loudspeakers and a decoder Ambisonics order (Nl), corresponding ket vectors (|Y(?l)) and a decoder mode matrix (?O×L) are formed. On the decoder mode matrix a Singular Value Decomposition is carried out, providing a final decoder mode matrix rank (rfind).
    Type: Application
    Filed: August 14, 2017
    Publication date: December 28, 2017
    Applicant: Dolby International AB
    Inventors: Holger KROPP, Stefan ABELING
  • Publication number: 20170372707
    Abstract: An error-concealing audio decoding method comprises: receiving a packet comprising a set of MDCT coefficients encoding a frame of time-domain samples of an audio signal; identifying the received packet as erroneous; generating estimated MDCT coefficients to replace the set of MDCT coefficients of the erroneous packet, based on corresponding MDCT coefficients associated with a received packet directly preceding the erroneous packet; assigning signs of a first subset of MDCT coefficients of the estimated MDCT coefficients, wherein the first subset comprises such MDCT coefficients that are associated with tonal-like spectral bins, to coincide with signs of corresponding MDCT coefficients of said preceding packet; randomly assigning signs of a second subset of MDCT coefficients of the estimated MDCT coefficients, wherein the second subset comprises MDCT coefficients associated with noise-like spectral bins; replacing the erroneous packet by a concealment packet containing the estimated MDCT coefficients and the s
    Type: Application
    Filed: December 8, 2015
    Publication date: December 28, 2017
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Arijit BISWAS, Tobias FRIEDRICH, Klaus PEICHL
  • Publication number: 20170372709
    Abstract: Soundfield signals such as e.g. Ambisonics carry a representation of a desired sound field. The Ambisonics format is based on spherical harmonic decomposition of the soundfield, and Higher Order Ambisonics (HOA) uses spherical harmonics of at least 2nd order. However, commonly used loudspeaker setups are irregular and lead to problems in decoder design. A method for improved decoding an audio soundfield representation for audio playback comprises calculating a panning function (W) using a geometrical method based on the positions of a plurality of loudspeakers and a plurality of source directions, calculating a mode matrix (?) from the loudspeaker positions, calculating a pseudo-inverse mode matrix (?+) and decoding the audio soundfield representation. The decoding is based on a decode matrix (D) that is obtained from the panning function (W) and the pseudo-inverse mode matrix (?+).
    Type: Application
    Filed: August 21, 2017
    Publication date: December 28, 2017
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Johann-Markus BATKE, Florian KEILER, Johannes BOEHM
  • Publication number: 20170371168
    Abstract: Integrating rod modules are disclosed comprising a plurality of single and/or solid integrating rods that are mated together by straps. Such modules tend to comprise a greater length than the single and/or solid integrating rods and provide good illumination to a modulator that light from a light source is transmitted through the integrating rod module. The straps may comprise a material (e.g., glass) that has substantially same or similar thermal characteristics as the integrating rods. The straps may be glued to the integrating rods by a glue having a substantially different (e.g., lower) index of refraction than the integrating rods, so as not to disturb the internal reflectance of the rods. The straps may be reinforced by braces that may allow the integrating rod module to be set within a projection display system at an angle substantially different from horizontal.
    Type: Application
    Filed: December 23, 2015
    Publication date: December 28, 2017
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Duane Scott DEWALD, Nathan WAINWRIGHT, Douglas J. GORNY, Martin J. RICHARDS
  • Publication number: 20170372719
    Abstract: The present disclosure relates to sibilance detection and mitigation in a voice signal. A method of sibilance detection and mitigation is described. In the method, a predetermined spectrum feature is extracted from a voice signal, the predetermined spectrum feature representing a distribution of signal energy over a voice frequency band. Sibilance is then identified based on the predetermined spectrum feature. Excessive sibilance is further identified from the identified sibilance based on a level of the identified sibilance. Then the voice signal is processed by decreasing a level of the excessive sibilance so as to suppress the excessive sibilance. Corresponding system and computer program products are described as well.
    Type: Application
    Filed: June 19, 2017
    Publication date: December 28, 2017
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Kai LI, David GUNAWAN
  • Publication number: 20170373656
    Abstract: A method for generating a perceptual equalization (EQ) filter applicable to an audio signal to equalize the audio signal, including: generating a full EQ filter for use in performing full equalization on the signal; and modifying the frequency-amplitude spectrum of the full EQ filter in accordance with a dip detection threshold function, thereby generating the perceptual EQ filter, where the dip detection threshold function is indicative of minimum perceivable amplitude of each of at least a number of different dips in the frequency-amplitude spectrum of an acoustic signal. Also, a method for equalizing an audio signal, including: generating a full EQ filter for use in performing full equalization on the signal, modifying the frequency-amplitude spectrum of the full EQ filter in accordance with at least one dip detection threshold value, thereby generating a perceptual EQ filter, and applying the perceptual EQ filter to perceptually equalize the signal.
    Type: Application
    Filed: February 17, 2016
    Publication date: December 28, 2017
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Sunil BHARITKAR, Charles Q. ROBINSON
  • Publication number: 20170374484
    Abstract: Embodiments are directed to a method of rendering adaptive audio by receiving input audio comprising channel-based audio, audio objects, and dynamic objects, wherein the dynamic objects are classified as sets of low-priority dynamic objects and high-priority dynamic objects, rendering the channel-based audio, the audio objects, and the low-priority dynamic objects in a first rendering processor of an audio processing system, and rendering the high-priority dynamic objects in a second rendering processor of the audio processing system. The rendered audio is then subject to virtualization and post-processing steps for playback through soundbars and other similar limited height capable speakers.
    Type: Application
    Filed: February 4, 2016
    Publication date: December 28, 2017
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Joshua Brandon LANDO, Freddie SANCHEZ, Alan J SEEFELDT
  • Publication number: 20170374317
    Abstract: Example embodiments disclosed herein relate to spatial congruency adjustment. A method for adjusting spatial congruency in a video conference is disclosed. The method in unwarping a visual scene captured by a video endpoint device into at least one rectilinear scene, the video endpoint device being configured to capture the visual scene in an omnidirectional manner, detecting spatial congruency between the at least one rectilinear scene and an auditory scene captured by an audio endpoint device that is positioned in relation to the video endpoint device. The spatial congruency being a degree of alignment between the auditory scene and the at least one rectilinear scene and in response to the detected spatial congruency being below the threshold, adjusting the spatial congruency. Corresponding system and computer program products are also disclosed.
    Type: Application
    Filed: November 18, 2015
    Publication date: December 28, 2017
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Xuejing SUN, Michael ECKERT
  • Publication number: 20170365273
    Abstract: A method of audio source separation from audio content is disclosed. The method includes determining a spatial parameter of an audio source based on a linear combination characteristic of the audio source and an orthogonality characteristic of two or more audio sources to be separated in the audio content. The method also includes separating the audio source from the audio content based on the spatial parameter. Corresponding system and computer program product are also disclosed.
    Type: Application
    Filed: February 12, 2016
    Publication date: December 21, 2017
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Jun WANG, David S. MC GRATH
  • Publication number: 20170364752
    Abstract: Image data relating to real-world objects or persons is collected from a scene while collecting audio data relating to the real-world objects or persons from the same scene. The audio data is used to derive sound objects corresponding to the real-world objects or persons. The image data is used to derive video objects corresponding to the real-world objects or persons. Based on the sound objects and the video objects, candidate salient objects are generated. A salient object is selected from among the candidate salient objects. Perceptual enhancement operations are performed on the selected salient object.
    Type: Application
    Filed: June 15, 2017
    Publication date: December 21, 2017
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Cong ZHOU, Timo KUNKEL, Cristina Michel VASCO
  • Publication number: 20170365261
    Abstract: The invention provides methods and devices for stereo encoding and decoding using complex prediction in the frequency domain. In one embodiment, a decoding method, for obtaining an output stereo signal from an input stereo signal encoded by complex prediction coding and comprising first frequency-domain representations of two input channels, comprises the upmixing steps of: (i) computing a second frequency-domain representation of a first input channel; and (ii) computing an output channel on the basis of the first and second frequency-domain representations of the first input channel, the first frequency-domain representation of the second input channel and a complex prediction coefficient. The upmixing can be suspended responsive to control data.
    Type: Application
    Filed: August 7, 2017
    Publication date: December 21, 2017
    Applicant: Dolby International AB
    Inventors: Heiko Purnhagen, Pontus Carlsson, Lars Villemoes
  • Publication number: 20170365268
    Abstract: A method performed in an audio decoder for decoding M encoded audio channels representing N audio channels is disclosed. The method includes receiving a bitstream containing the M encoded audio channels and a set of spatial parameters, decoding the M encoded audio channels, and extracting the set of spatial parameters from the bitstream. The method also includes analyzing the M audio channels to detect a location of a transient, decorrelating the M audio channels, and deriving N audio channels from the M audio channels and the set of spatial parameters. A first decorrelation technique is applied to a first subset of each audio channel and a second decorrelation technique is applied to a second subset of each audio channel. The first decorrelation technique represents a first mode of operation of a decorrelator, and the second decorrelation technique represents a second mode of operation of the decorrelator.
    Type: Application
    Filed: August 30, 2017
    Publication date: December 21, 2017
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Mark F. Davis
  • Publication number: 20170359045
    Abstract: The invention relates to the measurement and control of the perceived sound loudness and/or the perceived spectral balance of an audio signal. An audio signal is modified in response to calculations performed at least in part in the perceptual (psychoacoustic) loudness domain. The invention is useful, for example, in one or more of: loudness-compensating volume control, automatic gain control, dynamic range control (including, for example, limiters, compressors, expanders, etc.), dynamic equalization, and compensating for background noise interference in an audio playback environment. The invention includes not only methods but also corresponding computer programs and apparatus.
    Type: Application
    Filed: August 9, 2017
    Publication date: December 14, 2017
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Alan Jeffrey Seefeldt
  • Publication number: 20170353810
    Abstract: An audio object including audio content and object metadata is received. The object metadata indicates an object spatial position of the audio object to be rendered by audio speakers in a playback environment. Based on the object spatial position and source spatial positions of the audio speakers, initial gain values for the audio speakers are determined. The initial gain values can be used to select a set of audio speakers from among the audio speakers. Based on the object spatial position and a set of source spatial positions at which the set of audio speakers are respectively located in the playback environment, a set of non-negative optimized gain values for the set of audio speakers is determined. The audio object at the object spatial position is rendered with the set of optimized gain values for the set of audio speakers.
    Type: Application
    Filed: July 11, 2017
    Publication date: December 7, 2017
    Applicants: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Jun Wang, Giulio Cengarle, Juan Felix Torres, Daniel Arteaga
  • Publication number: 20170352330
    Abstract: Techniques for optimizing light output profiles in display systems are described. A light output profile is defined in relation to a plurality of sample locations on an illuminated surface. Point spread functions that satisfy illumination performance values specified in the light output profile in aggregate are computed or derived. A design process that adds or removes optical components to a display light assembly derives an optimal design of a light illumination layer for display systems. Relationships and parameter values determined in the design process may be configured into display systems along with the optical components for the purpose of generating optimized light output profiles in the display systems.
    Type: Application
    Filed: August 22, 2017
    Publication date: December 7, 2017
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Henry Hang-Kei Ip, Chun Chi Wan, Ka Wing Terence Lau
  • Publication number: 20170353696
    Abstract: The disclosure relates to handling nuisance in teleconference system. An endpoint device (400) for use in a teleconference includes an acquiring unit (401), a judging unit (402), a controller (403) and a processing unit (404). The acquiring unit acquires a media stream for presentation in the teleconference, and receives information from another device. The information includes a first estimation on whether the media stream is a nuisance to the teleconference. As the nuisance to a teleconference, audio or video signals are perceived by users as actually not relevant to the conference session or causing unpleasant feeling or confusion. The judging unit decides whether the media stream is the nuisance at least based on the information. The controller controls the processing of the media stream to degrade or suppress the presentation of the media stream in case that the media stream is decided as the nuisance. The processing unit processes the media stream under the control of the controller.
    Type: Application
    Filed: February 16, 2016
    Publication date: December 7, 2017
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Taoran LU, Hariharan GANAPATHY-KATHIRVELU, Peng YIN, Glenn N. DICKINS, Xuejing SUN
  • Publication number: 20170352355
    Abstract: A method for encoding multi-channel HOA audio signals for noise reduction comprises steps of decorrelating the channels using an inverse adaptive DSHT, the inverse adaptive DSHT comprising a rotation operation and an inverse DSHT, with the rotation operation rotating the spatial sampling grid of the iDSHT, perceptually encoding each of the decorrelated channels, encoding rotation information, the rotation information comprising parameters defining said rotation operation, and transmitting or storing the perceptually encoded audio channels and the encoded rotation information.
    Type: Application
    Filed: August 24, 2017
    Publication date: December 7, 2017
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Johannes BOEHM, Sven KORDON, Alexander KRÜGER, Peter JAX
  • Publication number: 20170344852
    Abstract: A method is disclosed for audio object extraction from an audio content which includes identifying a first set of projection spaces including a first subset for a first channel and a second subset for a second channel of the plurality of channels. The method may further include determining a first set of correlations between the first and second channels, each of the first set of correlations corresponding to one of the first subset of projection spaces and one of the second subset of projection spaces. Still further, the method may include extracting an audio object from an audio signal of the first channel at least in part based on a first correlation among the first set of correlations and the projection space from the first subset corresponding to the first correlation, the first correlation being greater than a first predefined threshold. Corresponding system and computer program products are also disclosed.
    Type: Application
    Filed: December 18, 2015
    Publication date: November 30, 2017
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Mingqing HU, Lie LU, Lianwu CHEN
  • Publication number: 20170347125
    Abstract: An encoding device evaluates a plurality of processing and/or post-processing algorithms and/or methods to be applied to a video stream, and signals a selected method, algorithm, class or category of methods/algorithms either in an encoded bitstream or as side information related to the encoded bitstream. A decoding device or post-processor utilizes the signaled algorithm or selects an algorithm/method based on the signaled method or algorithm. The selection is based, for example, on availability of the algorithm/method at the decoder/post-processor and/or cost of implementation. The video stream may comprise, for example, downsampled multiplexed stereoscopic images and the selected algorithm may include any of upconversion and/or error correction techniques that contribute to a restoration of the downsampled images.
    Type: Application
    Filed: August 16, 2017
    Publication date: November 30, 2017
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Alexandros TOURAPIS, Athanasios LEONTARIS, Peshala V. PAHALAWATTA, Kevin J. STEC
  • Publication number: 20170346460
    Abstract: The invention relates to the measurement and control of the perceived sound loudness and/or the perceived spectral balance of an audio signal. An audio signal is modified in response to calculations performed at least in part in the perceptual (psychoacoustic) loudness domain. The invention is useful, for example, in one or more of: loudness-compensating volume control, automatic gain control, dynamic range control (including, for example, limiters, compressors, expanders, etc.), dynamic equalization, and compensating for background noise interference in an audio playback environment. The invention includes not only methods but also corresponding computer programs and apparatus.
    Type: Application
    Filed: July 28, 2017
    Publication date: November 30, 2017
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Alan Jeffrey Seefeldt
  • Publication number: 20170345432
    Abstract: The present invention relates to a new method and apparatus for improvement of High Frequency Reconstruction (HFR) techniques using frequency translation or folding or a combination thereof. The proposed invention is applicable to audio source coding systems, and offers significantly reduced computational complexity. This is accomplished by means of frequency translation or folding in the subband domain, preferably integrated with spectral envelope adjustment in the same domain. The concept of dissonance guard-band filtering is further presented. The proposed invention offers a low-complexity, intermediate quality HFR method useful in speech and natural audio coding applications.
    Type: Application
    Filed: August 15, 2017
    Publication date: November 30, 2017
    Applicant: Dolby International AB
    Inventors: Lars G. Liljeryd, Per Ekstrand, Fredrik Henn, Kristofer Kjoerling
  • Publication number: 20170347215
    Abstract: Some methods may involve receiving a block of audio data, the block including N pulse code modulated (PCM) audio channels, including audio samples for each of the N channels, receiving metadata associated with the block of audio data and receiving a first set of values corresponding to reference audio samples. A second set of values, corresponding to audio samples from the block of audio data, may be determined. The first and second set of values may be compared. Based on the comparison, it may be determined whether the block of audio data is synchronized with the metadata.
    Type: Application
    Filed: May 23, 2017
    Publication date: November 30, 2017
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Kent Bennett TERRY, Scott Gregory NORCROSS, Jeffrey RIEDMILLER
  • Publication number: 20170339504
    Abstract: Embodiments are described for designing a filter in a magnitude domain performing an impedance filtering function over a frequency domain to compensate for directional cues for the left and right ears of the listener as a function of virtual source angles during headphone virtual sound reproduction. The filter is derived by obtaining blocked ear canal and open ear canal transfer functions for loudspeakers placed in a room, obtaining an open ear canal transfer function for a headphone placed on a listening subject, and dividing the loudspeaker transfer functions by the headphone transfer function to invert a headphone response at the entrance of the ear canal and map the ear canal function from the headphone to free field.
    Type: Application
    Filed: October 28, 2015
    Publication date: November 23, 2017
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Sunil BHARITKAR, Louis D. FIELDER
  • Publication number: 20170339429
    Abstract: Encoding and decoding architectures for 3D video delivery are described, such as 2D compatible 3D video delivery and frame compatible 3D video delivery. The architectures include pre-processing stages to pre-process the output of a base layer video encoder and/or decoder and input the pre-processed output into an enhancement layer video encoder and/or decoder of one or more enhancement layers. Multiplexing methods of how to combine the base and enhancement layer videos are also described.
    Type: Application
    Filed: August 11, 2017
    Publication date: November 23, 2017
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Alexandros TOURAPIS, Peshala V. PAHALAWATTA, Athanasios LEONTARIS, Kevin J. STEC, Walter J. HUSAK
  • Publication number: 20170339505
    Abstract: A control section (1009) receives signaling (S) indicating one of at least two coding formats (F1, F2, F3) of an M-channel audio signal (L, LS, LB, TFL, TBL), the coding formats corresponding to different partitions of the channels of the audio signal into respective first and second groups (601, 602), wherein, in the indicated coding format, first and second channels (L1, L2) of a downmix signal correspond to linear combinations of the first and second groups, respectively; and a decoding section (900) reconstructs the audio signal based on the downmix signal and associated upmix parameters (?L). In the decoding section: a decorrelation input signal (D1, D2, D3) is determined based on the downmix signal and the indicated coding format; and wet and dry upmix coefficients, controlling linear mappings of the downmix signal and a decorrelated signal, generated based on the decorrelation input signal, are determined based on the upmix parameters and the indicated coding format.
    Type: Application
    Filed: October 29, 2015
    Publication date: November 23, 2017
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Heiko PURNHAGEN, Heidi-Maria LEHTONEN, Janusz KLEJSA
  • Publication number: 20170338785
    Abstract: The invention relates to the measurement and control of the perceived sound loudness and/or the perceived spectral balance of an audio signal. An audio signal is modified in response to calculations performed at least in part in the perceptual (psychoacoustic) loudness domain. The invention is useful, for example, in one or more of: loudness-compensating volume control, automatic gain control, dynamic range control (including, for example, limiters, compressors, expanders, etc.), dynamic equalization, and compensating for background noise interference in an audio playback environment. The invention includes not only methods but also corresponding computer programs and apparatus.
    Type: Application
    Filed: August 9, 2017
    Publication date: November 23, 2017
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Alan Jeffrey Seefeldt