Dolby Labs Patent Applications

Patents granted to Dolby Labs by the U.S. Patent and Trademark Office (USPTO).

  • Publication number: 20180324543
    Abstract: Embodiments are described for an adaptive audio system that processes audio data comprising a number of independent monophonic audio streams. One or more of the streams has associated with it metadata that specifies whether the stream is a channel-based or object-based stream. Channel-based streams have rendering information encoded by means of channel name; and the object-based streams have location information encoded through location expressions encoded in the associated metadata. A codec packages the independent audio streams into a single serial bitstream that contains all of the audio data. This configuration allows for the sound to be rendered according to an allocentric frame of reference, in which the rendering location of a sound is based on the characteristics of the playback environment (e.g., room size, shape, etc.) to correspond to the mixer's intent.
    Type: Application
    Filed: July 13, 2018
    Publication date: November 8, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Charles Q. ROBINSON, Nicolas R. TSINGOS, Christophe CHABANNE
  • Publication number: 20180324540
    Abstract: Example embodiments disclosed herein relate to content-adaptive surround sound virtualization. A method of virtualizing surround sound is disclosed. The method includes receiving a set of input audio signals, each of the input audio signals being indicative of sound from one of different sound sources, and determining a probability of the set of input audio signals belonging to a predefined audio content category. The method also includes determining a virtualization amount based on the determined probability, the virtualization amount indicating to which extent the set of input audio signals is virtualized as surround sound. The method further includes performing surround sound virtualization on two or more input audio signals in the set based on the determined virtualization amount and generating output audio signals based on the virtualized input audio signals and other input audio signals in the set. Corresponding system and computer program product for virtualizing surround sound are also disclosed.
    Type: Application
    Filed: November 2, 2016
    Publication date: November 8, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Xin Liu, Lie LU, Alan J. Seefeldt
  • Publication number: 20180322889
    Abstract: Embodiments relate to an audio processing unit that includes a buffer, bitstream payload deformatter, and a decoding subsystem. The buffer stores at least one block of an encoded audio bitstream. The block includes a fill element that begins with an identifier followed by fill data. The fill data includes at least one flag identifying whether enhanced spectral band replication (eSBR) processing is to be performed on audio content of the block. A corresponding method for decoding an encoded audio bitstream is also provided.
    Type: Application
    Filed: July 19, 2018
    Publication date: November 8, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Lars Villemoes, Heiko Purnhagen, Per Ekstrand
  • Publication number: 20180322890
    Abstract: On the basis of a bitstream (P), an n-channel audio signal (X) is reconstructed by deriving an m-channel core signal (Y) and multichannel coding parameters (a) from the bitstream, where 1?m<n. Also derived from the bitstream are pre-processing dynamic range control, DRC, parameters (DRC2) quantifying an encoder-side dynamic range limiting of the core signal. The n-channel audio signal is obtained by parametric synthesis in accordance with the multichannel coding parameters and while cancelling any encoder-side dynamic range limiting based on the pre-processing DRC parameters. In particular embodiments, the reconstruction further includes use of compensated post-processing DRC parameters quantifying a potential decoder-side dynamic range compression. Cancellation of an encoder-side range limitation and range compression are preferably performed by different decoder-side components. Cancellation and compression may be coordinated by a DRC pre-processor.
    Type: Application
    Filed: July 19, 2018
    Publication date: November 8, 2018
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Jeffrey RIEDMILLER, Karl J. ROEDEN, Kristofer KJOERLING, Heiko PURNHAGEN, Vinay MELKOTE, Leif SEHLSTROM
  • Publication number: 20180322679
    Abstract: Systems and methods for overlaying a second image/video data onto a first image/video data are described herein. The first image/video data may be intended to be rendered on a display with certain characteristics—e.g., HDR, EDR, VDR or UHD capabilities. The second image/video data may comprise graphics, closed captioning, text, advertisement—or any data that may be desired to be overlaid and/or composited onto the first image/video data. The second image/video data may be appearance mapped according to the image statistics and/or characteristics of the first image/video data. In addition, such appearance mapping may be made according to the characteristics of the display that the composite data is to be rendered. Such appearance mapping is desired to render a composite data that is visually pleasing to a viewer, rendered upon a desired display.
    Type: Application
    Filed: July 17, 2018
    Publication date: November 8, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Timo KUNKEL, Ning XU, Tao CHEN, Bongsun LEE, Samir N. HULYALKAR
  • Publication number: 20180322886
    Abstract: The present document relates an audio encoding and decoding system (referred to as an audio codec system). In particular, the present document relates to a audio codec system which is particularly well suited for voice encoding/decoding. A transform-based speech encoder is configured to encode a speech signal into a bitstream is described. A speech decoder configured to decode audio signals from a bitstream is further described.
    Type: Application
    Filed: July 11, 2018
    Publication date: November 8, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Lars VILLEMOES, Janusz KLEJSA, Per HEDELIN
  • Publication number: 20180315434
    Abstract: The present invention relates to transposing signals in time and/or frequency and in particular to coding of audio signals. More particular, the present invention relates to high frequency reconstruction (HFR) methods including a frequency domain harmonic transposer. A method and system for generating a transposed output signal from an input signal using a transposition factor T is described. The system comprises an analysis window of length La, extracting a frame of the input signal, and an analysis transformation unit of order M transforming the samples into M complex coefficients. M is a function of the transposition factor T. The system further comprises a nonlinear processing unit altering the phase of the complex coefficients by using the transposition factor T, a synthesis transformation unit of order M transforming the altered coefficients into M altered samples, and a synthesis window of length Ls, generating a frame of the output signal.
    Type: Application
    Filed: July 5, 2018
    Publication date: November 1, 2018
    Applicant: Dolby International AB
    Inventors: Per Ekstrand, Lars Villemoes
  • Publication number: 20180314205
    Abstract: Dual or multi-modulation display systems are disclosed that comprise projector systems with at least one modulator that may employ non-mechanical beam steering modulation. Many embodiments disclosed herein employ a non-mechanical beam steering and/or polarizer to provide for a highlights modulator.
    Type: Application
    Filed: April 23, 2018
    Publication date: November 1, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Douglas J. Gorny, Martin J. Richards
  • Publication number: 20180315432
    Abstract: For converting a channel-based 3D audio signal to a higher-order Ambisonics HOA audio signal, the channel-based 3D audio signal is transformed (21) from time domain to frequency domain. A primary ambient decomposition (22) is carried out for three-channel triplets of blocks of the domain channel-based 3D audio signal, wherein directional signals and ambient signals are provided (37) for each triplet. From the directional signals directional information of a total directional signal for each triple is derived (23). That total directional signal is HOA encoded (25) according to the derived directions, and ambient signals are HOA encoded (24) according to channel positions. The HOA coefficients of the HOA encoded directional signal and the HOA coefficients of the HOA encoded ambient signal are superimposed (27) in order to obtain a HOA coefficients signal for the channel-based 3D audio signal, followed by a transformation (26) into time domain.
    Type: Application
    Filed: November 16, 2016
    Publication date: November 1, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Johannes BOEHM, Xiaoming CHEN
  • Publication number: 20180308496
    Abstract: The present document relates to a method of layered encoding of a compressed sound representation of a sound or sound field. The compressed sound representation comprises a basic compressed sound representation comprising a plurality of components, basic side information for decoding the basic compressed sound representation to a basic reconstructed sound representation of the sound or sound field, and enhancement side information including parameters for improving the basic reconstructed sound representation.
    Type: Application
    Filed: October 7, 2016
    Publication date: October 25, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Sven KORDON, Alexander KRUEGER
  • Publication number: 20180310027
    Abstract: In a method to improve the dynamic range of high-dynamic range (HDR) signals using an enhancement layer, a piecewise-linear inter-layer predictor and a residual masking operator are applied. The generation of the piecewise-linear inter-layer prediction function is based on a computed scene-significance histogram based on the average of frame-significance histograms indicating pixel values where coding artifacts are most likely to occur. For each segment in the prediction function, its slope is inversely proportional to a measure of energy in the segment under the scene-significance histogram. Bit rate constrains for the enhancement layer are also taken into consideration in determining the piecewise-linear prediction function.
    Type: Application
    Filed: October 26, 2016
    Publication date: October 25, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Navaneeth KAMBALLUR KOTTAYIL, Guan-Ming SU
  • Publication number: 20180308498
    Abstract: Soundfield signals such as e.g. Ambisonics carry a representation of a desired sound field. The Ambisonics format is based on spherical harmonic decomposition of the soundfield, and Higher Order Ambisonics (HOA) uses spherical harmonics of at least 2nd order. However, commonly used loudspeaker setups are irregular and lead to problems in decoder design. A method for improved decoding an audio soundfield representation for audio playback comprises calculating a panning function (W) using a geometrical method based on the positions of a plurality of loudspeakers and a plurality of source directions, calculating a mode matrix (?) from the loudspeaker positions, calculating a pseudo-inverse mode matrix (?+) and decoding the audio soundfield representation. The decoding is based on a decode matrix (D) that is obtained from the panning function (W) and the pseudo-inverse mode matrix (?+).
    Type: Application
    Filed: June 26, 2018
    Publication date: October 25, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Johann-Markus BATKE, Florian KEILER, Johannes BOEHM
  • Publication number: 20180308500
    Abstract: When compressing an HOA data frame representation, a gain control (15, 151) is applied for each channel signal before it is perceptually encoded (16). The gain values are transferred in a differential manner as side information. However, for starting decoding of such streamed compressed HOA data frame representation absolute gain values are required, which should be coded with a minimum number of bits. For determining such lowest integer number (?e) of bits the HOA data frame representation (c(k)) is rendered in spatial domain to virtual loudspeaker signals lying on a unit sphere, followed by normalisation of the HOA data frame representation (c(k)). Then the lowest integer number of bits is set to ?e=?log2(?log2(?{square root over (KMAX)}·O)?+1)?.
    Type: Application
    Filed: June 26, 2018
    Publication date: October 25, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Alexander KRUEGER, Sven KORDON
  • Publication number: 20180310023
    Abstract: Encoding and decoding architectures for 3D video delivery are described, such as 2D compatible 3D video delivery and frame compatible 3D video delivery. The architectures include pre-processing stages to pre-process the output of a base layer video encoder and/or decoder and input the pre-processed output into an enhancement layer video encoder and/or decoder of one or more enhancement layers. Multiplexing methods of how to combine the base and enhancement layer videos are also described.
    Type: Application
    Filed: June 18, 2018
    Publication date: October 25, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Alexandros TOURAPIS, Peshala V. PAHALAWATTA, Athanasios LEONTARIS, Kevin J. STEC, Walter J. HUSAK
  • Publication number: 20180308507
    Abstract: Example embodiments disclosed herein relate to audio signal processing with low latency. A method of processing an audio signal is disclosed. The method includes obtaining frequency parameters of a current frame of the audio signal. The method also includes generating intermediate frequency domain outputs for a set of predefined frequency bands based on the frequency parameters using predefined frequency band filter banks, a frequency band filter bank being specific to a respective frequency band in the set. The method further includes determining frequency band energies for the set of predefined frequency bands based on the intermediate frequency domain outputs, and processing the current frame based on the determined frequency band energies. Corresponding system, computer program product, and device for processing an audio signal are also disclosed.
    Type: Application
    Filed: January 13, 2017
    Publication date: October 25, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Zhiwei SHUANG, David S. MCGRATH, Michael William MASON
  • Publication number: 20180310112
    Abstract: The invention improves HOA sound field representation compression. The HOA representation is analysed for the presence of dominant sound sources and their directions are estimated. Then the HOA representation is decomposed into a number of dominant directional signals and a residual component. This residual component is transformed into the discrete spatial domain in order to obtain general plane wave functions at uniform sampling directions, which are predicted from the dominant directional signals. Finally, the prediction error is transformed back to the HOA domain and represents the residual ambient HOA component for which an order reduction is performed, followed by perceptual encoding of the dominant directional signals and the residual component.
    Type: Application
    Filed: June 26, 2018
    Publication date: October 25, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Alexander KRUEGER, Sven KORDON, Johannes BOEHM
  • Publication number: 20180302623
    Abstract: Techniques are provided to encode and decode image data comprising a tone mapped (TM) image with HDR reconstruction data in the form of luminance ratios and color residual values. In an example embodiment, luminance ratio values and residual values in color channels of a color space are generated on an individual pixel basis based on a high dynamic range (HDR) image and a derivative tone-mapped (TM) image that comprises one or more color alterations that would not be recoverable from the TM image with a luminance ratio image. The TM image with HDR reconstruction data derived from the luminance ratio values and the color-channel residual values may be outputted in an image file to a downstream device, for example, for decoding, rendering, and/or storing. The image file may be decoded to generate a restored HDR image free of the color alterations.
    Type: Application
    Filed: June 18, 2018
    Publication date: October 18, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Wenhui JIA, Ajit NINAN, Arkady TEN, Gregory John WARD
  • Publication number: 20180301156
    Abstract: Exemplary embodiments provide encoding and decoding methods, and associated encoders and decoders, for encoding and decoding of an audio scene which at least comprises one or more audio objects (106a). The encoder (108, 110) generates a bit stream (116) which comprises downmix signals (112) and side information which includes individual matrix elements (114) of a reconstruction matrix which enables reconstruction of the one or more audio objects (106a) in the decoder (120).
    Type: Application
    Filed: June 21, 2018
    Publication date: October 18, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Heiko PURNHAGEN, Lars VILLEMOES, Leif Jonas SAMUELSSON, Toni HIRVONEN
  • Publication number: 20180301157
    Abstract: Example embodiments disclosed herein relate to impulsive noise suppression. A method of impulsive noise suppression in an audio signal is disclosed. The method includes determining an impulsive noise related feature from a current frame of the audio signal. The method also includes detecting an impulsive noise in the current frame based on the impulsive noise related feature, and in response to detecting the impulsive noise in the current frame, applying a suppression gain to the current frame to suppress the impulsive noise. Corresponding system and computer program product of impulsive noise suppression in an audio signal are also disclosed.
    Type: Application
    Filed: April 27, 2016
    Publication date: October 18, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: David GUNAWAN, Dong SHI, Glenn N. DICKINS
  • Publication number: 20180295240
    Abstract: Teleconference audio data including a plurality of individual uplink data packet streams, may be received during a teleconference. Each uplink data packet stream may corresponding to a telephone endpoint used by one or more teleconference participants. The teleconference audio data may be analyzed to determine a plurality of suppressive gain coefficients, which may be applied to first instances of the teleconference audio data during the teleconference, to produce first gain-suppressed audio data provided to the telephone endpoints during the teleconference. Second instances of the teleconference audio data, as well as gain coefficient data corresponding to the plurality of suppressive gain coefficients, may be sent to a memory system as individual uplink data packet streams. The second instances of the teleconference audio data may be less gain-suppressed than the first gain-suppressed audio data.
    Type: Application
    Filed: June 15, 2016
    Publication date: October 11, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Glenn N. DICKINS, Richard J. CARTWRIGHT
  • Publication number: 20180295241
    Abstract: Embodiments are described for a soundfield system that receives a transmitting soundfield, wherein the transmitting soundfield includes a sound source at a location in the transmitting soundfield. The system determines a rotation angle for rotating the transmitting soundfield based on a desired location for the sound source. The transmitting soundfield is rotated by the determined angle and the system obtains a listener's soundfield based on the rotated transmitting soundfield. The listener's soundfield is transmitted for rendering to a listener.
    Type: Application
    Filed: April 18, 2018
    Publication date: October 11, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Richard J. CARTWRIGHT
  • Publication number: 20180295464
    Abstract: Diffuse or spatially large audio objects may be identified for special processing. A decorrelation process may be performed on audio signals corresponding to the large audio objects to produce decorrelated large audio object audio signals. These decorrelated large audio object audio signals may be associated with object locations, which may be stationary or time-varying locations. For example, the decorrelated large audio object audio signals may be rendered to virtual or actual speaker locations. The output of such a rendering process may be input to a scene simplification process. The decorrelation, associating and/or scene simplification processes may be performed prior to a process of encoding the audio data.
    Type: Application
    Filed: June 14, 2018
    Publication date: October 11, 2018
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Dirk Jeroen BREEBAART, Lie LU, Nicolas R. TSINGOS, Antonio MATEOS SOLE
  • Publication number: 20180292663
    Abstract: A projector system comprising a laser light source, a collimating lens, a fly-eye lens, an integrating rod and a first modulator is disclosed. The light from a laser light source/fiber illuminates a collimator to substantially collimate the light and then is transmitted through a fly's-eye lens. The fly's-eye lens provides a desired angular/spatial light distribution for further processing to a first modulator of the projector system.
    Type: Application
    Filed: October 11, 2016
    Publication date: October 11, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Martin J. RICHARDS, Duane Scott DEWALD, Nathan WAINWRIGHT, Barret LIPPEY
  • Publication number: 20180295330
    Abstract: Dual and multi-modulator projector display systems and techniques are disclosed. In one embodiment, a projector display system comprises a light source; a controller, a first modulator, receiving light from the light source and rendering a halftone image of said the input image; a blurring optical system that blurs said halftone image with a Point Spread Function (PSF); and a second modulator receiving the blurred halftone image and rendering a pulse width modulated image which may be projected to form the desired screen image. Systems and techniques for forming a binary halftone image from input image, correcting for misalignment between the first and second modulators and calibrating the projector system—e.g. over time—for continuous image improvement are also disclosed.
    Type: Application
    Filed: May 31, 2018
    Publication date: October 11, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Jerome SHIELDS, Martin J. RICHARDS, Juan P. PERTIERRA
  • Publication number: 20180295352
    Abstract: A spatial direction of a wearable device that represents an actual viewing direction of the wearable device is determined. The spatial direction of the wearable device is used to select, from a multi-view image comprising single-view images, a set of single-view images. A display image is caused to be rendered on a device display of the wearable device. The display image represents a single-view image as viewed from the actual viewing direction of the wearable device. The display image is constructed based on the spatial direction of the wearable device and the set of single-view images.
    Type: Application
    Filed: April 10, 2018
    Publication date: October 11, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Ajit NINAN, Neil Mammen
  • Publication number: 20180295351
    Abstract: A wearable device comprises a left view optical stack for a viewer to view left view cinema display images rendered on a cinema display and a right view optical stack for the viewer to view right view cinema display images rendered on the cinema display. The left view cinema display images and the right view cinema display images form stereoscopic cinema images. The wearable device further comprises a left view imager that renders left view device display images, to the viewer, on a device display, and a right view imager that renders right view device display images, to the viewer, on the device display. The left view device display images and the right view device display images form stereoscopic device images complementary to the stereoscopic cinema images.
    Type: Application
    Filed: April 4, 2018
    Publication date: October 11, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Ajit NINAN, Neil MAMMEN
  • Publication number: 20180295329
    Abstract: Projection systems and/or methods comprising a blurring element are disclosed In one embodiment, a blurring element may comprise a first plate having a pattern on a first surface and second plate. The first plate and the second plate may comprise material having a slight difference in their respective index of refraction. In another embodiment, a blurring element may comprise a first plate having a pattern thereon and a second immersing material. The blurring element may be placed in between two modulators in a dual or multi-modulator projector system. The blurring element may be configured to give a desired shape to the light transmitted from a first modulator to a second modulator.
    Type: Application
    Filed: May 10, 2016
    Publication date: October 11, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Martin J. RICHARDS, Nathan Shawn WAINWRIGHT, Duane Scott DEWALD, Barret LIPPEY, Brad WALKER
  • Publication number: 20180295459
    Abstract: A computer implemented system for rendering captured audio soundfields to a listener comprises apparatus to deliver the audio soundfields to the listener. The delivery apparatus delivers the audio soundfields to the listener with first and second audio elements perceived by the listener as emanating from first and second virtual source locations, respectively, and with the first audio element and/or the second audio element delivered to the listener from a third virtual source location. The first virtual source location and the second virtual source location are perceived by the listener as being located to the front of the listener, and the third virtual source location is located to the rear or the side of the listener.
    Type: Application
    Filed: June 14, 2018
    Publication date: October 11, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Richard J. CARTWRIGHT, David S. MCGRATH, Glenn N. DICKINS
  • Publication number: 20180293752
    Abstract: At a first time point, a first light capturing device at a first spatial location in a three-dimensional (3D) space captures first light rays from light sources located at designated spatial locations on a viewer device in the 3D space. At the first time point, a second light capturing device at a second spatial location in the 3D space captures second light rays from the light sources located at the designated spatial locations on the viewer device in the 3D space. Based on the first light rays captured by the first light capturing device and the second light rays captured by the second light capturing device, at least one of a spatial position and a spatial direction, at the first time point, of the viewer device is determined.
    Type: Application
    Filed: April 10, 2018
    Publication date: October 11, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Ajit NINAN, Neil MAMMEN
  • Publication number: 20180287910
    Abstract: In a packet switched voice delivery application which utilizes a jitter buffer for the delivery of sequential packet data, a method of determining a measure of the output jitter of taking packets out of the buffer, the method including the step of: (a) forming a pull jitter measure comprising the differential fetch times between sequential pull packets dived by an expected time interval between packets.
    Type: Application
    Filed: September 27, 2016
    Publication date: October 4, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Xuejing Sun, JiaQuan Huo, Paul Holmberg
  • Publication number: 20180287918
    Abstract: This disclosure falls into the field of voice communication systems, more specifically it is related to the field of voice quality estimation in a packet based voice communication system. In particular the disclosure provides methods, computer program products and devices for reducing a prediction error of the voice quality estimation by considering forward error correction of lost voice packets.
    Type: Application
    Filed: May 5, 2016
    Publication date: October 4, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Doh-Suk KIM
  • Publication number: 20180284590
    Abstract: The present invention provides a cinema screen that improves audience perception of brightness at, for example, a premium theater without additional illumination cost. The screen is produced from materials that also help mitigate speckle from laser illumination. The screen has properties and includes structures that may be tuned to the specific capabilities of the projection system, arrangement of the theater, and projector (and angle of projection, angle of viewing). Light reflected from the screen are direct toward audience members and away from walls and ceilings.
    Type: Application
    Filed: June 5, 2018
    Publication date: October 4, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Douglas J. GORNY, Martin J. RICHARDS, Timo KUNKEL, David Lloyd SCHNUELLE
  • Publication number: 20180277127
    Abstract: The present document relates to a method of layered encoding of a compressed sound representation of a sound or sound field. The compressed sound representation comprises a basic compressed sound representation comprising a plurality of components, basic side information for decoding the basic compressed sound representation to a basic reconstructed sound representation of the sound or sound field, and enhancement side information including parameters for improving the basic reconstructed sound representation.
    Type: Application
    Filed: October 7, 2016
    Publication date: September 27, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Sven KORDON, Alexander KRUEGER
  • Publication number: 20180278963
    Abstract: In a method to reconstruct a high dynamic range video signal, a decoder receives a base layer standard dynamic range video signal, an enhancement layer video signal, a metadata bitstream for a reference processing unit and a CRC code related to the metadata. A decoder reconstructs a high-dynamic range video output signal based on the base layer video signal, the enhancement layer video signal, and the data syntax and metadata specified by the metadata bitstream.
    Type: Application
    Filed: November 1, 2016
    Publication date: September 27, 2018
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Klaas Heinrich SCHUEUER, BROOKS David, Satoshi TESHIGAWARA, Tao CHEN
  • Publication number: 20180277047
    Abstract: A display system with temperature compensation includes (a) a backlight unit containing a light emitting diode (LED) array, (b) a liquid crystal display (LCD) containing a plurality of pixels for spatially modulating, according to respective LCD drive values of the pixels, transmission of light generated by the LED array, (c) a plurality of temperature probes mounted to the backlight unit for measuring a respective plurality of temperatures at the LED array, (d) a light-field simulator for simulating, at least in part based upon the temperatures, a light field at the LCD as generated by the LED array, and (e) an LCD drive solver for processing a target image and the light field simulated by the light-field simulator, to determine the LCD drive values required to display the target image as compensated for temperatures of the LED array.
    Type: Application
    Filed: March 20, 2018
    Publication date: September 27, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Qiqin DAI, Jon S. McElvain
  • Publication number: 20180279063
    Abstract: A method for processing audio data, the method comprising: receiving audio data corresponding to a plurality of instances of audio, including at least one of: (a) audio data from multiple endpoints, recorded separately or (b) audio data from a single endpoint corresponding to multiple talkers and including spatial information for each of the multiple talkers; rendering the audio data in a virtual acoustic space such that each of the instances of audio has a respective different virtual position in the virtual acoustic space; and scheduling the instances of audio to be played back with a playback overlap between at least two of the instances of audio, wherein the scheduling is performed, at least in part, according to a set of perceptually-motivated rules.
    Type: Application
    Filed: February 3, 2016
    Publication date: September 27, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Xuejing SUN, Richard J. CARTWRIGHT, Michael P. HOLLIER, Michael ECKERT
  • Publication number: 20180278930
    Abstract: Inter-color image prediction is based on multi-channel multiple regression (MMR) models. Image prediction is applied to the efficient coding of images and video signals of high dynamic range. MMR models may include first order parameters, second order parameters, and cross-pixel parameters. MMR models using extension parameters incorporating neighbor pixel relations are also presented. Using minimum means-square error criteria, closed form solutions for the prediction parameters are presented for a variety of MMR models.
    Type: Application
    Filed: May 24, 2018
    Publication date: September 27, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Guan-Ming SU, Sheng QU, Hubert KOEPFER, Yufei YUAN, Samir HULYALKAR
  • Publication number: 20180277128
    Abstract: The present invention relates to a new method and apparatus for improvement of High Frequency Reconstruction (HFR) techniques using frequency translation or folding or a combination thereof. The proposed invention is applicable to audio source coding systems, and offers significantly reduced computational complexity. This is accomplished by means of frequency translation or folding in the subband domain, preferably integrated with spectral envelope adjustment in the same domain. The concept of dissonance guard-band filtering is further presented. The proposed invention offers a low-complexity, intermediate quality HFR method useful in speech and natural audio coding applications.
    Type: Application
    Filed: May 24, 2018
    Publication date: September 27, 2018
    Applicant: Dolby International AB
    Inventors: Lars G. Liljeryd, Per Ekstrand, Fredrik Henn, Kristofer Kjoerling
  • Publication number: 20180268829
    Abstract: Methods for generating an object based audio program which is renderable in a personalizable manner, e.g., to provide an immersive, perception of audio content of the program. Other embodiments include steps of delivering (e.g., broadcasting), decoding, and/or rendering such a program. Rendering of audio objects indicated by the program may provide an immersive experience. The audio content of the program may be indicative of multiple object channels (e.g., object channels indicative of user-selectable and user-configurable objects, and typically also a default set of objects which will be rendered in the absence of a selection by a user) and a bed of speaker channels. Another aspect is an audio processing unit (e.g., encoder or decoder) configured to perform, or which includes a buffer memory which stores at least one frame (or other segment) of an object based audio program (or bitstream thereof) generated in accordance with, any embodiment of the method.
    Type: Application
    Filed: May 24, 2018
    Publication date: September 20, 2018
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Robert Andrew FRANCE, Thomas ZIEGLER, Sripal S. MEHTA, Andrew Jonathan DOWELL, Prinyar SAUNGSOMBOON, Michael David DWYER, Farhad FARAHANI, Nicolas R. Tsingos, Freddie SANCHEZ
  • Publication number: 20180268827
    Abstract: The present document relates to a method of layered encoding of a frame of a compressed higher-order Ambisonics, HOA, representation of a sound or sound field. The compressed HOA representation comprises a plurality of transport signals. The method comprises assigning the plurality of transport signals to a plurality of hierarchical layers, the plurality of layers including a base layer and one or more hierarchical enhancement layers, generating, for each layer, a respective HOA extension payload including side information for parametrically enhancing a reconstructed HOA representation obtainable from the transport signals assigned to the respective layer and any layers lower than the respective layer, assigning the generated HOA extension payloads to their respective layers, and signaling the generated HOA extension payloads in an output bitstream.
    Type: Application
    Filed: October 7, 2016
    Publication date: September 20, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Sven KORDON, Alexander KRUEGER
  • Publication number: 20180270600
    Abstract: For generating 3D audio content from a two-channel stereo signal, the stereo signal (x(t)) is partitioned into overlapping sample blocks and is transformed into time-frequency domain. From the stereo signal directional and ambient signal components are separated, wherein the estimated directions of the directional components are changed by a predetermined factor, wherein, if changes are within a predetermined interval, they are combined in order to form a directional centre channel object signal. For the other directions an encoding to Higher Order Ambisonics (HOA) is performed. Additional ambient signal channels are generated by de-correlation and rating by gain factors, followed by encoding to HOA. The directional HOA signals and the ambient HOA signals are combined, and the combined HOA signal and the centre channel object signals are transformed to time domain.
    Type: Application
    Filed: September 29, 2016
    Publication date: September 20, 2018
    Applicant: DOLBY INTERNATIONAL
    Inventors: Johannes BOEHM, Xiaoming CHEN
  • Publication number: 20180270598
    Abstract: Audio perception in local proximity to visual cues is provided. A device includes a video display, first row of audio transducers, and second row of audio transducers. The first and second rows can be vertically disposed above and below the video display. An audio transducer of the first row and an audio transducer of the second row form a column to produce, in concert, an audible signal. The perceived emanation of the audible signal is from a plane of the video display (e.g., a location of a visual cue) by weighing outputs of the audio transducers of the column. In certain embodiments, the audio transducers are spaced farther apart at a periphery for increased fidelity in a center portion of the plane and less fidelity at the periphery.
    Type: Application
    Filed: October 19, 2016
    Publication date: September 20, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Christophe Chabanne, Nicolas R. Tsingos, Charles Q. Robinson
  • Publication number: 20180268831
    Abstract: An encoding system (400) encodes an N-channel audio signal (X), wherein N?3, as a single-channel downmix signal (Y) together with dry and wet upmix parameters ({tilde over (C)}, {tilde over (P)}). In a decoding system (200), a decorrelating section (101) outputs, based on the downmix signal, an (N?1)-channel decorrelated signal (Z); a dry upmix section (102) maps the downmix signal linearly in accordance with dry upmix coefficients (C) determined based on the dry upmix parameters; a wet upmix section (103) populates an intermediate matrix based on the wet upmix parameters and knowing that the intermediate matrix belongs to a predefined matrix class, obtains wet upmix coefficients (P) by multiplying the intermediate matrix by a predefined matrix, and maps the decorrelated signal linearly in accordance with the wet upmix coefficients; and a combining section (104) combines outputs from the upmix sections to obtain a reconstructed signal ({circumflex over (X)}) corresponding to the signal to be reconstructed.
    Type: Application
    Filed: May 21, 2018
    Publication date: September 20, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Lars VILLEMOES, Heidi-Maria LEHTONEN, Heiko PURNHAGEN, Toni HIRVONEN
  • Publication number: 20180270451
    Abstract: Systems and methods are described for detecting and remedying potential incongruence in a video conference. A camera of a video conferencing system may capture video images of a conference room. A processor of the video conferencing system may identify locations of a plurality of participants within an image plane of a video image. Using face and shape detection, a location of a center point of each identified participant's torso may be calculated. A region of congruence bounded by key parallax lines may be calculated, the key parallax lines being a subset of all parallax lines running through the center points of each identified participant. When the audio device location is not within the region of congruence, audio captured by an audio device may be adjusted to reduce effects of incongruence when the captured audio is replayed at a far end of the video conference.
    Type: Application
    Filed: March 12, 2018
    Publication date: September 20, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Glenn N. DICKINS, Ludovic Christophe MALFAIT, David GUNAWAN
  • Publication number: 20180262551
    Abstract: In one embodiment, a method for optimizing delivery of a digital program having a plurality of selectable program components includes delivering to a first node a composite set of program components, assembling from the composite set first and second subsets of program components, the first and second subsets differing by at least one program component, delivering the first subset of program components to a first user, and delivering the second subset of program components to a second user. The program components relate to multiple program categories and each of the multiple program categories is associated with a program presentation aspect and comprises a plurality of selections.
    Type: Application
    Filed: September 21, 2016
    Publication date: September 13, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Michael F. DEMEYER, Timothy E. ONDERS
  • Publication number: 20180262856
    Abstract: Example embodiments disclosed herein relates to upmixing of audio signals. A method of upmixing an audio signal is described. The method includes decomposing the audio signal into a diffuse signal and a direct signal, generating an audio bed at least in part based on the diffuse signal, the audio bed including a height channel, extracting an audio object from the direct signal, estimating metadata of the audio object, the metadata including height information of the audio object; and rendering the audio bed and the audio object as an upmixed audio signal, wherein the audio bed is rendered to a predefined position and the audio object is rendered according to the metadata. Corresponding system and computer program product are described as well.
    Type: Application
    Filed: February 9, 2016
    Publication date: September 13, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Jun WANG, Lie LU, Lianwu CHEN, Mingqing HU
  • Publication number: 20180262769
    Abstract: Pixel data of a video sequence with enhanced dynamic range (EDR) are predicted based on pixel data of a corresponding video sequence with standard dynamic range (SDR) and an inter-layer predictor. Under a highlights clipping constrain, conventional SDR to EDR prediction is adjusted as follows: a) given a highlights threshold, the SDR to EDR predictor is adjusted to output a fixed output value for all input SDR pixel values larger than the highlights threshold, and b) given a dark-regions threshold, the residual values between the input EDR signal and its predicted value are set to zero for all input SDR pixel values lower than the dark-regions threshold. Example processes to determine the highlights and dark-regions thresholds and whether highlights clipping is occurring are provided.
    Type: Application
    Filed: February 16, 2016
    Publication date: September 13, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Guan-Ming SU, Qian CHEN
  • Publication number: 20180261184
    Abstract: A handheld imaging device has a data receiver that is configured to receive reference encoded image data. The data includes reference code values, which are encoded by an external coding system. The reference code values represent reference gray levels, which are being selected using a reference grayscale display function that is based on perceptual non-linearity of human vision adapted at different light levels to spatial frequencies. The imaging device also has a data converter that is configured to access a code mapping between the reference code values and device-specific code values of the imaging device. The device-specific code values are configured to produce gray levels that are specific to the imaging device. Based on the code mapping, the data converter is configured to transcode the reference encoded image data into device-specific image data, which is encoded with the device-specific code values.
    Type: Application
    Filed: September 21, 2016
    Publication date: September 13, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Ajit NINAN
  • Publication number: 20180255206
    Abstract: Methods and systems for color transforms are disclosed. A memory footprint of look up tables for color transforms can be reduced by separating the look up tables into factors, applying frequency domain transforms, dividing the look up tables into zones, or establishing hierarchical levels with increasing resolution. The methods can be applied to still image or video cameras with limited computation resources that can benefit from reduced memory footprints.
    Type: Application
    Filed: September 29, 2016
    Publication date: September 6, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Joonsoo KIM, Jon S. MCELVAIN
  • Publication number: 20180254047
    Abstract: Systems, methods, and computer program products of audio processing based on Adaptive Intermediate Spatial Format (AISF) are described. The AISF is an extension to ISF that allows spatial resolution around an ISF ring to be adjusted dynamically with respect to content of incoming audio objects. An AISF encoder device adaptively warps each ISF ring during ISF encoding to adjust angular distance between objects, resulting in increase in uniformity of energy distribution around the ISF ring. At an AISF decoder device, matrices that decode sound positions to the output speaker take into account the warping that was performed at the AISF encoder device to reproduce the true positions of sound sources.
    Type: Application
    Filed: February 22, 2018
    Publication date: September 6, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Juan Felix TORRES, David S. MCGRATH, Michael William MASON