Dolby Labs Patent Applications

Patents granted to Dolby Labs by the U.S. Patent and Trademark Office (USPTO).

  • Publication number: 20180103333
    Abstract: Embodiments of the example embodiment relate to audio object extraction. A method for audio object extraction from audio content is disclosed. The method comprises determining a sub-band object probability for a sub-band of the audio signal in a frame of the audio content, the sub-band object probability indicating a probability of the sub-band of the audio signal containing an audio object. The method further comprises splitting the sub-band of the audio signal into an audio object portion and a residual audio portion based on the determined sub-band object probability. Corresponding system and computer program product are also disclosed.
    Type: Application
    Filed: October 16, 2017
    Publication date: April 12, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu CHEN, Lie LU
  • Publication number: 20180103253
    Abstract: Methods to improve the quality of coding high-dynamic range (HDR) signals are presented. Instead of using a single chroma quantization table for all color formats, a video encoder may adaptively use separate tables for each one, and transmit the table's ID to a decoder. Examples for chroma quantization tables for video content encoded in the YCbCr (PQ) and ICtCp (PQ) color formats under a variety of color gamut containers are provided.
    Type: Application
    Filed: October 10, 2017
    Publication date: April 12, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Taoran LU, Fangjun PU, Peng YIN, Tao CHEN
  • Publication number: 20180098169
    Abstract: A multi-channel decoder for generating a binaural signal from a downmix signal using upmix rule information on an energy-error introducing upmix rule for calculating a gain factor based on the upmix rule information and characteristics of head related transfer function based filters corresponding to upmix channels. The one or more gain factors are used by a filter processor for filtering the downmix signal so that an energy corrected binaural signal having a left binaural channel and a right binaural channel is obtained.
    Type: Application
    Filed: November 21, 2017
    Publication date: April 5, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventor: Lars VILLEMOES
  • Publication number: 20180095718
    Abstract: Embodiments are directed to a method and system for receiving, in a bitstream, metadata associated with the audio data, and analyzing the metadata to determine whether a loudness parameter for a first group of audio playback devices are available in the bitstream. Responsive to determining that the parameters are present for the first group, the system uses the parameters and audio data to render audio. Responsive to determining that the loudness parameters are not present for the first group, the system analyzes one or more characteristics of the first group, and determines the parameter based on the one or more characteristics.
    Type: Application
    Filed: December 7, 2017
    Publication date: April 5, 2018
    Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB
    Inventors: Jeffrey RIEDMILLER, Scott Gregory NORCROSS, Karl Jonas ROEDEN
  • Publication number: 20180098094
    Abstract: An SDR CDF is constructed based on an SDR histogram generated from a distribution of SDR codewords in SDR images. An HDR CDF is constructed based on an HDR histogram generated from a distribution of HDR codewords in HDR images that correspond to the SDR images. A histogram transfer function is generated based on the SDR CDF and the HDR CDF. The SDR images are transmitted along with backward reshaping metadata to recipient devices. The backward reshaping metadata is generated at least in part on the histogram transfer function.
    Type: Application
    Filed: October 4, 2017
    Publication date: April 5, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Bihan WEN, Harshad KADU, Guan-Ming SU
  • Publication number: 20180098170
    Abstract: A multi-channel decoder for generating a binaural signal from a downmix signal using upmix rule information on an energy-error introducing upmix rule for calculating a gain factor based on the upmix rule information and characteristics of head related transfer function based filters corresponding to upmix channels. The one or more gain factors are used by a filter processor for filtering the downmix signal so that an energy corrected binaural signal having a left binaural channel and a right binaural channel is obtained.
    Type: Application
    Filed: November 22, 2017
    Publication date: April 5, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventor: Lars VILLEMOES
  • Publication number: 20180096692
    Abstract: There is provided encoding and decoding methods for encoding and decoding of object based audio. An exemplary encoding method includes inter alia calculating M downmix signals by forming combinations of N audio objects, wherein M?N, and calculating parameters which allow reconstruction of a set of audio objects formed on basis of the N audio objects from the M downmix signals. The calculation of the M downmix signals is made according to a criterion which is independent of any loudspeaker configuration.
    Type: Application
    Filed: November 22, 2017
    Publication date: April 5, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Heiko PURNHAGEN, Kristofer KJOERLING, Toni HIRVONEN, Lars VILLEMOES, Dirk Jeroen BREEBAART
  • Publication number: 20180098046
    Abstract: Given existing color remapping information (CRI) messaging variables, methods are described to communicate color volume information for a targeted display to a downstream receiver. Bits 7:0 of the 32-bit colour_remap_id are used to extract a first value. If the first value is not a reserved value, then the first value is used as an index to a look-up table to generate a first luminance value for a targeted display, otherwise a second value is generated based on bits 31:9 in the colour_remap_id messaging variable and the first luminance value for a targeted display is generated based on the second value. The methods may be applied to communicate via CRI messaging a minimum luminance value, a maximum luminance value, and color primaries information of the targeted display.
    Type: Application
    Filed: October 3, 2017
    Publication date: April 5, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Walter J. Husak, Robin Atkins, Peng Yin, Taoran Lu, Tao Chen
  • Publication number: 20180091914
    Abstract: A multi-channel decoder for generating a binaural signal from a downmix signal using upmix rule information on an energy-error introducing upmix rule for calculating a gain factor based on the upmix rule information and characteristics of head related transfer function based filters corresponding to upmix channels. The one or more gain factors are used by a filter processor for filtering the downmix signal so that an energy corrected binaural signal having a left binaural channel and a right binaural channel is obtained.
    Type: Application
    Filed: November 21, 2017
    Publication date: March 29, 2018
    Applicant: Dolby International AB
    Inventor: Lars Villemoes
  • Publication number: 20180091803
    Abstract: A projection display system includes a spatial modulator that is controlled to compensate for flare in a lens of the projector. The spatial modulator increases achievable intra-frame contrast and facilitates increased peak luminance without unacceptable black levels. Some embodiments provide 3D projection systems in which the spatial modulator is combined with a polarization control panel.
    Type: Application
    Filed: November 29, 2017
    Publication date: March 29, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Gregory John Ward, Robin Atkins
  • Publication number: 20180082429
    Abstract: Motion characteristics related to the images are determined. A motion characteristics metadata portion is generated based on the motion characteristics, and is to be used for determining an optimal FRC operational mode with a downstream device for the images. The images are encoded into a video stream. The motion characteristics metadata portion is encoded into the video stream as a part of image metadata. The video stream is transmitted to the downstream device. The downstream receives the video stream and operates the optimal FRC operational mode to generate, based on the images, additional images. The images and the additional images are rendered on a display device at an image refresh rate different from an input image refresh rate represented by images encoded in the video stream.
    Type: Application
    Filed: September 13, 2017
    Publication date: March 22, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Anustup Kumar Atanu Choudhury, Tao Chen, Robin Atkins, Samir N. Hulyalkar
  • Publication number: 20180082698
    Abstract: The present invention proposes a new method for improving the performance of a real-valued filterbank based spectral envelope adjuster. By adaptively locking the gain values for adjacent channels dependent on the sign of the channels, as defined in the application, reduced aliasing is achieved. Furthermore, the grouping of the channels during gain-calculation, gives an improved energy estimate of the real valued subband signals in the filterbank.
    Type: Application
    Filed: December 1, 2017
    Publication date: March 22, 2018
    Applicant: Dolby International AB
    Inventors: Kristofer KJOERLING, Lars VILLEMOES
  • Publication number: 20180082658
    Abstract: Methods for designing metamerically stable RGBW displays are presented. Display parameters are selected so that given a reference spectral power distribution (SPD) for the white color primary (e.g., one based on D65), and a test spectral power distribution for the white color primary, deviations in color appearance measurements between the two SPDs among N different observers are minimized. Given a display with a metamerically stable white (W), given linear input R, G, and B values, output R, G, B, and W values are generated to optimize metameric stability instead of reducing power consumption or to increase total brightness.
    Type: Application
    Filed: September 14, 2017
    Publication date: March 22, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Robin ATKINS
  • Publication number: 20180082695
    Abstract: Disclosed is a system and computer program product of encoding audio content and corresponding method. The method includes determining a characteristic of the audio content, the characteristic of the audio content including at least one of a type or a property of the audio content. Also the method includes classifying the audio content based on the characteristic of the audio content and determining probabilities for multiple predefined audio coding symbols associated with the audio content by calculating a probability for each of the audio coding symbols based on the result of the classification, the probability for an audio coding symbol indicating a frequency at which the audio coding symbol occurs in the audio content. Further, the method encoded the audio content based on the audio coding symbols and the corresponding probabilities to obtain a code value, the code value representing a compression coding format of the audio content.
    Type: Application
    Filed: April 13, 2016
    Publication date: March 22, 2018
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Xuejing SUN, Dong SHI, Janusz KLEJSA
  • Publication number: 20180075862
    Abstract: In an audio processing system (300), a filtering section (350, 400): receives subband signals (410, 420, 430) corresponding to audio content of a reference signal (301) in respective frequency subbands; receives subband signals (411, 421, 431) corresponding to audio content of a response signal (304) in the respective subbands; and forms filtered inband references (412, 422, 432) by applying respective filters (413, 423, 433) to the subband signals of the reference signal.
    Type: Application
    Filed: March 21, 2016
    Publication date: March 15, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Dong SHI, Glenn N. DICKINS, David GUNAWAN, Xuejing SUN
  • Publication number: 20180077510
    Abstract: Sound scenes in 3D can be synthesized or captured as a natural sound field. For decoding, a decode matrix is required that is specific for a given loudspeaker setup and is generated using the known loudspeaker positions. However, some source directions are attenuated for 2D loudspeaker setups like e.g. 5.1 surround. An improved method for decoding an encoded audio signal in soundfield format for L loudspeakers at known positions comprises steps of adding (10) a position of at least one virtual loudspeaker to the positions of the L loudspeakers, generating (11) a 3D decode matrix (D?), wherein the positions ({circumflex over (?)}1 . . . {circumflex over (?)}L) of the L loudspeakers and the at least one virtual position ({circumflex over (?)}?L+1) are used, downmixing (12) the 3D decode matrix (D?), and decoding (14) the encoded audio signal (i14) using the downscaled 3D decode matrix ({tilde over (D)}). As a result, a plurality of decoded loudspeaker signals (q14) is obtained.
    Type: Application
    Filed: September 28, 2017
    Publication date: March 15, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Florian Keiler, Johannes Boehm
  • Publication number: 20180075865
    Abstract: The present document relates to audio source coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR), as well as to digital effect processors, e.g. exciters, where generation of harmonic distortion add brightness to the processed signal, and to time stretchers where a signal duration is prolonged with maintained spectral content. A system and method configured to generate a time stretched and/or frequency transposed signal from an input signal is described. The system comprises an analysis filterbank configured to provide an analysis subband signal from the input signal; wherein the analysis subband signal comprises a plurality of complex valued analysis samples, each having a phase and a magnitude. Furthermore, the system comprises a subband processing unit configured to determine a synthesis subband signal from the analysis subband signal using a subband transposition factor Q and a subband stretch factor S.
    Type: Application
    Filed: November 27, 2017
    Publication date: March 15, 2018
    Applicant: Dolby International AB
    Inventor: Lars Villemoes
  • Publication number: 20180077491
    Abstract: Embodiments are described for a hybrid amplification architecture that separates individual audio amplifier stages from the power supply and a simple two- or three-conductor bus that transmits both power and audio signal to a plurality of daisy-chained speakers to playback adaptive audio content in an expanded surround-sound environment including surround and overhead speakers or for use within professional live sound applications and/or distributed audio systems. A control unit generates digital audio and power and transmits both simultaneously over the bus to individual speaker units associated with each speaker. The speaker units recover the power decode the channel assignment to route the audio to the appropriate speakers.
    Type: Application
    Filed: March 30, 2016
    Publication date: March 15, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Joel A. BUTLER, Garth Norman SHOWALTER
  • Publication number: 20180077515
    Abstract: Improved tools for authoring and rendering audio reproduction data are provided. Some such authoring tools allow audio reproduction data to be generalized for a wide variety of reproduction environments. Audio reproduction data may be authored by creating metadata for audio objects. The metadata may be created with reference to speaker zones. During the rendering process, the audio reproduction data may be reproduced according to the reproduction speaker layout of a particular reproduction environment.
    Type: Application
    Filed: November 3, 2017
    Publication date: March 15, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Nicolas R. TSINGOS, Charles Q. ROBINSON, Jurgen W. SCHARPF
  • Publication number: 20180077511
    Abstract: Embodiments are described for a system of rendering object-based audio content through a system that includes individually addressable drivers, including at least one driver that is configured to project sound waves toward one or more surfaces within a listening environment for reflection to a listening area within the listening environment; a renderer configured to receive and process audio streams and one or more metadata sets associated with each of the audio streams and specifying a playback location of a respective audio stream; and a playback system coupled to the renderer and configured to render the audio streams to a plurality of audio feeds corresponding to the array of audio drivers in accordance with the one or more metadata sets.
    Type: Application
    Filed: November 17, 2017
    Publication date: March 15, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Sripal S. MEHTA, Brett G. CROCKETT, Spencer HOOKS, Alan SEEFELDT, Christophe CHABANNE, C. Phillip BROWN, Joshua B. LANDO, Brad BASLER, Stewart MURRIE
  • Publication number: 20180068670
    Abstract: Apparatus and methods for audio classifying and processing are disclosed. In one embodiment, an audio processing apparatus includes an audio classifier for classifying an audio signal into at least one audio type in real time; an audio improving device for improving experience of audience; and an adjusting unit for adjusting at least one parameter of the audio improving device in a continuous manner based on the confidence value of the at least one audio type.
    Type: Application
    Filed: November 9, 2017
    Publication date: March 8, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Lie LU, Alan J. SEEFELDT, Jun WANG
  • Publication number: 20180068666
    Abstract: Techniques for adaptive processing of media data based on separate data specifying a state of the media data are provided. A device in a media processing chain may determine whether a type of media processing has already been performed on an input version of media data. If so, the device may adapt its processing of the media data to disable performing the type of media processing. If not, the device performs the type of media processing. The device may create a state of the media data specifying the type of media processing. The device may communicate the state of the media data and an output version of the media data to a recipient device in the media processing chain, for the purpose of supporting the recipient device's adaptive processing of the media data.
    Type: Application
    Filed: November 9, 2017
    Publication date: March 8, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Jeffrey RIEDMILLER, Regunathan RADHAKRISHNAN, Marvin PRIBADI, Farhad FARAHANI, Michael SMITHERS
  • Publication number: 20180069517
    Abstract: In some embodiments, a method for processing an audio signal in an audio processing apparatus is disclosed. The method includes receiving an audio signal and a parameter, the parameter indicating a location of an auditory event boundary. An audio portion between consecutive auditory event boundaries constitutes an auditory event. The method further includes applying a modification to the audio signal based in part on an occurrence of the auditory event. The parameter may be generated by monitoring a characteristic of the audio signal and identifying a change in the characteristic.
    Type: Application
    Filed: November 10, 2017
    Publication date: March 8, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Brett G. Crockett, Alan J. Seefeldt
  • Publication number: 20180068637
    Abstract: An input media signal encoded with a portion of image data to be rendered with a target display device is received. It is determined, based on the portion of image data, whether a first power profile is to be applied to rendering the portion of image data with the target display device. In response to determining, based on the portion of image data, that the first power profile is not to be applied to rendering the portion of image data with the target display device, a second power profile is applied to rendering the portion of image data with the target display device.
    Type: Application
    Filed: March 22, 2016
    Publication date: March 8, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Ajit NINAN, Chun Chi WAN
  • Publication number: 20180070071
    Abstract: Stereoscopic images are subsampled and placed in a “checkerboard” pattern in an image. The image is encoded in a monoscopic video format. The monoscopic video is transmitted to a device where the “checkerboard” is decoded. Portions of the checkerboard (e.g., “black” portions) are used to reconstruct one of the stereoscopic images and the other portion of the checkerboard (e.g., “white” portions) are used to reconstruct the other image. The subsamples are, for example, taken from the image in a location coincident to the checkerboard position in which the subsamples are encoded.
    Type: Application
    Filed: November 9, 2017
    Publication date: March 8, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Walter J. Husak, David Ruhoff, Alexandros Tourapis, Athanasios Leontaris
  • Publication number: 20180070064
    Abstract: Dual and multi-modulator projector display systems and techniques are disclosed. In one embodiment, a projector display system comprises a light source; a controller, a first modulator, receiving light from the light source and rendering a halftone image of said the input image; a blurring optical system that blurs said halftone image with a Point Spread Function (PSF); and a second modulator receiving the blurred halftone image and rendering a pulse width modulated image which may be projected to form the desired screen image. Systems and techniques for forming a binary halftone image from input image, correcting for misalignment between the first and second modulators and calibrating the projector system—e.g. over time—for continuous image improvement are also disclosed.
    Type: Application
    Filed: November 9, 2017
    Publication date: March 8, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Jerome SHIELDS
  • Publication number: 20180063482
    Abstract: Systems and methods are described for automatically framing participants in a video conference using a single camera of a video conferencing system. A camera of a video conferencing system may capture video images of a conference room. A processor of the video conferencing system may identify a potential region of interest within a video image of the captured video images, the potential region of interest including an identified participant. Feature detection may be executed on the potential region of interest, and a region of interest may be computed based on the executed feature detection. The processor may then automatically frame the identified participant within the computed region of interest, the automatic framing including at least one of cropping the video image to match the computed region of interest and rescaling the video image to a desired resolution.
    Type: Application
    Filed: August 21, 2017
    Publication date: March 1, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Erwin Goesnar
  • Publication number: 20180061427
    Abstract: The present invention proposes a new method for improving the performance of a real-valued filterbank based spectral envelope adjuster. By adaptively locking the gain values for adjacent channels dependent on the sign of the channels, as defined in the application, reduced aliasing is achieved. Furthermore, the grouping of the channels during gain-calculation, gives an improved energy estimate of the real valued subband signals in the filterbank.
    Type: Application
    Filed: October 31, 2017
    Publication date: March 1, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Kristofer KJOERLING, Lars VILLEMOES
  • Publication number: 20180054676
    Abstract: A multi-channel input signal having at least three original channels is represented by a parameter representation of the multi-channel signal. A first balance parameter, a first coherence parameter, or a first inter-channel time difference between a first channel pair and a second balance parameter, or a second coherence parameter, or a second inter-channel time difference parameter between a second channel pair are calculated. This set of parameters is the parameter representation of the original signals. The first channel pair has two channels, which are different from two channels of a second channel pair. Furthermore, each channel of the two channel pairs is one of the original channels, or a weighted combination of the original channels, and the first channel pair and the second channel pair include information on the three original channels.
    Type: Application
    Filed: April 19, 2013
    Publication date: February 22, 2018
    Applicant: Dolby International AB
    Inventors: Heiko PURNHAGEN, Lars VILLEMOES, Jonas ENGDEGARD, Jonas ROEDEN, Kristofer KJOERLING
  • Publication number: 20180053515
    Abstract: Methods for generating an object based audio program, renderable in a personalizable manner, and including a bed of speaker channels renderable in the absence of selection of other program content (e.g., to provide a default full range audio experience). Other embodiments include steps of delivering, decoding, and/or rendering such a program. Rendering of content of the bed, or of a selected mix of other content of the program, may provide an immersive experience. The program may include multiple object channels (e.g., object channels indicative of user-selectable and user-configurable objects), the bed of speaker channels, and other speaker channels. Another aspect is an audio processing unit (e.g., encoder or decoder) configured to perform, or which includes a buffer memory which stores at least one frame (or other segment) of an object based audio program (or bitstream thereof) generated in accordance with, any embodiment of the method.
    Type: Application
    Filed: October 24, 2017
    Publication date: February 22, 2018
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Sripal S. MEHTA, Thomas ZIEGLER, Giles BAKER, Jeffrey RIEDMILLER, Prinyar SAUNGSOMBOON
  • Publication number: 20180054689
    Abstract: Embodiments of the present invention relate to video content assisted audio object extraction. A method of audio object extraction from channel-based audio content is disclosed. The method comprises extracting at least one video object from video content associated with the channel-based audio content, and determining information about the at least one video object. The method further comprises extracting from the channel-based audio content an audio object to be rendered as an upmixed audio signal based on the determined information. Corresponding system and computer program product are also disclosed.
    Type: Application
    Filed: February 24, 2016
    Publication date: February 22, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu CHEN, Xuejing SUN, Lie LU
  • Publication number: 20180054688
    Abstract: Some disclosed implementations include an interface system and a control system. The control system may be capable of receiving, via the interface system, microphone data. The control system may be capable of determining, based at least in part on the microphone data, instances of one or more acoustic events. The instances of one or more acoustic events may, in some examples, include conversational dynamics data. The control system may be capable of providing behavior modification feedback, via the interface system, corresponding with the instances of the one or more acoustic events.
    Type: Application
    Filed: August 15, 2017
    Publication date: February 22, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Richard J. CARTWRIGHT, Peter MARTIN, Christopher Stanley MCGRATH, Glenn N. DICKINS
  • Publication number: 20180053517
    Abstract: The present invention proposes a new method for improving the performance of a real-valued filterbank based spectral envelope adjuster. By adaptively locking the gain values for adjacent channels dependent on the sign of the channels, as defined in the application, reduced aliasing is achieved. Furthermore, the grouping of the channels during gain-calculation, gives an improved energy estimate of the real valued subband signals in the filterbank.
    Type: Application
    Filed: October 31, 2017
    Publication date: February 22, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Kristofer KJOERLING, Lars VILLEMOES
  • Publication number: 20180053516
    Abstract: The present invention proposes a new method for improving the performance of a real-valued filterbank based spectral envelope adjuster. By adaptively locking the gain values for adjacent channels dependent on the sign of the channels, as defined in the application, reduced aliasing is achieved. Furthermore, the grouping of the channels during gain-calculation, gives an improved energy estimate of the real valued subband signals in the filterbank.
    Type: Application
    Filed: October 31, 2017
    Publication date: February 22, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Kristofer KJOERLING, Lars VILLEMOES
  • Publication number: 20180047405
    Abstract: In some embodiments, a pitch filter for filtering a preliminary audio signal generated from an audio bitstream is disclosed. The pitch filter has an operating mode selected from one of either: (i) an active mode where the preliminary audio signal is filtered using filtering information to obtain a filtered audio signal, and (ii) an inactive mode where the pitch filter is disabled. The preliminary audio signal is generated in an audio encoder or audio decoder having a coding mode selected from at least two distinct coding modes, and the pitch filter is capable of being selectively operated in either the active mode or the inactive mode while operating in the coding mode based on control information.
    Type: Application
    Filed: October 24, 2017
    Publication date: February 15, 2018
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Barbara RESCH, Kristofer KJÖRLING, Lars VILLEMOES
  • Publication number: 20180048974
    Abstract: There are two representations for Higher Order Ambisonics denoted HOA: spatial domain and coefficient domain. The invention generates from a coefficient domain representation a mixed spatial/coefficient domain representation, wherein the number of said HOA signals can be variable. A vector of coefficient domain signals is separated into a vector of coefficient domain signals having a constant number of HOA coefficients and a vector of coefficient domain signals having a variable number of HOA coefficients. The constant-number HOA coefficients vector is transformed to a corresponding spatial domain signal vector. In order to facilitate high-quality coding, without creating signal discontinuities the variable-number HOA coefficients vector of coefficient domain signals is adaptively normalised and multiplexed with the vector of spatial domain signals.
    Type: Application
    Filed: October 23, 2017
    Publication date: February 15, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Sven Kordon, Alexander Krueger
  • Publication number: 20180048768
    Abstract: In an audio conferencing environment, including multiple users participating by means of a series of associated audio input devices for the provision of audio input, and a series of audio output devices for the output of audio output streams to the multiple users, with the audio input and output devices being interconnected to a mixing control server for the control and mixing of the audio inputs from each audio input devices to present a series of audio streams to the audio output devices, a method of reducing the effects of cross talk pickup of at least a first audio conversation by multiple audio input devices, the method including the steps of: (a) monitoring the series of audio input devices for the presence of a duplicate audio conversation input from at least two input audio sources in an audio output stream; and (b) where a duplicate audio conversation input is detected, suppressing the presence of the duplicate audio conversation input in the audio output stream.
    Type: Application
    Filed: February 8, 2016
    Publication date: February 15, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Gary SPITTLE, Glenn DICKINS, David GUNAWAN, Guilin MA
  • Publication number: 20180047411
    Abstract: The present invention relates to coding of audio signals, and in particular to high frequency reconstruction methods including a frequency domain harmonic transposer. A system and method for generating a high frequency component of a signal from a low frequency component of the signal is described.
    Type: Application
    Filed: October 25, 2017
    Publication date: February 15, 2018
    Applicant: Dolby International AB
    Inventors: Lars Villemoes, Per Ekstrand
  • Publication number: 20180048904
    Abstract: Implementations are provided that relate, for example, to view tiling in video encoding and decoding. A particular method includes accessing a video picture that includes multiple pictures combined into a single picture (826), accessing information indicating how the multiple pictures in the accessed video picture are combined (806, 808, 822), decoding the video picture to provide a decoded representation of at least one of the multiple pictures (824, 826), and providing the accessed information and the decoded video picture as output (824, 826). Some other implementations format or process the information that indicates how multiple pictures included in a single video picture are combined into the single video picture, and format or process an encoded representation of the combined multiple pictures.
    Type: Application
    Filed: October 23, 2017
    Publication date: February 15, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Purvin Bibhas PANDIT, Peng YIN, Dong TIAN
  • Publication number: 20180048683
    Abstract: Apparatus comprising an interface for receiving a respective uplink data stream from each of three or more further apparatuses, and for transmitting a respective downlink data stream to each of the further apparatuses; and a logic system in communication with the interface. The logic system is configured: to receive first data in the uplink data stream received from a first one of the further apparatuses; and in a first mode, to include at least some of the first data in the respective downlink data streams transmitted to every other one of the further apparatuses, or, in a second mode, to include at least some of the first data in the downlink data stream transmitted to a second one of the further apparatuses and to omit or attenuate substantially all of the first data in the downlink data stream transmitted to at least a third one of the further apparatuses. Corresponding methods and computer readable media are disclosed.
    Type: Application
    Filed: August 8, 2017
    Publication date: February 15, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Rowan James KATEKAR, Glenn N. DICKINS
  • Publication number: 20180041759
    Abstract: A content-adaptive quantizer processor receives an input image with an input bit depth. A noise-mask generation process is applied to the input image to generate a noise mask image which characterizes each pixel in the input image in terms of its perceptual relevance in masking quantization noise. A noise mask histogram is generated based on the input image and the noise mask image. A masking-noise level to bit-depth function is applied to the noise mask histogram to generate minimal bit depth values for each bin in the noise mask histogram. A codeword mapping function is generated based on the input bit depth, a target bit depth, and the minimal bit depth values. The codeword mapping function is applied to the input image to generate an output image in the target bit depth.
    Type: Application
    Filed: March 1, 2016
    Publication date: February 8, 2018
    Applicants: DOLBY INTERNATIONAL AB, Dolby Laboratories Licensing Corporation
    Inventors: Jan FROEHLICH, Guan-Ming SU, Robin ATKINS, Scott DALY, Jon Scott MILLER
  • Publication number: 20180039169
    Abstract: A locally dimmed display has a spatial light modulator illuminated by a light source. The spatial light modulator is illuminated with a low resolution version of a desired image. The illumination may comprise a series of lighting elements that vary smoothly from one element to another at the spatial light modulator.
    Type: Application
    Filed: October 17, 2017
    Publication date: February 8, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Lorne A. Whitehead, Gregory John Ward, Wolfgang Stuerzlinger, Helge Seetzen
  • Publication number: 20180040336
    Abstract: A system and method of blind bandwidth extension. The system selects a prediction model from a number of stored prediction models that were generated using an unsupervised clustering method (e.g., a k-means method) and a supervised regression process (e.g., a support vector machine), and extends the bandwidth of an input musical audio signal.
    Type: Application
    Filed: August 2, 2017
    Publication date: February 8, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Chih-Wei WU, Mark S. VINTON
  • Publication number: 20180041639
    Abstract: Systems and methods are described for modifying one of far-end signal playback and capture of local audio on an audio device. Frames of both a far-end audio stream and a near-end audio stream may be analyzed using a measure of voice activity, the analyzing producing voice data associated with each frame. Based on the voice data, a conference state may be determined, and one of playback of the far-end audio stream and capture of local audio on an audio device may be modified based on the determined conference state. By associating the likely intent with a predefined state, the device may further cull or remove unwanted or unlikely content from the device input and output. This may have a substantial advantage in allowing for full duplex operation in the case of more meaningful and continuing voice activity, particularly in the case where there are many connected endpoints.
    Type: Application
    Filed: August 2, 2017
    Publication date: February 8, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: David GUNAWAN, Glenn N. DICKINS
  • Publication number: 20180031921
    Abstract: Techniques for driving a dual modulation display include generating backlight drive signals to drive individually-controllable illumination sources. The illumination sources emit first light onto a light conversion layer. The light conversion layer converts the first light, such as blue or ultraviolet light, into second light, such as white light. The light conversion layer can include quantum dot materials. Liquid crystal display (LCD) modulation drive signals are generated to determine transmission of the second light through individual color subpixels of the display. These LCD modulation drive signals can be adjusted based on one or more light field simulations to account for non-uniform, spatial color shifts. Alternatively, one or more light field simulations based on a uniformity assumption determine intermediate LCD modulation drive signals. A compensation field simulation, using backlight drive signals, is then used to adjust the intermediate LCD modulation drive signal for color correction.
    Type: Application
    Filed: October 5, 2017
    Publication date: February 1, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Chun Chi WAN, Ajit NINAN
  • Publication number: 20180033446
    Abstract: The present invention relates to audio coding systems which make use of a harmonic transposition method for high frequency reconstruction (HFR). A system and a method for generating a high frequency component of a signal from a low frequency component of the signal is described. The system comprises an analysis filter bank providing a plurality of analysis subband signals of the low frequency component of the signal. It also comprises a non-linear processing unit to generate a synthesis subband signal with a synthesis frequency by modifying the phase of a first and a second of the plurality of analysis subband signals and by combining the phase-modified analysis subband signals. Finally, it comprises a synthesis filter bank for generating the high frequency component of the signal from the synthesis subband signal.
    Type: Application
    Filed: September 20, 2017
    Publication date: February 1, 2018
    Applicant: Dolby International AB
    Inventors: Lars Villemoes, Per Hedelin
  • Publication number: 20180033453
    Abstract: According to one aspect, a method for detecting voice activity is disclosed, the method including receiving a frame of an input audio signal, the input audio signal having an sample rate; dividing the frame into a plurality of subbands based on the sample rate, the plurality of subbands including at least a lowest subband and a highest subband; filtering the lowest subband with a moving average filter to reduce an energy of the lowest subband; estimating a noise level for each of the plurality of subbands; calculating a signal to noise ratio value for each of the plurality of subbands; and determining a speech activity level of the frame based on an average of the calculated signal to noise ratio values and a weighted average of an energy of each of the plurality of subbands. Other aspects include audio decoders that decode audio that was encoded using the methods described herein.
    Type: Application
    Filed: October 12, 2017
    Publication date: February 1, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Hannes Muesch
  • Publication number: 20180035233
    Abstract: The present disclosure relates to reverberation generation for headphone virtualization. A method of generating one or more components of a binaural room impulse response (BRIR) for headphone virtualization is described. In the method, directionally-controlled reflections are generated, wherein directionally-controlled reflections impart a desired perceptual cue to an audio input signal corresponding to a sound source location. Then at least the generated reflections are combined to obtain the one or more components of the BRIR. Corresponding system and computer program products are described as well.
    Type: Application
    Filed: February 11, 2016
    Publication date: February 1, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Louis D. FIELDER, Zhiwei SHUANG, Grant A. DAVIDSON, Xiguang ZHENG, Mark S. VINTON
  • Publication number: 20180027245
    Abstract: Techniques for selecting a coding mode for an image coding process are described. Coding modes can be selected through a coding mode transition state machine, a re-quantization process, selection of an optimal transform size, by skipping some quantization parameters, or by performing motion search.
    Type: Application
    Filed: August 7, 2017
    Publication date: January 25, 2018
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Alexandros TOURAPIS, Yan YE
  • Publication number: 20180027123
    Abstract: Various disclosed implementations involve processing and/or playback of a recording of a conference involving a plurality of conference participants. Some implementations disclosed herein involve receiving audio data corresponding to a recording of at least one conference involving a plurality of conference participants. The audio data may include conference participant speech data from multiple endpoints, recorded separately and/or conference participant speech data from a single endpoint corresponding to multiple conference participants and including spatial information for each conference participant of the multiple conference participants. A search of the audio data may be based on one or more search parameters. The search may be a concurrent search for multiple features of the audio data. Instances of conference participant speech may be rendered to at least two different virtual conference participant positions of a virtual acoustic space.
    Type: Application
    Filed: February 3, 2016
    Publication date: January 25, 2018
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Richard J. CARTWRIGHT, Shen HUANG