Patents Assigned to Dolby Laboratories Licensing Corporation
-
Patent number: 11929091Abstract: An apparatus and method of blind detection of binauralized audio. If the input content is detected as binaural, a second binauralization may be avoided. In this manner, the user experience avoids audio artifacts introduced by multiple binauralizations.Type: GrantFiled: March 1, 2022Date of Patent: March 12, 2024Assignee: Dolby Laboratories Licensing CorporationInventors: Chunmao Zhang, Lianwu Chen, Ziyu Yang, Joshua Brandon Lando, David Matthew Fischer, Lie Lu
-
Patent number: 11930347Abstract: A method of processing audio content including a plurality of audio elements comprises: clustering the plurality of audio elements into a plurality of clusters of audio elements; and for a cluster among the plurality of clusters: for each audio element in the cluster, determining a measure of energy that the audio element contributes to the cluster; for at least one audio element in the cluster, determining a compensation gain based at least in part on the measures of energy for the audio elements in the cluster; and applying the compensation gain to the at least one audio element in the cluster.Type: GrantFiled: February 12, 2020Date of Patent: March 12, 2024Assignee: Dolby Laboratories Licensing CorporationInventors: Lianwu Chen, Lie Lu
-
Publication number: 20240080479Abstract: Sampled data is packaged in checkerboard format for encoding and decoding. The sampled data may be quincunx sampled multi-image video data (e.g., 3D video or a multi-program stream), and the data may also be divided into sub-images of each image which are then multiplexed, or interleaved, in frames of a video stream to be encoded and then decoded using a standardized video encoder. A system for viewing may utilize a standard video decoder and a formatting device that de-interleaves the decoded sub-images of each frame reformats the images for a display device. A 3D video may be encoded using a most advantageous interleaving format such that a preferred quality and compression ratio is reached. In one embodiment, the invention includes a display device that accepts data in multiple formats.Type: ApplicationFiled: November 7, 2023Publication date: March 7, 2024Applicant: Dolby Laboratories Licensing CorporationInventors: Alexandros Tourapis, Walter J. Husak, Peshala V. Pahalawatta, Athanasios Leontaris
-
Publication number: 20240079019Abstract: Computer-implemented methods for training a neural network, as well as for implementing audio encoders and decoders via trained neural networks, are provided. The neural network may receive an input audio signal, generate an encoded audio signal and decode the encoded audio signal. A loss function generating module may receive the decoded audio signal and a ground truth audio signal, and may generate a loss function value corresponding to the decoded audio signal. Generating the loss function value may involve applying a psychoacoustic model. The neural network may be trained based on the loss function value. The training may involve updating at least one weight of the neural network.Type: ApplicationFiled: November 13, 2023Publication date: March 7, 2024Applicant: Dolby Laboratories Licensing CorporationInventors: Roy M. FEJGIN, Grant A. DAVIDSON, Chih-Wei WU, Vivek KUMAR
-
Publication number: 20240080465Abstract: Methods and systems for frame rate scalability are described. Support is provided for input and output video sequences with variable frame rate and variable shutter angle across scenes, or for input video sequences with fixed input frame rate and input shutter angle, but allowing a decoder to generate a video output at a different output frame rate and shutter angle than the corresponding input values. Techniques allowing a decoder to decode more computationally-efficiently a specific backward compatible target frame rate and shutter angle among those allowed are also presented.Type: ApplicationFiled: November 13, 2023Publication date: March 7, 2024Applicant: Dolby Laboratories Licensing CorporationInventors: Robin Atkins, Peng Yin, Taoran Lu, Fangjun Pu, Sean Thomas McCarthy, Walter J. Husak, Tao Chen, Guan-Ming Su
-
Publication number: 20240080489Abstract: A quantization parameter signalling mechanism for both SDR and HDR content in video coding is described using two approaches. The first approach is to send the user-defined QpC table directly in high level syntax. This leads to more flexible and efficient QP control for future codec development and video content coding. The second approach is to signal luma and chroma QPs independently. This approach eliminates the need for QpC tables and removes the dependency of chroma quantization parameter on luma QP.Type: ApplicationFiled: November 10, 2023Publication date: March 7, 2024Applicant: Dolby Laboratories Licensing CorporationInventors: Fangjun PU, Taoran LU, Peng YIN, Sean Thomas MCCARTHY
-
Publication number: 20240079015Abstract: Methods for generating an object based audio program, renderable in a personalizable manner, and including a bed of speaker channels renderable in the absence of selection of other program content (e.g., to provide a default full range audio experience). Other embodiments include steps of delivering, decoding, and/or rendering such a program. Rendering of content of the bed, or of a selected mix of other content of the program, may provide an immersive experience. The program may include multiple object channels (e.g., object channels indicative of user-selectable and user-configurable objects), the bed of speaker channels, and other speaker channels. Another aspect is an audio processing unit (e.g., encoder or decoder) configured to perform, or which includes a buffer memory which stores at least one frame (or other segment) of an object based audio program (or bitstream thereof) generated in accordance with, any embodiment of the method.Type: ApplicationFiled: September 19, 2023Publication date: March 7, 2024Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL ABInventors: Sripal S. MEHTA, Thomas ZIEGLER, Giles BAKER, Jeffrey RIEDMILLER, Prinyar SAUNGSOMBOON
-
Publication number: 20240080608Abstract: A method of audio processing includes capturing a binaural audio signal, calculating noise reduction gains using a machine learning model, and generating a modified binaural audio signal. The method may further including performing various corrections to the audio to account for video captured by different cameras such as a front camera and a rear camera. The method may further include performing smooth switching of the binaural audio when switching between the front camera and the rear camera. In this manner, noise may be reduced in the binaural audio, and the user perception of the combined video and binaural audio may be improved.Type: ApplicationFiled: December 14, 2021Publication date: March 7, 2024Applicant: Dolby Laboratories Licensing CorporationInventors: Yuanxing MA, Zhiwei SHUANG, Yang LIU
-
Patent number: 11924619Abstract: An apparatus and method of rendering audio. A binaural signal is split on an amplitude weighting basis into a front binaural signal and a rear binaural signal, based on perceived position information of the audio. In this manner, the front-back differentiation of the binaural signal is improved.Type: GrantFiled: September 12, 2022Date of Patent: March 5, 2024Assignee: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Mark F. Davis, Nicolas R. Tsingos, C. Phillip Brown
-
Patent number: 11924477Abstract: In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated hue values in the preferred color space is minimized. HDR images are coded in the reshaped color space. Legacy devices can still decode standard dynamic range images assuming they are coded in the legacy color space, while updated devices can use color reshaping information to decode HDR images in the preferred color space at full dynamic range.Type: GrantFiled: June 27, 2023Date of Patent: March 5, 2024Assignee: Dolby Laboratories Licensing CorporationInventors: Robin Atkins, Peng Yin, Taoran Lu, Jaclyn Anne Pytlarz
-
Patent number: 11922639Abstract: A method for generating an high-dynamic-range (HDR) color image from a dual-exposure-time single-shot HDR color image sensor includes obtaining pixel values generated by a local region of sensor pixels of the image sensor, determining a motion parameter for the local region from pixel values associated with a first color, and demosaicing the pixel values of the local region to determine, for each of three colors, an output value of the images pixel, wherein relative contributions of short-exposure-time pixels and long-exposure-time pixels to the output value are weighted according to the motion parameter.Type: GrantFiled: June 4, 2019Date of Patent: March 5, 2024Assignee: Dolby Laboratories Licensing CorporationInventors: Yuelong Li, Jon Scott McElvain, Mohammad Tofighi
-
Publication number: 20240071411Abstract: Disclosed is a method for determining one or more dialog quality metrics of a mixed audio signal comprising a dialog component and a noise component, the method comprising separating an estimated dialog component from the mixed audio signal by means of a dialog separator using a dialog separating model determined by training the dialog separator based on the one or more quality metrics; providing the estimated dialog component from the dialog separator to a quality metrics estimator; and determining the one or more quality metrics by means of the quality metrics estimator based on the mixed signal and the estimated dialog component. Further disclosed is a method for training a dialog separator, a system comprising circuitry configured to perform the method, and a non-transitory computer-readable storage medium.Type: ApplicationFiled: January 4, 2022Publication date: February 29, 2024Applicant: Dolby Laboratories Licensing CorporationInventors: Jundai SUN, Lie LU, Shaofan YANG, Rhonda J. WILSON, Dirk Jeroen BREEBAART
-
Publication number: 20240073357Abstract: Dual or multi-modulation display systems comprising a first modulator and a second modulator are disclosed. The first modulator may comprise a plurality of analog mirrors (e.g. MEMS array) and the second modulator may comprise a plurality of mirrors (e.g., DMD array). The display system may further comprise a controller that sends control signals to the first and second modulator. The display system may render highlight features within a projected image by affecting a time multiplexing scheme. In one embodiment, the first modulator may be switched on a sub-frame basis such that a desired proportion of the available light may be focused or directed onto the second modulator to form the highlight feature on a sub-frame rendering basis.Type: ApplicationFiled: January 31, 2022Publication date: February 29, 2024Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventor: Martin J. Richards
-
Publication number: 20240073459Abstract: In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated hue values in the preferred color space is minimized HDR images are coded in the reshaped color space. Legacy devices can still decode standard dynamic range images assuming they are coded in the legacy color space, while updated devices can use color reshaping information to decode HDR images in the preferred color space at full dynamic range.Type: ApplicationFiled: October 31, 2023Publication date: February 29, 2024Applicant: DOLBY LABORATORIES LICENSING CORPORATIONInventors: Robin Atkins, Peng Yin, Taoran Lu, Jaclyn Anne Pytlarz
-
Publication number: 20240073444Abstract: A video encoding method according to an embodiment of the present invention includes generating header information that includes information about resolutions of motion vectors of respective blocks, determined based on motion prediction for a unit image. Here, the header information includes flag information indicating whether resolutions of all motion vectors included in the unit image are integer-pixel resolutions. Further, a video decoding method according to another embodiment of the present invention includes extracting information about resolutions of motion vectors of each unit image from header information included in a target bitstream to be decoded; and a decoding unit for decoding the unit image based on the resolution information. Here, the header information includes flag information indicating whether resolutions of all motion vectors included in the unit image are integer-pixel resolutions.Type: ApplicationFiled: November 8, 2023Publication date: February 29, 2024Applicant: Dolby Laboratories Licensing CorporationInventors: Jong Ki HAN, Jae Yung LEE
-
Patent number: 11917386Abstract: Methods and systems for performing at least one audio activity (e.g., conducting a phone call or playing music or other audio content) in an environment including by determining an estimated location of a user in the environment in response to sound uttered by the user (e.g., a voice command), and controlling the audio activity in response to determining the estimated user location. The environment may have zones which are indicated by a zone map and estimation of the user location may include estimating in which of the zones the user is located. The audio activity may be performed using microphones and loudspeakers which are implemented in or coupled to smart audio devices.Type: GrantFiled: March 7, 2023Date of Patent: February 27, 2024Assignee: Dolby Laboratories Licensing CorporationInventors: Carlos Eduardo Medaglia Dyonisio, David Gunawan
-
Patent number: 11917171Abstract: Several embodiments of scalable image processing systems and methods are disclosed herein whereby color management processing of source image data to be displayed on a target display is changed according to varying levels of metadata.Type: GrantFiled: July 7, 2023Date of Patent: February 27, 2024Assignee: Dolby Laboratories Licensing CorporationInventors: Neil W. Messmer, Robin Atkins, Steve Margerm, Peter W. Longhurst
-
Patent number: 11910025Abstract: In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated hue values in the preferred color space is minimized HDR images are coded in the reshaped color space. Legacy devices can still decode standard dynamic range images assuming they are coded in the legacy color space, while updated devices can use color reshaping information to decode HDR images in the preferred color space at full dynamic range.Type: GrantFiled: October 31, 2023Date of Patent: February 20, 2024Assignee: Dolby Laboratories Licensing CorporationInventors: Robin Atkins, Peng Yin, Taoran Lu, Jaclyn Anne Pytlarz
-
Publication number: 20240056755Abstract: Improved methods and/or apparatus for decoding an encoded audio signal in soundfield format for L loudspeakers. The method and/or apparatus can render an Ambisonics format audio signal to 2D loudspeaker setup(s) based on a rendering matrix. The rendering matrix has elements based on loudspeaker positions and wherein the rendering matrix is determined based on weighting at least an element of a first matrix with a weighting factor ? = 1 L . The first matrix is determined based on positions of the L loudspeakers and at least a virtual position of at least a virtual loudspeaker that is added to the positions of the L loudspeakers.Type: ApplicationFiled: August 28, 2023Publication date: February 15, 2024Applicant: Dolby Laboratories Licensing CorporationInventors: Florian KEILER, Johannes Boehm
-
Publication number: 20240056610Abstract: Methods are described to communicate source color volume information in a coded bitstream using SEI messaging. Such data include at least the minimum, maximum, and average luminance values in the source data plus optional data that may include the color volume x and y chromaticity coordinates for the input color primaries (e.g., red, green, and blue) of the source data, and the color x and y chromaticity coordinates for the color primaries corresponding to the minimum, average, and maximum luminance values in the source data. Messaging data signaling an active region in each picture may also be included.Type: ApplicationFiled: October 13, 2023Publication date: February 15, 2024Applicant: Dolby Laboratories Licensing CorporationInventors: Tao CHEN, Peng YIN, Taoran LU, Walter J. HUSAK