Dolby Labs Patent Applications

Dolby Labs patent applications that are pending before the United States Patent and Trademark Office (USPTO).

  • Publication number: 20230421811
    Abstract: An encoding device evaluates a plurality of processing and/or post-processing algorithms and/or methods to be applied to a video stream, and signals a selected method, algorithm, class or category of methods/algorithms either in an encoded bitstream or as side information related to the encoded bitstream. A decoding device or post-processor utilizes the signaled algorithm or selects an algorithm/method based on the signaled method or algorithm. The selection is based, for example, on availability of the algorithm/method at the decoder/post-processor and/or cost of implementation. The video stream may comprise, for example, downsampled multiplexed stereoscopic images and the selected algorithm may include any of upconversion and/or error correction techniques that contribute to a restoration of the downsampled images.
    Type: Application
    Filed: September 14, 2023
    Publication date: December 28, 2023
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Alexandros TOURAPIS, Athanasios LEONTARIS, Peshala V. PAHALAWATTA, Kevin J. STEC
  • Publication number: 20230419983
    Abstract: A method, an apparatus, and logic to post-process raw gains determined by input processing to generate post-processed gains, comprising using one or both of delta gain smoothing and decision-directed gain smoothing. The delta gain smoothing comprises applying a smoothing filter to the raw gain with a smoothing factor that depends on the gain delta: the absolute value of the difference between the raw gain for the current frame and the post-processed gain for a previous frame. The decision-directed gain smoothing comprises converting the raw gain to a signal-to-noise ratio, applying a smoothing filter with a smoothing factor to the signal-to-noise ratio to calculate a smoothed signal-to-noise ratio, and converting the smoothed signal-to-noise ratio to determine the second smoothed gain, with smoothing factor possibly dependent on the gain delta.
    Type: Application
    Filed: June 29, 2023
    Publication date: December 28, 2023
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Xuejing SUN, Glenn N. DICKINS
  • Publication number: 20230421953
    Abstract: Methods and systems of improving bass response for a speaker in a portable computing device are described. One portable computing device includes first and second cover parts that are joined together to form a casing of the portable computing device, wherein a speaker volume is formed between portions of the first and second cover parts; a speaker arranged within the speaker volume; and one or more elastic spacers arranged between the first and second cover parts. The one or more elastic spacers are arranged to counteract, by their elastic recoil forces, a compression of the speaker volume when the first and second cover parts are under external compressing forces. The one or more elastic spacers are arranged between the first and second cover parts to be partially compressed by the first and second cover parts in the absence of external compressing forces on the first and second cover parts.
    Type: Application
    Filed: November 17, 2021
    Publication date: December 28, 2023
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Xiaojun Xu, Tiezhong Liu
  • Publication number: 20230419975
    Abstract: Methods and apparatus for decoding a compressed Higher Order Ambisonics (HOA) representation of a sound or soundfield. The method may include receiving a bit stream containing the compressed HOA representation and decoding, based on a determination that there are multiple layers, the compressed HOA representation from the bitstream to obtain a sequence of decoded HOA representations. A first subset of the sequence of decoded HOA representations is determined based only on corresponding ambient HOA components. A second subset of the sequence of decoded HOA representations is determined based on corresponding ambient HOA components and corresponding predominant sound components.
    Type: Application
    Filed: September 11, 2023
    Publication date: December 28, 2023
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Sven KORDON, Alexander KRUEGER, Oliver WUEBBOLT
  • Publication number: 20230421812
    Abstract: An encoding device evaluates a plurality of processing and/or post-processing algorithms and/or methods to be applied to a video stream, and signals a selected method, algorithm, class or category of methods/algorithms either in an encoded bitstream or as side information related to the encoded bitstream. A decoding device or post-processor utilizes the signaled algorithm or selects an algorithm/method based on the signaled method or algorithm. The selection is based, for example, on availability of the algorithm/method at the decoder/post-processor and/or cost of implementation. The video stream may comprise, for example, downsampled multiplexed stereoscopic images and the selected algorithm may include any of upconversion and/or error correction techniques that contribute to a restoration of the downsampled images.
    Type: Application
    Filed: September 14, 2023
    Publication date: December 28, 2023
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Alexandros TOURAPIS, Athanasios LEONTARIS, Peshala V. PAHALAWATTA, Kevin J. STEC
  • Publication number: 20230421952
    Abstract: Some implementations involve receiving, from a first subband domain acoustic echo canceller (AEC) of a first audio device in an audio environment, first adaptive filter management data from each of a plurality of first adaptive filter management modules, each first adaptive filter management module corresponding to a subband of the first subband domain AEC, each first adaptive filter management module being configured to control a first plurality of adaptive filters. The first plurality of adaptive filters may include at least a first adaptive filter type and a second adaptive filter type. Some implementations involve extracting, from the first adaptive filter management data, a first plurality of extracted features corresponding to a plurality of subbands of the first subband domain AEC and estimating a current local acoustic state based, at least in part, on the first plurality of extracted features.
    Type: Application
    Filed: December 2, 2021
    Publication date: December 28, 2023
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Benjamin John Southwell, David Gunawan, Christopher Graham Hines
  • Publication number: 20230421507
    Abstract: Embodiments are disclosed for timestamp smoothing to remove jitter. In some embodiments, a method of smoothing timestamps associated with audio packets comprises: receiving, using at least one processor, a series of input timestamps for audio packets and their respective packet lengths; estimating, using the at least one processor, an initial timestamp based on the series of input timestamps, the packet lengths and a sample time; calculating, using the at least one processor, a predicted timestamp based on the estimated initial timestamp; and smoothing, using the at least one processor, the predicted timestamp.
    Type: Application
    Filed: November 17, 2021
    Publication date: December 28, 2023
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Shanush PREMA THASARATHAN, Ning WANG, Senaka Chandranath SAMARASEKERA
  • Publication number: 20230419973
    Abstract: Methods for generating an object based audio program which is renderable in a personalizable manner, e.g., to provide an immersive, perception of audio content of the program. Other embodiments include steps of delivering (e.g., broadcasting), decoding, and/or rendering such a program. Rendering of audio objects indicated by the program may provide an immersive experience. The audio content of the program may be indicative of multiple object channels (e.g., object channels indicative of user-selectable and user-configurable objects, and typically also a default set of objects which will be rendered in the absence of a selection by a user) and a bed of speaker channels. Another aspect is an audio processing unit (e.g., encoder or decoder) configured to perform, or which includes a buffer memory which stores at least one frame (or other segment) of an object based audio program (or bitstream thereof) generated in accordance with, any embodiment of the method.
    Type: Application
    Filed: July 3, 2023
    Publication date: December 28, 2023
    Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB
    Inventors: Robert Andrew France, Thomas ZIEGLER, Sripal S. Mehta, Andrew Jonathan DOWELL, Prinyar SAUNGSOMBOON, Michael David DWYER, Farhad FARAHANI, Nicolas R. TSINGOS, Freddie SANCHEZ
  • Publication number: 20230421813
    Abstract: An encoding device evaluates a plurality of processing and/or post-processing algorithms and/or methods to be applied to a video stream, and signals a selected method, algorithm, class or category of methods/algorithms either in an encoded bitstream or as side information related to the encoded bitstream. A decoding device or post-processor utilizes the signaled algorithm or selects an algorithm/method based on the signaled method or algorithm. The selection is based, for example, on availability of the algorithm/method at the decoder/post-processor and/or cost of implementation. The video stream may comprise, for example, downsampled multiplexed stereoscopic images and the selected algorithm may include any of upconversion and/or error correction techniques that contribute to a restoration of the downsampled images.
    Type: Application
    Filed: September 14, 2023
    Publication date: December 28, 2023
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Alexandros TOURAPIS, Athanasios LEONTARIS, Peshala V. PAHALAWATTA, Kevin J. STEC
  • Publication number: 20230421174
    Abstract: The invention proposes a method and a device for arithmetic encoding of a current spectral coefficient using preceding spectral coefficients. Said preceding spectral coefficients are already encoded and both, said preceding and current spectral coefficients, are comprised in one or more quantized spectra resulting from quantizing time-frequency-transform of video, audio or speech signal sample values.
    Type: Application
    Filed: September 12, 2023
    Publication date: December 28, 2023
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Oliver WUEBBOLT
  • Publication number: 20230410829
    Abstract: In an embodiment, a method comprises: receiving bands of power spectra of an input audio signal and a microphone covariance, and for each band: estimating, using a classifier, respective probabilities of speech and noise; estimating, using a directionality model, a set of means for speech and noise, or a set of means and covariances for speech and noise, based on the microphone covariance for the band and the probabilities; estimating, using a level model, a mean and covariance of noise power based on the probabilities and the power spectra; determining a first noise suppression gain based on the directionality model; determining a second noise suppression gain based on the level model; selecting the first or second noise suppression gain or their sum based on a signal-to-noise ratio of the input audio signal; and scaling a time-frequency representation of the input signal by the selected noise suppression gain.
    Type: Application
    Filed: November 4, 2021
    Publication date: December 21, 2023
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Richard J. CARTWRIGHT, Ning WANG
  • Publication number: 20230403091
    Abstract: A distributed amplification and packetized audio transmission system for clock synchronization and alignment between an audio/power source and endpoints with dedicated amplifiers and speakers. An Ethernet audio signal is combined with a Power-Line Communications (PLC) signal for transmission from the source to the endpoints over a common conductor. A single master clock in the source synchronizes the Ethernet audio transmitter with the PLC transmitter. Each end-point has a PLC receiver to recover the master clock for use by its Ethernet audio receiver to provide reliable clock synchronization between the source clock and the endpoint clocks. The endpoints can adjust and re-timestamp the PTP packetized clock based upon symbol and timing information from the PLC receiver.
    Type: Application
    Filed: October 7, 2021
    Publication date: December 14, 2023
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Joel BUTLER, Jeremy SOMMERFELD
  • Publication number: 20230401429
    Abstract: Systems, methods, and computer program products for audio processing based on convolutional neural network (CNN) are described. A first CNN architecture may comprise a contracting path of a U-net, a multi-scale CNN, and an expansive path of a U-net. The contracting path may comprise a first encoding layer and may be configured to generate an output representation of the contracting path. The multi-scale CNN may be configured to generate, based on the output representation of the contracting path, an intermediate representation. The multi-scale CNN may comprise at least two parallel convolution paths. The expansive path may comprise a first decoding layer and may be configured to generate a final representation based on the intermediate representation generated by the multi-scale CNN.
    Type: Application
    Filed: October 19, 2021
    Publication date: December 14, 2023
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Jundai Sun, Lie Lu, Zhiwei Shuang
  • Publication number: 20230395089
    Abstract: A neural network system is provided, implementing a generative model for autoregressively generating a distribution for a plurality of current filter-bank samples of an audio signal, wherein the current samples correspond to a current time slot, and each current sample corresponds to a channel of the filter-bank. The system includes a hierarchy of a plurality of neural network processing tiers ordered from a top to a bottom tier, each tier trained to generate conditioning information based on previous filter-bank samples and, for at least each tier but the top tier, also on the conditioning information from a tier higher up in the hierarchy, and an output stage trained to generate the probability distribution based on previous samples for one or more previous time slots and the conditioning information from the lowest processing tier.
    Type: Application
    Filed: October 15, 2021
    Publication date: December 7, 2023
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Per EKSTRAND, Janusz KLESJA, Pedro Jafeth VILLASANA TINAJERO, Lars VILLEMOES
  • Publication number: 20230395086
    Abstract: Described herein is a method of processing an audio signal using a neural network or using a first and a second neural network. Described is further a method of training said neural network or of jointly training a set of said first and said second neural network. Moreover, described is a method of obtaining and transmitting a latent feature space representation of a perceptual domain audio signal using a neural network and a method of obtaining an audio signal from a latent feature space representation of a perceptual domain audio signal using a neural network. Described are also respective apparatuses and computer program products.
    Type: Application
    Filed: October 14, 2021
    Publication date: December 7, 2023
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Mark S. VINTON, Cong ZHOU, Roy M. FEJGIN, Grant A. DAVIDSON
  • Publication number: 20230394287
    Abstract: A neural network system for predicting frequency coefficients of a media signal, the neural network system comprising a time predicting portion including at least one neural network trained to predict a first set of output variables representing a specific frequency band of a current time frame given coefficients of one or several previous time frames, and a frequency predicting portion including a at least one neural network trained to predict a second set of output variables representing a specific frequency band given coefficients of one or several frequency bands adjacent to the specific frequency band in said current time frame. Such a neural network system forms a predictor capable of capturing both temporal and frequency dependencies occurring in time-frequency tiles of a media signal.
    Type: Application
    Filed: October 12, 2021
    Publication date: December 7, 2023
    Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB
    Inventors: Cong Zhou, Mark S. Vinton, Grant A. Davidson, Lars Villemoes
  • Publication number: 20230393452
    Abstract: A projection system and calibration method therefore relate to a light source configured to emit a light in response to an image data, an illumination optical system configured to steer the light, the illumination optical system including a fold mirror and an integrating rod, a digital micromirror device (DMD) including a plurality of micromirrors respectively configured to reflect the steered light to a predetermined location as on-state light or to reflect the steered light as off-state light to a light dump; determining a deviation between an actual angle of orientation and an expected angle of orientation of a respective micromirror of the plurality of micromirrors; calculating a first amount of rotational adjustment corresponding to the fold mirror and a second amount of lateral adjustment corresponding to the integrating rod, and actuating the fold minor and integrating rod according to the corresponding first and second amount.
    Type: Application
    Filed: October 20, 2021
    Publication date: December 7, 2023
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: John David Jackson, Darren Hennigan, Nathan Shawn Wainwright
  • Publication number: 20230388702
    Abstract: Embodiments are described for a high-frequency waveguide that improves the performance of large-scale surround sound and immersive audio environments. A horn waveguide is configured to be asymmetric about one of a vertical axis and horizontal axis of the waveguide to form an asymmetric horn waveguide. A spherical enclosure surrounds the asymmetric horn waveguide to form a horn speaker, and a three-axis mounting system is configured to fix the horn speaker to one of a wall or ceiling surface of the venue, wherein the mounting system facilitates rotating the horn speaker to a location that provides maximum coverage of the venue within the passband of the asymmetric horn waveguide.
    Type: Application
    Filed: April 18, 2023
    Publication date: November 30, 2023
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Garth Norman SHOWALTER, Mario DI COLA, John Michael GOTT, Patrick Ross SPURLOCK, Gregory Lynn CARNEY, Bryce Joseph GOTT
  • Publication number: 20230388555
    Abstract: In a cloud-based system for encoding high dynamic range (HDR) video, each node receives a video segment and bumper frames. Each segment is subdivided into primary scenes and secondary scenes to derive scene-based forward reshaping functions that minimize the amount of reshaping-related metadata when coding the video segment. When a parent scene of a secondary scene is processed by two or more neighboring nodes, initial forward reshaping functions and trim-pass correction parameters are adjusted using reference tone-mapping functions and updated scene-based trim-pass correction parameters.
    Type: Application
    Filed: September 17, 2021
    Publication date: November 30, 2023
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Harshad Kadu, Guan-Ming Su
  • Publication number: 20230384656
    Abstract: A projection system and calibration method therefor relate to a light source configured to emit a light in response to an image data, an illumination optical system configured to steer the light, the illumination optical system including a first lens group and a second lens group, a digital micromirror device (DMD) including a plurality of micromirrors respectively configured to reflect the steered light to a predetermined location as on-state light or to reflect the steered light as off-state light to a light dump; determining a deviation between an actual angle of orientation and an expected angle of orientation of a respective micromirror of the plurality of micromirrors; calculating a first amount of lateral adjustment corresponding to the first lens group and a second amount of lateral adjustment corresponding to the second lens group, and actuating the first and second lens groups according to the corresponding first and second amount.
    Type: Application
    Filed: October 21, 2021
    Publication date: November 30, 2023
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: John David JACKSON, Darren HENNIGAN, Nathan Shawn WAINWRIGHT
  • Publication number: 20230385013
    Abstract: Embodiments are described for a method of rendering audio for playback through headphones comprising receiving digital audio content, receiving binaural rendering metadata generated by an authoring tool processing the received digital audio content, receiving playback metadata generated by a playback device, and combining the binaural rendering metadata and playback metadata to optimize playback of the digital audio content through the headphones.
    Type: Application
    Filed: April 24, 2023
    Publication date: November 30, 2023
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Nicolas R. TSINGOS, Rhonda WILSON, Sunil BHARITKAR, C. Phillip BROWN, Alan J. SEEFELDT, Remi AUDFRAY
  • Publication number: 20230386486
    Abstract: The present invention relates to a method for predicting transform coefficients representing frequency content of an adaptive block length media signal, by receiving a frame and receiving block length information indicating a number of quantized transform coefficients for each block in the frame, the number of quantized transform coefficients being one of a first or second number, wherein the first number is greater than the second number, determining a first block has the second number of quantized transform coefficients, converting the first block into a converted block having the first number of quantized transform coefficients, conditioning a main neural network trained to predict at least one output variable given at least one conditioning variable, the at least one conditioning variable being based on information regarding the converted block and block length information for the first block, providing at least one predicted transform coefficients from an output stage of the main neural network.
    Type: Application
    Filed: October 15, 2021
    Publication date: November 30, 2023
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Cong ZHOU, Grant A. DAVIDSON, Mark S. VINTON
  • Publication number: 20230388738
    Abstract: Improved tools for authoring and rendering audio reproduction data are provided. Some such authoring tools allow audio reproduction data to be generalized for a wide variety of reproduction environments. Audio reproduction data may be authored by creating metadata for audio objects. The metadata may be created with reference to speaker zones. During the rendering process, the audio reproduction data may be reproduced according to the reproduction speaker layout of a particular reproduction environment.
    Type: Application
    Filed: May 1, 2023
    Publication date: November 30, 2023
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Nicolas R. Tsingos, Charles Q. Robinson, Jurgen W. Scharpf
  • Publication number: 20230386500
    Abstract: Systems, methods, and computer program products for audio processing based on convolutional neural network (CNN) are described. The CNN architecture may comprise a multi-scale input block and a multi-scale nested block. The multi-scale input block may be configured to receive input data and to generate a first downsampled input data set by downsampling the input data. The multi-scale nested block may comprise a first encoding layer configured to generate a first encoded data set by performing a convolution based on the input data. The multi-scale nested block may comprise a second encoding layer configured to generate a second encoded data set by performing a convolution based on the first downsampled input data set. Furthermore, the multi-scale nested block may comprise a first convolutional layer configured to generate a first output data set by upsampling the second encoded data set, concatenating the first encoded data set and the upsampled second encoded data set, and performing a convolution.
    Type: Application
    Filed: October 19, 2021
    Publication date: November 30, 2023
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Jundai Sun, Lie Lu, Zhiwei Shuang
  • Publication number: 20230377589
    Abstract: A method for decoding an encoded audio bitstream is disclosed. The method includes receiving the encoded audio bitstream and decoding the audio data to generate a decoded lowband audio signal. The method further includes extracting high frequency reconstruction metadata and filtering the decoded lowband audio signal with an analysis filterbank to generate a filtered lowband audio signal. The method also includes extracting a flag indicating whether either spectral translation or harmonic transposition is to be performed on the audio data and regenerating a highband portion of the audio signal using the filtered lowband audio signal and the high frequency reconstruction metadata in accordance with the flag.
    Type: Application
    Filed: July 31, 2023
    Publication date: November 23, 2023
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Lars Villemoes, Heiko Purnhagen, Per Ekstrand
  • Publication number: 20230377584
    Abstract: The present disclosure relates to a method and system for performing packet loss concealment using a neural network system. The method comprises obtaining a representation of an incomplete audio signal, inputting the representation of the incomplete audio signal to an encoder neural network and outputting a latent representation of a predicted complete audio signal. The latent representation is input to a decoder neural network which outputs a representation of a predicted complete audio signal comprising a reconstruction of the original portion of the complete audio signal, wherein said encoder neural network and said decoder neural network have been trained with an adversarial neural network.
    Type: Application
    Filed: October 14, 2021
    Publication date: November 23, 2023
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Santiago PASCUAL, Joan SERRA, Jordi PONS PUIG
  • Publication number: 20230368805
    Abstract: Embodiments relate to an audio processing unit that includes a buffer, bitstream payload deformatter, and a decoding subsystem. The buffer stores at least one block of an encoded audio bitstream. The block includes a fill element that begins with an identifier followed by fill data. The fill data includes at least one flag identifying whether enhanced spectral band replication (eSBR) processing is to be performed on audio content of the block. A corresponding method for decoding an encoded audio bitstream is also provided.
    Type: Application
    Filed: May 16, 2023
    Publication date: November 16, 2023
    Applicant: Dolby International AB
    Inventors: Lars Villemoes, Heiko Purnhagen, Per Ekstrand
  • Publication number: 20230368807
    Abstract: A system for suppressing noise and enhancing speech and a related method are disclosed. The system trains a neural network model that takes banded energies corresponding to an original noisy waveform and produces a speech value indicating the amount of speech present in each band at each frame. The neural model comprises a feature extraction block that implements some lookahead. The feature extraction block is followed by an encoder with steady down-sampling along the frequency domain forming a contracting path. The encoder is followed by a corresponding decoder with steady up-sampling along the frequency domain forming an expanding path. The decoder receives scaled output feature maps from the encoder at a corresponding level. The decoder is followed by a classification block that generates a speech value indicating an amount of speech present for each frequency band of the plurality of frequency bands at each frame of the plurality of frames.
    Type: Application
    Filed: October 29, 2021
    Publication date: November 16, 2023
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Xiaoyu LIU, Michael Getty HORGAN, Roy M. FEJGIN, Paul HOLMBERG
  • Publication number: 20230368344
    Abstract: Using a standard-based RGB to YCbCr color transform a new RGB to YCC 3×3 transformation matrix and a 3×1 offset vector are derived under a set of coding-efficiency constraints. The new RGB to YCC 3×3 transform comprises a luminance scaling factor and a 2×2 chroma sub-matrix that preserves the energy of the standard-based RGB to YCbCr transform while maintaining or improving coding efficiency. It also adds support for an authorization or watermarking mechanism in streaming video applications. Examples of using the new color transform using image reshaping are also provided.
    Type: Application
    Filed: October 14, 2021
    Publication date: November 16, 2023
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Guan-Ming SU
  • Publication number: 20230370646
    Abstract: A global index value is generated for selecting a global reshaping function for an input image of a relatively low dynamic range using luma codewords in the input image. Image filtering is applied to the input image to generate a filtered image. The filtered values of the filtered image provide a measure of local brightness levels in the input image. Local index values are generated for selecting specific local reshaping functions for the input image using the global index value and the filtered values of the filtered image. A reshaped image of a relatively high dynamic range is generated by reshaping the input image with the specific local reshaping functions selected using the local index values.
    Type: Application
    Filed: October 1, 2021
    Publication date: November 16, 2023
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Tsung-Wei Huang, Guan-Ming Su, Neeraj J. Gadgil
  • Publication number: 20230362575
    Abstract: A method (910) for rendering an audio signal in a virtual reality rendering environment (180) is described. The method (910) comprises rendering (911) an origin audio signal of an audio source (311, 312, 313) from an origin source position on an origin sphere (114) around an origin listening position (301) of a listener (181). Furthermore, the method (900) comprises determining (912) that the listener (181) moves from the origin listening position (301) to a destination listening position (302). In addition, the method (900) comprises determining (913) a destination source position of the audio source (311, 312, 313) on a destination sphere (114) around the destination listening position (302) based on the origin source position, and determining (914) a destination audio signal of the audio source (311, 312, 313) based on the origin audio signal.
    Type: Application
    Filed: July 13, 2023
    Publication date: November 9, 2023
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Leon TERENTIV, Christof FERSCH, Daniel FISCHER
  • Publication number: 20230360662
    Abstract: The present invention relates to a method and device for processing a first and a second audio signal representing an input binaural audio signal acquired by a binaural recording device. The present invention further relates to a method for rendering a binaural audio signal on a speaker system. The method for processing a binaural signal comprising extracting audio information from the first audio signal, computing a band gain for reducing noise in the first audio signal and applying the band gains to respective frequency bands of the first audio signal in accordance with a dynamic scaling factor, to provide a first output audio signal. Wherein the dynamic scaling factor has a value between zero and one and is selected so as to reduce quality degradation for the first audio signal.
    Type: Application
    Filed: September 15, 2021
    Publication date: November 9, 2023
    Applicants: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Zhiwei Shuang, Yuanxing Ma, Yang Liu, Ziyu Yang, Giulio Cengarle
  • Publication number: 20230359430
    Abstract: Media input audio data corresponding to a media stream and microphone input audio data from at least one microphone may be received. A first level of at least one of a plurality of frequency bands of the media input audio data, as well as a second level of at least one of a plurality of frequency bands of the microphone input audio data, may be determined. Media output audio data and microphone output audio data may be produced by adjusting levels of one or more of the first and second plurality of frequency bands based on the perceived loudness of the microphone input audio data, of the microphone output audio data, of the media output audio data and the media input audio data. One or more processes may be modified upon receipt of a mode-switching indication.
    Type: Application
    Filed: July 12, 2023
    Publication date: November 9, 2023
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Mark Alexander, Chunjian Li, Joshua Brandon Lando, Alan J. Seefeldt, C. Phillip Brown, Dirk Jeroen Breebaart
  • Publication number: 20230360659
    Abstract: A method for representing a second presentation of audio channels or objects as a data stream, the method comprising the steps of: (a) providing a set of base signals, the base signals representing a first presentation of the audio channels or objects; (b) providing a set of transformation parameters, the transformation parameters intended to transform the first presentation into the second presentation; the transformation parameters further being specified for at least two frequency bands and including a set of multi-tap convolution matrix parameters for at least one of the frequency bands.
    Type: Application
    Filed: July 13, 2023
    Publication date: November 9, 2023
    Applicants: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Dirk Jeroen Breebaart, David Matthew Cooper, Leif Jonas Samuelsson
  • Publication number: 20230352058
    Abstract: A deep-learning-based system for performing automated multitrack mixing based on a plurality of input audio tracks is described herein. The system comprises one or more instances of a deep-learning-based first network and one or more instances of a deep-learning-based second network. Particularly, the first network is configured to, based on the 5 input audio tracks, generate parameters for use in the automated multitrack mixing. The second network is configured to, based on the parameters, apply signal processing and at least one mixing gain to the input audio tracks, for generating an output mix of the audio tracks.
    Type: Application
    Filed: June 16, 2021
    Publication date: November 2, 2023
    Applicant: Dolby International AB
    Inventors: Christian James Steinmetz, Joan Serra
  • Publication number: 20230353740
    Abstract: A method for coding includes; segmenting an image into blocks; grouping blocks into a number of subsets; coding, using an entropy coding module, each subset, by associating digital information with symbols of each block of a subset, including, for the first block of the image, initializing state variables of the coding module; and generating a data sub-stream representative of at least one of the coded subsets of blocks. Where a current block is the first block to be coded of a subset, symbol occurrence probabilities for the first current block are determined based on those for a coded and decoded predetermined block of at least one other subset. Where the current block is the last coded block of the subset: writing, in the sub-stream representative of the subset, the entire the digital information associated with the symbols during coding of the blocks of the subset, and implementing the initializing sub-step.
    Type: Application
    Filed: July 5, 2023
    Publication date: November 2, 2023
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Felix Henry, Stephane Pateux, Gordon Clare
  • Publication number: 20230353781
    Abstract: A method of coding at least one image comprising the steps of splitting the image into a plurality of blocks, of grouping said blocks into a predetermined number of subsets of blocks, of coding each of said subsets of blocks in parallel, the blocks of a subset considered being coded according to a predetermined sequential order of traversal. The coding step comprises, for a current block of a subset considered, the sub-step of predictive coding of said current block with respect to at least one previously coded and decoded block, and the sub-step of entropy coding of said current block on the basis of at least one probability of appearance of a symbol.
    Type: Application
    Filed: July 6, 2023
    Publication date: November 2, 2023
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Felix Henry, Stephane Pateux
  • Publication number: 20230353762
    Abstract: Several embodiments of scalable image processing systems and methods are disclosed herein whereby color management processing of source image data to be displayed on a target display is changed according to varying levels of metadata.
    Type: Application
    Filed: July 7, 2023
    Publication date: November 2, 2023
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Neil W. Messmer, Robin Atkins, Steve Margerm, Peter W. Longhurst
  • Publication number: 20230353970
    Abstract: Diffuse or spatially large audio objects may be identified for special processing. A decorrelation process may be performed on audio signals corresponding to the large audio objects to produce decorrelated large audio object audio signals. These decorrelated large audio object audio signals may be associated with object locations, which may be stationary or time-varying locations. For example, the decorrelated large audio object audio signals may be rendered to virtual or actual speaker locations. The output of such a rendering process may be input to a scene simplification process. The decorrelation, associating and/or scene simplification processes may be performed prior to a process of encoding the audio data.
    Type: Application
    Filed: July 10, 2023
    Publication date: November 2, 2023
    Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB
    Inventors: Dirk Jeroen BREEBAART, Lie LU, Nicolas R. TSINGOS, Antonio MATEOS SOLE
  • Publication number: 20230344970
    Abstract: A projection system and calibration method therefore relate to a light source configured to emit a light in response to an image data, an optical system configured to project the light emitted by the light source; receiving an input associated with a plurality of light values corresponding to a plurality of primary lightfields; converting the input associated with the plurality of light values to a plurality of projector primary color values; determining a gain map based on the plurality Values of projector primary color values; applying the gain map to an image to perform a chromaticity uniformity correction by adjusting levels of the plurality of primary lightfields so that a primary mixture is the same over an image frame, and projecting the image with the optical system in the image frame, wherein the second image is corrected by the gain map.
    Type: Application
    Filed: January 28, 2021
    Publication date: October 26, 2023
    Applicant: Dolby Laboratories Licensing Corporation
    Inventor: Jerome D. Shields
  • Publication number: 20230343344
    Abstract: A method of generating a substitution frame for a lost audio frame of an audio signal is presented. The method may comprise determining an audio filter based on samples of a valid audio frame preceding the lost audio frame. The method may comprise generating the substitution frame based on the audio filter and the samples of the valid audio frame preceding the lost audio frame. The method may be advantageously applied to a low frequency effects (LFE) channel of a multi-channel audio signal.
    Type: Application
    Filed: June 10, 2021
    Publication date: October 26, 2023
    Applicant: DOLBY INTERNATIONAL AB
    Inventor: Stefan BRUHN
  • Publication number: 20230343347
    Abstract: Many portable playback devices cannot decode and playback encoded audio content having wide bandwidth and wide dynamic range with consistent loudness and intelligibility unless the encoded audio content has been prepared specially for these devices. This problem can be overcome by including with the encoded content some metadata that specifies a suitable dynamic range compression profile by either absolute values or differential values relative to another known compression profile. A playback device may also adaptively apply gain and limiting to the playback audio. Implementations in encoders, in transcoders and in decoders are disclosed.
    Type: Application
    Filed: April 20, 2023
    Publication date: October 26, 2023
    Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB
    Inventors: Jeffrey RIEDMILLER, Harald MUNDT, Michael SCHUG, Martin WOLTERS
  • Publication number: 20230345055
    Abstract: In a method to improve backwards compatibility when decoding high-dynamic range images coded in a wide color gamut (WCG) space which may not be compatible with legacy color spaces, hue and/or saturation values of images in an image database are computed for both a legacy color space (say, YCbCr-gamma) and a preferred WCG color space (say, IPT-PQ). Based on a cost function, a reshaped color space is computed so that the distance between the hue values in the legacy color space and rotated hue values in the preferred color space is minimized. HDR images are coded in the reshaped color space. Legacy devices can still decode standard dynamic range images assuming they are coded in the legacy color space, while updated devices can use color reshaping information to decode HDR images in the preferred color space at full dynamic range.
    Type: Application
    Filed: June 27, 2023
    Publication date: October 26, 2023
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Robin Atkins, Peng Yin, Taoran Lu, Jaclyn Anne Pytlarz
  • Publication number: 20230345176
    Abstract: A method and apparatus for reconstructing N audio channels from M audio channels is disclosed. The method includes receiving a bitstream containing an encoded audio signal representing the M audio channels and decoding the encoded audio signal to obtain a frequency domain representation of the M audio channels. The method further includes extracting a parameter from the bitstream and reconstructing at least one of the N audio channels using the parameter. The parameter represents an angle between two signals, at least one of which is included in the M audio channels.
    Type: Application
    Filed: May 3, 2023
    Publication date: October 26, 2023
    Applicant: DOLBY INTERNATIONAL AB
    Inventors: Heiko PURNHAGEN, Lars VILLEMOES, Jonas ENGDEGARD, Jonas ROEDEN, Kristofer KJOERLING
  • Publication number: 20230343100
    Abstract: In a cloud-based system for encoding high dynamic range (HDR) video, each node receives a video segment and bumper frames. Each segment is subdivided into primary scenes and secondary scenes to derive scene-based forward reshaping functions that minimize the amount of reshaping-related metadata when coding the video segment, while maintaining temporal continuity among scenes processed by multiple nodes. Methods to generate scene-based forward and backward reshaping functions to optimize video coding and improve the coding efficiency of reshaping-related metadata are also examined.
    Type: Application
    Filed: September 17, 2021
    Publication date: October 26, 2023
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: Harshad Kadu, Guan-Ming Su, Neeraj J. Gadgil, Tsung-Wei Huang
  • Publication number: 20230341754
    Abstract: A novel spatial light modulator (SLM) includes a cover glass, and modulation layer, and a plurality of pixel minors, and separates unwanted, reflected light from desired, modulated light. In one embodiment, a geometrical relationship exists between the cover glass and the pixel minors, such that light that reflects from the cover glass is separated from light that reflects from the pixel minors and is transmitted from the SLM. In one example, one of the cover glass or the pixel minors is angled with respect to the modulation layer. In another example embodiment, the cover glass has a particular thickness, which introduces destructive interference between light that reflects from the top and bottom surfaces of the cover glass. In another embodiment antireflective coatings are disposed between optical interfaces of the SLM. In another embodiment, light from the SLM is directed through an optical filter to remove unwanted light.
    Type: Application
    Filed: June 30, 2023
    Publication date: October 26, 2023
    Applicant: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Juan P. PERTIERRA, Martin J. RICHARDS, Barret LIPPEY
  • Publication number: 20230345192
    Abstract: Methods for dialogue enhancing audio content, comprising providing a first audio signal presentation of the audio components, providing a second audio signal presentation, receiving a set of dialogue estimation parameters configured to enable estimation of dialogue components from the first audio signal presentation, applying said set of dialogue estimation parameters to said first audio signal presentation, to form a dialogue presentation of the dialogue components; and combining the dialogue presentation with said second audio signal presentation to form a dialogue enhanced audio signal presentation for reproduction on the second audio reproduction system, wherein at least one of said first and second audio signal presentation is a binaural audio signal presentation.
    Type: Application
    Filed: April 28, 2023
    Publication date: October 26, 2023
    Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB
    Inventors: Leif Jonas Samuelsson, Dirk Jeroen Breebaart, David Matthew Cooper, Jeroen Koppens
  • Publication number: 20230343346
    Abstract: Described is a method of frame-wise encoding metadata for an input signal, the metadata comprising a plurality of at least partially interrelated parameters calculable from the input signal. The method comprises, for each frame: iteratively performing, by using a looping process, steps of: determining a processing strategy from a plurality of processing strategies for calculating and quantizing the parameters; calculating and quantizing the parameters based on the determined processing strategy to obtain quantized parameters; and encoding the quantized parameters. In particular, each of the plurality of processing strategies comprises a respective first indication indicative of an ordering related to the calculation and quantization of individual parameters; and the processing strategy is determined based on at least one bitrate threshold.
    Type: Application
    Filed: June 10, 2021
    Publication date: October 26, 2023
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: David S. MCGRATH, Rishabh TYAGI, Stefanie BROWN, Juan Felix Torres
  • Publication number: 20230335142
    Abstract: A method comprising receiving a first input bit stream for a first parametrically coded input audio signal, the first input bit stream including data representing a first input core audio signal and a first set including at least one spatial parameter relating to the first parametrically coded input audio signal. A first covariance matrix of the first parametrically coded audio signal is determined based on the spatial parameter(s) of the first set. A modified set including at least one spatial parameter is determined based on the determined first covariance matrix, wherein the modified set is different from the first set. An output core audio signal is determined, which is based on, or constituted by, the first input core audio signal. An output bit stream for a parametrically coded output audio signal is generated, the output bit stream including data representing the output core audio signal and the modified set.
    Type: Application
    Filed: September 7, 2021
    Publication date: October 19, 2023
    Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB
    Inventors: Dirk Jeroen Breebaart, Michael Eckert, Heiko Purnhagen
  • Publication number: 20230326469
    Abstract: Embodiments are disclosed for a matrix coded stereo signal with periphonic elements. A mixing matrix, suitable for processing a multi-channel audio input signal, is constructed so that the resulting multi-channel output signal contains the same audio elements from the input signal, wherein the spatial relationships between audio elements, as defined by panning rules associated with the input signal format, are preserved in the output signal, as defined by matrix encoding rules associated with the output signal format. The choice of the coefficients of the mixing matrix is governed by a phase-preference rule that is used to determine the appropriate phase to apply to each input signal channel.
    Type: Application
    Filed: August 26, 2021
    Publication date: October 12, 2023
    Applicant: Dolby Laboratories Licensing Corporation
    Inventors: David S. MCGRATH, Hao LUO