Patents by Inventor Stefan Bruhns

Stefan Bruhns has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20250140267
    Abstract: The application provides a method (500) for encoding an Ambisonics input audio signal. The method (500) comprises providing (501) the input audio signal to a SPAR encoder and to a DirAC analyzer and parameter encoder. Furthermore, the method (500) comprises generating (502) an encoder bit stream based on output of the SPAR encoder and based on output of the DirAC analyzer and parameter encoder. The application also provides a method for decoding the encoder bitstream by generating an intermediate Ambisonics signal using a SPAR decoder based on the encoder bitstream and processing the intermediate Ambisonics signal using a DirAC synthesizer to provide an output audio signal for rendering. The DirAC synthesizer may use DirAC parameters of the bitstreams or DirAC parameters obtained by analyzing the intermediate Ambisonics signal.
    Type: Application
    Filed: November 30, 2022
    Publication date: May 1, 2025
    Applicant: DOLBY INTERNATIONAL AB
    Inventor: Stefan BRUHN
  • Publication number: 20250119698
    Abstract: There is provided encoding and decoding methods for representing spatial audio that is a combination of directional sound and diffuse sound. An exemplary encoding method includes inter alia creating a single- or multi-channel downmix audio signal by downmixing input audio signals from a plurality of microphones in an audio capture unit capturing the spatial audio; determining first metadata parameters associated with the downmix audio signal, wherein the first metadata parameters are indicative of one or more of: a relative time delay value, a gain value, and a phase value associated with each input audio signal; and combining the created downmix audio signal and the first metadata parameters into a representation of the spatial audio.
    Type: Application
    Filed: October 24, 2024
    Publication date: April 10, 2025
    Applicants: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Stefan BRUHN
  • Publication number: 20250095660
    Abstract: Described herein is a method of encoding Higher Order Ambisonics, HOA, audio, the method including: receiving an input HOA audio signal having more than four Ambisonics channels; encoding the HOA audio signal using a SPAR coding framework and a core audio encoder; and providing the encoded HOA audio signal to a downstream device, the encoded HOA audio signal including core encoded SPAR downmix channels and encoded SPAR metadata. Further described are a method of decoding Higher Order Ambisonics, HOA, audio, respective apparatuses and computer program products.
    Type: Application
    Filed: January 9, 2023
    Publication date: March 20, 2025
    Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB
    Inventors: Stefanie BROWN, Stefan BRUHN, Rishabh TYAGI
  • Publication number: 20250088816
    Abstract: The disclosure herein generally relates to capturing, acoustic pre-processing, encoding, decoding, and rendering of directional audio of an audio scene. In particular, it relates to a device adapted to modify a directional property of a captured directional audio in response to spatial data of a microphone system capturing the directional audio. The disclosure further relates to a rendering device configured to modify a directional property of a received directional audio in response to received spatial data.
    Type: Application
    Filed: November 26, 2024
    Publication date: March 13, 2025
    Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Stefan Bruhn, Juan Felix Torres, David S. McGrath, Brian B. Lee
  • Publication number: 20250061900
    Abstract: There is provided mechanisms for frame loss concealment. A method is performed by a receiving entity. The method comprises adding, in association with constructing a substitution frame for a lost frame, a noise component to the substitution frame. The noise component has a frequency characteristic corresponding to a low-resolution spectral representation of a signal in a previously received frame.
    Type: Application
    Filed: October 24, 2024
    Publication date: February 20, 2025
    Applicant: Telefonaktiebolaget LM Ericsson (publ)
    Inventor: Stefan BRUHN
  • Publication number: 20250037724
    Abstract: Concealment of a lost audio frame can be performed by identifying at least one peak of a magnitude spectrum of the prototype frame; identifying frequencies in the vicinity of the at least one peak to identify a sinusoidal frequency fk with higher resolution than the frequency resolution of the used frequency domain transform; calculating a phase shift ?k for a sinusoid k; shifting a phase of all spectral coefficients in the prototype frame included in an interval Mk around the sinusoid k by ?k while retaining the magnitude of those spectral coefficients; randomizing phases of spectral coefficients that are not phase shifted; and creating a substitution frame by performing an inverse frequency transform of a frequency spectrum of the prototype frame.
    Type: Application
    Filed: October 16, 2024
    Publication date: January 30, 2025
    Inventor: Stefan BRUHN
  • Patent number: 12167219
    Abstract: The disclosure herein generally relates to capturing, acoustic pre-processing, encoding, decoding, and rendering of directional audio of an audio scene. In particular, it relates to a device adapted to modify a directional property of a captured directional audio in response to spatial data of a microphone system capturing the directional audio. The disclosure further relates to a rendering device configured to modify a directional property of a received directional audio in response to received spatial data.
    Type: Grant
    Filed: November 12, 2019
    Date of Patent: December 10, 2024
    Assignees: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Stefan Bruhn, Juan Felix Torres, David S. McGrath, Brian B. Lee
  • Patent number: 12159635
    Abstract: There is provided mechanisms for frame loss concealment. A method is performed by a receiving entity. The method comprises adding, in association with constructing a substitution frame for a lost frame, a noise component to the substitution frame. The noise component has a frequency characteristic corresponding to a low-resolution spectral representation of a signal in a previously received frame.
    Type: Grant
    Filed: May 19, 2023
    Date of Patent: December 3, 2024
    Assignee: TELEFONAKTIEBOLAGET L M ERICSSON (PUBL)
    Inventor: Stefan Bruhn
  • Publication number: 20240395268
    Abstract: A method performed by an encoder. The method comprises determining envelope representation residual coefficients as first compressed envelope representation coefficients subtracted from the input envelope representation coefficients. The method comprises transforming the envelope representation residual coefficients into a warped domain so as to obtain transformed envelope representation residual coefficients. The method comprises applying, at least one of a plurality of gain-shape coding schemes on the transformed envelope representation residual coefficients in order to achieve gain-shape coded envelope representation residual coefficients, where the plurality of gain-shape coding schemes have mutually different trade-offs in one or more of gain resolution and shape resolution for one or more of the transformed envelope representation residual coefficients.
    Type: Application
    Filed: April 29, 2024
    Publication date: November 28, 2024
    Applicant: Telefonaktiebolaget LM Ericsson (publ)
    Inventors: Jonas SVEDBERG, Stefan BRUHN, Martin SEHLSTEDT
  • Patent number: 12156012
    Abstract: There is provided encoding and decoding methods for representing spatial audio that is a combination of directional sound and diffuse sound. An exemplary encoding method includes inter alia creating a single- or multi-channel downmix audio signal by downmixing input audio signals from a plurality of microphones in an audio capture unit capturing the spatial audio; determining first metadata parameters associated with the downmix audio signal, wherein the first metadata parameters are indicative of one or more of: a relative time delay value, a gain value, and a phase value associated with each input audio signal; and combining the created downmix audio signal and the first metadata parameters into a representation of the spatial audio.
    Type: Grant
    Filed: September 12, 2023
    Date of Patent: November 26, 2024
    Assignees: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Stefan Bruhn
  • Patent number: 12148434
    Abstract: Concealing a lost audio frame of a received audio signal is provided by performing a sinusoidal analysis of a part of a previously received or reconstructed audio signal, wherein the sinusoidal analysis involves identifying frequencies of sinusoidal components of the audio signal, applying a sinusoidal model on a segment of the previously received or reconstructed audio signal, wherein said segment is used as a prototype frame in order to create a substitution frame for a lost audio frame, and creating the substitution frame for the lost audio frame by time-evolving sinusoidal components of the prototype frame, up to the time instance of the lost audio frame, in response to the corresponding identified frequencies.
    Type: Grant
    Filed: September 20, 2022
    Date of Patent: November 19, 2024
    Assignee: Telefonaktiebolaget LM Ericsson (publ)
    Inventor: Stefan Bruhn
  • Publication number: 20240347069
    Abstract: The present document describes a method for generating a bitstream, wherein the bitstream comprises a sequence of superframes for a sequence of frames of an immersive audio signal. The method comprises, repeatedly for the sequence of superframes, inserting coded audio data for one or more frames of one or more downmix channel signals derived from the immersive audio signal, into data fields of a superframe; and inserting metadata for reconstructing one or more frames of the immersive audio signal from the coded audio data, into a metadata field of the superframe.
    Type: Application
    Filed: June 21, 2024
    Publication date: October 17, 2024
    Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB
    Inventors: Stefan BRUHN, Juan Felix TORRES
  • Publication number: 20240331708
    Abstract: The disclosed embodiments enable converting audio signals captured in various formats by various capture devices into a limited number of formats that can be processed by an audio codec (e.g., an Immersive Voice and Audio Services (IVAS) codec). In an embodiment, a simplification unit of the audio device receives an audio signal captured by one or more audio capture devices coupled to the audio device. The simplification unit determines whether the audio signal is in a format that is supported/not supported by an encoding unit of the audio device. Based on the determining, the simplification unit, converts the audio signal into a format that is supported by the encoding unit. In an embodiment, if the simplification unit determines that the audio signal is in a spatial format, the simplification unit can convert the audio signal into a spatial “mezzanine” format supported by the encoding.
    Type: Application
    Filed: May 8, 2024
    Publication date: October 3, 2024
    Applicants: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Stefan BRUHN, Michael ECKERT, Juan Felix TORRES, Stefanie BROWN, David S. MCGRATH
  • Patent number: 12020718
    Abstract: The present document describes a method (500) for generating a bitstream (101), wherein the bitstream (101) comprises a sequence of superframes (400) for a sequence of frames of an immersive audio signal (111). The method (500) comprises, repeatedly for the sequence of superframes (400), inserting (501) coded audio data (206) for one or more frames of one or more downmix channel signals (203) derived from the immersive audio signal (111), into data fields (411, 421, 412, 422) of a superframe (400); and inserting (502) metadata (202, 205) for reconstructing one or more frames of the immersive audio signal (111) from the coded audio data (206), into a metadata field (403) of the superframe (400).
    Type: Grant
    Filed: July 2, 2019
    Date of Patent: June 25, 2024
    Assignees: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Stefan Bruhn, Juan Felix Torres
  • Patent number: 12014745
    Abstract: The disclosed embodiments enable converting audio signals captured in various formats by various capture devices into a limited number of formats that can be processed by an audio codec (e.g., an Immersive Voice and Audio Services (IVAS) codec). In an embodiment, a simplification unit of the audio device receives an audio signal captured by one or more audio capture devices coupled to the audio device. The simplification unit determines whether the audio signal is in a format that is supported/not supported by an encoding unit of the audio device. Based on the determining, the simplification unit, converts the audio signal into a format that is supported by the encoding unit. In an embodiment, if the simplification unit determines that the audio signal is in a spatial format, the simplification unit can convert the audio signal into a spatial “mezzanine” format supported by the encoding.
    Type: Grant
    Filed: August 8, 2022
    Date of Patent: June 18, 2024
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Stefan Bruhn, Michael Eckert, Juan Felix Torres, Stefanie Brown, David S. McGrath
  • Publication number: 20240196156
    Abstract: An aspect of the present disclosure relates to processing audio comprising decoding a first bitstream (b1) to obtain decoded immersive audio content (A), decoding a second bitstream (bp) to obtain pose information (P, V, V?) associated with a user of a lightweight processing device, determining a first head-pose (P?) based on the pose information, providing a downmix representation (Dmx) of the immersive audio content (A) corresponding to the first head pose (P?), rendering a set of binaural representations (BINn) of the immersive audio content (A), wherein the binaural representations correspond to a second set of head poses (Pn), computing reconstruction metadata (M) to enable reconstruction of the set of binaural representations from the downmix representation (Dmx), the metadata (M) including the first head pose (P?), and encoding the downmix representation (Dmx) and the reconstruction metadata (M) in a third bitstream (b2).
    Type: Application
    Filed: February 7, 2024
    Publication date: June 13, 2024
    Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB
    Inventors: Rishabh TYAGI, Stefan BRUHN, Juan Felix TORRES
  • Patent number: 11990145
    Abstract: A method performed by an encoder. The method comprises determining envelope representation residual coefficients as first compressed envelope representation coefficients subtracted from the input envelope representation coefficients. The method comprises transforming the envelope representation residual coefficients into a warped domain so as to obtain transformed envelope representation residual coefficients. The method comprises applying, at least one of a plurality of gain-shape coding schemes on the transformed envelope representation residual coefficients in order to achieve gain-shape coded envelope representation residual coefficients, where the plurality of gain-shape coding schemes have mutually different trade-offs in one or more of gain resolution and shape resolution for one or more of the transformed envelope representation residual coefficients.
    Type: Grant
    Filed: August 22, 2022
    Date of Patent: May 21, 2024
    Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)
    Inventors: Jonas Svedberg, Stefan Bruhn, Martin Sehlstedt
  • Publication number: 20240153512
    Abstract: A method for performing gain control on audio signals is provided. In some implementations, the method involves determining downmixed signals associated with one or more downmix channels associated with a current frame of an audio signal to be encoded. In some implementations, the method involves determining whether an overload condition exists for an encoder. In some implementation, the method involves determining a gain parameter. In some implementations, the method involves determining at least one gain transition function based on the gain parameter and a gain parameter associated with a preceding frame of the audio signal. In some implementations, the method involves applying the at least one gain transition function to one or more of the downmixed signals. In some implementations, the method involves encoding the downmixed signals in connection with information indicative of gain control applied to the current frame.
    Type: Application
    Filed: March 8, 2022
    Publication date: May 9, 2024
    Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB
    Inventors: Panji Setiawan, Rishabh Tyagi, Stefan Bruhn
  • Publication number: 20240114307
    Abstract: There is provided encoding and decoding methods for representing spatial audio that is a combination of directional sound and diffuse sound. An exemplary encoding method includes inter alia creating a single- or multi-channel downmix audio signal by downmixing input audio signals from a plurality of microphones in an audio capture unit capturing the spatial audio; determining first metadata parameters associated with the downmix audio signal, wherein the first metadata parameters are indicative of one or more of: a relative time delay value, a gain value, and a phase value associated with each input audio signal; and combining the created downmix audio signal and the first metadata parameters into a representation of the spatial audio.
    Type: Application
    Filed: September 12, 2023
    Publication date: April 4, 2024
    Applicants: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATION
    Inventor: Stefan BRUHN
  • Publication number: 20240013793
    Abstract: Method for encoding scene-based audio is provided. In some implementations, the method involves determining, by an encoder, a spatial direction of a dominant sound component in a frame of an input audio signal. In some implementations, the method involves determining rotation parameters based on the determined spatial direction and a direction preference of a coding scheme to be used to encode the input audio signal. In some implementations, the method involves rotating sound components of the frame based on the rotation parameters such that, after being rotated, the dominant sound component has a spatial direction that aligns with the direction preference of the coding scheme. In some implementations, the method involves encoding the rotated sound components of the frame of the input audio signal using the coding scheme in connection with an indication of the rotation parameters or an indication of the spatial direction of the dominant sound component.
    Type: Application
    Filed: December 2, 2021
    Publication date: January 11, 2024
    Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB
    Inventors: Stefan BRUHN, Harald MUNDT, David S. MCGRATH, Stefanie BROWN