Patents by Inventor Stefan Bruhns
Stefan Bruhns has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20250140267Abstract: The application provides a method (500) for encoding an Ambisonics input audio signal. The method (500) comprises providing (501) the input audio signal to a SPAR encoder and to a DirAC analyzer and parameter encoder. Furthermore, the method (500) comprises generating (502) an encoder bit stream based on output of the SPAR encoder and based on output of the DirAC analyzer and parameter encoder. The application also provides a method for decoding the encoder bitstream by generating an intermediate Ambisonics signal using a SPAR decoder based on the encoder bitstream and processing the intermediate Ambisonics signal using a DirAC synthesizer to provide an output audio signal for rendering. The DirAC synthesizer may use DirAC parameters of the bitstreams or DirAC parameters obtained by analyzing the intermediate Ambisonics signal.Type: ApplicationFiled: November 30, 2022Publication date: May 1, 2025Applicant: DOLBY INTERNATIONAL ABInventor: Stefan BRUHN
-
Publication number: 20250119698Abstract: There is provided encoding and decoding methods for representing spatial audio that is a combination of directional sound and diffuse sound. An exemplary encoding method includes inter alia creating a single- or multi-channel downmix audio signal by downmixing input audio signals from a plurality of microphones in an audio capture unit capturing the spatial audio; determining first metadata parameters associated with the downmix audio signal, wherein the first metadata parameters are indicative of one or more of: a relative time delay value, a gain value, and a phase value associated with each input audio signal; and combining the created downmix audio signal and the first metadata parameters into a representation of the spatial audio.Type: ApplicationFiled: October 24, 2024Publication date: April 10, 2025Applicants: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATIONInventor: Stefan BRUHN
-
Publication number: 20250095660Abstract: Described herein is a method of encoding Higher Order Ambisonics, HOA, audio, the method including: receiving an input HOA audio signal having more than four Ambisonics channels; encoding the HOA audio signal using a SPAR coding framework and a core audio encoder; and providing the encoded HOA audio signal to a downstream device, the encoded HOA audio signal including core encoded SPAR downmix channels and encoded SPAR metadata. Further described are a method of decoding Higher Order Ambisonics, HOA, audio, respective apparatuses and computer program products.Type: ApplicationFiled: January 9, 2023Publication date: March 20, 2025Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL ABInventors: Stefanie BROWN, Stefan BRUHN, Rishabh TYAGI
-
Publication number: 20250088816Abstract: The disclosure herein generally relates to capturing, acoustic pre-processing, encoding, decoding, and rendering of directional audio of an audio scene. In particular, it relates to a device adapted to modify a directional property of a captured directional audio in response to spatial data of a microphone system capturing the directional audio. The disclosure further relates to a rendering device configured to modify a directional property of a received directional audio in response to received spatial data.Type: ApplicationFiled: November 26, 2024Publication date: March 13, 2025Applicants: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL ABInventors: Stefan Bruhn, Juan Felix Torres, David S. McGrath, Brian B. Lee
-
Publication number: 20250061900Abstract: There is provided mechanisms for frame loss concealment. A method is performed by a receiving entity. The method comprises adding, in association with constructing a substitution frame for a lost frame, a noise component to the substitution frame. The noise component has a frequency characteristic corresponding to a low-resolution spectral representation of a signal in a previously received frame.Type: ApplicationFiled: October 24, 2024Publication date: February 20, 2025Applicant: Telefonaktiebolaget LM Ericsson (publ)Inventor: Stefan BRUHN
-
Publication number: 20250037724Abstract: Concealment of a lost audio frame can be performed by identifying at least one peak of a magnitude spectrum of the prototype frame; identifying frequencies in the vicinity of the at least one peak to identify a sinusoidal frequency fk with higher resolution than the frequency resolution of the used frequency domain transform; calculating a phase shift ?k for a sinusoid k; shifting a phase of all spectral coefficients in the prototype frame included in an interval Mk around the sinusoid k by ?k while retaining the magnitude of those spectral coefficients; randomizing phases of spectral coefficients that are not phase shifted; and creating a substitution frame by performing an inverse frequency transform of a frequency spectrum of the prototype frame.Type: ApplicationFiled: October 16, 2024Publication date: January 30, 2025Inventor: Stefan BRUHN
-
Patent number: 12167219Abstract: The disclosure herein generally relates to capturing, acoustic pre-processing, encoding, decoding, and rendering of directional audio of an audio scene. In particular, it relates to a device adapted to modify a directional property of a captured directional audio in response to spatial data of a microphone system capturing the directional audio. The disclosure further relates to a rendering device configured to modify a directional property of a received directional audio in response to received spatial data.Type: GrantFiled: November 12, 2019Date of Patent: December 10, 2024Assignees: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL ABInventors: Stefan Bruhn, Juan Felix Torres, David S. McGrath, Brian B. Lee
-
Patent number: 12159635Abstract: There is provided mechanisms for frame loss concealment. A method is performed by a receiving entity. The method comprises adding, in association with constructing a substitution frame for a lost frame, a noise component to the substitution frame. The noise component has a frequency characteristic corresponding to a low-resolution spectral representation of a signal in a previously received frame.Type: GrantFiled: May 19, 2023Date of Patent: December 3, 2024Assignee: TELEFONAKTIEBOLAGET L M ERICSSON (PUBL)Inventor: Stefan Bruhn
-
Publication number: 20240395268Abstract: A method performed by an encoder. The method comprises determining envelope representation residual coefficients as first compressed envelope representation coefficients subtracted from the input envelope representation coefficients. The method comprises transforming the envelope representation residual coefficients into a warped domain so as to obtain transformed envelope representation residual coefficients. The method comprises applying, at least one of a plurality of gain-shape coding schemes on the transformed envelope representation residual coefficients in order to achieve gain-shape coded envelope representation residual coefficients, where the plurality of gain-shape coding schemes have mutually different trade-offs in one or more of gain resolution and shape resolution for one or more of the transformed envelope representation residual coefficients.Type: ApplicationFiled: April 29, 2024Publication date: November 28, 2024Applicant: Telefonaktiebolaget LM Ericsson (publ)Inventors: Jonas SVEDBERG, Stefan BRUHN, Martin SEHLSTEDT
-
Patent number: 12156012Abstract: There is provided encoding and decoding methods for representing spatial audio that is a combination of directional sound and diffuse sound. An exemplary encoding method includes inter alia creating a single- or multi-channel downmix audio signal by downmixing input audio signals from a plurality of microphones in an audio capture unit capturing the spatial audio; determining first metadata parameters associated with the downmix audio signal, wherein the first metadata parameters are indicative of one or more of: a relative time delay value, a gain value, and a phase value associated with each input audio signal; and combining the created downmix audio signal and the first metadata parameters into a representation of the spatial audio.Type: GrantFiled: September 12, 2023Date of Patent: November 26, 2024Assignees: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATIONInventor: Stefan Bruhn
-
Patent number: 12148434Abstract: Concealing a lost audio frame of a received audio signal is provided by performing a sinusoidal analysis of a part of a previously received or reconstructed audio signal, wherein the sinusoidal analysis involves identifying frequencies of sinusoidal components of the audio signal, applying a sinusoidal model on a segment of the previously received or reconstructed audio signal, wherein said segment is used as a prototype frame in order to create a substitution frame for a lost audio frame, and creating the substitution frame for the lost audio frame by time-evolving sinusoidal components of the prototype frame, up to the time instance of the lost audio frame, in response to the corresponding identified frequencies.Type: GrantFiled: September 20, 2022Date of Patent: November 19, 2024Assignee: Telefonaktiebolaget LM Ericsson (publ)Inventor: Stefan Bruhn
-
Publication number: 20240347069Abstract: The present document describes a method for generating a bitstream, wherein the bitstream comprises a sequence of superframes for a sequence of frames of an immersive audio signal. The method comprises, repeatedly for the sequence of superframes, inserting coded audio data for one or more frames of one or more downmix channel signals derived from the immersive audio signal, into data fields of a superframe; and inserting metadata for reconstructing one or more frames of the immersive audio signal from the coded audio data, into a metadata field of the superframe.Type: ApplicationFiled: June 21, 2024Publication date: October 17, 2024Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL ABInventors: Stefan BRUHN, Juan Felix TORRES
-
Publication number: 20240331708Abstract: The disclosed embodiments enable converting audio signals captured in various formats by various capture devices into a limited number of formats that can be processed by an audio codec (e.g., an Immersive Voice and Audio Services (IVAS) codec). In an embodiment, a simplification unit of the audio device receives an audio signal captured by one or more audio capture devices coupled to the audio device. The simplification unit determines whether the audio signal is in a format that is supported/not supported by an encoding unit of the audio device. Based on the determining, the simplification unit, converts the audio signal into a format that is supported by the encoding unit. In an embodiment, if the simplification unit determines that the audio signal is in a spatial format, the simplification unit can convert the audio signal into a spatial “mezzanine” format supported by the encoding.Type: ApplicationFiled: May 8, 2024Publication date: October 3, 2024Applicants: Dolby Laboratories Licensing Corporation, Dolby International ABInventors: Stefan BRUHN, Michael ECKERT, Juan Felix TORRES, Stefanie BROWN, David S. MCGRATH
-
Patent number: 12020718Abstract: The present document describes a method (500) for generating a bitstream (101), wherein the bitstream (101) comprises a sequence of superframes (400) for a sequence of frames of an immersive audio signal (111). The method (500) comprises, repeatedly for the sequence of superframes (400), inserting (501) coded audio data (206) for one or more frames of one or more downmix channel signals (203) derived from the immersive audio signal (111), into data fields (411, 421, 412, 422) of a superframe (400); and inserting (502) metadata (202, 205) for reconstructing one or more frames of the immersive audio signal (111) from the coded audio data (206), into a metadata field (403) of the superframe (400).Type: GrantFiled: July 2, 2019Date of Patent: June 25, 2024Assignees: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATIONInventors: Stefan Bruhn, Juan Felix Torres
-
Patent number: 12014745Abstract: The disclosed embodiments enable converting audio signals captured in various formats by various capture devices into a limited number of formats that can be processed by an audio codec (e.g., an Immersive Voice and Audio Services (IVAS) codec). In an embodiment, a simplification unit of the audio device receives an audio signal captured by one or more audio capture devices coupled to the audio device. The simplification unit determines whether the audio signal is in a format that is supported/not supported by an encoding unit of the audio device. Based on the determining, the simplification unit, converts the audio signal into a format that is supported by the encoding unit. In an embodiment, if the simplification unit determines that the audio signal is in a spatial format, the simplification unit can convert the audio signal into a spatial “mezzanine” format supported by the encoding.Type: GrantFiled: August 8, 2022Date of Patent: June 18, 2024Assignee: Dolby Laboratories Licensing CorporationInventors: Stefan Bruhn, Michael Eckert, Juan Felix Torres, Stefanie Brown, David S. McGrath
-
Publication number: 20240196156Abstract: An aspect of the present disclosure relates to processing audio comprising decoding a first bitstream (b1) to obtain decoded immersive audio content (A), decoding a second bitstream (bp) to obtain pose information (P, V, V?) associated with a user of a lightweight processing device, determining a first head-pose (P?) based on the pose information, providing a downmix representation (Dmx) of the immersive audio content (A) corresponding to the first head pose (P?), rendering a set of binaural representations (BINn) of the immersive audio content (A), wherein the binaural representations correspond to a second set of head poses (Pn), computing reconstruction metadata (M) to enable reconstruction of the set of binaural representations from the downmix representation (Dmx), the metadata (M) including the first head pose (P?), and encoding the downmix representation (Dmx) and the reconstruction metadata (M) in a third bitstream (b2).Type: ApplicationFiled: February 7, 2024Publication date: June 13, 2024Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL ABInventors: Rishabh TYAGI, Stefan BRUHN, Juan Felix TORRES
-
Patent number: 11990145Abstract: A method performed by an encoder. The method comprises determining envelope representation residual coefficients as first compressed envelope representation coefficients subtracted from the input envelope representation coefficients. The method comprises transforming the envelope representation residual coefficients into a warped domain so as to obtain transformed envelope representation residual coefficients. The method comprises applying, at least one of a plurality of gain-shape coding schemes on the transformed envelope representation residual coefficients in order to achieve gain-shape coded envelope representation residual coefficients, where the plurality of gain-shape coding schemes have mutually different trade-offs in one or more of gain resolution and shape resolution for one or more of the transformed envelope representation residual coefficients.Type: GrantFiled: August 22, 2022Date of Patent: May 21, 2024Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)Inventors: Jonas Svedberg, Stefan Bruhn, Martin Sehlstedt
-
Publication number: 20240153512Abstract: A method for performing gain control on audio signals is provided. In some implementations, the method involves determining downmixed signals associated with one or more downmix channels associated with a current frame of an audio signal to be encoded. In some implementations, the method involves determining whether an overload condition exists for an encoder. In some implementation, the method involves determining a gain parameter. In some implementations, the method involves determining at least one gain transition function based on the gain parameter and a gain parameter associated with a preceding frame of the audio signal. In some implementations, the method involves applying the at least one gain transition function to one or more of the downmixed signals. In some implementations, the method involves encoding the downmixed signals in connection with information indicative of gain control applied to the current frame.Type: ApplicationFiled: March 8, 2022Publication date: May 9, 2024Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL ABInventors: Panji Setiawan, Rishabh Tyagi, Stefan Bruhn
-
Publication number: 20240114307Abstract: There is provided encoding and decoding methods for representing spatial audio that is a combination of directional sound and diffuse sound. An exemplary encoding method includes inter alia creating a single- or multi-channel downmix audio signal by downmixing input audio signals from a plurality of microphones in an audio capture unit capturing the spatial audio; determining first metadata parameters associated with the downmix audio signal, wherein the first metadata parameters are indicative of one or more of: a relative time delay value, a gain value, and a phase value associated with each input audio signal; and combining the created downmix audio signal and the first metadata parameters into a representation of the spatial audio.Type: ApplicationFiled: September 12, 2023Publication date: April 4, 2024Applicants: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATIONInventor: Stefan BRUHN
-
Publication number: 20240013793Abstract: Method for encoding scene-based audio is provided. In some implementations, the method involves determining, by an encoder, a spatial direction of a dominant sound component in a frame of an input audio signal. In some implementations, the method involves determining rotation parameters based on the determined spatial direction and a direction preference of a coding scheme to be used to encode the input audio signal. In some implementations, the method involves rotating sound components of the frame based on the rotation parameters such that, after being rotated, the dominant sound component has a spatial direction that aligns with the direction preference of the coding scheme. In some implementations, the method involves encoding the rotated sound components of the frame of the input audio signal using the coding scheme in connection with an indication of the rotation parameters or an indication of the spatial direction of the dominant sound component.Type: ApplicationFiled: December 2, 2021Publication date: January 11, 2024Applicants: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL ABInventors: Stefan BRUHN, Harald MUNDT, David S. MCGRATH, Stefanie BROWN