Patents by Inventor Sascha DICK

Sascha DICK has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Multi-channel audio decoder, multi-channel audio encoder, methods and computer program using a residual-signal-based adjustment of a contribution of a decorrelated signal

Patent number: 12658193

Abstract: A multi-channel audio decoder for providing at least two output audio signals on the basis of an encoded representation is configured to perform a weighted combination of a downmix signal, a decorrelated signal and a residual signal, to obtain one of the output audio signals. The multi-channel audio decoder is configured to determine a weight describing a contribution of the decorrelated signal in the weighted combination in dependence on the residual signal. A multi-channel audio encoder for providing an encoded representation of a multi-channel audio signal is configured to obtain a downmix signal on the basis of the multi-channel audio signal, to provide parameters describing dependencies between the channels of the multi-channel audio signal, and to provide a residual signal. The multi-channel audio encoder is configured to vary an amount of residual signal included into the encoded representation in dependence on the multi-channel audio signal.

Type: Grant

Filed: August 25, 2020

Date of Patent: June 16, 2026

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V

Inventors: Sascha Dick, Christian Helmrich, Johannes Hilpert, Andreas Hoelzer
Apparatus and Method for Stereo Filling in Multichannel Coding

Publication number: 20260080880

Abstract: An apparatus for decoding an encoded multichannel signal of a current frame to obtain three or more current audio output channels is provided. A multichannel processor is adapted to select two decoded channels from three or more decoded channels depending on first multichannel parameters. Moreover, the multichannel processor is adapted to generate a first group of two or more processed channels based on the selected channels. A noise filling module is adapted to identify for at least one of the selected channels, one or more frequency bands, within which all spectral lines are quantized to zero, and to generate a mixing channel using, depending on side information, a proper subset of three or more previous audio output channels that have been decoded, and to fill the spectral lines of frequency bands, within which all spectral lines are quantized to zero, with noise generated using spectral lines of the mixing channel.

Type: Application

Filed: July 23, 2025

Publication date: March 19, 2026

Inventors: Sascha DICK, Christian HELMRICH, Nikolaus RETTELBACH, Florian SCHUH, Richard FUEG, Frederik NAGEL
FREQUENCY-DOMAIN AUDIO CODING SUPPORTING TRANSFORM LENGTH SWITCHING

Publication number: 20260065920

Abstract: A frequency-domain audio codec is provided with the ability to additionally support a certain transform length in a backward-compatible manner, by the following: the frequency-domain coefficients of a respective frame are transmitted in an interleaved manner irrespective of the signalization signaling for the frames as to which transform length actually applies, and additionally the frequency-domain coefficient extraction and the scale factor extraction operate independent from the signalization. By this measure, old-fashioned frequency-domain audio coders/decoders, insensitive for the signalization, would be able to nevertheless operate without faults and with reproducing a reasonable quality. Concurrently, frequency-domain audio coders/decoders able to support the additional transform length would offer even better quality despite the backward compatibility.

Type: Application

Filed: November 5, 2025

Publication date: March 5, 2026

Inventors: Sascha DICK, Christian HELMRICH, Andreas HOELZER
APPARATUS AND METHOD FOR SCREEN RELATED AUDIO OBJECT REMAPPING

Publication number: 20260065921

Abstract: An apparatus for generating loudspeaker signals includes an object metadata processor configured to receive metadata, to calculate a second position of the audio object depending on the first position of the audio object and on a size of a screen if the audio object is indicated in the metadata as being screen-related, to feed the first position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being not screen-related, and to feed the second position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being screen-related. The apparatus further includes an object renderer configured to receive an audio object and to generate the loudspeaker signals depending on the audio object and on position information.

Type: Application

Filed: July 2, 2025

Publication date: March 5, 2026

Inventors: Simone FUEG, Jan PLOGSTIES, Sascha DICK, Johannes HILPERT, Julien ROBILLIARD, Achim KUNTZ, Andreas HOELZER
AUDIO ENCODER, AUDIO DECODER, METHODS AND COMPUTER PROGRAM USING JOINTLY ENCODED RESIDUAL SIGNALS

Publication number: 20260024535

Abstract: An audio decoder for providing at least four audio channel signals on the basis of an encoded representation is configured to provide a first residual signal and a second residual signal on the basis of a jointly encoded representation of the first residual signal and of the second residual signal using a multi-channel decoding. The audio decoder is configured to provide a first audio channel signal and a second audio channel signal on the basis of a first downmix signal and the first residual signal using a residual-signal-assisted multi-channel decoding. The audio decoder is configured to provide a third audio channel signal and a fourth audio channel signal on the basis of a second downmix signal and the second residual signal using a residual-signal-assisted multi-channel decoding. An audio encoder is based on corresponding considerations.

Type: Application

Filed: July 25, 2025

Publication date: January 22, 2026

Applicant: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Sascha Dick, Christian Ertel, Christian Helmrich, Johannes Hilpert, Andreas Hoelzer, Achim Kuntz
Frequency-domain audio coding supporting transform length switching

Patent number: 12488804

Abstract: A frequency-domain audio codec is provided with the ability to additionally support a certain transform length in a backward-compatible manner, by the following: the frequency-domain coefficients of a respective frame are transmitted in an interleaved manner irrespective of the signalization signaling for the frames as to which transform length actually applies, and additionally the frequency-domain coefficient extraction and the scale factor extraction operate independent from the signalization. By this measure, old-fashioned frequency-domain audio coders/decoders, insensitive for the signalization, would be able to nevertheless operate without faults and with reproducing a reasonable quality. Concurrently, frequency-domain audio coders/decoders able to support the additional transform length would offer even better quality despite the backward compatibility.

Type: Grant

Filed: December 14, 2023

Date of Patent: December 2, 2025

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Sascha Dick, Christian Helmrich, Andreas Hoelzer
Apparatus and method for encoding or decoding a multi-channel signal

Patent number: 12462819

Abstract: An apparatus for encoding a multi-channel signal having at least three channels includes an iteration processor, a channel encoder and an output interface. The iteration processor is configured to calculate inter-channel correlation values between each pair of the at least three channels, for selecting a pair including a highest value or including a value above a threshold, and for processing the selected pair using a multi-channel processing operation to derive first multi-channel parameters for the selected pair and to derive first processed channels. The iteration processor is configured to perform the calculating, the selecting and the processing using at least one of the processed channels to derive second multi-channel parameters and second processed channels. The channel encoder is configured to encode channels resulting from an iteration processing to obtain encoded channels.

Type: Grant

Filed: March 29, 2024

Date of Patent: November 4, 2025

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Sascha Dick, Florian Schuh, Nikolaus Rettelbach, Tobias Schwegler, Richard Fueg, Johannes Hilpert, Matthias Neusinger
APPARATUS AND METHOD FOR PERCEPTION-BASED CLUSTERING OF OBJECT-BASED AUDIO SCENES

Publication number: 20250287169

Abstract: An apparatus according to an embodiment is provided The apparatus comprises an input interface for receiving information on three or more audio objects. Moreover, the apparatus comprises a cluster generator for generating two or more audio object clusters by associating each of the three or more audio objects with at least one of the two or more audio object clusters, such that, for each of the two or more audio object clusters, at least one of the three or more audio objects is associated to said audio object cluster, and such that, for each of at least one of the two or more audio object clusters, at least two of the three or more audio objects are associated with said audio object cluster. The cluster generator is configured to generate the two or more audio object clusters depending on a perception-based model.

Type: Application

Filed: March 28, 2025

Publication date: September 11, 2025

Applicant: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

Inventors: Sascha DICK, Jürgen HERRE
APPARATUS AND METHOD EMPLOYING A PERCEPTION-BASED DISTANCE METRIC FOR SPATIAL AUDIO

Publication number: 20250287170

Abstract: An apparatus according to an embodiment is provided. The apparatus comprises an input interface for receiving a plurality of audio objects of an audio sound scene. Moreover, the apparatus comprises a processor. Each of the plurality of audio objects represents a sound source being different from any other sound source being represented by any other audio object of the plurality of audio objects; or at least two of the plurality of audio objects represent a same sound source at different locations. The processor is configured to obtain information on a perceptual difference between two audio objects of the plurality of audio objects depending on a distance metric, wherein the distance metric represents perceptual differences in spatial properties of the audio sound scene. And/or, the processor is configured to process the plurality of audio objects to obtain a plurality of audio object clusters or a plurality of processed audio objects depending on the distance metric.

Type: Application

Filed: March 28, 2025

Publication date: September 11, 2025

Applicant: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

Inventors: Sascha DICK, Jürgen HERRE, Pablo Manuel DELGADO
Apparatus and method for stereo filling in multichannel coding

Patent number: 12387731

Abstract: An apparatus for decoding an encoded multichannel signal of a current frame to obtain three or more current audio output channels is provided. A multichannel processor is adapted to select two decoded channels from three or more decoded channels depending on first multichannel parameters. Moreover, the multichannel processor is adapted to generate a first group of two or more processed channels based on the selected channels. A noise filling module is adapted to identify for at least one of the selected channels, one or more frequency bands, within which all spectral lines are quantized to zero, and to generate a mixing channel using, depending on side information, a proper subset of three or more previous audio output channels that have been decoded, and to fill the spectral lines of frequency bands, within which all spectral lines are quantized to zero, with noise generated using spectral lines of the mixing channel.

Type: Grant

Filed: July 11, 2023

Date of Patent: August 12, 2025

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Sascha Dick, Christian Helmrich, Nikolaus Rettelbach, Florian Schuh, Richard Fueg, Frederik Nagel
Apparatus and method for screen related audio object remapping

Patent number: 12380903

Abstract: An apparatus for generating loudspeaker signals includes an object metadata processor configured to receive metadata, to calculate a second position of the audio object depending on the first position of the audio object and on a size of a screen if the audio object is indicated in the metadata as being screen-related, to feed the first position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being not screen-related, and to feed the second position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being screen-related. The apparatus further includes an object renderer configured to receive an audio object and to generate the loudspeaker signals depending on the audio object and on position information.

Type: Grant

Filed: January 18, 2024

Date of Patent: August 5, 2025

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Simone Fueg, Jan Plogsties, Sascha Dick, Johannes Hilpert, Julien Robilliard, Achim Kuntz, Andreas Hoelzer
Audio encoder, audio decoder, methods and computer program using jointly encoded residual signals

Patent number: 12380899

Abstract: An audio decoder for providing at least four audio channel signals on the basis of an encoded representation is configured to provide a first residual signal and a second residual signal on the basis of a jointly encoded representation of the first residual signal and of the second residual signal using a multi-channel decoding. The audio decoder is configured to provide a first audio channel signal and a second audio channel signal on the basis of a first downmix signal and the first residual signal using a residual-signal-assisted multi-channel decoding. The audio decoder is configured to provide a third audio channel signal and a fourth audio channel signal on the basis of a second downmix signal and the second residual signal using a residual-signal-assisted multi-channel decoding. An audio encoder is based on corresponding considerations.

Type: Grant

Filed: May 22, 2023

Date of Patent: August 5, 2025

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Sascha Dick, Christian Ertel, Christian Helmrich, Johannes Hilpert, Andreas Hoelzer, Achim Kuntz
Multisignal audio coding using signal whitening as processing

Patent number: 12367883

Abstract: A multisignal encoder for encoding at least three audio signals, including: a signal preprocessor for individually preprocessing each audio signal to obtain at least three preprocessed audio signals, wherein the preprocessing is performed so that a preprocessed audio signal is whitened with respect to the signal before preprocessing; an adaptive joint signal processor for performing a processing of the at least three preprocessed audio signals to obtain at least three jointly processed signals or at least two jointly processed signals and an unprocessed signal; a signal encoder for encoding each signal to obtain one or more encoded signals; and an output interface for transmitting or storing an encoded multisignal audio signal including the one or more encoded signals, side information relating to the preprocessing and side information relating to the processing.

Type: Grant

Filed: December 17, 2020

Date of Patent: July 22, 2025

Assignee: FRAUNHOFER-GESELLSCHAFT ZUR FÖRDERUNG DER ANGEWANDTEN FORSCHUNG E.V.

Inventors: Eleni Fotopoulou, Markus Multrus, Sascha Dick, Goran Markovic, Pallavi Maben, Srikanth Korse, Stefan Bayer, Sascha Disch, Jürgen Herre
Directional loudness map based audio processing

Patent number: 12183360

Abstract: An audio analyzer configured to obtain spectral domain representations of two or more input audio signals. Additionally the audio analyzer is configured to obtain directional information associated with spectral bands of the spectral domain representations and to obtain loudness information associated with different directions as an analysis result. Contributions to the loudness information are determined in dependence on the directional information.

Type: Grant

Filed: April 26, 2021

Date of Patent: December 31, 2024

Assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.

Inventors: Jürgen Herre, Pablo Manuel Delgado, Sascha Dick
APPARATUS AND METHOD FOR SCREEN RELATED AUDIO OBJECT REMAPPING

Publication number: 20240265930

Abstract: An apparatus for generating loudspeaker signals includes an object metadata processor configured to receive metadata, to calculate a second position of the audio object depending on the first position of the audio object and on a size of a screen if the audio object is indicated in the metadata as being screen-related, to feed the first position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being not screen-related, and to feed the second position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being screen-related. The apparatus further includes an object renderer configured to receive an audio object and to generate the loudspeaker signals depending on the audio object and on position information.

Type: Application

Filed: January 18, 2024

Publication date: August 8, 2024

Inventors: Simone FUEG, Jan PLOGSTIES, Sascha DICK, Johannes HILPERT, Julien ROBILLIARD, Achim KUNTZ, Andreas HOELZER
Apparatus and Method for Encoding or Decoding a Multi-Channel Signal

Publication number: 20240249732

Abstract: An apparatus for encoding a multi-channel signal having at least three channels includes an iteration processor, a channel encoder and an output interface. The iteration processor is configured to calculate inter-channel correlation values between each pair of the at least three channels, for selecting a pair including a highest value or including a value above a threshold, and for processing the selected pair using a multi-channel processing operation to derive first multi-channel parameters for the selected pair and to derive first processed channels. The iteration processor is configured to perform the calculating, the selecting and the processing using at least one of the processed channels to derive second multi-channel parameters and second processed channels. The channel encoder is configured to encode channels resulting from an iteration processing to obtain encoded channels.

Type: Application

Filed: March 29, 2024

Publication date: July 25, 2024

Inventors: Sascha DICK, Florian SCHUH, Nikolaus RETTELBACH, Tobias SCHWEGLER, Richard FUEG, Johannes HILPERT, Matthias NEUSINGER
Concept for audio encoding and decoding for audio channels and audio objects

Patent number: 11984131

Abstract: Audio encoder for encoding audio input data to obtain audio output data includes an input interface for receiving a plurality of audio channels, a plurality of audio objects and metadata related to one or more of the plurality of audio objects; a mixer for mixing the plurality of objects and the plurality of channels to obtain a plurality of pre-mixed channels, each pre-mixed channel including audio data of a channel and audio data of at least one object; a core encoder for core encoding core encoder input data; and a metadata compressor for compressing the metadata related to the one or more of the plurality of audio objects, wherein the audio encoder is configured to operate in at least one mode of the group of two modes.

Type: Grant

Filed: December 13, 2021

Date of Patent: May 14, 2024

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Alexander Adami, Christian Borss, Sascha Dick, Christian Ertel, Simone Neukam, Juergen Herre, Johannes Hilpert, Andreas Hoelzer, Michael Kratschmer, Fabian Kuech, Achim Kuntz, Adrian Murtaza, Jan Plogsties, Andreas Silzle, Hanne Stenzel
FREQUENCY-DOMAIN AUDIO CODING SUPPORTING TRANSFORM LENGTH SWITCHING

Publication number: 20240127836

Abstract: A frequency-domain audio codec is provided with the ability to additionally support a certain transform length in a backward-compatible manner, by the following: the frequency-domain coefficients of a respective frame are transmitted in an interleaved manner irrespective of the signalization signaling for the frames as to which transform length actually applies, and additionally the frequency-domain coefficient extraction and the scale factor extraction operate independent from the signalization. By this measure, old-fashioned frequency-domain audio coders/decoders, insensitive for the signalization, would be able to nevertheless operate without faults and with reproducing a reasonable quality. Concurrently, frequency-domain audio coders/decoders able to support the additional transform length would offer even better quality despite the backward compatibility.

Type: Application

Filed: December 14, 2023

Publication date: April 18, 2024

Inventors: Sascha DICK, Christian HELMRICH, Andreas HOELZER
Apparatus and method for encoding or decoding a multi-channel signal

Patent number: 11955131

Abstract: An apparatus for encoding a multi-channel signal having at least three channels includes an iteration processor, a channel encoder and an output interface. The iteration processor is configured to calculate inter-channel correlation values between each pair of the at least three channels, for selecting a pair including a highest value or including a value above a threshold, and for processing the selected pair using a multi-channel processing operation to derive first multi-channel parameters for the selected pair and to derive first processed channels. The iteration processor is configured to perform the calculating, the selecting and the processing using at least one of the processed channels to derive second multi-channel parameters and second processed channels. The channel encoder is configured to encode channels resulting from an iteration processing to obtain encoded channels.

Type: Grant

Filed: October 18, 2022

Date of Patent: April 9, 2024

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Sascha Dick, Florian Schuh, Nikolaus Rettelbach, Tobias Schwegler, Richard Fueg, Johannes Hilpert, Matthias Neusinger
Apparatus and method for screen related audio object remapping

Patent number: 11900955

Abstract: An apparatus for generating loudspeaker signals includes an object metadata processor configured to receive metadata, to calculate a second position of the audio object depending on the first position of the audio object and on a size of a screen if the audio object is indicated in the metadata as being screen-related, to feed the first position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being not screen-related, and to feed the second position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being screen-related. The apparatus further includes an object renderer configured to receive an audio object and to generate the loudspeaker signals depending on the audio object and on position information.

Type: Grant

Filed: November 18, 2022

Date of Patent: February 13, 2024

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Simone Fueg, Jan Plogsties, Sascha Dick, Johannes Hilpert, Julien Robilliard, Achim Kuntz, Andreas Hoelzer

1 2 3 4 next