Patents by Inventor Sascha DICK

Sascha DICK has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11227616
    Abstract: Audio encoder for encoding audio input data to obtain audio output data includes an input interface for receiving a plurality of audio channels, a plurality of audio objects and metadata related to one or more of the plurality of audio objects; a mixer for mixing the plurality of objects and the plurality of channels to obtain a plurality of pre-mixed channels, each pre-mixed channel including audio data of a channel and audio data of at least one object; a core encoder for core encoding core encoder input data; and a metadata compressor for compressing the metadata related to the one or more of the plurality of audio objects, wherein the audio encoder is configured to operate in at least one mode of the group of two modes.
    Type: Grant
    Filed: February 15, 2019
    Date of Patent: January 18, 2022
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Alexander Adami, Christian Borss, Sascha Dick, Christian Ertel, Simone Neukam, Juergen Herre, Johannes Hilpert, Andreas Hoelzer, Michael Kratschmer, Fabian Kuech, Achim Kuntz, Adrian Murtaza, Jan Plogsties, Andreas Silzle, Hanne Stenzel
  • Publication number: 20210383820
    Abstract: An audio analyzer configured to obtain spectral domain representations of two or more input audio signals. Additionally the audio analyzer is configured to obtain directional information associated with spectral bands of the spectral domain representations and to obtain loudness information associated with different directions as an analysis result. Contributions to the loudness information are determined in dependence on the directional information.
    Type: Application
    Filed: April 26, 2021
    Publication date: December 9, 2021
    Inventors: Jürgen HERRE, Pablo Manuel DELGADO, Sascha DICK
  • Publication number: 20210233543
    Abstract: An audio decoder for providing at least four audio channel signals on the basis of an encoded representation is configured to provide a first residual signal and a second residual signal on the basis of a jointly encoded representation of the first residual signal and of the second residual signal using a multi-channel decoding. The audio decoder is configured to provide a first audio channel signal and a second audio channel signal on the basis of a first downmix signal and the first residual signal using a residual-signal-assisted multi-channel decoding. The audio decoder is configured to provide a third audio channel signal and a fourth audio channel signal on the basis of a second downmix signal and the second residual signal using a residual-signal-assisted multi-channel decoding. An audio encoder is based on corresponding considerations.
    Type: Application
    Filed: August 11, 2020
    Publication date: July 29, 2021
    Applicant: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Sascha Dick, Christian Ertel, Christian Helmrich, Johannes Hilpert, Andreas Hoelzer, Achim Kuntz
  • Publication number: 20210233545
    Abstract: A frequency-domain audio codec is provided with the ability to additionally support a certain transform length in a backward-compatible manner, by the following: the frequency-domain coefficients of a respective frame are transmitted in an interleaved manner irrespective of the signalization signaling for the frames as to which transform length actually applies, and additionally the frequency-domain coefficient extraction and the scale factor extraction operate independent from the signalization. By this measure, old-fashioned frequency-domain audio coders/decoders, insensitive for the signalization, would be able to nevertheless operate without faults and with reproducing a reasonable quality. Concurrently, frequency-domain audio coders/decoders able to support the additional transform length would offer even better quality despite the backward compatibility.
    Type: Application
    Filed: April 9, 2021
    Publication date: July 29, 2021
    Inventors: Sascha DICK, Christian HELMRICH, Andreas HOELZER
  • Patent number: 10984809
    Abstract: A frequency-domain audio codec is provided with the ability to additionally support a certain transform length in a backward-compatible manner, by the following: the frequency-domain coefficients of a respective frame are transmitted in an interleaved manner irrespective of the signalization signaling for the frames as to which transform length actually applies, and additionally the frequency-domain coefficient extraction and the scale factor extraction operate independent from the signalization. By this measure, old-fashioned frequency-domain audio coders/decoders, insensitive for the signalization, would be able to nevertheless operate without faults and with reproducing a reasonable quality. Concurrently, frequency-domain audio coders/decoders able to support the additional transform length would offer even better quality despite the backward compatibility.
    Type: Grant
    Filed: February 25, 2019
    Date of Patent: April 20, 2021
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Sascha Dick, Christian Helmrich, Andreas Hoelzer
  • Publication number: 20210104249
    Abstract: A multisignal encoder for encoding at least three audio signals, including: a signal preprocessor for individually preprocessing each audio signal to obtain at least three preprocessed audio signals, wherein the preprocessing is performed so that a preprocessed audio signal is whitened with respect to the signal before preprocessing; an adaptive joint signal processor for performing a processing of the at least three preprocessed audio signals to obtain at least three jointly processed signals or at least two jointly processed signals and an unprocessed signal; a signal encoder for encoding each signal to obtain one or more encoded signals; and an output interface for transmitting or storing an encoded multisignal audio signal including the one or more encoded signals, side information relating to the preprocessing and side information relating to the processing.
    Type: Application
    Filed: December 17, 2020
    Publication date: April 8, 2021
    Inventors: Eleni FOTOPOULOU, Markus Multrus, Sascha Dick, Goran Markovic, Pallavi Maben, Srikanth Korse, Stefan Bayer, Sascha Disch, Jürgen Herre
  • Publication number: 20210065722
    Abstract: The invention refers to audio encoders, audio decoders, and audio encoding methods and audio decoding methods. In some examples, the invention refers to improved stereo coding. An encoder provides an encoded representation of an audio signal. The encoder applies a spectral whitening to a separate-channel representation of the input audio signal, to obtain a whitened separate-channel representation of the signal. The audio encoder applies a spectral whitening to a mid-side representation of the signal, to obtain a whitened mid-side representation of the signal. The audio encoder decides whether to encode the whitened separate-channel representation of the signal, to obtain the encoded representation of the signal, or to encode the whitened mid-side representation of the signal, to obtain the encoded representation of the signal.
    Type: Application
    Filed: August 28, 2020
    Publication date: March 4, 2021
    Inventors: Goran MARKOVIC, Sascha DICK, Eleni FOTOPOULOU, Stefan BAYER
  • Publication number: 20210065729
    Abstract: An apparatus for generating loudspeaker signals includes an object metadata processor configured to receive metadata, to calculate a second position of the audio object depending on the first position of the audio object and on a size of a screen if the audio object is indicated in the metadata as being screen-related, to feed the first position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being not screen-related, and to feed the second position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being screen-related. The apparatus further includes an object renderer configured to receive an audio object and to generate the loudspeaker signals depending on the audio object and on position information.
    Type: Application
    Filed: November 17, 2020
    Publication date: March 4, 2021
    Inventors: Simone FUEG, Jan PLOGSTIES, Sascha DICK, Johannes HILPERT, Julien ROBILLIARD, Achim KUNTZ, Andreas HOELZER
  • Publication number: 20210056979
    Abstract: An audio decoder for providing at least four bandwidth-extended channel signals on the basis of an encoded representation provides first and second downmix signals on the basis of a jointly encoded representation of the first and second downmix signals using a multi-channel decoding and provides at least first and second audio channel signals on the basis of the first downmix signal using a multi-channel decoding, and provides at least third and fourth audio channel signals on the basis of the second downmix signal using a multi-channel decoding. It performs a multi-channel bandwidth extension on the basis of the first and third audio channel signals, to obtain first and third bandwidth-extended channel signals, and performs a multi-channel bandwidth extension on the basis of the second and fourth audio channel signals, to obtain second and fourth bandwidth extended channel signals. An audio encoder uses a related concept.
    Type: Application
    Filed: September 3, 2020
    Publication date: February 25, 2021
    Applicant: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Sascha Dick, Christian Ertel, Christian Helmrich, Johannes Hilpert, Andreas Hoelzer, Achim Kuntz
  • Publication number: 20210012783
    Abstract: An apparatus for encoding a multi-channel signal having at least three channels includes an iteration processor, a channel encoder and an output interface. The iteration processor is configured to calculate inter-channel correlation values between each pair of the at least three channels, for selecting a pair including a highest value or including a value above a threshold, and for processing the selected pair using a multi-channel processing operation to derive first multi-channel parameters for the selected pair and to derive first processed channels. The iteration processor is configured to perform the calculating, the selecting and the processing using at least one of the processed channels to derive second multi-channel parameters and second processed channels. The channel encoder is configured to encode channels resulting from an iteration processing to obtain encoded channels.
    Type: Application
    Filed: August 17, 2020
    Publication date: January 14, 2021
    Inventors: Sascha DICK, Florian SCHUH, Nikolaus RETTELBACH, Tobias SCHWEGLER, Richard FUEG, Johannes HILPERT, Matthias NEUSINGER
  • Publication number: 20200388293
    Abstract: A multi-channel audio decoder for providing at least two output audio signals on the basis of an encoded representation is configured to perform a weighted combination of a downmix signal, a decorrelated signal and a residual signal, to obtain one of the output audio signals. The multi-channel audio decoder is configured to determine a weight describing a contribution of the decorrelated signal in the weighted combination in dependence on the residual signal. A multi-channel audio encoder for providing an encoded representation of a multi-channel audio signal is configured to obtain a downmix signal on the basis of the multi-channel audio signal, to provide parameters describing dependencies between the channels of the multi-channel audio signal, and to provide a residual signal. The multi-channel audio encoder is configured to vary an amount of residual signal included into the encoded representation in dependence on the multi-channel audio signal.
    Type: Application
    Filed: August 25, 2020
    Publication date: December 10, 2020
    Applicant: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Sascha Dick, Christian Helmrich, Johannes Hilpert, Andreas Hoelzer
  • Patent number: 10854213
    Abstract: An apparatus for generating loudspeaker signals includes an object metadata processor configured to receive metadata, to calculate a second position of the audio object depending on the first position of the audio object and on a size of a screen if the audio object is indicated in the metadata as being screen-related, to feed the first position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being not screen-related, and to feed the second position of the audio object as the position information into the object renderer if the audio object is indicated in the metadata as being screen-related. The apparatus further includes an object renderer configured to receive an audio object and to generate the loudspeaker signals depending on the audio object and on position information.
    Type: Grant
    Filed: December 28, 2018
    Date of Patent: December 1, 2020
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Simone Fueg, Jan Plogsties, Sascha Dick, Johannes Hilpert, Julien Robilliard, Achim Kuntz, Andreas Hoelzer
  • Patent number: 10839812
    Abstract: A multi-channel audio decoder for providing at least two output audio signals on the basis of an encoded representation is configured to perform a weighted combination of a downmix signal, a decorrelated signal and a residual signal, to obtain one of the output audio signals. The multi-channel audio decoder is configured to determine a weight describing a contribution of the decorrelated signal in the weighted combination in dependence on the residual signal. A multi-channel audio encoder for providing an encoded representation of a multi-channel audio signal is configured to obtain a downmix signal on the basis of the multi-channel audio signal, to provide parameters describing dependencies between the channels of the multi-channel audio signal, and to provide a residual signal. The multi-channel audio encoder is configured to vary an amount of residual signal included into the encoded representation in dependence on the multi-channel audio signal.
    Type: Grant
    Filed: January 22, 2016
    Date of Patent: November 17, 2020
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Sascha Dick, Christian Helmrich, Johannes Hilpert, Andreas Hoelzer
  • Publication number: 20200357418
    Abstract: An apparatus for decoding an encoded multichannel signal of a current frame to obtain three or more current audio output channels is provided. A multichannel processor is adapted to select two decoded channels from three or more decoded channels depending on first multichannel parameters. Moreover, the multichannel processor is adapted to generate a first group of two or more processed channels based on the selected channels. A noise filling module is adapted to identify for at least one of the selected channels, one or more frequency bands, within which all spectral lines are quantized to zero, and to generate a mixing channel using, depending on side information, a proper subset of three or more previous audio output channels that have been decoded, and to fill the spectral lines of frequency bands, within which all spectral lines are quantized to zero, with noise generated using spectral lines of the mixing channel.
    Type: Application
    Filed: July 1, 2020
    Publication date: November 12, 2020
    Inventors: Sascha DICK, Christian HELMRICH, Nikolaus RETTELBACH, Florian SCHUH, Richard FUEG, Frederik NAGEL
  • Publication number: 20200359149
    Abstract: An apparatus for converting an object position of an audio object from a Cartesian representation to a spherical representation is described. A basis area of the Cartesian representation is subdivided into a plurality of basis area triangles, and wherein a plurality of spherical-domain triangles are inscribed into a circle of a spherical representation. The apparatus is configured to determine, in which of the basis area triangles a projection of the object position of the audio object into the base area is arranged; and the apparatus is configured to determine a mapped position of the projection of the object position using a linear transform, which maps the base area triangle onto its associated spherical domain triangle. The apparatus is configured to derive an azimuth angle and an intermediate radius value from the mapped position.
    Type: Application
    Filed: July 30, 2020
    Publication date: November 12, 2020
    Inventors: Oliver WÜBBOLT, Achim KUNTZ, Christian ERTEL, Sascha DICK, Frederik NAGEL, Matthias NEUSINGER
  • Patent number: 10770080
    Abstract: An audio decoder for providing at least four bandwidth-extended channel signals on the basis of an encoded representation provides first and second downmix signals on the basis of a jointly encoded representation of the first and second downmix signals using a multi-channel decoding and provides at least first and second audio channel signals on the basis of the first downmix signal using a multi-channel decoding, and provides at least third and fourth audio channel signals on the basis of the second downmix signal using a multi-channel decoding. It performs a multi-channel bandwidth extension on the basis of the first and third audio channel signals, to obtain first and third bandwidth-extended channel signals, and performs a multi-channel bandwidth extension on the basis of the second and fourth audio channel signals, to obtain second and fourth bandwidth extended channel signals. An audio encoder uses a related concept.
    Type: Grant
    Filed: December 4, 2018
    Date of Patent: September 8, 2020
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung, e.V.
    Inventors: Sascha Dick, Christian Ertel, Christian Helmrich, Johannes Hilpert, Andreas Hoelzer, Achim Kuntz
  • Patent number: 10762909
    Abstract: An apparatus for encoding a multi-channel signal having at least three channels includes an iteration processor, a channel encoder and an output interface. The iteration processor is configured to calculate inter-channel correlation values between each pair of the at least three channels, for selecting a pair including a highest value or including a value above a threshold, and for processing the selected pair using a multi-channel processing operation to derive first multi-channel parameters for the selected pair and to derive first processed channels. The iteration processor is configured to perform the calculating, the selecting and the processing using at least one of the processed channels to derive second multi-channel parameters and second processed channels. The channel encoder is configured to encode channels resulting from an iteration processing to obtain encoded channels.
    Type: Grant
    Filed: May 15, 2019
    Date of Patent: September 1, 2020
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Sascha Dick, Florian Schuh, Nikolaus Rettelbach, Tobias Schwegler, Richard Fueg, Johannes Hilpert, Matthias Neusinger
  • Patent number: 10755720
    Abstract: A multi-channel audio decoder for providing at least two output audio signals on the basis of an encoded representation is configured to perform a weighted combination of a downmix signal, a decorrelated signal and a residual signal, to obtain one of the output audio signals. The multi-channel audio decoder is configured to determine a weight describing a contribution of the decorrelated signal in the weighted combination in dependence on the residual signal. A multi-channel audio encoder for providing an encoded representation of a multi-channel audio signal is configured to obtain a downmix signal on the basis of the multi-channel audio signal, to provide parameters describing dependencies between the channels of the multi-channel audio signal, and to provide a residual signal. The multi-channel audio encoder is configured to vary an amount of residual signal included into the encoded representation in dependence on the multi-channel audio signal.
    Type: Grant
    Filed: October 16, 2017
    Date of Patent: August 25, 2020
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angwandten Forschung e.V.
    Inventors: Sascha Dick, Christian Helmrich, Johannes Hilpert, Andreas Hoelzer
  • Patent number: 10741188
    Abstract: An audio decoder for providing at least four audio channel signals on the basis of an encoded representation is configured to provide a first residual signal and a second residual signal on the basis of a jointly encoded representation of the first residual signal and of the second residual signal using a multi-channel decoding. The audio decoder is configured to provide a first audio channel signal and a second audio channel signal on the basis of a first downmix signal and the first residual signal using a residual-signal-assisted multi-channel decoding. The audio decoder is configured to provide a third audio channel signal and a fourth audio channel signal on the basis of a second downmix signal and the second residual signal using a residual-signal-assisted multi-channel decoding. An audio encoder is based on corresponding considerations.
    Type: Grant
    Filed: April 9, 2018
    Date of Patent: August 11, 2020
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Sascha Dick, Christian Ertel, Christian Helmrich, Johannes Hilpert, Andreas Hoelzer, Achim Kuntz
  • Patent number: 10733999
    Abstract: An apparatus for decoding an encoded multichannel signal of a current frame to obtain three or more current audio output channels is provided. A multichannel processor is adapted to select two decoded channels from three or more decoded channels depending on first multichannel parameters. Moreover, the multichannel processor is adapted to generate a first group of two or more processed channels based on the selected channels. A noise filling module is adapted to identify for at least one of the selected channels, one or more frequency bands, within which all spectral lines are quantized to zero, and to generate a mixing channel using, depending on side information, a proper subset of three or more previous audio output channels that have been decoded, and to fill the spectral lines of frequency bands, within which all spectral lines are quantized to zero, with noise generated using spectral lines of the mixing channel.
    Type: Grant
    Filed: August 17, 2018
    Date of Patent: August 4, 2020
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Sascha Dick, Christian Helmrich, Nikolaus Rettelbach, Florian Schuh, Richard Fueg, Frederik Nagel