Patents by Inventor Frank Baumgarte

Frank Baumgarte has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20170032801
    Abstract: A system for producing an encoded digital audio recording has an audio encoder that encodes a digital audio recording having a number of audio channels or audio objects. An equalization (EQ) value generator produces a sequence of EQ values which define EQ filtering that is to be applied when decoding the encoded digital audio recording, wherein the EQ filtering is to be applied to a group of one or more of the audio channels or audio objects of the recording independent of any downmix. A bitstream multiplexer combines the encoded digital audio recording with the sequence of EQ values, the latter as metadata associated with the encoded digital audio recording. Other embodiments are also described including a system for decoding the encoded audio recording.
    Type: Application
    Filed: March 3, 2016
    Publication date: February 2, 2017
    Inventor: Frank Baumgarte
  • Patent number: 9559651
    Abstract: An audio normalization gain value is applied to an audio signal to produce a normalized signal. The normalized signal is processed to compute dynamic range control (DRC) gain values in accordance with a selected one of several pre-defined DRC characteristics. The audio signal is encoded, and the DRC gain values are provided as metadata associated with the encoded audio signal. Several other embodiments are also described and claimed.
    Type: Grant
    Filed: March 26, 2014
    Date of Patent: January 31, 2017
    Assignee: Apple Inc.
    Inventors: Frank Baumgarte, Eric A. Allamanche, Stefan K. O. Strommer
  • Publication number: 20140294200
    Abstract: An audio normalization gain value is applied to an audio signal to produce a normalized signal. The normalized signal is processed to compute dynamic range control (DRC) gain values in accordance with a selected one of several pre-defined DRC characteristics. The audio signal is encoded, and the DRC gain values are provided as metadata associated with the encoded audio signal. Several other embodiments are also described and claimed.
    Type: Application
    Filed: March 26, 2014
    Publication date: October 2, 2014
    Inventors: Frank Baumgarte, Eric A. Allamanche, Stefan K. O. Strommer
  • Patent number: 8515768
    Abstract: Methods, systems, and apparatus are presented for decoding an audio signal that includes bandwidth extension data. An audio signal that includes core audio data and bandwidth extension data can be received in a decoder. The core audio data can be associated with a core portion of an audio signal, such as the frequency range below a cutoff frequency, and the bandwidth extension data can be associated with an extended portion of the audio signal, such as a frequency range above the cutoff frequency. The core audio data can be decoded to generate a decoded core audio signal in a time domain representation. Further, an extended portion of the audio signal can be reconstructed in accordance with extension data and decoded core audio signal. Additionally, the decoded core audio signal can be lowpass filtered and the extended portion can be highpass filtered before being combined to generate a decoded output signal.
    Type: Grant
    Filed: August 31, 2009
    Date of Patent: August 20, 2013
    Assignee: Apple Inc.
    Inventors: Frank Baumgarte, William Stewart, Shyh-Shiaw Kuo
  • Patent number: 8224661
    Abstract: According to one embodiment, an improved audio coding technique encodes audio having a low frequency transient signal, using a long block, but with a set of adapted masking thresholds. Upon identifying an audio window that contains a low frequency transient signal, masking thresholds for the long block may be calculated as usual. A set of masking thresholds calculated for the 8 short blocks corresponding to the long block are calculated. The masking thresholds for low frequency critical bands are adapted based on the thresholds calculated for the short blocks, and the resulting adapted masking thresholds are used to encode the long block of audio data. The result is encoded audio with rich harmonic content and negligible coder noise resulting from the low frequency transient signal.
    Type: Grant
    Filed: September 25, 2011
    Date of Patent: July 17, 2012
    Assignee: Apple Inc.
    Inventors: Shyh-Shiaw Kuo, Frank Baumgarte
  • Patent number: 8200500
    Abstract: Generic and specific C-to-E binaural cue coding (BCC) schemes are described, including those in which one or more of the input channels are transmitted as unmodified channels that are not downmixed at the BCC encoder and not upmixed at the BCC decoder. The specific BCC schemes described include 5-to-2, 6-to-5, 7-to-5, 6.1-to-5.1, 7.1-to-5.1, and 6.2-to-5.1, where “0.1” indicates a single low-frequency effects (LFE) channel and “0.2” indicates two LFE channels.
    Type: Grant
    Filed: March 14, 2011
    Date of Patent: June 12, 2012
    Assignee: Agere Systems Inc.
    Inventors: Frank Baumgarte, Jiashu Chen, Christof Faller
  • Publication number: 20120016679
    Abstract: According to one embodiment, an improved audio coding technique encodes audio having a low frequency transient signal, using a long block, but with a set of adapted masking thresholds. Upon identifying an audio window that contains a low frequency transient signal, masking thresholds for the long block may be calculated as usual. A set of masking thresholds calculated for the 8 short blocks corresponding to the long block are calculated. The masking thresholds for low frequency critical bands are adapted based on the thresholds calculated for the short blocks, and the resulting adapted masking thresholds are used to encode the long block of audio data. The result is encoded audio with rich harmonic content and negligible coder noise resulting from the low frequency transient signal.
    Type: Application
    Filed: September 25, 2011
    Publication date: January 19, 2012
    Inventors: Shyh-Shiaw Kuo, Frank Baumgarte
  • Patent number: 8090122
    Abstract: According to one embodiment, during mixing of an N-channel input signal to generate an M-channel output signal, in at least one frequency sub-band, magnitude equalization is applied to the mixed channel signals such that an amplitude sum magnitude for the N input channels (e.g., the magnitude of a sum of estimated amplitudes of the N input channels) is approximately equal to an amplitude sum magnitude for the M output channels (e.g., the magnitude of a sum of estimated amplitudes of the M output channels). In one implementation, magnitude equalization is applied to one or more sub-bands (e.g., those below 1 kHz), and power equalization is applied to one or more other sub-bands (e.g., those above 1 kHz) to reduce coloration effects in the output signal.
    Type: Grant
    Filed: November 11, 2009
    Date of Patent: January 3, 2012
    Assignee: Agere Systems Inc.
    Inventor: Frank Baumgarte
  • Patent number: 8060375
    Abstract: An improved audio coding technique encodes audio having a low frequency transient signal, using a long block, but with a set of adapted masking thresholds. Upon identifying an audio window that contains a low frequency transient signal, masking thresholds for the long block may be calculated as usual. A set of masking thresholds calculated for the 8 short blocks corresponding to the long block are calculated. The masking thresholds for low frequency critical bands are adapted based on the thresholds calculated for the short blocks, and the resulting adapted masking thresholds are used to encode the long block of audio data. The result is encoded audio with rich harmonic content and negligible coder noise resulting from the low frequency transient signal.
    Type: Grant
    Filed: January 12, 2011
    Date of Patent: November 15, 2011
    Assignee: Apple Inc.
    Inventors: Shyh-Shiaw Kuo, Frank Baumgarte
  • Publication number: 20110164756
    Abstract: Generic and specific C-to-E binaural cue coding (BCC) schemes are described, including those in which one or more of the input channels are transmitted as unmodified channels that are not downmixed at the BCC encoder and not upmixed at the BCC decoder. The specific BCC schemes described include 5-to-2, 6-to-5, 7-to-5, 6.1-to-5.1, 7.1-to-5.1, and 6.2-to-5.1, where “.1” indicates a single low-frequency effects (LFE) channel and “.2” indicates two LFE channels.
    Type: Application
    Filed: March 14, 2011
    Publication date: July 7, 2011
    Applicant: AGERE SYSTEMS INC.
    Inventors: Frank Baumgarte, Jiashu Chen, Christof Faller
  • Patent number: 7941320
    Abstract: Generic and specific C-to-E binaural cue coding (BCC) schemes are described, including those in which one or more of the input channels are transmitted as unmodified channels that are not downmixed at the BCC encoder and not upmixed at the BCC decoder. The specific BCC schemes described include 5-to-2, 6-to-5, 7-to-5, 6.1-to-5.1, 7.1-to-5.1, and 6.2-to-5.1, where “0.1” indicates a single low-frequency effects (LFE) channel and “0.2” indicates two LFE channels.
    Type: Grant
    Filed: August 27, 2009
    Date of Patent: May 10, 2011
    Assignee: Agere Systems, Inc.
    Inventors: Frank Baumgarte, Jiashu Chen, Christof Faller
  • Publication number: 20110106544
    Abstract: An improved audio coding technique encodes audio having a low frequency transient signal, using a long block, but with a set of adapted masking thresholds. Upon identifying an audio window that contains a low frequency transient signal, masking thresholds for the long block may be calculated as usual. A set of masking thresholds calculated for the 8 short blocks corresponding to the long block are calculated. The masking thresholds for low frequency critical bands are adapted based on the thresholds calculated for the short blocks, and the resulting adapted masking thresholds are used to encode the long block of audio data. The result is encoded audio with rich harmonic content and negligible coder noise resulting from the low frequency transient signal.
    Type: Application
    Filed: January 12, 2011
    Publication date: May 5, 2011
    Applicant: APPLE INC.
    Inventors: Shyh-Shiaw Kuo, Frank Baumgarte
  • Publication number: 20110054911
    Abstract: Methods, systems, and apparatus are presented for decoding an audio signal that includes bandwidth extension data. An audio signal that includes core audio data and bandwidth extension data can be received in a decoder. The core audio data can be associated with a core portion of an audio signal, such as the frequency range below a cutoff frequency, and the bandwidth extension data can be associated with an extended portion of the audio signal, such as a frequency range above the cutoff frequency. The core audio data can be decoded to generate a decoded core audio signal in a time domain representation. Further, an extended portion of the audio signal can be reconstructed in accordance with extension data and decoded core audio signal. Additionally, the decoded core audio signal can be lowpass filtered and the extended portion can be highpass filtered before being combined to generate a decoded output signal.
    Type: Application
    Filed: August 31, 2009
    Publication date: March 3, 2011
    Applicant: APPLE INC.
    Inventors: Frank Baumgarte, William Stewart, Shyh-Shiaw Kuo
  • Patent number: 7899677
    Abstract: An improved audio coding technique encodes audio having a low frequency transient signal, using a long block, but with a set of adapted masking thresholds. Upon identifying an audio window that contains a low frequency transient signal, masking thresholds for the long block may be calculated as usual. A set of masking thresholds calculated for the 8 short blocks corresponding to the long block are calculated. The masking thresholds for low frequency critical bands are adapted based on the thresholds calculated for the short blocks, and the resulting adapted masking thresholds are used to encode the long block of audio data. The result is encoded audio with rich harmonic content and negligible coder noise resulting from the low frequency transient signal.
    Type: Grant
    Filed: November 24, 2009
    Date of Patent: March 1, 2011
    Assignee: Apple Inc.
    Inventors: Shyh-Shiaw Kuo, Frank Baumgarte
  • Publication number: 20100111330
    Abstract: According to one embodiment, during mixing of an N-channel input signal to generate an M-channel output signal, in at least one frequency sub-band, magnitude equalization is applied to the mixed channel signals such that an amplitude sum magnitude for the N input channels (e.g., the magnitude of a sum of estimated amplitudes of the N input channels) is approximately equal to an amplitude sum magnitude for the M output channels (e.g., the magnitude of a sum of estimated amplitudes of the M output channels). In one implementation, magnitude equalization is applied to one or more sub-bands (e.g., those below 1 kHz), and power equalization is applied to one or more other sub-bands (e.g., those above 1 kHz) to reduce coloration effects in the output signal.
    Type: Application
    Filed: November 11, 2009
    Publication date: May 6, 2010
    Applicant: AGERE SYSTEMS INC.
    Inventor: Frank Baumgarte
  • Patent number: 7693721
    Abstract: Part of the spectrum of two or more input signals is encoded using conventional coding techniques, while encoding the rest of the spectrum using binaural cue coding (BCC). In BCC coding, spectral components of the input signals are downmixed and BCC parameters (e.g., inter-channel level and/or time differences) are generated. In a stereo implementation, after converting the left and right channels to the frequency domain, pairs of left- and right-channel spectral components are downmixed to mono. The mono components are then converted back to the time domain, along with those left- and right-channel spectral components that were not downmixed, to form hybrid stereo signals, which can then be encoded using conventional coding techniques. For playback, the encoded bitstream is decoded using conventional decoding techniques. BCC synthesis techniques may then apply the BCC parameters to synthesize an auditory scene based on the mono components as well as the unmixed stereo components.
    Type: Grant
    Filed: December 10, 2007
    Date of Patent: April 6, 2010
    Assignee: Agere Systems Inc.
    Inventors: Frank Baumgarte, Peter Kroon
  • Publication number: 20100070287
    Abstract: An improved audio coding technique encodes audio having a low frequency transient signal, using a long block, but with a set of adapted masking thresholds. Upon identifying an audio window that contains a low frequency transient signal, masking thresholds for the long block may be calculated as usual. A set of masking thresholds calculated for the 8 short blocks corresponding to the long block are calculated. The masking thresholds for low frequency critical bands are adapted based on the thresholds calculated for the short blocks, and the resulting adapted masking thresholds are used to encode the long block of audio data. The result is encoded audio with rich harmonic content and negligible coder noise resulting from the low frequency transient signal.
    Type: Application
    Filed: November 24, 2009
    Publication date: March 18, 2010
    Inventors: Shyh-Shiaw Kuo, Frank Baumgarte
  • Patent number: 7644003
    Abstract: Generic and specific C-to-E binaural cue coding (BCC) schemes are described, including those in which one or more of the input channels are transmitted as unmodified channels that are not downmixed at the BCC encoder and not upmixed at the BCC decoder. The specific BCC schemes described include 5-to-2, 6-to-5, 7-to-5, 6.1-to-5.1, 7.1-to-5.1, and 6.2-to-5.1, where “0.1” indicates a single low-frequency effects (LFE) channel and “0.2” indicates two LFE channels.
    Type: Grant
    Filed: September 8, 2004
    Date of Patent: January 5, 2010
    Assignee: Agere Systems Inc.
    Inventors: Frank Baumgarte, Jiashu Chen, Christof Faller
  • Patent number: 7639823
    Abstract: According to one embodiment, during mixing of an N-channel input signal to generate an M-channel output signal, in at least one frequency sub-band, magnitude equalization is applied to the mixed channel signals such that an amplitude sum magnitude for the N input channels (e.g., the magnitude of a sum of estimated amplitudes of the N input channels) is approximately equal to an amplitude sum magnitude for the M output channels (e.g., the magnitude of a sum of estimated amplitudes of the M output channels). In one implementation, magnitude equalization is applied to one or more sub-bands (e.g., those below 1 kHz), and power equalization is applied to one or more other sub-bands (e.g., those above 1 kHz) to reduce coloration effects in the output signal.
    Type: Grant
    Filed: May 25, 2004
    Date of Patent: December 29, 2009
    Assignee: Agere Systems Inc.
    Inventor: Frank Baumgarte
  • Publication number: 20090319281
    Abstract: Generic and specific C-to-E binaural cue coding (BCC) schemes are described, including those in which one or more of the input channels are transmitted as unmodified channels that are not downmixed at the BCC encoder and not upmixed at the BCC decoder. The specific BCC schemes described include 5-to-2, 6-to-5, 7-to-5, 6.1-to-5.1, 7.1-to-5.1, and 6.2-to-5.1, where “0.1” indicates a single low-frequency effects (LFE) channel and “0.2” indicates two LFE channels.
    Type: Application
    Filed: August 27, 2009
    Publication date: December 24, 2009
    Applicant: AGERE SYSTEMS INC.
    Inventors: Frank Baumgarte, Jiashu Chen, Christof Faller