Patents by Inventor Frank Baumgarte
Frank Baumgarte has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20170032801Abstract: A system for producing an encoded digital audio recording has an audio encoder that encodes a digital audio recording having a number of audio channels or audio objects. An equalization (EQ) value generator produces a sequence of EQ values which define EQ filtering that is to be applied when decoding the encoded digital audio recording, wherein the EQ filtering is to be applied to a group of one or more of the audio channels or audio objects of the recording independent of any downmix. A bitstream multiplexer combines the encoded digital audio recording with the sequence of EQ values, the latter as metadata associated with the encoded digital audio recording. Other embodiments are also described including a system for decoding the encoded audio recording.Type: ApplicationFiled: March 3, 2016Publication date: February 2, 2017Inventor: Frank Baumgarte
-
Patent number: 9559651Abstract: An audio normalization gain value is applied to an audio signal to produce a normalized signal. The normalized signal is processed to compute dynamic range control (DRC) gain values in accordance with a selected one of several pre-defined DRC characteristics. The audio signal is encoded, and the DRC gain values are provided as metadata associated with the encoded audio signal. Several other embodiments are also described and claimed.Type: GrantFiled: March 26, 2014Date of Patent: January 31, 2017Assignee: Apple Inc.Inventors: Frank Baumgarte, Eric A. Allamanche, Stefan K. O. Strommer
-
Publication number: 20140294200Abstract: An audio normalization gain value is applied to an audio signal to produce a normalized signal. The normalized signal is processed to compute dynamic range control (DRC) gain values in accordance with a selected one of several pre-defined DRC characteristics. The audio signal is encoded, and the DRC gain values are provided as metadata associated with the encoded audio signal. Several other embodiments are also described and claimed.Type: ApplicationFiled: March 26, 2014Publication date: October 2, 2014Inventors: Frank Baumgarte, Eric A. Allamanche, Stefan K. O. Strommer
-
Patent number: 8515768Abstract: Methods, systems, and apparatus are presented for decoding an audio signal that includes bandwidth extension data. An audio signal that includes core audio data and bandwidth extension data can be received in a decoder. The core audio data can be associated with a core portion of an audio signal, such as the frequency range below a cutoff frequency, and the bandwidth extension data can be associated with an extended portion of the audio signal, such as a frequency range above the cutoff frequency. The core audio data can be decoded to generate a decoded core audio signal in a time domain representation. Further, an extended portion of the audio signal can be reconstructed in accordance with extension data and decoded core audio signal. Additionally, the decoded core audio signal can be lowpass filtered and the extended portion can be highpass filtered before being combined to generate a decoded output signal.Type: GrantFiled: August 31, 2009Date of Patent: August 20, 2013Assignee: Apple Inc.Inventors: Frank Baumgarte, William Stewart, Shyh-Shiaw Kuo
-
Patent number: 8224661Abstract: According to one embodiment, an improved audio coding technique encodes audio having a low frequency transient signal, using a long block, but with a set of adapted masking thresholds. Upon identifying an audio window that contains a low frequency transient signal, masking thresholds for the long block may be calculated as usual. A set of masking thresholds calculated for the 8 short blocks corresponding to the long block are calculated. The masking thresholds for low frequency critical bands are adapted based on the thresholds calculated for the short blocks, and the resulting adapted masking thresholds are used to encode the long block of audio data. The result is encoded audio with rich harmonic content and negligible coder noise resulting from the low frequency transient signal.Type: GrantFiled: September 25, 2011Date of Patent: July 17, 2012Assignee: Apple Inc.Inventors: Shyh-Shiaw Kuo, Frank Baumgarte
-
Patent number: 8200500Abstract: Generic and specific C-to-E binaural cue coding (BCC) schemes are described, including those in which one or more of the input channels are transmitted as unmodified channels that are not downmixed at the BCC encoder and not upmixed at the BCC decoder. The specific BCC schemes described include 5-to-2, 6-to-5, 7-to-5, 6.1-to-5.1, 7.1-to-5.1, and 6.2-to-5.1, where “0.1” indicates a single low-frequency effects (LFE) channel and “0.2” indicates two LFE channels.Type: GrantFiled: March 14, 2011Date of Patent: June 12, 2012Assignee: Agere Systems Inc.Inventors: Frank Baumgarte, Jiashu Chen, Christof Faller
-
Publication number: 20120016679Abstract: According to one embodiment, an improved audio coding technique encodes audio having a low frequency transient signal, using a long block, but with a set of adapted masking thresholds. Upon identifying an audio window that contains a low frequency transient signal, masking thresholds for the long block may be calculated as usual. A set of masking thresholds calculated for the 8 short blocks corresponding to the long block are calculated. The masking thresholds for low frequency critical bands are adapted based on the thresholds calculated for the short blocks, and the resulting adapted masking thresholds are used to encode the long block of audio data. The result is encoded audio with rich harmonic content and negligible coder noise resulting from the low frequency transient signal.Type: ApplicationFiled: September 25, 2011Publication date: January 19, 2012Inventors: Shyh-Shiaw Kuo, Frank Baumgarte
-
Patent number: 8090122Abstract: According to one embodiment, during mixing of an N-channel input signal to generate an M-channel output signal, in at least one frequency sub-band, magnitude equalization is applied to the mixed channel signals such that an amplitude sum magnitude for the N input channels (e.g., the magnitude of a sum of estimated amplitudes of the N input channels) is approximately equal to an amplitude sum magnitude for the M output channels (e.g., the magnitude of a sum of estimated amplitudes of the M output channels). In one implementation, magnitude equalization is applied to one or more sub-bands (e.g., those below 1 kHz), and power equalization is applied to one or more other sub-bands (e.g., those above 1 kHz) to reduce coloration effects in the output signal.Type: GrantFiled: November 11, 2009Date of Patent: January 3, 2012Assignee: Agere Systems Inc.Inventor: Frank Baumgarte
-
Patent number: 8060375Abstract: An improved audio coding technique encodes audio having a low frequency transient signal, using a long block, but with a set of adapted masking thresholds. Upon identifying an audio window that contains a low frequency transient signal, masking thresholds for the long block may be calculated as usual. A set of masking thresholds calculated for the 8 short blocks corresponding to the long block are calculated. The masking thresholds for low frequency critical bands are adapted based on the thresholds calculated for the short blocks, and the resulting adapted masking thresholds are used to encode the long block of audio data. The result is encoded audio with rich harmonic content and negligible coder noise resulting from the low frequency transient signal.Type: GrantFiled: January 12, 2011Date of Patent: November 15, 2011Assignee: Apple Inc.Inventors: Shyh-Shiaw Kuo, Frank Baumgarte
-
Publication number: 20110164756Abstract: Generic and specific C-to-E binaural cue coding (BCC) schemes are described, including those in which one or more of the input channels are transmitted as unmodified channels that are not downmixed at the BCC encoder and not upmixed at the BCC decoder. The specific BCC schemes described include 5-to-2, 6-to-5, 7-to-5, 6.1-to-5.1, 7.1-to-5.1, and 6.2-to-5.1, where “.1” indicates a single low-frequency effects (LFE) channel and “.2” indicates two LFE channels.Type: ApplicationFiled: March 14, 2011Publication date: July 7, 2011Applicant: AGERE SYSTEMS INC.Inventors: Frank Baumgarte, Jiashu Chen, Christof Faller
-
Patent number: 7941320Abstract: Generic and specific C-to-E binaural cue coding (BCC) schemes are described, including those in which one or more of the input channels are transmitted as unmodified channels that are not downmixed at the BCC encoder and not upmixed at the BCC decoder. The specific BCC schemes described include 5-to-2, 6-to-5, 7-to-5, 6.1-to-5.1, 7.1-to-5.1, and 6.2-to-5.1, where “0.1” indicates a single low-frequency effects (LFE) channel and “0.2” indicates two LFE channels.Type: GrantFiled: August 27, 2009Date of Patent: May 10, 2011Assignee: Agere Systems, Inc.Inventors: Frank Baumgarte, Jiashu Chen, Christof Faller
-
Publication number: 20110106544Abstract: An improved audio coding technique encodes audio having a low frequency transient signal, using a long block, but with a set of adapted masking thresholds. Upon identifying an audio window that contains a low frequency transient signal, masking thresholds for the long block may be calculated as usual. A set of masking thresholds calculated for the 8 short blocks corresponding to the long block are calculated. The masking thresholds for low frequency critical bands are adapted based on the thresholds calculated for the short blocks, and the resulting adapted masking thresholds are used to encode the long block of audio data. The result is encoded audio with rich harmonic content and negligible coder noise resulting from the low frequency transient signal.Type: ApplicationFiled: January 12, 2011Publication date: May 5, 2011Applicant: APPLE INC.Inventors: Shyh-Shiaw Kuo, Frank Baumgarte
-
Publication number: 20110054911Abstract: Methods, systems, and apparatus are presented for decoding an audio signal that includes bandwidth extension data. An audio signal that includes core audio data and bandwidth extension data can be received in a decoder. The core audio data can be associated with a core portion of an audio signal, such as the frequency range below a cutoff frequency, and the bandwidth extension data can be associated with an extended portion of the audio signal, such as a frequency range above the cutoff frequency. The core audio data can be decoded to generate a decoded core audio signal in a time domain representation. Further, an extended portion of the audio signal can be reconstructed in accordance with extension data and decoded core audio signal. Additionally, the decoded core audio signal can be lowpass filtered and the extended portion can be highpass filtered before being combined to generate a decoded output signal.Type: ApplicationFiled: August 31, 2009Publication date: March 3, 2011Applicant: APPLE INC.Inventors: Frank Baumgarte, William Stewart, Shyh-Shiaw Kuo
-
Patent number: 7899677Abstract: An improved audio coding technique encodes audio having a low frequency transient signal, using a long block, but with a set of adapted masking thresholds. Upon identifying an audio window that contains a low frequency transient signal, masking thresholds for the long block may be calculated as usual. A set of masking thresholds calculated for the 8 short blocks corresponding to the long block are calculated. The masking thresholds for low frequency critical bands are adapted based on the thresholds calculated for the short blocks, and the resulting adapted masking thresholds are used to encode the long block of audio data. The result is encoded audio with rich harmonic content and negligible coder noise resulting from the low frequency transient signal.Type: GrantFiled: November 24, 2009Date of Patent: March 1, 2011Assignee: Apple Inc.Inventors: Shyh-Shiaw Kuo, Frank Baumgarte
-
Publication number: 20100111330Abstract: According to one embodiment, during mixing of an N-channel input signal to generate an M-channel output signal, in at least one frequency sub-band, magnitude equalization is applied to the mixed channel signals such that an amplitude sum magnitude for the N input channels (e.g., the magnitude of a sum of estimated amplitudes of the N input channels) is approximately equal to an amplitude sum magnitude for the M output channels (e.g., the magnitude of a sum of estimated amplitudes of the M output channels). In one implementation, magnitude equalization is applied to one or more sub-bands (e.g., those below 1 kHz), and power equalization is applied to one or more other sub-bands (e.g., those above 1 kHz) to reduce coloration effects in the output signal.Type: ApplicationFiled: November 11, 2009Publication date: May 6, 2010Applicant: AGERE SYSTEMS INC.Inventor: Frank Baumgarte
-
Patent number: 7693721Abstract: Part of the spectrum of two or more input signals is encoded using conventional coding techniques, while encoding the rest of the spectrum using binaural cue coding (BCC). In BCC coding, spectral components of the input signals are downmixed and BCC parameters (e.g., inter-channel level and/or time differences) are generated. In a stereo implementation, after converting the left and right channels to the frequency domain, pairs of left- and right-channel spectral components are downmixed to mono. The mono components are then converted back to the time domain, along with those left- and right-channel spectral components that were not downmixed, to form hybrid stereo signals, which can then be encoded using conventional coding techniques. For playback, the encoded bitstream is decoded using conventional decoding techniques. BCC synthesis techniques may then apply the BCC parameters to synthesize an auditory scene based on the mono components as well as the unmixed stereo components.Type: GrantFiled: December 10, 2007Date of Patent: April 6, 2010Assignee: Agere Systems Inc.Inventors: Frank Baumgarte, Peter Kroon
-
Publication number: 20100070287Abstract: An improved audio coding technique encodes audio having a low frequency transient signal, using a long block, but with a set of adapted masking thresholds. Upon identifying an audio window that contains a low frequency transient signal, masking thresholds for the long block may be calculated as usual. A set of masking thresholds calculated for the 8 short blocks corresponding to the long block are calculated. The masking thresholds for low frequency critical bands are adapted based on the thresholds calculated for the short blocks, and the resulting adapted masking thresholds are used to encode the long block of audio data. The result is encoded audio with rich harmonic content and negligible coder noise resulting from the low frequency transient signal.Type: ApplicationFiled: November 24, 2009Publication date: March 18, 2010Inventors: Shyh-Shiaw Kuo, Frank Baumgarte
-
Patent number: 7644003Abstract: Generic and specific C-to-E binaural cue coding (BCC) schemes are described, including those in which one or more of the input channels are transmitted as unmodified channels that are not downmixed at the BCC encoder and not upmixed at the BCC decoder. The specific BCC schemes described include 5-to-2, 6-to-5, 7-to-5, 6.1-to-5.1, 7.1-to-5.1, and 6.2-to-5.1, where “0.1” indicates a single low-frequency effects (LFE) channel and “0.2” indicates two LFE channels.Type: GrantFiled: September 8, 2004Date of Patent: January 5, 2010Assignee: Agere Systems Inc.Inventors: Frank Baumgarte, Jiashu Chen, Christof Faller
-
Patent number: 7639823Abstract: According to one embodiment, during mixing of an N-channel input signal to generate an M-channel output signal, in at least one frequency sub-band, magnitude equalization is applied to the mixed channel signals such that an amplitude sum magnitude for the N input channels (e.g., the magnitude of a sum of estimated amplitudes of the N input channels) is approximately equal to an amplitude sum magnitude for the M output channels (e.g., the magnitude of a sum of estimated amplitudes of the M output channels). In one implementation, magnitude equalization is applied to one or more sub-bands (e.g., those below 1 kHz), and power equalization is applied to one or more other sub-bands (e.g., those above 1 kHz) to reduce coloration effects in the output signal.Type: GrantFiled: May 25, 2004Date of Patent: December 29, 2009Assignee: Agere Systems Inc.Inventor: Frank Baumgarte
-
Publication number: 20090319281Abstract: Generic and specific C-to-E binaural cue coding (BCC) schemes are described, including those in which one or more of the input channels are transmitted as unmodified channels that are not downmixed at the BCC encoder and not upmixed at the BCC decoder. The specific BCC schemes described include 5-to-2, 6-to-5, 7-to-5, 6.1-to-5.1, 7.1-to-5.1, and 6.2-to-5.1, where “0.1” indicates a single low-frequency effects (LFE) channel and “0.2” indicates two LFE channels.Type: ApplicationFiled: August 27, 2009Publication date: December 24, 2009Applicant: AGERE SYSTEMS INC.Inventors: Frank Baumgarte, Jiashu Chen, Christof Faller