Patents by Inventor Tobias Herbig

Tobias Herbig has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11950067
    Abstract: An In-Car Communication (ICC) system supports the communication paths within a car by receiving the speech signals of a speaking passenger and playing it back for one or more listening passengers. Signal processing tasks are split into a microphone related part and into a loudspeaker related part. A sound processing system suitable for use in a vehicle having multiple acoustic zones includes a plurality of microphone In-Car Communication (Mic-ICC) instances coupled and a plurality of loudspeaker In-Car Communication (Ls-ICC) instances. The system further includes a dynamic audio routing matrix with a controller and coupled to the Mic-ICC instances, a mixer coupled to the plurality of Mic-ICC instances and a distributor coupled to the Ls-ICC instances.
    Type: Grant
    Filed: February 6, 2023
    Date of Patent: April 2, 2024
    Assignee: Cerence Operating Company
    Inventors: Tobias Herbig, Markus Buck, Meik Pfeffinger
  • Publication number: 20240062770
    Abstract: Methods and systems for deessing of speech signals are described. A deesser of a speech processing system includes an analyzer configured to receive a full spectral envelope for each time frame of a speech signal presented to the speech processing system, and to analyze the full spectral envelope to identify frequency content for deessing. The deesser also includes a compressor configured to receive results from the analyzer and to spectrally weight the speech signal as a function of results of the analyzer. The analyzer can be configured to calculate a psychoacoustic measure from the full spectral envelope, and may be further configured to detect sibilant sounds of the speech signal using the psychoacoustic measure. The psychoacoustic measure can include, for example, a measure of sharpness, and the analyzer may be further configured to calculate deesser weights based on the measure of sharpness. An example application includes in-car communications.
    Type: Application
    Filed: November 3, 2023
    Publication date: February 22, 2024
    Inventors: Tobias Herbig, Stefan Richardt
  • Patent number: 11817115
    Abstract: Methods and systems for deessing of speech signals are described. A deesser of a speech processing system includes an analyzer configured to receive a full spectral envelope for each time frame of a speech signal presented to the speech processing system, and to analyze the full spectral envelope to identify frequency content for deessing. The deesser also includes a compressor configured to receive results from the analyzer and to spectrally weight the speech signal as a function of results of the analyzer. The analyzer can be configured to calculate a psychoacoustic measure from the full spectral envelope, and may be further configured to detect sibilant sounds of the speech signal using the psychoacoustic measure. The psychoacoustic measure can include, for example, a measure of sharpness, and the analyzer may be further configured to calculate deesser weights based on the measure of sharpness. An example application includes in-car communications.
    Type: Grant
    Filed: September 1, 2016
    Date of Patent: November 14, 2023
    Assignee: Cerence Operating Company
    Inventors: Tobias Herbig, Stefan Richardt
  • Patent number: 11798576
    Abstract: Methods and apparatus for a communication system having microphones and loudspeakers to determine a noise and speech level estimate for a transformed signal, determine a SNR from the noise and speech level estimates, and determine a gain for the transformed signal to achieve a selected SNR range at a given position. In one embodiment, the gain is determined by adapting an actual gain to follow a target gain, wherein the target gain is adjusted to achieve the selected SNR range.
    Type: Grant
    Filed: November 1, 2019
    Date of Patent: October 24, 2023
    Assignee: Cerence Operating Company
    Inventors: Tobias Herbig, Meik Pfeffinger, Bernd Iser
  • Publication number: 20230209260
    Abstract: An In-Car Communication (ICC) system supports the communication paths within a car by receiving the speech signals of a speaking passenger and playing it back for one or more listening passengers. Signal processing tasks are split into a microphone related part and into a loudspeaker related part. A sound processing system suitable for use in a vehicle having multiple acoustic zones includes a plurality of microphone In-Car Communication (Mic-ICC) instances coupled and a plurality of loudspeaker In-Car Communication (Ls-ICC) instances. The system further includes a dynamic audio routing matrix with a controller and coupled to the Mic-ICC instances, a mixer coupled to the plurality of Mic-ICC instances and a distributor coupled to the Ls-ICC instances.
    Type: Application
    Filed: February 6, 2023
    Publication date: June 29, 2023
    Inventors: Tobias Herbig, Markus Buck, Meik Pfeffinger
  • Publication number: 20230178077
    Abstract: A system for detection of at least one designated wake-up word for at least one speech-enabled application. The system comprises at least one microphone; and at least one computer hardware processor configured to perform: receiving an acoustic signal generated by the at least one microphone at least in part as a result of receiving an utterance spoken by a speaker; obtaining information indicative of the speaker's identity; interpreting the acoustic signal at least in part by determining, using the information indicative of the speaker's identity and automated speech recognition, whether the utterance spoken by the speaker includes the at least one designated wake-up word; and interacting with the speaker based, at least in part, on results of the interpreting.
    Type: Application
    Filed: January 30, 2023
    Publication date: June 8, 2023
    Applicant: CERENCE OPERATING COMPANY
    Inventors: Meik PFEFFINGER, Timo MATHEJA, Tobias HERBIG, Tim HAULICK
  • Patent number: 11600269
    Abstract: A system for detection of at least one designated wake-up word for at least one speech-enabled application. The system comprises at least one microphone; and at least one computer hardware processor configured to perform: receiving an acoustic signal generated by the at least one microphone at least in part as a result of receiving an utterance spoken by a speaker; obtaining information indicative of the speaker's identity; interpreting the acoustic signal at least in part by determining, using the information indicative of the speaker's identity and automated speech recognition, whether the utterance spoken by the speaker includes the at least one designated wake-up word; and interacting with the speaker based, at least in part, on results of the interpreting.
    Type: Grant
    Filed: June 15, 2016
    Date of Patent: March 7, 2023
    Assignee: Cerence Operating Company
    Inventors: Meik Pfeffinger, Timo Matheja, Tobias Herbig, Tim Haulick
  • Patent number: 11575990
    Abstract: An communication system supports communication paths within an environment by receiving speech signals of a speaker and playing it back for one or more listeners. Signal processing tasks are split into a microphone related part and into a loudspeaker related part. A sound processing system suitable for use in an environment having multiple acoustic zones includes a plurality of microphone communication instances coupled and a plurality of loudspeaker instances.
    Type: Grant
    Filed: May 1, 2017
    Date of Patent: February 7, 2023
    Assignee: Cerence Operating Company
    Inventors: Tobias Herbig, Markus Buck, Meik Pfeffinger
  • Patent number: 11176957
    Abstract: A low-complexity method and apparatus for detection of voiced speech and pitch estimation is disclosed that is capable of dealing with special constraints given by applications where low latency is required, such as in-car communication (ICC) systems. An example embodiment employs very short frames that may capture only a single excitation impulse of voiced speech in an audio signal. A distance between multiple such impulses, corresponding to a pitch period, may be determined by evaluating phase differences between low-resolution spectra of the very short frames. An example embodiment may perform pitch estimation directly in a frequency domain based on the phase differences and reduce computational complexity by obviating transformation to a time domain to perform the pitch estimation. In an event the phase differences are determined to be substantially linear, an example embodiment enhances voice quality of the voiced speech by applying speech enhancement to the audio signal.
    Type: Grant
    Filed: August 17, 2017
    Date of Patent: November 16, 2021
    Assignee: Cerence Operating Company
    Inventors: Simon Graf, Tobias Herbig, Markus Buck
  • Publication number: 20210134311
    Abstract: A low-complexity method and apparatus for detection of voiced speech and pitch estimation is disclosed that is capable of dealing with special constraints given by applications where low latency is required, such as in-car communication (ICC) systems. An example embodiment employs very short frames that may capture only a single excitation impulse of voiced speech in an audio signal. A distance between multiple such impulses, corresponding to a pitch period, may be determined by evaluating phase differences between low-resolution spectra of the very short frames. An example embodiment may perform pitch estimation directly in a frequency domain based on the phase differences and reduce computational complexity by obviating transformation to a time domain to perform the pitch estimation. In an event the phase differences are determined to be substantially linear, an example embodiment enhances voice quality of the voiced speech by applying speech enhancement to the audio signal.
    Type: Application
    Filed: August 17, 2017
    Publication date: May 6, 2021
    Inventors: Simon Graf, Tobias Herbig, Markus Buck
  • Patent number: 10783899
    Abstract: Systems and methods are introduced to perform noise suppression of an audio signal. The audio signal includes foreground speech components and background noise. The foreground speech components correspond to speech from a user's speaking into an audio receiving device. The background noise includes babble noise that includes speech from one or more interfering speakers. A soft speech detector determines, dynamically, a speech detection result indicating a likelihood of a presence of the foreground speech components in the audio signal. The speech detection result is employed to control, dynamically, an amount of attenuation of the noise suppression to reduce the babble noise in the audio signal. Further processing achieves a more stationary background and reduction of musical tones in the audio signal.
    Type: Grant
    Filed: November 18, 2016
    Date of Patent: September 22, 2020
    Assignee: Cerence Operating Company
    Inventors: Simon Graf, Tobias Herbig, Markus Buck
  • Publication number: 20200176012
    Abstract: Methods and apparatus for a communication system having microphones and loudspeakers to determine a noise and speech level estimate for a transformed signal, determine a SNR from the noise and speech level estimates, and determine a gain for the transformed signal to achieve a selected SNR range at a given position. In one embodiment, the gain is determined by adapting an actual gain to follow a target gain, wherein the target gain is adjusted to achieve the selected SNR range.
    Type: Application
    Filed: November 1, 2019
    Publication date: June 4, 2020
    Applicant: NUANCE COMMUNICATIONS, INC.
    Inventors: Tobias HERBIG, Meik PFEFFINGER, Bernd ISER
  • Publication number: 20190311715
    Abstract: A system for detection of at least one designated wake-up word for at least one speech-enabled application. The system comprises at least one microphone; and at least one computer hardware processor configured to perform: receiving an acoustic signal generated by the at least one microphone at least in part as a result of receiving an utterance spoken by a speaker; obtaining information indicative of the speaker's identity; interpreting the acoustic signal at least in part by determining, using the information indicative of the speaker's identity and automated speech recognition, whether the utterance spoken by the speaker includes the at least one designated wake-up word; and interacting with the speaker based, at least in part, on results of the interpreting.
    Type: Application
    Filed: June 15, 2016
    Publication date: October 10, 2019
    Applicant: Nuance Communications, Inc.
    Inventors: Meik Pfeffinger, Timo Matheja, Tobias Herbig, Tim Haulick
  • Publication number: 20190156855
    Abstract: Methods and systems for deessing of speech signals are described. A deesser of a speech processing system includes an analyzer configured to receive a full spectral envelope for each time frame of a speech signal presented to the speech processing system, and to analyze the full spectral envelope to identify frequency content for deessing. The deesser also includes a compressor configured to receive results from the analyzer and to spectrally weight the speech signal as a function of results of the analyzer. The analyzer can be configured to calculate a psychoacoustic measure from the full spectral envelope, and may be further configured to detect sibilant sounds of the speech signal using the psychoacoustic measure. The psychoacoustic measure can include, for example, a measure of sharpness, and the analyzer may be further configured to calculate deesser weights based on the measure of sharpness. An example application includes in-car communications.
    Type: Application
    Filed: September 1, 2016
    Publication date: May 23, 2019
    Inventors: Tobias Herbig, Stefan Richardt
  • Publication number: 20190139567
    Abstract: Speech processing methods may rely on voice activity detection (VAD) that separates speech from noise. Example embodiments of a computationally low complex VAD feature that is robust against various types of noise is introduced. By considering an alternating excitation structure of low and high frequencies, speech is detected with a high confidence. The computationally low complex VAD feature can cope even with the limited spectral resolution that may be typical for a communication system, such as an in-car-communication (ICC) system. Simulation results confirm the robustness of the computationally low complex VAD feature and show an increase in performance relative to established VAD features.
    Type: Application
    Filed: February 17, 2017
    Publication date: May 9, 2019
    Inventors: Simon Graf, Tobias Herbig, Markus Buck
  • Patent number: 10229686
    Abstract: Methods and apparatus to process microphone signals by a speech enhancement module to generate an audio stream signal including first and second metadata for use by a speech recognition module. In an embodiment, speech recognition is performed using endpointing information including transitioning from a silence state to a maybe speech state, in which data is buffered, based on the first metadata and transitioning to a speech state, in which speech recognition is performed, based upon the second metadata.
    Type: Grant
    Filed: August 18, 2014
    Date of Patent: March 12, 2019
    Assignee: NUANCE COMMUNICATIONS, INC.
    Inventors: Markus Buck, Tobias Herbig, Simon Graf, Christophe Ris
  • Publication number: 20190013036
    Abstract: Systems and methods are introduced to perform noise suppression of an audio signal. The audio signal includes foreground speech components and background noise. The foreground speech components correspond to speech from a user's speaking into an audio receiving device. The background noise includes babble noise that includes speech from one or more interfering speakers. A soft speech detector determines, dynamically, a speech detection result indicating a likelihood of a presence of the foreground speech components in the audio signal. The speech detection result is employed to control, dynamically, an amount of attenuation of the noise suppression to reduce the babble noise in the audio signal. Further processing achieves a more stationary background and reduction of musical tones in the audio signal.
    Type: Application
    Filed: November 18, 2016
    Publication date: January 10, 2019
    Inventors: Simon Graf, Tobias Herbig, Markus Buck
  • Patent number: 9767826
    Abstract: Method and apparatus to determine a speaker activity detection measure from energy-based characteristics of signals from a plurality of speaker-dedicated microphones, detect acoustic events using power spectra for the microphone signals, and determine a robust speaker activity detection measure from the speaker activity measure and the detected acoustic events.
    Type: Grant
    Filed: September 27, 2013
    Date of Patent: September 19, 2017
    Assignee: NUANCE COMMUNICATIONS, INC.
    Inventors: Timo Matheja, Tobias Herbig, Markus Buck
  • Publication number: 20170251304
    Abstract: An communication system supports communication paths within an environment by receiving speech signals of a speaker and playing it back for one or more listeners. Signal processing tasks are split into a microphone related part and into a loudspeaker related part. A sound processing system suitable for use in an environment having multiple acoustic zones includes a plurality of microphone communication instances coupled and a plurality of loudspeaker instances.
    Type: Application
    Filed: May 1, 2017
    Publication date: August 31, 2017
    Applicant: NUANCE COMMUNICATIONS, INC.
    Inventors: Tobias Herbig, Markus Buck, Meik Pfeffinger
  • Publication number: 20170213556
    Abstract: Methods and apparatus to process microphone signals by a speech enhancement module to generate an audio stream signal including first and second metadata for use by a speech recognition module. In an embodiment, speech recognition is performed using endpointing information including transitioning from a silence state to a maybe speech state, in which data is buffered, based on the first metadata and transitioning to a speech state, in which speech recognition is performed, based upon the second metadata.
    Type: Application
    Filed: August 18, 2014
    Publication date: July 27, 2017
    Applicant: NUANCE COMMUNICATIONS, INC.
    Inventors: Markus BUCK, Tobias HERBIG, Simon GRAF, Christophe RIS