Patents by Inventor Tobias Herbig

Tobias Herbig has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Communication system for multiple acoustic zones

Patent number: 11950067

Abstract: An In-Car Communication (ICC) system supports the communication paths within a car by receiving the speech signals of a speaking passenger and playing it back for one or more listening passengers. Signal processing tasks are split into a microphone related part and into a loudspeaker related part. A sound processing system suitable for use in a vehicle having multiple acoustic zones includes a plurality of microphone In-Car Communication (Mic-ICC) instances coupled and a plurality of loudspeaker In-Car Communication (Ls-ICC) instances. The system further includes a dynamic audio routing matrix with a controller and coupled to the Mic-ICC instances, a mixer coupled to the plurality of Mic-ICC instances and a distributor coupled to the Ls-ICC instances.

Type: Grant

Filed: February 6, 2023

Date of Patent: April 2, 2024

Assignee: Cerence Operating Company

Inventors: Tobias Herbig, Markus Buck, Meik Pfeffinger
ENHANCED DE-ESSER FOR IN-CAR COMMUNICATIONS SYSTEMS

Publication number: 20240062770

Abstract: Methods and systems for deessing of speech signals are described. A deesser of a speech processing system includes an analyzer configured to receive a full spectral envelope for each time frame of a speech signal presented to the speech processing system, and to analyze the full spectral envelope to identify frequency content for deessing. The deesser also includes a compressor configured to receive results from the analyzer and to spectrally weight the speech signal as a function of results of the analyzer. The analyzer can be configured to calculate a psychoacoustic measure from the full spectral envelope, and may be further configured to detect sibilant sounds of the speech signal using the psychoacoustic measure. The psychoacoustic measure can include, for example, a measure of sharpness, and the analyzer may be further configured to calculate deesser weights based on the measure of sharpness. An example application includes in-car communications.

Type: Application

Filed: November 3, 2023

Publication date: February 22, 2024

Inventors: Tobias Herbig, Stefan Richardt
Enhanced de-esser for in-car communication systems

Patent number: 11817115

Abstract: Methods and systems for deessing of speech signals are described. A deesser of a speech processing system includes an analyzer configured to receive a full spectral envelope for each time frame of a speech signal presented to the speech processing system, and to analyze the full spectral envelope to identify frequency content for deessing. The deesser also includes a compressor configured to receive results from the analyzer and to spectrally weight the speech signal as a function of results of the analyzer. The analyzer can be configured to calculate a psychoacoustic measure from the full spectral envelope, and may be further configured to detect sibilant sounds of the speech signal using the psychoacoustic measure. The psychoacoustic measure can include, for example, a measure of sharpness, and the analyzer may be further configured to calculate deesser weights based on the measure of sharpness. An example application includes in-car communications.

Type: Grant

Filed: September 1, 2016

Date of Patent: November 14, 2023

Assignee: Cerence Operating Company

Inventors: Tobias Herbig, Stefan Richardt
Methods and apparatus for adaptive gain control in a communication system

Patent number: 11798576

Abstract: Methods and apparatus for a communication system having microphones and loudspeakers to determine a noise and speech level estimate for a transformed signal, determine a SNR from the noise and speech level estimates, and determine a gain for the transformed signal to achieve a selected SNR range at a given position. In one embodiment, the gain is determined by adapting an actual gain to follow a target gain, wherein the target gain is adjusted to achieve the selected SNR range.

Type: Grant

Filed: November 1, 2019

Date of Patent: October 24, 2023

Assignee: Cerence Operating Company

Inventors: Tobias Herbig, Meik Pfeffinger, Bernd Iser
Communication System For Multiple Acoustic Zones

Publication number: 20230209260

Abstract: An In-Car Communication (ICC) system supports the communication paths within a car by receiving the speech signals of a speaking passenger and playing it back for one or more listening passengers. Signal processing tasks are split into a microphone related part and into a loudspeaker related part. A sound processing system suitable for use in a vehicle having multiple acoustic zones includes a plurality of microphone In-Car Communication (Mic-ICC) instances coupled and a plurality of loudspeaker In-Car Communication (Ls-ICC) instances. The system further includes a dynamic audio routing matrix with a controller and coupled to the Mic-ICC instances, a mixer coupled to the plurality of Mic-ICC instances and a distributor coupled to the Ls-ICC instances.

Type: Application

Filed: February 6, 2023

Publication date: June 29, 2023

Inventors: Tobias Herbig, Markus Buck, Meik Pfeffinger
TECHNIQUES FOR WAKE-UP WORK RECOGNITION AND RELATED SYSTEMS AND METHODS

Publication number: 20230178077

Abstract: A system for detection of at least one designated wake-up word for at least one speech-enabled application. The system comprises at least one microphone; and at least one computer hardware processor configured to perform: receiving an acoustic signal generated by the at least one microphone at least in part as a result of receiving an utterance spoken by a speaker; obtaining information indicative of the speaker's identity; interpreting the acoustic signal at least in part by determining, using the information indicative of the speaker's identity and automated speech recognition, whether the utterance spoken by the speaker includes the at least one designated wake-up word; and interacting with the speaker based, at least in part, on results of the interpreting.

Type: Application

Filed: January 30, 2023

Publication date: June 8, 2023

Applicant: CERENCE OPERATING COMPANY

Inventors: Meik PFEFFINGER, Timo MATHEJA, Tobias HERBIG, Tim HAULICK
Techniques for wake-up word recognition and related systems and methods

Patent number: 11600269

Abstract: A system for detection of at least one designated wake-up word for at least one speech-enabled application. The system comprises at least one microphone; and at least one computer hardware processor configured to perform: receiving an acoustic signal generated by the at least one microphone at least in part as a result of receiving an utterance spoken by a speaker; obtaining information indicative of the speaker's identity; interpreting the acoustic signal at least in part by determining, using the information indicative of the speaker's identity and automated speech recognition, whether the utterance spoken by the speaker includes the at least one designated wake-up word; and interacting with the speaker based, at least in part, on results of the interpreting.

Type: Grant

Filed: June 15, 2016

Date of Patent: March 7, 2023

Assignee: Cerence Operating Company

Inventors: Meik Pfeffinger, Timo Matheja, Tobias Herbig, Tim Haulick
Communication system for multiple acoustic zones

Patent number: 11575990

Abstract: An communication system supports communication paths within an environment by receiving speech signals of a speaker and playing it back for one or more listeners. Signal processing tasks are split into a microphone related part and into a loudspeaker related part. A sound processing system suitable for use in an environment having multiple acoustic zones includes a plurality of microphone communication instances coupled and a plurality of loudspeaker instances.

Type: Grant

Filed: May 1, 2017

Date of Patent: February 7, 2023

Assignee: Cerence Operating Company

Inventors: Tobias Herbig, Markus Buck, Meik Pfeffinger
Low complexity detection of voiced speech and pitch estimation

Patent number: 11176957

Abstract: A low-complexity method and apparatus for detection of voiced speech and pitch estimation is disclosed that is capable of dealing with special constraints given by applications where low latency is required, such as in-car communication (ICC) systems. An example embodiment employs very short frames that may capture only a single excitation impulse of voiced speech in an audio signal. A distance between multiple such impulses, corresponding to a pitch period, may be determined by evaluating phase differences between low-resolution spectra of the very short frames. An example embodiment may perform pitch estimation directly in a frequency domain based on the phase differences and reduce computational complexity by obviating transformation to a time domain to perform the pitch estimation. In an event the phase differences are determined to be substantially linear, an example embodiment enhances voice quality of the voiced speech by applying speech enhancement to the audio signal.

Type: Grant

Filed: August 17, 2017

Date of Patent: November 16, 2021

Assignee: Cerence Operating Company

Inventors: Simon Graf, Tobias Herbig, Markus Buck
LOW COMPLEXITY DETECTION OF VOICED SPEECH AND PITCH ESTIMATION

Publication number: 20210134311

Abstract: A low-complexity method and apparatus for detection of voiced speech and pitch estimation is disclosed that is capable of dealing with special constraints given by applications where low latency is required, such as in-car communication (ICC) systems. An example embodiment employs very short frames that may capture only a single excitation impulse of voiced speech in an audio signal. A distance between multiple such impulses, corresponding to a pitch period, may be determined by evaluating phase differences between low-resolution spectra of the very short frames. An example embodiment may perform pitch estimation directly in a frequency domain based on the phase differences and reduce computational complexity by obviating transformation to a time domain to perform the pitch estimation. In an event the phase differences are determined to be substantially linear, an example embodiment enhances voice quality of the voiced speech by applying speech enhancement to the audio signal.

Type: Application

Filed: August 17, 2017

Publication date: May 6, 2021

Inventors: Simon Graf, Tobias Herbig, Markus Buck
Babble noise suppression

Patent number: 10783899

Abstract: Systems and methods are introduced to perform noise suppression of an audio signal. The audio signal includes foreground speech components and background noise. The foreground speech components correspond to speech from a user's speaking into an audio receiving device. The background noise includes babble noise that includes speech from one or more interfering speakers. A soft speech detector determines, dynamically, a speech detection result indicating a likelihood of a presence of the foreground speech components in the audio signal. The speech detection result is employed to control, dynamically, an amount of attenuation of the noise suppression to reduce the babble noise in the audio signal. Further processing achieves a more stationary background and reduction of musical tones in the audio signal.

Type: Grant

Filed: November 18, 2016

Date of Patent: September 22, 2020

Assignee: Cerence Operating Company

Inventors: Simon Graf, Tobias Herbig, Markus Buck
METHODS AND APPARATUS FOR ADAPTIVE GAIN CONTROL IN A COMMUNICATION SYSTEM

Publication number: 20200176012

Abstract: Methods and apparatus for a communication system having microphones and loudspeakers to determine a noise and speech level estimate for a transformed signal, determine a SNR from the noise and speech level estimates, and determine a gain for the transformed signal to achieve a selected SNR range at a given position. In one embodiment, the gain is determined by adapting an actual gain to follow a target gain, wherein the target gain is adjusted to achieve the selected SNR range.

Type: Application

Filed: November 1, 2019

Publication date: June 4, 2020

Applicant: NUANCE COMMUNICATIONS, INC.

Inventors: Tobias HERBIG, Meik PFEFFINGER, Bernd ISER
TECHNIQUES FOR WAKE-UP WORD RECOGNITION AND RELATED SYSTEMS AND METHODS

Publication number: 20190311715

Abstract: A system for detection of at least one designated wake-up word for at least one speech-enabled application. The system comprises at least one microphone; and at least one computer hardware processor configured to perform: receiving an acoustic signal generated by the at least one microphone at least in part as a result of receiving an utterance spoken by a speaker; obtaining information indicative of the speaker's identity; interpreting the acoustic signal at least in part by determining, using the information indicative of the speaker's identity and automated speech recognition, whether the utterance spoken by the speaker includes the at least one designated wake-up word; and interacting with the speaker based, at least in part, on results of the interpreting.

Type: Application

Filed: June 15, 2016

Publication date: October 10, 2019

Applicant: Nuance Communications, Inc.

Inventors: Meik Pfeffinger, Timo Matheja, Tobias Herbig, Tim Haulick
Enhanced De-Esser For In-Car Communication Systems

Publication number: 20190156855

Abstract: Methods and systems for deessing of speech signals are described. A deesser of a speech processing system includes an analyzer configured to receive a full spectral envelope for each time frame of a speech signal presented to the speech processing system, and to analyze the full spectral envelope to identify frequency content for deessing. The deesser also includes a compressor configured to receive results from the analyzer and to spectrally weight the speech signal as a function of results of the analyzer. The analyzer can be configured to calculate a psychoacoustic measure from the full spectral envelope, and may be further configured to detect sibilant sounds of the speech signal using the psychoacoustic measure. The psychoacoustic measure can include, for example, a measure of sharpness, and the analyzer may be further configured to calculate deesser weights based on the measure of sharpness. An example application includes in-car communications.

Type: Application

Filed: September 1, 2016

Publication date: May 23, 2019

Inventors: Tobias Herbig, Stefan Richardt
Voice Activity Detection Feature Based on Modulation-Phase Differences

Publication number: 20190139567

Abstract: Speech processing methods may rely on voice activity detection (VAD) that separates speech from noise. Example embodiments of a computationally low complex VAD feature that is robust against various types of noise is introduced. By considering an alternating excitation structure of low and high frequencies, speech is detected with a high confidence. The computationally low complex VAD feature can cope even with the limited spectral resolution that may be typical for a communication system, such as an in-car-communication (ICC) system. Simulation results confirm the robustness of the computationally low complex VAD feature and show an increase in performance relative to established VAD features.

Type: Application

Filed: February 17, 2017

Publication date: May 9, 2019

Inventors: Simon Graf, Tobias Herbig, Markus Buck
Methods and apparatus for speech segmentation using multiple metadata

Patent number: 10229686

Abstract: Methods and apparatus to process microphone signals by a speech enhancement module to generate an audio stream signal including first and second metadata for use by a speech recognition module. In an embodiment, speech recognition is performed using endpointing information including transitioning from a silence state to a maybe speech state, in which data is buffered, based on the first metadata and transitioning to a speech state, in which speech recognition is performed, based upon the second metadata.

Type: Grant

Filed: August 18, 2014

Date of Patent: March 12, 2019

Assignee: NUANCE COMMUNICATIONS, INC.

Inventors: Markus Buck, Tobias Herbig, Simon Graf, Christophe Ris
Babble Noise Suppression

Publication number: 20190013036

Abstract: Systems and methods are introduced to perform noise suppression of an audio signal. The audio signal includes foreground speech components and background noise. The foreground speech components correspond to speech from a user's speaking into an audio receiving device. The background noise includes babble noise that includes speech from one or more interfering speakers. A soft speech detector determines, dynamically, a speech detection result indicating a likelihood of a presence of the foreground speech components in the audio signal. The speech detection result is employed to control, dynamically, an amount of attenuation of the noise suppression to reduce the babble noise in the audio signal. Further processing achieves a more stationary background and reduction of musical tones in the audio signal.

Type: Application

Filed: November 18, 2016

Publication date: January 10, 2019

Inventors: Simon Graf, Tobias Herbig, Markus Buck
Methods and apparatus for robust speaker activity detection

Patent number: 9767826

Abstract: Method and apparatus to determine a speaker activity detection measure from energy-based characteristics of signals from a plurality of speaker-dedicated microphones, detect acoustic events using power spectra for the microphone signals, and determine a robust speaker activity detection measure from the speaker activity measure and the detected acoustic events.

Type: Grant

Filed: September 27, 2013

Date of Patent: September 19, 2017

Assignee: NUANCE COMMUNICATIONS, INC.

Inventors: Timo Matheja, Tobias Herbig, Markus Buck
Communication System For Multiple Acoustic Zones

Publication number: 20170251304

Abstract: An communication system supports communication paths within an environment by receiving speech signals of a speaker and playing it back for one or more listeners. Signal processing tasks are split into a microphone related part and into a loudspeaker related part. A sound processing system suitable for use in an environment having multiple acoustic zones includes a plurality of microphone communication instances coupled and a plurality of loudspeaker instances.

Type: Application

Filed: May 1, 2017

Publication date: August 31, 2017

Applicant: NUANCE COMMUNICATIONS, INC.

Inventors: Tobias Herbig, Markus Buck, Meik Pfeffinger
Methods And Apparatus For Speech Segmentation Using Multiple Metadata

Publication number: 20170213556

Abstract: Methods and apparatus to process microphone signals by a speech enhancement module to generate an audio stream signal including first and second metadata for use by a speech recognition module. In an embodiment, speech recognition is performed using endpointing information including transitioning from a silence state to a maybe speech state, in which data is buffered, based on the first metadata and transitioning to a speech state, in which speech recognition is performed, based upon the second metadata.

Type: Application

Filed: August 18, 2014

Publication date: July 27, 2017

Applicant: NUANCE COMMUNICATIONS, INC.

Inventors: Markus BUCK, Tobias HERBIG, Simon GRAF, Christophe RIS

1 2 3 next