Patents by Inventor Tobias Herbig
Tobias Herbig has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11950067Abstract: An In-Car Communication (ICC) system supports the communication paths within a car by receiving the speech signals of a speaking passenger and playing it back for one or more listening passengers. Signal processing tasks are split into a microphone related part and into a loudspeaker related part. A sound processing system suitable for use in a vehicle having multiple acoustic zones includes a plurality of microphone In-Car Communication (Mic-ICC) instances coupled and a plurality of loudspeaker In-Car Communication (Ls-ICC) instances. The system further includes a dynamic audio routing matrix with a controller and coupled to the Mic-ICC instances, a mixer coupled to the plurality of Mic-ICC instances and a distributor coupled to the Ls-ICC instances.Type: GrantFiled: February 6, 2023Date of Patent: April 2, 2024Assignee: Cerence Operating CompanyInventors: Tobias Herbig, Markus Buck, Meik Pfeffinger
-
Publication number: 20240062770Abstract: Methods and systems for deessing of speech signals are described. A deesser of a speech processing system includes an analyzer configured to receive a full spectral envelope for each time frame of a speech signal presented to the speech processing system, and to analyze the full spectral envelope to identify frequency content for deessing. The deesser also includes a compressor configured to receive results from the analyzer and to spectrally weight the speech signal as a function of results of the analyzer. The analyzer can be configured to calculate a psychoacoustic measure from the full spectral envelope, and may be further configured to detect sibilant sounds of the speech signal using the psychoacoustic measure. The psychoacoustic measure can include, for example, a measure of sharpness, and the analyzer may be further configured to calculate deesser weights based on the measure of sharpness. An example application includes in-car communications.Type: ApplicationFiled: November 3, 2023Publication date: February 22, 2024Inventors: Tobias Herbig, Stefan Richardt
-
Patent number: 11817115Abstract: Methods and systems for deessing of speech signals are described. A deesser of a speech processing system includes an analyzer configured to receive a full spectral envelope for each time frame of a speech signal presented to the speech processing system, and to analyze the full spectral envelope to identify frequency content for deessing. The deesser also includes a compressor configured to receive results from the analyzer and to spectrally weight the speech signal as a function of results of the analyzer. The analyzer can be configured to calculate a psychoacoustic measure from the full spectral envelope, and may be further configured to detect sibilant sounds of the speech signal using the psychoacoustic measure. The psychoacoustic measure can include, for example, a measure of sharpness, and the analyzer may be further configured to calculate deesser weights based on the measure of sharpness. An example application includes in-car communications.Type: GrantFiled: September 1, 2016Date of Patent: November 14, 2023Assignee: Cerence Operating CompanyInventors: Tobias Herbig, Stefan Richardt
-
Patent number: 11798576Abstract: Methods and apparatus for a communication system having microphones and loudspeakers to determine a noise and speech level estimate for a transformed signal, determine a SNR from the noise and speech level estimates, and determine a gain for the transformed signal to achieve a selected SNR range at a given position. In one embodiment, the gain is determined by adapting an actual gain to follow a target gain, wherein the target gain is adjusted to achieve the selected SNR range.Type: GrantFiled: November 1, 2019Date of Patent: October 24, 2023Assignee: Cerence Operating CompanyInventors: Tobias Herbig, Meik Pfeffinger, Bernd Iser
-
Publication number: 20230209260Abstract: An In-Car Communication (ICC) system supports the communication paths within a car by receiving the speech signals of a speaking passenger and playing it back for one or more listening passengers. Signal processing tasks are split into a microphone related part and into a loudspeaker related part. A sound processing system suitable for use in a vehicle having multiple acoustic zones includes a plurality of microphone In-Car Communication (Mic-ICC) instances coupled and a plurality of loudspeaker In-Car Communication (Ls-ICC) instances. The system further includes a dynamic audio routing matrix with a controller and coupled to the Mic-ICC instances, a mixer coupled to the plurality of Mic-ICC instances and a distributor coupled to the Ls-ICC instances.Type: ApplicationFiled: February 6, 2023Publication date: June 29, 2023Inventors: Tobias Herbig, Markus Buck, Meik Pfeffinger
-
Publication number: 20230178077Abstract: A system for detection of at least one designated wake-up word for at least one speech-enabled application. The system comprises at least one microphone; and at least one computer hardware processor configured to perform: receiving an acoustic signal generated by the at least one microphone at least in part as a result of receiving an utterance spoken by a speaker; obtaining information indicative of the speaker's identity; interpreting the acoustic signal at least in part by determining, using the information indicative of the speaker's identity and automated speech recognition, whether the utterance spoken by the speaker includes the at least one designated wake-up word; and interacting with the speaker based, at least in part, on results of the interpreting.Type: ApplicationFiled: January 30, 2023Publication date: June 8, 2023Applicant: CERENCE OPERATING COMPANYInventors: Meik PFEFFINGER, Timo MATHEJA, Tobias HERBIG, Tim HAULICK
-
Patent number: 11600269Abstract: A system for detection of at least one designated wake-up word for at least one speech-enabled application. The system comprises at least one microphone; and at least one computer hardware processor configured to perform: receiving an acoustic signal generated by the at least one microphone at least in part as a result of receiving an utterance spoken by a speaker; obtaining information indicative of the speaker's identity; interpreting the acoustic signal at least in part by determining, using the information indicative of the speaker's identity and automated speech recognition, whether the utterance spoken by the speaker includes the at least one designated wake-up word; and interacting with the speaker based, at least in part, on results of the interpreting.Type: GrantFiled: June 15, 2016Date of Patent: March 7, 2023Assignee: Cerence Operating CompanyInventors: Meik Pfeffinger, Timo Matheja, Tobias Herbig, Tim Haulick
-
Patent number: 11575990Abstract: An communication system supports communication paths within an environment by receiving speech signals of a speaker and playing it back for one or more listeners. Signal processing tasks are split into a microphone related part and into a loudspeaker related part. A sound processing system suitable for use in an environment having multiple acoustic zones includes a plurality of microphone communication instances coupled and a plurality of loudspeaker instances.Type: GrantFiled: May 1, 2017Date of Patent: February 7, 2023Assignee: Cerence Operating CompanyInventors: Tobias Herbig, Markus Buck, Meik Pfeffinger
-
Patent number: 11176957Abstract: A low-complexity method and apparatus for detection of voiced speech and pitch estimation is disclosed that is capable of dealing with special constraints given by applications where low latency is required, such as in-car communication (ICC) systems. An example embodiment employs very short frames that may capture only a single excitation impulse of voiced speech in an audio signal. A distance between multiple such impulses, corresponding to a pitch period, may be determined by evaluating phase differences between low-resolution spectra of the very short frames. An example embodiment may perform pitch estimation directly in a frequency domain based on the phase differences and reduce computational complexity by obviating transformation to a time domain to perform the pitch estimation. In an event the phase differences are determined to be substantially linear, an example embodiment enhances voice quality of the voiced speech by applying speech enhancement to the audio signal.Type: GrantFiled: August 17, 2017Date of Patent: November 16, 2021Assignee: Cerence Operating CompanyInventors: Simon Graf, Tobias Herbig, Markus Buck
-
Publication number: 20210134311Abstract: A low-complexity method and apparatus for detection of voiced speech and pitch estimation is disclosed that is capable of dealing with special constraints given by applications where low latency is required, such as in-car communication (ICC) systems. An example embodiment employs very short frames that may capture only a single excitation impulse of voiced speech in an audio signal. A distance between multiple such impulses, corresponding to a pitch period, may be determined by evaluating phase differences between low-resolution spectra of the very short frames. An example embodiment may perform pitch estimation directly in a frequency domain based on the phase differences and reduce computational complexity by obviating transformation to a time domain to perform the pitch estimation. In an event the phase differences are determined to be substantially linear, an example embodiment enhances voice quality of the voiced speech by applying speech enhancement to the audio signal.Type: ApplicationFiled: August 17, 2017Publication date: May 6, 2021Inventors: Simon Graf, Tobias Herbig, Markus Buck
-
Patent number: 10783899Abstract: Systems and methods are introduced to perform noise suppression of an audio signal. The audio signal includes foreground speech components and background noise. The foreground speech components correspond to speech from a user's speaking into an audio receiving device. The background noise includes babble noise that includes speech from one or more interfering speakers. A soft speech detector determines, dynamically, a speech detection result indicating a likelihood of a presence of the foreground speech components in the audio signal. The speech detection result is employed to control, dynamically, an amount of attenuation of the noise suppression to reduce the babble noise in the audio signal. Further processing achieves a more stationary background and reduction of musical tones in the audio signal.Type: GrantFiled: November 18, 2016Date of Patent: September 22, 2020Assignee: Cerence Operating CompanyInventors: Simon Graf, Tobias Herbig, Markus Buck
-
Publication number: 20200176012Abstract: Methods and apparatus for a communication system having microphones and loudspeakers to determine a noise and speech level estimate for a transformed signal, determine a SNR from the noise and speech level estimates, and determine a gain for the transformed signal to achieve a selected SNR range at a given position. In one embodiment, the gain is determined by adapting an actual gain to follow a target gain, wherein the target gain is adjusted to achieve the selected SNR range.Type: ApplicationFiled: November 1, 2019Publication date: June 4, 2020Applicant: NUANCE COMMUNICATIONS, INC.Inventors: Tobias HERBIG, Meik PFEFFINGER, Bernd ISER
-
Publication number: 20190311715Abstract: A system for detection of at least one designated wake-up word for at least one speech-enabled application. The system comprises at least one microphone; and at least one computer hardware processor configured to perform: receiving an acoustic signal generated by the at least one microphone at least in part as a result of receiving an utterance spoken by a speaker; obtaining information indicative of the speaker's identity; interpreting the acoustic signal at least in part by determining, using the information indicative of the speaker's identity and automated speech recognition, whether the utterance spoken by the speaker includes the at least one designated wake-up word; and interacting with the speaker based, at least in part, on results of the interpreting.Type: ApplicationFiled: June 15, 2016Publication date: October 10, 2019Applicant: Nuance Communications, Inc.Inventors: Meik Pfeffinger, Timo Matheja, Tobias Herbig, Tim Haulick
-
Publication number: 20190156855Abstract: Methods and systems for deessing of speech signals are described. A deesser of a speech processing system includes an analyzer configured to receive a full spectral envelope for each time frame of a speech signal presented to the speech processing system, and to analyze the full spectral envelope to identify frequency content for deessing. The deesser also includes a compressor configured to receive results from the analyzer and to spectrally weight the speech signal as a function of results of the analyzer. The analyzer can be configured to calculate a psychoacoustic measure from the full spectral envelope, and may be further configured to detect sibilant sounds of the speech signal using the psychoacoustic measure. The psychoacoustic measure can include, for example, a measure of sharpness, and the analyzer may be further configured to calculate deesser weights based on the measure of sharpness. An example application includes in-car communications.Type: ApplicationFiled: September 1, 2016Publication date: May 23, 2019Inventors: Tobias Herbig, Stefan Richardt
-
Publication number: 20190139567Abstract: Speech processing methods may rely on voice activity detection (VAD) that separates speech from noise. Example embodiments of a computationally low complex VAD feature that is robust against various types of noise is introduced. By considering an alternating excitation structure of low and high frequencies, speech is detected with a high confidence. The computationally low complex VAD feature can cope even with the limited spectral resolution that may be typical for a communication system, such as an in-car-communication (ICC) system. Simulation results confirm the robustness of the computationally low complex VAD feature and show an increase in performance relative to established VAD features.Type: ApplicationFiled: February 17, 2017Publication date: May 9, 2019Inventors: Simon Graf, Tobias Herbig, Markus Buck
-
Patent number: 10229686Abstract: Methods and apparatus to process microphone signals by a speech enhancement module to generate an audio stream signal including first and second metadata for use by a speech recognition module. In an embodiment, speech recognition is performed using endpointing information including transitioning from a silence state to a maybe speech state, in which data is buffered, based on the first metadata and transitioning to a speech state, in which speech recognition is performed, based upon the second metadata.Type: GrantFiled: August 18, 2014Date of Patent: March 12, 2019Assignee: NUANCE COMMUNICATIONS, INC.Inventors: Markus Buck, Tobias Herbig, Simon Graf, Christophe Ris
-
Publication number: 20190013036Abstract: Systems and methods are introduced to perform noise suppression of an audio signal. The audio signal includes foreground speech components and background noise. The foreground speech components correspond to speech from a user's speaking into an audio receiving device. The background noise includes babble noise that includes speech from one or more interfering speakers. A soft speech detector determines, dynamically, a speech detection result indicating a likelihood of a presence of the foreground speech components in the audio signal. The speech detection result is employed to control, dynamically, an amount of attenuation of the noise suppression to reduce the babble noise in the audio signal. Further processing achieves a more stationary background and reduction of musical tones in the audio signal.Type: ApplicationFiled: November 18, 2016Publication date: January 10, 2019Inventors: Simon Graf, Tobias Herbig, Markus Buck
-
Patent number: 9767826Abstract: Method and apparatus to determine a speaker activity detection measure from energy-based characteristics of signals from a plurality of speaker-dedicated microphones, detect acoustic events using power spectra for the microphone signals, and determine a robust speaker activity detection measure from the speaker activity measure and the detected acoustic events.Type: GrantFiled: September 27, 2013Date of Patent: September 19, 2017Assignee: NUANCE COMMUNICATIONS, INC.Inventors: Timo Matheja, Tobias Herbig, Markus Buck
-
Publication number: 20170251304Abstract: An communication system supports communication paths within an environment by receiving speech signals of a speaker and playing it back for one or more listeners. Signal processing tasks are split into a microphone related part and into a loudspeaker related part. A sound processing system suitable for use in an environment having multiple acoustic zones includes a plurality of microphone communication instances coupled and a plurality of loudspeaker instances.Type: ApplicationFiled: May 1, 2017Publication date: August 31, 2017Applicant: NUANCE COMMUNICATIONS, INC.Inventors: Tobias Herbig, Markus Buck, Meik Pfeffinger
-
Publication number: 20170213556Abstract: Methods and apparatus to process microphone signals by a speech enhancement module to generate an audio stream signal including first and second metadata for use by a speech recognition module. In an embodiment, speech recognition is performed using endpointing information including transitioning from a silence state to a maybe speech state, in which data is buffered, based on the first metadata and transitioning to a speech state, in which speech recognition is performed, based upon the second metadata.Type: ApplicationFiled: August 18, 2014Publication date: July 27, 2017Applicant: NUANCE COMMUNICATIONS, INC.Inventors: Markus BUCK, Tobias HERBIG, Simon GRAF, Christophe RIS