Patents by Inventor Timo Matheja

Timo Matheja has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

TECHNIQUES FOR WAKE-UP WORK RECOGNITION AND RELATED SYSTEMS AND METHODS

Publication number: 20230178077

Abstract: A system for detection of at least one designated wake-up word for at least one speech-enabled application. The system comprises at least one microphone; and at least one computer hardware processor configured to perform: receiving an acoustic signal generated by the at least one microphone at least in part as a result of receiving an utterance spoken by a speaker; obtaining information indicative of the speaker's identity; interpreting the acoustic signal at least in part by determining, using the information indicative of the speaker's identity and automated speech recognition, whether the utterance spoken by the speaker includes the at least one designated wake-up word; and interacting with the speaker based, at least in part, on results of the interpreting.

Type: Application

Filed: January 30, 2023

Publication date: June 8, 2023

Applicant: CERENCE OPERATING COMPANY

Inventors: Meik PFEFFINGER, Timo MATHEJA, Tobias HERBIG, Tim HAULICK
Techniques for wake-up word recognition and related systems and methods

Patent number: 11600269

Abstract: A system for detection of at least one designated wake-up word for at least one speech-enabled application. The system comprises at least one microphone; and at least one computer hardware processor configured to perform: receiving an acoustic signal generated by the at least one microphone at least in part as a result of receiving an utterance spoken by a speaker; obtaining information indicative of the speaker's identity; interpreting the acoustic signal at least in part by determining, using the information indicative of the speaker's identity and automated speech recognition, whether the utterance spoken by the speaker includes the at least one designated wake-up word; and interacting with the speaker based, at least in part, on results of the interpreting.

Type: Grant

Filed: June 15, 2016

Date of Patent: March 7, 2023

Assignee: Cerence Operating Company

Inventors: Meik Pfeffinger, Timo Matheja, Tobias Herbig, Tim Haulick
Multi-microphone speech dialog system for multiple spatial zones

Patent number: 11367437

Abstract: There is provided a speech dialog system that includes a first microphone, a second microphone, a processor and a memory. The first microphone captures first audio from a first spatial zone, and produces a first audio signal. The second microphone captures second audio from a second spatial zone, and produces a second audio signal. The processor receives the first audio signal and the second audio signal, and the memory contains instructions that control the processor to perform operations of a speech enhancement module, an automatic speech recognition module, and a speech dialog module that performs a zone-dedicated speech dialog.

Type: Grant

Filed: May 30, 2019

Date of Patent: June 21, 2022

Assignee: NUANCE COMMUNICATIONS, INC.

Inventors: Timo Matheja, Markus Buck, Andreas Kirbach, Martin Roessler, Tim Haulick, Julien Premont, Josef Anastasiadis, Rudi Vuerinckx, Christophe Ris, Stijn Verschaeren, Hakan Ari, Dieter Ranz
Multi-channel microphone signal gain equalization based on evaluation of cross talk components

Patent number: 10917717

Abstract: Gain mismatch and related problems can be solved by a system and method that applies an automatic microphone signal gain equalization without any direct absolute reference or calibration phase. The system and method performs the steps of receiving, by a computing device, a speech signal from a speaking person via a plurality of microphones, determining a speech signal component in the time-frequency domain for each microphone of the plurality of microphones, calculating an instantaneous cross-talk coupling matrix based on the speech signal components across the microphones, estimating gain factors based on calculated cross-talk couplings and a given expected cross-talk attenuation, limiting the gain factors to appropriate maximum and minimum values, and applying the gain factors to the speech signal used in the control path to control further speech enhancement algorithms or used in the signal path for direct influence on the speech enhanced audio output signal.

Type: Grant

Filed: May 30, 2019

Date of Patent: February 9, 2021

Assignee: Nuance Communications, Inc.

Inventors: Timo Matheja, Markus Buck
MULTI-MICROPHONE SPEECH DIALOG SYSTEM FOR MULTIPLE SPATIAL ZONES

Publication number: 20200380967

Abstract: There is provided a speech dialog system that includes a first microphone, a second microphone, a processor and a memory. The first microphone captures first audio from a first spatial zone, and produces a first audio signal. The second microphone captures second audio from a second spatial zone, and produces a second audio signal. The processor receives the first audio signal and the second audio signal, and the memory contains instructions that control the processor to perform operations of a speech enhancement module, an automatic speech recognition module, and a speech dialog module that performs a zone-dedicated speech dialog.

Type: Application

Filed: May 30, 2019

Publication date: December 3, 2020

Inventors: Timo MATHEJA, Markus BUCK, Andreas KIRBACH, Martin ROESSLER, Tim HAULICK, Julien PREMONT, Josef ANASTASIADIS, Rudi VUERINCKX, Christophe RIS, Stijn VERSCHAEREN, Hakan ARI, Dieter RANZ
MULTI-CHANNEL MICROPHONE SIGNAL GAIN EQUALIZATION BASED ON EVALUATION OF CROSS TALK COMPONENTS

Publication number: 20200382863

Abstract: Gain mismatch and related problems can be solved by a system and method that applies an automatic microphone signal gain equalization without any direct absolute reference or calibration phase. The system and method performs the steps of receiving, by a computing device, a speech signal from a speaking person via a plurality of microphones, determining a speech signal component in the time-frequency domain for each microphone of the plurality of microphones, calculating an instantaneous cross-talk coupling matrix based on the speech signal components across the microphones, estimating gain factors based on calculated cross-talk couplings and a given expected cross-talk attenuation, limiting the gain factors to appropriate maximum and minimum values, and applying the gain factors to the speech signal used in the control path to control further speech enhancement algorithms or used in the signal path for direct influence on the speech enhanced audio output signal.

Type: Application

Filed: May 30, 2019

Publication date: December 3, 2020

Inventors: Timo MATHEJA, Markus BUCK
SYSTEM AND METHOD FOR ACOUSTIC LOCALIZATION OF MULTIPLE SOURCES USING SPATIAL PRE-FILTERING

Publication number: 20200184994

Abstract: A method, computer program product, and computer system for identifying, by a computing device, a plurality of sources, wherein a first source of the plurality of sources is a source of interest and wherein a second source of the plurality of sources is an interference source. The first source and the second source may be monitored simultaneously by implementing a spatial pre-filter for acoustic source localization.

Type: Application

Filed: December 7, 2018

Publication date: June 11, 2020

Inventors: Tobias Wolff, Simon Graf, Timo Matheja
Methods and apparatus for selective microphone signal combining

Patent number: 10536773

Abstract: Methods and apparatus for frequency selective signal mixing for speech enhancement. In one embodiment frequency-based channel selection is performed for signal magnitude, signal energy, and noise estimate using speaker activity detection information, signal-to-noise ratio, and/or signal level, Frequency-based channel selection is performed for a dynamic spectral floor to adjust the noise estimate using speaker dominance information. Noise reduction is performed on the signal for the selected channel.

Type: Grant

Filed: October 30, 2013

Date of Patent: January 14, 2020

Assignee: Cerence Operating Company

Inventors: Timo Matheja, Markus Buck, Julien Premont
TECHNIQUES FOR WAKE-UP WORD RECOGNITION AND RELATED SYSTEMS AND METHODS

Publication number: 20190311715

Abstract: A system for detection of at least one designated wake-up word for at least one speech-enabled application. The system comprises at least one microphone; and at least one computer hardware processor configured to perform: receiving an acoustic signal generated by the at least one microphone at least in part as a result of receiving an utterance spoken by a speaker; obtaining information indicative of the speaker's identity; interpreting the acoustic signal at least in part by determining, using the information indicative of the speaker's identity and automated speech recognition, whether the utterance spoken by the speaker includes the at least one designated wake-up word; and interacting with the speaker based, at least in part, on results of the interpreting.

Type: Application

Filed: June 15, 2016

Publication date: October 10, 2019

Applicant: Nuance Communications, Inc.

Inventors: Meik Pfeffinger, Timo Matheja, Tobias Herbig, Tim Haulick
System and method for temporal and power based zone detection in speaker dependent microphone environments

Patent number: 10332545

Abstract: A method, computer program product, and computer system for receiving, by a computing device, a speech signal from a speaker via a plurality of microphone zones. A temporal cue based confidence may be determined for at least a portion of the plurality of microphone zones. A power cue based confidence may be determined for at least a portion of the plurality of microphone zones. A microphone zone of the plurality of microphone zones from which to use an output signal of the speaker may be identified based upon, at least in part, a combination of the temporal cue based confidence and the power cue based confidence.

Type: Grant

Filed: November 28, 2017

Date of Patent: June 25, 2019

Assignee: Nuance Communications, Inc.

Inventors: Timo Matheja, Markus Buck, Simon Graf
SYSTEM AND METHOD FOR TEMPORAL AND POWER BASED ZONE DETECTION IN SPEAKER DEPENDENT MICROPHONE ENVIRONMENTS

Publication number: 20190164568

Abstract: A method, computer program product, and computer system for receiving, by a computing device, a speech signal from a speaker via a plurality of microphone zones. A temporal cue based confidence may be determined for at least a portion of the plurality of microphone zones. A power cue based confidence may be determined for at least a portion of the plurality of microphone zones. A microphone zone of the plurality of microphone zones from which to use an output signal of the speaker may be identified based upon, at least in part, a combination of the temporal cue based confidence and the power cue based confidence.

Type: Application

Filed: November 28, 2017

Publication date: May 30, 2019

Inventors: Timo Matheja, Markus Buck, Simon Graf
System and method for speech enhancement using a coherent to diffuse sound ratio

Patent number: 10242690

Abstract: Embodiments of the present disclosure may include a system and method for speech enhancement using the coherent to diffuse sound ratio. Embodiments may include receiving an audio signal at one or more microphones and controlling one or more adaptive filters of a beamformer using a coherent to diffuse ratio (“CDR”).

Type: Grant

Filed: December 12, 2014

Date of Patent: March 26, 2019

Assignee: Nuance Communications, Inc.

Inventors: Tobias Wolff, Timo Matheja, Markus Buck
SYSTEM AND METHOD FOR IDENTIFYING SUBOPTIMAL MICROPHONE PERFORMANCE

Publication number: 20180262831

Abstract: Embodiments disclosed herein may include determining a signal parameter of a first microphone and a second microphone associated with a computing device. Embodiments may include generating a reference parameter based upon at least one of the parameter of the first microphone and the parameter of the second microphone. Embodiments may include adjusting a tolerance of at least one of the first microphone and the second microphone, based upon the reference parameter. Embodiments may include receiving, at the first microphone, a first speech signal, the first speech signal having a first speech signal magnitude and receiving, at the second microphone, a second speech signal, the second speech signal having a second speech signal magnitude. Embodiments may include comparing at least one of the first speech signal magnitude and the second speech signal magnitude with a third speech signal magnitude and detecting an obstructed microphone based upon the comparison.

Type: Application

Filed: February 6, 2018

Publication date: September 13, 2018

Inventors: Timo Matheja, Markus Buck
Combined voice recognition, hands-free telephony and in-car communication

Patent number: 9978389

Abstract: A multi-mode speech communication system is described that has different operating modes for different speech applications. A signal processing module is in communication with the speech applications and includes an input processing module and an output processing module. The input processing module processes microphone input signals to produce a set user input signals for each speech application that are limited to currently active system users for that speech application. The output processing module processes application output communications from the speech applications to produce loudspeaker output signals to the system users, wherein for each different speech application, the loudspeaker output signals are directed only to system users currently active in that speech application.

Type: Grant

Filed: February 27, 2017

Date of Patent: May 22, 2018

Assignee: NUANCE COMMUNICATIONS, INC.

Inventors: Markus Buck, Tim Haulick, Timo Matheja
System and method for identifying suboptimal microphone performance

Patent number: 9888316

Abstract: Embodiments disclosed herein may include determining a signal parameter of a first microphone and a second microphone associated with a computing device. Embodiments may include generating a reference parameter based upon at least one of the parameter of the first microphone and the parameter of the second microphone. Embodiments may include adjusting a tolerance of at least one of the first microphone and the second microphone, based upon the reference parameter. Embodiments may include receiving, at the first microphone, a first speech signal, the first speech signal having a first speech signal magnitude and receiving, at the second microphone, a second speech signal, the second speech signal having a second speech signal magnitude. Embodiments may include comparing at least one of the first speech signal magnitude and the second speech signal magnitude with a third speech signal magnitude and detecting an obstructed microphone based upon the comparison.

Type: Grant

Filed: March 21, 2013

Date of Patent: February 6, 2018

Assignee: Nuance Communications, Inc.

Inventors: Timo Matheja, Markus Buck
SYSTEM AND METHOD FOR SPEECH ENHANCEMENT USING A COHERENT TO DIFFUSE SOUND RATIO

Publication number: 20170330580

Abstract: Embodiments of the present disclosure may include a system and method for speech enhancement using the coherent to diffuse sound ratio.

Type: Application

Filed: December 12, 2014

Publication date: November 16, 2017

Inventors: Tobias Wolff, Timo Matheja, Markus Buck
Methods and apparatus for robust speaker activity detection

Patent number: 9767826

Abstract: Method and apparatus to determine a speaker activity detection measure from energy-based characteristics of signals from a plurality of speaker-dedicated microphones, detect acoustic events using power spectra for the microphone signals, and determine a robust speaker activity detection measure from the speaker activity measure and the detected acoustic events.

Type: Grant

Filed: September 27, 2013

Date of Patent: September 19, 2017

Assignee: NUANCE COMMUNICATIONS, INC.

Inventors: Timo Matheja, Tobias Herbig, Markus Buck
COMBINED VOICE RECOGNITION, HANDS-FREE TELEPHONY AND IN-CAR COMMUNICATION

Publication number: 20170169836

Abstract: A multi-mode speech communication system is described that has different operating modes for different speech applications. A signal processing module is in communication with the speech applications and includes an input processing module and an output processing module. The input processing module processes microphone input signals to produce a set user input signals for each speech application that are limited to currently active system users for that speech application. The output processing module processes application output communications from the speech applications to produce loudspeaker output signals to the system users, wherein for each different speech application, the loudspeaker output signals are directed only to system users currently active in that speech application.

Type: Application

Filed: February 27, 2017

Publication date: June 15, 2017

Applicant: NUANCE COMMUNICATIONS, INC.

Inventors: Markus Buck, Tim Haulick, Timo Matheja
Speech communication system for combined voice recognition, hands-free telephony and in-car communication

Patent number: 9620146

Abstract: A multi-mode speech communication system is described that has different operating modes for different speech applications. A speech service compartment contains multiple system users, multiple input microphones that develop microphone input signals from the system users to the system, and multiple output loudspeakers that develop loudspeaker output signals from the system to the system users. A signal processing module is in communication with the speech applications and includes an input processing module and an output processing module. The input processing module processes the microphone input signals to produce a set user input signals for each speech application that are limited to currently active system users for that speech application.

Type: Grant

Filed: May 16, 2012

Date of Patent: April 11, 2017

Assignee: NUANCE COMMUNICATIONS, INC.

Inventors: Markus Buck, Tim Haulick, Timo Matheja
Methods And Apparatus For Selective Microphone Signal Combining

Publication number: 20160261951

Abstract: Methods and apparatus for frequency selective signal mixing for speech enhancement. In one embodiment frequency-based channel selection is performed for signal magnitude, signal energy, and noise estimate using speaker activity detection information, signal-to-noise ratio, and/or signal level, Frequency-based channel selection is performed for a dynamic spectral floor to adjust the noise estimate using speaker dominance information. Noise reduction is performed on the signal for the selected channel.

Type: Application

Filed: October 30, 2013

Publication date: September 8, 2016

Applicant: NUANCE COMMUNICATIONS, INC.

Inventors: Timo Matheja, Markus Buck, Julien Premont

1 2 next