Patents by Inventor Timo Matheja
Timo Matheja has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20230178077Abstract: A system for detection of at least one designated wake-up word for at least one speech-enabled application. The system comprises at least one microphone; and at least one computer hardware processor configured to perform: receiving an acoustic signal generated by the at least one microphone at least in part as a result of receiving an utterance spoken by a speaker; obtaining information indicative of the speaker's identity; interpreting the acoustic signal at least in part by determining, using the information indicative of the speaker's identity and automated speech recognition, whether the utterance spoken by the speaker includes the at least one designated wake-up word; and interacting with the speaker based, at least in part, on results of the interpreting.Type: ApplicationFiled: January 30, 2023Publication date: June 8, 2023Applicant: CERENCE OPERATING COMPANYInventors: Meik PFEFFINGER, Timo MATHEJA, Tobias HERBIG, Tim HAULICK
-
Patent number: 11600269Abstract: A system for detection of at least one designated wake-up word for at least one speech-enabled application. The system comprises at least one microphone; and at least one computer hardware processor configured to perform: receiving an acoustic signal generated by the at least one microphone at least in part as a result of receiving an utterance spoken by a speaker; obtaining information indicative of the speaker's identity; interpreting the acoustic signal at least in part by determining, using the information indicative of the speaker's identity and automated speech recognition, whether the utterance spoken by the speaker includes the at least one designated wake-up word; and interacting with the speaker based, at least in part, on results of the interpreting.Type: GrantFiled: June 15, 2016Date of Patent: March 7, 2023Assignee: Cerence Operating CompanyInventors: Meik Pfeffinger, Timo Matheja, Tobias Herbig, Tim Haulick
-
Patent number: 11367437Abstract: There is provided a speech dialog system that includes a first microphone, a second microphone, a processor and a memory. The first microphone captures first audio from a first spatial zone, and produces a first audio signal. The second microphone captures second audio from a second spatial zone, and produces a second audio signal. The processor receives the first audio signal and the second audio signal, and the memory contains instructions that control the processor to perform operations of a speech enhancement module, an automatic speech recognition module, and a speech dialog module that performs a zone-dedicated speech dialog.Type: GrantFiled: May 30, 2019Date of Patent: June 21, 2022Assignee: NUANCE COMMUNICATIONS, INC.Inventors: Timo Matheja, Markus Buck, Andreas Kirbach, Martin Roessler, Tim Haulick, Julien Premont, Josef Anastasiadis, Rudi Vuerinckx, Christophe Ris, Stijn Verschaeren, Hakan Ari, Dieter Ranz
-
Patent number: 10917717Abstract: Gain mismatch and related problems can be solved by a system and method that applies an automatic microphone signal gain equalization without any direct absolute reference or calibration phase. The system and method performs the steps of receiving, by a computing device, a speech signal from a speaking person via a plurality of microphones, determining a speech signal component in the time-frequency domain for each microphone of the plurality of microphones, calculating an instantaneous cross-talk coupling matrix based on the speech signal components across the microphones, estimating gain factors based on calculated cross-talk couplings and a given expected cross-talk attenuation, limiting the gain factors to appropriate maximum and minimum values, and applying the gain factors to the speech signal used in the control path to control further speech enhancement algorithms or used in the signal path for direct influence on the speech enhanced audio output signal.Type: GrantFiled: May 30, 2019Date of Patent: February 9, 2021Assignee: Nuance Communications, Inc.Inventors: Timo Matheja, Markus Buck
-
Publication number: 20200380967Abstract: There is provided a speech dialog system that includes a first microphone, a second microphone, a processor and a memory. The first microphone captures first audio from a first spatial zone, and produces a first audio signal. The second microphone captures second audio from a second spatial zone, and produces a second audio signal. The processor receives the first audio signal and the second audio signal, and the memory contains instructions that control the processor to perform operations of a speech enhancement module, an automatic speech recognition module, and a speech dialog module that performs a zone-dedicated speech dialog.Type: ApplicationFiled: May 30, 2019Publication date: December 3, 2020Inventors: Timo MATHEJA, Markus BUCK, Andreas KIRBACH, Martin ROESSLER, Tim HAULICK, Julien PREMONT, Josef ANASTASIADIS, Rudi VUERINCKX, Christophe RIS, Stijn VERSCHAEREN, Hakan ARI, Dieter RANZ
-
Publication number: 20200382863Abstract: Gain mismatch and related problems can be solved by a system and method that applies an automatic microphone signal gain equalization without any direct absolute reference or calibration phase. The system and method performs the steps of receiving, by a computing device, a speech signal from a speaking person via a plurality of microphones, determining a speech signal component in the time-frequency domain for each microphone of the plurality of microphones, calculating an instantaneous cross-talk coupling matrix based on the speech signal components across the microphones, estimating gain factors based on calculated cross-talk couplings and a given expected cross-talk attenuation, limiting the gain factors to appropriate maximum and minimum values, and applying the gain factors to the speech signal used in the control path to control further speech enhancement algorithms or used in the signal path for direct influence on the speech enhanced audio output signal.Type: ApplicationFiled: May 30, 2019Publication date: December 3, 2020Inventors: Timo MATHEJA, Markus BUCK
-
Publication number: 20200184994Abstract: A method, computer program product, and computer system for identifying, by a computing device, a plurality of sources, wherein a first source of the plurality of sources is a source of interest and wherein a second source of the plurality of sources is an interference source. The first source and the second source may be monitored simultaneously by implementing a spatial pre-filter for acoustic source localization.Type: ApplicationFiled: December 7, 2018Publication date: June 11, 2020Inventors: Tobias Wolff, Simon Graf, Timo Matheja
-
Patent number: 10536773Abstract: Methods and apparatus for frequency selective signal mixing for speech enhancement. In one embodiment frequency-based channel selection is performed for signal magnitude, signal energy, and noise estimate using speaker activity detection information, signal-to-noise ratio, and/or signal level, Frequency-based channel selection is performed for a dynamic spectral floor to adjust the noise estimate using speaker dominance information. Noise reduction is performed on the signal for the selected channel.Type: GrantFiled: October 30, 2013Date of Patent: January 14, 2020Assignee: Cerence Operating CompanyInventors: Timo Matheja, Markus Buck, Julien Premont
-
Publication number: 20190311715Abstract: A system for detection of at least one designated wake-up word for at least one speech-enabled application. The system comprises at least one microphone; and at least one computer hardware processor configured to perform: receiving an acoustic signal generated by the at least one microphone at least in part as a result of receiving an utterance spoken by a speaker; obtaining information indicative of the speaker's identity; interpreting the acoustic signal at least in part by determining, using the information indicative of the speaker's identity and automated speech recognition, whether the utterance spoken by the speaker includes the at least one designated wake-up word; and interacting with the speaker based, at least in part, on results of the interpreting.Type: ApplicationFiled: June 15, 2016Publication date: October 10, 2019Applicant: Nuance Communications, Inc.Inventors: Meik Pfeffinger, Timo Matheja, Tobias Herbig, Tim Haulick
-
Patent number: 10332545Abstract: A method, computer program product, and computer system for receiving, by a computing device, a speech signal from a speaker via a plurality of microphone zones. A temporal cue based confidence may be determined for at least a portion of the plurality of microphone zones. A power cue based confidence may be determined for at least a portion of the plurality of microphone zones. A microphone zone of the plurality of microphone zones from which to use an output signal of the speaker may be identified based upon, at least in part, a combination of the temporal cue based confidence and the power cue based confidence.Type: GrantFiled: November 28, 2017Date of Patent: June 25, 2019Assignee: Nuance Communications, Inc.Inventors: Timo Matheja, Markus Buck, Simon Graf
-
Publication number: 20190164568Abstract: A method, computer program product, and computer system for receiving, by a computing device, a speech signal from a speaker via a plurality of microphone zones. A temporal cue based confidence may be determined for at least a portion of the plurality of microphone zones. A power cue based confidence may be determined for at least a portion of the plurality of microphone zones. A microphone zone of the plurality of microphone zones from which to use an output signal of the speaker may be identified based upon, at least in part, a combination of the temporal cue based confidence and the power cue based confidence.Type: ApplicationFiled: November 28, 2017Publication date: May 30, 2019Inventors: Timo Matheja, Markus Buck, Simon Graf
-
Patent number: 10242690Abstract: Embodiments of the present disclosure may include a system and method for speech enhancement using the coherent to diffuse sound ratio. Embodiments may include receiving an audio signal at one or more microphones and controlling one or more adaptive filters of a beamformer using a coherent to diffuse ratio (“CDR”).Type: GrantFiled: December 12, 2014Date of Patent: March 26, 2019Assignee: Nuance Communications, Inc.Inventors: Tobias Wolff, Timo Matheja, Markus Buck
-
Publication number: 20180262831Abstract: Embodiments disclosed herein may include determining a signal parameter of a first microphone and a second microphone associated with a computing device. Embodiments may include generating a reference parameter based upon at least one of the parameter of the first microphone and the parameter of the second microphone. Embodiments may include adjusting a tolerance of at least one of the first microphone and the second microphone, based upon the reference parameter. Embodiments may include receiving, at the first microphone, a first speech signal, the first speech signal having a first speech signal magnitude and receiving, at the second microphone, a second speech signal, the second speech signal having a second speech signal magnitude. Embodiments may include comparing at least one of the first speech signal magnitude and the second speech signal magnitude with a third speech signal magnitude and detecting an obstructed microphone based upon the comparison.Type: ApplicationFiled: February 6, 2018Publication date: September 13, 2018Inventors: Timo Matheja, Markus Buck
-
Patent number: 9978389Abstract: A multi-mode speech communication system is described that has different operating modes for different speech applications. A signal processing module is in communication with the speech applications and includes an input processing module and an output processing module. The input processing module processes microphone input signals to produce a set user input signals for each speech application that are limited to currently active system users for that speech application. The output processing module processes application output communications from the speech applications to produce loudspeaker output signals to the system users, wherein for each different speech application, the loudspeaker output signals are directed only to system users currently active in that speech application.Type: GrantFiled: February 27, 2017Date of Patent: May 22, 2018Assignee: NUANCE COMMUNICATIONS, INC.Inventors: Markus Buck, Tim Haulick, Timo Matheja
-
Patent number: 9888316Abstract: Embodiments disclosed herein may include determining a signal parameter of a first microphone and a second microphone associated with a computing device. Embodiments may include generating a reference parameter based upon at least one of the parameter of the first microphone and the parameter of the second microphone. Embodiments may include adjusting a tolerance of at least one of the first microphone and the second microphone, based upon the reference parameter. Embodiments may include receiving, at the first microphone, a first speech signal, the first speech signal having a first speech signal magnitude and receiving, at the second microphone, a second speech signal, the second speech signal having a second speech signal magnitude. Embodiments may include comparing at least one of the first speech signal magnitude and the second speech signal magnitude with a third speech signal magnitude and detecting an obstructed microphone based upon the comparison.Type: GrantFiled: March 21, 2013Date of Patent: February 6, 2018Assignee: Nuance Communications, Inc.Inventors: Timo Matheja, Markus Buck
-
Publication number: 20170330580Abstract: Embodiments of the present disclosure may include a system and method for speech enhancement using the coherent to diffuse sound ratio.Type: ApplicationFiled: December 12, 2014Publication date: November 16, 2017Inventors: Tobias Wolff, Timo Matheja, Markus Buck
-
Patent number: 9767826Abstract: Method and apparatus to determine a speaker activity detection measure from energy-based characteristics of signals from a plurality of speaker-dedicated microphones, detect acoustic events using power spectra for the microphone signals, and determine a robust speaker activity detection measure from the speaker activity measure and the detected acoustic events.Type: GrantFiled: September 27, 2013Date of Patent: September 19, 2017Assignee: NUANCE COMMUNICATIONS, INC.Inventors: Timo Matheja, Tobias Herbig, Markus Buck
-
Publication number: 20170169836Abstract: A multi-mode speech communication system is described that has different operating modes for different speech applications. A signal processing module is in communication with the speech applications and includes an input processing module and an output processing module. The input processing module processes microphone input signals to produce a set user input signals for each speech application that are limited to currently active system users for that speech application. The output processing module processes application output communications from the speech applications to produce loudspeaker output signals to the system users, wherein for each different speech application, the loudspeaker output signals are directed only to system users currently active in that speech application.Type: ApplicationFiled: February 27, 2017Publication date: June 15, 2017Applicant: NUANCE COMMUNICATIONS, INC.Inventors: Markus Buck, Tim Haulick, Timo Matheja
-
Patent number: 9620146Abstract: A multi-mode speech communication system is described that has different operating modes for different speech applications. A speech service compartment contains multiple system users, multiple input microphones that develop microphone input signals from the system users to the system, and multiple output loudspeakers that develop loudspeaker output signals from the system to the system users. A signal processing module is in communication with the speech applications and includes an input processing module and an output processing module. The input processing module processes the microphone input signals to produce a set user input signals for each speech application that are limited to currently active system users for that speech application.Type: GrantFiled: May 16, 2012Date of Patent: April 11, 2017Assignee: NUANCE COMMUNICATIONS, INC.Inventors: Markus Buck, Tim Haulick, Timo Matheja
-
Publication number: 20160261951Abstract: Methods and apparatus for frequency selective signal mixing for speech enhancement. In one embodiment frequency-based channel selection is performed for signal magnitude, signal energy, and noise estimate using speaker activity detection information, signal-to-noise ratio, and/or signal level, Frequency-based channel selection is performed for a dynamic spectral floor to adjust the noise estimate using speaker dominance information. Noise reduction is performed on the signal for the selected channel.Type: ApplicationFiled: October 30, 2013Publication date: September 8, 2016Applicant: NUANCE COMMUNICATIONS, INC.Inventors: Timo Matheja, Markus Buck, Julien Premont