Patents by Inventor Munir Nikolai Alexander Georges

Munir Nikolai Alexander Georges has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11437020
    Abstract: According to some aspects, a system for detecting a designated wake-up word is provided, the system comprising a plurality of microphones to detect acoustic information from a physical space having a plurality of acoustic zones, at least one processor configured to receive a first acoustic signal representing the acoustic information received by the plurality of microphones, process the first acoustic signal to identify content of the first acoustic signal originating from each of the plurality of acoustic zones, provide a plurality of second acoustic signals, each of the plurality of second acoustic signals substantially corresponding to the content identified as originating from a respective one of the plurality of acoustic zones, and performing automatic speech recognition on each of the plurality of second acoustic signals to determine whether the designated wake-up word was spoken.
    Type: Grant
    Filed: February 10, 2016
    Date of Patent: September 6, 2022
    Assignee: CERENCE OPERATING COMPANY
    Inventors: Julien Prémont, Tim Haulick, Emanuele Dalmasso, Munir Nikolai Alexander Georges, Andreas Kellner, Gaetan Martens, Oliver Van Porten, Holger Quast, Martin Roessler, Tobias Wolff, Markus Buck
  • Patent number: 11074249
    Abstract: Techniques are provided for dynamic adaptation of language understanding systems to acoustic environments. A methodology implementing the techniques according to an embodiment includes generating a trigger in response to recognition of a wake-on-voice key-phrase in or prior to an audio stream. The trigger serves to switch processing modes from an adaptation mode to a query recognition mode. The method further includes performing automatic speech recognition on the audio stream during the query recognition mode, to recognize an in-domain query. The method further includes applying both a static language understanding classifier and a dynamic language understanding classifier to the recognized in-domain query. The static language understanding classifier employs a static semantic model and the dynamic language understanding classifier employs a dynamic semantic model.
    Type: Grant
    Filed: April 10, 2018
    Date of Patent: July 27, 2021
    Assignee: Intel Corporation
    Inventor: Munir Nikolai Alexander Georges
  • Patent number: 10909972
    Abstract: An example apparatus for detecting intent in voiced audio includes a receiver to receive one or more word sequence hypotheses related to a voiced audio and a dynamic vocabulary. The apparatus also includes a natural language understander (NLU) to detect an intent and recognize a property related to the intent based on the word sequence hypothesis and the dynamic vocabulary. The apparatus further includes a transmitter to transmit the detected intent and recognized associated property to an application.
    Type: Grant
    Filed: November 7, 2017
    Date of Patent: February 2, 2021
    Assignee: Intel Corporation
    Inventors: Munir Nikolai Alexander Georges, Grzegorz Wojdyga, Tomasz Noczynski, Jakub Nowicki, Szymon Jessa
  • Patent number: 10770094
    Abstract: An example apparatus for routing audio streams includes an audio receiver to receive audio from a microphone. The apparatus also includes a classifier to semantically generate a result set based on the audio. The apparatus further includes a scheduler to select a spoken language understanding (SLU) engine based on the result set. The apparatus includes a router to route the audio to the selected SLU engine.
    Type: Grant
    Filed: January 9, 2018
    Date of Patent: September 8, 2020
    Assignee: Intel IP Corporation
    Inventors: Munir Nikolai Alexander Georges, Jakub Nowicki
  • Patent number: 10672380
    Abstract: Techniques are provided for wake-on-voice (WOV) key-phrase enrollment. A methodology implementing the techniques according to an embodiment includes generating a WOV key-phrase model based on identification of the sequence of sub-phonetic units of a user-provided key-phrase. The WOV key-phrase model is employed by a WOV processor for detection of the user spoken key-phrase and triggering operation of an automatic speech recognition (ASR) processor in response to the detection. The method further includes updating an ASR language model based on the user-provided key-phrase. The update includes one of embedding the WOV key-phrase model into the ASR language model, converting sub-phonetic units of the WOV key-phrase model and embedding the converted WOV key-phrase model into the ASR language model, or generating an ASR key-phrase model by applying a phoneme-syllable based statistical language model to the user-provided key-phrase and embedding the generated ASR key-phrase model into the ASR language model.
    Type: Grant
    Filed: December 27, 2017
    Date of Patent: June 2, 2020
    Assignee: Intel IP Corporation
    Inventors: Munir Nikolai Alexander Georges, Tobias Bocklet, Georg Stemmer, Joachim Hofer, Josef G. Bauer
  • Patent number: 10657952
    Abstract: Techniques are provided for reducing the latency of automatic speech recognition using hypothesis score trend analysis. A methodology implementing the techniques according to an embodiment includes generating complete-phrase hypotheses and partial-phrase hypotheses, along with associated likelihood scores, based on a segment of speech. The method also includes selecting the complete-phrase hypothesis associated with the highest of the complete-phrase hypotheses likelihood scores, and selecting the partial-phrase hypothesis associated with the highest of the partial-phrase hypotheses likelihood scores. The method further includes calculating a relative likelihood score based on a ratio of the likelihood score associated with the selected complete-phrase hypothesis to the likelihood score associated with the selected partial-phrase hypothesis.
    Type: Grant
    Filed: February 9, 2018
    Date of Patent: May 19, 2020
    Assignee: Intel IP Corporation
    Inventors: Joachim Hofer, Georg Stemmer, Josef G. Bauer, Munir Nikolai Alexander Georges
  • Publication number: 20200090657
    Abstract: An example apparatus for recognizing speech includes an audio receiver to receive a stream of audio. The apparatus also includes a key phrase detector to detect a key phrase in the stream of audio. The apparatus further includes a model adapter to dynamically adapt a model based on the detected key phrase. The apparatus also includes a query recognizer to detect a voice query following the key phrase in a stream of audio via the adapted model.
    Type: Application
    Filed: November 22, 2019
    Publication date: March 19, 2020
    Applicant: INTEL CORPORATION
    Inventors: Krzysztof Czarnowski, Munir Nikolai Alexander Georges, Tobias Bocklet, Georg Stemmer
  • Publication number: 20200082837
    Abstract: An example apparatus for concealing phrases in audio includes a receiver to receive a detected phrase via a network. The detected phrase is based on audio captured near a source of an audio stream. The apparatus also includes a speech recognizer to generate a trigger in response to detecting that a section of the audio stream contains a confirmed phrase. The apparatus further includes a phrase concealer to conceal the section of the audio stream in response to the trigger.
    Type: Application
    Filed: November 14, 2019
    Publication date: March 12, 2020
    Inventors: Munir Nikolai Alexander Georges, Joachim Hofer, Tobias Bocklet, Josef Bauer, Georg Stemmer
  • Patent number: 10504510
    Abstract: A method or associated system for motion adaptive speech processing includes dynamically estimating a motion profile that is representative of a user's motion based on data from one or more resources, such as sensors and non-speech resources, associated with the user. The method includes effecting processing of a speech signal received from the user, for example, while the user is in motion, the processing taking into account the estimated motion profile to produce an interpretation of the speech signal. Dynamically estimating the motion profile can include computing a motion weight vector using the data from the one or more resources associated with the user, and can further include interpolating a plurality of models using the motion weight vector to generate a motion adaptive model. The motion adaptive model can be used to enhance voice destination entry for the user and re-used for other users who do not provide motion profiles.
    Type: Grant
    Filed: June 10, 2015
    Date of Patent: December 10, 2019
    Assignee: Cerence Operating Company
    Inventors: Munir Nikolai Alexander Georges, Josef Damianus Anastasiadis, Oliver Bender
  • Publication number: 20190348036
    Abstract: A method for context-aware query recognition in an electronic device includes receiving user speech from an input device. A speech signal is generated from the user speech. It is determined if the speech signal includes an action to be performed and if the electronic device is the intended recipient of the user speech. If the recognized speech signal include the action and the intended recipient of the user speech is the electronic device, a command is generated for the electronic device to perform the action.
    Type: Application
    Filed: December 4, 2018
    Publication date: November 14, 2019
    Inventors: Munir Nikolai Alexander Georges, Georg Stemmer, Joachim Hofer
  • Publication number: 20190073999
    Abstract: According to some aspects, a system for detecting a designated wake-up word is provided, the system comprising a plurality of microphones to detect acoustic information from a physical space having a plurality of acoustic zones, at least one processor configured to receive a first acoustic signal representing the acoustic information received by the plurality of microphones, process the first acoustic signal to identify content of the first acoustic signal originating from each of the plurality of acoustic zones, provide a plurality of second acoustic signals, each of the plurality of second acoustic signals substantially corresponding to the content identified as originating from a respective one of the plurality of acoustic zones, and performing automatic speech recognition on each of the plurality of second acoustic signals to determine whether the designated wake-up word was spoken.
    Type: Application
    Filed: February 10, 2016
    Publication date: March 7, 2019
    Applicant: Nuance Communications, Inc.
    Inventors: Julien Prémont, Tim Haulick, Emanuele Dalmasso, Munir Nikolai Alexander Georges, Andreas Kellner, Gaetan Martens, Oliver van Porten, Holger Quast, Martin Rößler, Tobias Wolff, Markus Buck
  • Publication number: 20190043476
    Abstract: Techniques are provided for reducing the latency of automatic speech recognition using hypothesis score trend analysis. A methodology implementing the techniques according to an embodiment includes generating complete-phrase hypotheses and partial-phrase hypotheses, along with associated likelihood scores, based on a segment of speech. The method also includes selecting the complete-phrase hypothesis associated with the highest of the complete-phrase hypotheses likelihood scores, and selecting the partial-phrase hypothesis associated with the highest of the partial-phrase hypotheses likelihood scores. The method further includes calculating a relative likelihood score based on a ratio of the likelihood score associated with the selected complete-phrase hypothesis to the likelihood score associated with the selected partial-phrase hypothesis.
    Type: Application
    Filed: February 9, 2018
    Publication date: February 7, 2019
    Applicant: INTEL CORPORATION
    Inventors: Joachim Hofer, Georg Stemmer, Josef G. Bauer, Munir Nikolai Alexander Georges
  • Publication number: 20190043481
    Abstract: Techniques are provided for wake-on-voice (WOV) key-phrase enrollment. A methodology implementing the techniques according to an embodiment includes generating a WOV key-phrase model based on identification of the sequence of sub-phonetic units of a user-provided key-phrase. The WOV key-phrase model is employed by a WOV processor for detection of the user spoken key-phrase and triggering operation of an automatic speech recognition (ASR) processor in response to the detection. The method further includes updating an ASR language model based on the user-provided key-phrase. The update includes one of embedding the WOV key-phrase model into the ASR language model, converting sub-phonetic units of the WOV key-phrase model and embedding the converted WOV key-phrase model into the ASR language model, or generating an ASR key-phrase model by applying a phoneme-syllable based statistical language model to the user-provided key-phrase and embedding the generated ASR key-phrase model into the ASR language model.
    Type: Application
    Filed: December 27, 2017
    Publication date: February 7, 2019
    Applicant: INTEL IP CORPORATION
    Inventors: Munir Nikolai Alexander Georges, Tobias Bocklet, Georg Stemmer, Joachim Hofer, Josef G. Bauer
  • Publication number: 20190043497
    Abstract: Techniques are provided for dynamic adaptation of language understanding systems to acoustic environments. A methodology implementing the techniques according to an embodiment includes generating a trigger in response to recognition of a wake-on-voice key-phrase in or prior to an audio stream. The trigger serves to switch processing modes from an adaptation mode to a query recognition mode. The method further includes performing automatic speech recognition on the audio stream during the query recognition mode, to recognize an in-domain query. The method further includes applying both a static language understanding classifier and a dynamic language understanding classifier to the recognized in-domain query. The static language understanding classifier employs a static semantic model and the dynamic language understanding classifier employs a dynamic semantic model.
    Type: Application
    Filed: April 10, 2018
    Publication date: February 7, 2019
    Applicant: INTEL IP CORPORATION
    Inventor: Munir Nikolai Alexander Georges
  • Publication number: 20190043527
    Abstract: An example apparatus for routing audio streams includes an audio receiver to receive audio from a microphone. The apparatus also includes a classifier to semantically generate a result set based on the audio. The apparatus further includes a scheduler to select a spoken language understanding (SLU) engine based on the result set. The apparatus includes a router to route the audio to the selected SLU engine.
    Type: Application
    Filed: January 9, 2018
    Publication date: February 7, 2019
    Applicant: INTEL IP CORPORATION
    Inventors: Munir Nikolai Alexander Georges, Jakub Nowicki
  • Publication number: 20190027133
    Abstract: An example apparatus for detecting intent in voiced audio includes a receiver to receive one or more word sequence hypotheses related to a voiced audio and a dynamic vocabulary. The apparatus also includes a natural language understander (NLU) to detect an intent and recognize a property related to the intent based on the word sequence hypothesis and the dynamic vocabulary. The apparatus further includes a transmitter to transmit the detected intent and recognized associated property to an application.
    Type: Application
    Filed: November 7, 2017
    Publication date: January 24, 2019
    Applicant: INTEL CORPORATION
    Inventors: Munir Nikolai Alexander Georges, Grzegorz Wojdyga, Tomasz Noczynski, Jakub Nowicki, Szymon Jessa
  • Publication number: 20180366123
    Abstract: Systems and methods for processing results from plural speech services are described. A method includes receiving speech service results from plural speech services and service specifications corresponding to the speech service results. The results are at least one data structure representing information according to functionality of the speech services. The service specifications describe the data structure and its interpretation for each speech service. The speech service results are encoded into a unified conceptual knowledge representation of the results based on the service specification. The unified conceptual knowledge representation is provided to an application module. A method includes assessing speech service results received asynchronously from plural speech services to determine, based on a reliability measure, whether there is a reliable result among the speech service results received.
    Type: Application
    Filed: May 31, 2016
    Publication date: December 20, 2018
    Inventors: Munir Nikolai Alexander Georges, Friederike Eva Anabel Niedtner, Josef Damianus Anastasiadis, Oliver Bender, Jeroen Maurice Decroos
  • Publication number: 20180357998
    Abstract: Techniques are provided for language identification performed in conjunction with wake-on-voice keyword detection. A methodology implementing the techniques according to an embodiment includes applying phrase models to a user-spoken keyword. Each of the phrase models is configured to detect the keyword in a selected language and to generate a probability associated with the detection. The method further includes scoring the probabilities associated with the keyword detection in each of the languages, and identifying the language of the keyword based on the scoring. Automatic speech recognition and spoken language understanding systems may then be configured or selected to process further speech from the user in the identified language. In some embodiments, the phrase models are generated, in an offline process, based on provided grapheme sequences representing the keyword in the language associated with the phrase model.
    Type: Application
    Filed: June 13, 2017
    Publication date: December 13, 2018
    Applicant: INTEL IP CORPORATION
    Inventors: Munir Nikolai Alexander Georges, Tobias Bocklet
  • Publication number: 20180349794
    Abstract: Techniques are provided for rejecting out-of-domain (OD) queries in a language understanding system. A methodology implementing the techniques according to an embodiment includes generating a plurality of in-domain (ID) utterances based on variations of provided ID sentences, and generating a plurality of OD utterances based on variations of provided OD sentences. The method may further include training an ID language model based on the generated ID utterances and training an OD language model based on the generated OD utterances. The ID language model is configured to generate an ID dataset based on calculated probabilities associated with the generated ID utterances. The OD language model is configured to generate an OD dataset based on calculated probabilities associated with the generated OD utterances. The method further includes training a classifier to detect OD queries from a plurality of received queries, the training based on the ID dataset and the OD dataset.
    Type: Application
    Filed: June 1, 2017
    Publication date: December 6, 2018
    Applicant: INTEL IP CORPORATION
    Inventors: Munir Nikolai Alexander Georges, Szymon Jessa, Georg Stemmer
  • Patent number: 10147423
    Abstract: A method for context-aware query recognition in an electronic device includes receiving user speech from an input device. A speech signal is generated from the user speech. It is determined if the speech signal includes an action to be performed and if the electronic device is the intended recipient of the user speech. If the recognized speech signal include the action and the intended recipient of the user speech is the electronic device, a command is generated for the electronic device to perform the action.
    Type: Grant
    Filed: September 29, 2016
    Date of Patent: December 4, 2018
    Assignee: Intel IP Corporation
    Inventors: Munir Nikolai Alexander Georges, Georg Stemmer, Joachim Hofer