Patents by Inventor Tobias Bocklet

Tobias Bocklet has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10714122
    Abstract: Speech or non-speech detection techniques are discussed and include updating a speech pattern model using probability scores from an acoustic model to generate a score for each state of the speech pattern model, such that the speech pattern model includes a first non-speech state having multiple self loops each associated with a non-speech probability score of the probability scores, a plurality of speech states following the first non-speech state, and a second non-speech state following the speech states, and detecting speech based on a comparison of a score of the first non-speech state and a score of the last speech state of the multiple speech states.
    Type: Grant
    Filed: June 6, 2018
    Date of Patent: July 14, 2020
    Assignee: Intel Corporation
    Inventors: Maciej Muchlinski, Tobias Bocklet
  • Patent number: 10692492
    Abstract: Techniques are disclosed for client-side analysis of audio samples to identify one or more characteristics associated with captured audio. The client-side analysis may then allow a user device, e.g., a smart phone, laptop computer, in-car infotainment system, and so on, to provide the one or more identified characteristics as configuration data to a voice recognition service at or shortly after connection with the same. In turn, the voice recognition service may load one or more recognition components, e.g., language models and/or application modules/engines, based on the received configuration data. Thus, latency may be reduced based on the voice recognition engine having “hints” that allow components to be loaded without necessarily having to process audio samples first. The reduction of latency may reduce processing time relative to other approaches to voice recognitions systems that exclusively perform server-side context recognition/classification.
    Type: Grant
    Filed: September 29, 2017
    Date of Patent: June 23, 2020
    Assignee: Intel IP Corporation
    Inventors: Piotr Rozen, Tobias Bocklet, Jakub Nowicki, Munir Georges
  • Patent number: 10672380
    Abstract: Techniques are provided for wake-on-voice (WOV) key-phrase enrollment. A methodology implementing the techniques according to an embodiment includes generating a WOV key-phrase model based on identification of the sequence of sub-phonetic units of a user-provided key-phrase. The WOV key-phrase model is employed by a WOV processor for detection of the user spoken key-phrase and triggering operation of an automatic speech recognition (ASR) processor in response to the detection. The method further includes updating an ASR language model based on the user-provided key-phrase. The update includes one of embedding the WOV key-phrase model into the ASR language model, converting sub-phonetic units of the WOV key-phrase model and embedding the converted WOV key-phrase model into the ASR language model, or generating an ASR key-phrase model by applying a phoneme-syllable based statistical language model to the user-provided key-phrase and embedding the generated ASR key-phrase model into the ASR language model.
    Type: Grant
    Filed: December 27, 2017
    Date of Patent: June 2, 2020
    Assignee: Intel IP Corporation
    Inventors: Munir Nikolai Alexander Georges, Tobias Bocklet, Georg Stemmer, Joachim Hofer, Josef G. Bauer
  • Patent number: 10665222
    Abstract: A system, article, and method provide temporal-domain feature extraction for automatic speech recognition.
    Type: Grant
    Filed: June 28, 2018
    Date of Patent: May 26, 2020
    Assignee: Intel Corporation
    Inventors: Suyoung Bang, Muhammad Khellah, Somnath Paul, Charles Augustine, Turbo Majumder, Wootaek Lim, Tobias Bocklet, David Pearce
  • Patent number: 10650807
    Abstract: A method and system are directed to autonomous neural network keyphrase detection and includes generating and using a multiple element state score vector by using neural network operations and without substantial use of a digital signal processor (DSP) to perform the keyphrase detection.
    Type: Grant
    Filed: September 18, 2018
    Date of Patent: May 12, 2020
    Assignee: Intel Corporation
    Inventors: Tobias Bocklet, Jacek Ossowski, Tomasz Dorau, Maciej Muchlinski, David Pearce, Piotr Rozen
  • Patent number: 10629184
    Abstract: Cepstral variance normalization is described for audio feature extraction.
    Type: Grant
    Filed: December 22, 2014
    Date of Patent: April 21, 2020
    Assignee: Intel Corporation
    Inventors: Tobias Bocklet, Adam Marek
  • Publication number: 20200090657
    Abstract: An example apparatus for recognizing speech includes an audio receiver to receive a stream of audio. The apparatus also includes a key phrase detector to detect a key phrase in the stream of audio. The apparatus further includes a model adapter to dynamically adapt a model based on the detected key phrase. The apparatus also includes a query recognizer to detect a voice query following the key phrase in a stream of audio via the adapted model.
    Type: Application
    Filed: November 22, 2019
    Publication date: March 19, 2020
    Applicant: INTEL CORPORATION
    Inventors: Krzysztof Czarnowski, Munir Nikolai Alexander Georges, Tobias Bocklet, Georg Stemmer
  • Publication number: 20200082837
    Abstract: An example apparatus for concealing phrases in audio includes a receiver to receive a detected phrase via a network. The detected phrase is based on audio captured near a source of an audio stream. The apparatus also includes a speech recognizer to generate a trigger in response to detecting that a section of the audio stream contains a confirmed phrase. The apparatus further includes a phrase concealer to conceal the section of the audio stream in response to the trigger.
    Type: Application
    Filed: November 14, 2019
    Publication date: March 12, 2020
    Inventors: Munir Nikolai Alexander Georges, Joachim Hofer, Tobias Bocklet, Josef Bauer, Georg Stemmer
  • Publication number: 20190371326
    Abstract: Techniques related to key phrase detection for applications such as wake on voice are discussed. Such techniques may include updating a start state based rejection model and a key phrase model based on scores of sub-phonetic units from an acoustic model to generate a rejection likelihood score and a key phrase likelihood score and determining whether received audio input is associated with a predetermined key phrase based on the rejection likelihood score and the key phrase likelihood score.
    Type: Application
    Filed: May 21, 2019
    Publication date: December 5, 2019
    Applicant: Intel IP Corporation
    Inventors: Tobias Bocklet, Joachim Hofer
  • Publication number: 20190355379
    Abstract: Methods, apparatus, systems and articles of manufacture are disclosed for distributed automatic speech recognition. An example apparatus includes a detector to process an input audio signal and identify a portion of the input audio signal including a sound to be evaluated, the sound to be evaluated organized into a plurality of audio features representing the sound. The example apparatus includes a quantizer to process the audio features using a quantization process to reduce the audio features to generate a reduced set of audio features for transmission. The example apparatus includes a transmitter to transmit the reduced set of audio features over a low-energy communication channel for processing.
    Type: Application
    Filed: August 5, 2019
    Publication date: November 21, 2019
    Inventors: Binuraj K. Ravindran, Francis M. Tharappel, Prabhakar R. Datta, Tobias Bocklet, Maciej Muchlinski, Tomasz Dorau, Josef G. Bauer, Saurin Shah, Georg Stemmer
  • Patent number: 10468032
    Abstract: Techniques related to speaker recognition are discussed. Such techniques include determining context aware confidence values formed of false accept and false reject rates determined by using adaptively updated acoustic environment score distributions matched to current score distributions.
    Type: Grant
    Filed: April 10, 2017
    Date of Patent: November 5, 2019
    Assignee: Intel Corporation
    Inventors: Jonathan J. Huang, Gokcen Cilingir, Tobias Bocklet
  • Publication number: 20190266240
    Abstract: An embodiment of a spoken intent detection device includes technology to detect a phrase in an electronic representation of an audio stream based on a pre-defined vocabulary, associate a time stamp with the detected phrase, and classify a spoken intent based on a sequence of detected phrases and the respective associated time stamps. Other embodiments are disclosed and claimed.
    Type: Application
    Filed: May 9, 2019
    Publication date: August 29, 2019
    Applicant: Intel Corporation
    Inventors: Munir Georges, Wenda Chen, Tobias Bocklet, Jonathan Huang
  • Patent number: 10373630
    Abstract: Methods, apparatus, systems and articles of manufacture are disclosed for distributed automatic speech recognition. An example apparatus includes a detector to process an input audio signal and identify a portion of the input audio signal including a sound to be evaluated, the sound to be evaluated organized into a plurality of audio features representing the sound. The example apparatus includes a quantizer to process the audio features using a quantization process to reduce the audio features to generate a reduced set of audio features for transmission. The example apparatus includes a transmitter to transmit the reduced set of audio features over a low-energy communication channel for processing.
    Type: Grant
    Filed: March 31, 2017
    Date of Patent: August 6, 2019
    Assignee: Intel Corporation
    Inventors: Binuraj K. Ravindran, Francis M. Tharappel, Prabhakar R. Datta, Tobias Bocklet, Maciej Muchlinski, Tomasz Dorau, Josef G. Bauer, Saurin Shah, Georg Stemmer
  • Publication number: 20190221205
    Abstract: Techniques related to keyphrase detection for applications such as wake on voice are disclosed herein. Such techniques may have high accuracy by using scores of phone positions in triphones to select which triphones to use with a rejection model, using context-related phones for the rejection model, adding silence before keyphrase sounds for a keyphrase model, or any combination of these.
    Type: Application
    Filed: March 29, 2019
    Publication date: July 18, 2019
    Applicant: Intel Corporation
    Inventors: Sebastian Czyryba, Tobias Bocklet, Kuba Lopatka
  • Publication number: 20190221206
    Abstract: An embodiment of a wake-on-intent speech recognition device includes technology to detect one or more keywords in a digital representation of a spoken natural language utterance, determine an intent of the spoken natural language utterance based on the detected keywords, and provide the spoken natural language utterance to a speech recognition and interpretation system if the determined intent is to further process the spoken natural language utterance. Other embodiments are disclosed and claimed.
    Type: Application
    Filed: March 27, 2019
    Publication date: July 18, 2019
    Applicant: Intel Corporation
    Inventors: Wenda Chen, Jonathan Huang, Tobias Bocklet, Munir Georges
  • Patent number: 10325594
    Abstract: Techniques related to key phrase detection for applications such as wake on voice are discussed. Such techniques may include updating a start state based rejection model and a key phrase model based on scores of sub-phonetic units from an acoustic model to generate a rejection likelihood score and a key phrase likelihood score and determining whether received audio input is associated with a predetermined key phrase based on the rejection likelihood score and the key phrase likelihood score.
    Type: Grant
    Filed: October 17, 2017
    Date of Patent: June 18, 2019
    Assignee: Intel IP Corporation
    Inventors: Tobias Bocklet, Joachim Hofer
  • Publication number: 20190103100
    Abstract: Techniques are disclosed for client-side analysis of audio samples to identify one or more characteristics associated with captured audio. The client-side analysis may then allow a user device, e.g., a smart phone, laptop computer, in-car infotainment system, and so on, to provide the one or more identified characteristics as configuration data to a voice recognition service at or shortly after connection with the same. In turn, the voice recognition service may load one or more recognition components, e.g., language models and/or application modules/engines, based on the received configuration data. Thus, latency may be reduced based on the voice recognition engine having “hints” that allow components to be loaded without necessarily having to process audio samples first. The reduction of latency may reduce processing time relative to other approaches to voice recognitions systems that exclusively perform server-side context recognition/classification.
    Type: Application
    Filed: September 29, 2017
    Publication date: April 4, 2019
    Inventors: PIOTR ROZEN, TOBIAS BOCKLET, JAKUB NOWICKI, MUNIR GEORGES
  • Publication number: 20190049989
    Abstract: Embodiments include apparatuses, systems, and methods for a computer-aided or autonomous driving (CA/AD) system to identify and respond to an audio signal, e.g., an emergency alarm signal. In embodiments, the CA/AD driving system may include a plurality of microphones disposed to capture the audio signal included in surrounding sounds to a semi-autonomous or autonomous (SA/AD) vehicle. In embodiments, an audio analysis unit may receive the audio signal to extract audio features from the audio signal. In embodiments, a neural network such as a Deep Neural Network (DNN) may receive the extracted audio features from the audio analysis unit and to generate a probability score to allow identification of the audio signal. In embodiments, the CA/AD driving system may control driving elements of the SA/AD vehicle to autonomously or semi-autonomously drive the SA/AD vehicle in response to the identification. Other embodiments may also be described and claimed.
    Type: Application
    Filed: November 17, 2017
    Publication date: February 14, 2019
    Inventors: Sarang Akotkar, Mithil Ramteke, Tobias Bocklet, Sivasubramanian Sundaram
  • Publication number: 20190051299
    Abstract: Techniques related to a method and system of audio false keyphrase rejection using speaker recognition are described herein. Such techniques use speaker recognition of a computer originated voice to omit actions triggered when a keyphrase is present in captured audio and omitted when speech of the captured audio was spoken by the computer originated voice.
    Type: Application
    Filed: June 25, 2018
    Publication date: February 14, 2019
    Applicant: Intel Corporation
    Inventors: Jacek Ossowski, Tobias Bocklet, Kuba Lopatka
  • Publication number: 20190043488
    Abstract: A method and system are directed to autonomous neural network keyphrase detection and includes generating and using a multiple element state score vector by using neural network operations and without substantial use of a digital signal processor (DSP) to perform the keyphrase detection.
    Type: Application
    Filed: September 18, 2018
    Publication date: February 7, 2019
    Applicant: Intel Corporation
    Inventors: Tobias Bocklet, Jacek Ossowski, Tomasz Dorau, Maciej Muchlinski, David Pearce, Piotr Rozen