Patents by Inventor Tobias Bocklet

Tobias Bocklet has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Speech classification of audio for wake on voice

Patent number: 10714122

Abstract: Speech or non-speech detection techniques are discussed and include updating a speech pattern model using probability scores from an acoustic model to generate a score for each state of the speech pattern model, such that the speech pattern model includes a first non-speech state having multiple self loops each associated with a non-speech probability score of the probability scores, a plurality of speech states following the first non-speech state, and a second non-speech state following the speech states, and detecting speech based on a comparison of a score of the first non-speech state and a score of the last speech state of the multiple speech states.

Type: Grant

Filed: June 6, 2018

Date of Patent: July 14, 2020

Assignee: Intel Corporation

Inventors: Maciej Muchlinski, Tobias Bocklet
Techniques for client-side speech domain detection using gyroscopic data and a system using the same

Patent number: 10692492

Abstract: Techniques are disclosed for client-side analysis of audio samples to identify one or more characteristics associated with captured audio. The client-side analysis may then allow a user device, e.g., a smart phone, laptop computer, in-car infotainment system, and so on, to provide the one or more identified characteristics as configuration data to a voice recognition service at or shortly after connection with the same. In turn, the voice recognition service may load one or more recognition components, e.g., language models and/or application modules/engines, based on the received configuration data. Thus, latency may be reduced based on the voice recognition engine having “hints” that allow components to be loaded without necessarily having to process audio samples first. The reduction of latency may reduce processing time relative to other approaches to voice recognitions systems that exclusively perform server-side context recognition/classification.

Type: Grant

Filed: September 29, 2017

Date of Patent: June 23, 2020

Assignee: Intel IP Corporation

Inventors: Piotr Rozen, Tobias Bocklet, Jakub Nowicki, Munir Georges
Dynamic enrollment of user-defined wake-up key-phrase for speech enabled computer system

Patent number: 10672380

Abstract: Techniques are provided for wake-on-voice (WOV) key-phrase enrollment. A methodology implementing the techniques according to an embodiment includes generating a WOV key-phrase model based on identification of the sequence of sub-phonetic units of a user-provided key-phrase. The WOV key-phrase model is employed by a WOV processor for detection of the user spoken key-phrase and triggering operation of an automatic speech recognition (ASR) processor in response to the detection. The method further includes updating an ASR language model based on the user-provided key-phrase. The update includes one of embedding the WOV key-phrase model into the ASR language model, converting sub-phonetic units of the WOV key-phrase model and embedding the converted WOV key-phrase model into the ASR language model, or generating an ASR key-phrase model by applying a phoneme-syllable based statistical language model to the user-provided key-phrase and embedding the generated ASR key-phrase model into the ASR language model.

Type: Grant

Filed: December 27, 2017

Date of Patent: June 2, 2020

Assignee: Intel IP Corporation

Inventors: Munir Nikolai Alexander Georges, Tobias Bocklet, Georg Stemmer, Joachim Hofer, Josef G. Bauer
Method and system of temporal-domain feature extraction for automatic speech recognition

Patent number: 10665222

Abstract: A system, article, and method provide temporal-domain feature extraction for automatic speech recognition.

Type: Grant

Filed: June 28, 2018

Date of Patent: May 26, 2020

Assignee: Intel Corporation

Inventors: Suyoung Bang, Muhammad Khellah, Somnath Paul, Charles Augustine, Turbo Majumder, Wootaek Lim, Tobias Bocklet, David Pearce
Method and system of neural network keyphrase detection

Patent number: 10650807

Abstract: A method and system are directed to autonomous neural network keyphrase detection and includes generating and using a multiple element state score vector by using neural network operations and without substantial use of a digital signal processor (DSP) to perform the keyphrase detection.

Type: Grant

Filed: September 18, 2018

Date of Patent: May 12, 2020

Assignee: Intel Corporation

Inventors: Tobias Bocklet, Jacek Ossowski, Tomasz Dorau, Maciej Muchlinski, David Pearce, Piotr Rozen
Cepstral variance normalization for audio feature extraction

Patent number: 10629184

Abstract: Cepstral variance normalization is described for audio feature extraction.

Type: Grant

Filed: December 22, 2014

Date of Patent: April 21, 2020

Assignee: Intel Corporation

Inventors: Tobias Bocklet, Adam Marek
ADAPTIVELY RECOGNIZING SPEECH USING KEY PHRASES

Publication number: 20200090657

Abstract: An example apparatus for recognizing speech includes an audio receiver to receive a stream of audio. The apparatus also includes a key phrase detector to detect a key phrase in the stream of audio. The apparatus further includes a model adapter to dynamically adapt a model based on the detected key phrase. The apparatus also includes a query recognizer to detect a voice query following the key phrase in a stream of audio via the adapted model.

Type: Application

Filed: November 22, 2019

Publication date: March 19, 2020

Applicant: INTEL CORPORATION

Inventors: Krzysztof Czarnowski, Munir Nikolai Alexander Georges, Tobias Bocklet, Georg Stemmer
CONCEALING PHRASES IN AUDIO TRAVELING OVER AIR

Publication number: 20200082837

Abstract: An example apparatus for concealing phrases in audio includes a receiver to receive a detected phrase via a network. The detected phrase is based on audio captured near a source of an audio stream. The apparatus also includes a speech recognizer to generate a trigger in response to detecting that a section of the audio stream contains a confirmed phrase. The apparatus further includes a phrase concealer to conceal the section of the audio stream in response to the trigger.

Type: Application

Filed: November 14, 2019

Publication date: March 12, 2020

Inventors: Munir Nikolai Alexander Georges, Joachim Hofer, Tobias Bocklet, Josef Bauer, Georg Stemmer
LOW RESOURCE KEY PHRASE DETECTION FOR WAKE ON VOICE

Publication number: 20190371326

Abstract: Techniques related to key phrase detection for applications such as wake on voice are discussed. Such techniques may include updating a start state based rejection model and a key phrase model based on scores of sub-phonetic units from an acoustic model to generate a rejection likelihood score and a key phrase likelihood score and determining whether received audio input is associated with a predetermined key phrase based on the rejection likelihood score and the key phrase likelihood score.

Type: Application

Filed: May 21, 2019

Publication date: December 5, 2019

Applicant: Intel IP Corporation

Inventors: Tobias Bocklet, Joachim Hofer
SYSTEMS AND METHODS FOR ENERGY EFFICIENT AND LOW POWER DISTRIBUTED AUTOMATIC SPEECH RECOGNITION ON WEARABLE DEVICES

Publication number: 20190355379

Abstract: Methods, apparatus, systems and articles of manufacture are disclosed for distributed automatic speech recognition. An example apparatus includes a detector to process an input audio signal and identify a portion of the input audio signal including a sound to be evaluated, the sound to be evaluated organized into a plurality of audio features representing the sound. The example apparatus includes a quantizer to process the audio features using a quantization process to reduce the audio features to generate a reduced set of audio features for transmission. The example apparatus includes a transmitter to transmit the reduced set of audio features over a low-energy communication channel for processing.

Type: Application

Filed: August 5, 2019

Publication date: November 21, 2019

Inventors: Binuraj K. Ravindran, Francis M. Tharappel, Prabhakar R. Datta, Tobias Bocklet, Maciej Muchlinski, Tomasz Dorau, Josef G. Bauer, Saurin Shah, Georg Stemmer
Method and system of speaker recognition using context aware confidence modeling

Patent number: 10468032

Abstract: Techniques related to speaker recognition are discussed. Such techniques include determining context aware confidence values formed of false accept and false reject rates determined by using adaptively updated acoustic environment score distributions matched to current score distributions.

Type: Grant

Filed: April 10, 2017

Date of Patent: November 5, 2019

Assignee: Intel Corporation

Inventors: Jonathan J. Huang, Gokcen Cilingir, Tobias Bocklet
TIME ASYNCHRONOUS SPOKEN INTENT DETECTION

Publication number: 20190266240

Abstract: An embodiment of a spoken intent detection device includes technology to detect a phrase in an electronic representation of an audio stream based on a pre-defined vocabulary, associate a time stamp with the detected phrase, and classify a spoken intent based on a sequence of detected phrases and the respective associated time stamps. Other embodiments are disclosed and claimed.

Type: Application

Filed: May 9, 2019

Publication date: August 29, 2019

Applicant: Intel Corporation

Inventors: Munir Georges, Wenda Chen, Tobias Bocklet, Jonathan Huang
Systems and methods for energy efficient and low power distributed automatic speech recognition on wearable devices

Patent number: 10373630

Abstract: Methods, apparatus, systems and articles of manufacture are disclosed for distributed automatic speech recognition. An example apparatus includes a detector to process an input audio signal and identify a portion of the input audio signal including a sound to be evaluated, the sound to be evaluated organized into a plurality of audio features representing the sound. The example apparatus includes a quantizer to process the audio features using a quantization process to reduce the audio features to generate a reduced set of audio features for transmission. The example apparatus includes a transmitter to transmit the reduced set of audio features over a low-energy communication channel for processing.

Type: Grant

Filed: March 31, 2017

Date of Patent: August 6, 2019

Assignee: Intel Corporation

Inventors: Binuraj K. Ravindran, Francis M. Tharappel, Prabhakar R. Datta, Tobias Bocklet, Maciej Muchlinski, Tomasz Dorau, Josef G. Bauer, Saurin Shah, Georg Stemmer
METHOD AND SYSTEM OF HIGH ACCURACY KEYPHRASE DETECTION FOR LOW RESOURCE DEVICES

Publication number: 20190221205

Abstract: Techniques related to keyphrase detection for applications such as wake on voice are disclosed herein. Such techniques may have high accuracy by using scores of phone positions in triphones to select which triphones to use with a rejection model, using context-related phones for the rejection model, adding silence before keyphrase sounds for a keyphrase model, or any combination of these.

Type: Application

Filed: March 29, 2019

Publication date: July 18, 2019

Applicant: Intel Corporation

Inventors: Sebastian Czyryba, Tobias Bocklet, Kuba Lopatka
SPOKEN KEYWORD DETECTION BASED UTTERANCE-LEVEL WAKE ON INTENT SYSTEM

Publication number: 20190221206

Abstract: An embodiment of a wake-on-intent speech recognition device includes technology to detect one or more keywords in a digital representation of a spoken natural language utterance, determine an intent of the spoken natural language utterance based on the detected keywords, and provide the spoken natural language utterance to a speech recognition and interpretation system if the determined intent is to further process the spoken natural language utterance. Other embodiments are disclosed and claimed.

Type: Application

Filed: March 27, 2019

Publication date: July 18, 2019

Applicant: Intel Corporation

Inventors: Wenda Chen, Jonathan Huang, Tobias Bocklet, Munir Georges
Low resource key phrase detection for wake on voice

Patent number: 10325594

Abstract: Techniques related to key phrase detection for applications such as wake on voice are discussed. Such techniques may include updating a start state based rejection model and a key phrase model based on scores of sub-phonetic units from an acoustic model to generate a rejection likelihood score and a key phrase likelihood score and determining whether received audio input is associated with a predetermined key phrase based on the rejection likelihood score and the key phrase likelihood score.

Type: Grant

Filed: October 17, 2017

Date of Patent: June 18, 2019

Assignee: Intel IP Corporation

Inventors: Tobias Bocklet, Joachim Hofer
TECHNIQUES FOR CLIENT-SIDE SPEECH DOMAIN DETECTION AND A SYSTEM USING THE SAME

Publication number: 20190103100

Abstract: Techniques are disclosed for client-side analysis of audio samples to identify one or more characteristics associated with captured audio. The client-side analysis may then allow a user device, e.g., a smart phone, laptop computer, in-car infotainment system, and so on, to provide the one or more identified characteristics as configuration data to a voice recognition service at or shortly after connection with the same. In turn, the voice recognition service may load one or more recognition components, e.g., language models and/or application modules/engines, based on the received configuration data. Thus, latency may be reduced based on the voice recognition engine having “hints” that allow components to be loaded without necessarily having to process audio samples first. The reduction of latency may reduce processing time relative to other approaches to voice recognitions systems that exclusively perform server-side context recognition/classification.

Type: Application

Filed: September 29, 2017

Publication date: April 4, 2019

Inventors: PIOTR ROZEN, TOBIAS BOCKLET, JAKUB NOWICKI, MUNIR GEORGES
IDENTIFICATION OF AUDIO SIGNALS IN SURROUNDING SOUNDS AND GUIDANCE OF AN AUTONOMOUS VEHICLE IN RESPONSE TO THE SAME

Publication number: 20190049989

Abstract: Embodiments include apparatuses, systems, and methods for a computer-aided or autonomous driving (CA/AD) system to identify and respond to an audio signal, e.g., an emergency alarm signal. In embodiments, the CA/AD driving system may include a plurality of microphones disposed to capture the audio signal included in surrounding sounds to a semi-autonomous or autonomous (SA/AD) vehicle. In embodiments, an audio analysis unit may receive the audio signal to extract audio features from the audio signal. In embodiments, a neural network such as a Deep Neural Network (DNN) may receive the extracted audio features from the audio analysis unit and to generate a probability score to allow identification of the audio signal. In embodiments, the CA/AD driving system may control driving elements of the SA/AD vehicle to autonomously or semi-autonomously drive the SA/AD vehicle in response to the identification. Other embodiments may also be described and claimed.

Type: Application

Filed: November 17, 2017

Publication date: February 14, 2019

Inventors: Sarang Akotkar, Mithil Ramteke, Tobias Bocklet, Sivasubramanian Sundaram
METHOD AND SYSTEM OF AUDIO FALSE KEYPHRASE REJECTION USING SPEAKER RECOGNITION

Publication number: 20190051299

Abstract: Techniques related to a method and system of audio false keyphrase rejection using speaker recognition are described herein. Such techniques use speaker recognition of a computer originated voice to omit actions triggered when a keyphrase is present in captured audio and omitted when speech of the captured audio was spoken by the computer originated voice.

Type: Application

Filed: June 25, 2018

Publication date: February 14, 2019

Applicant: Intel Corporation

Inventors: Jacek Ossowski, Tobias Bocklet, Kuba Lopatka
METHOD AND SYSTEM OF NEURAL NETWORK KEYPHRASE DETECTION

Publication number: 20190043488

Abstract: A method and system are directed to autonomous neural network keyphrase detection and includes generating and using a multiple element state score vector by using neural network operations and without substantial use of a digital signal processor (DSP) to perform the keyphrase detection.

Type: Application

Filed: September 18, 2018

Publication date: February 7, 2019

Applicant: Intel Corporation

Inventors: Tobias Bocklet, Jacek Ossowski, Tomasz Dorau, Maciej Muchlinski, David Pearce, Piotr Rozen

prev 1 2 3 4 next