Patents by Inventor Georg Stemmer

Georg Stemmer has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Statistical-analysis-based reset of recurrent neural networks for automatic speech recognition

Patent number: 10255909

Abstract: Techniques are provided for calculating reset parameters for recurrent neural networks (RNN). A methodology implementing the techniques according to an embodiment includes generating a sequence of statistics. The calculation of each statistic is based on outputs of an RNN that is periodically re-initialized at a selected RNN reset time such that each of the calculated statistics is associated with a unique RNN reset time selected from a pre-determined range of reset times. The method further includes analyzing the sequence to identify a maximum interval during which the sequence remains relatively constant. The method further includes selecting a reset time parameter and reset context duration parameter, for re-initialization of the RNN during operation. The reset time parameter is based on the duration of the identified maximum interval and the reset context duration parameter is based on a time associated with the starting point of the identified maximum interval.

Type: Grant

Filed: June 29, 2017

Date of Patent: April 9, 2019

Assignee: INTEL IP CORPORATION

Inventors: Joachim Hofer, Josef G. Bauer, Piotr Rozen, Georg Stemmer
System and method of automatic speech recognition using parallel processing for weighted finite state transducer-based speech decoding

Patent number: 10255911

Abstract: A computer-implemented method of speech recognition comprises forming a weighted finite state transducer (WFST) having nodes associated with states and interconnected by arcs, and to identify at least one word or word sequence hypothesis, identifying multiple sub-graphs on the WFST, each sub-graph having the same arrangement of multiple states and at least one arc, and propagating tokens in parallel through the sub-graphs, where each sub-graph is stored as a supertoken each having an array of tokens.

Type: Grant

Filed: December 17, 2014

Date of Patent: April 9, 2019

Assignee: Intel Corporation

Inventors: Lukasz M. Malinowski, Piotr Jerzy Majcher, Georg Stemmer, Piotr Rozen, Joachim Hofer, Josef G. Bauer
Technologies for improved keyword spotting

Patent number: 10217458

Abstract: Technologies for improved keyword spotting are disclosed. A compute device may capture speech data from a user of the compute device, and perform automatic speech recognition on the captured speech data. The automatic speech recognition algorithm is configured to both spot keywords as well as provide a full transcription of the captured speech data. The automatic speech recognition algorithm may preferentially match the keywords compared to similar words. The recognized keywords may be used to improve parsing of the transcribed speech data or to improve an assistive agent in holding a dialog with a user of the compute device.

Type: Grant

Filed: September 23, 2016

Date of Patent: February 26, 2019

Assignee: Intel Corporation

Inventors: Praful Mangalath, Josef G. Bauer, Georg Stemmer
DYNAMIC ENROLLMENT OF USER-DEFINED WAKE-UP KEY-PHRASE FOR SPEECH ENABLED COMPUTER SYSTEM

Publication number: 20190043481

Abstract: Techniques are provided for wake-on-voice (WOV) key-phrase enrollment. A methodology implementing the techniques according to an embodiment includes generating a WOV key-phrase model based on identification of the sequence of sub-phonetic units of a user-provided key-phrase. The WOV key-phrase model is employed by a WOV processor for detection of the user spoken key-phrase and triggering operation of an automatic speech recognition (ASR) processor in response to the detection. The method further includes updating an ASR language model based on the user-provided key-phrase. The update includes one of embedding the WOV key-phrase model into the ASR language model, converting sub-phonetic units of the WOV key-phrase model and embedding the converted WOV key-phrase model into the ASR language model, or generating an ASR key-phrase model by applying a phoneme-syllable based statistical language model to the user-provided key-phrase and embedding the generated ASR key-phrase model into the ASR language model.

Type: Application

Filed: December 27, 2017

Publication date: February 7, 2019

Applicant: INTEL IP CORPORATION

Inventors: Munir Nikolai Alexander Georges, Tobias Bocklet, Georg Stemmer, Joachim Hofer, Josef G. Bauer
SCORE TREND ANALYSIS FOR REDUCED LATENCY AUTOMATIC SPEECH RECOGNITION

Publication number: 20190043476

Abstract: Techniques are provided for reducing the latency of automatic speech recognition using hypothesis score trend analysis. A methodology implementing the techniques according to an embodiment includes generating complete-phrase hypotheses and partial-phrase hypotheses, along with associated likelihood scores, based on a segment of speech. The method also includes selecting the complete-phrase hypothesis associated with the highest of the complete-phrase hypotheses likelihood scores, and selecting the partial-phrase hypothesis associated with the highest of the partial-phrase hypotheses likelihood scores. The method further includes calculating a relative likelihood score based on a ratio of the likelihood score associated with the selected complete-phrase hypothesis to the likelihood score associated with the selected partial-phrase hypothesis.

Type: Application

Filed: February 9, 2018

Publication date: February 7, 2019

Applicant: INTEL CORPORATION

Inventors: Joachim Hofer, Georg Stemmer, Josef G. Bauer, Munir Nikolai Alexander Georges
STATISTICAL-ANALYSIS-BASED RESET OF RECURRENT NEURAL NETWORKS FOR AUTOMATIC SPEECH RECOGNITION

Publication number: 20190005945

Abstract: Techniques are provided for calculating reset parameters for recurrent neural networks (RNN). A methodology implementing the techniques according to an embodiment includes generating a sequence of statistics. The calculation of each statistic is based on outputs of an RNN that is periodically re-initialized at a selected RNN reset time such that each of the calculated statistics is associated with a unique RNN reset time selected from a pre-determined range of reset times. The method further includes analyzing the sequence to identify a maximum interval during which the sequence remains relatively constant. The method further includes selecting a reset time parameter and reset context duration parameter, for re-initialization of the RNN during operation. The reset time parameter is based on the duration of the identified maximum interval and the reset context duration parameter is based on a time associated with the starting point of the identified maximum interval.

Type: Application

Filed: June 29, 2017

Publication date: January 3, 2019

Applicant: INTEL IP CORPORATION

Inventors: Joachim Hofer, Josef G. Bauer, Piotr Rozen, Georg Stemmer
QUERY REJECTION FOR LANGUAGE UNDERSTANDING

Publication number: 20180349794

Abstract: Techniques are provided for rejecting out-of-domain (OD) queries in a language understanding system. A methodology implementing the techniques according to an embodiment includes generating a plurality of in-domain (ID) utterances based on variations of provided ID sentences, and generating a plurality of OD utterances based on variations of provided OD sentences. The method may further include training an ID language model based on the generated ID utterances and training an OD language model based on the generated OD utterances. The ID language model is configured to generate an ID dataset based on calculated probabilities associated with the generated ID utterances. The OD language model is configured to generate an OD dataset based on calculated probabilities associated with the generated OD utterances. The method further includes training a classifier to detect OD queries from a plurality of received queries, the training based on the ID dataset and the OD dataset.

Type: Application

Filed: June 1, 2017

Publication date: December 6, 2018

Applicant: INTEL IP CORPORATION

Inventors: Munir Nikolai Alexander Georges, Szymon Jessa, Georg Stemmer
Context-aware query recognition for electronic devices

Patent number: 10147423

Abstract: A method for context-aware query recognition in an electronic device includes receiving user speech from an input device. A speech signal is generated from the user speech. It is determined if the speech signal includes an action to be performed and if the electronic device is the intended recipient of the user speech. If the recognized speech signal include the action and the intended recipient of the user speech is the electronic device, a command is generated for the electronic device to perform the action.

Type: Grant

Filed: September 29, 2016

Date of Patent: December 4, 2018

Assignee: Intel IP Corporation

Inventors: Munir Nikolai Alexander Georges, Georg Stemmer, Joachim Hofer
Optimizations to decoding of WFST models for automatic speech recognition

Patent number: 10127902

Abstract: A method in a computing device for decoding a weighted finite state transducer (WFST) for automatic speech recognition is described. The method includes sorting a set of one or more WFST arcs based on their arc weight in ascending order. The method further includes iterating through each arc in the sorted set of arcs according to the ascending order until the score of the generated token corresponding to an arc exceeds a score threshold. The method further includes discarding any remaining arcs in the set of arcs that have yet to be considered.

Type: Grant

Filed: June 6, 2017

Date of Patent: November 13, 2018

Assignee: Intel Corporation

Inventors: Joachim Hofer, Georg Stemmer
SPOKEN LANGUAGE UNDERSTANDING BASED ON BUFFERED KEYWORD SPOTTING AND SPEECH RECOGNITION

Publication number: 20180293974

Abstract: Techniques are provided for spoken language understanding based on keyword spotting and speech recognition. A methodology implementing the techniques according to an embodiment includes detecting a user spoken keyword or key-phrase embedded in an initial segment of a received audio signal, which is stored in a buffer. The method further includes triggering an automatic speech recognition (ASR) processor in response to the key-phrase detection. The method further includes performing automatic speech recognition, by the ASR processor, on a combination of the buffered initial segment and one or more additional received segments of the audio signal which include further speech from the user. The method still further includes performing natural language understanding on the recognized speech to determine a user request. The key-phrase is user selectable and serves to wake the ASR processor from a sleeping or idle lower power consumption state, into an active higher power consumption recognition state.

Type: Application

Filed: April 10, 2017

Publication date: October 11, 2018

Applicant: INTEL IP CORPORATION

Inventors: Munir Nikolai Alexander Georges, Tobias Bocklet, Georg Stemmer, Joachim Hofer, Josef G. Bauer
SYSTEMS AND METHODS FOR ENERGY EFFICIENT AND LOW POWER DISTRIBUTED AUTOMATIC SPEECH RECOGNITION ON WEARABLE DEVICES

Publication number: 20180286414

Abstract: Methods, apparatus, systems and articles of manufacture are disclosed for distributed automatic speech recognition. An example apparatus includes a detector to process an input audio signal and identify a portion of the input audio signal including a sound to be evaluated, the sound to be evaluated organized into a plurality of audio features representing the sound. The example apparatus includes a quantizer to process the audio features using a quantization process to reduce the audio features to generate a reduced set of audio features for transmission. The example apparatus includes a transmitter to transmit the reduced set of audio features over a low-energy communication channel for processing.

Type: Application

Filed: March 31, 2017

Publication date: October 4, 2018

Inventors: Binuraj K. Ravindran, Francis M. Tharappel, Prabhakar R. Datta, Tobias Bocklet, Maciej Muchlinski, Tomasz Dorau, Josef G. Bauer, Saurin Shah, Georg Stemmer
MISSPEAK RESOLUTION IN NATURAL LANGUAGE UNDERSTANDING FOR A MAN-MACHINE INTERFACE

Publication number: 20180268813

Abstract: Misspeaking is resolved in a natural language understanding system to interface with a machine. In one example, a user speech utterance is received. A sequence of classifiers is applied to words of the utterance to determine a meaning of the utterance. The meaning in interpreted as a command and the command is applied to a device for execution. The classifiers may include a first classifier to determine an intended function that is a subject of the utterance, a second classifier to determine words with properties that are related to the intended function, and a third classifier to select a property to apply to the function.

Type: Application

Filed: March 17, 2017

Publication date: September 20, 2018

Applicant: Intel IP Corporation

Inventors: Munir Nikolai Alexander Georges, Georg Stemmer, Szymon Jessa
VARIABLE WORD LENGTH NEURAL NETWORK ACCELERATOR CIRCUIT

Publication number: 20180232627

Abstract: A processing system includes a processor to execute a neural network application comprising an operation associated with a weight parameter and an input value, and an accelerator circuit, associated with the processor, to perform the operation, the accelerator circuit comprising a weight storage device to store a bit stream encoding the weight parameter, a controller to request a bit from the bit stream, an input data storage to store the input value, and an arithmetic logic unit (ALU) comprising an accumulator circuit to store an accumulation value and an operator circuit to receive the bit and the input value, receive a control signal from the controller, and responsive to determining that the control signal is set to a first value corresponding to a first operation and that that the bit encodes a first status, increase the accumulation value, stored in the accumulation circuit, by the input value.

Type: Application

Filed: February 16, 2017

Publication date: August 16, 2018

Inventors: Piotr Rozen, Ramya Rasipuram, Georg Stemmer
TECHNOLOGIES FOR IMPROVED KEYWORD SPOTTING

Publication number: 20180090131

Abstract: Technologies for improved keyword spotting are disclosed. A compute device may capture speech data from a user of the compute device, and perform automatic speech recognition on the captured speech data. The automatic speech recognition algorithm is configured to both spot keywords as well as provide a full transcription of the captured speech data. The automatic speech recognition algorithm may preferentially match the keywords compared to similar words. The recognized keywords may be used to improve parsing of the transcribed speech data or to improve an assistive agent in holding a dialog with a user of the compute device.

Type: Application

Filed: September 23, 2016

Publication date: March 29, 2018

Inventors: Praful Mangalath, Josef G. Bauer, Georg Stemmer
CONTEXT-AWARE QUERY RECOGNITION FOR ELECTRONIC DEVICES

Publication number: 20180090140

Abstract: A method for context-aware query recognition in an electronic device includes receiving user speech from an input device. A speech signal is generated from the user speech. It is determined if the speech signal includes an action to be performed and if the electronic device is the intended recipient of the user speech. If the recognized speech signal include the action and the intended recipient of the user speech is the electronic device, a command is generated for the electronic device to perform the action.

Type: Application

Filed: September 29, 2016

Publication date: March 29, 2018

Inventors: Munir Nikolai Alexander Georges, Georg Stemmer, Joachim Hofer
METHOD AND SYSTEM OF AUTOMATIC SPEECH RECOGNITION USING POSTERIOR CONFIDENCE SCORES

Publication number: 20180068653

Abstract: A system, article, and method include techniques of automatic speech recognition using posterior confidence scores.

Type: Application

Filed: September 8, 2016

Publication date: March 8, 2018

Inventors: David J. Trawick, Joachim Hofer, Josef G. Bauer, Georg Stemmer, Da-Ming Chiang
Technologies for robust crying detection using temporal characteristics of acoustic features

Patent number: 9899034

Abstract: Technologies for identifying sounds are disclosed. A sound identification device may capture sound data, and split the sound data into frames. The sound identification device may then determine an acoustic feature vector for each frame, and determine parameters based on how each acoustic feature varies over the duration of time corresponding to the frames. The sound identification device may then determine if the sound matches a pre-defined sound based on the parameters. In one embodiment, the sound identification device may be a baby monitor, and the pre-defined sound may be a baby crying.

Type: Grant

Filed: December 22, 2015

Date of Patent: February 20, 2018

Assignee: Intel IP Corporation

Inventors: Joachim Hofer, Tobias Bocklet, Georg Stemmer, David Pearce, Sebastian Czyryba, Josef G. Bauer
OPTIMIZATIONS TO DECODING OF WFST MODELS FOR AUTOMATIC SPEECH RECOGNITION

Publication number: 20180025720

Abstract: A method in a computing device for decoding a weighted finite state transducer (WFST) for automatic speech recognition is described. The method includes sorting a set of one or more WFST arcs based on their arc weight in ascending order. The method further includes iterating through each arc in the sorted set of arcs according to the ascending order until the score of the generated token corresponding to an arc exceeds a score threshold. The method further includes discarding any remaining arcs in the set of arcs that have yet to be considered.

Type: Application

Filed: June 6, 2017

Publication date: January 25, 2018

Inventors: Joachim HOFER, Georg STEMMER
SYSTEM AND METHOD OF AUTOMATIC SPEECH RECOGNITION USING PARALLEL PROCESSING FOR WEIGHTED FINITE STATE TRANSDUCER-BASED SPEECH DECODING

Publication number: 20170323638

Abstract: A system, article, and method of automatic speech recognition using parallel processing for weighted finite state transducer-based speech decoding.

Type: Application

Filed: December 12, 2014

Publication date: November 9, 2017

Inventors: Lukasz MALINOWSKI, Piotr MAJCHER, Georg STEMMER, Piotr ROZEN, Joachim HOFER, Josef BAUER
Method and system of automatic speech recognition with dynamic vocabularies

Patent number: 9740678

Abstract: A system, article, and method of automatic speech recognition with dynamic vocabularies is described herein.

Type: Grant

Filed: June 25, 2015

Date of Patent: August 22, 2017

Assignee: Intel Corporation

Inventors: Joachim Hofer, Georg Stemmer, Josef Bauer

prev 1 2 3 next