Patents by Inventor Georg Stemmer

Georg Stemmer has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

METHODS AND APPARATUS FOR AUDIO ADJUSTMENT BASED ON VOCAL EFFORT

Publication number: 20230410810

Abstract: Methods and apparatus to audio adjustment based on vocal effort are disclosed herein. An example apparatus comprising interface circuitry, machine readable instructions, and programmable circuitry to at least one of instantiate or execute the machine readable instructions to identify speech with a soft voice type in audio from a first user device, the speech with the soft voice type including phonation, modify the audio to generate modified audio based on the identification of the speech with the soft voice type, and output the modified audio from a second user device.

Type: Application

Filed: August 28, 2023

Publication date: December 21, 2023

Inventors: Hector Alfonso Cordourier Maruri, Georg Stemmer, Lukasz Kurylo, Himanshu Bhalla
ENHANCED SPATIAL AUDIO-BASED VIRTUAL SEATING ARRANGEMENTS

Publication number: 20220343289

Abstract: This disclosure describes systems, methods, and devices related to presenting video conferencing virtual seating arrangements. A method may include generating a first similarity score indicative of a first similarity between a first voice of a first virtual meeting user and a second voice of a second virtual meeting user; generating a second similarity score indicative of a second similarity between the first voice of the first virtual meeting user and a third voice of a third virtual meeting user; determining, based on the first similarity score and the second similarity score, a similarity loss for a virtual seating arrangement; determining that the similarity loss is a minimum similarity loss of respective similarity losses for different virtual seating arrangements; generating presentation data, for the virtual meeting, including virtual representations of the virtual meeting users arranged based on the virtual seating arrangement; and presenting the presentation data.

Type: Application

Filed: June 29, 2022

Publication date: October 27, 2022

Inventors: Georg Stemmer, Willem Beltman, Hector Cordourier Maruri
METHODS AND APPARATUS TO GENERATE BINAURAL SOUNDS FOR HEARING DEVICES

Publication number: 20220286798

Abstract: Methods, apparatus, systems, and articles of manufacture are disclosed to generate binaural sounds for hearing devices. An example apparatus includes processor circuitry to at least access audio data corresponding to multiple devices, ones of the multiple devices positioned at spatial locations relative to a listener, identify a position of the listener relative to the multiple devices, adjust, based on the spatial locations and the position of the listener, the audio data associated with at least one of the multiple devices, transmit the adjusted audio data to a hearing device associated with the listener, the adjusted audio data including a binaural sound corresponding to each of the spatial locations.

Type: Application

Filed: May 27, 2022

Publication date: September 8, 2022

Inventors: Georg Stemmer, Hector Cordourier Maruri, Willem Beltman
METHOD AND SYSTEM OF AUTOMATIC CONTEXT-BOUND DOMAIN-SPECIFIC SPEECH RECOGNITION

Publication number: 20220122596

Abstract: A system, article, and method of automatic context-bound domain-specific speech recognition uses general language models.

Type: Application

Filed: December 24, 2021

Publication date: April 21, 2022

Applicant: Intel Corporation

Inventors: Szymon Jessa, Jakub Nowicki, Michal Papaj, Piotr Hoffmann, Krzysztof Swider, Georg Stemmer
Systems and methods for energy efficient and low power distributed automatic speech recognition on wearable devices

Patent number: 11308978

Abstract: Methods, apparatus, systems and articles of manufacture are disclosed for distributed automatic speech recognition. An example apparatus includes a detector to process an input audio signal and identify a portion of the input audio signal including a sound to be evaluated, the sound to be evaluated organized into a plurality of audio features representing the sound. The example apparatus includes a quantizer to process the audio features using a quantization process to reduce the audio features to generate a reduced set of audio features for transmission. The example apparatus includes a transmitter to transmit the reduced set of audio features over a low-energy communication channel for processing.

Type: Grant

Filed: August 5, 2019

Date of Patent: April 19, 2022

Assignee: INTEL CORPORATION

Inventors: Binuraj K. Ravindran, Francis M. Tharappel, Prabhakar R. Datta, Tobias Bocklet, Maciej Muchlinski, Tomasz Dorau, Josef G. Bauer, Saurin Shah, Georg Stemmer
AUTOMATIC PERSONAL IDENTIFIABLE INFORMATION REMOVAL FROM AUDIO

Publication number: 20220084521

Abstract: This disclosure describes systems, methods, and devices related to automatic personal identifiable information (PII) removal. A system may detect a sound signal received from a vicinity of a machine during the operation of the machine. The system may perform speech detection to detect a segment of the sound signal that comprises a speech signal. The system may modify the sound signal at the segment of the sound signal by performing a segment replacement mechanism. The system may generate a filtered sound signal to be used for monitoring the operation of the machine.

Type: Application

Filed: November 23, 2021

Publication date: March 17, 2022

Inventors: Raju Arvind, Jose Lopez, Georg Stemmer
DEEPFAKE DETECTION MODELS UTILIZING SUBJECT-SPECIFIC LIBRARIES

Publication number: 20220004904

Abstract: An apparatus to facilitate deepfake detection models utilizing subject-specific libraries is disclosed. The apparatus includes one or more processors to store a plurality of deepfake detection models corresponding to a plurality of subjects of interest; receive a query to identify whether data pertaining to a target subject of interest is a deepfake, the target subject of interest comprised in the plurality of subjects of interest and associated with a subject identifier (ID); identify a deepfake detection model corresponding to the subject ID; extract features for deepfake detection from the data; input the extracted features to the identified deepfake detection model corresponding to the subject ID; and responsive to an output of the deepfake detection model exceeding a determined deepfake threshold, generate a notification, in response to the query, indicating a possible deepfake attack corresponding to the target subject of interest.

Type: Application

Filed: September 22, 2021

Publication date: January 6, 2022

Applicant: Intel Corporation

Inventors: Georg Stemmer, Carl Marshall, Satyam Srivastava, Ilke Demir
Continuous topic detection and adaption in audio environments

Patent number: 11031005

Abstract: A mechanism is described for facilitating continuous topic detection and adaption in audio environments, according to one embodiment. A method of embodiments, as described herein, includes detecting a term relating to a topic in an audio input received from one or more microphones of the computing device including a voice-enabled device; analyzing the term based on the topic to determine an action to be performed by the computing device; and triggering an event to facilitate the computing device to perform the action consistent with the term and the topic.

Type: Grant

Filed: December 17, 2018

Date of Patent: June 8, 2021

Assignee: INTEL CORPORATION

Inventors: Georg Stemmer, Andrzej Mialkowski, Joachim Hofer, Piotr Rozen, Tomasz Szmelczynski
FIXED POINT INTEGER IMPLEMENTATIONS FOR NEURAL NETWORKS

Publication number: 20210004686

Abstract: Techniques related to implementing neural networks for speech recognition systems are discussed. Such techniques may include processing a node of the neural network by determining a score for the node as a product of weights and inputs such that the weights are fixed point integer values, applying a correction to the score based on a correction value associated with at least one of the weights, and generating an output from the node based on the corrected score.

Type: Application

Filed: September 24, 2020

Publication date: January 7, 2021

Applicant: Intel Corporation

Inventors: Piotr Rozen, Georg Stemmer
Fixed point integer implementations for neural networks

Patent number: 10803381

Abstract: Techniques related to implementing neural networks for speech recognition systems are discussed. Such techniques may include processing a node of the neural network by determining a score for the node as a product of weights and inputs such that the weights are fixed point integer values, applying a correction to the score based on a correction value associated with at least one of the weights, and generating an output from the node based on the corrected score.

Type: Grant

Filed: September 9, 2014

Date of Patent: October 13, 2020

Assignee: Intel Corporation

Inventors: Piotr Rozen, Georg Stemmer
Dynamic enrollment of user-defined wake-up key-phrase for speech enabled computer system

Patent number: 10672380

Abstract: Techniques are provided for wake-on-voice (WOV) key-phrase enrollment. A methodology implementing the techniques according to an embodiment includes generating a WOV key-phrase model based on identification of the sequence of sub-phonetic units of a user-provided key-phrase. The WOV key-phrase model is employed by a WOV processor for detection of the user spoken key-phrase and triggering operation of an automatic speech recognition (ASR) processor in response to the detection. The method further includes updating an ASR language model based on the user-provided key-phrase. The update includes one of embedding the WOV key-phrase model into the ASR language model, converting sub-phonetic units of the WOV key-phrase model and embedding the converted WOV key-phrase model into the ASR language model, or generating an ASR key-phrase model by applying a phoneme-syllable based statistical language model to the user-provided key-phrase and embedding the generated ASR key-phrase model into the ASR language model.

Type: Grant

Filed: December 27, 2017

Date of Patent: June 2, 2020

Assignee: Intel IP Corporation

Inventors: Munir Nikolai Alexander Georges, Tobias Bocklet, Georg Stemmer, Joachim Hofer, Josef G. Bauer
Score trend analysis for reduced latency automatic speech recognition

Patent number: 10657952

Abstract: Techniques are provided for reducing the latency of automatic speech recognition using hypothesis score trend analysis. A methodology implementing the techniques according to an embodiment includes generating complete-phrase hypotheses and partial-phrase hypotheses, along with associated likelihood scores, based on a segment of speech. The method also includes selecting the complete-phrase hypothesis associated with the highest of the complete-phrase hypotheses likelihood scores, and selecting the partial-phrase hypothesis associated with the highest of the partial-phrase hypotheses likelihood scores. The method further includes calculating a relative likelihood score based on a ratio of the likelihood score associated with the selected complete-phrase hypothesis to the likelihood score associated with the selected partial-phrase hypothesis.

Type: Grant

Filed: February 9, 2018

Date of Patent: May 19, 2020

Assignee: Intel IP Corporation

Inventors: Joachim Hofer, Georg Stemmer, Josef G. Bauer, Munir Nikolai Alexander Georges
ADAPTIVELY RECOGNIZING SPEECH USING KEY PHRASES

Publication number: 20200090657

Abstract: An example apparatus for recognizing speech includes an audio receiver to receive a stream of audio. The apparatus also includes a key phrase detector to detect a key phrase in the stream of audio. The apparatus further includes a model adapter to dynamically adapt a model based on the detected key phrase. The apparatus also includes a query recognizer to detect a voice query following the key phrase in a stream of audio via the adapted model.

Type: Application

Filed: November 22, 2019

Publication date: March 19, 2020

Applicant: INTEL CORPORATION

Inventors: Krzysztof Czarnowski, Munir Nikolai Alexander Georges, Tobias Bocklet, Georg Stemmer
CONCEALING PHRASES IN AUDIO TRAVELING OVER AIR

Publication number: 20200082837

Abstract: An example apparatus for concealing phrases in audio includes a receiver to receive a detected phrase via a network. The detected phrase is based on audio captured near a source of an audio stream. The apparatus also includes a speech recognizer to generate a trigger in response to detecting that a section of the audio stream contains a confirmed phrase. The apparatus further includes a phrase concealer to conceal the section of the audio stream in response to the trigger.

Type: Application

Filed: November 14, 2019

Publication date: March 12, 2020

Inventors: Munir Nikolai Alexander Georges, Joachim Hofer, Tobias Bocklet, Josef Bauer, Georg Stemmer
SYSTEMS AND METHODS FOR ENERGY EFFICIENT AND LOW POWER DISTRIBUTED AUTOMATIC SPEECH RECOGNITION ON WEARABLE DEVICES

Publication number: 20190355379

Abstract: Methods, apparatus, systems and articles of manufacture are disclosed for distributed automatic speech recognition. An example apparatus includes a detector to process an input audio signal and identify a portion of the input audio signal including a sound to be evaluated, the sound to be evaluated organized into a plurality of audio features representing the sound. The example apparatus includes a quantizer to process the audio features using a quantization process to reduce the audio features to generate a reduced set of audio features for transmission. The example apparatus includes a transmitter to transmit the reduced set of audio features over a low-energy communication channel for processing.

Type: Application

Filed: August 5, 2019

Publication date: November 21, 2019

Inventors: Binuraj K. Ravindran, Francis M. Tharappel, Prabhakar R. Datta, Tobias Bocklet, Maciej Muchlinski, Tomasz Dorau, Josef G. Bauer, Saurin Shah, Georg Stemmer
CONTEXT-AWARE QUERY RECOGNITION FOR ELECTRONIC DEVICES

Publication number: 20190348036

Abstract: A method for context-aware query recognition in an electronic device includes receiving user speech from an input device. A speech signal is generated from the user speech. It is determined if the speech signal includes an action to be performed and if the electronic device is the intended recipient of the user speech. If the recognized speech signal include the action and the intended recipient of the user speech is the electronic device, a command is generated for the electronic device to perform the action.

Type: Application

Filed: December 4, 2018

Publication date: November 14, 2019

Inventors: Munir Nikolai Alexander Georges, Georg Stemmer, Joachim Hofer
Method and system of automatic speech recognition using posterior confidence scores

Patent number: 10403268

Abstract: A system, article, and method include techniques of automatic speech recognition using posterior confidence scores.

Type: Grant

Filed: September 8, 2016

Date of Patent: September 3, 2019

Assignee: Intel IP Corporation

Inventors: David J. Trawick, Joachim Hofer, Josef G. Bauer, Georg Stemmer, Da-Ming Chiang
Systems and methods for energy efficient and low power distributed automatic speech recognition on wearable devices

Patent number: 10373630

Abstract: Methods, apparatus, systems and articles of manufacture are disclosed for distributed automatic speech recognition. An example apparatus includes a detector to process an input audio signal and identify a portion of the input audio signal including a sound to be evaluated, the sound to be evaluated organized into a plurality of audio features representing the sound. The example apparatus includes a quantizer to process the audio features using a quantization process to reduce the audio features to generate a reduced set of audio features for transmission. The example apparatus includes a transmitter to transmit the reduced set of audio features over a low-energy communication channel for processing.

Type: Grant

Filed: March 31, 2017

Date of Patent: August 6, 2019

Assignee: Intel Corporation

Inventors: Binuraj K. Ravindran, Francis M. Tharappel, Prabhakar R. Datta, Tobias Bocklet, Maciej Muchlinski, Tomasz Dorau, Josef G. Bauer, Saurin Shah, Georg Stemmer
Language model modification for local speech recognition systems using remote sources

Patent number: 10325590

Abstract: A language model is modified for a local speech recognition system using remote speech recognition sources. In one example, a speech utterance is received. The speech utterance is sent to at least one remote speech recognition system. Text results corresponding to the utterance are received from the remote speech recognition system. A local text result is generated using local vocabulary. The received text results and the generated text result are compared to determine words that are out of the local vocabulary and the local vocabulary is updated using the out of vocabulary words.

Type: Grant

Filed: June 26, 2015

Date of Patent: June 18, 2019

Assignee: INTEL CORPORATION

Inventors: Michael Deisher, Georg Stemmer
CONTINUOUS TOPIC DETECTION AND ADAPTION IN AUDIO ENVIRONMENTS

Publication number: 20190147875

Abstract: A mechanism is described for facilitating continuous topic detection and adaption in audio environments, according to one embodiment. A method of embodiments, as described herein, includes detecting a term relating to a topic in an audio input received from one or more microphones of the computing device including a voice-enabled device; analyzing the term based on the topic to determine an action to be performed by the computing device; and triggering an event to facilitate the computing device to perform the action consistent with the term and the topic.

Type: Application

Filed: December 17, 2018

Publication date: May 16, 2019

Applicant: Intel Corporation

Inventors: GEORG STEMMER, ANDRZEJ MIALKOWSKI, JOACHIM HOFER, PIOTR ROZEN, TOMASZ SZMELCZYNSKI

1 2 3 next