Patents by Inventor Alan Bekker

Alan Bekker has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Speech to intent

Patent number: 11984114

Abstract: Systems and methods are provided for performing speech to intent classification. The systems and methods perform operations comprising: receiving an audio file comprising speech input; processing, by a speech recognition engine, the audio file comprising the speech input to generate an initial character-based representation of the speech input; processing, by an intent classifier, the initial character-based representation of the speech input to generate an estimated intent of the speech input; and generating, by the speech recognition engine, a textual representation of the speech input based on the estimated intent of the speech input.

Type: Grant

Filed: October 6, 2021

Date of Patent: May 14, 2024

Assignee: SNAP INC.

Inventors: Alan Bekker, Itamar Schen, Jackie Assa, Einav Itamar, Nave Algarici
Conversation guided augmented reality experience

Patent number: 11983462

Abstract: Systems and methods are provided for providing an augmented reality experience. The systems and methods perform operations comprising: generating, for display by a messaging application, an image comprising one or more augmented reality elements, the one or more augmented reality elements being associated with a configurable entity; receiving, by the messaging application, speech input from a user; determining a schema associated with the one or more augmented reality elements; causing the speech input to be processed by a speech understanding model in accordance with the schema to determine one or more configurable state entity update values; updating the configurable entity associated with the one or more augmented reality elements based on the one or more configurable state entity update values; and modifying the one or more augmented reality elements in the image based on the updated configurable entity.

Type: Grant

Filed: August 31, 2021

Date of Patent: May 14, 2024

Assignee: SNAP INC.

Inventors: Jackie Assa, Alan Bekker, Gilad Landau
GROUPING SIMILAR WORDS IN A LANGUAGE MODEL

Publication number: 20240062752

Abstract: Systems and methods are provided for performing automated speech recognition. The systems and methods access a LM that includes a plurality of n-grams, each of the plurality of n-grams comprising a respective sequence of words and corresponding LM score and receive a list of words associated with a group classification, each word in the list of words being associated with a respective weight. The systems and method compute, based on the LM scores of the plurality of n-grams, a probability that a given word in the list of words associated with the group classification appears in an n-gram in the LM comprising an individual sequence of words and adds one or more new n-grams to the LM comprising one or more words in the list of words in combination with the individual sequence of words and associated with a particular LM score based on the computed probability.

Type: Application

Filed: August 22, 2022

Publication date: February 22, 2024

Inventors: Jackie Assa, Alan Bekker, Zach Moshe
BOOSTING WORDS IN AUTOMATED SPEECH RECOGNITION

Publication number: 20240021195

Abstract: Systems and methods are provided for performing automated speech recognition. The systems and methods perform operations comprising: accessing a language model that includes a plurality of n-grams, each of the plurality of n-grams comprising a respective sequence of words and corresponding LM score; selecting a target word to boost in the language model; receiving a boosting factor for the target word; identifying a target n-gram in the language model that includes the target word; identifying a subset of n-grams of the plurality of n-grams that include words in a portion of the target n-gram; and adjusting the LM score of the target n-gram based on the LM scores of the subset of n-grams and the boosting factor.

Type: Application

Filed: July 14, 2022

Publication date: January 18, 2024

Inventors: Jackie Assa, Alan Bekker, Zach Moshe
ANIMATED SPEECH REFINEMENT USING MACHINE LEARNING

Publication number: 20230326445

Abstract: Systems and methods are provided for providing animated speech refinement. The systems and methods perform operations comprising: receiving an audio stream comprising one or more spoken words; processing the audio stream by an automated speech recognition (ASR) engine to identify base timing of one or more phonemes corresponding to the one or more spoken words; applying a machine learning model to the base of the one or more phonemes to estimate an adjustment to the base timing of the one or more phonemes.

Type: Application

Filed: April 11, 2022

Publication date: October 12, 2023

Inventors: Guy Adam, Jackie Assa, Alan Bekker
EMOTION-BASED TEXT TO SPEECH

Publication number: 20230252972

Abstract: Systems and methods are provided for providing emotion-based text to speech. The systems and methods perform operations comprising accessing a text string; storing a plurality of embeddings associated with a plurality of speakers, a first embedding for a first speaker being associated with a first emotion and a second embedding for a second speaker of the plurality of speakers being associated with a second emotion; selecting the first speaker to speak one or more words of the text string; determining that the one or more words are associated with the second emotion; generating, based on the first embedding and the second embedding, a third embedding for the first speaker associated with the second emotion; and applying the third embedding and the text string to a vocoder to generate an audio stream comprising the one or more words being spoken by the first speaker with the second emotion.

Type: Application

Filed: February 8, 2022

Publication date: August 10, 2023

Inventors: Liron Harazi, Jackie Assa, Alan Bekker
SPEECH TO ENTITY

Publication number: 20230197064

Abstract: Systems and methods are provided for extracting entities from received speech. The systems and methods perform operations comprising receiving an audio file comprising speech input and processing, by a speech recognition engine, the audio file comprising the speech input to generate an initial character-based representation of the speech input. The operations further comprise processing, by an entity extractor, the initial character-based representation of the speech input to generate an estimated set of entities of the speech input. The operations further comprise generating, by the speech recognition engine, a textual representation of the speech input based on the estimated set of entities of the speech input.

Type: Application

Filed: December 17, 2021

Publication date: June 22, 2023

Inventors: Alan Bekker, Jackie Assa, Itamar Schen, Einav Itamar
SPEECH TO INTENT

Publication number: 20230104583

Abstract: Systems and methods are provided for performing speech to intent classification. The systems and methods perform operations comprising: receiving an audio file comprising speech input; processing, by a speech recognition engine, the audio file comprising the speech input to generate an initial character-based representation of the speech input; processing, by an intent classifier, the initial character-based representation of the speech input to generate an estimated intent of the speech input; and generating, by the speech recognition engine, a textual representation of the speech input based on the estimated intent of the speech input.

Type: Application

Filed: October 6, 2021

Publication date: April 6, 2023

Inventors: Alan Bekker, Itamar Schen, Jackie Assa, Einav Itamar, Nave Algarici
CONVERSATION GUIDED AUGMENTED REALITY EXPERIENCE

Publication number: 20230067305

Abstract: Systems and methods are provided for providing an augmented reality experience. The systems and methods perform operations comprising: generating, for display by a messaging application, an image comprising one or more augmented reality elements, the one or more augmented reality elements being associated with a configurable entity; receiving, by the messaging application, speech input from a user; determining a schema associated with the one or more augmented reality elements; causing the speech input to be processed by a speech understanding model in accordance with the schema to determine one or more configurable state entity update values; updating the configurable entity associated with the one or more augmented reality elements based on the one or more configurable state entity update values; and modifying the one or more augmented reality elements in the image based on the updated configurable entity.

Type: Application

Filed: August 31, 2021

Publication date: March 2, 2023

Inventors: Jackie Assa, Alan Bekker, Gilad Landau