Patents by Inventor Alan Bekker

Alan Bekker has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11984114
    Abstract: Systems and methods are provided for performing speech to intent classification. The systems and methods perform operations comprising: receiving an audio file comprising speech input; processing, by a speech recognition engine, the audio file comprising the speech input to generate an initial character-based representation of the speech input; processing, by an intent classifier, the initial character-based representation of the speech input to generate an estimated intent of the speech input; and generating, by the speech recognition engine, a textual representation of the speech input based on the estimated intent of the speech input.
    Type: Grant
    Filed: October 6, 2021
    Date of Patent: May 14, 2024
    Assignee: SNAP INC.
    Inventors: Alan Bekker, Itamar Schen, Jackie Assa, Einav Itamar, Nave Algarici
  • Patent number: 11983462
    Abstract: Systems and methods are provided for providing an augmented reality experience. The systems and methods perform operations comprising: generating, for display by a messaging application, an image comprising one or more augmented reality elements, the one or more augmented reality elements being associated with a configurable entity; receiving, by the messaging application, speech input from a user; determining a schema associated with the one or more augmented reality elements; causing the speech input to be processed by a speech understanding model in accordance with the schema to determine one or more configurable state entity update values; updating the configurable entity associated with the one or more augmented reality elements based on the one or more configurable state entity update values; and modifying the one or more augmented reality elements in the image based on the updated configurable entity.
    Type: Grant
    Filed: August 31, 2021
    Date of Patent: May 14, 2024
    Assignee: SNAP INC.
    Inventors: Jackie Assa, Alan Bekker, Gilad Landau
  • Publication number: 20240062752
    Abstract: Systems and methods are provided for performing automated speech recognition. The systems and methods access a LM that includes a plurality of n-grams, each of the plurality of n-grams comprising a respective sequence of words and corresponding LM score and receive a list of words associated with a group classification, each word in the list of words being associated with a respective weight. The systems and method compute, based on the LM scores of the plurality of n-grams, a probability that a given word in the list of words associated with the group classification appears in an n-gram in the LM comprising an individual sequence of words and adds one or more new n-grams to the LM comprising one or more words in the list of words in combination with the individual sequence of words and associated with a particular LM score based on the computed probability.
    Type: Application
    Filed: August 22, 2022
    Publication date: February 22, 2024
    Inventors: Jackie Assa, Alan Bekker, Zach Moshe
  • Publication number: 20240021195
    Abstract: Systems and methods are provided for performing automated speech recognition. The systems and methods perform operations comprising: accessing a language model that includes a plurality of n-grams, each of the plurality of n-grams comprising a respective sequence of words and corresponding LM score; selecting a target word to boost in the language model; receiving a boosting factor for the target word; identifying a target n-gram in the language model that includes the target word; identifying a subset of n-grams of the plurality of n-grams that include words in a portion of the target n-gram; and adjusting the LM score of the target n-gram based on the LM scores of the subset of n-grams and the boosting factor.
    Type: Application
    Filed: July 14, 2022
    Publication date: January 18, 2024
    Inventors: Jackie Assa, Alan Bekker, Zach Moshe
  • Publication number: 20230326445
    Abstract: Systems and methods are provided for providing animated speech refinement. The systems and methods perform operations comprising: receiving an audio stream comprising one or more spoken words; processing the audio stream by an automated speech recognition (ASR) engine to identify base timing of one or more phonemes corresponding to the one or more spoken words; applying a machine learning model to the base of the one or more phonemes to estimate an adjustment to the base timing of the one or more phonemes.
    Type: Application
    Filed: April 11, 2022
    Publication date: October 12, 2023
    Inventors: Guy Adam, Jackie Assa, Alan Bekker
  • Publication number: 20230252972
    Abstract: Systems and methods are provided for providing emotion-based text to speech. The systems and methods perform operations comprising accessing a text string; storing a plurality of embeddings associated with a plurality of speakers, a first embedding for a first speaker being associated with a first emotion and a second embedding for a second speaker of the plurality of speakers being associated with a second emotion; selecting the first speaker to speak one or more words of the text string; determining that the one or more words are associated with the second emotion; generating, based on the first embedding and the second embedding, a third embedding for the first speaker associated with the second emotion; and applying the third embedding and the text string to a vocoder to generate an audio stream comprising the one or more words being spoken by the first speaker with the second emotion.
    Type: Application
    Filed: February 8, 2022
    Publication date: August 10, 2023
    Inventors: Liron Harazi, Jackie Assa, Alan Bekker
  • Publication number: 20230197064
    Abstract: Systems and methods are provided for extracting entities from received speech. The systems and methods perform operations comprising receiving an audio file comprising speech input and processing, by a speech recognition engine, the audio file comprising the speech input to generate an initial character-based representation of the speech input. The operations further comprise processing, by an entity extractor, the initial character-based representation of the speech input to generate an estimated set of entities of the speech input. The operations further comprise generating, by the speech recognition engine, a textual representation of the speech input based on the estimated set of entities of the speech input.
    Type: Application
    Filed: December 17, 2021
    Publication date: June 22, 2023
    Inventors: Alan Bekker, Jackie Assa, Itamar Schen, Einav Itamar
  • Publication number: 20230104583
    Abstract: Systems and methods are provided for performing speech to intent classification. The systems and methods perform operations comprising: receiving an audio file comprising speech input; processing, by a speech recognition engine, the audio file comprising the speech input to generate an initial character-based representation of the speech input; processing, by an intent classifier, the initial character-based representation of the speech input to generate an estimated intent of the speech input; and generating, by the speech recognition engine, a textual representation of the speech input based on the estimated intent of the speech input.
    Type: Application
    Filed: October 6, 2021
    Publication date: April 6, 2023
    Inventors: Alan Bekker, Itamar Schen, Jackie Assa, Einav Itamar, Nave Algarici
  • Publication number: 20230067305
    Abstract: Systems and methods are provided for providing an augmented reality experience. The systems and methods perform operations comprising: generating, for display by a messaging application, an image comprising one or more augmented reality elements, the one or more augmented reality elements being associated with a configurable entity; receiving, by the messaging application, speech input from a user; determining a schema associated with the one or more augmented reality elements; causing the speech input to be processed by a speech understanding model in accordance with the schema to determine one or more configurable state entity update values; updating the configurable entity associated with the one or more augmented reality elements based on the one or more configurable state entity update values; and modifying the one or more augmented reality elements in the image based on the updated configurable entity.
    Type: Application
    Filed: August 31, 2021
    Publication date: March 2, 2023
    Inventors: Jackie Assa, Alan Bekker, Gilad Landau