Patents by Inventor Leonid Velikovich

Leonid Velikovich has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Word lattice augmentation for automatic speech recognition

Patent number: 11797772

Abstract: Speech processing techniques are disclosed that enable determining a text representation of named entities in captured audio data. Various implementations include determining the location of a carrier phrase in a word lattice representation of the captured audio data, where the carrier phrase provides an indication of a named entity. Additional or alternative implementations include matching a candidate named entity with the portion of the word lattice, and augmenting the word lattice with the matched candidate named entity.

Type: Grant

Filed: January 31, 2022

Date of Patent: October 24, 2023

Assignee: GOOGLE LLC

Inventors: Leonid Velikovich, Petar Aleksic, Pedro Moreno
Lattice Speech Corrections

Publication number: 20230186898

Abstract: A method includes receiving audio data corresponding to a query spoken and processing the audio data to generate multiple candidate hypotheses each represented by a respective sequence of hypothesized terms. For each candidate hypothesis, the method includes determining whether the sequence of hypothesized terms includes a source phrase from a list of phrase correction pairs. Each phrase correction pair includes a corresponding source phrase that was misrecognized and a corresponding target phrase replacing the source phrase. When the respective sequence of hypothesized terms includes the source phrase, the method includes generating a corresponding additional candidate hypothesis that replaces the source phrase.

Type: Application

Filed: December 15, 2021

Publication date: June 15, 2023

Applicant: Google LLC

Inventors: Ágoston Weisz, Leonid Velikovich
CONTEXTUAL TAGGING AND BIASING OF GRAMMARS INSIDE WORD LATTICES

Publication number: 20220310082

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for implementing contextual grammar selection are disclosed. In one aspect, a method includes the actions of receiving audio data of an utterance. The actions include generating a word lattice that includes multiple candidate transcriptions of the utterance and that includes transcription confidence scores. The actions include determining a context of the computing device. The actions include based on the context of the computing device, identifying grammars that correspond to the multiple candidate transcriptions. The actions include determining, for each of the multiple candidate transcriptions, grammar confidence scores that reflect a likelihood that a respective grammar is a match for a respective candidate transcription. The actions include selecting, from among the candidate transcriptions, a candidate transcription.

Type: Application

Filed: June 16, 2022

Publication date: September 29, 2022

Applicant: Google LLC

Inventors: Petar Aleksic, Pedro J. Moreno Mengibar, Leonid Velikovich
WORD LATTICE AUGMENTATION FOR AUTOMATIC SPEECH RECOGNITION

Publication number: 20220229992

Abstract: Speech processing techniques are disclosed that enable determining a text representation of named entities in captured audio data. Various implementations include determining the location of a carrier phrase in a word lattice representation of the captured audio data, where the carrier phrase provides an indication of a named entity. Additional or alternative implementations include matching a candidate named entity with the portion of the word lattice, and augmenting the word lattice with the matched candidate named entity.

Type: Application

Filed: January 31, 2022

Publication date: July 21, 2022

Inventors: Leonid Velikovich, Petar Aleksic, Pedro Moreno
Contextual tagging and biasing of grammars inside word lattices

Patent number: 11386889

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for implementing contextual grammar selection are disclosed. In one aspect, a method includes the actions of receiving audio data of an utterance. The actions include generating a word lattice that includes multiple candidate transcriptions of the utterance and that includes transcription confidence scores. The actions include determining a context of the computing device. The actions include based on the context of the computing device, identifying grammars that correspond to the multiple candidate transcriptions. The actions include determining, for each of the multiple candidate transcriptions, grammar confidence scores that reflect a likelihood that a respective grammar is a match for a respective candidate transcription. The actions include selecting, from among the candidate transcriptions, a candidate transcription.

Type: Grant

Filed: November 27, 2019

Date of Patent: July 12, 2022

Assignee: Google LLC

Inventors: Petar Aleksic, Pedro J. Moreno Mengibar, Leonid Velikovich
Word lattice augmentation for automatic speech recognition

Patent number: 11238227

Abstract: Speech processing techniques are disclosed that enable determining a text representation of named entities in captured audio data. Various implementations include determining the location of a carrier phrase in a word lattice representation of the captured audio data, where the carrier phrase provides an indication of a named entity. Additional or alternative implementations include matching a candidate named entity with the portion of the word lattice, and augmenting the word lattice with the matched candidate named entity.

Type: Grant

Filed: June 27, 2019

Date of Patent: February 1, 2022

Assignee: Google LLC

Inventors: Leonid Velikovich, Petar Aleksic, Pedro Moreno
WORD LATTICE AUGMENTATION FOR AUTOMATIC SPEECH RECOGNITION

Publication number: 20210064822

Abstract: Speech processing techniques are disclosed that enable determining a text representation of named entities in captured audio data. Various implementations include determining the location of a carrier phrase in a word lattice representation of the captured audio data, where the carrier phrase provides an indication of a named entity. Additional or alternative implementations include matching a candidate named entity with the portion of the word lattice, and augmenting the word lattice with the matched candidate named entity.

Type: Application

Filed: June 27, 2019

Publication date: March 4, 2021

Inventors: Leonid Velikovich, Petar Aleksic, Pedro Moreno
SPEECH INPUT PROCESSING

Publication number: 20200175969

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for implementing contextual grammar selection are disclosed. In one aspect, a method includes the actions of receiving audio data of an utterance. The actions include generating a word lattice that includes multiple candidate transcriptions of the utterance and that includes transcription confidence scores. The actions include determining a context of the computing device. The actions include based on the context of the computing device, identifying grammars that correspond to the multiple candidate transcriptions. The actions include determining, for each of the multiple candidate transcriptions, grammar confidence scores that reflect a likelihood that a respective grammar is a match for a respective candidate transcription. The actions include selecting, from among the candidate transcriptions, a candidate transcription.

Type: Application

Filed: November 27, 2019

Publication date: June 4, 2020

Inventors: Petar Aleksic, Pedro J. Moreno Mengibar, Leonid Velikovich
Semantic model for tagging of word lattices

Patent number: 10529322

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for tagging during speech recognition. A word lattice that indicates probabilities for sequences of words in an utterance is obtained. A conditional probability transducer that indicates a frequency that sequences of both the words and semantic tags for the words appear is obtained. The word lattice and the conditional probability transducer are composed to construct a word lattice that indicates probabilities for sequences of both the words in the utterance and the semantic tags for the words. The word lattice that indicates probabilities for sequences of both the words in the utterance and the semantic tags for the words is used to generate a transcription that includes the words in the utterance and the semantic tags for the words.

Type: Grant

Filed: August 21, 2017

Date of Patent: January 7, 2020

Assignee: Google LLC

Inventors: Petar Aleksic, Michael D. Riley, Pedro J. Moreno Mengibar, Leonid Velikovich
SEMANTIC MODEL FOR TAGGING OF WORD LATTICES

Publication number: 20180366112

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for tagging during speech recognition. A word lattice that indicates probabilities for sequences of words in an utterance is obtained. A conditional probability transducer that indicates a frequency that sequences of both the words and semantic tags for the words appear is obtained. The word lattice and the conditional probability transducer are composed to construct a word lattice that indicates probabilities for sequences of both the words in the utterance and the semantic tags for the words. The word lattice that indicates probabilities for sequences of both the words in the utterance and the semantic tags for the words is used to generate a transcription that includes the words in the utterance and the semantic tags for the words.

Type: Application

Filed: August 21, 2017

Publication date: December 20, 2018

Inventors: Petar Aleksic, Michael D. Riley, Pedro J. Moreno Mengibar, Leonid Velikovich
Updating phonetic dictionaries

Patent number: 9135912

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for updating phonetic dictionaries. In one aspect, a method includes accessing a phonetic dictionary that identifies terms and one or more phonetic representations associated with each term, determining that a particular term that is identified in the phonetic dictionary is a spelling correction for another term that is identified in the phonetic dictionary, and storing, in the phonetic dictionary, one or more of the phonetic representations associated with the other term, with the particular term that is a spelling correction for the other term.

Type: Grant

Filed: September 19, 2012

Date of Patent: September 15, 2015

Assignee: Google Inc.

Inventors: Brian Strope, Leonid Velikovich