Patents by Inventor Leonid Velikovich
Leonid Velikovich has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20250061889Abstract: A method includes receiving audio data corresponding to a query spoken and processing the audio data to generate multiple candidate hypotheses each represented by a respective sequence of hypothesized terms. For each candidate hypothesis, the method includes determining whether the sequence of hypothesized terms includes a source phrase from a list of phrase correction pairs. Each phrase correction pair includes a corresponding source phrase that was misrecognized and a corresponding target phrase replacing the source phrase. When the respective sequence of hypothesized terms includes the source phrase, the method includes generating a corresponding additional candidate hypothesis that replaces the source phrase.Type: ApplicationFiled: November 1, 2024Publication date: February 20, 2025Applicant: Google LLCInventors: Ágoston Weisz, Leonid Velikovich
-
Publication number: 20250046296Abstract: A method, device, and computer-readable storage medium for predicting pronunciation of a text sample. The method includes selecting a predicted text sample corresponding to an audio sample, receiving a correction text sample corresponding to the audio sample, updating an encoding of allowable pronunciations of the correction text sample based on the predicted text sample and the audio sample, the updated encoding of allowable pronunciations of the correction text sample including a pronunciation of the predicted text sample, and predicting a pronunciation of the correction text sample based on the updated encoding of allowable pronunciations of the correction text sample.Type: ApplicationFiled: July 31, 2023Publication date: February 6, 2025Applicant: GOOGLE LLCInventors: Leonid VELIKOVICH, Ágoston WEISZ
-
Publication number: 20240428785Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for implementing contextual grammar selection are disclosed. In one aspect, a method includes the actions of receiving audio data of an utterance. The actions include generating a word lattice that includes multiple candidate transcriptions of the utterance and that includes transcription confidence scores. The actions include determining a context of the computing device. The actions include based on the context of the computing device, identifying grammars that correspond to the multiple candidate transcriptions. The actions include determining, for each of the multiple candidate transcriptions, grammar confidence scores that reflect a likelihood that a respective grammar is a match for a respective candidate transcription. The actions include selecting, from among the candidate transcriptions, a candidate transcription.Type: ApplicationFiled: September 4, 2024Publication date: December 26, 2024Applicant: Google LLCInventors: Petar Aleksic, Pedro J. Moreno Mengibar, Leonid Velikovich
-
Patent number: 12154549Abstract: A method includes receiving audio data corresponding to a query spoken and processing the audio data to generate multiple candidate hypotheses each represented by a respective sequence of hypothesized terms. For each candidate hypothesis, the method includes determining whether the sequence of hypothesized terms includes a source phrase from a list of phrase correction pairs. Each phrase correction pair includes a corresponding source phrase that was misrecognized and a corresponding target phrase replacing the source phrase. When the respective sequence of hypothesized terms includes the source phrase, the method includes generating a corresponding additional candidate hypothesis that replaces the source phrase.Type: GrantFiled: December 15, 2021Date of Patent: November 26, 2024Assignee: Google LLCInventors: Ágoston Weisz, Leonid Velikovich
-
Patent number: 11797772Abstract: Speech processing techniques are disclosed that enable determining a text representation of named entities in captured audio data. Various implementations include determining the location of a carrier phrase in a word lattice representation of the captured audio data, where the carrier phrase provides an indication of a named entity. Additional or alternative implementations include matching a candidate named entity with the portion of the word lattice, and augmenting the word lattice with the matched candidate named entity.Type: GrantFiled: January 31, 2022Date of Patent: October 24, 2023Assignee: GOOGLE LLCInventors: Leonid Velikovich, Petar Aleksic, Pedro Moreno
-
Publication number: 20230186898Abstract: A method includes receiving audio data corresponding to a query spoken and processing the audio data to generate multiple candidate hypotheses each represented by a respective sequence of hypothesized terms. For each candidate hypothesis, the method includes determining whether the sequence of hypothesized terms includes a source phrase from a list of phrase correction pairs. Each phrase correction pair includes a corresponding source phrase that was misrecognized and a corresponding target phrase replacing the source phrase. When the respective sequence of hypothesized terms includes the source phrase, the method includes generating a corresponding additional candidate hypothesis that replaces the source phrase.Type: ApplicationFiled: December 15, 2021Publication date: June 15, 2023Applicant: Google LLCInventors: Ágoston Weisz, Leonid Velikovich
-
Publication number: 20220310082Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for implementing contextual grammar selection are disclosed. In one aspect, a method includes the actions of receiving audio data of an utterance. The actions include generating a word lattice that includes multiple candidate transcriptions of the utterance and that includes transcription confidence scores. The actions include determining a context of the computing device. The actions include based on the context of the computing device, identifying grammars that correspond to the multiple candidate transcriptions. The actions include determining, for each of the multiple candidate transcriptions, grammar confidence scores that reflect a likelihood that a respective grammar is a match for a respective candidate transcription. The actions include selecting, from among the candidate transcriptions, a candidate transcription.Type: ApplicationFiled: June 16, 2022Publication date: September 29, 2022Applicant: Google LLCInventors: Petar Aleksic, Pedro J. Moreno Mengibar, Leonid Velikovich
-
Publication number: 20220229992Abstract: Speech processing techniques are disclosed that enable determining a text representation of named entities in captured audio data. Various implementations include determining the location of a carrier phrase in a word lattice representation of the captured audio data, where the carrier phrase provides an indication of a named entity. Additional or alternative implementations include matching a candidate named entity with the portion of the word lattice, and augmenting the word lattice with the matched candidate named entity.Type: ApplicationFiled: January 31, 2022Publication date: July 21, 2022Inventors: Leonid Velikovich, Petar Aleksic, Pedro Moreno
-
Patent number: 11386889Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for implementing contextual grammar selection are disclosed. In one aspect, a method includes the actions of receiving audio data of an utterance. The actions include generating a word lattice that includes multiple candidate transcriptions of the utterance and that includes transcription confidence scores. The actions include determining a context of the computing device. The actions include based on the context of the computing device, identifying grammars that correspond to the multiple candidate transcriptions. The actions include determining, for each of the multiple candidate transcriptions, grammar confidence scores that reflect a likelihood that a respective grammar is a match for a respective candidate transcription. The actions include selecting, from among the candidate transcriptions, a candidate transcription.Type: GrantFiled: November 27, 2019Date of Patent: July 12, 2022Assignee: Google LLCInventors: Petar Aleksic, Pedro J. Moreno Mengibar, Leonid Velikovich
-
Patent number: 11238227Abstract: Speech processing techniques are disclosed that enable determining a text representation of named entities in captured audio data. Various implementations include determining the location of a carrier phrase in a word lattice representation of the captured audio data, where the carrier phrase provides an indication of a named entity. Additional or alternative implementations include matching a candidate named entity with the portion of the word lattice, and augmenting the word lattice with the matched candidate named entity.Type: GrantFiled: June 27, 2019Date of Patent: February 1, 2022Assignee: Google LLCInventors: Leonid Velikovich, Petar Aleksic, Pedro Moreno
-
Publication number: 20210064822Abstract: Speech processing techniques are disclosed that enable determining a text representation of named entities in captured audio data. Various implementations include determining the location of a carrier phrase in a word lattice representation of the captured audio data, where the carrier phrase provides an indication of a named entity. Additional or alternative implementations include matching a candidate named entity with the portion of the word lattice, and augmenting the word lattice with the matched candidate named entity.Type: ApplicationFiled: June 27, 2019Publication date: March 4, 2021Inventors: Leonid Velikovich, Petar Aleksic, Pedro Moreno
-
Publication number: 20200175969Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for implementing contextual grammar selection are disclosed. In one aspect, a method includes the actions of receiving audio data of an utterance. The actions include generating a word lattice that includes multiple candidate transcriptions of the utterance and that includes transcription confidence scores. The actions include determining a context of the computing device. The actions include based on the context of the computing device, identifying grammars that correspond to the multiple candidate transcriptions. The actions include determining, for each of the multiple candidate transcriptions, grammar confidence scores that reflect a likelihood that a respective grammar is a match for a respective candidate transcription. The actions include selecting, from among the candidate transcriptions, a candidate transcription.Type: ApplicationFiled: November 27, 2019Publication date: June 4, 2020Inventors: Petar Aleksic, Pedro J. Moreno Mengibar, Leonid Velikovich
-
Patent number: 10529322Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for tagging during speech recognition. A word lattice that indicates probabilities for sequences of words in an utterance is obtained. A conditional probability transducer that indicates a frequency that sequences of both the words and semantic tags for the words appear is obtained. The word lattice and the conditional probability transducer are composed to construct a word lattice that indicates probabilities for sequences of both the words in the utterance and the semantic tags for the words. The word lattice that indicates probabilities for sequences of both the words in the utterance and the semantic tags for the words is used to generate a transcription that includes the words in the utterance and the semantic tags for the words.Type: GrantFiled: August 21, 2017Date of Patent: January 7, 2020Assignee: Google LLCInventors: Petar Aleksic, Michael D. Riley, Pedro J. Moreno Mengibar, Leonid Velikovich
-
Publication number: 20180366112Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for tagging during speech recognition. A word lattice that indicates probabilities for sequences of words in an utterance is obtained. A conditional probability transducer that indicates a frequency that sequences of both the words and semantic tags for the words appear is obtained. The word lattice and the conditional probability transducer are composed to construct a word lattice that indicates probabilities for sequences of both the words in the utterance and the semantic tags for the words. The word lattice that indicates probabilities for sequences of both the words in the utterance and the semantic tags for the words is used to generate a transcription that includes the words in the utterance and the semantic tags for the words.Type: ApplicationFiled: August 21, 2017Publication date: December 20, 2018Inventors: Petar Aleksic, Michael D. Riley, Pedro J. Moreno Mengibar, Leonid Velikovich
-
Patent number: 9135912Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for updating phonetic dictionaries. In one aspect, a method includes accessing a phonetic dictionary that identifies terms and one or more phonetic representations associated with each term, determining that a particular term that is identified in the phonetic dictionary is a spelling correction for another term that is identified in the phonetic dictionary, and storing, in the phonetic dictionary, one or more of the phonetic representations associated with the other term, with the particular term that is a spelling correction for the other term.Type: GrantFiled: September 19, 2012Date of Patent: September 15, 2015Assignee: Google Inc.Inventors: Brian Strope, Leonid Velikovich