Patents by Inventor Dmitry Osmakov

Dmitry Osmakov has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

ACOUSTIC MODEL TRAINING USING CORRECTED TERMS

Publication number: 20230274729

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for speech recognition. One of the methods includes receiving first audio data corresponding to an utterance; obtaining a first transcription of the first audio data; receiving data indicating (i) a selection of one or more terms of the first transcription and (ii) one or more of replacement terms; determining that one or more of the replacement terms are classified as a correction of one or more of the selected terms; in response to determining that the one or more of the replacement terms are classified as a correction of the one or more of the selected terms, obtaining a first portion of the first audio data that corresponds to one or more terms of the first transcription; and using the first portion of the first audio data that is associated with the one or more terms of the first transcription to train an acoustic model for recognizing the one or more of the replacement terms.

Type: Application

Filed: May 4, 2023

Publication date: August 31, 2023

Applicant: Google LLC

Inventors: Olga Kapralova, Evgeny A. Cherepanov, Dmitry Osmakov, Martin Baeuml, Gleb Skobeltsyn
Acoustic model training using corrected terms

Patent number: 11682381

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for speech recognition. One of the methods includes receiving first audio data corresponding to an utterance; obtaining a first transcription of the first audio data; receiving data indicating (i) a selection of one or more terms of the first transcription and (ii) one or more of replacement terms; determining that one or more of the replacement terms are classified as a correction of one or more of the selected terms; in response to determining that the one or more of the replacement terms are classified as a correction of the one or more of the selected terms, obtaining a first portion of the first audio data that corresponds to one or more terms of the first transcription; and using the first portion of the first audio data that is associated with the one or more terms of the first transcription to train an acoustic model for recognizing the one or more of the replacement terms.

Type: Grant

Filed: December 2, 2021

Date of Patent: June 20, 2023

Assignee: Google LLC

Inventors: Olga Kapralova, Evgeny A. Cherepanov, Dmitry Osmakov, Martin Baeuml, Gleb Skobeltsyn
ACOUSTIC MODEL TRAINING USING CORRECTED TERMS

Publication number: 20220093080

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for speech recognition. One of the methods includes receiving first audio data corresponding to an utterance; obtaining a first transcription of the first audio data; receiving data indicating (i) a selection of one or more terms of the first transcription and (ii) one or more of replacement terms; determining that one or more of the replacement terms are classified as a correction of one or more of the selected terms; in response to determining that the one or more of the replacement terms are classified as a correction of the one or more of the selected terms, obtaining a first portion of the first audio data that corresponds to one or more terms of the first transcription; and using the first portion of the first audio data that is associated with the one or more terms of the first transcription to train an acoustic model for recognizing the one or more of the replacement terms.

Type: Application

Filed: December 2, 2021

Publication date: March 24, 2022

Applicant: Google LLC

Inventors: Olga Kapralova, Evgeny A. Cherepanov, Dmitry Osmakov, Martin Baeuml, Gleb Skobeltsyn
Acoustic model training using corrected terms

Patent number: 11200887

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for speech recognition. One of the methods includes receiving first audio data corresponding to an utterance; obtaining a first transcription of the first audio data; receiving data indicating (i) a selection of one or more terms of the first transcription and (ii) one or more of replacement terms; determining that one or more of the replacement terms are classified as a correction of one or more of the selected terms; in response to determining that the one or more of the replacement terms are classified as a correction of the one or more of the selected terms, obtaining a first portion of the first audio data that corresponds to one or more terms of the first transcription; and using the first portion of the first audio data that is associated with the one or more terms of the first transcription to train an acoustic model for recognizing the one or more of the replacement terms.

Type: Grant

Filed: April 1, 2020

Date of Patent: December 14, 2021

Assignee: Google LLC

Inventors: Olga Kapralova, Evgeny A. Cherepanov, Dmitry Osmakov, Martin Baeuml, Gleb Skobeltsyn
ACOUSTIC MODEL TRAINING USING CORRECTED TERMS

Publication number: 20200243070

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for speech recognition. One of the methods includes receiving first audio data corresponding to an utterance; obtaining a first transcription of the first audio data; receiving data indicating (i) a selection of one or more terms of the first transcription and (ii) one or more of replacement terms; determining that one or more of the replacement terms are classified as a correction of one or more of the selected terms; in response to determining that the one or more of the replacement terms are classified as a correction of the one or more of the selected terms, obtaining a first portion of the first audio data that corresponds to one or more terms of the first transcription; and using the first portion of the first audio data that is associated with the one or more terms of the first transcription to train an acoustic model for recognizing the one or more of the replacement terms.

Type: Application

Filed: April 1, 2020

Publication date: July 30, 2020

Applicant: Google LLC

Inventors: Olga Kapralova, Evgeny A. Cherepanov, Dmitry Osmakov, Martin Baeuml, Gleb Skobeltsyn
Acoustic model training using corrected terms

Patent number: 10643603

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for speech recognition. One of the methods includes receiving first audio data corresponding to an utterance; obtaining a first transcription of the first audio data; receiving data indicating (i) a selection of one or more terms of the first transcription and (ii) one or more of replacement terms; determining that one or more of the replacement terms are classified as a correction of one or more of the selected terms; in response to determining that the one or more of the replacement terms are classified as a correction of the one or more of the selected terms, obtaining a first portion of the first audio data that corresponds to one or more terms of the first transcription; and using the first portion of the first audio data that is associated with the one or more terms of the first transcription to train an acoustic model for recognizing the one or more of the replacement terms.

Type: Grant

Filed: June 29, 2018

Date of Patent: May 5, 2020

Assignee: Google LLC

Inventors: Olga Kapralova, Evgeny A. Cherepanov, Dmitry Osmakov, Martin Baeuml, Gleb Skobeltsyn
ACOUSTIC MODEL TRAINING USING CORRECTED TERMS

Publication number: 20180308471

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for speech recognition. One of the methods includes receiving first audio data corresponding to an utterance; obtaining a first transcription of the first audio data; receiving data indicating (i) a selection of one or more terms of the first transcription and (ii) one or more of replacement terms; determining that one or more of the replacement terms are classified as a correction of one or more of the selected terms; in response to determining that the one or more of the replacement terms are classified as a correction of the one or more of the selected terms, obtaining a first portion of the first audio data that corresponds to one or more terms of the first transcription; and using the first portion of the first audio data that is associated with the one or more terms of the first transcription to train an acoustic model for recognizing the one or more of the replacement terms.

Type: Application

Filed: June 29, 2018

Publication date: October 25, 2018

Applicant: Google LLC

Inventors: Olga Kapralova, Evgeny A. Cherepanov, Dmitry Osmakov, Martin Baeuml, Gleb Skobeltsyn
Follow-up voice query prediction

Patent number: 10026398

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for predicting follow-up queries to an initial transcription of an utterance. In some implementations, one or more follow-up queries that are pre-associated with a transcription of an initial utterance of a user are identified. A new or modified language model in which a respective probability associated with one or more of the follow-up queries is increased with respect to an initial language model is obtained. Subsequent audio data corresponding to a subsequent utterance of the user is then received. The subsequent audio data is processed using the new or modified language model to generate a transcription of the subsequent utterance. The transcription of the subsequent utterance is then provided for output to the user.

Type: Grant

Filed: July 8, 2016

Date of Patent: July 17, 2018

Assignee: Google LLC

Inventors: Behshad Behzadi, Dmitry Osmakov, Martin Baeuml, Gleb Skobeltsyn
Acoustic model training using corrected terms

Patent number: 10019986

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for speech recognition. One of the methods includes receiving first audio data corresponding to an utterance; obtaining a first transcription of the first audio data; receiving data indicating (i) a selection of one or more terms of the first transcription and (ii) one or more of replacement terms; determining that one or more of the replacement terms are classified as a correction of one or more of the selected terms; in response to determining that the one or more of the replacement terms are classified as a correction of the one or more of the selected terms, obtaining a first portion of the first audio data that corresponds to one or more terms of the first transcription; and using the first portion of the first audio data that is associated with the one or more terms of the first transcription to train an acoustic model for recognizing the one or more of the replacement terms.

Type: Grant

Filed: July 29, 2016

Date of Patent: July 10, 2018

Assignee: Google LLC

Inventors: Olga Kapralova, Evgeny A. Cherepanov, Dmitry Osmakov, Martin Baeuml, Gleb Skobeltsyn
ACOUSTIC MODEL TRAINING USING CORRECTED TERMS

Publication number: 20180033426

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for speech recognition. One of the methods includes receiving first audio data corresponding to an utterance; obtaining a first transcription of the first audio data; receiving data indicating (i) a selection of one or more terms of the first transcription and (ii) one or more of replacement terms; determining that one or more of the replacement terms are classified as a correction of one or more of the selected terms; in response to determining that the one or more of the replacement terms are classified as a correction of the one or more of the selected terms, obtaining a first portion of the first audio data that corresponds to one or more terms of the first transcription; and using the first portion of the first audio data that is associated with the one or more terms of the first transcription to train an acoustic model for recognizing the one or more of the replacement terms.

Type: Application

Filed: July 29, 2016

Publication date: February 1, 2018

Applicant: Google Inc.

Inventors: Olga Kapralova, Evgeny A. Cherepanov, Dmitry Osmakov, Martin Baeuml, Gleb Skobeltsyn
FOLLOW-UP VOICE QUERY PREDICTION

Publication number: 20180012594

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for predicting follow-up queries to an initial transcription of an utterance. In some implementations, one or more follow-up queries that are pre-associated with a transcription of an initial utterance of a user are identified. A new or modified language model in which a respective probability associated with one or more of the follow-up queries is increased with respect to an initial language model is obtained. Subsequent audio data corresponding to a subsequent utterance of the user is then received. The subsequent audio data is processed using the new or modified language model to generate a transcription of the subsequent utterance. The transcription of the subsequent utterance is then provided for output to the user.

Type: Application

Filed: July 8, 2016

Publication date: January 11, 2018

Inventors: Behshad Behzadi, Dmitry Osmakov, Martin Baeuml, Gleb Skobeltsyn
Contextual improvement of voice query recognition

Patent number: 9576578

Abstract: Methods, including computer programs encoded on a computer storage medium, for collaborative language model biasing. In one aspect, a method includes: obtaining (i) one or more initial candidate transcriptions, and (ii) one or more terms that are associated with a context; selecting one or more of the terms that are associated with the context, and that (i) do not occur in the candidate transcriptions, and (ii) are indicated as phonetically similar to one or more terms that do occur in the initial candidate transcriptions; generating one or more additional candidate transcriptions based on the (i) initial candidate transcriptions, and (ii) the selected terms; and providing the one or more additional candidate transcriptions to an automated speech recognizer.

Type: Grant

Filed: August 12, 2015

Date of Patent: February 21, 2017

Assignee: Google Inc.

Inventors: Gleb Skobeltsyn, Alexandru Ovidiu Dovlecel, Carl-Anton Ingmarsson, Martin Baeuml, Behshad Behzadi, Dmitry Osmakov