Patents by Inventor Pedro J. Mengibar

Pedro J. Mengibar has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Context-based speech recognition

Patent number: 9311915

Abstract: A processing system receives an audio signal encoding a portion of an utterance. The processing system receives context information associated with the utterance, wherein the context information is not derived from the audio signal or any other audio signal. The processing system provides, as input to a neural network, data corresponding to the audio signal and the context information, and generates a transcription for the utterance based on at least an output of the neural network.

Type: Grant

Filed: September 18, 2013

Date of Patent: April 12, 2016

Assignee: Google Inc.

Inventors: Eugene Weinstein, Pedro J. Mengibar, Johan Schalkwyk
Utterance selection for automated speech recognizer training

Patent number: 9263033

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a set of training utterances. The methods, systems, and apparatus include actions of obtaining a target multi-dimensional distribution of characteristics in an initial set of candidate utterances and selecting a subset of the initial set of candidate utterances based on speech recognition confidence scores associated with the candidate utterances. Additional actions include selecting a particular candidate utterance from the subset of the initial set of utterances and determining that adding the particular candidate utterance to a set of training utterances reduces a divergence of a multi-dimensional distribution of the characteristics in the set of training utterances from the target multi-dimensional distribution. Further actions include adding the particular candidate utterance to the set of training utterances.

Type: Grant

Filed: June 25, 2014

Date of Patent: February 16, 2016

Assignee: Google Inc.

Inventors: Olivier Siohan, Pedro J. Mengibar
UTTERANCE SELECTION FOR AUTOMATED SPEECH RECOGNIZER TRAINING

Publication number: 20150379983

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a set of training utterances. The methods, systems, and apparatus include actions of obtaining a target multi-dimensional distribution of characteristics in an initial set of candidate utterances and selecting a subset of the initial set of candidate utterances based on speech recognition confidence scores associated with the candidate utterances. Additional actions include selecting a particular candidate utterance from the subset of the initial set of utterances and determining that adding the particular candidate utterance to a set of training utterances reduces a divergence of a multi-dimensional distribution of the characteristics in the set of training utterances from the target multi-dimensional distribution. Further actions include adding the particular candidate utterance to the set of training utterances.

Type: Application

Filed: June 25, 2014

Publication date: December 31, 2015

Inventors: Olivier Siohan, Pedro J. Mengibar
Automatic language model update

Patent number: 9159316

Abstract: A method for generating a speech recognition model includes accessing a baseline speech recognition model, obtaining information related to recent language usage from search queries, and modifying the speech recognition model to revise probabilities of a portion of a sound occurrence based on the information. The portion of a sound may include a word. Also, a method for generating a speech recognition model, includes receiving at a search engine from a remote device an audio recording and a transcript that substantially represents at least a portion of the audio recording, synchronizing the transcript with the audio recording, extracting one or more letters from the transcript and extracting the associated pronunciation of the one or more letters from the audio recording, and generating a dictionary entry in a pronunciation dictionary.

Type: Grant

Filed: May 7, 2013

Date of Patent: October 13, 2015

Assignee: Google Inc.

Inventors: Michael H. Cohen, Shumeet Baluja, Pedro J. Mengibar
Bootstrapping named entity canonicalizers from English using alignment models

Patent number: 9146919

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training recognition canonical representations corresponding to named-entity phrases in a second natural language based on translating a set of allowable expressions with canonical representations from a first natural language, which may be generated by expanding a context-free grammar for the allowable expressions for the first natural language.

Type: Grant

Filed: March 14, 2013

Date of Patent: September 29, 2015

Assignee: Google Inc.

Inventors: Mark Edward Epstein, Pedro J. Mengibar
Multiple recognizer speech recognition

Patent number: 9058805

Abstract: The subject matter of this specification can be embodied in, among other things, a method that includes receiving audio data that corresponds to an utterance, obtaining a first transcription of the utterance that was generated using a limited speech recognizer. The limited speech recognizer includes a speech recognizer that includes a language model that is trained over a limited speech recognition vocabulary that includes one or more terms from a voice command grammar, but that includes fewer than all terms of an expanded grammar. A second transcription of the utterance is obtained that was generated using an expanded speech recognizer. The expanded speech recognizer includes a speech recognizer that includes a language model that is trained over an expanded speech recognition vocabulary that includes all of the terms of the expanded grammar. The utterance is classified based at least on a portion of the first transcription or the second transcription.

Type: Grant

Filed: May 13, 2013

Date of Patent: June 16, 2015

Assignee: Google Inc.

Inventors: Petar Aleksic, Pedro J. Mengibar, Fadi Biadsy
Increasing semantic coverage with semantically irrelevant insertions

Patent number: 9020809

Abstract: A method includes accessing data specifying a set of actions, each action defining a user device operation and for each action: accessing a corresponding set of command sentences for the action, determining first n-grams in the set of command sentences that are semantically relevant for the action, determining second n-grams in the set of command sentences that are semantically irrelevant for the action, generating a training set of command sentences from the corresponding set of command sentences, the generating the training set of command sentences including removing each second n-gram from each sentence in the corresponding set of command sentences for the action, and generating a command model from the training set of command sentences configured to generate an action score for the action for an input sentence based on: first n-grams for the action, and second n-grams for the action that are also second n-grams for all other actions.

Type: Grant

Filed: February 28, 2013

Date of Patent: April 28, 2015

Assignee: Google Inc.

Inventors: Pedro J. Mengibar, Mark Edward Epstein, Fadi Biadsy
BOOTSTRAPPING NAMED ENTITY CANONICALIZERS FROM ENGLISH USING ALIGNMENT MODELS

Publication number: 20140200876

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training recognition canonical representations corresponding to named-entity phrases in a second natural language based on translating a set of allowable expressions with canonical representations from a first natural language, which may be generated by expanding a context-free grammar for the allowable expressions for the first natural language.

Type: Application

Filed: March 14, 2013

Publication date: July 17, 2014

Applicant: Google Inc.

Inventors: Mark Edward Epstein, Pedro J. Mengibar
Dictionary filtering using market data

Patent number: 8473293

Abstract: This specification describes technologies relating to system, methods, and articles for updating a speech recognition dictionary based on, at least in part, both search query and market data metrics. In general, one innovative aspect of the subject matter described in this specification can be embodied in a method comprising (i) identifying a candidate term for possible inclusion in a speech recognition dictionary, (ii) identifying at least one search query metric associated with the identified candidate term, (iii) identifying at least one market data metric associated with the identified candidate term, and (iv) generating a candidate term score for the identified candidate term based, at least in part, on a weighted combination of the at least one identified search query metric and the at least one identified market data metric.

Type: Grant

Filed: October 23, 2012

Date of Patent: June 25, 2013

Assignee: Google Inc.

Inventors: Pedro J. Mengibar, Jeffrey S. Sorensen