Patents by Inventor Pedro J. Moreno Mengibar

Pedro J. Moreno Mengibar has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Dynamic language model

Patent number: 10140362

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for speech recognition. One of the methods includes receiving a base language model for speech recognition including a first word sequence having a base probability value; receiving a voice search query associated with a query context; determining that a customized language model is to be used when the query context satisfies one or more criteria associated with the customized language model; obtaining the customized language model, the customized language model including the first word sequence having an adjusted probability value being the base probability value adjusted according to the query context; and converting the voice search query to a text search query based on one or more probabilities, each of the probabilities corresponding to a word sequence in a group of one or more word sequences, the group including the first word sequence having the adjusted probability value.

Type: Grant

Filed: August 8, 2016

Date of Patent: November 27, 2018

Assignee: Google LLC

Inventors: Pedro J. Moreno Mengibar, Michael H. Cohen
Determining Dialog States for Language Models

Publication number: 20180336895

Abstract: Systems, methods, devices, and other techniques are described herein for determining dialog states that correspond to voice inputs and for biasing a language model based on the determined dialog states. In some implementations, a method includes receiving, at a computing system, audio data that indicates a voice input and determining a particular dialog state, from among a plurality of dialog states, which corresponds to the voice input. A set of n-grams can be identified that are associated with the particular dialog state that corresponds to the voice input. In response to identifying the set of n-grams that are associated with the particular dialog state that corresponds to the voice input, a language model can be biased by adjusting probability scores that the language model indicates for n-grams in the set of n-grams. The voice input can be transcribed using the adjusted language model.

Type: Application

Filed: May 18, 2018

Publication date: November 22, 2018

Inventors: Petar Aleksic, Pedro J. Moreno Mengibar
Speech recognition for keywords

Patent number: 10055767

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speech recognition are disclosed. In one aspect, a method includes receiving a candidate adword from an advertiser. The method further includes generating a score for the candidate adword based on a likelihood of a speech recognizer generating, based on an utterance of the candidate adword, a transcription that includes a word that is associated with an expected pronunciation of the candidate adword. The method further includes classifying, based at least on the score, the candidate adword as an appropriate adword for use in a bidding process for advertisements that are selected based on a transcription of a speech query or as not an appropriate adword for use in the bidding process for advertisements that are selected based on the transcription of the speech query.

Type: Grant

Filed: May 13, 2015

Date of Patent: August 21, 2018

Assignee: Google LLC

Inventors: Petar Aleksic, Pedro J. Moreno Mengibar
LANGUAGE MODEL BIASING SYSTEM

Publication number: 20180233131

Abstract: Methods, systems, and apparatus for receiving audio data corresponding to a user utterance and context data, identifying an initial set of one or more n-grams from the context data, generating an expanded set of one or more n-grams based on the initial set of n-grams, adjusting a language model based at least on the expanded set of n-grams, determining one or more speech recognition candidates for at least a portion of the user utterance using the adjusted language model, adjusting a score for a particular speech recognition candidate determined to be included in the expanded set of n-grams, determining a transcription of user utterance that includes at least one of the one or more speech recognition candidates, and providing the transcription of the user utterance for output.

Type: Application

Filed: February 14, 2017

Publication date: August 16, 2018

Inventors: Petar Aleksic, Pedro J. Moreno Mengibar
SERVER SIDE HOTWORDING

Publication number: 20180233150

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for detecting hotwords using a server. One of the methods includes receiving an audio signal encoding one or more utterances including a first utterance; determining whether at least a portion of the first utterance satisfies a first threshold of being at least a portion of a key phrase; in response to determining that at least the portion of the first utterance satisfies the first threshold of being at least a portion of a key phrase, sending the audio signal to a server system that determines whether the first utterance satisfies a second threshold of being the key phrase, the second threshold being more restrictive than the first threshold; and receiving tagged text data representing the one or more utterances encoded in the audio signal when the server system determines that the first utterance satisfies the second threshold.

Type: Application

Filed: February 14, 2017

Publication date: August 16, 2018

Inventors: Alexander H. Gruenstein, Petar Aleksic, Johan Schalkwyk, Pedro J. Moreno Mengibar
Voice recognition system

Patent number: 10049666

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for voice recognition. In one aspect, a method includes the actions of receiving a voice input; determining a transcription for the voice input, wherein determining the transcription for the voice input includes, for a plurality of segments of the voice input: obtaining a first candidate transcription for a first segment of the voice input; determining one or more contexts associated with the first candidate transcription; adjusting a respective weight for each of the one or more contexts; and determining a second candidate transcription for a second segment of the voice input based in part on the adjusted weights; and providing the transcription of the plurality of segments of the voice input for output.

Type: Grant

Filed: January 6, 2016

Date of Patent: August 14, 2018

Assignee: Google LLC

Inventors: Petar Aleksic, Pedro J. Moreno Mengibar
Automatic Language Model Update

Publication number: 20180204565

Abstract: A method for generating a speech recognition model includes accessing a baseline speech recognition model, obtaining information related to recent language usage from search queries, and modifying the speech recognition model to revise probabilities of a portion of a sound occurrence based on the information. The portion of a sound may include a word. Also, a method for generating a speech recognition model, includes receiving at a search engine from a remote device an audio recording and a transcript that substantially represents at least a portion of the audio recording, synchronizing the transcript with the audio recording, extracting one or more letters from the transcript and extracting the associated pronunciation of the one or more letters from the audio recording, and generating a dictionary entry in a pronunciation dictionary.

Type: Application

Filed: March 15, 2018

Publication date: July 19, 2018

Inventors: Michael H. Cohen, Shumeet Baluja, Pedro J. Moreno Mengibar
VOICE RECOGNITION SYSTEM

Publication number: 20180190293

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for voice recognition. In one aspect, a method includes the actions of receiving a voice input; determining a transcription for the voice input, wherein determining the transcription for the voice input includes, for a plurality of segments of the voice input: obtaining a first candidate transcription for a first segment of the voice input; determining one or more contexts associated with the first candidate transcription; adjusting a respective weight for each of the one or more contexts; and determining a second candidate transcription for a second segment of the voice input based in part on the adjusted weights; and providing the transcription of the plurality of segments of the voice input for output.

Type: Application

Filed: March 2, 2018

Publication date: July 5, 2018

Inventors: Petar Aleksic, Pedro J. Moreno Mengibar
Privacy-preserving training corpus selection

Patent number: 9990925

Abstract: The present disclosure relates to training a speech recognition system. A system that includes an automated speech recognizer and receives data from a client device. The system determines that at least a portion of the received data is likely sensitive data. Before the at least a portion of the received data is deleted, the system provides the at least a portion of the received data to a model training engine that trains recognition models for the automated speech recognizer. After the at least a portion of the received data is provided, the system deletes the at least a portion of the received data.

Type: Grant

Filed: December 14, 2017

Date of Patent: June 5, 2018

Assignee: Google LLC

Inventors: Eugene Weinstein, Pedro J. Moreno Mengibar
LANGUAGE MODEL BIASING MODULATION

Publication number: 20180144741

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for modulating language model biasing. In some implementations, context data is received. A likely context associated with a user is determined based on at least a portion of the context data. One or more language model biasing parameters based at least on the likely context associated with the user is selected. A context confidence score associated with the likely context based on at least a portion of the context data is determined. One or more language model biasing parameters based at least on the context confidence score is adjusted. A baseline language model based at least on the one or more of the adjusted language model biasing parameters is biased. The baseline language model is provided for use by an automated speech recognizer (ASR).

Type: Application

Filed: January 18, 2018

Publication date: May 24, 2018

Inventors: Pedro J. Moreno Mengibar, Petar Aleksic
Determining dialog states for language models

Patent number: 9978367

Abstract: Systems, methods, devices, and other techniques are described herein for determining dialog states that correspond to voice inputs and for biasing a language model based on the determined dialog states. In some implementations, a method includes receiving, at a computing system, audio data that indicates a voice input and determining a particular dialog state, from among a plurality of dialog states, which corresponds to the voice input. A set of n-grams can be identified that are associated with the particular dialog state that corresponds to the voice input. In response to identifying the set of n-grams that are associated with the particular dialog state that corresponds to the voice input, a language model can be biased by adjusting probability scores that the language model indicates for n-grams in the set of n-grams. The voice input can be transcribed using the adjusted language model.

Type: Grant

Filed: March 16, 2016

Date of Patent: May 22, 2018

Assignee: Google LLC

Inventors: Petar Aleksic, Pedro J. Moreno Mengibar
Automatic language model update

Patent number: 9953636

Abstract: A method for generating a speech recognition model includes accessing a baseline speech recognition model, obtaining information related to recent language usage from search queries, and modifying the speech recognition model to revise probabilities of a portion of a sound occurrence based on the information. The portion of a sound may include a word. Also, a method for generating a speech recognition model, includes receiving at a search engine from a remote device an audio recording and a transcript that substantially represents at least a portion of the audio recording, synchronizing the transcript with the audio recording, extracting one or more letters from the transcript and extracting the associated pronunciation of the one or more letters from the audio recording, and generating a dictionary entry in a pronunciation dictionary.

Type: Grant

Filed: October 9, 2015

Date of Patent: April 24, 2018

Inventors: Michael H. Cohen, Shumeet Baluja, Pedro J. Moreno Mengibar
PRIVACY-PRESERVING TRAINING CORPUS SELECTION

Publication number: 20180108355

Abstract: The present disclosure relates to training a speech recognition system. A system that includes an automated speech recognizer and receives data from a client device. The system determines that at least a portion of the received data is likely sensitive data. Before the at least a portion of the received data is deleted, the system provides the at least a portion of the received data to a model training engine that trains recognition models for the automated speech recognizer. After the at least a portion of the received data is provided, the system deletes the at least a portion of the received data.

Type: Application

Filed: December 14, 2017

Publication date: April 19, 2018

Inventors: Eugene Weinstein, Pedro J. Moreno Mengibar
Language model biasing modulation

Patent number: 9886946

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for modulating language model biasing. In some implementations, context data is received. A likely context associated with a user is determined based on at least a portion of the context data. One or more language model biasing parameters based at least on the likely context associated with the user is selected. A context confidence score associated with the likely context based on at least a portion of the context data is determined. One or more language model biasing parameters based at least on the context confidence score is adjusted. A baseline language model based at least on the one or more of the adjusted language model biasing parameters is biased. The baseline language model is provided for use by an automated speech recognizer (ASR).

Type: Grant

Filed: September 13, 2016

Date of Patent: February 6, 2018

Assignee: Google LLC

Inventors: Pedro J. Moreno-Mengibar, Petar Aleksic
Privacy-preserving training corpus selection

Patent number: 9881613

Abstract: The present disclosure relates to training a speech recognition system. A system that includes an automated speech recognizer and receives data from a client device. The system determines that at least a portion of the received data is likely sensitive data. Before the at least a portion of the received data is deleted, the system provides the at least a portion of the received data to a model training engine that trains recognition models for the automated speech recognizer. After the at least a portion of the received data is provided, the system deletes the at least a portion of the received data.

Type: Grant

Filed: June 29, 2015

Date of Patent: January 30, 2018

Assignee: Google LLC

Inventors: Eugene Weinstein, Pedro J. Moreno Mengibar
Language models using non-linguistic context

Patent number: 9842592

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for language models using non-linguistic context. In some implementations, context data indicating non-linguistic context for the utterance is received. Based on the context data, feature scores for one or more non-linguistic features are generated. The feature scores for the non-linguistic features are provided to a language model trained to process scores for non-linguistic features. The output from the language model is received, and a transcription for the utterance is determined using the output of the language model.

Type: Grant

Filed: February 12, 2014

Date of Patent: December 12, 2017

Assignee: Google Inc.

Inventors: Fadi Biadsy, Pedro J. Moreno Mengibar
CUSTOMIZED VOICE ACTION SYSTEM

Publication number: 20170301350

Abstract: Systems, methods, and computer-readable media that may be used to modify a voice action system to include voice actions provided by advertisers or users are provided. One method includes receiving electronic voice action bids from advertisers to modify the voice action system to include a specific voice action (e.g., a triggering phrase and an action). One or more bids may be selected. The method includes, for each of the selected bids, modifying data associated with the voice action system to include the voice action associated with the bid, such that the action associated with the respective voice action is performed when voice input from a user is received that the voice action system determines to correspond to the triggering phrase associated with the respective voice action.

Type: Application

Filed: June 29, 2017

Publication date: October 19, 2017

Applicant: Google Inc.

Inventor: Pedro J. Moreno Mengibar
NEGATIVE N-GRAM BIASING

Publication number: 20170270918

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing dynamic, stroke-based alignment of touch displays. In one aspect, a method includes obtaining a candidate transcription that an automated speech recognizer generates for an utterance, determining a particular context associated with the utterance, determining that a particular n-gram that is included in the candidate transcription is included among a set of undesirable n-grams that is associated with the context, adjusting a speech recognition confidence score associated with the transcription based on determining that the particular n-gram that is included in the candidate transcription is included among the set of undesirable n-grams that is associated with the context, and determining whether to provide the candidate transcription for output based at least on the adjusted speech recognition confidence score.

Type: Application

Filed: May 25, 2017

Publication date: September 21, 2017

Inventors: Pedro J. Moreno Mengibar, Petar Aleksic
Determining Dialog States for Language Models

Publication number: 20170270929

Abstract: Systems, methods, devices, and other techniques are described herein for determining dialog states that correspond to voice inputs and for biasing a language model based on the determined dialog states. In some implementations, a method includes receiving, at a computing system, audio data that indicates a voice input and determining a particular dialog state, from among a plurality of dialog states, which corresponds to the voice input. A set of n-grams can be identified that are associated with the particular dialog state that corresponds to the voice input. In response to identifying the set of n-grams that are associated with the particular dialog state that corresponds to the voice input, a language model can be biased by adjusting probability scores that the language model indicates for n-grams in the set of n-grams. The voice input can be transcribed using the adjusted language model.

Type: Application

Filed: March 16, 2016

Publication date: September 21, 2017

Inventors: Petar Aleksic, Pedro J. Moreno Mengibar
Identifying substitute pronunciations

Patent number: 9747897

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, including selecting terms; obtaining an expected phonetic transcription of an idealized native speaker of a natural language speaking the terms; receiving audio data corresponding to a particular user speaking the terms in the natural language; obtaining, based on the audio data, an actual phonetic transcription of the particular user speaking the terms in the natural language; aligning the expected phonetic transcription of the idealized native speaker of the natural language with the actual phonetic transcription of the particular user; identifying, based on the aligning, a portion of the expected phonetic transcription that is different than a corresponding portion of the actual phonetic transcription; and based on identifying the portion of the expected phonetic transcription, designating the expected phonetic transcription as a substitute pronunciation for the corresponding portion of the actual phonetic tr

Type: Grant

Filed: December 17, 2013

Date of Patent: August 29, 2017

Assignee: Google Inc.

Inventors: Fuchun Peng, Francoise Beaufays, Pedro J. Moreno Mengibar, Brian Patrick Strope

prev … 2 3 4 5 6 7 8 9 10 next