Patents by Inventor Raziel Alvarez

Raziel Alvarez has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Speaker Verification Using Co-Location Information

Publication number: 20200013412

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying a user in a multi-user environment. One of the methods includes receiving, by a first user device, an audio signal encoding an utterance, obtaining, by the first user device, a first speaker model for a first user of the first user device, obtaining, by the first user device for a second user of a second user device that is co-located with the first user device, a second speaker model for the second user or a second score that indicates a respective likelihood that the utterance was spoken by the second user, and determining, by the first user device, that the utterance was spoken by the first user using (i) the first speaker model and the second speaker model or (ii) the first speaker model and the second score.

Type: Application

Filed: September 17, 2019

Publication date: January 9, 2020

Applicant: Google LLC

Inventors: Raziel Alvarez Guevara, Othar Hansson
Speaker verification using co-location information

Patent number: 10460735

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying a user in a multi-user environment. One of the methods includes receiving, by a first user device, an audio signal encoding an utterance, obtaining, by the first user device, a first speaker model for a first user of the first user device, obtaining, by the first user device for a second user of a second user device that is co-located with the first user device, a second speaker model for the second user or a second score that indicates a respective likelihood that the utterance was spoken by the second user, and determining, by the first user device, that the utterance was spoken by the first user using (i) the first speaker model and the second speaker model or (ii) the first speaker model and the second score.

Type: Grant

Filed: October 26, 2018

Date of Patent: October 29, 2019

Assignee: Google LLC

Inventors: Raziel Alvarez Guevara, Othar Hansson
Individualized hotword detection models

Patent number: 10438593

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for presenting notifications in an enterprise system. In one aspect, a method include actions of obtaining enrollment acoustic data representing an enrollment utterance spoken by a user, obtaining a set of candidate acoustic data representing utterances spoken by other users, determining, for each candidate acoustic data of the set of candidate acoustic data, a similarity score that represents a similarity between the enrollment acoustic data and the candidate acoustic data, selecting a subset of candidate acoustic data from the set of candidate acoustic data based at least on the similarity scores, generating a detection model based on the subset of candidate acoustic data, and providing the detection model for use in detecting an utterance spoken by the user.

Type: Grant

Filed: July 22, 2015

Date of Patent: October 8, 2019

Assignee: Google LLC

Inventor: Raziel Alvarez Guevara
SPEAKER VERIFICATION USING CO-LOCATION INFORMATION

Publication number: 20190074017

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying a user in a multi-user environment. One of the methods includes receiving, by a first user device, an audio signal encoding an utterance, obtaining, by the first user device, a first speaker model for a first user of the first user device, obtaining, by the first user device for a second user of a second user device that is co-located with the first user device, a second speaker model for the second user or a second score that indicates a respective likelihood that the utterance was spoken by the second user, and determining, by the first user device, that the utterance was spoken by the first user using (i) the first speaker model and the second speaker model or (ii) the first speaker model and the second score.

Type: Application

Filed: October 26, 2018

Publication date: March 7, 2019

Applicant: Google LLC

Inventors: Raziel Alvarez Guevara, Othar Hansson
Speaker verification using co-location information

Patent number: 10147429

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying a user in a multi-user environment. One of the methods includes receiving, by a first user device, an audio signal encoding an utterance, obtaining, by the first user device, a first speaker model for a first user of the first user device, obtaining, by the first user device for a second user of a second user device that is co-located with the first user device, a second speaker model for the second user or a second score that indicates a respective likelihood that the utterance was spoken by the second user, and determining, by the first user device, that the utterance was spoken by the first user using (i) the first speaker model and the second speaker model or (ii) the first speaker model and the second score.

Type: Grant

Filed: September 6, 2017

Date of Patent: December 4, 2018

Assignee: Google LLC

Inventors: Raziel Alvarez Guevara, Othar Hansson
Systems and methods for performing actions in response to user gestures in captured images

Patent number: 9953216

Abstract: Systems, methods, and computer-readable media are provided for performing actions in response to gestures made by a user in captured images. In accordance with one implementation, a computer-implemented system is provided that includes an image capture device that captures at least one image, a memory device that stores instructions, and at least one processor that executes the instructions stored in the memory device. In some implementations, the processor receives, from the image capture device, at least one image including a gesture made by a user and analyzes the at least one image to identify the gesture made by the user. In some implementations, the processor also determines, based on the identified gesture, one or more actions to perform on the at least one image.

Type: Grant

Filed: January 13, 2015

Date of Patent: April 24, 2018

Assignee: Google LLC

Inventor: Raziel Alvarez
SPEAKER VERIFICATION USING CO-LOCATION INFORMATION

Publication number: 20180012604

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying a user in a multi-user environment. One of the methods includes receiving, by a first user device, an audio signal encoding an utterance, obtaining, by the first user device, a first speaker model for a first user of the first user device, obtaining, by the first user device for a second user of a second user device that is co-located with the first user device, a second speaker model for the second user or a second score that indicates a respective likelihood that the utterance was spoken by the second user, and determining, by the first user device, that the utterance was spoken by the first user using (i) the first speaker model and the second speaker model or (ii) the first speaker model and the second score.

Type: Application

Filed: September 6, 2017

Publication date: January 11, 2018

Inventors: Raziel Alvarez Guevara, Othar Hansson
Automatic selective gain control of audio data for speech recognition

Patent number: 9842608

Abstract: This specification describes, among other things, a computer-implemented method. The method can include receiving a stream of audio data at a computing device. The stream of audio data can be segmented into a plurality of audio segments. Respective intensity levels are determined for each of the plurality of audio segments. For each of the plurality of audio segments and based on the respective intensity levels, a determination can be made as to whether the audio segment includes a speech signal. Selective gain control can be performed on the stream of audio data by automatically adjusting a gain of particular ones of the plurality of audio segments that are determined to include a speech signal.

Type: Grant

Filed: June 1, 2015

Date of Patent: December 12, 2017

Assignee: Google Inc.

Inventors: Raziel Alvarez Guevara, Preetum Nakkiran
Speaker verification using co-location information

Patent number: 9792914

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying a user in a multi-user environment. One of the methods includes receiving, by a first user device, an audio signal encoding an utterance, obtaining, by the first user device, a first speaker model for a first user of the first user device, obtaining, by the first user device for a second user of a second user device that is co-located with the first user device, a second speaker model for the second user or a second score that indicates a respective likelihood that the utterance was spoken by the second user, and determining, by the first user device, that the utterance was spoken by the first user using (i) the first speaker model and the second speaker model or (ii) the first speaker model and the second score.

Type: Grant

Filed: July 5, 2016

Date of Patent: October 17, 2017

Assignee: Google Inc.

Inventors: Raziel Alvarez Guevara, Othar Hansson
Rank-constrained neural networks

Patent number: 9767410

Abstract: This specification describes, among other things, a computer-implemented method. The method can include training a baseline neural network using a first set of training data. For each node in a subset of interconnected nodes in the baseline neural network, a rank-k approximation of a filter for the node can be computed. A subset of nodes in a rank-constrained neural network can then be initialized with the rank-k approximations of the filters from the baseline neural network. The subset of nodes in the rank-constrained neural network can correspond to the subset of nodes in the baseline neural network. After initializing, the rank-constrained neural network can be trained using a second set of training data while maintaining a rank-k filter topology for the subset of nodes in the rank-constrained neural network.

Type: Grant

Filed: June 15, 2015

Date of Patent: September 19, 2017

Assignee: Google Inc.

Inventors: Raziel Alvarez Guevara, Preetum Nakkiran
INDIVIDUALIZED HOTWORD DETECTION MODELS

Publication number: 20170194006

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for presenting notifications in an enterprise system. In one aspect, a method include actions of obtaining enrollment acoustic data representing an enrollment utterance spoken by a user, obtaining a set of candidate acoustic data representing utterances spoken by other users, determining, for each candidate acoustic data of the set of candidate acoustic data, a similarity score that represents a similarity between the enrollment acoustic data and the candidate acoustic data, selecting a subset of candidate acoustic data from the set of candidate acoustic data based at least on the similarity scores, generating a detection model based on the subset of candidate acoustic data, and providing the detection model for use in detecting an utterance spoken by the user.

Type: Application

Filed: March 17, 2017

Publication date: July 6, 2017

Inventor: Raziel Alvarez Guevara
INDIVIDUALIZED HOTWORD DETECTION MODELS

Publication number: 20170186433

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for presenting notifications in an enterprise system. In one aspect, a method include actions of obtaining enrollment acoustic data representing an enrollment utterance spoken by a user, obtaining a set of candidate acoustic data representing utterances spoken by other users, determining, for each candidate acoustic data of the set of candidate acoustic data, a similarity score that represents a similarity between the enrollment acoustic data and the candidate acoustic data, selecting a subset of candidate acoustic data from the set of candidate acoustic data based at least on the similarity scores, generating a detection model based on the subset of candidate acoustic data, and providing the detection model for use in detecting an utterance spoken by the user.

Type: Application

Filed: June 29, 2016

Publication date: June 29, 2017

Inventor: Raziel Alvarez Guevara
INDIVIDUALIZED HOTWORD DETECTION MODELS

Publication number: 20170025125

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for presenting notifications in an enterprise system. In one aspect, a method include actions of obtaining enrollment acoustic data representing an enrollment utterance spoken by a user, obtaining a set of candidate acoustic data representing utterances spoken by other users, determining, for each candidate acoustic data of the set of candidate acoustic data, a similarity score that represents a similarity between the enrollment acoustic data and the candidate acoustic data, selecting a subset of candidate acoustic data from the set of candidate acoustic data based at least on the similarity scores, generating a detection model based on the subset of candidate acoustic data, and providing the detection model for use in detecting an utterance spoken by the user.

Type: Application

Filed: July 22, 2015

Publication date: January 26, 2017

Inventor: Raziel Alvarez Guevara
Text-dependent speaker identification

Patent number: 9542948

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speaker verification. The methods, systems, and apparatus include actions of inputting speech data that corresponds to a particular utterance to a first neural network and determining an evaluation vector based on output at a hidden layer of the first neural network. Additional actions include obtaining a reference vector that corresponds to a past utterance of a particular speaker. Further actions include inputting the evaluation vector and the reference vector to a second neural network that is trained on a set of labeled pairs of feature vectors to identify whether speakers associated with the labeled pairs of feature vectors are the same speaker. More actions include determining, based on an output of the second neural network, whether the particular utterance was likely spoken by the particular speaker.

Type: Grant

Filed: February 3, 2015

Date of Patent: January 10, 2017

Assignee: Google Inc.

Inventors: Dominik Roblek, Matthew Sharifi, Raziel Alvarez Guevara
SPEAKER VERIFICATION USING CO-LOCATION INFORMATION

Publication number: 20160314792

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying a user in a multi-user environment. One of the methods includes receiving, by a first user device, an audio signal encoding an utterance, obtaining, by the first user device, a first speaker model for a first user of the first user device, obtaining, by the first user device for a second user of a second user device that is co-located with the first user device, a second speaker model for the second user or a second score that indicates a respective likelihood that the utterance was spoken by the second user, and determining, by the first user device, that the utterance was spoken by the first user using (i) the first speaker model and the second speaker model or (ii) the first speaker model and the second score.

Type: Application

Filed: July 5, 2016

Publication date: October 27, 2016

Inventors: Raziel Alvarez Guevara, Othar Hansson
Speaker verification using co-location information

Patent number: 9412376

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying a user in a multi-user environment. One of the methods includes receiving, by a first user device, an audio signal encoding an utterance, obtaining, by the first user device, a first speaker model for a first user of the first user device, obtaining, by the first user device for a second user of a second user device that is co-located with the first user device, a second speaker model for the second user or a second score that indicates a respective likelihood that the utterance was spoken by the second user, and determining, by the first user device, that the utterance was spoken by the first user using (i) the first speaker model and the second speaker model or (ii) the first speaker model and the second score.

Type: Grant

Filed: July 22, 2015

Date of Patent: August 9, 2016

Assignee: Google Inc.

Inventors: Raziel Alvarez Guevara, Othar Hansson
SYSTEMS AND METHODS FOR PERFORMING ACTIONS IN RESPONSE TO USER GESTURES IN CAPTURED IMAGES

Publication number: 20160203360

Abstract: Systems, methods, and computer-readable media are provided for performing actions in response to gestures made by a user in captured images. In accordance with one implementation, a computer-implemented system is provided that includes an image capture device that captures at least one image, a memory device that stores instructions, and at least one processor that executes the instructions stored in the memory device. In some implementations, the processor receives, from the image capture device, at least one image including a gesture made by a user and analyzes the at least one image to identify the gesture made by the user. In some implementations, the processor also determines, based on the identified gesture, one or more actions to perform on the at least one image.

Type: Application

Filed: January 13, 2015

Publication date: July 14, 2016

Inventor: Raziel ALVAREZ
Unified declarative model for user interfaces

Patent number: 9372675

Abstract: In one implementation for providing a user interface of a software application, a request to enable a user to perceive the user interface of the software application may be received. A context and a user interface expression may be determined based on the request. The user interface expression may be evaluated based on the context to generate a user interface definition. The user interface definition may be provided to a user interface renderer. The user interface renderer may be configured to process the user interface definition to enable the user to perceive the user interface.

Type: Grant

Filed: January 22, 2014

Date of Patent: June 21, 2016

Assignee: Appian Corporation

Inventors: Marco Pescosolido, Annelise Dubrovsky, Brian Joseph Sullivan, Antonio Andrade Garcia, Carlos Santiago Aguayo Rico, Raziel Alvarez Guevara, Cody Allen Smith, Matthew David Hilliard
AUTOMATIC GAIN CONTROL FOR SPEECH RECOGNITION

Publication number: 20160099007

Abstract: This specification describes, among other things, a computer-implemented method. The method can include receiving a stream of audio data at a computing device. The stream of audio data can be segmented into a plurality of audio segments. Respective intensity levels are determined for each of the plurality of audio segments. For each of the plurality of audio segments and based on the respective intensity levels, a determination can be made as to whether the audio segment includes a speech signal. Selective gain control can be performed on the stream of audio data by automatically adjusting a gain of particular ones of the plurality of audio segments that are determined to include a speech signal.

Type: Application

Filed: June 1, 2015

Publication date: April 7, 2016

Inventors: Raziel Alvarez, Preetum Nakkiran
Speaker verification using co-location information

Patent number: 9257120

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying a user in a multi-user environment. One of the methods includes receiving, by a first user device, an audio signal encoding an utterance, obtaining, by the first user device, a first speaker model for a first user of the first user device, obtaining, by the first user device for a second user of a second user device that is co-located with the first user device, a second speaker model for the second user or a second score that indicates a respective likelihood that the utterance was spoken by the second user, and determining, by the first user device, that the utterance was spoken by the first user using (i) the first speaker model and the second speaker model or (ii) the first speaker model and the second score.

Type: Grant

Filed: July 18, 2014

Date of Patent: February 9, 2016

Assignee: Google Inc.

Inventors: Raziel Alvarez Guevara, Othar Hansson

prev 1 2 3 next