Patents by Inventor Marc'aurelio Ranzato

Marc'aurelio Ranzato has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Systems and methods for facial representation

Patent number: 11210503

Abstract: Systems, methods, and non-transitory computer readable media can align face images, classify face images, and verify face images by employing a deep neural network (DNN). A 3D-aligned face image can be generated from a 2D face image. An identity of the 2D face image can be classified based on provision of the 3D-aligned face image to the DNN. The identity of the 2D face image can comprise a feature vector.

Type: Grant

Filed: September 11, 2018

Date of Patent: December 28, 2021

Assignee: Facebook, Inc.

Inventors: Yaniv Taigman, Ming Yang, Marc'Aurelio Ranzato
SYSTEMS AND METHODS FOR FACIAL REPRESENTATION

Publication number: 20190171868

Abstract: Systems, methods, and non-transitory computer readable media can align face images, classify face images, and verify face images by employing a deep neural network (DNN). A 3D-aligned face image can be generated from a 2D face image. An identity of the 2D face image can be classified based on provision of the 3D-aligned face image to the DNN. The identity of the 2D face image can comprise a feature vector.

Type: Application

Filed: September 11, 2018

Publication date: June 6, 2019

Inventors: Yaniv Taigman, Ming Yang, Marc'Aurelio Ranzato
Generating labeled images

Patent number: 9852363

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating labeled images. One of the methods includes selecting a plurality of candidate videos from videos identified in a response to a search query derived from a label for an object category; selecting one or more initial frames from each of the candidate videos; detecting one or more initial images of objects in the object category in the initial frames; for each initial frame including an initial image of an object in the object category, tracking the object through surrounding frames to identify additional images of the object; and selecting one or more images from the one or more initial images and one or more additional images as database images of objects belonging to the object category.

Type: Grant

Filed: January 5, 2016

Date of Patent: December 26, 2017

Assignee: Google Inc.

Inventors: Jonathon Shlens, Quoc V. Le, Gregory Sean Corrado, Marc'Aurelio Ranzato
Identifying objects in images

Patent number: 9613297

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying objects in images. One of the methods includes receiving an input image; down-sampling the input image to generate a second image; generating a respective first score for each of the plurality of object categories; selecting an initial patch of the input image; generating a respective second score for each of the plurality of object categories; and generating a respective third score for each of the plurality of object categories from the first scores and the second scores, wherein the respective third score for each of the plurality of object categories represents a likelihood that the input image contains an image of an object belonging to the object category.

Type: Grant

Filed: December 28, 2015

Date of Patent: April 4, 2017

Assignee: Google Inc.

Inventor: Marc'Aurelio Ranzato
Multilingual, acoustic deep neural networks

Patent number: 9460711

Abstract: Methods and systems for processing multilingual DNN acoustic models are described. An example method may include receiving training data that includes a respective training data set for each of two or more or languages. A multilingual deep neural network (DNN) acoustic model may be processed based on the training data. The multilingual DNN acoustic model may include a feedforward neural network having multiple layers of one or more nodes. Each node of a given layer may connect with a respective weight to each node of a subsequent layer, and the multiple layers of one or more nodes may include one or more shared hidden layers of nodes and a language-specific output layer of nodes corresponding to each of the two or more languages. Additionally, weights associated with the multiple layers of one or more nodes of the processed multilingual DNN acoustic model may be stored in a database.

Type: Grant

Filed: April 15, 2013

Date of Patent: October 4, 2016

Assignee: Google Inc.

Inventors: Vincent Olivier Vanhoucke, Jeffrey Adgate Dean, Georg Heigold, Marc'aurelio Ranzato, Matthieu Devin, Patrick An Phu Nguyen, Andrew William Senior
Generating labeled images

Patent number: 9256807

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating labeled images. One of the methods includes selecting a plurality of candidate videos from videos identified in a response to a search query derived from a label for an object category; selecting one or more initial frames from each of the candidate videos; detecting one or more initial images of objects in the object category in the initial frames; for each initial frame including an initial image of an object in the object category, tracking the object through surrounding frames to identify additional images of the object; and selecting one or more images from the one or more initial images and one or more additional images as database images of objects belonging to the object category.

Type: Grant

Filed: March 14, 2013

Date of Patent: February 9, 2016

Assignee: Google Inc.

Inventors: Jonathon Shlens, Quoc V. Le, Gregory S. Corrado, Marc'Aurelio Ranzato
Identifying objects in images

Patent number: 9224068

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying objects in images. One of the methods includes receiving an input image; down-sampling the input image to generate a second image; generating a respective first score for each of the plurality of object categories; selecting an initial patch of the input image; generating a respective second score for each of the plurality of object categories; and generating a respective third score for each of the plurality of object categories from the first scores and the second scores, wherein the respective third score for each of the plurality of object categories represents a likelihood that the input image contains an image of an object belonging to the object category.

Type: Grant

Filed: December 4, 2013

Date of Patent: December 29, 2015

Assignee: Google Inc.

Inventor: Marc'Aurelio Ranzato
Curriculum learning for speech recognition

Patent number: 9202464

Abstract: Methods and apparatus related to training speech recognition devices are presented. A computing device receives training samples for training a neural network to learn an acoustic speech model. A curriculum function for speech modeling can be determined. For each training sample of the training samples, a corresponding curriculum function value for the training sample can be determined using the curriculum function. The training samples can be ordered based on the corresponding curriculum function values. In some embodiments, the neural network can be trained utilizing the ordered training samples. The trained neural network can receive an input of a second plurality of samples corresponding to human speech, where the second plurality of samples differs from the training samples. In response to receiving the second plurality of samples, the trained neural network can generate a plurality of phones corresponding to the captured human speech.

Type: Grant

Filed: April 9, 2013

Date of Patent: December 1, 2015

Assignee: Google Inc.

Inventors: Andrew William Senior, Marc'Aurelio Ranzato
Identifying objects in images

Patent number: 9129190

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying objects in images. One of the methods includes obtaining a first training image; down-sampling the first training image to generate a low-resolution first training image; processing the low-resolution first training image using a first neural network to generate a plurality of features of the low-resolution first training image and first scores for the low-resolution first training image; processing the first scores and the features of the low-resolution first training image using an initial patch locator neural network to generate an initial location of an initial patch of the first training image; locally perturbing the initial location to select an adjusted location for the initial patch of the first training image; and updating the current values of the parameters of the initial patch locator neural network to generate updated values using the adjusted location.

Type: Grant

Filed: December 4, 2013

Date of Patent: September 8, 2015

Assignee: Google Inc.

Inventor: Marc'Aurelio Ranzato
SYSTEMS AND METHODS FOR FACIAL REPRESENTATION

Publication number: 20150125049

Abstract: Systems, methods, and non-transitory computer readable media can align face images, classify face images, and verify face images by employing a deep neural network (DNN). A 3D-aligned face image can be generated from a 2D face image. An identity of the 2D face image can be classified based on provision of the 3D-aligned face image to the DNN. The identity of the 2D face image can comprise a feature vector.

Type: Application

Filed: October 31, 2014

Publication date: May 7, 2015

Inventors: Yaniv Taigman, Ming Yang, Marc'Aurelio Ranzato
Automatic visual recognition of biological particles

Publication number: 20050251347

Abstract: A method and system provide the ability to automatically recognize biological particles. An image of biological particles (e.g., airborne pollen or urine) is obtained. One or more parts of the image are detected as containing one or more particles of interest. Feature vector(s) are extracted from each detected part of the image. Non-linearities are applied to each feature vector. Each part of the image is then classified into a category of biological particle based on the one or more feature vectors for each part of the image.

Type: Application

Filed: May 5, 2005

Publication date: November 10, 2005

Inventors: Pietro Perona, Marc'aurelio Ranzato, Richard Flagan