Patents by Inventor Marc'aurelio Ranzato
Marc'aurelio Ranzato has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11210503Abstract: Systems, methods, and non-transitory computer readable media can align face images, classify face images, and verify face images by employing a deep neural network (DNN). A 3D-aligned face image can be generated from a 2D face image. An identity of the 2D face image can be classified based on provision of the 3D-aligned face image to the DNN. The identity of the 2D face image can comprise a feature vector.Type: GrantFiled: September 11, 2018Date of Patent: December 28, 2021Assignee: Facebook, Inc.Inventors: Yaniv Taigman, Ming Yang, Marc'Aurelio Ranzato
-
Publication number: 20190171868Abstract: Systems, methods, and non-transitory computer readable media can align face images, classify face images, and verify face images by employing a deep neural network (DNN). A 3D-aligned face image can be generated from a 2D face image. An identity of the 2D face image can be classified based on provision of the 3D-aligned face image to the DNN. The identity of the 2D face image can comprise a feature vector.Type: ApplicationFiled: September 11, 2018Publication date: June 6, 2019Inventors: Yaniv Taigman, Ming Yang, Marc'Aurelio Ranzato
-
Patent number: 9852363Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating labeled images. One of the methods includes selecting a plurality of candidate videos from videos identified in a response to a search query derived from a label for an object category; selecting one or more initial frames from each of the candidate videos; detecting one or more initial images of objects in the object category in the initial frames; for each initial frame including an initial image of an object in the object category, tracking the object through surrounding frames to identify additional images of the object; and selecting one or more images from the one or more initial images and one or more additional images as database images of objects belonging to the object category.Type: GrantFiled: January 5, 2016Date of Patent: December 26, 2017Assignee: Google Inc.Inventors: Jonathon Shlens, Quoc V. Le, Gregory Sean Corrado, Marc'Aurelio Ranzato
-
Patent number: 9613297Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying objects in images. One of the methods includes receiving an input image; down-sampling the input image to generate a second image; generating a respective first score for each of the plurality of object categories; selecting an initial patch of the input image; generating a respective second score for each of the plurality of object categories; and generating a respective third score for each of the plurality of object categories from the first scores and the second scores, wherein the respective third score for each of the plurality of object categories represents a likelihood that the input image contains an image of an object belonging to the object category.Type: GrantFiled: December 28, 2015Date of Patent: April 4, 2017Assignee: Google Inc.Inventor: Marc'Aurelio Ranzato
-
Patent number: 9460711Abstract: Methods and systems for processing multilingual DNN acoustic models are described. An example method may include receiving training data that includes a respective training data set for each of two or more or languages. A multilingual deep neural network (DNN) acoustic model may be processed based on the training data. The multilingual DNN acoustic model may include a feedforward neural network having multiple layers of one or more nodes. Each node of a given layer may connect with a respective weight to each node of a subsequent layer, and the multiple layers of one or more nodes may include one or more shared hidden layers of nodes and a language-specific output layer of nodes corresponding to each of the two or more languages. Additionally, weights associated with the multiple layers of one or more nodes of the processed multilingual DNN acoustic model may be stored in a database.Type: GrantFiled: April 15, 2013Date of Patent: October 4, 2016Assignee: Google Inc.Inventors: Vincent Olivier Vanhoucke, Jeffrey Adgate Dean, Georg Heigold, Marc'aurelio Ranzato, Matthieu Devin, Patrick An Phu Nguyen, Andrew William Senior
-
Patent number: 9256807Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating labeled images. One of the methods includes selecting a plurality of candidate videos from videos identified in a response to a search query derived from a label for an object category; selecting one or more initial frames from each of the candidate videos; detecting one or more initial images of objects in the object category in the initial frames; for each initial frame including an initial image of an object in the object category, tracking the object through surrounding frames to identify additional images of the object; and selecting one or more images from the one or more initial images and one or more additional images as database images of objects belonging to the object category.Type: GrantFiled: March 14, 2013Date of Patent: February 9, 2016Assignee: Google Inc.Inventors: Jonathon Shlens, Quoc V. Le, Gregory S. Corrado, Marc'Aurelio Ranzato
-
Patent number: 9224068Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying objects in images. One of the methods includes receiving an input image; down-sampling the input image to generate a second image; generating a respective first score for each of the plurality of object categories; selecting an initial patch of the input image; generating a respective second score for each of the plurality of object categories; and generating a respective third score for each of the plurality of object categories from the first scores and the second scores, wherein the respective third score for each of the plurality of object categories represents a likelihood that the input image contains an image of an object belonging to the object category.Type: GrantFiled: December 4, 2013Date of Patent: December 29, 2015Assignee: Google Inc.Inventor: Marc'Aurelio Ranzato
-
Patent number: 9202464Abstract: Methods and apparatus related to training speech recognition devices are presented. A computing device receives training samples for training a neural network to learn an acoustic speech model. A curriculum function for speech modeling can be determined. For each training sample of the training samples, a corresponding curriculum function value for the training sample can be determined using the curriculum function. The training samples can be ordered based on the corresponding curriculum function values. In some embodiments, the neural network can be trained utilizing the ordered training samples. The trained neural network can receive an input of a second plurality of samples corresponding to human speech, where the second plurality of samples differs from the training samples. In response to receiving the second plurality of samples, the trained neural network can generate a plurality of phones corresponding to the captured human speech.Type: GrantFiled: April 9, 2013Date of Patent: December 1, 2015Assignee: Google Inc.Inventors: Andrew William Senior, Marc'Aurelio Ranzato
-
Patent number: 9129190Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for identifying objects in images. One of the methods includes obtaining a first training image; down-sampling the first training image to generate a low-resolution first training image; processing the low-resolution first training image using a first neural network to generate a plurality of features of the low-resolution first training image and first scores for the low-resolution first training image; processing the first scores and the features of the low-resolution first training image using an initial patch locator neural network to generate an initial location of an initial patch of the first training image; locally perturbing the initial location to select an adjusted location for the initial patch of the first training image; and updating the current values of the parameters of the initial patch locator neural network to generate updated values using the adjusted location.Type: GrantFiled: December 4, 2013Date of Patent: September 8, 2015Assignee: Google Inc.Inventor: Marc'Aurelio Ranzato
-
Publication number: 20150125049Abstract: Systems, methods, and non-transitory computer readable media can align face images, classify face images, and verify face images by employing a deep neural network (DNN). A 3D-aligned face image can be generated from a 2D face image. An identity of the 2D face image can be classified based on provision of the 3D-aligned face image to the DNN. The identity of the 2D face image can comprise a feature vector.Type: ApplicationFiled: October 31, 2014Publication date: May 7, 2015Inventors: Yaniv Taigman, Ming Yang, Marc'Aurelio Ranzato
-
Publication number: 20050251347Abstract: A method and system provide the ability to automatically recognize biological particles. An image of biological particles (e.g., airborne pollen or urine) is obtained. One or more parts of the image are detected as containing one or more particles of interest. Feature vector(s) are extracted from each detected part of the image. Non-linearities are applied to each feature vector. Each part of the image is then classified into a category of biological particle based on the one or more feature vectors for each part of the image.Type: ApplicationFiled: May 5, 2005Publication date: November 10, 2005Inventors: Pietro Perona, Marc'aurelio Ranzato, Richard Flagan