Patents by Inventor Ross Girshick

Ross Girshick has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Masked Autoencoders for Computer Vision

Publication number: 20240096072

Abstract: In particular embodiments, a computing system may access a plurality of images for pre-training a first machine-learning model that includes an encoder and a decoder. Using each image, the system may pre-train the model by dividing the image into a set a patches, selecting a first subset of the patches to be visible and a second subset of the patches to be masked during the pre-training, processing, using the encoder, the first subset of patches to generate corresponding first latent representations, processing, using the decoder, the first latent representations corresponding to the first subset of patches and mask tokens corresponding to the second subset of patches to generate reconstructed patches corresponding to the second subset of patches, the reconstructed patches and the first subset of patches being used to generate a reconstructed image, and updating the model based on comparisons between the image and the reconstructed image.

Type: Application

Filed: July 27, 2022

Publication date: March 21, 2024

Inventors: Kaiming He, Piotr Dollar, Ross Girshick, Saining Xie, Xinlei Chen, Yanghao Li
Machine-learning models based on non-local neural networks

Patent number: 11562243

Abstract: In one embodiment, a method includes training a baseline machine-learning model based on a neural network comprising a plurality of stages, wherein each stage comprises a plurality of neural blocks, accessing a plurality of training samples comprising a plurality of content objects, respectively, determining one or more non-local operations, wherein each non-local operation is based on one or more pairwise functions and one or more unary functions, generating one or more non-local blocks based on the plurality of training samples and the one or more non-local operations, determining a stage from the plurality of stages of the neural network, and training a non-local machine-learning model by inserting each of the one or more non-local blocks in between at least two of the plurality of neural blocks in the determined stage of the neural network.

Type: Grant

Filed: November 15, 2018

Date of Patent: January 24, 2023

Assignee: Meta Platforms, Inc.

Inventors: Kaiming He, Ross Girshick, Xiaolong Wang
Systems and methods for optimizing pose estimation

Patent number: 10733431

Abstract: In one embodiment, a system may access first, second, and third probability models that are respectively associated with predetermined first and second body parts and a predetermined segment connecting the first and second body parts. Each model includes probability values associated with regions in an image, with each value representing the probability of the associated region containing the associated body part or segment. The system may select a first and second region based on the first probability model and a third region based on the second probability model. Based on the third probability model, the system may compute a first probability score for regions connecting the first and third regions and a second probability score for regions connecting the second and third regions. Based on the first and second probability scores, the system may select the first region to indicate where the predetermined first body part appears in the image.

Type: Grant

Filed: December 31, 2018

Date of Patent: August 4, 2020

Assignee: Facebook, Inc.

Inventors: Peizhao Zhang, Peter Vajda, Kevin Matzen, Ross Girshick
Method and system for using machine-learning for object instance segmentation

Patent number: 10713794

Abstract: In one embodiment, a method includes a computing system accessing a training image. The system may generate a feature map for the training image using a first neural network. The system may identify a region of interest in the feature map and generate a regional feature map for the region of interest based on sampling locations defined by a sampling region. The sampling region and the region of interest may correspond to the same region in the feature map. The system may generate an instance segmentation mask associated with the region of interest by processing the regional feature map using a second neural network. The second neural network may be trained using the instance segmentation mask. Once trained, the second neural network is configured to generate instance segmentation masks for object instances depicted in images.

Type: Grant

Filed: March 15, 2018

Date of Patent: July 14, 2020

Assignee: Facebook, Inc.

Inventors: Kaiming He, Georgia Gkioxari, Piotr Dollar, Ross Girshick
Systems and Methods for Optimizing Pose Estimation

Publication number: 20190171871

Abstract: In one embodiment, a system may access first, second, and third probability models that are respectively associated with predetermined first and second body parts and a predetermined segment connecting the first and second body parts. Each model includes probability values associated with regions in an image, with each value representing the probability of the associated region containing the associated body part or segment. The system may select a first and second region based on the first probability model and a third region based on the second probability model. Based on the third probability model, the system may compute a first probability score for regions connecting the first and third regions and a second probability score for regions connecting the second and third regions. Based on the first and second probability scores, the system may select the first region to indicate where the predetermined first body part appears in the image.

Type: Application

Filed: December 31, 2018

Publication date: June 6, 2019

Inventors: Peizhao Zhang, Peter Vajda, Kevin Matzen, Ross Girshick
Machine-Learning Models Based on Non-local Neural Networks

Publication number: 20190156210

Abstract: In one embodiment, a method includes training a baseline machine-learning model based on a neural network comprising a plurality of stages, wherein each stage comprises a plurality of neural blocks, accessing a plurality of training samples comprising a plurality of content objects, respectively, determining one or more non-local operations, wherein each non-local operation is based on one or more pairwise functions and one or more unary functions, generating one or more non-local blocks based on the plurality of training samples and the one or more non-local operations, determining a stage from the plurality of stages of the neural network, and training a non-local machine-learning model by inserting each of the one or more non-local blocks in between at least two of the plurality of neural blocks in the determined stage of the neural network.

Type: Application

Filed: November 15, 2018

Publication date: May 23, 2019

Inventors: Kaiming He, Ross Girshick, Xiaolong Wang
Object detection and classification in images

Patent number: 9858496

Abstract: Systems, methods, and computer-readable media for providing fast and accurate object detection and classification in images are described herein. In some examples, a computing device can receive an input image. The computing device can process the image, and generate a convolutional feature map. In some configurations, the convolutional feature map can be processed through a Region Proposal Network (RPN) to generate proposals for candidate objects in the image. In various examples, the computing device can process the convolutional feature map with the proposals through a Fast Region-Based Convolutional Neural Network (FRCN) proposal classifier to determine a class of each object in the image and a confidence score associated therewith. The computing device can then provide a requestor with an output including the object classification and/or confidence score.

Type: Grant

Filed: January 20, 2016

Date of Patent: January 2, 2018

Assignee: Microsoft Technology Licensing, LLC

Inventors: Jian Sun, Ross Girshick, Shaoqing Ren, Kaiming He
OBJECT DETECTION AND CLASSIFICATION IN IMAGES

Publication number: 20170206431

Abstract: Systems, methods, and computer-readable media for providing fast and accurate object detection and classification in images are described herein. In some examples, a computing device can receive an input image. The computing device can process the image, and generate a convolutional feature map. In some configurations, the convolutional feature map can be processed through a Region Proposal Network (RPN) to generate proposals for candidate objects in the image. In various examples, the computing device can process the convolutional feature map with the proposals through a Fast Region-Based Convolutional Neural Network (FRCN) proposal classifier to determine a class of each object in the image and a confidence score associated therewith. The computing device can then provide a requestor with an output including the object classification and/or confidence score.

Type: Application

Filed: January 20, 2016

Publication date: July 20, 2017

Inventors: Jian Sun, Ross Girshick, Shaoqing Ren, Kaiming He