Patents by Inventor Lorenzo Torresani

Lorenzo Torresani has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Anticipating future video based on present video

Patent number: 11636681

Abstract: In one embodiment, a method includes accessing a first set of images of multiple images of a scene, wherein the first set of images show the scene during a time period. The method includes generating, by processing the first set of images using a first machine-learning model, one or more attributes representing observed actions performed in the scene during the time period. The method includes predicting, by processing the generated one or more attributes using a second machine-learning model, one or more actions that would happen in the scene after the time period.

Type: Grant

Filed: November 19, 2019

Date of Patent: April 25, 2023

Assignee: Meta Platforms, Inc.

Inventors: Heng Wang, Du Le Hong Tran, Antoine Miech, Lorenzo Torresani
CLASSIFYING A VIDEO STREAM USING A SELF-ATTENTION-BASED MACHINE-LEARNING MODEL

Publication number: 20220253633

Abstract: In one embodiment, a method includes accessing a stream of F video frames, where each of the F video frames includes N patches that are non-overlapping, generating an initial embedding vector for each of the N×F patches in the F video frames, generating a classification embedding by processing the generated N×F initial embedding vectors using a self-attention-based machine-learning model that computes a temporal attention and a spatial attention for each of the N×F patches, and determining a class of the stream of video frames based on the generated classification embedding.

Type: Application

Filed: August 30, 2021

Publication date: August 11, 2022

Inventors: Gediminas Bertasius, Heng Wang, Lorenzo Torresani
Task-Specific Text Generation Based On Multimodal Inputs

Publication number: 20220222435

Abstract: In one embodiment, a method includes accessing first sets of tokens associated with a desired task and one or more modalities associated with a context of the desired task, determining a second set of tokens for each of the one or more modalities using a classifier network associated with the modality, generating a number of embedding vectors by mapping the first sets of tokens and the second set of tokens associated with each of the one or more modalities to an embedding space, and producing a sequence of words addressing the desired task by processing the number of embedding vectors with an encoder-decoder network.

Type: Application

Filed: June 4, 2021

Publication date: July 14, 2022

Inventors: Xudong Lin, Gediminas Bertasius, Jue Wang, Devi Niru Parikh, Lorenzo Torresani
Convolutional neural network based on groupwise convolution for efficient video analysis

Patent number: 10984245

Abstract: In one embodiment, a method includes receiving a request for information associated with a video, determining the information associated with the video by processing the video using a machine-learning model which is based on a convolutional neural network comprising a plurality of layers, wherein at least one of the plurality of layers comprises one or more building blocks, wherein at least one of the one or more building blocks comprises a first filter configured to perform a three-dimensional (3D) pointwise convolutional operation and a second filter configured to perform a three-dimensional (3D) groupwise convolutional operation, and outputting the information associated with the video in response to the request.

Type: Grant

Filed: February 26, 2019

Date of Patent: April 20, 2021

Assignee: Facebook, Inc.

Inventors: Du Le Hong Tran, Kaiming He, Heng Wang, Matthew Dan Feiszli, Lorenzo Torresani
Anticipating Future Video Based on Present Video

Publication number: 20200160064

Abstract: In one embodiment, a method includes accessing a first set of images of multiple images of a scene, wherein the first set of images show the scene during a time period. The method includes generating, by processing the first set of images using a first machine-learning model, one or more attributes representing observed actions performed in the scene during the time period. The method includes predicting, by processing the generated one or more attributes using a second machine-learning model, one or more actions that would happen in the scene after the time period.

Type: Application

Filed: November 19, 2019

Publication date: May 21, 2020

Inventors: Heng Wang, Du Le Hong Tran, Antoine Miech, Lorenzo Torresani
Techniques for enabling or establishing the use of face recognition algorithms

Patent number: 9690979

Abstract: Embodiments described herein facilitate or enhance the implementation of image recognition processes which can perform recognition on images to identify objects and/or faces by class or by people.

Type: Grant

Filed: January 13, 2014

Date of Patent: June 27, 2017

Assignee: Google Inc.

Inventors: Salih Burak Gokturk, Dragomir Anguelov, Lorenzo Torresani, Vincent O. Vanhoucke, Munjal Shah, Diem Thanh Vu, Kuang-chih Lee
TECHNIQUES FOR ENABLING OR ESTABLISHING THE USE OF FACE RECOGNITION ALGORITHMS

Publication number: 20170169286

Abstract: Embodiments described herein facilitate or enhance the implementation of image recognition processes which can perform recognition on images to identify objects and/or faces by class or by people.

Type: Application

Filed: January 13, 2014

Publication date: June 15, 2017

Applicant: Google Inc.

Inventors: Salih Burak Gokturk, Dragomir Anguelov, Lorenzo Torresani, Vincent O. Vanhoueke, Munjal Shah, Diem Thanh Vu, Kuang-chih Lee
TECHNIQUES FOR ENABLING OR ESTABLISHING THE USE OF FACE RECOGNITION ALGORITHMS

Publication number: 20150199560

Abstract: Embodiments described herein facilitate or enhance the implementation of image recognition processes which can perform recognition on images to identify objects and/or faces by class or by people.

Type: Application

Filed: January 13, 2014

Publication date: July 16, 2015

Applicant: Google Inc.

Inventors: Salih Burak Gokturk, Dragomir Anguelov, Lorenzo Torresani, Vincent O. Vanhoueke, Munjal Shah, Diem Thanh Vu, Kuang-chih Lee
System and method for enabling image searching using manual enrichment, classification, and/or segmentation

Patent number: 9082162

Abstract: A system includes an image analysis module that is configured to programmatically analyze individual images in a collection of images in order to determine information about each image in the collection. The system may also include a manual interface that is configured to (i) interface with one or more human editors, and (ii) displays a plurality of panels concurrently. Individual panels may be provided for one or more analyzed images, and individual panels may be configured to display information that is at least indicative of the one or more images of that panel and/or of the information determined from the one or more images.

Type: Grant

Filed: September 14, 2012

Date of Patent: July 14, 2015

Assignee: Google Inc.

Inventors: Salih Burak Gokturk, Baris Sumengen, Diem Vu, Navneet Dalal, Danny Yang, Xiaofan Lin, Azhar Khan, Munjal Shah, Dragomir Anguelov, Lorenzo Torresani, Vincent Vanhoucke
System and method for search portions of objects in images and features thereof

Patent number: 9008435

Abstract: Embodiments enable searching of portions of objects in images, including programmatically analyzing each image in a collection in order to determine image data that, for individual images in the collection, represents one or more visual characteristics of a portion of an object shown in that image. A user is enabled to specify one or more search criteria that includes image data, and a search result may be determined based on one or more images in the collection that show a corresponding object that has a portion that satisfies a threshold. The threshold is defined at least in part by the one or more search criteria.

Type: Grant

Filed: September 14, 2012

Date of Patent: April 14, 2015

Assignee: Google Inc.

Inventors: Salih Burak Gokturk, Baris Sumengen, Diem Vu, Navneet Dalal, Danny Yang, Xiaofan Lin, Azhar Khan, Munjal Shah, Dragomir Anguelov, Lorenzo Torresani, Vincent Vanhoucke
System and method for using image analysis and search in E-commerce

Patent number: 8732030

Abstract: Embodiments described herein provide a system and method for providing merchandise items at a network site. According to an embodiment, an image of a merchandise item is obtained. The image is programmatically analyzed to determine information about the merchandise item. The information is used to generate a presentation that includes the merchandise item.

Type: Grant

Filed: February 16, 2012

Date of Patent: May 20, 2014

Assignee: Google Inc.

Inventors: Salih Burak Gokturk, Baris Sumengen, Diem Vu, Navneet Dalal, Danny Yang, Xiaofan Lin, Azhar Khan, Munjal Shah, Dragomir Anguelov, Lorenzo Torresani, Vincent Vanhoucke
Techniques for enabling or establishing the use of face recognition algorithms

Patent number: 8630493

Abstract: Embodiments described herein facilitate or enhance the implementation of image recognition processes which can perform recognition on images to identify objects and/or faces by class or by people.

Type: Grant

Filed: December 7, 2010

Date of Patent: January 14, 2014

Assignee: Google Inc.

Inventors: Salih Burak Gokturk, Dragomir Anguelov, Lorenzo Torresani, Vincent Vanhoucke, Munjal Shah, Diem Vu, Kuang-Chih Lee
Techniques for enabling or establishing the use of face recognition algorithms

Patent number: 8571272

Abstract: Embodiments described herein facilitate or enhance the implementation of image recognition processes which can perform recognition on images to identify objects and/or faces by class or by people.

Type: Grant

Filed: March 12, 2007

Date of Patent: October 29, 2013

Assignee: Google Inc.

Inventors: Salih Burak Gokturk, Dragomir Anguelov, Lorenzo Torresani, Vincent Vanhoucke, Munjal Shah, Diem Vu, Kuang-Chih Lee
SYSTEM AND METHOD FOR ENABLING IMAGE SEARCHING USING MANUAL ENRICHMENT, CLASSIFICATION, AND/OR SEGMENTATION

Publication number: 20130127893

Abstract: A system includes an image analysis module that is configured to programmatically analyze individual images in a collection of images in order to determine information about each image in the collection. The system may also include a manual interface that is configured to (i) interface with one or more human editors, and (ii) displays a plurality of panels concurrently. Individual panels may be provided for one or more analyzed images, and individual panels may be configured to display information that is at least indicative of the one or more images of that panel and/or of the information determined from the one or more images.

Type: Application

Filed: September 14, 2012

Publication date: May 23, 2013

Inventors: Salih Burak Gokturk, Baris Sumengen, Diem Vu, Navneet Dalal, Danny Yang, Xiaofan Lin, Azhar Khan, Munjal Shah, Dragomir Anguelov, Lorenzo Torresani, Vincent Vanhoucke
SYSTEM AND METHOD FOR SEARCH PORTIONS OF OBJECTS IN IMAGES AND FEATURES THEREOF

Publication number: 20130121571

Abstract: Embodiments enable searching of portions of objects in images, including programmatically analyzing each image in a collection in order to determine image data that, for individual images in the collection, represents one or more visual characteristics of a portion of an object shown in that image. A user is enabled to specify one or more search criteria that includes image data, and a search result may be determined based on one or more images in the collection that show a corresponding object that has a portion that satisfies a threshold. The threshold is defined at least in part by the one or more search criteria.

Type: Application

Filed: September 14, 2012

Publication date: May 16, 2013

Inventors: Salih Burak Gokturk, Baris Sumengen, Diem Vu, Navneet Dalal, Danny Yang, Xiaofan Lin, Azhar Khan, Munjal Shah, Dragomir Anguelov, Lorenzo Torresani, Vincent Vanhoucke
Techniques for enabling or establishing the use of face recognition algorithms

Patent number: 8385633

Abstract: Embodiments described herein facilitate or enhance the implementation of image recognition processes which can perform recognition on images to identify objects and/or faces by class or by people.

Type: Grant

Filed: December 7, 2010

Date of Patent: February 26, 2013

Assignee: Google Inc.

Inventors: Salih Burak Gokturk, Dragomir Anguelov, Lorenzo Torresani, Vincent Vanhoucke, Munjal Shah, Diem Vu, Kuang-Chih Lee
Content-based information retrieval

Patent number: 8346800

Abstract: Content-based information retrieval is described. In an example, a query item such as an image, document, email or other item is presented and items with similar content are retrieved from a database of items. In an example, each time a query is presented, a classifier is formed based on that query and using a training set of items. For example, the classifier is formed in real-time and is formed in such a way that a limit on the proportion of the items in the database that will be retrieved is set. In an embodiment, the query item is analyzed to identify tokens in that item and subsets of those tokens are selected to form the classifier. For example, the subsets of tokens are combined using Boolean operators in a manner which is efficient for searching on particular types of database.

Type: Grant

Filed: April 2, 2009

Date of Patent: January 1, 2013

Assignee: Microsoft Corporation

Inventors: Martin Szummer, Andrew Fitzgibbon, Lorenzo Torresani
System and method for search portions of objects in images and features thereof

Patent number: 8345982

Abstract: Embodiments enable searching of portions of objects in images, including programmatically analyzing each image in a collection in order to determine image data that, for individual images in the collection, represents one or more visual characteristics of a portion of an object shown in that image. A user is enabled to specify one or more search criteria that includes image data, and a search result may be determined based on one or more images in the collection that show a corresponding object that has a portion that satisfies a threshold. The threshold is defined at least in part by the one or more search criteria.

Type: Grant

Filed: December 28, 2009

Date of Patent: January 1, 2013

Assignee: Google Inc.

Inventors: Salih Burak Gokturk, Baris Sumengen, Diem Vu, Navneet Dalal, Danny Yang, Xiaofan Lin, Azhar Khan, Munjal Shah, Dragomir Anguelov, Lorenzo Torresani, Vincent Vanhoucke
System and Method for Using Image Analysis and Search in E-Commerce

Publication number: 20120323738

Abstract: Embodiments described herein provide a system and method for providing merchandise items at a network site. According to an embodiment, an image of a merchandise item is obtained. The image is programmatically analyzed to determine information about the merchandise item. The information is used to generate a presentation that includes the merchandise item.

Type: Application

Filed: February 16, 2012

Publication date: December 20, 2012

Inventors: Salih Burak GOKTURK, Baris Sumengen, Diem Vu, Navneet Dalal, Danny Yang, Xiaofan Lin, Azhar Khan, Munjal Shah, Dragomir Anguelov, Lorenzo Torresani, Vincent Vanhoucke
System and method for enabling image searching using manual enrichment, classification, and/or segmentation

Patent number: 8315442

Abstract: Embodiments described herein provide for a system for creating a data collection of recognized images. The system includes an image analysis module that is configured to programmatically analyze individual images in a collection of images in order to determine information about each image in the collection. The system may also include a manual interface that is configured to (i) interface with one or more human editors, and (ii) displays a plurality of panels concurrently. Individual panels may be provided for one or more analyzed images, and individual panels may be configured to display information that is at least indicative of the one or more images of that panel and/or of the information determined from the one or more images. Additionally, the manual interface enables the one or more human editors to view the plurality of panels concurrently and to interact with each of the plurality of panels in order to correct or remove any information that is incorrectly determined from the image of that panel.

Type: Grant

Filed: December 28, 2009

Date of Patent: November 20, 2012

Assignee: Google Inc.

Inventors: Salih Burak Gokturk, Baris Sumengen, Diem Vu, Navneet Dalal, Danny Yang, Xiaofan Lin, Azhar Khan, Munjal Shah, Dragomir Anguelov, Lorenzo Torresani, Vincent Vanhoucke

1 2 next