Patents by Inventor Vincent Michael Casser
Vincent Michael Casser has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12260576Abstract: A system for generating a depth output for an image is described. The system receives input images that depict the same scene, each input image including one or more potential objects. The system generates, for each input image, a respective background image and processes the background images to generate a camera motion output that characterizes the motion of the camera between the input images. For each potential object, the system generates a respective object motion output for the potential object based on the input images and the camera motion output. The system processes a particular input image of the input images using a depth prediction neural network (NN) to generate a depth output for the particular input image, and updates the current values of parameters of the depth prediction NN based on the particular depth output, the camera motion output, and the object motion outputs for the potential objects.Type: GrantFiled: September 13, 2023Date of Patent: March 25, 2025Assignee: Google LLCInventors: Vincent Michael Casser, Soeren Pirk, Reza Mahjourian, Anelia Angelova
-
Publication number: 20230419521Abstract: A system for generating a depth output for an image is described. The system receives input images that depict the same scene, each input image including one or more potential objects. The system generates, for each input image, a respective background image and processes the background images to generate a camera motion output that characterizes the motion of the camera between the input images. For each potential object, the system generates a respective object motion output for the potential object based on the input images and the camera motion output. The system processes a particular input image of the input images using a depth prediction neural network (NN) to generate a depth output for the particular input image, and updates the current values of parameters of the depth prediction NN based on the particular depth output, the camera motion output, and the object motion outputs for the potential objects.Type: ApplicationFiled: September 13, 2023Publication date: December 28, 2023Inventors: Vincent Michael Casser, Soeren Pirk, Reza Mahjourian, Anelia Angelova
-
Publication number: 20230334842Abstract: Methods, systems, and apparatus for processing inputs that include video frames using neural networks. In one aspect, a system comprises one or more computers configured to obtain a set of one or more training images and, for each training image, ground truth instance data that identifies, for each of one or more object instances, a corresponding region of the training image that depicts the object instance. For each training image in the set, the one or more computers process the training image using an instance segmentation neural network to generate an embedding output comprising a respective embedding for each of a plurality of output pixels. The one or more computers then train the instance segmentation neural network to minimize a loss function.Type: ApplicationFiled: April 18, 2023Publication date: October 19, 2023Inventors: Alex Zihao Zhu, Vincent Michael Casser, Henrik Kretzschmar, Reza Mahjourian, Soeren Pirk
-
Patent number: 11783500Abstract: A system for generating a depth output for an image is described. The system receives input images that depict the same scene, each input image including one or more potential objects. The system generates, for each input image, a respective background image and processes the background images to generate a camera motion output that characterizes the motion of the camera between the input images. For each potential object, the system generates a respective object motion output for the potential object based on the input images and the camera motion output. The system processes a particular input image of the input images using a depth prediction neural network (NN) to generate a depth output for the particular input image, and updates the current values of parameters of the depth prediction NN based on the particular depth output, the camera motion output, and the object motion outputs for the potential objects.Type: GrantFiled: September 5, 2019Date of Patent: October 10, 2023Assignee: Google LLCInventors: Vincent Michael Casser, Soeren Pirk, Reza Mahjourian, Anelia Angelova
-
Publication number: 20230177822Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for rendering a new image that depicts a scene from a perspective of a camera at a new camera viewpoint.Type: ApplicationFiled: December 2, 2022Publication date: June 8, 2023Inventors: Vincent Michael Casser, Henrik Kretzschmar, Matthew Justin Tancik, Sabeek Mani Pradhan, Benjamin Joseph Mildenhall, Pratul Preeti Srinivasan, Jonathan Tilton Barron
-
Publication number: 20230110391Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for determining the visibility of query points using depth estimates generated by a neural network.Type: ApplicationFiled: September 29, 2022Publication date: April 13, 2023Inventors: Vincent Michael Casser, Bradley Dodson
-
Patent number: 11544498Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network using consistency measures. One of the methods includes processing a particular training example from a mediator training data set using a first neural network to generate a first output for a first machine learning task; processing the particular training example in the mediator training data set using each of one or more second neural networks, wherein each second neural network is configured to generate a second output for a respective second machine learning task; determining, for each second machine learning task, a consistency target output for the first machine learning task; determining, for each second machine learning task, an error between the first output and the consistency target output corresponding to the second machine learning task; and generating a parameter update for the first neural network from the determined errors.Type: GrantFiled: March 5, 2021Date of Patent: January 3, 2023Assignee: Google LLCInventors: Ariel Gordon, Soeren Pirk, Anelia Angelova, Vincent Michael Casser, Yao Lu, Anthony Brohan, Zhao Chen, Jan Dlabal
-
Publication number: 20210390407Abstract: Methods, computer systems, and apparatus, including computer programs encoded on computer storage media, for training a perspective computer vision model. The model is configured to receive input data characterizing an input scene in an environment from an input viewpoint and to process the input data in accordance with a set of model parameters to generate an output perspective representation of the scene from the input viewpoint. The system trains the model based on first data characterizing a scene in the environment from a first viewpoint and second data characterizing the scene in the environment from a second, different viewpoint.Type: ApplicationFiled: June 10, 2021Publication date: December 16, 2021Inventors: Vincent Michael Casser, Yuning Chai, Dragomir Anguelov, Hang Zhao, Henrik Kretzschmar, Reza Mahjourian, Anelia Angelova, Ariel Gordon, Soeren Pirk
-
Publication number: 20210319578Abstract: A system for generating a depth output for an image is described. The system receives input images that depict the same scene, each input image including one or more potential objects. The system generates, for each input image, a respective background image and processes the background images to generate a camera motion output that characterizes the motion of the camera between the input images. For each potential object, the system generates a respective object motion output for the potential object based on the input images and the camera motion output. The system processes a particular input image of the input images using a depth prediction neural network (NN) to generate a depth output for the particular input image, and updates the current values of parameters of the depth prediction NN based on the particular depth output, the camera motion output, and the object motion outputs for the potential objects.Type: ApplicationFiled: September 5, 2019Publication date: October 14, 2021Inventors: Vincent Michael Casser, Soeren Pirk, Reza Mahjourian, Anelia Angelova
-
Publication number: 20210279511Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network using consistency measures. One of the methods includes processing a particular training example from a mediator training data set using a first neural network to generate a first output for a first machine learning task; processing the particular training example in the mediator training data set using each of one or more second neural networks, wherein each second neural network is configured to generate a second output for a respective second machine learning task; determining, for each second machine learning task, a consistency target output for the first machine learning task; determining, for each second machine learning task, an error between the first output and the consistency target output corresponding to the second machine learning task; and generating a parameter update for the first neural network from the determined errors.Type: ApplicationFiled: March 5, 2021Publication date: September 9, 2021Inventors: Ariel Gordon, Soeren Pirk, Anelia Angelova, Vincent Michael Casser, Yao Lu, Anthony Brohan, Zhao Chen, Jan Dlabal