Patents by Inventor Anelia Angelova

Anelia Angelova has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

SEGMENTING OBJECTS BY REFINING SHAPE PRIORS

Publication number: 20240037926

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing instance segmentation by detecting and segmenting individual objects in an image. In one aspect, a method comprises: processing an image to generate data identifying a region of the image that depicts a particular object; obtaining data defining a plurality of example object segmentations; generating a respective weight value for each of the example object segmentations; for each of a plurality of pixels in the region of the image, determining a score characterizing a likelihood that the pixel is included in the particular object depicted in the region of the image using: (i) the example object segmentations, and (ii) the weight values for the example object segmentations; and generating a segmentation of the particular object depicted in the region of the image using the scores for the pixels in the region of the image.

Type: Application

Filed: October 12, 2023

Publication date: February 1, 2024

Inventors: Weicheng Kuo, Anelia Angelova, Tsung-Yi Lin
Dynamic training of Models

Publication number: 20240029413

Abstract: A method involves the training of a model by dynamically adjusting the number of examples within each training batch. The dynamic adjustment is accomplished by adjusting the number of examples per task within each training batch according to the performance of the model on the tasks that the model is being trained on. In some embodiments, this method is applied to cross-modal vision-language tasks. This model may also be applied to the pre-training of a model that can be later fine-tuned for a more specific task(s).

Type: Application

Filed: July 12, 2023

Publication date: January 25, 2024

Inventors: Anthony Jacob Piergiovanni, Weiching Kuo, Wei Li, Anelia Angelova
UNSUPERVISED DEPTH PREDICTION NEURAL NETWORKS

Publication number: 20230419521

Abstract: A system for generating a depth output for an image is described. The system receives input images that depict the same scene, each input image including one or more potential objects. The system generates, for each input image, a respective background image and processes the background images to generate a camera motion output that characterizes the motion of the camera between the input images. For each potential object, the system generates a respective object motion output for the potential object based on the input images and the camera motion output. The system processes a particular input image of the input images using a depth prediction neural network (NN) to generate a depth output for the particular input image, and updates the current values of parameters of the depth prediction NN based on the particular depth output, the camera motion output, and the object motion outputs for the potential objects.

Type: Application

Filed: September 13, 2023

Publication date: December 28, 2023

Inventors: Vincent Michael Casser, Soeren Pirk, Reza Mahjourian, Anelia Angelova
COMPUTER VISION NEURAL NETWORKS WITH LEARNED TOKENIZATION

Publication number: 20230409899

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing a network input using a computer vision neural network with learned tokenization.

Type: Application

Filed: June 21, 2022

Publication date: December 21, 2023

Inventors: Michael Sahngwon Ryoo, Anthony Jacob Piergiovanni, Anelia Angelova, Anurag Arnab, Mostafa Dehghani
Multi-Modal Machine Learning Models with Improved Computational Efficiency Via Adaptive Tokenization and Fusion

Publication number: 20230394306

Abstract: Provided is an efficient multi-modal processing model. The multi-modal processing model can process input data from multiple different domains to generate a prediction for a multi-modal processing task. A machine-learned multi-modal processing model can include an adaptive tokenization layer that is configured to adaptively tokenize features generated from the multi-modal inputs into sets of tokens. Specifically, the tokens may have a smaller data size relative to the features from the inputs, thereby enabling a reduced number of processing operations to be performed overall, thereby improving the efficiency of model.

Type: Application

Filed: June 2, 2023

Publication date: December 7, 2023

Inventors: Anthony J. Piergiovanni, Wei-Cheng Kuo, Anelia Angelova
Segmenting objects by refining shape priors

Patent number: 11823443

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing instance segmentation by detecting and segmenting individual objects in an image. In one aspect, a method comprises: processing an image to generate data identifying a region of the image that depicts a particular object; obtaining data defining a plurality of example object segmentations; generating a respective weight value for each of the example object segmentations; for each of a plurality of pixels in the region of the image, determining a score characterizing a likelihood that the pixel is included in the particular object depicted in the region of the image using: (i) the example object segmentations, and (ii) the weight values for the example object segmentations; and generating a segmentation of the particular object depicted in the region of the image using the scores for the pixels in the region of the image.

Type: Grant

Filed: August 14, 2019

Date of Patent: November 21, 2023

Assignee: Google LLC

Inventors: Weicheng Kuo, Anelia Angelova, Tsung-Yi Lin
Unsupervised learning of image depth and ego-motion prediction neural networks

Patent number: 11790549

Abstract: A system includes a neural network implemented by one or more computers, in which the neural network includes an image depth prediction neural network and a camera motion estimation neural network. The neural network is configured to receive a sequence of images. The neural network is configured to process each image in the sequence of images using the image depth prediction neural network to generate, for each image, a respective depth output that characterizes a depth of the image, and to process a subset of images in the sequence of images using the camera motion estimation neural network to generate a camera motion output that characterizes the motion of a camera between the images in the subset. The image depth prediction neural network and the camera motion estimation neural network have been jointly trained using an unsupervised learning technique.

Type: Grant

Filed: May 27, 2022

Date of Patent: October 17, 2023

Assignee: Google LLC

Inventors: Reza Mahjourian, Martin Wicke, Anelia Angelova
Unsupervised depth prediction neural networks

Patent number: 11783500

Abstract: A system for generating a depth output for an image is described. The system receives input images that depict the same scene, each input image including one or more potential objects. The system generates, for each input image, a respective background image and processes the background images to generate a camera motion output that characterizes the motion of the camera between the input images. For each potential object, the system generates a respective object motion output for the potential object based on the input images and the camera motion output. The system processes a particular input image of the input images using a depth prediction neural network (NN) to generate a depth output for the particular input image, and updates the current values of parameters of the depth prediction NN based on the particular depth output, the camera motion output, and the object motion outputs for the potential objects.

Type: Grant

Filed: September 5, 2019

Date of Patent: October 10, 2023

Assignee: Google LLC

Inventors: Vincent Michael Casser, Soeren Pirk, Reza Mahjourian, Anelia Angelova
Fusing multiple depth sensing modalities

Patent number: 11769269

Abstract: A method includes receiving a first depth map that includes a plurality of first pixel depths and a second depth map that includes a plurality of second pixel depths. The first depth map corresponds to a reference depth scale and the second depth map corresponds to a relative depth scale. The method includes aligning the second pixel depths with the first pixel depths. The method includes transforming the aligned region of the second pixel depths such that transformed second edge pixel depths of the aligned region are coextensive with first edge pixel depths surrounding the corresponding region of the first pixel depths. The method includes generating a third depth map. The third depth map includes a first region corresponding to the first pixel depths and a second region corresponding to the transformed and aligned region of the second pixel depths.

Type: Grant

Filed: August 1, 2022

Date of Patent: September 26, 2023

Assignee: Google LLC

Inventors: Guy Satat, Michael Quinlan, Sean Kirmani, Anelia Angelova, Ariel Gordon
Image depth prediction neural networks

Patent number: 11734847

Abstract: A system includes an image depth prediction neural network implemented by one or more computers. The image depth prediction neural network is a recurrent neural network that is configured to receive a sequence of images and, for each image in the sequence: process the image in accordance with a current internal state of the recurrent neural network to (i) update the current internal state and (ii) generate a depth output that characterizes a predicted depth of a future image in the sequence.

Type: Grant

Filed: January 15, 2021

Date of Patent: August 22, 2023

Assignee: Google LLC

Inventors: Anelia Angelova, Martin Wicke, Reza Mahjourian
NEURAL NETWORK MODELS USING PEER-ATTENTION

Publication number: 20230114556

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for processing a network input using a neural network to generate a network output.

Type: Application

Filed: July 14, 2021

Publication date: April 13, 2023

Inventors: Michael Sahngwon Ryoo, Anthony Jacob Piergiovanni, Anelia Angelova
GENERATING OPTICAL FLOW LABELS FROM POINT CLOUDS

Publication number: 20230035454

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating an optical flow label from a lidar point cloud. One of the methods includes obtaining data specifying a training example, including a first image of a scene in an environment captured at a first time point and a second image of the scene in the environment captured at a second time point. For each of a plurality of lidar points, a respective second corresponding pixel in the second image is obtained and a respective velocity estimate for the lidar point at the second time point is obtained. A respective first corresponding pixel in the first image is determined using the velocity estimate for the lidar point. A proxy optical flow ground truth for the training example is generated based on an estimate of optical flow of the pixel between the first and second images.

Type: Application

Filed: July 23, 2021

Publication date: February 2, 2023

Inventors: Daniel Rudolf Maurer, Alper Ayvaci, Robert William Anderson, Rico Jonschkowski, Austin Charles Stone, Anelia Angelova, Nichola Abdo, Christopher John Sweeney
Training neural networks using consistency measures

Patent number: 11544498

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network using consistency measures. One of the methods includes processing a particular training example from a mediator training data set using a first neural network to generate a first output for a first machine learning task; processing the particular training example in the mediator training data set using each of one or more second neural networks, wherein each second neural network is configured to generate a second output for a respective second machine learning task; determining, for each second machine learning task, a consistency target output for the first machine learning task; determining, for each second machine learning task, an error between the first output and the consistency target output corresponding to the second machine learning task; and generating a parameter update for the first neural network from the determined errors.

Type: Grant

Filed: March 5, 2021

Date of Patent: January 3, 2023

Assignee: Google LLC

Inventors: Ariel Gordon, Soeren Pirk, Anelia Angelova, Vincent Michael Casser, Yao Lu, Anthony Brohan, Zhao Chen, Jan Dlabal
Fusing Multiple Depth Sensing Modalities

Publication number: 20220366590

Abstract: A method includes receiving a first depth map that includes a plurality of first pixel depths and a second depth map that includes a plurality of second pixel depths. The first depth map corresponds to a reference depth scale and the second depth map corresponds to a relative depth scale. The method includes aligning the second pixel depths with the first pixel depths. The method includes transforming the aligned region of the second pixel depths such that transformed second edge pixel depths of the aligned region are coextensive with first edge pixel depths surrounding the corresponding region of the first pixel depths. The method includes generating a third depth map. The third depth map includes a first region corresponding to the first pixel depths and a second region corresponding to the transformed and aligned region of the second pixel depths.

Type: Application

Filed: August 1, 2022

Publication date: November 17, 2022

Inventors: Guy Satat, Michael Quinlan, Sean Kirmani, Anelia Angelova, Ariel Gordon
Small and Fast Video Processing Networks via Neural Architecture Search

Publication number: 20220366257

Abstract: Generally, the present disclosure is directed to a neural architecture search process for finding small and fast video processing networks for understanding of video data. The neural architecture search process can automatically design networks that provide comparable video processing performance at a fraction of the computational and storage cost of larger existing models, thereby conserving computing resources such as memory and processor usage.

Type: Application

Filed: September 16, 2020

Publication date: November 17, 2022

Inventors: Anthony J. Piergiovanni, Anelia Angelova, Michael Sahngwon Ryoo
UNSUPERVISED TRAINING OF OPTICAL FLOW ESTIMATION NEURAL NETWORKS

Publication number: 20220335624

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a neural network to predict optical flow. One of the methods includes obtaining a batch of one or more training image pairs; for each of the pairs: processing the first training image and the second training image using the neural network to generate a final optical flow estimate; generating a cropped final optical flow estimate from the final optical flow estimate; and training the neural network using the cropped optical flow estimate.

Type: Application

Filed: April 14, 2022

Publication date: October 20, 2022

Inventors: Daniel Rudolf Maurer, Austin Charles Stone, Alper Ayvaci, Anelia Angelova, Rico Jonschkowski
FUTURE PREDICTION, USING STOCHASTIC ADVERSARIAL BASED SAMPLING, FOR ROBOTIC CONTROL AND/OR OTHER PURPOSE(S)

Publication number: 20220305647

Abstract: Techniques are disclosed that enable the generation of predicted sequences of terminals using a generator model portion of a prediction model. Various implementations include controlling actuators of a robot based on the predicted sequences of terminals. Additional or alternative implementations include jointly training the generator model portion of the prediction model using a discriminator model portion of the prediction model using, for example, stochastic adversarial based sampling.

Type: Application

Filed: August 27, 2019

Publication date: September 29, 2022

Inventors: Anthony Jacob Piergiovanni, Anelia Angelova, Alexander Toshev, Michael Ryoo
Fusing multiple depth sensing modalities

Patent number: 11450018

Abstract: A method includes receiving a first depth map that includes a plurality of first pixel depths and a second depth map that includes a plurality of second pixel depths. The first depth map corresponds to a reference depth scale and the second depth map corresponds to a relative depth scale. The method includes aligning the second pixel depths with the first pixel depths. The method includes transforming the aligned region of the second pixel depths such that transformed second edge pixel depths of the aligned region are coextensive with first edge pixel depths surrounding the corresponding region of the first pixel depths. The method includes generating a third depth map. The third depth map includes a first region corresponding to the first pixel depths and a second region corresponding to the transformed and aligned region of the second pixel depths.

Type: Grant

Filed: December 24, 2019

Date of Patent: September 20, 2022

Assignee: X Development LLC

Inventors: Guy Satat, Michael Quinlan, Sean Kirmani, Anelia Angelova, Ariel Gordon
UNSUPERVISED LEARNING OF IMAGE DEPTH AND EGO-MOTION PREDICTION NEURAL NETWORKS

Publication number: 20220292701

Abstract: A system includes a neural network implemented by one or more computers, in which the neural network includes an image depth prediction neural network and a camera motion estimation neural network. The neural network is configured to receive a sequence of images. The neural network is configured to process each image in the sequence of images using the image depth prediction neural network to generate, for each image, a respective depth output that characterizes a depth of the image, and to process a subset of images in the sequence of images using the camera motion estimation neural network to generate a camera motion output that characterizes the motion of a camera between the images in the subset. The image depth prediction neural network and the camera motion estimation neural network have been jointly trained using an unsupervised learning technique.

Type: Application

Filed: May 27, 2022

Publication date: September 15, 2022

Inventors: Reza Mahjourian, Martin Wicke, Anelia Angelova
CONNECTION WEIGHT LEARNING FOR GUIDED ARCHITECTURE EVOLUTION

Publication number: 20220189154

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining one or more neural network architectures of a neural network for performing a video processing neural network task. In one aspect, a method comprises: at each of a plurality of iterations: selecting a parent neural network architecture from a set of neural network architectures; training a neural network having the parent neural network architecture to perform the video processing neural network task, comprising determining trained values of connection weight parameters of the parent neural network architecture; generating a new neural network architecture based at least in part on the trained values of the connection weight parameters of the parent neural network architecture; and adding the new neural network architecture to the set of neural network architectures.

Type: Application

Filed: May 22, 2020

Publication date: June 16, 2022

Inventors: Michael Sahngwon Ryoo, Anthony Jacob Piergiovanni, Mingxing Tan, Anelia Angelova

1 2 3 next