Patents by Inventor Sarah Tariq

Sarah Tariq has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

DEPTH DATA MODEL TRAINING

Publication number: 20210150278

Abstract: Techniques for training a machine learned (ML) model to determine depth data based on image data are discussed herein. Training can use stereo image data and depth data (e.g., lidar data). A first (e.g., left) image can be input to a ML model, which can output predicted disparity and/or depth data. The predicted disparity data can be used with second image data (e.g., a right image) to reconstruct the first image. Differences between the first and reconstructed images can be used to determine a loss. Losses may include pixel, smoothing, structural similarity, and/or consistency losses. Further, differences between the depth data and the predicted depth data and/or differences between the predicted disparity data and the predicted depth data can be determined, and the ML model can be trained based on the various losses. Thus, the techniques can use self-supervised training and supervised training to train a ML model.

Type: Application

Filed: November 14, 2019

Publication date: May 20, 2021

Inventors: Thomas Oscar Dudzik, Kratarth Goel, Praveen Srinivasan, Sarah Tariq
DEPTH DATA MODEL TRAINING WITH UPSAMPLING, LOSSES, AND LOSS BALANCING

Publication number: 20210150279

Abstract: Techniques for training a machine learned (ML) model to determine depth data based on image data are discussed herein. Training can use stereo image data and depth data (e.g., lidar data). A first (e.g., left) image can be input to a ML model, which can output predicted disparity and/or depth data. The predicted disparity data can be used with second image data (e.g., a right image) to reconstruct the first image. Differences between the first and reconstructed images can be used to determine a loss. Losses may include pixel, smoothing, structural similarity, and/or consistency losses. Further, differences between the depth data and the predicted depth data and/or differences between the predicted disparity data and the predicted depth data can be determined, and the ML model can be trained based on the various losses. Thus, the techniques can use self-supervised training and supervised training to train a ML model.

Type: Application

Filed: November 14, 2019

Publication date: May 20, 2021

Inventors: Thomas Oscar Dudzik, Kratarth Goel, Praveen Srinivasan, Sarah Tariq
IMAGE EMBEDDING FOR OBJECT TRACKING

Publication number: 20210142078

Abstract: Techniques are disclosed for implementing a neural network that outputs embeddings. Furthermore, techniques are disclosed for using sensor data to train a neural network to learn such embeddings. In some examples, the neural network may be trained to learn embeddings. The embeddings may be used for object identification, object matching, object classification, and/or object tracking in various examples.

Type: Application

Filed: November 3, 2020

Publication date: May 13, 2021

Inventors: Bryce A. Evans, James William Vaisey Philbin, Sarah Tariq
Multi-task learning for real-time semantic and/or depth aware instance segmentation and/or three-dimensional object bounding

Patent number: 10984290

Abstract: Training a machine-learning (ML) architecture to determine three or more outputs at a rate of 30 or more frames per second on consumer grade hardware may comprise jointly training components of the ML using loss(es) determined across the components and/or consistency losses determined between outputs of two or more components. The ML architecture discussed herein may comprise one or more sets of neural network layers and/or respective components for determining a two and/or three-dimensional region of interest, semantic segmentation, direction logits, depth data, and/or instance segmentation associated with an object in an image.

Type: Grant

Filed: December 31, 2019

Date of Patent: April 20, 2021

Assignee: Zoox, Inc.

Inventors: Kratarth Goel, James William Vaisey Philbin, Praveen Srinivasan, Sarah Tariq
Hierarchical machine-learning network architecture

Patent number: 10963709

Abstract: The techniques discussed herein may comprise refining a classification of an object detected as being represented in sensor data. For example, refining the classification may comprise determining a sub-classification of the object.

Type: Grant

Filed: January 2, 2019

Date of Patent: March 30, 2021

Assignee: Zoox, Inc.

Inventors: Kratarth Goel, Sarah Tariq
Machine learning techniques

Patent number: 10936922

Abstract: Improved techniques for training a machine learning (ML) model are discussed herein. Training the ML model can be based on a subset of examples. In particular, the training can include identifying a reference region associated with an area of the image representing an object, and selecting, based at least in part on a first confidence score associated with a first bounding box, a first hard example for inclusion in the subset of examples. In some cases, the first confidence score and the first bounding box can be associated with a first portion of the feature map. Next, the training can include determining that a first degree of alignment of the first bounding box to the reference region is above a threshold degree of alignment, and in response, replacing the first hard example with a second hard example.

Type: Grant

Filed: June 20, 2018

Date of Patent: March 2, 2021

Assignee: Zoox, Inc.

Inventors: Sarah Tariq, James William Vaisey Philbin, Kratarth Goel
DETECTOR-TRACKER ARCHITECTURE

Publication number: 20210049776

Abstract: A machine-learning (ML) architecture may comprise a first ML model and/or an optical flow model that receive, as input, a first image and a second image. The first ML model may output a first feature map corresponding to the first image and a second feature map corresponding to the second image. The optical flow model may output an estimated optical flow. A deformation component may modify the second feature map, as a deformed feature map, based at least in part on the estimated optical flow. The deformed feature map and the first feature may be concatenated together as a concatenated feature map, which may be provided to a second ML model. The second ML model may be trained to output an output ROI and/or a track in association with an object represented in the first image.

Type: Application

Filed: November 2, 2020

Publication date: February 18, 2021

Inventors: Qijun Tan, Sarah Tariq
OBJECT VELOCITY FROM IMAGES

Publication number: 20210049778

Abstract: Techniques are discussed for determining a velocity of an object in an environment from a sequence of images (e.g., two or more). A first image of the sequence is transformed to align the object with an image center. Additional images in the sequence are transformed by the same amount to form a sequence of transformed images. Such sequence is input into a machine learned model trained to output a scaled velocity of the object (a relative object velocity (ROV)) according to the transformed coordinate system. The ROV is then converted to the camera coordinate system by applying an inverse of the transformation. Using a depth associated with the object and the ROV of the object in the camera coordinate frame, an actual velocity of the object in the environment is determined relative to the camera.

Type: Application

Filed: November 3, 2020

Publication date: February 18, 2021

Inventors: Vasiliy Karasev, Sarah Tariq
Bounding box embedding for object identifying

Patent number: 10922574

Abstract: Techniques are disclosed for implementing a neural network that outputs embeddings. Furthermore, techniques are disclosed for using sensor data to train a neural network to learn such embeddings. In some examples, the neural network may be trained to learn embeddings for instance segmentation of an object based on an embedding for a bounding box associated with the object being trained to match pixel embeddings for pixels associated with the object. The embeddings may be used for object identification, object matching, object classification, and/or object tracking in various examples.

Type: Grant

Filed: December 10, 2018

Date of Patent: February 16, 2021

Assignee: Zoox, Inc.

Inventor: Sarah Tariq
LOW VARIANCE REGION DETECTION FOR IMPROVED DETECTION

Publication number: 20200410281

Abstract: Using detection of low variance regions for improving detection is described. In an example, sensor data can be received from a sensor associated with a vehicle. The sensor data can represent an environment. An indication of a low variance region associated with the sensor data can be determined and an indication of a high variance region associated with the sensor data can be determined based at least in part on the indication of the low variance region. The vehicle can be controlled based on at least one of the sensor data or the indication of the high variance region.

Type: Application

Filed: December 6, 2019

Publication date: December 31, 2020

Inventors: Kratarth Goel, James William Vaisey Philbin, Sarah Tariq
LOW VARIANCE DETECTION TRAINING

Publication number: 20200410225

Abstract: Low variance detection training is described herein. In an example, annotated data can be determined based on sensor data received from a sensor associated with a vehicle. The annotated data can comprise an annotated low variance region and/or an annotated high variance region. The sensor data can be input into a model, and the model can determine an output comprising a high variance output and a low variance output. In an example, a difference between the annotated data and the output can be determined and one or more parameters associated with the model can be altered based at least in part on the difference. The model can be transmitted to a vehicle configured to be controlled by another output of the model.

Type: Application

Filed: December 6, 2019

Publication date: December 31, 2020

Inventors: Kratarth Goel, James William Vaisey Philbin, Sarah Tariq
Object velocity from images

Patent number: 10832418

Abstract: Techniques are discussed for determining a velocity of an object in an environment from a sequence of images (e.g., two or more). A first image of the sequence is transformed to align the object with an image center. Additional images in the sequence are transformed by the same amount to form a sequence of transformed images. Such sequence is input into a machine learned model trained to output a scaled velocity of the object (a relative object velocity (ROV)) according to the transformed coordinate system. The ROV is then converted to the camera coordinate system by applying an inverse of the transformation. Using a depth associated with the object and the ROV of the object in the camera coordinate frame, an actual velocity of the object in the environment is determined relative to the camera.

Type: Grant

Filed: May 9, 2019

Date of Patent: November 10, 2020

Assignee: Zoox, Inc.

Inventors: Vasiliy Karasev, Sarah Tariq
Image embedding for object tracking

Patent number: 10832062

Abstract: Techniques are disclosed for implementing a neural network that outputs embeddings. Furthermore, techniques are disclosed for using sensor data to train a neural network to learn such embeddings. In some examples, the neural network may be trained to learn embeddings. The embeddings may be used for object identification, object matching, object classification, and/or object tracking in various examples.

Type: Grant

Filed: September 28, 2018

Date of Patent: November 10, 2020

Assignee: Zoox, Inc.

Inventors: Bryce A. Evans, James William Vaisey Philbin, Sarah Tariq
Detector-tracker architecture

Patent number: 10825188

Abstract: A machine-learning (ML) architecture may comprise a first ML model and/or an optical flow model that receive, as input, a first image and a second image. The first ML model may output a first feature map corresponding to the first image and a second feature map corresponding to the second image. The optical flow model may output an estimated optical flow. A deformation component may modify the second feature map, as a deformed feature map, based at least in part on the estimated optical flow. The deformed feature map and the first feature map may be concatenated together as a concatenated feature map, which may be provided to a second ML model. The second ML model may be trained to output an output ROI and/or a track in association with an object represented in the first image.

Type: Grant

Filed: March 8, 2019

Date of Patent: November 3, 2020

Assignee: Zoox, Inc.

Inventors: Qijun Tan, Sarah Tariq
Instance segmentation inferred from machine learning model output

Patent number: 10817740

Abstract: Techniques for using instance segmentation with machine learning (ML) models are discussed herein. An image can be provided as input to a ML model, which can generate, as an output from the ML model, a feature map comprising a plurality of features. Each feature of the plurality of features can comprise a confidence score, classification information, and a region of interest (ROI) determined in accordance with a non-maximal suppression (NMS) technique. Individual ROIs that are similar can be associated together for segmentation purposes. That is, instead of requiring a second ML model and/or a second operation to segment the image (e.g., identify which pixels correspond with the detected object, for example, by outputting a mask or set of lines and/or curves), the techniques discussed herein substantially simultaneously detect an object (e.g., determine an ROI) and segment the image.

Type: Grant

Filed: June 20, 2018

Date of Patent: October 27, 2020

Assignee: Zoox, Inc.

Inventors: Sarah Tariq, James William Vaisey Philbin, Kratarth Goel
MOTION PREDICTION BASED ON APPEARANCE

Publication number: 20200272148

Abstract: Techniques for determining and/or predicting a trajectory of an object by using the appearance of the object, as captured in an image, are discussed herein. Image data, sensor data, and/or a predicted trajectory of the object (e.g., a pedestrian, animal, and the like) may be used to train a machine learning model that can subsequently be provided to, and used by, an autonomous vehicle for operation and navigation. In some implementations, predicted trajectories may be compared to actual trajectories and such comparisons are used as training data for machine learning.

Type: Application

Filed: February 21, 2019

Publication date: August 27, 2020

Inventors: Vasiliy Karasev, Tencia Lee, James William Vaisey Philbin, Sarah Tariq, Kai Zhenyu Wang
HIERARCHICAL MACHINE-LEARNING NETWORK ARCHITECTURE

Publication number: 20200210721

Abstract: The techniques discussed herein may comprise refining a classification of an object detected as being represented in sensor data. For example, refining the classification may comprise determining a sub-classification of the object.

Type: Application

Filed: January 2, 2019

Publication date: July 2, 2020

Inventors: Kratarth Goel, Sarah Tariq
Machine Learning Techniques

Publication number: 20190392268

Abstract: Improved techniques for training a machine learning (ML) model are discussed herein. Training the ML model can be based on a subset of examples. In particular, the training can include identifying a reference region associated with an area of the image representing an object, and selecting, based at least in part on a first confidence score associated with a first bounding box, a first hard example for inclusion in the subset of examples. In some cases, the first confidence score and the first bounding box can be associated with a first portion of the feature map. Next, the training can include determining that a first degree of alignment of the first bounding box to the reference region is above a threshold degree of alignment, and in response, replacing the first hard example with a second hard example.

Type: Application

Filed: June 20, 2018

Publication date: December 26, 2019

Inventors: Sarah Tariq, James William Vaisey Philbin, Kratarth Goel
RESTRICTED MULTI-SCALE INFERENCE FOR MACHINE LEARNING

Publication number: 20190391578

Abstract: Techniques for utilizing multiple scales of images as input to machine learning (ML) models are discussed herein. Operations can include providing an image associated with a first scale to a first ML model. An output of the first ML model can include a first bounding box indicative of a first region of the image representing a first object, with the first bounding box falling within a first range of sizes. Next, a scaled image can be generated by scaling the image. The scaled image can be provided to a second ML model, which can output a second bounding box indicative of a second region of the image representing a second object, the second bounding falling within a second range of sizes. Thus, inputting a scaled image to a same ML model (or to different ML models) can result in different detected features in the images.

Type: Application

Filed: June 20, 2018

Publication date: December 26, 2019

Inventors: Sarah Tariq, James William Vaisey Philbin, Kratarth Goel
INSTANCE SEGMENTATION INFERRED FROM MACHINE-LEARNING MODEL OUTPUT

Publication number: 20190392242

Abstract: Techniques for using instance segmentation with machine learning (ML) models are discussed herein. An image can be provided as input to a ML model, which can generate, as an output from the ML model, a feature map comprising a plurality of features. Each feature of the plurality of features can comprise a confidence score, classification information, and a region of interest (ROI) determined in accordance with a non-maximal suppression (NMS) technique. Individual ROIs that are similar can be associated together for segmentation purposes. That is, instead of requiring a second ML model and/or a second operation to segment the image (e.g., identify which pixels correspond with the detected object, for example, by outputting a mask or set of lines and/or curves), the techniques discussed herein substantially simultaneously detect an object (e.g., determine an ROI) and segment the image.

Type: Application

Filed: June 20, 2018

Publication date: December 26, 2019

Inventors: Sarah Tariq, James William Vaisey Philbin, Kratarth Goel

prev 1 2 3 4 next