Patents by Inventor Ehsan ADELI-MOSABBEB

Ehsan ADELI-MOSABBEB has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Cooperative-contrastive learning systems and methods

Patent number: 12254691

Abstract: Systems and methods for multi-view cooperative contrastive self-supervised learning, may include receiving a plurality of video sequences, the video sequences comprising a plurality of image frames; applying selected images of a first and second video sequence of the plurality of video sequences to a plurality of different encoders to derive a plurality of embeddings for different views of the selected images of the first and second video sequences; determining distances of the derived plurality of embeddings for the selected images of the first and second video sequences; detecting inconsistencies in the determined distances; and predicting semantics of a future image based on the determined distances.

Type: Grant

Filed: December 3, 2020

Date of Patent: March 18, 2025

Assignees: TOYOTA RESEARCH INSTITUTE, INC., THE BOARD OF TRUSTEES OF THE LELAND STANFORD JUNIOR UNIVERSITY

Inventors: Nishant Rai, Ehsan Adeli Mosabbeb, Kuan-Hui Lee, Adrien Gaidon, Juan Carlos Niebles
Systems and methods for estimation of Parkinson's Disease gait impairment severity from videos using MDS-UPDRS

Patent number: 11918370

Abstract: Many embodiments of the invention include systems and methods for evaluating motion from a video, the method includes identifying a target individual in a set of one or more frames in a video, analyzing the set of frames to determine a set of pose parameters, generating a 3D body mesh based on the pose parameters, identifying joint positions for the target individual in the set of frames based on the generated 3D body mesh, predicting a motion evaluation score based on the identified join positions, providing an output based on the motion evaluation score.

Type: Grant

Filed: May 19, 2021

Date of Patent: March 5, 2024

Assignee: The Board of Trustees of the Leland Stanford Junior University

Inventors: Ehsan Adeli-Mosabbeb, Mandy Lu, Kathleen Poston, Juan Carlos Niebles
Systems and Methods for Recognizing Human Actions from Privacy-Preserving Optics

Publication number: 20240021018

Abstract: Systems and methods of capturing privacy protected images and performing machine vision tasks are described. An embodiment includes a system that includes an optical component and an image processing application configured to capture distorted video using the optical component, where the optical component includes a set of optimal camera lens parameters ?*o learned using machine learning, performing a machine vision task on the distorted video, where the machine vision task includes a set of optimal action recognition parameters ?*c learned using the machine learning, and generating a classification based on the machine vision task, where the machine learning is jointly trained to optimize the optical element and the machine vision task.

Type: Application

Filed: July 13, 2023

Publication date: January 18, 2024

Applicants: The Board of Trustees of the Leland Stanford Junior University, Universidad Industrial de Santander

Inventors: Juan Carlos Niebles, Carlos Hinojosa, Henry Arguello, Miguel Marquez, Ehsan Adeli-Mosabbeb, Fei-Fei Li
Spatio-temporal graph for video captioning with knowledge distillation

Patent number: 11604936

Abstract: A method for scene perception using video captioning based on a spatio-temporal graph model is described. The method includes decomposing the spatio-temporal graph model of a scene in input video into a spatial graph and a temporal graph. The method also includes modeling a two branch framework having an object branch and a scene branch according to the spatial graph and the temporal graph to learn object interactions between the object branch and the scene branch. The method further includes transferring the learned object interactions from the object branch to the scene branch as privileged information. The method also includes captioning the scene by aligning language logits from the object branch and the scene branch according to the learned object interactions.

Type: Grant

Filed: March 23, 2020

Date of Patent: March 14, 2023

Assignees: TOYOTA RESEARCH INSTITUTE, INC., THE BOARD OF TRUSTEES OF THE LELAND STANFORD JUNIOR UNIVERSITY

Inventors: Boxiao Pan, Haoye Cai, De-An Huang, Kuan-Hui Lee, Adrien David Gaidon, Ehsan Adeli-Mosabbeb, Juan Carlos Niebles Duque
COOPERATIVE-CONTRASTIVE LEARNING SYSTEMS AND METHODS

Publication number: 20220180101

Abstract: Systems and methods for multi-view cooperative contrastive self-supervised learning, may include receiving a plurality of video sequences, the video sequences comprising a plurality of image frames; applying selected images of a first and second video sequence of the plurality of video sequences to a plurality of different encoders to derive a plurality of embeddings for different views of the selected images of the first and second video sequences; determining distances of the derived plurality of embeddings for the selected images of the first and second video sequences; detecting inconsistencies in the determined distances; and predicting semantics of a future image based on the determined distances.

Type: Application

Filed: December 3, 2020

Publication date: June 9, 2022

Inventors: Nishant Rai, Ehsan Adeli Mosabbeb, Kuan-Hui Lee, Adrien Gaidon, Juan Carlos Niebles
Spatiotemporal relationship reasoning for pedestrian intent prediction

Patent number: 11205082

Abstract: A system and method for predicting pedestrian intent is provided. A prediction circuit comprising a plurality of gated recurrent units (GRUB) receives a sequence of images captured by a camera. The prediction circuit parses each frame of the sequence of images to identify one or more pedestrians and one or more objects. Using the parsed data, the prediction circuit generates a pedestrian-centric spatiotemporal graph, the parsed data comprising one or more identified pedestrians and one or more identified object. The prediction circuit uses the pedestrian-centric graph to determine a probability of one or more pedestrians crossing a street for each frame of the sequence of images.

Type: Grant

Filed: October 8, 2019

Date of Patent: December 21, 2021

Assignees: TOYOTA RESEARCH INSTITUTE, INC., The Board of Trustees of the Leland Stanford Junior University

Inventors: Ehsan Adeli-Mosabbeb, Kuan Lee, Adrien Gaidon, Bingbin Liu, Zhangjie Cao, Juan Carlos Niebles
Systems and Methods for Estimation of Parkinson's Disease Gait Impairment Severity from Videos Using MDS-UPDRS

Publication number: 20210386359

Abstract: Many embodiments of the invention include systems and methods for evaluating motion from a video, the method includes identifying a target individual in a set of one or more frames in a video, analyzing the set of frames to determine a set of pose parameters, generating a 3D body mesh based on the pose parameters, identifying joint positions for the target individual in the set of frames based on the generated 3D body mesh, predicting a motion evaluation score based on the identified join positions, providing an output based on the motion evaluation score.

Type: Application

Filed: May 19, 2021

Publication date: December 16, 2021

Applicant: The Board of Trustees of the Leland Stanford Junior University

Inventors: Ehsan Adeli-Mosabbeb, Mandy Lu, Kathleen Poston, Juan Carlos Niebles
SPATIO-TEMPORAL GRAPH FOR VIDEO CAPTIONING WITH KNOWLEDGE DISTILLATION

Publication number: 20210295093

Abstract: A method for scene perception using video captioning based on a spatio-temporal graph model is described. The method includes decomposing the spatio-temporal graph model of a scene in input video into a spatial graph and a temporal graph. The method also includes modeling a two branch framework having an object branch and a scene branch according to the spatial graph and the temporal graph to learn object interactions between the object branch and the scene branch. The method further includes transferring the learned object interactions from the object branch to the scene branch as privileged information. The method also includes captioning the scene by aligning language logits from the object branch and the scene branch according to the learned object interactions.

Type: Application

Filed: March 23, 2020

Publication date: September 23, 2021

Applicants: TOYOTA RESEARCH INSTITUTE, INC., THE BOARD OF TRUSTEES OF THE LELAND STANFORD JUNIOR UNIVERSITY

Inventors: Boxiao PAN, Haoye CAI, De-An HUANG, Kuan-Hui LEE, Adrien David GAIDON, Ehsan ADELI-MOSABBEB, Juan Carlos NIEBLES DUQUE
Disentangling human dynamics for pedestrian locomotion forecasting with noisy supervision

Patent number: 11074438

Abstract: A method for predicting spatial positions of several key points on a human body in the near future in an egocentric setting is described. The method includes generating a frame-level supervision for human poses. The method also includes suppressing noise and filling missing joints of the human body using a pose completion module. The method further includes splitting the poses into a global stream and a local stream. Furthermore, the method includes combining the global stream and the local stream to forecast future human locomotion.

Type: Grant

Filed: October 1, 2019

Date of Patent: July 27, 2021

Assignees: TOYOTA RESEARCH INSTITUTE, INC., THE BOARD OF TRUSTEES OF THE LELAND STANFORD JUNIOR UNIVERSITY

Inventors: Karttikeya Mangalam, Ehsan Adeli-Mosabbeb, Kuan-Hui Lee, Adrien Gaidon, Juan Carlos Niebles Duque
SPATIOTEMPORAL RELATIONSHIP REASONING FOR PEDESTRIAN INTENT PREDICTION

Publication number: 20210103742

Abstract: A system and method for predicting pedestrian intent is provided. A prediction circuit comprising a plurality of gated recurrent units (GRUB) receives a sequence of images captured by a camera. The prediction circuit parses each frame of the sequence of images to identify one or more pedestrians and one or more objects. Using the parsed data, the prediction circuit generates a pedestrian-centric spatiotemporal graph, the parsed data comprising one or more identified pedestrians and one or more identified object. The prediction circuit uses the pedestrian-centric graph to determine a probability of one or more pedestrians crossing a street for each frame of the sequence of images.

Type: Application

Filed: October 8, 2019

Publication date: April 8, 2021

Inventors: Ehsan Adeli-Mosabbeb, Kuan Lee, Adrien Gaidon, Bingbin Liu, Zhangjie Cao, Juan Carlos Niebles
DISENTANGLING HUMAN DYNAMICS FOR PEDESTRIAN LOCOMOTION FORECASTING WITH NOISY SUPERVISION

Publication number: 20210097266

Abstract: A method for predicting spatial positions of several key points on a human body in the near future in an egocentric setting is described. The method includes generating a frame-level supervision for human poses. The method also includes suppressing noise and filling missing joints of the human body using a pose completion module. The method further includes splitting the poses into a global stream and a local stream. Furthermore, the method includes combining the global stream and the local stream to forecast future human locomotion.

Type: Application

Filed: October 1, 2019

Publication date: April 1, 2021

Applicants: TOYOTA RESEARCH INSTITUTE, INC., THE BOARD OF TRUSTEES OF THE LELAND STANFORD JUNIOR UNIVERSITY

Inventors: Karttikeya MANGALAM, Ehsan ADELI-MOSABBEB, Kuan-Hui LEE, Adrien GAIDON, Juan Carlos NIEBLES DUQUE