Patents by Inventor Phuc Xuan Nguyen

Phuc Xuan Nguyen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Weakly-supervised action localization by sparse temporal pooling network

Patent number: 11881022

Abstract: Systems and methods for a weakly supervised action localization model are provided. Example models according to example aspects of the present disclosure can localize and/or classify actions in untrimmed videos using machine-learned models, such as convolutional neural networks. The example models can predict temporal intervals of human actions given video-level class labels with no requirement of temporal localization information of actions. The example models can recognize actions and identify a sparse set of keyframes associated with actions through adaptive temporal pooling of video frames, wherein the loss function of the model is composed of a classification error and a sparsity of frame selection. Following action recognition with sparse keyframe attention, temporal proposals for action can be extracted using temporal class activation mappings, and final time intervals can be estimated corresponding to target actions.

Type: Grant

Filed: March 10, 2023

Date of Patent: January 23, 2024

Assignee: GOOGLE LLC

Inventors: Ting Liu, Gautam Prasad, Phuc Xuan Nguyen, Bohyung Han
Weakly-Supervised Action Localization by Sparse Temporal Pooling Network

Publication number: 20230215169

Abstract: Systems and methods for a weakly supervised action localization model are provided. Example models according to example aspects of the present disclosure can localize and/or classify actions in untrimmed videos using machine-learned models, such as convolutional neural networks. The example models can predict temporal intervals of human actions given video-level class labels with no requirement of temporal localization information of actions. The example models can recognize actions and identify a sparse set of keyframes associated with actions through adaptive temporal pooling of video frames, wherein the loss function of the model is composed of a classification error and a sparsity of frame selection. Following action recognition with sparse keyframe attention, temporal proposals for action can be extracted using temporal class activation mappings, and final time intervals can be estimated corresponding to target actions.

Type: Application

Filed: March 10, 2023

Publication date: July 6, 2023

Inventors: Ting Liu, Gautam Prasad, Phuc Xuan Nguyen, Bohyung Han
Weakly-supervised action localization by sparse temporal pooling network

Patent number: 11640710

Abstract: Systems and methods for a weakly supervised action localization model are provided. Example models according to example aspects of the present disclosure can localize and/or classify actions in untrimmed videos using machine-learned models, such as convolutional neural networks. The example models can predict temporal intervals of human actions given video-level class labels with no requirement of temporal localization information of actions. The example models can recognize actions and identify a sparse set of keyframes associated with actions through adaptive temporal pooling of video frames, wherein the loss function of the model is composed of a classification error and a sparsity of frame selection. Following action recognition with sparse keyframe attention, temporal proposals for action can be extracted using temporal class activation mappings, and final time intervals can be estimated corresponding to target actions.

Type: Grant

Filed: November 5, 2018

Date of Patent: May 2, 2023

Assignee: GOOGLE LLC

Inventors: Ting Liu, Gautam Prasad, Phuc Xuan Nguyen, Bohyung Han
Weakly-Supervised Action Localization by Sparse Temporal Pooling Network

Publication number: 20200272823

Abstract: Systems and methods for a weakly supervised action localization model are provided. Example models according to example aspects of the present disclosure can localize and/or classify actions in untrimmed videos using machine-learned models, such as convolutional neural networks. The example models can predict temporal intervals of human actions given video-level class labels with no requirement of temporal localization information of actions. The example models can recognize actions and identify a sparse set of keyframes associated with actions through adaptive temporal pooling of video frames, wherein the loss function of the model is composed of a classification error and a sparsity of frame selection. Following action recognition with sparse keyframe attention, temporal proposals for action can be extracted using temporal class activation mappings, and final time intervals can be estimated corresponding to target actions.

Type: Application

Filed: November 5, 2018

Publication date: August 27, 2020

Inventors: Ting Liu, Gautam Prasad, Phuc Xuan Nguyen, Bohyung Han

Weakly-supervised action localization by sparse temporal pooling network

Weakly-Supervised Action Localization by Sparse Temporal Pooling Network

Weakly-supervised action localization by sparse temporal pooling network

Weakly-Supervised Action Localization by Sparse Temporal Pooling Network