Patents by Inventor Hexiang Hu

Hexiang Hu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Hierarchical Video Encoders

Publication number: 20240114158

Abstract: A computer-implemented method for generating video representations utilizing a hierarchical video encoder includes obtaining a video, wherein the video includes a plurality of frames, processing each of the plurality of frames with a machine-learned frame-level encoder model to respectively generate a plurality of frame representations for the plurality of frames, the plurality of frame representations respective to the plurality of frames determining a plurality of segment representations representative of a plurality of video segments including one or more of the plurality of frames, the plurality of segment representations based at least in part on the plurality of frame representations, processing the plurality of segment representations with a machine-learned segment-level encoder model to generate a plurality of contextualized segment representations, determining a video representation based at least in part on the plurality of contextualized segment representations, and providing the video representati

Type: Application

Filed: December 5, 2023

Publication date: April 4, 2024

Inventors: Vihan Jain, Joonseok Lee, Ming Zhao, Sheide Chammas, Hexiang Hu, Bowen Zhang, Fei Sha, Tze Way Eugene Ie
Hierarchical video encoders

Patent number: 11876986

Abstract: A computer-implemented method for generating video representations utilizing a hierarchical video encoder includes obtaining a video, wherein the video includes a plurality of frames, processing each of the plurality of frames with a machine-learned frame-level encoder model to respectively generate a plurality of frame representations for the plurality of frames, the plurality of frame representations respective to the plurality of frames determining a plurality of segment representations representative of a plurality of video segments including one or more of the plurality of frames, the plurality of segment representations based at least in part on the plurality of frame representations, processing the plurality of segment representations with a machine-learned segment-level encoder model to generate a plurality of contextualized segment representations, determining a video representation based at least in part on the plurality of contextualized segment representations, and providing the video representati

Type: Grant

Filed: November 29, 2022

Date of Patent: January 16, 2024

Assignee: GOOGLE LLC

Inventors: Vihan Jain, Joonseok Lee, Ming Zhao, Sheide Chammas, Hexiang Hu, Bowen Zhang, Fei Sha, Tze Way Eugene Ie
Hierarchical Video Encoders

Publication number: 20230103148

Abstract: A computer-implemented method for generating video representations utilizing a hierarchical video encoder includes obtaining a video, wherein the video includes a plurality of frames, processing each of the plurality of frames with a machine-learned frame-level encoder model to respectively generate a plurality of frame representations for the plurality of frames, the plurality of frame representations respective to the plurality of frames determining a plurality of segment representations representative of a plurality of video segments including one or more of the plurality of frames, the plurality of segment representations based at least in part on the plurality of frame representations, processing the plurality of segment representations with a machine-learned segment-level encoder model to generate a plurality of contextualized segment representations, determining a video representation based at least in part on the plurality of contextualized segment representations, and providing the video representati

Type: Application

Filed: November 29, 2022

Publication date: March 30, 2023

Inventors: Vihan Jain, Joonseok Lee, Ming Zhao, Sheide Chammas, Hexiang Hu, Bowen Zhang, Fei Sha, Tze Way Eugene Ie
Compressed content object and action detection

Patent number: 11568545

Abstract: Various embodiments of a framework which allow, as an alternative to resource-taxing decompression, efficient computation of feature maps using a compressed content data subset, such as video, by exploiting the motion information, such as a motion vector, present in the compressed video. This framework allows frame-specific object recognition and action detection algorithms to be applied to compressed video and other media files by executing only on I-frames in a Group of Pictures and linearly interpolating the results. Training and machine learning increases recognition accuracy. Yielding significant computational gains, this approach accelerates frame-wise feature extraction I-frame/P-frame/P-frame videos as well as I-frame/P-frame/B-frame videos. The present techniques may also be used for segmentation to identify and label respective regions for objects in a video.

Type: Grant

Filed: December 27, 2019

Date of Patent: January 31, 2023

Assignee: A9.com, Inc.

Inventors: R. Manmatha, Hexiang Hu, Deva Ramanan
Hierarchical video encoders

Patent number: 11533495

Abstract: A computer-implemented method for generating video representations utilizing a hierarchical video encoder includes obtaining a video, wherein the video includes a plurality of frames, processing each of the plurality of frames with a machine-learned frame-level encoder model to respectively generate a plurality of frame representations for the plurality of frames, the plurality of frame representations respective to the plurality of frames determining a plurality of segment representations representative of a plurality of video segments including one or more of the plurality of frames, the plurality of segment representations based at least in part on the plurality of frame representations, processing the plurality of segment representations with a machine-learned segment-level encoder model to generate a plurality of contextualized segment representations, determining a video representation based at least in part on the plurality of contextualized segment representations, and providing the video representati

Type: Grant

Filed: January 29, 2021

Date of Patent: December 20, 2022

Assignee: GOOGLE LLC

Inventors: Vihan Jain, Joonseok Lee, Ming Zhao, Sheide Chammas, Hexiang Hu, Bowen Zhang, Fei Sha, Tze Way Eugene Ie
Hierarchical Video Encoders

Publication number: 20220256175

Abstract: A computer-implemented method for generating video representations utilizing a hierarchical video encoder includes obtaining a video, wherein the video includes a plurality of frames, processing each of the plurality of frames with a machine-learned frame-level encoder model to respectively generate a plurality of frame representations for the plurality of frames, the plurality of frame representations respective to the plurality of frames determining a plurality of segment representations representative of a plurality of video segments including one or more of the plurality of frames, the plurality of segment representations based at least in part on the plurality of frame representations, processing the plurality of segment representations with a machine-learned segment-level encoder model to generate a plurality of contextualized segment representations, determining a video representation based at least in part on the plurality of contextualized segment representations, and providing the video representati

Type: Application

Filed: January 29, 2021

Publication date: August 11, 2022

Inventors: Vihan Jain, Joonseok Lee, Ming Zhao, Sheide Chammas, Hexiang Hu, Bowen Zhang, Fei Sha, Tze Way Eugene Ie
COMPRESSED CONTENT OBJECT AND ACTION DETECTION

Publication number: 20210342924

Abstract: Various embodiments of a framework which allow, as an alternative to resource-taxing decompression, efficient computation of feature maps using a compressed content data subset, such as video, by exploiting the motion information, such as a motion vector, present in the compressed video. This framework allows frame-specific object recognition and action detection algorithms to be applied to compressed video and other media files by executing only on I-frames in a Group of Pictures and linearly interpolating the results. Training and machine learning increases recognition accuracy. Yielding significant computational gains, this approach accelerates frame-wise feature extraction I-frame/P-frame/P-frame videos as well as I-frame/P-frame/B-frame videos. The present techniques may also be used for segmentation to identify and label respective regions for objects in a video.

Type: Application

Filed: December 27, 2019

Publication date: November 4, 2021

Inventors: R. Manmatha, Hexiang Hu, Deva Ramanan
COMPRESSED CONTENT OBJECT AND ACTION DETECTION

Publication number: 20200143457

Abstract: Various embodiments of a framework which allow, as an alternative to resource-taxing decompression, efficient computation of feature maps using a compressed content data subset, such as video, by exploiting the motion information, such as a motion vector, present in the compressed video. This framework allows frame-specific object recognition and action detection algorithms to be applied to compressed video and other media files by executing only on I-frames in a Group of Pictures and linearly interpolating the results. Training and machine learning increases recognition accuracy. Yielding significant computational gains, this approach accelerates frame-wise feature extraction I-frame/P-frame/P-frame videos as well as I-frame/P-frame/B-frame videos. The present techniques may also be used for segmentation to identify and label respective regions for objects in a video.

Type: Application

Filed: December 27, 2019

Publication date: May 7, 2020

Inventors: R. Manmatha, Hexiang Hu, Deva Ramanan
Compressed content object and action detection

Patent number: 10528819

Abstract: Various embodiments of a framework which allow, as an alternative to resource-taxing decompression, efficient computation of feature maps using a compressed content data subset, such as video, by exploiting the motion information, such as a motion vector, present in the compressed video. This framework allows frame-specific object recognition and action detection algorithms to be applied to compressed video and other media files by executing only on I-frames in a Group of Pictures and linearly interpolating the results. Training and machine learning increases recognition accuracy. Yielding significant computational gains, this approach accelerates frame-wise feature extraction I-frame/P-frame/P-frame videos as well as I-frame/P-frame/B-frame videos. The present techniques may also be used for segmentation to identify and label respective regions for objects in a video.

Type: Grant

Filed: November 20, 2017

Date of Patent: January 7, 2020

Assignee: A9.COM, INC.

Inventors: R. Manmatha, Hexiang Hu, Deva Ramanan
Step light

Patent number: D1012342

Type: Grant

Filed: September 30, 2023

Date of Patent: January 23, 2024

Inventor: Hexiang Hu