Patents by Inventor Xiaohan Nie

Xiaohan Nie has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Searchability and discoverability of contextually relevant frames within digital content

Patent number: 12518560

Abstract: Systems, devices, and methods are provided for searchability and discoverability of contextually relevant frames within digital content. Digital content, such as videos, may be segmented to identify a plurality of shots. Discoverability may be performed by identifying key frames of the digital content and using a contrastive language-image pre-training (CLIP) model to determine contextual relevance of a frame or shot to textual information associated with the digital content. Searchability may be performed by receiving search parameters and applying various filters to digital content to identify frames or shots that satisfy a user's search query.

Type: Grant

Filed: September 14, 2021

Date of Patent: January 6, 2026

Assignee: Amazon Technologies, Inc.

Inventors: Honey Gupta, Prabhakar Gupta, Dongqing Zhang, Shixing Chen, Xiaohan Nie, Muhammad Raffay Hamid
Systems and methods for video-based sports field registration

Patent number: 12211222

Abstract: Methods and systems are described for registering a sports field to a video. Video of a live event may feature participants at a venue. A template of the venue, including virtual markings that represent real markings on the venue, may be obtained. A homographic transformation between an image plane and a ground plane may be determined by matching virtual markings to corresponding real markings captured in at least one frame of the video. The determined homographic transformation may be used in the automated analysis of sports statistics and in improving inserted annotations and visualizations.

Type: Grant

Filed: October 4, 2023

Date of Patent: January 28, 2025

Assignee: Amazon Technologies, Inc.

Inventors: Xiaohan Nie, Muhammad Raffay Hamid
DEPTH-GUIDED STRUCTURE-FROM-MOTION TECHNIQUES

Publication number: 20240346686

Abstract: Systems, devices, and methods are provided for depth-guided structure from motion. A system may obtain a plurality of image frames from a digital content item that corresponds to a scene and determine, based at least in part on a correspondence search, a set of 2-D keypoints for the plurality of image frames. A depth estimator may be used to determine a plurality of dense depth map for the plurality of image frames. The set of 2-D keypoints and the plurality of dense depth maps may be used to determine a corresponding set of depth priors. Initialization and/or depth-regularized optimization may be performed using the keypoints and depth priors.

Type: Application

Filed: June 20, 2024

Publication date: October 17, 2024

Applicant: Amazon Technologies, Inc.

Inventors: Xiaohan Nie, Michael Thomas Pecchia, Leo Chan, Ahmed Aly Saad Ahmed, Muhammad Raffay Hamid, Sheng Liu
Client side augmented reality overlay

Patent number: 12101529

Abstract: Techniques are described for facilitating client-side augmented reality overlay of secondary content during live events. Regions for overlaying secondary content are identified along with attributes for each region. A client device may then used the attributes to overlay secondary content in each region prior to playback.

Type: Grant

Filed: September 17, 2021

Date of Patent: September 24, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Yongjun Wu, Zaixi Shang, Sriram Sethuraman, Hai Wei, Xiaohan Nie
Contrastive learning of scene representation guided by video similarities

Patent number: 12067779

Abstract: A plurality of similar video pairs may be determined based on one or more similarity information types. Each video pair of the plurality of similar video pairs may include a first respective video and a second respective video. For each video pair, one or more similar scene pairs may be determined. Each of the one or more similar scene pairs may include a respective first scene from the first respective video and a second respective scene from the second respective video. An encoder may be trained using a contrastive learning model that contrasts a plurality of similar scene pairs with a plurality of random scenes. The plurality of similar scene pairs may include the one or more scene pairs for each video pair. One or more scene features of one or more other scenes of one or more other videos may be determined using the encoder.

Type: Grant

Filed: February 9, 2022

Date of Patent: August 20, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Shixing Chen, Xiang Hao, Xiaohan Nie, Muhammad Raffay Hamid
Depth-guided structure-from-motion techniques

Patent number: 12046002

Abstract: Systems, devices, and methods are provided for depth guided structure from motion. A system may obtain a plurality of image frames from a digital content item that corresponds to a scene and determine, based at least in part on a correspondence search, a set of 2-D keypoints for the plurality of image frames. A depth estimator may be used to determine a plurality of dense depth map for the plurality of image frames. The set of 2-D keypoints and the plurality of dense depth maps may be used to determine a corresponding set of depth priors. Initialization and/or depth-regularized optimization may be performed using the keypoints and depth priors.

Type: Grant

Filed: March 1, 2022

Date of Patent: July 23, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Xiaohan Nie, Michael Thomas Pecchia, Leo Chan, Ahmed Aly Saad Ahmed, Muhammad Raffay Hamid, Sheng Liu
Computer-implemented methods of an automated framework for virtual product placement in video frames

Patent number: 12041278

Abstract: Techniques for a computer-implemented service for virtual product placement in video frames are described. According to some embodiments, a computer-implemented method includes receiving, at a virtual product placement service, a request to place a two-dimensional image of a virtual product into a video, identifying, by a machine learning model of the virtual product placement service, a surface depicted in the video for insertion of the two-dimensional image of the virtual product, inserting, by the virtual product placement service, of the two-dimensional image of the virtual product into one or more frames of the video onto the surface to generate a video including the virtual product, and transmitting the video including the virtual product to a viewer device or a storage location.

Type: Grant

Filed: June 29, 2022

Date of Patent: July 16, 2024

Assignee: Amazon Technologies, Inc.

Inventors: V Divya Bhargavi, Karan Sindwani, Siavash Gholami, Xiaohan Nie, Ahmed Aly Saad Ahmed, David Kuo, Yash Chaturvedi, Vidya Sagar Ravipati
Systems and methods of obstacle detection for automated delivery apparatus

Patent number: 11967161

Abstract: The present disclosure generally relates to a system of a delivery device for combining sensor data from various types of sensors to generate a map that enables the delivery device to navigate from a first location to a second location to deliver an item to the second location. The system obtains data from RGB, LIDAR, and depth sensors and combines this sensor data according to various algorithms to detect objects in an environment of the delivery device, generate point cloud and pose information associated with the detected objects, and generates object boundary data for the detected objects. The system further identifies object states for the detected object and generates the map for the environment based on the detected object, the generated object proposal data, the labeled point cloud data, and the object states. The generated map may be provided to other systems to navigate the delivery device.

Type: Grant

Filed: June 26, 2020

Date of Patent: April 23, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Hakan Boyraz, Baoyuan Liu, Xiaohan Nie, Sheng Chen
SYSTEMS AND METHODS FOR VIDEO-BASED SPORTS FIELD REGISTRATION

Publication number: 20240029278

Abstract: Methods and systems are described for registering a sports field to a video. Video of a live event may feature participants at a venue. A template of the venue, including virtual markings that represent real markings on the venue, may be obtained. A homographic transformation between an image plane and a ground plane may be determined by matching virtual markings to corresponding real markings captured in at least one frame of the video. The determined homographic transformation may be used in the automated analysis of sports statistics and in improving inserted annotations and visualizations.

Type: Application

Filed: October 4, 2023

Publication date: January 25, 2024

Inventors: Xiaohan Nie, Muhammad Raffay Hamid
Systems and methods for video-based sports field registration

Patent number: 11816849

Abstract: Methods and systems are described for registering a sports field to a video. Video of a live event may feature participants at a venue. A template of the venue, including virtual markings that represent real markings on the venue, may be obtained. A homographic transformation between an image plane and a ground plane may be determined by matching virtual markings to corresponding real markings captured in at least one frame of the video. The determined homographic transformation may be used in the automated analysis of sports statistics and in improving inserted annotations and visualizations.

Type: Grant

Filed: September 30, 2022

Date of Patent: November 14, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Xiaohan Nie, Muhammad Raffay Hamid
Shot contras five self-supervised learning of a plurality of machine learning models for video analysis applications

Patent number: 11748988

Abstract: Techniques for automatic scene change detection in a video are described. As one example, a computer-implemented method includes extracting features of a query shot and its neighboring shots of a first set of shots without labels with a query model, determining a key shot of the neighboring shots which is most similar to the query shot based at least in part on the features of the query shot and its neighboring shots, extracting features of the key shot with a key model, training the query model into a trained query model based at least in part on a comparison of the features of the query shot and the features of the key shot, extracting features of a second set of shots with labels with the trained query model, and training a temporal model into a trained temporal model based at least in part on the features extracted from the second set of shots and the labels of the second set of shots.

Type: Grant

Filed: April 21, 2021

Date of Patent: September 5, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Shixing Chen, Xiaohan Nie, David Jiatian Fan, Dongqing Zhang, Vimal Bhat, Muhammad Raffay Hamid
SYSTEMS AND METHODS FOR VIDEO-BASED SPORTS FIELD REGISTRATION

Publication number: 20230023419

Abstract: Methods and systems are described for registering a sports field to a video. Video of a live event may feature participants at a venue. A template of the venue, including virtual markings that represent real markings on the venue, may be obtained. A homographic transformation between an image plane and a ground plane may be determined by matching virtual markings to corresponding real markings captured in at least one frame of the video. The determined homographic transformation may be used in the automated analysis of sports statistics and in improving inserted annotations and visualizations.

Type: Application

Filed: September 30, 2022

Publication date: January 26, 2023

Inventors: Xiaohan Nie, Muhammad Raffay Hamid
Systems and methods for generating comic books from video and images

Patent number: 11532111

Abstract: Techniques for a comic book feature are described herein. A visual data stream of a video may be parsed into a plurality of frames. Scene boundaries may be determined to generate a scene using the plurality of frames where a scene includes a subset of frames. A key frame may be determined for the scene using the subset of frames. An audio portion of an audio data stream of the video may be identified that maps to the subset of frames based on time information. The key frame may be converted to a comic image based on an algorithm. First dimensions and placement for a data object may be determined for the comic image. The data object may include the audio portion for the comic image. A comic panel may be generated for the comic image that incorporates the data object using the determined first dimensions and the placement.

Type: Grant

Filed: June 10, 2021

Date of Patent: December 20, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Dongqing Zhang, Muhammad Raffay Hamid, Xiaohan Nie, Shixing Chen
Systems and methods for video-based sports field registration

Patent number: 11468578

Abstract: Methods and systems are described for registering a sports field to a video. Video of a live event may feature participants at a venue. A template of the venue, including virtual markings that represent real markings on the venue, may be obtained. A homographic transformation between an image plane and a ground plane may be determined by matching virtual markings to corresponding real markings captured in at least one frame of the video. The determined homographic transformation may be used in the automated analysis of sports statistics and in improving inserted annotations and visualizations.

Type: Grant

Filed: September 14, 2020

Date of Patent: October 11, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Xiaohan Nie, Muhammad Raffay Hamid
SYSTEMS AND METHODS FOR VIDEO-BASED SPORTS FIELD REGISTRATION

Publication number: 20220084222

Abstract: Methods and systems are described for registering a sports field to a video. Video of a live event may feature participants at a venue. A template of the venue, including virtual markings that represent real markings on the venue, may be obtained. A homographic transformation between an image plane and a ground plane may be determined by matching virtual markings to corresponding real markings captured in at least one frame of the video. The determined homographic transformation may be used in the automated analysis of sports statistics and in improving inserted annotations and visualizations.

Type: Application

Filed: September 14, 2020

Publication date: March 17, 2022

Inventors: Xiaohan Nie, Muhammad Raffay Hamid
SYSTEMS AND METHODS OF OBSTACLE DETECTION FOR AUTOMATED DELIVERY APPARATUS

Publication number: 20210405638

Abstract: The present disclosure generally relates to a system of a delivery device for combining sensor data from various types of sensors to generate a map that enables the delivery device to navigate from a first location to a second location to deliver an item to the second location. The system obtains data from RGB, LIDAR, and depth sensors and combines this sensor data according to various algorithms to detect objects in an environment of the delivery device, generate point cloud and pose information associated with the detected objects, and generates object boundary data for the detected objects. The system further identifies object states for the detected object and generates the map for the environment based on the detected object, the generated object proposal data, the labeled point cloud data, and the object states. The generated map may be provided to other systems to navigate the delivery device.

Type: Application

Filed: June 26, 2020

Publication date: December 30, 2021

Inventors: Hakan Boyraz, Baoyuan Liu, Xiaohan Nie, Sheng Chen
Navigation directly from perception data without pre-mapping

Patent number: 11175664

Abstract: An autonomous delivery robot system to enable delivery of a product to a customer is described. One autonomous ground vehicle (AGV) includes a processing device that receives a delivery request comprising a route divided into multiple navigation segments and computes a navigable space from perception data stored in a perception map. The perception map is a robot-centered local map that stores the perception data indicative of the surroundings of the AGV. The processing device computes a cost inflation from the perception data stored in the perception map, determines a sub-goal that is on the navigable space and reachable by the AGV using the navigable space and the cost inflation, and determines a path to achieve the sub-goal using the using the navigable space and the cost inflation. The processing device controls one or more actuators to move along the path.

Type: Grant

Filed: January 15, 2019

Date of Patent: November 16, 2021

Assignee: Amazon Technologies, Inc.

Inventors: Hakan Boyraz, Marshall Tappen, Baoyuan Liu, Xiaohan Nie, Sheng Chen, Christopher Brown