Patents by Inventor Bingxin HOU

Bingxin HOU has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

OBJECT DETECTION USING DEEP LEARNING

Publication number: 20260154957

Abstract: In various examples, techniques for optimizing object detection models are described herein. Systems and methods are disclosed that process sensor data using a backbone of a machine learning model(s) in order to generate feature maps at different resolutions. The systems and methods then use the machine learning model(s) to generate a vector based at least in part on one or more of the feature maps. For example, if the backbone generates four feature maps, then the machine learning model(s) may generate the vector using two feature maps from the four feature maps. The systems and methods then process the vector using a transformer of the machine learning model(s) in order to generate data representing a class label(s) for an object(s) depicted by an image represented by the sensor data and/or a location(s) of the object(s) within the image.

Type: Application

Filed: January 23, 2026

Publication date: June 4, 2026

Applicant: NVIDIA Corporation

Inventors: Dahjung CHUNG, Farzin AGHDASI, Parthasarathy SRIRAM, Bingxin HOU
Object detection using deep learning for real-time streaming applications

Patent number: 12536783

Abstract: In various examples, techniques for optimizing object detection models are described herein. Systems and methods are disclosed that process sensor data using a backbone of a machine learning model(s) in order to generate feature maps at different resolutions. The systems and methods then use the machine learning model(s) to generate a vector based at least in part on one or more of the feature maps. For example, if the backbone generates four feature maps, then the machine learning model(s) may generate the vector using two feature maps from the four feature maps. The systems and methods then process the vector using a transformer of the machine learning model(s) in order to generate data representing a class label(s) for an object(s) depicted by an image represented by the sensor data and/or a location(s) of the object(s) within the image.

Type: Grant

Filed: August 25, 2022

Date of Patent: January 27, 2026

Assignee: NVIDIA Corporation

Inventors: Dahjung Chung, Farzin Aghdasi, Parthasarathy Sriram, Bingxin Hou
OBJECT DETECTION USING DEEP LEARNING FOR REAL-TIME STREAMING APPLICATIONS

Publication number: 20240071064

Abstract: In various examples, techniques for optimizing object detection models are described herein. Systems and methods are disclosed that process sensor data using a backbone of a machine learning model(s) in order to generate feature maps at different resolutions. The systems and methods then use the machine learning model(s) to generate a vector based at least in part on one or more of the feature maps. For example, if the backbone generates four feature maps, then the machine learning model(s) may generate the vector using two feature maps from the four feature maps. The systems and methods then process the vector using a transformer of the machine learning model(s) in order to generate data representing a class label(s) for an object(s) depicted by an image represented by the sensor data and/or a location(s) of the object(s) within the image.

Type: Application

Filed: August 25, 2022

Publication date: February 29, 2024

Inventors: Dahjung CHUNG, Farzin AGHDASI, Parthasarathy SRIRAM, Bingxin HOU
3D SEPARABLE DEEP CONVOLUTIONAL NEURAL NETWORK FOR MOVING OBJECT DETECTION

Publication number: 20220164630

Abstract: A method for detecting moving objects in video frames, an apparatus and a non-transitory computer-readable storage medium thereof are provided. The method includes that: an encoder in a 3-dimenional (3D) separable convolutional neural network with multi-input multi-output (3DS_MM) receives a first input including multiple video frames, where the encoder includes a plurality of encoder layers including 3D separable convolutional neural network (CNN) layers; the encoder generates a first encoder output; and a decoder in the 3DS_MM receives the first encoder output and generates a first output including multiple first binary masks related to the first input, where the decoder includes a plurality of decoder layers comprising 3D separable transposed CNN layers.

Type: Application

Filed: November 22, 2021

Publication date: May 26, 2022

Applicants: KWAI INC., SANTA CLARA UNIVERSITY

Inventors: Bingxin HOU, Ying LIU, Nam LING, Lingzhi LIU, Yongxiong REN, Ming Kai HSU

OBJECT DETECTION USING DEEP LEARNING

Object detection using deep learning for real-time streaming applications

OBJECT DETECTION USING DEEP LEARNING FOR REAL-TIME STREAMING APPLICATIONS

3D SEPARABLE DEEP CONVOLUTIONAL NEURAL NETWORK FOR MOVING OBJECT DETECTION