Patents by Inventor Bingxin HOU

Bingxin HOU has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20260154957
    Abstract: In various examples, techniques for optimizing object detection models are described herein. Systems and methods are disclosed that process sensor data using a backbone of a machine learning model(s) in order to generate feature maps at different resolutions. The systems and methods then use the machine learning model(s) to generate a vector based at least in part on one or more of the feature maps. For example, if the backbone generates four feature maps, then the machine learning model(s) may generate the vector using two feature maps from the four feature maps. The systems and methods then process the vector using a transformer of the machine learning model(s) in order to generate data representing a class label(s) for an object(s) depicted by an image represented by the sensor data and/or a location(s) of the object(s) within the image.
    Type: Application
    Filed: January 23, 2026
    Publication date: June 4, 2026
    Applicant: NVIDIA Corporation
    Inventors: Dahjung CHUNG, Farzin AGHDASI, Parthasarathy SRIRAM, Bingxin HOU
  • Patent number: 12536783
    Abstract: In various examples, techniques for optimizing object detection models are described herein. Systems and methods are disclosed that process sensor data using a backbone of a machine learning model(s) in order to generate feature maps at different resolutions. The systems and methods then use the machine learning model(s) to generate a vector based at least in part on one or more of the feature maps. For example, if the backbone generates four feature maps, then the machine learning model(s) may generate the vector using two feature maps from the four feature maps. The systems and methods then process the vector using a transformer of the machine learning model(s) in order to generate data representing a class label(s) for an object(s) depicted by an image represented by the sensor data and/or a location(s) of the object(s) within the image.
    Type: Grant
    Filed: August 25, 2022
    Date of Patent: January 27, 2026
    Assignee: NVIDIA Corporation
    Inventors: Dahjung Chung, Farzin Aghdasi, Parthasarathy Sriram, Bingxin Hou
  • Publication number: 20240071064
    Abstract: In various examples, techniques for optimizing object detection models are described herein. Systems and methods are disclosed that process sensor data using a backbone of a machine learning model(s) in order to generate feature maps at different resolutions. The systems and methods then use the machine learning model(s) to generate a vector based at least in part on one or more of the feature maps. For example, if the backbone generates four feature maps, then the machine learning model(s) may generate the vector using two feature maps from the four feature maps. The systems and methods then process the vector using a transformer of the machine learning model(s) in order to generate data representing a class label(s) for an object(s) depicted by an image represented by the sensor data and/or a location(s) of the object(s) within the image.
    Type: Application
    Filed: August 25, 2022
    Publication date: February 29, 2024
    Inventors: Dahjung CHUNG, Farzin AGHDASI, Parthasarathy SRIRAM, Bingxin HOU
  • Publication number: 20220164630
    Abstract: A method for detecting moving objects in video frames, an apparatus and a non-transitory computer-readable storage medium thereof are provided. The method includes that: an encoder in a 3-dimenional (3D) separable convolutional neural network with multi-input multi-output (3DS_MM) receives a first input including multiple video frames, where the encoder includes a plurality of encoder layers including 3D separable convolutional neural network (CNN) layers; the encoder generates a first encoder output; and a decoder in the 3DS_MM receives the first encoder output and generates a first output including multiple first binary masks related to the first input, where the decoder includes a plurality of decoder layers comprising 3D separable transposed CNN layers.
    Type: Application
    Filed: November 22, 2021
    Publication date: May 26, 2022
    Applicants: KWAI INC., SANTA CLARA UNIVERSITY
    Inventors: Bingxin HOU, Ying LIU, Nam LING, Lingzhi LIU, Yongxiong REN, Ming Kai HSU