Patents by Inventor Junhua Mao

Junhua Mao has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

MULTI-MODAL 3-D POSE ESTIMATION

Publication number: 20220156965

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for estimating a 3-D pose of an object of interest from image and point cloud data. In one aspect, a method includes obtaining an image of an environment; obtaining a point cloud of a three-dimensional region of the environment; generating a fused representation of the image and the point cloud; and processing the fused representation using a pose estimation neural network and in accordance with current values of a plurality of pose estimation network parameters to generate a pose estimation network output that specifies, for each of multiple keypoints, a respective estimated position in the three-dimensional region of the environment.

Type: Application

Filed: October 20, 2021

Publication date: May 19, 2022

Inventors: Jingxiao Zheng, Xinwei Shi, Alexander Gorban, Junhua Mao, Andre Liang Cornman, Yang Song, Ting Liu, Ruizhongtai Qi, Yin Zhou, Congcong Li, Dragomir Anguelov
OBJECT CLASSIFICATION USING EXTRA-REGIONAL CONTEXT

Publication number: 20210326609

Abstract: Some aspects of the subject matter disclosed herein include a system implemented on one or more data processing apparatuses. The system can include an interface configured to obtain, from one or more sensor subsystems, sensor data describing an environment of a vehicle, and to generate, using the sensor data, (i) one or more first neural network inputs representing sensor measurements for a particular object in the environment and (ii) a second neural network input representing sensor measurements for at least a portion of the environment that encompasses the particular object and additional portions of the environment that are not represented by the one or more first neural network inputs; and a convolutional neural network configured to process the second neural network input to generate an output, the output including a plurality of feature vectors that each correspond to a different one a plurality of regions of the environment.

Type: Application

Filed: April 7, 2021

Publication date: October 21, 2021

Inventors: Junhua Mao, Qian Yu, Congcong Li
Object Action Classification For Autonomous Vehicles

Publication number: 20210294346

Abstract: Aspects of the disclosure relate to training and using a model for identifying actions of objects. For instance, LIDAR sensor data frames including an object bounding box corresponding to an object as well as an action label for the bounding box may be received. Each sensor frame is associated with a timestamp and is sequenced with respect to other sensor frames. Each given sensor data frame may be projected into a camera image of the object based on the timestamp associated with the given sensor data frame in order to provide fused data. The model may be trained using the fused data such that the model is configured to, in response to receiving fused data, the model outputs an action label for each object bounding box of the fused data. This output may then be used to control a vehicle in an autonomous driving mode.

Type: Application

Filed: June 9, 2021

Publication date: September 23, 2021

Inventors: Junhua Mao, Congcong Li, Alper Ayvaci, Chen Sun, Kevin Murphy, Ruichi Yu
STREAMING OBJECT DETECTION WITHIN SENSOR DATA

Publication number: 20210279465

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for processing data generated by a sensing system that rotationally senses an environment. In one aspect, a method comprises partitioning a predetermined period of time into a plurality of sub-periods, wherein the predetermined period of time is a period of time for which data generated by the sensing system constitutes a complete rotational sensing of the environment; for each sub-period: receiving current data generated by the sensing system during the sub-period and characterizing a respective partial scene of the environment; processing the current data using an object detection neural network to generate a current object detection output that is specific to the respective partial scene of the environment.

Type: Application

Filed: March 6, 2020

Publication date: September 9, 2021

Inventors: Jonathon Shlens, Vijay Vasudevan, Jiquan Ngiam, Wei Han, Zhifeng Chen, Brandon Chauloon Yang, Benjamin James Caine, Zhengdong Zhang, Christoph Sprunk, Ouais Alsharif, Junhua Mao, Chen Wu
Object action classification for autonomous vehicles

Patent number: 11061406

Abstract: Aspects of the disclosure relate to training and using a model for identifying actions of objects. For instance, LIDAR sensor data frames including an object bounding box corresponding to an object as well as an action label for the bounding box may be received. Each sensor frame is associated with a timestamp and is sequenced with respect to other sensor frames. Each given sensor data frame may be projected into a camera image of the object based on the timestamp associated with the given sensor data frame in order to provide fused data. The model may be trained using the fused data such that the model is configured to, in response to receiving fused data, the model outputs an action label for each object bounding box of the fused data. This output may then be used to control a vehicle in an autonomous driving mode.

Type: Grant

Filed: October 22, 2018

Date of Patent: July 13, 2021

Assignee: Waymo LLC

Inventors: Junhua Mao, Congcong Li, Alper Ayvaci, Chen Sun, Kevin Murphy, Ruichi Yu
Vehicle Intent Prediction Neural Network

Publication number: 20210191395

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating vehicle intent predictions using a neural network. One of the methods includes obtaining an input characterizing one or more vehicles in an environment; generating, from the input, features of each of the vehicles; and for each of the vehicles: processing the features of the vehicle using each of a plurality of intent-specific neural networks, wherein each of the intent-specific neural networks corresponds to a respective intent from a set of intents, and wherein each intent-specific neural network is configured to process the features of the vehicle to generate an output for the corresponding intent.

Type: Application

Filed: December 20, 2019

Publication date: June 24, 2021

Inventors: Jiyang Gao, Junhua Mao, Yi Shen, Congcong Li, Chen Sun
SPATIO-TEMPORAL-INTERACTIVE NETWORKS

Publication number: 20210150199

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for processing point cloud data using spatio-temporal-interactive networks.

Type: Application

Filed: November 16, 2020

Publication date: May 20, 2021

Inventors: Junhua Mao, Jiyang Gao, Yukai Liu, Congcong Li, Zhishuai Zhang, Dragomir Anguelov
Object classification using extra-regional context

Patent number: 10977501

Abstract: Some aspects of the subject matter disclosed herein include a system implemented on one or more data processing apparatuses. The system can include an interface configured to obtain, from one or more sensor subsystems, sensor data describing an environment of a vehicle, and to generate, using the sensor data, (i) one or more first neural network inputs representing sensor measurements for a particular object in the environment and (ii) a second neural network input representing sensor measurements for at least a portion of the environment that encompasses the particular object and additional portions of the environment that are not represented by the one or more first neural network inputs; and a convolutional neural network configured to process the second neural network input to generate an output, the output including a plurality of feature vectors that each correspond to a different one a plurality of regions of the environment.

Type: Grant

Filed: December 21, 2018

Date of Patent: April 13, 2021

Assignee: Waymo LLC

Inventors: Junhua Mao, Qian Yu, Congcong Li
Multilingual image question answering

Patent number: 10909329

Abstract: Embodiments of a multimodal question answering (mQA) system are presented to answer a question about the content of an image. In embodiments, the model comprises four components: a Long Short-Term Memory (LSTM) component to extract the question representation; a Convolutional Neural Network (CNN) component to extract the visual representation; an LSTM component for storing the linguistic context in an answer, and a fusing component to combine the information from the first three components and generate the answer. A Freestyle Multilingual Image Question Answering (FM-IQA) dataset was constructed to train and evaluate embodiments of the mQA model. The quality of the generated answers of the mQA model on this dataset is evaluated by human judges through a Turing Test.

Type: Grant

Filed: April 25, 2016

Date of Patent: February 2, 2021

Assignee: Baidu USA LLC

Inventors: Haoyuan Gao, Junhua Mao, Jie Zhou, Zhiheng Huang, Lei Wang, Wei Xu
Neural networks for coarse- and fine-object classifications

Patent number: 10867210

Abstract: Aspects of the subject matter disclosed herein include methods, systems, and other techniques for training, in a first phase, an object classifier neural network with a first set of training data, the first set of training data including a first plurality of training examples, each training example in the first set of training data being labeled with a coarse-object classification; and training, in a second phase after completion of the first phase, the object classifier neural network with a second set of training data, the second set of training data including a second plurality of training examples, each training example in the second set of training data being labeled with a fine-object classification.

Type: Grant

Filed: December 21, 2018

Date of Patent: December 15, 2020

Assignee: Waymo LLC

Inventors: Junhua Mao, Congcong Li, Yang Song
NEURAL NETWORKS FOR COARSE- AND FINE-OBJECT CLASSIFICATIONS

Publication number: 20200202168

Abstract: Aspects of the subject matter disclosed herein include methods, systems, and other techniques for training, in a first phase, an object classifier neural network with a first set of training data, the first set of training data including a first plurality of training examples, each training example in the first set of training data being labeled with a coarse-object classification; and training, in a second phase after completion of the first phase, the object classifier neural network with a second set of training data, the second set of training data including a second plurality of training examples, each training example in the second set of training data being labeled with a fine-object classification.

Type: Application

Filed: December 21, 2018

Publication date: June 25, 2020

Inventors: Junhua Mao, Congcong Li, Yang Song
OBJECT CLASSIFICATION USING EXTRA-REGIONAL CONTEXT

Publication number: 20200202145

Abstract: Some aspects of the subject matter disclosed herein include a system implemented on one or more data processing apparatuses. The system can include an interface configured to obtain, from one or more sensor subsystems, sensor data describing an environment of a vehicle, and to generate, using the sensor data, (i) one or more first neural network inputs representing sensor measurements for a particular object in the environment and (ii) a second neural network input representing sensor measurements for at least a portion of the environment that encompasses the particular object and additional portions of the environment that are not represented by the one or more first neural network inputs; and a convolutional neural network configured to process the second neural network input to generate an output, the output including a plurality of feature vectors that each correspond to a different one a plurality of regions of the environment.

Type: Application

Filed: December 21, 2018

Publication date: June 25, 2020

Inventors: Junhua Mao, Qian Yu, Congcong Li
TRAINING A CLASSIFIER TO DETECT OPEN VEHICLE DOORS

Publication number: 20200202209

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for training a classifier to detect open vehicle doors. One of the methods includes obtaining a plurality of initial training examples, each initial training example comprising (i) a sensor sample from a collection of sensor samples and (ii) data classifying the sensor sample as characterizing a vehicle that has an open door; generating a plurality of additional training examples, comprising, for each initial training example: identifying, from the collection of sensor samples, one or more additional sensor samples that were captured less than a threshold amount of time before the sensor sample in the initial training example was captured; and training the machine learning classifier on first training data that includes the initial training examples and the additional training examples to generate updated weights for the machine learning classifier.

Type: Application

Filed: December 21, 2018

Publication date: June 25, 2020

Inventors: Junhua Mao, Lo Po Tsui, Congcong Li, Edward Stephen Walker, JR.
SEARCHING AN AUTONOMOUS VEHICLE SENSOR DATA REPOSITORY

Publication number: 20200202196

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for searching an autonomous vehicle sensor data repository. One of the methods includes maintaining a collection of sensor samples and, for each sensor sample, an embedding of the sensor sample; receiving a request specifying a query sensor sample, wherein the query sensor sample characterizes a query environment region; and identifying, from the collection of sensor samples, a plurality of relevant sensor samples that characterize similar environment regions to the query environment region, comprising: processing the query sensor sample through the embedding neural network to generate a query embedding; and identifying, from sensor samples in a subset of the sensor samples in the collection, a plurality of sensor samples that have embeddings that are closest to the query embedding.

Type: Application

Filed: December 23, 2019

Publication date: June 25, 2020

Inventors: Zijian Guo, Nichola Abdo, Junhua Mao, Congcong Li, Edward Stephen Walker, JR.
Object Action Classification For Autonomous Vehicles

Publication number: 20200125112

Abstract: Aspects of the disclosure relate to training and using a model for identifying actions of objects. For instance, LIDAR sensor data frames including an object bounding box corresponding to an object as well as an action label for the bounding box may be received. Each sensor frame is associated with a timestamp and is sequenced with respect to other sensor frames. Each given sensor data frame may be projected into a camera image of the object based on the timestamp associated with the given sensor data frame in order to provide fused data. The model may be trained using the fused data such that the model is configured to, in response to receiving fused data, the model outputs an action label for each object bounding box of the fused data. This output may then be used to control a vehicle in an autonomous driving mode.

Type: Application

Filed: October 22, 2018

Publication date: April 23, 2020

Inventors: Junhua Mao, Congcong Li, Alper Ayvaci, Chen Sun, Kevin Murphy, Ruichi Yu
High reliable OTP memory with low reading voltage

Patent number: 10510427

Abstract: The present invention relates to the technical field of integrated circuits. Disclosed is a one-time programmable memory with a high reliability and a low reading voltage, comprising: a first MOS transistor, a second MOS transistor, and an antifuse component. A gate terminal of the first MOS transistor is connected to a second connecting line (WS), a first connection terminal of the first MOS transistor is connected to the antifuse component, the antifuse component is connected to a first connecting line (WP), and a second connection terminal of the first MOS transistor is connected to a third connecting line (BL). A first connection terminal of the second MOS transistor is connected to a fourth connecting line (BR), and a second connection terminal of the second MOS transistor is connected to a third connecting line (BL). The invention further comprises a voltage limiting device with a control terminal and two connection terminals.

Type: Grant

Filed: February 18, 2016

Date of Patent: December 17, 2019

Assignee: SICHUAN KILOWAY ELECTRONICS INC.

Inventors: Xuyang Liao, Junhua Mao, Jack Z. Peng
Systems and methods for fast novel visual concept learning from sentence descriptions of images

Patent number: 10504010

Abstract: Described herein are systems and methods that address the task of learning novel visual concepts, and their interactions with other concepts, from a few images with sentence descriptions. Using linguistic context and visual features, embodiments are able to efficiently hypothesize the semantic meaning of new words and add them to model word dictionaries so that they can be used to describe images which contain these novel concepts. In the experiments, it was shown that the tested embodiments effectively learned novel visual concepts from a few examples without disturbing the previously learned concepts.

Type: Grant

Filed: January 27, 2017

Date of Patent: December 10, 2019

Assignee: Baidu USA LLC

Inventors: Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Zhiheng Huang
High reliability OTP memory by using of voltage isolation in series

Patent number: 10504908

Abstract: A high-reliability one-time programmable memory adopting series high voltage partition, which relates to integrated circuit technology and comprises a first MOS tube, a second MOS tube and an anti-fuse element, wherein a gate end of the first MOS tube is connected to a second connecting line (WS), a first connecting end of the first MOS tube is connected to a gate end of the second MOS tube and a voltage limiting device, and a second connecting end of the first MOS tube is connected to a third connecting line (BL); a first connecting end of the second MOS tube is connected to a fourth connecting line (BR), a second connecting end of the second MOS tube is connected to the third connecting line (BL), and a gate end of the second MOS tube is connected to the voltage limiting device and the second connecting end of the first MOS tube.

Type: Grant

Filed: February 18, 2016

Date of Patent: December 10, 2019

Assignee: SICHUAN KILOWAY ELECTRONICS INC.

Inventors: Jack Z. Peng, Junhua Mao, Xuyang Liao
INTELLIGENT IMAGE CAPTIONING

Publication number: 20190370633

Abstract: Presented herein are embodiments of a multimodal Recurrent Neural Network (m-RNN) model for generating novel image captions. In embodiments, it directly models the probability distribution of generating a word given a previous word or words and an image, and image captions are generated according to this distribution. In embodiments, the model comprises two sub-networks: a deep recurrent neural network for sentences and a deep convolutional network for images. In embodiments, these two sub-networks interact with each other in a multimodal layer to form the whole m-RNN model. The effectiveness of an embodiment of model was validated on four benchmark datasets, and it outperformed the state-of-the-art methods. In embodiments, the m-RNN model may also be applied to retrieval tasks for retrieving images or captions.

Type: Application

Filed: August 19, 2019

Publication date: December 5, 2019

Applicant: BAIDU USA LLC

Inventors: Junhua Mao, Wei Xu, Yi Yang, Jiang Wang, Zhiheng Huang
High Reliability OTP Memory by Using of Voltage Isolation in Series

Publication number: 20190341393

Abstract: A high-reliability one-time programmable memory adopting series high voltage partition, which relates to integrated circuit technology and comprises a first MOS tube, a second MOS tube and an anti-fuse element, wherein a gate end of the first MOS tube is connected to a second connecting line (WS), a first connecting end of the first MOS tube is connected to a gate end of the second MOS tube and a voltage limiting device, and a second connecting end of the first MOS tube is connected to a third connecting line (BL); a first connecting end of the second. MOS tube is connected to a fourth connecting line (BR), a second connecting end of the second MOS tube is connected to the third connecting line (BL), and a gate end of the second MOS tube is connected to the voltage limiting device and the second connecting end of the first MOS tube.

Type: Application

Filed: February 18, 2016

Publication date: November 7, 2019

Applicant: SICHUAN KILOWAY ELECTRONICS INC.

Inventors: Jack Z. PENG, Junhua MAO, Xuyang LIAO

prev 1 2 3 next