Patents by Inventor Haibo Ding

Haibo Ding has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Oxygen reduction device in ion source region of inductively coupled plasma

Patent number: 12057291

Abstract: An oxygen reduction device in an ion source region of inductively coupled plasma is provided. The oxygen reduction device includes a torch and an inflation sleeve. An upper end of the inflation sleeve is sealed and sleeved outside the torch, and a lower end of the inflation sleeve and the torch are arranged at an interval to form an inflation gap. An inflation hole communicating with the inflation gap is formed in an outer side wall of the inflation sleeve. An outer side face of the lower end of the inflation sleeve is protruded outwards to form an annular gas guiding protrusion. The annular gas guiding protrusion is configured for being arranged opposite to a sampling cone base arranged below the torch, and a gas outlet gap is formed between the annular gas guiding protrusion and the sampling cone base.

Type: Grant

Filed: October 31, 2022

Date of Patent: August 6, 2024

Assignees: China University of Geosciences, Hangzhou PuYu Technology Development Co., Ltd.

Inventors: Yongsheng Liu, Xin Jiang, Xiaofeng Xia, Haibo Ding, Wengui Liu, Jie Lin, Chengyi Zhang, Shuimiao Lu, Lifei Chen
METHODS AND SYSTEMS FOR AUTOMATED CREATION OF ANNOTATED DATA AND TRAINING OF A MACHINE LEARNING MODEL THEREFROM

Publication number: 20240112014

Abstract: The systems and methods described herein are directed to a Co-Augmentation framework that may learn new rules and labels simultaneously from unlabeled data with a small set of seed rules and a few manually labeled training data. The augmented rules and labels are further used to train supervised neural network models. Specifically, the systems and methods described herein include two major components: a rule augmenter, and a label augmenter. The rule augmenter is directed to learning new rules, which can be used to obtain weak labels from unlabeled data. The label augmenter is directed to learning new labels from unlabeled data. The Co-Augmentation framework is an iterative learning process which generates and refines a high precision set. At each iteration, both the rule augmenter and label augmenter will contribute new and more accurate labels to the high precision set, which is in turn used to train both the rule augmenter and label augmenter.

Type: Application

Filed: September 26, 2022

Publication date: April 4, 2024

Inventors: Yuheng Wang, Haibo Ding, Bingqing Wang, Zhe Feng
System and method for domain-independent terminology linking

Patent number: 11907662

Abstract: An automatic terminology linking system includes a candidate generator configured to identify candidate nodes for each terminology that is to be linked to a node of the knowledge base. A pseudo-candidate generator is configured to identify pseudo-candidate nodes for candidate-less terminologies. A candidate scorer is configured to respectively score the candidate nodes and the pseudo-candidate nodes by collective inference using occurrence statistics and co-occurrence statistics for these nodes. The pseudo-candidate generator is configured to identify knowledge base nodes that are semantically-related to candidate-less terminology as the pseudo-candidate nodes for the candidate-less terminology.

Type: Grant

Filed: December 27, 2018

Date of Patent: February 20, 2024

Assignee: Robert Bosch GmbH

Inventors: Haibo Ding, Yifan He, Lin Zhao, Kui Xu, Zhe Feng
DOMAIN-SPECIFIC NAMED ENTITY RECOGNITION VIA GRAPH NEURAL NETWORKS

Publication number: 20240037339

Abstract: A computer-implemented method of Named Entity Recognition (NER) includes receiving an input, identifying a plurality of candidate entities corresponding to the input, assigning word embeddings to the input at an embedding layer, capturing sequential context of the word embeddings in an encoding layer to obtain encoded word embeddings, constructing an entity relation graph using global coreference relations and local dependency relations to obtain a coreference graph and a dependency graph, fusing the encoded word embeddings, coreference graph, and dependency graph, via a graphical neural network (GNN), to obtain updated word embeddings, and decoding the updated word embeddings via a decoding layer to obtain enriched entity predictions.

Type: Application

Filed: July 29, 2022

Publication date: February 1, 2024

Inventors: Pei Chen, Haibo Ding, Jun Araki, Ruihong Huang
Weakly supervised and explainable training of a machine-learning-based named-entity recognition (NER) mechanism

Patent number: 11775763

Abstract: Systems and methods for weakly-supervised training a machine-learning model to perform named-entity recognition. All possible entity candidates and all possible rule candidates are automatically identified in an input data set of unlabeled text. An initial training of the machine-learning model is performed using labels assigned to entity candidates by a set of seeding rules as a first set of training data. The trained machine-learning model is then applied to the unlabeled text and a subset of rules from the rule candidates is identified that produces labels that most accurately match the labels assigned by the trained machine-learning model. The machine-learning model is then retrained using the labels assigned by the identified subset of rules as the second set of training data. This process is iteratively repeated to further refine and improve the performance of the machine-learning model for named-entity recognition.

Type: Grant

Filed: February 25, 2021

Date of Patent: October 3, 2023

Assignee: Robert Bosch GmbH

Inventors: Jiacheng Li, Haibo Ding, Zhe Feng
Automatically labeling data using conceptual descriptions

Patent number: 11720748

Abstract: A system for automatically labeling data using conceptual descriptions. In one example, the system includes an electronic processor configured to generate unlabeled training data examples from one or more natural language documents and, for each of a plurality of categories, determine one or more concepts associated with a conceptual description of the category and generate a weak annotator for each of the one or more concepts. The electronic processor is also configured to apply each weak annotator to each training data example and, when a training data example satisfies a weak annotator, output a category associated with the weak annotator. For each training data example, the electronic processor determines a probabilistic distribution of the plurality of categories. For each training data example, the electronic processor labels the training data example with a category having the highest value in the probabilistic distribution determined for the training data example.

Type: Grant

Filed: April 27, 2020

Date of Patent: August 8, 2023

Assignee: Robert Bosch GmbH

Inventors: Haibo Ding, Zhe Feng
Graph-based labeling rule augmentation for weakly supervised training of machine-learning-based named entity recognition

Patent number: 11669740

Abstract: Systems and methods for training a machine-learning model for named-entity recognition. A rule graph is constructed including a plurality of nodes each corresponding to a different labeling rule of a set of labeling rules (including a set of seeding rules of known labeling accuracy and a plurality of candidate rules of unknown labeling accuracy). The nodes are coupled to other nodes based on which rules exhibit the highest sematic similarity. A labeling accuracy metric is estimated for each candidate rule by propagating a labeling confidence metric through the rule graph from the seeding rules to each candidate rule. A subset of labeling rules is then identified by ranking the rules by their labeling confidence metric. The identified subset of labeling rules is applied to unlabeled data to generate a set of weakly labeled named entities and the machine-learning model is trained based on the set of weakly labeled named entities.

Type: Grant

Filed: February 25, 2021

Date of Patent: June 6, 2023

Assignee: Robert Bosch GmbH

Inventors: Xinyan Zhao, Haibo Ding, Zhe Feng
Oxygen Reduction Device in Ion Source Region of Inductively Coupled Plasma

Publication number: 20230137483

Abstract: An oxygen reduction device in an ion source region of inductively coupled plasma is provided. The oxygen reduction device includes a torch and an inflation sleeve. An upper end of the inflation sleeve is sealed and sleeved outside the torch, and a lower end of the inflation sleeve and the torch are arranged at an interval to form an inflation gap. An inflation hole communicating with the inflation gap is formed in an outer side wall of the inflation sleeve. An outer side face of the lower end of the inflation sleeve is protruded outwards to form an annular gas guiding protrusion. The annular gas guiding protrusion is configured for being arranged opposite to a sampling cone base arranged below the torch, and a gas outlet gap is formed between the annular gas guiding protrusion and the sampling cone base.

Type: Application

Filed: October 31, 2022

Publication date: May 4, 2023

Inventors: Yongsheng LIU, Xin JIANG, Xiaofeng XIA, Haibo DING, Wengui LIU, Jie LIN, Chengyi ZHANG, Shuimiao LU, Lifei CHEN
WEAKLY SUPERVISED AND EXPLAINABLE TRAINING OF A MACHINE-LEARNING-BASED NAMED-ENTITY RECOGNITION (NER) MECHANISM

Publication number: 20220269862

Abstract: Systems and methods for weakly-supervised training a machine-learning model to perform named-entity recognition. All possible entity candidates and all possible rule candidates are automatically identified in an input data set of unlabeled text. An initial training of the machine-learning model is performed using labels assigned to entity candidates by a set of seeding rules as a first set of training data. The trained machine-learning model is then applied to the unlabeled text and a subset of rules from the rule candidates is identified that produces labels that most accurately match the labels assigned by the trained machine-learning model. The machine-learning model is then retrained using the labels assigned by the identified subset of rules as the second set of training data. This process is iteratively repeated to further refine and improve the performance of the machine-learning model for named-entity recognition.

Type: Application

Filed: February 25, 2021

Publication date: August 25, 2022

Inventors: Jiacheng Li, Haibo Ding, Zhe Feng
GRAPH-BASED LABELING RULE AUGMENTATION FOR WEAKLY SUPERVISED TRAINING OF MACHINE-LEARNING-BASED NAMED ENTITY RECOGNITION

Publication number: 20220269939

Abstract: Systems and methods for training a machine-learning model for named-entity recognition. A rule graph is constructed including a plurality of nodes each corresponding to a different labeling rule of a set of labeling rules (including a set of seeding rules of known labeling accuracy and a plurality of candidate rules of unknown labeling accuracy). The nodes are coupled to other nodes based on which rules exhibit the highest sematic similarity. A labeling accuracy metric is estimated for each candidate rule by propagating a labeling confidence metric through the rule graph from the seeding rules to each candidate rule. A subset of labeling rules is then identified by ranking the rules by their labeling confidence metric. The identified subset of labeling rules is applied to unlabeled data to generate a set of weakly labeled named entities and the machine-learning model is trained based on the set of weakly labeled named entities.

Type: Application

Filed: February 25, 2021

Publication date: August 25, 2022

Inventors: Xinyan Zhao, Haibo Ding, Zhe Feng
Weakly supervised semantic entity recognition using general and target domain knowledge

Patent number: 11423228

Abstract: Methods and systems for performing semantic entity recognition. The method includes accessing a document stored in a memory and selecting from a general knowledge data repository, target domain information based on a specified target domain. The method also includes generating a plurality of weak annotators for the document based upon the selected target domain information and expert knowledge from a domain-specific expert knowledge data repository and applying the plurality of weak annotators to the document to generate a plurality of weak labels. The method further includes selecting at least one weak label from the plurality of weak labels as training data and training a semantic entity prediction model using the training data.

Type: Grant

Filed: April 9, 2020

Date of Patent: August 23, 2022

Assignee: Robert Bosch GmbH

Inventors: Xinyan Zhao, Haibo Ding, Zhe Feng
AUTOMATICALLY LABELING DATA USING CONCEPTUAL DESCRIPTIONS

Publication number: 20210334596

Abstract: A system for automatically labeling data using conceptual descriptions. In one example, the system includes an electronic processor configured to generate unlabeled training data examples from one or more natural language documents and, for each of a plurality of categories, determine one or more concepts associated with a conceptual description of the category and generate a weak annotator for each of the one or more concepts. The electronic processor is also configured to apply each weak annotator to each training data example and, when a training data example satisfies a weak annotator, output a category associated with the weak annotator. For each training data example, the electronic processor determines a probabilistic distribution of the plurality of categories. For each training data example, the electronic processor labels the training data example with a category having the highest value in the probabilistic distribution determined for the training data example.

Type: Application

Filed: April 27, 2020

Publication date: October 28, 2021

Inventors: Haibo Ding, Zhe Feng
WEAKLY SUPERVISED SEMANTIC ENTITY RECOGNITION USING GENERAL AND TARGET DOMAIN KNOWLEDGE

Publication number: 20210319183

Abstract: Methods and systems for performing semantic entity recognition. The method includes accessing a document stored in a memory and selecting from a general knowledge data repository, target domain information based on a specified target domain. The method also includes generating a plurality of weak annotators for the document based upon the selected target domain information and expert knowledge from a domain-specific expert knowledge data repository and applying the plurality of weak annotators to the document to generate a plurality of weak labels. The method further includes selecting at least one weak label from the plurality of weak labels as training data and training a semantic entity prediction model using the training data.

Type: Application

Filed: April 9, 2020

Publication date: October 14, 2021

Inventors: Xinyan Zhao, Haibo Ding, Zhe Feng
System and Method for Domain-Independent Terminology Linking

Publication number: 20200342178

Abstract: An automatic terminology linking system includes a candidate generator configured to identify candidate nodes for each terminology that is to be linked to a node of the knowledge base. A pseudo-candidate generator is configured to identify pseudo-candidate nodes for candidate-less terminologies. A candidate scorer is configured to respectively score the candidate nodes and the pseudo-candidate nodes by collective inference using occurrence statistics and co-occurrence statistics for these nodes. The pseudo-candidate generator is configured to identify knowledge base nodes that are semantically-related to candidate-less terminology as the pseudo-candidate nodes for the candidate-less terminology.

Type: Application

Filed: December 27, 2018

Publication date: October 29, 2020

Inventors: Haibo Ding, Yifan He, Lin Zhao, Kui Xu, Zhe Feng