Patents by Inventor Haibo Ding

Haibo Ding has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240112014
    Abstract: The systems and methods described herein are directed to a Co-Augmentation framework that may learn new rules and labels simultaneously from unlabeled data with a small set of seed rules and a few manually labeled training data. The augmented rules and labels are further used to train supervised neural network models. Specifically, the systems and methods described herein include two major components: a rule augmenter, and a label augmenter. The rule augmenter is directed to learning new rules, which can be used to obtain weak labels from unlabeled data. The label augmenter is directed to learning new labels from unlabeled data. The Co-Augmentation framework is an iterative learning process which generates and refines a high precision set. At each iteration, both the rule augmenter and label augmenter will contribute new and more accurate labels to the high precision set, which is in turn used to train both the rule augmenter and label augmenter.
    Type: Application
    Filed: September 26, 2022
    Publication date: April 4, 2024
    Inventors: Yuheng Wang, Haibo Ding, Bingqing Wang, Zhe Feng
  • Patent number: 11907662
    Abstract: An automatic terminology linking system includes a candidate generator configured to identify candidate nodes for each terminology that is to be linked to a node of the knowledge base. A pseudo-candidate generator is configured to identify pseudo-candidate nodes for candidate-less terminologies. A candidate scorer is configured to respectively score the candidate nodes and the pseudo-candidate nodes by collective inference using occurrence statistics and co-occurrence statistics for these nodes. The pseudo-candidate generator is configured to identify knowledge base nodes that are semantically-related to candidate-less terminology as the pseudo-candidate nodes for the candidate-less terminology.
    Type: Grant
    Filed: December 27, 2018
    Date of Patent: February 20, 2024
    Assignee: Robert Bosch GmbH
    Inventors: Haibo Ding, Yifan He, Lin Zhao, Kui Xu, Zhe Feng
  • Publication number: 20240037339
    Abstract: A computer-implemented method of Named Entity Recognition (NER) includes receiving an input, identifying a plurality of candidate entities corresponding to the input, assigning word embeddings to the input at an embedding layer, capturing sequential context of the word embeddings in an encoding layer to obtain encoded word embeddings, constructing an entity relation graph using global coreference relations and local dependency relations to obtain a coreference graph and a dependency graph, fusing the encoded word embeddings, coreference graph, and dependency graph, via a graphical neural network (GNN), to obtain updated word embeddings, and decoding the updated word embeddings via a decoding layer to obtain enriched entity predictions.
    Type: Application
    Filed: July 29, 2022
    Publication date: February 1, 2024
    Inventors: Pei Chen, Haibo Ding, Jun Araki, Ruihong Huang
  • Patent number: 11775763
    Abstract: Systems and methods for weakly-supervised training a machine-learning model to perform named-entity recognition. All possible entity candidates and all possible rule candidates are automatically identified in an input data set of unlabeled text. An initial training of the machine-learning model is performed using labels assigned to entity candidates by a set of seeding rules as a first set of training data. The trained machine-learning model is then applied to the unlabeled text and a subset of rules from the rule candidates is identified that produces labels that most accurately match the labels assigned by the trained machine-learning model. The machine-learning model is then retrained using the labels assigned by the identified subset of rules as the second set of training data. This process is iteratively repeated to further refine and improve the performance of the machine-learning model for named-entity recognition.
    Type: Grant
    Filed: February 25, 2021
    Date of Patent: October 3, 2023
    Assignee: Robert Bosch GmbH
    Inventors: Jiacheng Li, Haibo Ding, Zhe Feng
  • Patent number: 11720748
    Abstract: A system for automatically labeling data using conceptual descriptions. In one example, the system includes an electronic processor configured to generate unlabeled training data examples from one or more natural language documents and, for each of a plurality of categories, determine one or more concepts associated with a conceptual description of the category and generate a weak annotator for each of the one or more concepts. The electronic processor is also configured to apply each weak annotator to each training data example and, when a training data example satisfies a weak annotator, output a category associated with the weak annotator. For each training data example, the electronic processor determines a probabilistic distribution of the plurality of categories. For each training data example, the electronic processor labels the training data example with a category having the highest value in the probabilistic distribution determined for the training data example.
    Type: Grant
    Filed: April 27, 2020
    Date of Patent: August 8, 2023
    Assignee: Robert Bosch GmbH
    Inventors: Haibo Ding, Zhe Feng
  • Patent number: 11669740
    Abstract: Systems and methods for training a machine-learning model for named-entity recognition. A rule graph is constructed including a plurality of nodes each corresponding to a different labeling rule of a set of labeling rules (including a set of seeding rules of known labeling accuracy and a plurality of candidate rules of unknown labeling accuracy). The nodes are coupled to other nodes based on which rules exhibit the highest sematic similarity. A labeling accuracy metric is estimated for each candidate rule by propagating a labeling confidence metric through the rule graph from the seeding rules to each candidate rule. A subset of labeling rules is then identified by ranking the rules by their labeling confidence metric. The identified subset of labeling rules is applied to unlabeled data to generate a set of weakly labeled named entities and the machine-learning model is trained based on the set of weakly labeled named entities.
    Type: Grant
    Filed: February 25, 2021
    Date of Patent: June 6, 2023
    Assignee: Robert Bosch GmbH
    Inventors: Xinyan Zhao, Haibo Ding, Zhe Feng
  • Publication number: 20230137483
    Abstract: An oxygen reduction device in an ion source region of inductively coupled plasma is provided. The oxygen reduction device includes a torch and an inflation sleeve. An upper end of the inflation sleeve is sealed and sleeved outside the torch, and a lower end of the inflation sleeve and the torch are arranged at an interval to form an inflation gap. An inflation hole communicating with the inflation gap is formed in an outer side wall of the inflation sleeve. An outer side face of the lower end of the inflation sleeve is protruded outwards to form an annular gas guiding protrusion. The annular gas guiding protrusion is configured for being arranged opposite to a sampling cone base arranged below the torch, and a gas outlet gap is formed between the annular gas guiding protrusion and the sampling cone base.
    Type: Application
    Filed: October 31, 2022
    Publication date: May 4, 2023
    Inventors: Yongsheng LIU, Xin JIANG, Xiaofeng XIA, Haibo DING, Wengui LIU, Jie LIN, Chengyi ZHANG, Shuimiao LU, Lifei CHEN
  • Publication number: 20220269939
    Abstract: Systems and methods for training a machine-learning model for named-entity recognition. A rule graph is constructed including a plurality of nodes each corresponding to a different labeling rule of a set of labeling rules (including a set of seeding rules of known labeling accuracy and a plurality of candidate rules of unknown labeling accuracy). The nodes are coupled to other nodes based on which rules exhibit the highest sematic similarity. A labeling accuracy metric is estimated for each candidate rule by propagating a labeling confidence metric through the rule graph from the seeding rules to each candidate rule. A subset of labeling rules is then identified by ranking the rules by their labeling confidence metric. The identified subset of labeling rules is applied to unlabeled data to generate a set of weakly labeled named entities and the machine-learning model is trained based on the set of weakly labeled named entities.
    Type: Application
    Filed: February 25, 2021
    Publication date: August 25, 2022
    Inventors: Xinyan Zhao, Haibo Ding, Zhe Feng
  • Publication number: 20220269862
    Abstract: Systems and methods for weakly-supervised training a machine-learning model to perform named-entity recognition. All possible entity candidates and all possible rule candidates are automatically identified in an input data set of unlabeled text. An initial training of the machine-learning model is performed using labels assigned to entity candidates by a set of seeding rules as a first set of training data. The trained machine-learning model is then applied to the unlabeled text and a subset of rules from the rule candidates is identified that produces labels that most accurately match the labels assigned by the trained machine-learning model. The machine-learning model is then retrained using the labels assigned by the identified subset of rules as the second set of training data. This process is iteratively repeated to further refine and improve the performance of the machine-learning model for named-entity recognition.
    Type: Application
    Filed: February 25, 2021
    Publication date: August 25, 2022
    Inventors: Jiacheng Li, Haibo Ding, Zhe Feng
  • Patent number: 11423228
    Abstract: Methods and systems for performing semantic entity recognition. The method includes accessing a document stored in a memory and selecting from a general knowledge data repository, target domain information based on a specified target domain. The method also includes generating a plurality of weak annotators for the document based upon the selected target domain information and expert knowledge from a domain-specific expert knowledge data repository and applying the plurality of weak annotators to the document to generate a plurality of weak labels. The method further includes selecting at least one weak label from the plurality of weak labels as training data and training a semantic entity prediction model using the training data.
    Type: Grant
    Filed: April 9, 2020
    Date of Patent: August 23, 2022
    Assignee: Robert Bosch GmbH
    Inventors: Xinyan Zhao, Haibo Ding, Zhe Feng
  • Publication number: 20210334596
    Abstract: A system for automatically labeling data using conceptual descriptions. In one example, the system includes an electronic processor configured to generate unlabeled training data examples from one or more natural language documents and, for each of a plurality of categories, determine one or more concepts associated with a conceptual description of the category and generate a weak annotator for each of the one or more concepts. The electronic processor is also configured to apply each weak annotator to each training data example and, when a training data example satisfies a weak annotator, output a category associated with the weak annotator. For each training data example, the electronic processor determines a probabilistic distribution of the plurality of categories. For each training data example, the electronic processor labels the training data example with a category having the highest value in the probabilistic distribution determined for the training data example.
    Type: Application
    Filed: April 27, 2020
    Publication date: October 28, 2021
    Inventors: Haibo Ding, Zhe Feng
  • Publication number: 20210319183
    Abstract: Methods and systems for performing semantic entity recognition. The method includes accessing a document stored in a memory and selecting from a general knowledge data repository, target domain information based on a specified target domain. The method also includes generating a plurality of weak annotators for the document based upon the selected target domain information and expert knowledge from a domain-specific expert knowledge data repository and applying the plurality of weak annotators to the document to generate a plurality of weak labels. The method further includes selecting at least one weak label from the plurality of weak labels as training data and training a semantic entity prediction model using the training data.
    Type: Application
    Filed: April 9, 2020
    Publication date: October 14, 2021
    Inventors: Xinyan Zhao, Haibo Ding, Zhe Feng
  • Publication number: 20200342178
    Abstract: An automatic terminology linking system includes a candidate generator configured to identify candidate nodes for each terminology that is to be linked to a node of the knowledge base. A pseudo-candidate generator is configured to identify pseudo-candidate nodes for candidate-less terminologies. A candidate scorer is configured to respectively score the candidate nodes and the pseudo-candidate nodes by collective inference using occurrence statistics and co-occurrence statistics for these nodes. The pseudo-candidate generator is configured to identify knowledge base nodes that are semantically-related to candidate-less terminology as the pseudo-candidate nodes for the candidate-less terminology.
    Type: Application
    Filed: December 27, 2018
    Publication date: October 29, 2020
    Inventors: Haibo Ding, Yifan He, Lin Zhao, Kui Xu, Zhe Feng
  • Patent number: D1016176
    Type: Grant
    Filed: February 16, 2022
    Date of Patent: February 27, 2024
    Assignee: ZHEJIANG TAOTAO VEHICLES CO., LTD.
    Inventors: Haibo Zhu, Jianbing Wu, Hongfeng Tian, Dupu Ding, Guofu Zou