Patents by Inventor Dingcheng Li

Dingcheng Li has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Meta cooperative training paradigms

Patent number: 12293284

Abstract: Generative adversarial models have several benefits; however, due to mode collapse, these generators face a quality-diversity trade-off (i.e., the generator models sacrifice generation diversity for increased generation quality). Presented herein are embodiments that improve the performance of adversarial content generation by decelerating mode collapse. In one or more embodiments, a cooperative training paradigm is employed where a second model is cooperatively trained with the generator and helps efficiently shape the data distribution of the generator against mode collapse. Moreover, embodiments of a meta learning mechanism may be used, where the cooperative update to the generator serves as a high-level meta task and which helps ensures the generator parameters after the adversarial update stay resistant against mode collapse. In experiments, tested employments demonstrated efficient slowdown of mode collapse for the adversarial text generators.

Type: Grant

Filed: December 29, 2020

Date of Patent: May 6, 2025

Assignee: Baidu USA, LLC

Inventors: Dingcheng Li, Haiyan Yin, Xu Li, Ping Li
Disentangle syntax and semantics in sentence representation with decomposable variational autoencoder

Patent number: 12039270

Abstract: Described herein are embodiments of a framework named decomposable variational autoencoder (DecVAE) to disentangle syntax and semantics by using total correlation penalties of Kullback-Leibler (KL) divergences. KL divergence term of the original VAE is decomposed such that the hidden variables generated may be separated in a clear-cut and interpretable way. Embodiments of DecVAE models are evaluated on various semantic similarity and syntactic similarity datasets. Experimental results show that embodiments of DecVAE models achieve state-of-the-art (SOTA) performance in disentanglement between syntactic and semantic representations.

Type: Grant

Filed: August 5, 2020

Date of Patent: July 16, 2024

Assignee: Baldu USA LLC

Inventors: Dingcheng Li, Shaogang Ren, Ping Li
Video recommendation with multi-gate mixture of experts soft actor critic

Patent number: 11922287

Abstract: Described herein are embodiments of a reinforcement learning based large-scale multi-objective ranking system. Embodiments of the system may be used for optimizing short-video recommendation on a video sharing platform. Multiple competing ranking objective and implicit selection bias in user feedback are the main challenges in real-world platform. In order to address those challenges, multi-gate mixture of experts (MMoE) and soft actor critic (SAC) are integrated together into a MMoE_SAC system. Experiment results demonstrate that embodiments of the MMoE_SAC system may greatly reduce a loss function compared to systems only based on single strategies.

Type: Grant

Filed: July 15, 2020

Date of Patent: March 5, 2024

Assignees: Baidu USA, LLC, Baidu.com Times Technology (Beijing) Co., Ltd.

Inventors: Dingcheng Li, Xu Li, Jun Wang, Ping Li
Learning latent structural relations with segmentation variational autoencoders

Patent number: 11816533

Abstract: Learning disentangled representations is an important topic in machine learning for a wide range of applications. Disentangled latent variables represent interpretable semantic information and reflect separate factors of variation in data. Although generative models may learn latent representations and generate data samples as well, existing models may ignore the structural information among latent representations. Described in the present disclosure are embodiments to learn disentangled latent structural representations from data using decomposable variational auto-encoders, which simultaneously learn component representations and encode component relationships. Embodiments of a novel structural prior for latent representations are disclosed to capture interactions among different data components. Embodiments are applied to data segmentation and latent relation discovery among different data components. Experiments on several datasets demonstrate the utility of the present model embodiments.

Type: Grant

Filed: November 18, 2020

Date of Patent: November 14, 2023

Assignee: Baidu USA LLC

Inventors: Shaogang Ren, Hongliang Fei, Dingcheng Li, Ping Li
Estimating the implicit likelihoods of generative adversarial networks

Patent number: 11783198

Abstract: The thriving of deep models and generative models provides approaches to model high dimensional distributions. Generative adversarial networks (GANs) can approximate data distributions and generate data samples from the learned data manifolds as well. Presented herein are embodiments to estimate the implicit likelihoods of GAN models. In one or more embodiments, a stable inverse function of the generator is learned with the help of a variance network of the generator. The local variance of the sample distribution may be approximated by the normalized distance in the latent space. Simulation studies and likelihood testing on data sets validate embodiments, which outperformed several baseline methods in these tasks. An embodiment was also applied to anomaly detection. Experiments show that the embodiments herein can achieve state-of-the-art anomaly detection performance.

Type: Grant

Filed: April 3, 2020

Date of Patent: October 10, 2023

Assignee: Baidu USA LLC

Inventors: Dingcheng Li, Shaogang Ren, Zhixin Zhou, Ping Li
Systems and methods for large scale semantic indexing with deep level-wise extreme multi-label learning

Patent number: 11748613

Abstract: Described herein are embodiments for a deep level-wise extreme multi-label learning and classification (XMLC) framework to facilitate the semantic indexing of literatures. In one or more embodiments, the Deep Level-wise XMLC framework comprises two sequential modules, a deep level-wise multi-label learning module and a hierarchical pointer generation module. In one or more embodiments, the first module decomposes terms of domain ontology into multiple levels and builds a special convolutional neural network for each level with category-dependent dynamic max-pooling and macro F-measure based weights tuning. In one or more embodiments, the second module merges the level-wise outputs into a final summarized semantic indexing. The effectiveness of Deep Level-wise XMLC framework embodiments is demonstrated by comparing it with several state-of-the-art methods of automatic labeling on various datasets.

Type: Grant

Filed: May 10, 2019

Date of Patent: September 5, 2023

Assignee: Baidu USA LLC

Inventors: Dingcheng Li, Jingyuan Zhang, Ping Li
Total correlation variational autoencoder strengthened with attentions for segmenting syntax and semantics

Patent number: 11748567

Abstract: Described herein are embodiments of a framework named as total correlation variational autoencoder (TC_VAE) to disentangle syntax and semantics by making use of total correlation penalties of KL divergences. One or more Kullback-Leibler (KL) divergence terms in a loss for a variational autoencoder are discomposed so that generated hidden variables may be separated. Embodiments of the TC_VAE framework were examined on semantic similarity tasks and syntactic similarity tasks. Experimental results show that better disentanglement between syntactic and semantic representations have been achieved compared with state-of-the-art (SOTA) results on the same data sets in similar settings.

Type: Grant

Filed: July 10, 2020

Date of Patent: September 5, 2023

Assignee: Baidu USA LLC

Inventors: Dingcheng Li, Shaogang Ren, Ping Li
Knowledge-graph-embedding-based question answering

Patent number: 11727243

Abstract: Described herein are embodiments for question answering over knowledge graph using a Knowledge Embedding based Question Answering (KEQA) framework. Instead of inferring an input questions' head entity and predicate directly, KEQA embodiments target jointly recovering the question's head entity, predicate, and tail entity representations in the KG embedding spaces. In embodiments, a joint distance metric incorporating various loss terms is used to measure distances of a predicated fact to all candidate facts. In embodiments, the fact with the minimum distance is returned as the answer. Embodiments of a joint training strategy are also disclosed for better performance. Performance evaluation on various datasets demonstrates the effectiveness of the disclosed systems and methods using the KEQA framework.

Type: Grant

Filed: January 30, 2019

Date of Patent: August 15, 2023

Assignee: Baidu USA LLC

Inventors: Jingyuan Zhang, Dingcheng Li, Ping Li, Xiao Huang
Integration of knowledge graph embedding into topic modeling with hierarchical Dirichlet process

Patent number: 11636355

Abstract: Leveraging domain knowledge is an effective strategy for enhancing the quality of inferred low-dimensional representations of documents by topic models. Presented herein are embodiments of a Bayesian nonparametric model that employ knowledge graph (KG) embedding in the context of topic modeling for extracting more coherent topics; embodiments of the model may be referred to as topic modeling with knowledge graph embedding (TMKGE). TMKGE embodiments are hierarchical Dirichlet process (HDP)-based models that flexibly borrow information from a KG to improve the interpretability of topics. Also, embodiments of a new, efficient online variational inference method based on a stick-breaking construction of HDP were developed for TMKGE models, making TMKGE suitable for large document corpora and KGs. Experiments on datasets illustrate the superior performance of TMKGE in terms of topic coherence and document classification accuracy, compared to state-of-the-art topic modeling methods.

Type: Grant

Filed: May 30, 2019

Date of Patent: April 25, 2023

Assignee: Baidu USA LLC

Inventors: Dingcheng Li, Jingyuan Zhang, Ping Li, Siamak Zamani Dadaneh
Systems and methods for end-to-end deep reinforcement learning based coreference resolution

Patent number: 11630953

Abstract: Described herein are embodiments for end-to-end reinforcement learning based coreference resolution models to directly optimize coreference evaluation metrics. Embodiments of a reinforced policy gradient model are disclosed to incorporate reward associated with a sequence of coreference linking actions. Furthermore, maximum entropy regularization may be used for adequate exploration to prevent a model embodiment from prematurely converging to a bad local optimum. Experiments on datasets compared with state-of-the-art methods verified the effectiveness of embodiments.

Type: Grant

Filed: July 25, 2019

Date of Patent: April 18, 2023

Assignees: Baidu USA LLC, Baidu.com Times Technology (Beijing) Co., Ltd.

Inventors: Hongliang Fei, Xu Li, Dingcheng Li, Ping Li
Representation learning for input classification via topic sparse autoencoder and entity embedding

Patent number: 11615311

Abstract: Described herein are embodiments of a unified neural network framework to integrate Topic modeling, Word embedding and Entity Embedding (TWEE) for representation learning of inputs. In one or more embodiments, a novel topic sparse autoencoder is introduced to incorporate discriminative topics into the representation learning of the input. Topic distributions of inputs are generated from a global viewpoint and are utilized to enable autoencoder to learn topical representations. A sparsity constraint may be added to ensure that the most discriminative representations are related to topics. In addition, both words and entity related information may be embedded into the network to help learn a more comprehensive input representation. Extensive empirical experiments show that embodiments of the TWEE framework outperform the state-of-the-art methods on different datasets.

Type: Grant

Filed: November 21, 2019

Date of Patent: March 28, 2023

Assignee: Baidu USA LLC

Inventors: Dingcheng Li, Jingyuan Zhang, Ping Li
Automated and efficient personal transportation vehicle sharing

Patent number: 11593735

Abstract: A method, computer program product, and a system where a processor(s) determines generates a cognitive user profile representing patterns of usage of each of a plurality of users of the transportation resource sharing system, a cognitive resource profile for each resource of the plurality of resources, a cognitive route profile for each route traversed by at least one resource of the plurality of resources, and a cognitive station profile for each station of the plurality of stations. The processor(s) assigns one or more specific resources of the plurality of resources to one or more specific users of the plurality of users and the one or more specific resources of the plurality of resources to one or more specific stations of the plurality of stations.

Type: Grant

Filed: May 21, 2019

Date of Patent: February 28, 2023

Assignee: International Business Machines Corporation

Inventors: Su Liu, Yu Gu, Dingcheng Li, Kai Liu
Systems and methods for mutual learning for topic discovery and word embedding

Patent number: 11568266

Abstract: Described herein are embodiments for systems and methods for mutual machine learning with global topic discovery and local word embedding. Both topic modeling and word embedding map documents onto a low-dimensional space, with the former clustering words into a global topic space and the latter mapping word into a local continuous embedding space. Embodiments of Topic Modeling and Sparse Autoencoder (TMSA) framework unify these two complementary patterns by constructing a mutual learning mechanism between word co-occurrence based topic modeling and autoencoder. In embodiments, word topics generated with topic modeling are passed into auto-encoder to impose topic sparsity for the autoencoder to learn topic-relevant word representations. In return, word embedding learned by autoencoder is sent back to topic modeling to improve the quality of topic generations. Performance evaluation on various datasets demonstrates the effectiveness of the disclosed TMSA framework in discovering topics and embedding words.

Type: Grant

Filed: March 15, 2019

Date of Patent: January 31, 2023

Assignee: Baidu USA LLC

Inventors: Dingcheng Li, Jingyuan Zhang, Ping Li
LEARNING LATENT STRUCTURAL RELATIONS WITH SEGMENTATION VARIATIONAL AUTOENCODERS

Publication number: 20220156612

Abstract: Learning disentangled representations is an important topic in machine learning for a wide range of applications. Disentangled latent variables represent interpretable semantic information and reflect separate factors of variation in data. Although generative models may learn latent representations and generate data samples as well, existing models may ignore the structural information among latent representations. Described in the present disclosure are embodiments to learn disentangled latent structural representations from data using decomposable variational auto-encoders, which simultaneously learn component representation and encodes component relationships. Embodiments of a novel structural prior for latent representations are disclosed to capture interactions among different data components. Embodiments are applied to data segmentation and latent relation discovery among different data components. Experiments on several datasets demonstrate the utility of the present model embodiments.

Type: Application

Filed: November 18, 2020

Publication date: May 19, 2022

Applicant: Baidu USA LLC

Inventors: Shaogang REN, Hongliang FEI, Dingcheng LI, Ping LI
Transforming data for a target schema

Patent number: 11249960

Abstract: Embodiments generally relate transforming data for a target schema. In some embodiments, a method includes receiving input data, where the input data includes a plurality of segments, and where the segments include a plurality of source fields containing target data. The method further includes characterizing the input data based at least in part on a plurality of predetermined metrics, where the predetermined metrics determine a structure of the input data. The method further includes mapping the target data in the source fields of the segments to a plurality of target fields of a target schema based at least in part on the characterizing. The method further includes populating the target fields of the target schema with the target data from the source fields based at least in part on the mapping.

Type: Grant

Filed: June 11, 2018

Date of Patent: February 15, 2022

Assignee: International Business Machines Corporation

Inventors: Daniel Dean, Checed A. Rodgers, Dingcheng Li, Pei Ni Liu, Xiao Xi Liu, Hui Lei, Yu Gu, Jing Min Xu, Yaoping Ruan
DISENTANGLE SYNTAX AND SEMANTICS IN SENTENCE REPRESENTATION WITH DECOMPOSABLE VARIATIONAL AUTOENCODER

Publication number: 20220043975

Abstract: Described herein are embodiments of a framework named decomposable variational autoencoder (DecVAE) to disentangle syntax and semantics by using total correlation penalties of Kullback-Leibler (KL) divergences. KL divergence term of the original VAE are decomposed such that the hidden variables generated may be separated in a clear-cut and interpretable way. Embodiments of DecVAE models are evaluated on various semantic similarity and syntactic similarity datasets. Experimental results show that embodiments of DecVAE models achieve state-of-the-art (SOTA) performance in disentanglement between syntactic and semantic representations.

Type: Application

Filed: August 5, 2020

Publication date: February 10, 2022

Applicant: Baidu USA LLC

Inventors: Dingcheng LI, Shaogang REN, Ping LI
VIDEO RECOMMENDATION WITH MULTI-GATE MIXTURE OF EXPERTS SOFT ACTOR CRITIC

Publication number: 20220019878

Abstract: Described herein are embodiments of a reinforcement learning based large-scale multi-objective ranking system. Embodiments of the system may be used for optimizing short-video recommendation on a video sharing platform. Multiple competing ranking objective and implicit selection bias in user feedback are the main challenges in real-world platform. In order to address those challenges, multi-gate mixture of experts (MMoE) and soft actor critic (SAC) are integrated together into a MMoE_SAC system. Experiment results demonstrate that embodiments of the MMoE_SAC system may greatly reduce a loss function compared to systems only based on single strategies.

Type: Application

Filed: July 15, 2020

Publication date: January 20, 2022

Applicants: Baidu USA LLC, Baidu.com Times Technology (Beijing) Co., Ltd.

Inventors: Dingcheng LI, Xu LI, Jun WANG, Ping LI
TOTAL CORRELATION VARIATIONAL AUTOENCODER STRENGTHENED WITH ATTENTIONS FOR SEGMENTING SYNTAX AND SEMANTICS

Publication number: 20220012425

Abstract: Described herein are embodiments of a framework named as total correlation variational autoencoder (TC_VAE) to disentangle syntax and semantics by making use of total correlation penalties of KL divergences. One or more Kullback-Leibler (KL) divergence terms in a loss for a variational autoencoder are discomposed so that generated hidden variables may be separated. Embodiments of the TC_VAE framework were examined on semantic similarity tasks and syntactic similarity tasks. Experimental results show that better disentanglement between syntactic and semantic representations have been achieved compared with state-of-the-art (SOTA) results on the same data sets in similar settings.

Type: Application

Filed: July 10, 2020

Publication date: January 13, 2022

Applicant: Baidu USA LLC

Inventors: Dingcheng LI, Shaogang REN, Ping LI
Entity model establishment

Patent number: 11188819

Abstract: Disclosed aspects relate to entity model establishment using an infinite mixture topic modeling (IMTM) technique. A set of event data which corresponds to a set of events may be detected. Using the IMTM technique, the set of event data which corresponds to the set of events may be analyzed. Based on analyzing the set of event data using the IMTM technique, a set of entity models for the set of events may be determined. Based on the set of entity models for the set of events, a subset of the set of entity models for the set of events may be established.

Type: Grant

Filed: May 10, 2017

Date of Patent: November 30, 2021

Assignee: International Business Machines Corporation

Inventors: Yu Gu, Dingcheng Li, Kai Liu, Su Liu
ESTIMATING THE IMPLICIT LIKELIHOODS OF GENERATIVE ADVERSARIAL NETWORKS

Publication number: 20210319302

Abstract: The thriving of deep models and generative models provides approaches to model high dimensional distributions. Generative adversarial networks (GANs) can approximate data distributions and generate data samples from the learned data manifolds as well. Presented herein are embodiments to estimate the implicit likelihoods of GAN models. In one or more embodiments, a stable inverse function of the generator is learned with the help of a variance network of the generator. The local variance of the sample distribution may be approximated by the normalized distance in the latent space. Simulation studies and likelihood testing on data sets validate embodiments, which outperformed several baseline methods in these tasks. An embodiment was also applied to anomaly detection. Experiments show that the embodiments herein can achieve state-of-the-art anomaly detection performance.

Type: Application

Filed: April 3, 2020

Publication date: October 14, 2021

Applicant: Baidu USA LLC

Inventors: Dingcheng LI, Shaogang REN, Zhixin ZHOU, Ping LI

1 2 next