Patents by Inventor Shuncheng He

Shuncheng He has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Multi-agent coordination method and apparatus

Patent number: 11948079

Abstract: The present disclosure discloses a multi-agent coordination method. The method includes: performing multiple data collections on N agents to collect E sets of data, where N and E are integers greater than 1; and optimizing neural networks of the N agents using reinforcement learning based on the E sets of data. Each data collection includes: randomly selecting a first coordination pattern from multiple predetermined coordination patterns; obtaining N observations after the N agents act on an environment in the first coordination pattern; determining a first probability and a second probability that a current coordination pattern is the first coordination pattern based on the N observations; and determining a pseudo reward based on the first probability and the second probability. The E sets of data include: a first coordination pattern label indicating the first coordination pattern, the N observations, and the pseudo reward.

Type: Grant

Filed: October 19, 2020

Date of Patent: April 2, 2024

Inventors: Xiangyang Ji, Shuncheng He
Unsupervised reinforcement learning method and apparatus based on Wasserstein distance

Patent number: 11823062

Abstract: The present disclosure discloses an unsupervised reinforcement learning method and apparatus based on Wasserstein distance. The method includes: obtaining a state distribution in a trajectory obtained with guidance of a current policy of an agent; calculating a Wasserstein distance between the state distribution and a state distribution in a trajectory obtained with another historical policy, and calculating a pseudo reward of the agent based on the Wasserstein distance, replacing a reward fed back from an environment in a target reinforcement learning framework with the pseudo reward, and guiding the current policy of the agent to keep a large distance from the other historical policy. The method uses Wasserstein distance to encourage an algorithm in an unsupervised reinforcement learning framework to obtain diverse policies and skills through training.

Type: Grant

Filed: March 21, 2023

Date of Patent: November 21, 2023

Assignee: TSINGHUA UNIVERSITY

Inventors: Xiangyang Ji, Shuncheng He, Yuhang Jiang
MULTI-AGENT COORDINATION METHOD AND APPARATUS

Publication number: 20220121920

Abstract: The present disclosure discloses a multi-agent coordination method. The method includes: performing multiple data collections on N agents to collect E sets of data, where N and E are integers greater than 1; and optimizing neural networks of the N agents using reinforcement learning based on the E sets of data. Each data collection includes: randomly selecting a first coordination pattern from multiple predetermined coordination patterns; obtaining N observations after the N agents act on an environment in the first coordination pattern; determining a first probability and a second probability that a current coordination pattern is the first coordination pattern based on the N observations; and determining a pseudo reward based on the first probability and the second probability. The E sets of data include: a first coordination pattern label indicating the first coordination pattern, the N observations, and the pseudo reward.

Type: Application

Filed: October 19, 2020

Publication date: April 21, 2022

Inventors: Xiangyang JI, Shuncheng He

Multi-agent coordination method and apparatus

Unsupervised reinforcement learning method and apparatus based on Wasserstein distance

MULTI-AGENT COORDINATION METHOD AND APPARATUS