Patent number: 12149078
Abstract: A method for intelligently adjusting a power flow based on a Q-learning algorithm includes: converting a variable, an action, and a goal in a power grid to a state, an action, and a reward in the algorithm, respectively; selecting an action from an action space, giving an immediate reward based on a result of power flow calculation, and correcting a next state; forwardly observing a next exploration action based on a strategy in the Q-learning algorithm; updating a Q value in a corresponding position in a Q-value table based on the obtained reward; if a final state is not reached, going back to step 2; otherwise, increasing the number of iterations by 1; if the number of iterations does not reach predetermined value K, that is, Episode<K, going back to step 2; otherwise, that is, Episode=K, outputting the Q-value table; and outputting an optimal unit combination.
Type:
Grant
Filed:
October 11, 2020
Date of Patent:
November 19, 2024
Assignees:
STATE GRID ZHEJIANG ELECTRIC POWER CO., LTD., TAIZHOU POWER SUPPLY COMPANY
Inventors:
Jian Yang, Dongbo Zhang, Xinjian Chen, Yilun Zhu, Jie Yu, Daojian Hong, Zhouhong Wang, Chenghuai Hong, Zihuai Zheng, Huiying Gao, Minyan Xia, Bingren Wang, Guode Ying, Yizhi Zhu