To solve the optimal power flow(OPF)problem,reinforcement learning(RL)emerges as a promising new approach.However,the RL-OPF literature is strongly divided regarding the exact formulation of the OPF problem as an RL e...To solve the optimal power flow(OPF)problem,reinforcement learning(RL)emerges as a promising new approach.However,the RL-OPF literature is strongly divided regarding the exact formulation of the OPF problem as an RL environment.In this work,we collect and implement diverse environment design decisions from the literature regarding training data,observation space,episode definition,and reward function choice.In an experimental analysis,we show the significant impact of these environment design options on RL-OPF training performance.Further,we derive some first recommendations regarding the choice of these design decisions.The created environment framework is fully open-source and can serve as a benchmark for future research in the RL-OPF field.展开更多
A distributed active and reactive power control(DARPC)strategy based on the alternating direction method of multipliers(ADMM)is proposed for regional AC transmission system(TS)with wind farms(WFs).The proposed DARPC s...A distributed active and reactive power control(DARPC)strategy based on the alternating direction method of multipliers(ADMM)is proposed for regional AC transmission system(TS)with wind farms(WFs).The proposed DARPC strategy optimizes the power distribution among the WFs to minimize the power losses of the AC TS while tracking the active power reference from the transmission system operator(TSO),and minimizes the voltage deviation of the buses inside the WF from the rated voltage as well as the power losses of the WF collection system.The optimal power flow(OPF)of the TS is relaxed by using the semidefinite programming(SDP)relaxation while the branch flow model is used to model the WF collection system.In the DARPC strategy,the large-scale strongly-coupled optimization problem is decomposed by using the ADMM,which is solved in the regional TS controller and WF controllers in parallel without loss of the global optimality.The boundary information is exchanged between the regional TS controller and WF controllers.Compared with the conventional OPF method of the TS with WFs,the optimality and accuracy of the system operation can be improved.Moreover,the proposed strategy efficiently reduces the computation burden of the TS controller and eliminates the need of a central controller.The protection of the information privacy can be enhanced.A modified IEEE 9-bus system with two WFs consisting of 64 wind turbines(WTs)is used to validate the proposed DARPC strategy.展开更多
以统一潮流控制器(unified power flow controller,UPFC)为代表的灵活交流输电技术(flexible AC transmission system,FACTS)可实现传输功率的合理分布、优化系统资源,提高系统的稳定性和可靠性。该文基于内点优化方法,提出计及UPFC的...以统一潮流控制器(unified power flow controller,UPFC)为代表的灵活交流输电技术(flexible AC transmission system,FACTS)可实现传输功率的合理分布、优化系统资源,提高系统的稳定性和可靠性。该文基于内点优化方法,提出计及UPFC的无功优化模型,以系统有功网损最小为目标函数,采用UPFC电压源模型,将其作用等效为一系列电压和功率的约束,直接放到内点法的约束中,在不同的负荷运行方式下进行优化分析。在IEEE-30节点系统测试中发现,引入UPFC后系数矩阵的维数会有所增加,但不会影响其收敛性。算例就系统网损和电压指标对装设UPFC前后进行比较,并给出最优控制方案下UPFC的参数值。结果表明该方法是可行的、有效的,取得很好的效果。展开更多
文摘To solve the optimal power flow(OPF)problem,reinforcement learning(RL)emerges as a promising new approach.However,the RL-OPF literature is strongly divided regarding the exact formulation of the OPF problem as an RL environment.In this work,we collect and implement diverse environment design decisions from the literature regarding training data,observation space,episode definition,and reward function choice.In an experimental analysis,we show the significant impact of these environment design options on RL-OPF training performance.Further,we derive some first recommendations regarding the choice of these design decisions.The created environment framework is fully open-source and can serve as a benchmark for future research in the RL-OPF field.
基金supported in part by Technical University of Denmark(DTU)in part by China Scholarship Council(No.201806130202)。
文摘A distributed active and reactive power control(DARPC)strategy based on the alternating direction method of multipliers(ADMM)is proposed for regional AC transmission system(TS)with wind farms(WFs).The proposed DARPC strategy optimizes the power distribution among the WFs to minimize the power losses of the AC TS while tracking the active power reference from the transmission system operator(TSO),and minimizes the voltage deviation of the buses inside the WF from the rated voltage as well as the power losses of the WF collection system.The optimal power flow(OPF)of the TS is relaxed by using the semidefinite programming(SDP)relaxation while the branch flow model is used to model the WF collection system.In the DARPC strategy,the large-scale strongly-coupled optimization problem is decomposed by using the ADMM,which is solved in the regional TS controller and WF controllers in parallel without loss of the global optimality.The boundary information is exchanged between the regional TS controller and WF controllers.Compared with the conventional OPF method of the TS with WFs,the optimality and accuracy of the system operation can be improved.Moreover,the proposed strategy efficiently reduces the computation burden of the TS controller and eliminates the need of a central controller.The protection of the information privacy can be enhanced.A modified IEEE 9-bus system with two WFs consisting of 64 wind turbines(WTs)is used to validate the proposed DARPC strategy.
文摘以统一潮流控制器(unified power flow controller,UPFC)为代表的灵活交流输电技术(flexible AC transmission system,FACTS)可实现传输功率的合理分布、优化系统资源,提高系统的稳定性和可靠性。该文基于内点优化方法,提出计及UPFC的无功优化模型,以系统有功网损最小为目标函数,采用UPFC电压源模型,将其作用等效为一系列电压和功率的约束,直接放到内点法的约束中,在不同的负荷运行方式下进行优化分析。在IEEE-30节点系统测试中发现,引入UPFC后系数矩阵的维数会有所增加,但不会影响其收敛性。算例就系统网损和电压指标对装设UPFC前后进行比较,并给出最优控制方案下UPFC的参数值。结果表明该方法是可行的、有效的,取得很好的效果。