电器与能效管理技术 ›› 2026, Vol. 0 ›› Issue (4): 8-16.doi: 10.16628/j.cnki.2095-8188.2026.04.002

• 研究与分析 • 上一篇    下一篇

基于深度强化学习与条件扩散模型的短期负荷预测场景生成技术

储琳琳1, 张宇俊1, 宗明1, 朱夏1, 陈妍君1, 杨智翔2, 贾雅君2   

  1. 1 国网上海市电力公司市南供电公司, 上海 200030
    2 上海君世电气科技有限公司, 上海 200240
  • 收稿日期:2025-11-08 出版日期:2026-04-30 发布日期:2026-05-18
  • 作者简介:储琳琳(1978—),女,正高级工程师,主要从事电力系统规划设计工作。|张宇俊(1975—),女,高级经济师,主要从事电力系统规划设计工作。|宗 明(1969—),男,正高级工程师,主要从事电力系统规划设计工作。
  • 基金资助:
    国家电网有限公司科技项目(52992424001A)

Technology of Short-Term Load Forecasting Scenario Generation Based on Deep Reinforcement Learning and Conditional Diffusion Models

CHU Linlin1, ZHANG Yujun1, ZONG Ming1, ZHU Xia1, CHEN Yanjun1, YANG Zhixiang2, JIA Yajun2   

  1. 1 State Grid Shanghai Municipal Electric Power Company, Shinan Power Supply Company, Shanghai 200030, China
    2 Shanghai Junshi Electric Technology Co., Ltd., Shanghai 200240, China
  • Received:2025-11-08 Online:2026-04-30 Published:2026-05-18

摘要:

随着以新能源为主体的新型电力系统建设的推进,电力负荷不确定性显著增强,对电网安全稳定与经济运行构成严峻挑战。在此背景下,概率预测、区间预测等不确定性负荷预测方法受到广泛关注。场景生成技术通过建模并模拟负荷、气象与新能源出力等多源不确定性因素,为预测模型提供关键输入。提出一种融合深度强化学习(DRL)与条件扩散模型(CDM)的短期负荷预测场景生成方法。针对负荷、气象等多元时序数据的复杂耦合与动态特性,设计了结合双向长短期记忆(Bi-LSTM)网络、自注意力机制及季节性分解层的条件扩散模型,以精准学习数据内在条件概率分布,生成高保真度未来场景。同时,为解决超参数整定难题,构建了基于DRL的优化框架,将超参数寻优建模作为马尔可夫决策过程,通过智能体与环境交互实现参数自适应配置。基于我国某地区实际负荷与气象数据的实验表明,所提方法在各项评价指标上均优于基准模型。

关键词: 短期负荷预测, 场景生成, 条件扩散模型, 深度强化学习, 不确定性量化

Abstract:

With the construction of a new power system dominated by new energy sources, the uncertainty of power load has increased significantly, posing severe challenges to the safe, stable and economic operation of power grids. Against this background, uncertain load forecasting methods such as probabilistic forecasting and interval forecasting have attracted extensive attention. Scenario technology provides key inputs for forecasting models by modeling and simulating multi-source uncertainties including load, meteorology and new energy output. This paper proposes a short-term load forecasting scenario generation method that integrates deep reinforcement learning(DRL) and conditional diffusion model(CD). Aiming at the complex coupling and dynamic characteristics of multivariate time-series data such as load and meteorology, a conditional diffusion model combined with bidirectional long short-term memory(Bi-LST) network, self-attention mechanism and seasonal decomposition layer is designed to accurately leamn the intrinsic conditional probability distribution of data and generate high-fidelity fiture scenarios. Meanwhile, to address the difficulty of hyperparameter tuning, an optimization framework based on DRL, is constructed, which formulates hyperparameter optimization as a lMarkov decision process and realizes adaptive parameter configuration through the interaction between agents and the environment. Experiments based on actual load and meteorological lata from a regton in China show that the proposed method outperforms benchmark models in various evaluation indicators.

Key words: short-term load forecasting, scenario generation, conditional diffusion model, deep reinforcement learning, uncertainty quantification

中图分类号: