CN118798584B

CN118798584B - A method, system, medium and device for robust optimization scheduling of building microgrid

Info

Publication number: CN118798584B
Application number: CN202411266152.6A
Authority: CN
Inventors: 王新立; 张程浩; 贾磊; 尹晓红; 林晨; 耿房; 王雷
Original assignee: Shandong University
Current assignee: Shandong University
Priority date: 2024-09-11
Filing date: 2024-09-11
Publication date: 2025-02-07
Anticipated expiration: 2044-09-11
Also published as: CN118798584A

Abstract

The present invention proposes a method, system, medium and device for robust optimization scheduling of a building microgrid, which belongs to the technical field of building microgrid optimization scheduling, including: generating a number of source load samples based on the day-ahead source load prediction value, representing the possible realization value of the source load on the next day, thereby constructing a source load uncertainty set; establishing a day-ahead scheduling layer, including: dividing the source load uncertainty set into a larger part of the source load uncertainty set, a smaller part of the source load uncertainty set and establishing a three-stage day-ahead robust optimization scheduling model; on the basis of the day-ahead scheduling layer, establishing an intraday real-time optimization scheduling layer based on model reinforcement learning, and outputting an intraday real-time optimal scheduling plan.

Description

Building micro-grid robust optimal scheduling method, system, medium and device

Technical Field

The invention belongs to the technical field of building micro-grid optimal scheduling, and particularly relates to a building micro-grid robust optimal scheduling method and system.

Background

The statements in this section merely provide background information related to the present disclosure and may not necessarily constitute prior art.

At present, the building energy structure is relatively single and mainly takes fossil energy, and along with the low-carbon development of the building, the building energy structure tends to be diversified, and renewable energy sources gradually take the dominant role. In the process of continuously and synchronously expanding the renewable energy power generation and the building power consumption scale, a building micro-grid is generated.

To ensure efficient and safe operation of the building microgrid, the key is an efficient scheduling strategy and sufficient adjustable resources. However, when new energy power generation such as photovoltaic power generation and wind power discharge occupies a large proportion in the system, scheduling work of the micro-grid can encounter some difficulties. Firstly, the power demand presents randomness due to the uncertainty of the power consumption habit of a user, and secondly, the yield of new energy power generation such as photovoltaic power generation, wind power discharge and the like is also influenced by natural conditions such as sunlight intensity, wind power size and the like, and the power generation system has instability.

At present, the artificial intelligence technology has limitations in accurately predicting new energy power generation and power requirements such as photovoltaic power generation, wind power discharge and the like, which causes prediction deviation and increases the problem of uncertainty of source load, and the method has direct influence on scheduling planning of a micro-grid. And secondly, with large-scale grid connection of new energy power generation such as photovoltaic power generation and wind power generation, the capacity of the traditional adjustable power generation equipment is relatively reduced, so that adjustable resources for balancing new energy power generation fluctuation and power demand of photovoltaic power generation and wind power generation in a micro-grid become insufficient, and the influence of the source load uncertainty problem is further aggravated.

Through retrieval, related technical schemes of the building micro-grid exist in the prior art, but the technical problem is that the traditional building micro-grid optimization scheduling generally comprises two layers, namely, two-stage robust optimization scheduling before the day and real-time optimization scheduling in the day. In day-ahead scheduling, the construction of the set of source load uncertainties does not take full advantage of the correlation between the source load prediction bias and the predicted value, which results in the set of constructed uncertainties being too conservative. Furthermore, two-phase robust optimized scheduling does not take into account the multi-time scale characteristics of source-load devices and uncertainties in building micro-grids, which makes scheduling plans for those source-load side devices with short time scale characteristics too conservative. In the real-time scheduling layer in the day, the randomness of the source load prediction deviation cannot be fully considered by the traditional method based on mathematical programming, so that the influence on the source load prediction deviation is difficult to effectively influence.

Disclosure of Invention

In order to overcome the defects of the prior art, the invention provides a robust optimal scheduling method for a building micro-grid, which is based on a daily real-time optimal scheduling model for model reinforcement learning, and calculates the daily real-time scheduling result by utilizing the daily scheduling result so as to ensure the stable and efficient operation of the building micro-grid system.

To achieve the above object, one or more embodiments of the present invention provide the following technical solutions:

in a first aspect, a robust optimal scheduling method for a building micro-grid is disclosed, comprising:

Generating a plurality of source load samples based on the day-ahead source load predicted value, wherein the source load samples represent possible realization values of the source load in the next day, so as to construct a source load uncertainty set;

The method comprises the steps of dividing a source load uncertainty set into a larger part of source load uncertainty set and a smaller part of source load uncertainty set, and establishing a three-stage robust optimal scheduling model before the day, wherein the three-stage robust optimal scheduling model before the day comprises the following steps:

determining a scheduling plan under a source load predicted value nominal scene;

The second stage is to calculate the pursuit plan of the heating, ventilation, air conditioning and electric vehicle lumped load of the gas turbine, energy storage battery and load aggregator with different response speed with the aim of maximizing the minimum pursuit cost of the building micro-grid in the worst scene of the uncertainty concentration of the larger part of source load;

The second-stage pursuit plan is overlapped on the basis of the first-stage nominal plan, the third stage aims at maximizing the minimum pursuit cost of the building micro-grid in the condition of uncertainty concentration of small partial source load and worst, and calculates the pursuit plan of the lumped load of the energy storage battery and the electric vehicle of the load aggregator, which has higher response speed;

on the basis of the day-ahead scheduling layer, the day-ahead real-time optimal scheduling layer is built based on model reinforcement learning, and the day-ahead real-time optimal scheduling plan is output.

As a further technical solution, the specific process of constructing the source load uncertainty set is as follows:

acquiring the latest day-ahead predicted value;

acquiring an expected value and a covariance matrix of conditional correlation between a source load predicted value and a true value;

sampling the conditional correlation between the predicted value and the true value of the source load to generate a plurality of samples, and then obtaining the samples of the true value of the source load;

Based on The sample obtained by time sampling and the latest day-ahead predicted value establish a source load uncertainty set.

As a further technical scheme, the source load uncertainty set is decomposed into an uncertainty set with larger fluctuation amplitude and long time scale characteristics and an uncertainty set with smaller fluctuation amplitude and short time scale characteristics.

The method comprises the following steps of determining a scheduling plan under a source load predicted value nominal scene, wherein the scheduling plan comprises a first-stage optimization target, constraint conditions and decision variables;

The first stage constraints include power balance constraints, gas turbine operating constraints, grid constraints, energy storage battery operating constraints, and load aggregator constraints.

As a further technical scheme, the optimization target of the second stage is the second stage additional cost, and specifically comprises the additional cost of the gas turbine, the additional cost of the energy storage battery and the compensation cost of the building micro-grid on the lumped load regulation behavior of the heating ventilation air conditioner of the load aggregator;

The second stage is a new constraint relative to the first stage, which is an upper limit value and a lower limit value of the user thermal comfort temperature of the heating ventilation air conditioner in the building, and the corresponding constraint condition aims to limit the indoor temperature to the thermal comfort range of the user.

As a further technical scheme, the third-stage additional cost comprises additional cost of an energy storage battery and compensation cost of a building micro-grid on lumped load adjustment behavior of the electric automobile of the load aggregator;

The third stage is to newly add upper and lower limit values of the lumped target charge quantity of the electric vehicle to the second stage, and Upper and lower limits of the deviation of the target charge amount tolerable to the individual electric car users.

As a further technical scheme, the method for establishing the intra-day real-time optimized scheduling layer based on model reinforcement learning comprises the following steps:

Constructing a real-time daily optimization scheduling problem as a Markov decision process, and defining states, actions and rewards;

Learning a state transition model of the building micro-grid system by utilizing LSTM, wherein the input of the state transition model is state and action, and the output is state variation;

Interacting the actor network for learning the deterministic strategy with the learned state transition model of the building micro-grid system in a mode of optimizing a limited prediction interval, and continuously updating actor and critic network model parameters by using accumulated rewards in the limited prediction interval obtained by interaction as an updating target of the critic network to finally obtain an optimal control strategy;

and after the daily load predicted value is obtained at set time intervals, the scheduling center obtains a daily real-time scheduling plan by utilizing actor network optimal control strategies.

In a second aspect, a building micro-grid robust optimal scheduling system is disclosed, comprising:

a source load uncertainty set construction module configured to generate a plurality of source load samples based on the day-ahead source load predicted value, representing possible realization values of the source load on the next day, thereby constructing a source load uncertainty set;

The day-ahead scheduling layer establishing module is configured to establish a day-ahead scheduling layer and comprises the steps of dividing a source load uncertainty set into a larger part of source load uncertainty set, a smaller part of source load uncertainty set and establishing a day-ahead three-stage robust optimal scheduling model, wherein the day-ahead three-stage robust optimal scheduling model comprises:

And the intra-day real-time optimal scheduling module is configured to establish an intra-day real-time optimal scheduling layer based on the model reinforcement learning on the basis of the pre-day scheduling layer and output an intra-day real-time optimal scheduling plan.

The one or more of the above technical solutions have the following beneficial effects:

In order to overcome the limitation of the traditional single-layer deterministic optimization scheduling method, the technical scheme of the invention creatively provides a multi-layer building micro-grid management framework. The framework integrates a data-driven uncertainty set modeling technology and a three-stage robust optimization scheduling strategy with multi-time scale characteristics, and ensures the comprehensive treatment of source load uncertainty in a day-ahead scheduling stage. Meanwhile, the intra-day real-time scheduling method based on model reinforcement learning is combined, and the response capability of the system to prediction deviation and the scheduling flexibility are further improved.

In order to accurately describe source load uncertainty, aiming at the correlation characteristic between source load prediction deviation and a predicted value, the technical scheme of the invention provides a modeling method for a data driving source load uncertainty set. Secondly, in a day-ahead scheduling layer, aiming at the multi-time scale characteristics of source load equipment and source load uncertainty of a building micro-grid, the invention provides a day-ahead scheduling layer based on three-stage robust optimal scheduling, firstly, a source load uncertainty set is decomposed into two parts of large fluctuation amplitude, long time scale, small fluctuation amplitude and short time scale, secondly, a nominal scheduling plan of all source load equipment is calculated in a first stage, a chase scheduling plan of the source load equipment with the long time scale characteristics and the short time scale characteristics is calculated in a second scheduling stage and a third scheduling stage respectively based on the source load uncertainty set with the long time scale and the short time scale characteristics, and finally day-ahead scheduling is summarized. In a real-time scheduling layer in the day, aiming at the random characteristic presented by the source load prediction deviation, the invention provides a real-time optimization scheduling model in the day based on model reinforcement learning, and the real-time scheduling result in the day is calculated by utilizing the scheduling result before the day so as to ensure the stable and efficient operation of a building micro-grid system.

Additional aspects of the invention will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the invention.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the invention.

FIG. 1 is a schematic overall flow chart of a method according to an embodiment of the present invention;

FIG. 2 is a schematic flow chart of a method according to an embodiment of the invention.

Detailed Description

It should be noted that the following detailed description is exemplary and is intended to provide further explanation of the invention. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.

It is noted that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of exemplary embodiments according to the present invention.

Embodiments of the invention and features of the embodiments may be combined with each other without conflict.

Example 1

The embodiment discloses a robust optimal scheduling method for a building micro-grid, which can effectively cope with source load uncertainty in the building micro-grid and comprises the following steps:

step one, establishing a source load uncertainty set with correlation information based on data driving.

And capturing the conditional correlation of the prediction deviation and the prediction value by using the probability information of the source load historical data and the corresponding prediction deviation. And establishing a source load uncertainty set based on a conditional normal connection method according to the latest source load predicted value by utilizing the extracted correlation information, so that the source load uncertainty set has stronger data adaptability to the change of a predicted result.

In an embodiment, in order to effectively solve the problem of uncertainty of source load in the optimized dispatching of the building micro-grid, a robust optimized dispatching method is necessary. In this process, a set of source load uncertainties needs to be constructed first. However, conventional methods based on probabilistic statistics and optimization theory, while taking into account worst-case source load uncertainties, tend to be too conservative, may not be adequate to meet the flexibility and efficiency requirements of practical applications. Because of a certain correlation between the source load prediction bias and the predicted value, the correlation is characterized in that the prediction bias is smaller and distributed intensively at low prediction power, and the prediction bias is larger and distributed decentrally at high prediction power.

For this feature, the present embodiment proposes a data-driven source load uncertainty set modeling method. According to the method, the probability information of the source load historical data is utilized to capture the condition correlation between the prediction deviation and the prediction value, so that the adaptability of a source load uncertainty set to the change of the prediction result is improved, and the conservation is reduced.

First, collecting the historical true value of the source loadPredicted value predicted by source load prediction model based on depth learningThe method comprises the steps of collecting historical true values and corresponding predicted values of load power values of photovoltaic power generation, wind power generation, electric vehicle batteries, heating ventilation air conditioning equipment and the like, and obtaining combined accumulated distribution of source load prediction deviation and the predicted values thereof according to a normal connection theory.

(1)

Wherein, The method comprises the steps of predicting the joint cumulative distribution of the deviation and the predicted value of the source load; Is the source load prediction bias, where ;AndThe intermediate variables are normally connected and are obtained from the source load history true values and the corresponding predicted values respectively; And Respectively obtaining accumulated distribution functions from the source load historical true value and the corresponding source load predicted value; is a standard normal cumulative distribution function obtained from the source load history true value and the corresponding source load predicted value, Is an inverse function thereof; And Probability density functions obtained from the source load historical true value and the corresponding source load predicted value respectively; For normally connecting intermediate variables AndProbability density functions of a standard multivariate normal distribution of (c),A covariance matrix for the probability density function.

The source load data refer to load power values of photovoltaic power generation, wind power generation, an electric vehicle battery, heating ventilation air conditioning equipment and the like, wherein the photovoltaic power generation and the wind power generation are source sides, and the load power values of the electric vehicle battery, the heating ventilation air conditioning equipment and the like are load sides.

Definition of the definition,,,Vector composed of normal connected intermediate variables obtained from source load history real value and corresponding predicted value, andI.e. obeying the covariance matrix asIs a normal distribution of (c).

(2)

(3)

Wherein the method comprises the steps ofAndThe linear correlation coefficient and the spearman correlation coefficient of normal connected intermediate variables are used for extracting the correlation between the source load history true value and the predicted value,AndComputing the same as。

And secondly, establishing a conditional probability relation between the source load prediction deviation and the source load prediction value through a normal connection model.

(4)

Wherein, A conditional probability density function between the source load prediction bias and the source load prediction value; The distribution characteristics of the source load predicted values are described; is a conditional distribution of a multielement normal distribution, meets the following conditions Conditional correlation between source load predicted values and true values is described.

(5)

Wherein, As a mean value vector of the data set,In the form of a covariance matrix,Describing the distribution characteristics between the source charge realism values and the predicted values,Describing the distribution characteristics of the source load predictors,Describing the distribution characteristics of the source charge realism values.

A conditional correlation between a source load predictor and a true value is described, with which a number of source load samples can be generated based on the latest day-ahead source load predictor, representing possible realization values of the source load on the next day, thereby constructing a set of source load uncertainties. The specific process is as follows:

1) Obtaining the latest day-ahead source load predicted value to obtain ;

2) Acquisition by means of (5)Is used for the expected value and covariance matrix;

3) By aligning Sampling to generate a plurality ofIs then passed throughA sample of the source charge real value is obtained.

Based onTime sampling acquired sampleAnd the latest day-ahead predictive value(Is the day-ahead scheduling period), a set of source load uncertainties is established.

(6)

Wherein the sampled vectorThe method has the distribution characteristics of prediction deviation under different day-ahead predicted values, so that the constructed source load uncertainty set has strong data adaptability to the change of the predicted result, and the uncertainty of the source load can be described more accurately.In order to predict the magnitude of the fluctuation of the deviation,Is a variable which is 0 to 1,AndTogether describe the degree of conservation of the source charge uncertainty set.

And step two, establishing a day-ahead scheduling layer based on three-stage robust optimal scheduling.

Firstly, dividing a source load uncertainty set into two parts of large fluctuation amplitude, a long time scale and a short time scale, and reducing conservation, wherein the large fluctuation amplitude and the small fluctuation amplitude refer to the large fluctuation amplitude of the prediction deviation, and the long and short time scales refer to the long and short time scale characteristics of source load side equipment. The method comprises the steps of establishing a three-stage robust optimization scheduling model in the first stage, wherein the first stage aims at minimizing the running cost of a building micro-grid under a source load predicted value, calculating a nominal scheduling plan for all source load devices, the second stage aims at maximizing the minimum cost of the building micro-grid in a source load uncertain set with large fluctuation amplitude and long time scale based on the nominal scheduling plan, and obtaining a top-down scheduling plan of the source load devices with long time scale characteristics, and the third stage aims at maximizing the minimum cost of the building micro-grid in the source load uncertain set with small fluctuation amplitude and short time scale based on the nominal scheduling plan and the top-down scheduling plan of the second stage. Finally, summarizing the scheduling plans of the three stages, and sending the summarized scheduling plans to an underlying intra-day real-time optimized scheduling layer as a pre-day scheduling plan.

In a real-time two-layer building micro-grid management framework in the day-ahead and day with the cooperation of multiple time scales, a robust optimal scheduling method is applied in the day-ahead stage to pre-process possible source load fluctuation. In the day-ahead dispatching layer, aiming at the multi-time scale characteristic of source load side equipment and source load uncertainty of a building micro-grid, the characteristic cannot be effectively utilized by the traditional day-ahead two-stage robust optimization dispatching. Firstly, dividing a source load uncertainty set into two parts of large fluctuation amplitude, long time scale, small fluctuation amplitude and short time scale, and reducing conservation. And secondly, establishing a three-stage robust optimal scheduling model before the day.

In a building micro-grid, the response speed of source load side equipment shows obvious multi-time scale characteristics. For example, gas turbines and hvac equipment have relatively slow response speeds due to thermal inertia and regulatory process limitations, with long time scale characteristics. Meanwhile, the energy storage battery, the electric automobile battery, the photovoltaic battery and the wind turbine unit show quick response capability, can quickly adapt to load change no matter in low power or high power output, and have the characteristic of short time scale. The accuracy of the source load prediction is inversely proportional to the prediction time scale, namely the longer the prediction time is, the larger the prediction deviation is, the shorter the prediction time is, the smaller the deviation is, and the management strategy of the building micro-grid needs to adjust the processing of the source load prediction deviation according to the response speed of the equipment. Specifically, for slower responding devices, strategies may be designed to handle larger prediction bias, while for faster responding devices, the response may be fast and adjusted to accommodate smaller prediction bias. Based on the method, the source load uncertainty set is further subdivided into two parts, wherein one part is in the case of large fluctuation amplitude and long time scale, and the other part is in the case of small fluctuation amplitude and short time scale. This subdivision helps to more accurately manage the uncertainty of the source load side equipment, optimize the overall scheduling strategy of the building micro-grid, and equations (7) and (8) are larger and smaller sets of source load uncertainties, respectively.

(7)

(8)

、Larger and smaller source load uncertainty sets, respectively; And The greater and lesser source load uncertainty concentrations, respectivelyThe moment source load value vector is the load power value vector of photovoltaic power generation, wind power generation, electric automobile batteries, heating ventilation air conditioning equipment and the like; Is that The predicted value vector of the source load at moment, namely the predicted value vector of the load power of photovoltaic power generation, wind power generation, electric automobile batteries, heating ventilation air conditioning equipment and the like; the fluctuation amplitude for describing the source load prediction deviation in the larger and smaller source load uncertainty sets is respectively in the range of ;For 0-1 variable, in the larger and smaller source load uncertainty sets, respectively, for determining the uncertainty setWhether positive and negative source load prediction deviation exists at any moment; Uncertainty parameters in the larger and smaller source load uncertainty sets represent the total time period number of the source load prediction deviation, and the value range is ,;AndThe conservation degree of the source load uncertainty set is described together, and the larger the value is, the worse the source load uncertainty scene is, so that the more conservative the scheduling plan is.

The optimization objective of the three-stage robust optimization scheduling model before day is given by the following equation (9). In the first stage, we determine a scheduling plan under the source load predictor nominal scenario, whose optimization objectives, constraints, and decision variables are defined by equations (10) - (12), respectively.

(9)

(10)

(11)

(12)

Wherein, Optimizing the targets for the first, second and third stages,Is a first, a second and a third stage decision vector,For the second and third stages in larger and smaller part of source charge uncertainty setIs a source load vector in (a).

First stage operation costComprises a gas turbine start-stop cost, a gas turbine operation and maintenance cost, an energy storage battery operation and maintenance cost and a power purchase cost,Is the cost of single start-stop of the gas turbine,AndThe unit operation and maintenance cost for the gas turbine and the energy storage battery,Is thatThe electricity selling price of the power grid at any moment,Is the first stageThe starting and stopping state of the gas turbine is carried out at the moment,Is the first stageThe output power of the gas turbine at the moment,Is the first stageThe charging and discharging power of the energy storage battery is always equal,Is the first stagePurchase power to the power grid at any time; Time scales are scheduled for the day before.

The first stage constraints include power balance constraints, gas turbine operating constraints, grid constraints, energy storage battery operating constraints, and load aggregator constraints, including power balance constraints and gas turbine operating constraints in equation (11) above,Is the first stageThe output power of the photovoltaic power generation is at the moment,Is the first stageThe wind power generation output power at the moment,Is the first stageThe electric power is used by the load at the moment; is the first stage Heating ventilation and air conditioning of a moment load aggregator and lumped load of an electric automobile. Equation (12) is a first stage decision vector.

And in the second stage, the minimum pursuit cost of the building micro-grid in the worst scene of the uncertainty concentration of the larger part of source load is used as a target, and the pursuit plans of the lumped loads of the gas turbines, the energy storage batteries, the heating ventilation air conditioner of the load aggregator and the electric automobile with different response speeds are calculated. Equations (13) - (15) below are second stage optimization objectives, new constraints and decision variables, respectively, relative to the first stage.

(13)

(14)

(15)

Wherein, the second stage recoils the costComprises the following cost of a gas turbine, the following cost of an energy storage battery and the compensation cost of a building micro-grid on the lumped load adjustment behavior of a heating ventilation air conditioner of a load aggregator,Is thatThe unit power compensation cost of the building micro-grid to the total load of the heating ventilation air conditioner of the load aggregator is realized at any time,、、、In the second stage for each device of source loadThe second stage is to utilize the dispatching capability of the load aggregation heating, ventilation and air conditioning aggregate load, the second stage will not exactly meet the initial temperature setting of the user, so the formula (14) is the newly added constraint of the second stage relative to the first stage,The method is characterized in that the method comprises the steps of (1) a first stage decision variable is adopted in the formula (15), and the first stage decision variable is adopted in the formula (15), wherein the first stage decision variable is used for adjusting the starting and stopping states of a gas turbine and the purchase power of a power grid.

And on the basis that the second-stage chase plan is overlapped with the first-stage nominal plan, the third-stage chase plan for calculating the lumped load of the energy storage battery and the load aggregator electric automobile with higher response speed is calculated by taking the minimum chase cost of the maximum building micro-grid under the condition of the uncertainty concentration worst of the small partial source load as a target, and the following formulas (16) - (18) are respectively the third-stage optimization target, the newly added constraint relative to the second-stage and the decision variable.

(16)

(17)

(18)

Wherein, the cost of the pursuit in the third stageComprises the following cost of an energy storage battery and the compensation cost of a building micro-grid to the lumped load adjustment behavior of the electric automobile of a load aggregator,Is thatThe unit power compensation cost of the building micro-grid to the lumped load of the electric automobile of the load aggregator is realized at any time,、、In the third stage for each device of source loadThe third stage is to utilize the dispatching capability of the load aggregation electric automobile to aggregate load, and the third stage will not exactly meet all the target charge amount, so the formula (17) is to increase the constraint of the third stage relative to the second stage,、An upper limit value and a lower limit value of a lumped target charge amount of the electric vehicle;、 Is the first Upper and lower limits of the deviation of the target charge amount tolerable to the individual electric car users. Equation (18) is a second stage decision variable, and the third stage will not adjust the output power of the gas turbine and the electric power for the lumped load of the heating ventilation air conditioner.

And thirdly, establishing a real-time daily optimal scheduling layer based on model reinforcement learning.

Firstly, constructing a real-time daily optimization scheduling problem as a Markov decision process, learning a building micro-grid system state transition model by utilizing LSTM, and secondly, learning a real-time daily optimization scheduling optimal strategy based on actor-critic network and fusing a model predictive control algorithm by utilizing the daily front scheduling plan obtained in the step two as a reference to compensate the actual source load prediction deviation.

Because the source load prediction deviation is reduced along with the shortening of the prediction time, the deviation of a day-ahead scheduling plan occurs when the day-ahead scheduling plan operates in the day, and therefore different source load side devices need to be adjusted to ensure the stable operation of the building micro-grid. Because of the random characteristic presented by the source load prediction deviation, the traditional method based on mathematical programming is insufficient to cope with the random characteristic, so that the running deviation cannot be effectively eliminated, the reinforcement learning is based on the randomness of the Markov decision process to the environment effectively.

Firstly, the real-time daily optimization scheduling problem is constructed into a Markov decision process, and states, actions and rewards are defined. The state vector is defined as the following formula (19).

(19)

Wherein, The schedule time variable is optimized for real-time during the day,The schedule time scale is optimized for real-time throughout the day,,,Is thatThe real-time prediction value of the time source lotus in the day,The initial electric quantity value is lumped for the electric automobile; Is that The energy stored by the energy storage battery is stored at any time,Results are scheduled for the day before.

Defining motion vectorsIs of the formula (20):

(20)

Wherein, For each device of source loadAnd the corresponding power adjustment amount at the moment.

The defined reward function is the following formula (21):

(21)

Wherein, And (3) withRespectively isPunishment cost of electricity purchase violations of a unit power grid and punishment of reduction of unit photovoltaic power generation,Is thatThe unit power compensation cost of the building micro-grid to the lumped load of the electric automobile of the load aggregator is realized at any time,Is thatThe unit power compensation cost of the building micro-grid to the total load of the heating ventilation air conditioner of the load aggregator is realized at any time,Is thatAnd the electricity selling price of the power grid at moment.

Learning building micro-grid system state transition model by LSTMThe input to the state transition model is the stateAnd actionsOutput is state change quantity。

(22)

Fusion model predictive control algorithm concept, and learning deterministic strategyThe actor network and the learned building micro-grid system state transition model interact in a mode of optimizing a limited prediction interval, accumulated rewards in the limited prediction interval obtained by interaction are used as an updating target of the critic network, network model parameters actor and critic are continuously updated, and finally an optimal control strategy is obtained。

(23)

(24)

(25)

Wherein, For critic networks, equation (23) is the cumulative rewards within the predicted interval of the H-step model prediction, equation (24) is the loss function of critic networks, and equation (25) is the deterministic policy gradient of actor networks.

Finally, after obtaining the predicted daily load value every 5 minutes, the dispatching center uses actor networkAnd obtaining a real-time scheduling plan in the day.

(26)

Wherein, For real-time optimization of the scheduling plan during the day,For a day-ahead dispatch plan,And outputting a real-time optimal scheduling plan in the day for actor networks.

The reinforcement learning state vector (formula 19) includes the day-ahead schedule of the second stage, and the day-ahead schedule obtained by formula 26 is the sum of the day-ahead schedule and the result obtained by the day-ahead real-time optimization.

In the embodiment, a more accurate uncertainty set is established based on data driving, and the embodiment establishes a source load uncertainty set based on a data driving method, captures the condition correlation between a prediction error and a prediction value by utilizing probability information and prediction deviation of historical data, so that the source load uncertainty set can be more flexibly adapted to the change of a source load prediction result, the uncertainty set is more accurate, and the conservation type of the uncertainty set established by the traditional method is reduced.

In the embodiment, the robustness of the system is improved based on the day-ahead dispatching of three-stage robust optimization, a source load uncertainty set is decomposed into two parts with large fluctuation amplitude and small fluctuation amplitude aiming at the multi-time scale characteristics of source load side equipment and source load uncertainty of a building micro-grid, and a day-ahead three-stage robust optimal dispatching model is established based on the source load uncertainty set, so that compared with the traditional day-ahead two-stage robust optimal dispatching, the method and the system utilize the multi-time scale characteristics of the source load side equipment and the source load uncertainty of the building micro-grid, greatly reduce conservation and improve the robustness.

In the embodiment, the scheduling deviation is eliminated based on the intra-day real-time optimization scheduling of the model reinforcement learning, and the scheduling efficiency is improved. Because the source load prediction deviation is reduced along with the shortening of the prediction time scale, the deviation of the day-ahead scheduling plan occurs when the day-ahead scheduling plan operates in the day, and therefore different source load side devices need to be adjusted to ensure the stable operation of the building micro-grid. Because of the random characteristic presented by the source load prediction deviation, the traditional method based on mathematical programming is insufficient to cope with the random characteristic, so that the running deviation cannot be effectively eliminated, the reinforcement learning is based on the randomness of the Markov decision process to the environment effectively.

Example two

It is an object of the present embodiment to provide a computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, which processor implements the steps of the above method when executing the program.

Example III

An object of the present embodiment is to provide a computer-readable storage medium.

A computer readable storage medium having stored thereon a computer program which when executed by a processor performs the steps of the above method.

Example IV

The purpose of this embodiment is to provide a robust optimization scheduling system for a micro-grid of a building, including:

Example five

It is an object of the present embodiments to provide a computer program product comprising instructions which, when run on a computer, cause the computer to perform the methods and functions involved in any of the embodiments described above.

The steps involved in the apparatus of the above embodiment correspond to those of the first embodiment of the method, and the detailed description of the embodiment refers to the relevant description of the first embodiment. The term "computer-readable storage medium" shall be taken to include a single medium or multiple media that includes one or more sets of instructions, and shall also be taken to include any medium that is capable of storing, encoding or carrying a set of instructions for execution by the processor and that cause the processor to perform any one of the methodologies of the present invention.

It will be appreciated by those skilled in the art that the modules or steps of the invention described above may be implemented by general-purpose computer means, alternatively they may be implemented by program code executable by computing means, whereby they may be stored in storage means for execution by computing means, or they may be made into individual integrated circuit modules separately, or a plurality of modules or steps in them may be made into a single integrated circuit module. The present invention is not limited to any specific combination of hardware and software.

While the foregoing description of the embodiments of the present invention has been presented in conjunction with the drawings, it should be understood that it is not intended to limit the scope of the invention, but rather, it is intended to cover all modifications or variations within the scope of the invention as defined by the claims of the present invention.

Claims

1. A robust optimization scheduling method for a building microgrid, characterized by comprising:

Based on the day-ahead source load forecast value, several source load samples are generated to represent the possible realized value of the source load on the next day, thereby constructing a source load uncertainty set;

Establishing a day-ahead scheduling layer includes: dividing the source-load uncertainty set into a larger source-load uncertainty set and a smaller source-load uncertainty set, and establishing a day-ahead three-stage robust optimization scheduling model, wherein the day-ahead three-stage robust optimization scheduling model includes:

Phase 1: Determine the dispatch plan under the nominal scenario of source load forecast value;

Phase II: To maximize the minimum recourse cost of the building microgrid under the worst scenario of a larger source-load uncertainty concentration, the recourse plan of the gas turbine, energy storage battery, and the HVAC and electric vehicle aggregate loads of the load aggregator with different response speeds is calculated;

The second-stage recourse plan is superimposed on the nominal plan of the first stage. In the third stage, the recourse plan of the energy storage battery with faster response speed and the electric vehicle aggregate load of the load aggregator is calculated with the goal of maximizing the minimum recourse cost of the building microgrid in the worst scenario of uncertain concentration of a smaller part of the source and load. Finally, the dispatch plans of the three stages are summarized as the day-ahead dispatch plan and sent to the lower-level intraday real-time optimization dispatch layer.

On the basis of the day-ahead scheduling layer, the intraday real-time optimization scheduling layer is established based on model reinforcement learning to output the intraday real-time optimal scheduling plan;

The specific process of constructing the source-load uncertainty set is as follows:

Get the latest day-ahead forecast value;

Obtain the expected value and covariance matrix of the conditional correlation between the source load prediction value and the true value;

The conditional correlation between the source load prediction value and the true value is sampled to generate a number of samples, and then a sample of the source load true value is obtained;

based on The samples obtained by sampling at the moment and the latest day-ahead forecast value establish a source load uncertainty set;

Among them, the first stage: determine the dispatch plan under the nominal scenario of source load forecast value, including the first stage optimization objectives, constraints and decision variables;

The constraints in the first stage include power balance constraints, gas turbine operation constraints, grid constraints, energy storage battery operation constraints, and load aggregator constraints;

The optimization target of the second stage is the second stage recovery cost, which specifically includes the gas turbine recovery cost, the energy storage battery recovery cost and the compensation cost of the building microgrid for the load aggregator's HVAC aggregate load regulation behavior;

The new constraints added in the second stage compared to the first stage are the upper and lower limits of thermal comfort temperature for HVAC users in the building. The corresponding constraints are intended to limit the indoor temperature to the thermal comfort range of users;

The third stage recourse costs include the energy storage battery recourse costs and the compensation costs of the building microgrid for the load aggregator's electric vehicle aggregate load regulation behavior;

Compared with the second stage, the third stage adds the upper and lower limits of the aggregate target charging capacity of electric vehicles; The upper and lower limits of the deviation from the target charging amount that an electric vehicle user can tolerate;

Based on model reinforcement learning, a real-time optimization scheduling layer is established within the day, including:

The intraday real-time optimization scheduling problem is constructed as a Markov decision process, defining states, actions, and rewards;

The state transition model of the building microgrid system is learned using LSTM. The input of the state transition model is the state and action, and the output is the state change.

The actor network that learns deterministic strategies interacts with the learned building microgrid system state transition model in a way that optimizes the finite prediction interval. The accumulated reward within the finite prediction interval obtained by the interaction is used as the update target of the critic network. The actor and critic network model parameters are continuously updated to finally obtain the optimal control strategy.

After obtaining the daily source-load forecast value at set intervals, the dispatch center will use the actor network optimal control strategy to obtain the daily real-time dispatch plan.

2. A method for robust optimization scheduling of a building microgrid as described in claim 1, characterized in that the source-load uncertainty set is decomposed into an uncertainty set with a large fluctuation amplitude and long-time scale characteristics and an uncertainty set with a small fluctuation amplitude and short-time scale characteristics.

3. A building microgrid robust optimization dispatching system, characterized by comprising:

The source load uncertainty set construction module is configured to: generate a number of source load samples based on the day-ahead source load prediction value, representing the possible realization value of the source load on the next day, thereby constructing a source load uncertainty set;

The day-ahead scheduling layer establishment module is configured to: establish the day-ahead scheduling layer, including: dividing the source-load uncertainty set into a larger source-load uncertainty set and a smaller source-load uncertainty set, and establishing a day-ahead three-stage robust optimization scheduling model, wherein the day-ahead three-stage robust optimization scheduling model includes:

The intraday real-time optimization scheduling module is configured as follows: on the basis of the day-ahead scheduling layer, the intraday real-time optimization scheduling layer is established based on model reinforcement learning, and the intraday real-time optimal scheduling plan is output;

Get the latest day-ahead forecast value;

4. A computer device comprising a memory, a processor and a computer program stored in the memory and executable on the processor, wherein the processor implements the steps of the method described in any one of claims 1 to 2 when executing the program.

5. A computer-readable storage medium having a computer program stored thereon, wherein when the program is executed by a processor, the steps of the method described in any one of claims 1 to 2 are executed.