[go: up one dir, main page]

CN112632677B - Bridge full-life maintenance strategy optimization method based on half Markov decision process - Google Patents

Bridge full-life maintenance strategy optimization method based on half Markov decision process Download PDF

Info

Publication number
CN112632677B
CN112632677B CN202011561626.1A CN202011561626A CN112632677B CN 112632677 B CN112632677 B CN 112632677B CN 202011561626 A CN202011561626 A CN 202011561626A CN 112632677 B CN112632677 B CN 112632677B
Authority
CN
China
Prior art keywords
bridge
reliability index
maintenance
state
decision
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011561626.1A
Other languages
Chinese (zh)
Other versions
CN112632677A (en
Inventor
陶伟峰
王乃玉
汪英俊
林陪晖
王俊彦
黄秀兵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Straits Innovation Internet Co ltd
Zhejiang Strait Innovation Technology Co ltd
Zhejiang University ZJU
Original Assignee
Straits Innovation Internet Co ltd
Zhejiang Strait Innovation Technology Co ltd
Zhejiang University ZJU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Straits Innovation Internet Co ltd, Zhejiang Strait Innovation Technology Co ltd, Zhejiang University ZJU filed Critical Straits Innovation Internet Co ltd
Priority to CN202011561626.1A priority Critical patent/CN112632677B/en
Publication of CN112632677A publication Critical patent/CN112632677A/en
Application granted granted Critical
Publication of CN112632677B publication Critical patent/CN112632677B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/10Geometric CAD
    • G06F30/13Architectural design, e.g. computer-aided architectural design [CAAD] related to design of buildings, bridges, landscapes, production plants or roads
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/20Design optimisation, verification or simulation
    • G06F30/23Design optimisation, verification or simulation using finite element methods [FEM] or finite difference methods [FDM]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/20Design optimisation, verification or simulation
    • G06F30/27Design optimisation, verification or simulation using machine learning, e.g. artificial intelligence, neural networks, support vector machines [SVM] or training a model
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0639Performance analysis of employees; Performance analysis of enterprise or organisation operations
    • G06Q10/06393Score-carding, benchmarking or key performance indicator [KPI] analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/20Administration of product repair or maintenance
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/08Construction
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2111/00Details relating to CAD techniques
    • G06F2111/06Multi-objective optimisation, e.g. Pareto optimisation using simulated annealing [SA], ant colony algorithms or genetic algorithms [GA]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2111/00Details relating to CAD techniques
    • G06F2111/08Probabilistic or stochastic CAD
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2119/00Details relating to the type or aim of the analysis or the optimisation
    • G06F2119/02Reliability analysis or reliability optimisation; Failure analysis, e.g. worst case scenario performance, failure mode and effects analysis [FMEA]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2119/00Details relating to the type or aim of the analysis or the optimisation
    • G06F2119/04Ageing analysis or optimisation against ageing

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • Human Resources & Organizations (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Strategic Management (AREA)
  • Economics (AREA)
  • Geometry (AREA)
  • Marketing (AREA)
  • Evolutionary Computation (AREA)
  • Computer Hardware Design (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Quality & Reliability (AREA)
  • General Engineering & Computer Science (AREA)
  • Operations Research (AREA)
  • Development Economics (AREA)
  • Game Theory and Decision Science (AREA)
  • Educational Administration (AREA)
  • Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Medical Informatics (AREA)
  • Software Systems (AREA)
  • Architecture (AREA)
  • Civil Engineering (AREA)
  • Structural Engineering (AREA)
  • Computational Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Pure & Applied Mathematics (AREA)
  • Bridges Or Land Bridges (AREA)
  • Testing Resistance To Weather, Investigating Materials By Mechanical Methods (AREA)

Abstract

The invention discloses a method for optimizing a bridge full-life maintenance strategy based on a half Markov decision process, which comprises the following steps: s1, determining the annual failure probability of the bridge and the corresponding reliability index, and defining the bridge state according to the reliability index; s2, only considering the reliability index degradation caused by corrosion, and setting the degradation process to accord with the gamma process; s3, calculating the failure probability of the bridge in the decision-making interval; s4, performing health detection on the bridge every year, judging the degradation condition of the bridge protection layer, determining the state of the bridge, and determining the adopted decision according to the state of the bridge, wherein the decision problem adopts a half Markov decision process model; and S5, solving the half Markov decision process model to obtain the optimal full-life maintenance strategy of the bridge. The method is used for uniformly optimizing a preventive maintenance strategy and a necessary maintenance strategy based on the reliability index of the bridge, and meanwhile, the randomness in the performance degradation process of the bridge and the time-varying reliability of the bridge in a decision-making interval are considered.

Description

基于半马尔科夫决策过程的桥梁全寿命维护策略优化方法Optimization method of bridge life-cycle maintenance strategy based on semi-Markov decision process

技术领域technical field

本发明涉及桥梁工程技术领域,具体涉及一种基于半马尔科夫决策过程的桥梁全寿命维护策略优化方法。The invention relates to the technical field of bridge engineering, in particular to a method for optimizing a bridge life-cycle maintenance strategy based on a semi-Markov decision process.

背景技术Background technique

桥梁在服役期间不可避免会受到各种环境因子的作用,如碳化、氯离子侵蚀及车辆疲劳荷载,导致强度逐渐下降,从而影响安全性。若维护不当,老旧桥梁容易在极端车辆荷载下发生倒塌,造成严重的社会影响。另一方面,过度维护则会造成不必要的资源浪费。因此,如何优化桥梁的全寿命维护策略,在维护费用与倒塌风险之间达到一种权衡,是桥梁管理部门关心的重要问题。桥梁的维护措施可分为预防性维护与必要性维护两大类,前者通过修复桥梁保护层以延缓锈蚀的进一步发展,后者则是将桥梁修复至完好状态,所需的费用也更高。目前,常用的优化方法对预防性维护策略和必要性维护策略进行了区别对待,分别基于时间和性能对两者进行优化,而不是在一个统一的框架内进行优化。Bridges will inevitably be affected by various environmental factors during their service, such as carbonization, chloride ion erosion, and vehicle fatigue loads, resulting in a gradual decrease in strength, thereby affecting safety. If not maintained properly, old bridges are prone to collapse under extreme vehicle loads, causing serious social impacts. On the other hand, over-maintenance will cause unnecessary waste of resources. Therefore, how to optimize the life-cycle maintenance strategy of bridges and achieve a trade-off between maintenance costs and collapse risk is an important issue that bridge management departments are concerned about. The maintenance measures of bridges can be divided into two categories: preventive maintenance and essential maintenance. The former is to repair the bridge protective layer to delay the further development of corrosion, and the latter is to restore the bridge to a good state, and the cost is also higher. Currently, commonly used optimization methods treat preventive maintenance strategies and necessary maintenance strategies differently, optimizing both based on time and performance, respectively, rather than optimizing within a unified framework.

发明内容SUMMARY OF THE INVENTION

本发明为了克服以上技术的不足,提供了一种基于半马尔科夫决策过程的桥梁全寿命维护策略优化方法。In order to overcome the deficiencies of the above technologies, the present invention provides an optimization method for bridge life-cycle maintenance strategy based on a semi-Markov decision process.

本发明克服其技术问题所采用的技术方案是:The technical scheme adopted by the present invention to overcome its technical problems is:

一种基于半马尔科夫决策过程的桥梁全寿命维护策略优化方法,包括步骤:An optimization method for bridge life-cycle maintenance strategy based on semi-Markov decision process, including steps:

S1、基于退化后的桥梁强度和车辆荷载的年极值分布,计算桥梁的年失效概率及对应的可靠度指标,并根据可靠度指标定义桥梁状态;S1. Based on the annual extreme value distribution of the degraded bridge strength and vehicle load, calculate the annual failure probability of the bridge and the corresponding reliability index, and define the bridge state according to the reliability index;

S2、仅考虑锈蚀引起的可靠度指标退化,且设定退化过程符合伽马过程;S2. Only the reliability index degradation caused by corrosion is considered, and the degradation process is set to conform to the gamma process;

S3、采用时变可靠度方法计算决策区间内桥梁的失效概率;其中,步骤S1中所述的年失效概率是指桥梁在一年内发生失效的概率,而决策区间内桥梁的失效概率是指桥梁在相邻两个决策时刻之间发生失效的概率;S3. The time-varying reliability method is used to calculate the failure probability of the bridge in the decision interval; wherein, the annual failure probability described in step S1 refers to the probability that the bridge will fail within one year, and the failure probability of the bridge in the decision interval refers to the bridge The probability of failure between two adjacent decision moments;

S4、分别定义桥梁的状态空间和桥梁维护的动作空间,每年对桥梁进行一次健康检测,根据检测结果判断桥梁保护层的退化情况并确定桥梁状态,根据桥梁状态确定采取的决策,该决策问题采用半马尔科夫决策过程模型;S4. Define the state space of the bridge and the action space of the bridge maintenance, respectively, conduct a health inspection of the bridge every year, judge the degradation of the bridge protective layer and determine the bridge state according to the inspection results, and determine the decision to be taken according to the bridge state. Semi-Markov decision process model;

S5、采用Q学习算法求解步骤S4所述的半马尔科夫决策过程模型,获得桥梁最优全寿命维护策略。S5 , using the Q-learning algorithm to solve the semi-Markov decision process model described in step S4 to obtain the optimal full-life maintenance strategy for the bridge.

进一步地,所述步骤S1中,退化后的桥梁强度的获取方式如下:Further, in the step S1, the method of obtaining the degraded bridge strength is as follows:

对桥梁进行健康检测,获得关于桥梁锈蚀程度的数据,然后通过有限元分析或截面强度公式计算得出退化后的桥梁强度;所述有限元分析和截面承载力公式都是计算桥梁强度的两种常用手段,前者工作量大但精度较高,后者简单易行但精度相对低一些。Perform health inspection on bridges to obtain data on the degree of bridge corrosion, and then calculate the degraded bridge strength through finite element analysis or section strength formula; the finite element analysis and section bearing capacity formula are both two types of bridge strength calculations Commonly used methods, the former has a large workload but high precision, and the latter is simple and easy to implement but has relatively low precision.

进一步地,所述步骤S1中,车辆荷载的年极值分布根据实测数据统计得出或参考桥梁设计规范得出。Further, in the step S1, the annual extreme value distribution of the vehicle load is obtained statistically according to the measured data or obtained with reference to the bridge design specification.

进一步地,所述步骤S1中,计算可靠度指标及定义桥梁状态,具体包括如下:Further, in the step S1, calculating the reliability index and defining the bridge state specifically includes the following:

依次设桥梁的年失效概率为

Figure BDA0002860851190000021
可靠度指标为β,所述可靠度指标β的计算为:In turn, the annual failure probability of the bridge is set as
Figure BDA0002860851190000021
The reliability index is β, and the calculation of the reliability index β is:

Figure BDA0002860851190000022
Figure BDA0002860851190000022

上式中,Φ(·)为标准正态分布函数,由β可以得到可靠度指标退化系数D:In the above formula, Φ( ) is the standard normal distribution function, and the reliability index degradation coefficient D can be obtained from β:

Figure BDA0002860851190000023
Figure BDA0002860851190000023

上式中,β0为桥梁的初始可靠度指标;In the above formula, β 0 is the initial reliability index of the bridge;

将可靠度指标退化系数从0到1均匀离散为N等份,从而得到N+1个离散值,对应N+1个桥梁状态,分别为s0、s1、……、sN,其中,s0和sN分别对应完好状态和可靠度指标为0的状态,s1至sN-1处于完好状态与可靠度指标为0的状态之间,且对应的可靠度指标依次降低。The reliability index degradation coefficient is uniformly dispersed into N equal parts from 0 to 1, so as to obtain N+1 discrete values, corresponding to N+1 bridge states, which are respectively s 0 , s 1 , ..., s N , among which, s 0 and s N correspond to the intact state and the state where the reliability index is 0 respectively, and s 1 to s N-1 are between the intact state and the state where the reliability index is 0, and the corresponding reliability indexes decrease in turn.

进一步地,所述步骤S2中,仅考虑锈蚀引起的可靠度指标退化,且设定退化过程符合伽马过程,即退化系数D(t)满足以下条件:Further, in the step S2, only the reliability index degradation caused by corrosion is considered, and the degradation process is set to conform to the gamma process, that is, the degradation coefficient D(t) satisfies the following conditions:

1)D(0)=0依概率1成立;1) D(0)=0 is established according to probability 1;

2)D(t)为独立增量过程;2) D(t) is an independent incremental process;

3)对任意t2>t1≥0,D(t2)-D(t1)服从形状参数为v(t2)-v(t1)、尺度参数为u的伽马分布;3) For any t 2 >t 1 ≥0, D(t 2 )-D(t 1 ) obeys a gamma distribution with shape parameter v(t 2 )-v(t 1 ) and scale parameter u;

形状参数为v、尺度参数为u的伽马分布的概率密度函数如下:The probability density function of a gamma distribution with shape parameter v and scale parameter u is as follows:

Figure BDA0002860851190000031
Figure BDA0002860851190000031

上式中,

Figure BDA0002860851190000032
为伽马函数,x为伽马分布概率密度函数的自变量,y为伽马函数中的积分变量;In the above formula,
Figure BDA0002860851190000032
is the gamma function, x is the independent variable of the probability density function of the gamma distribution, and y is the integral variable in the gamma function;

进一步设形状参数随时间线性增加,即,It is further assumed that the shape parameter increases linearly with time, that is,

v(t)=ζtv(t)=ζt

上式中,ζ为环境常数。In the above formula, ζ is the environmental constant.

进一步地,所述步骤S3中,设相邻两个决策时刻之间的时间间隔为Δt年,且先后两个决策时刻桥梁的可靠度指标分别为β0(1-D1)和β0(1-D2),为便于计算,设该决策区间内可靠度指标逐年阶梯下降,则第i年的可靠度指标为:Further, in the step S3, it is assumed that the time interval between two adjacent decision-making moments is Δt years, and the reliability indexes of the bridges at the two successive decision-making moments are respectively β 0 (1-D 1 ) and β 0 ( 1-D 2 ), for the convenience of calculation, suppose that the reliability index in the decision-making interval decreases year by year, then the reliability index of the i-th year is:

Figure BDA0002860851190000033
Figure BDA0002860851190000033

故,桥梁在Δt年内的失效概率

Figure BDA0002860851190000034
为:Therefore, the failure probability of the bridge in Δt years
Figure BDA0002860851190000034
for:

Figure BDA0002860851190000035
Figure BDA0002860851190000035

进一步地,所述步骤S4中,设桥梁的状态空间为S={s0,…,sN},桥梁维护的动作空间为A={a0,a1,a2},其中,a0、a1、a2分别代表无维护、预防性维护、必要性维护;每年对桥梁进行一次健康检测,当发现桥梁保护层完全失效或桥梁的退化程度提高一级,则决定是否需要采取预防性维护措施或必要性维护措施。Further, in the step S4, the state space of the bridge is set to be S={s 0 ,...,s N }, and the action space of bridge maintenance is A={a 0 , a 1 , a 2 }, where a 0 , a 1 , and a 2 represent no maintenance, preventive maintenance, and necessary maintenance, respectively; a health inspection is carried out on the bridge every year. When it is found that the protective layer of the bridge has completely failed or the degradation degree of the bridge has increased by one level, it is decided whether to take preventive measures. Maintenance measures or necessary maintenance measures.

进一步地,所述步骤S4中,半马尔科夫决策过程模型如下:Further, in the step S4, the semi-Markov decision process model is as follows:

Figure BDA0002860851190000041
Figure BDA0002860851190000041

Figure BDA0002860851190000042
Figure BDA0002860851190000042

上式中,Vt *(s)为最优值函数,表示t时刻之后的最小总期望贴现成本;πt(s)为由策略πt和状态s确定的维护措施;Π为桥梁全寿命维护策略空间;In the above formula, V t * (s) is the optimal value function, representing the minimum total expected discounted cost after time t; π t (s) is the maintenance measure determined by the policy π t and the state s; Π is the full life of the bridge maintain policy space;

s′为桥梁的下一个可能状态,其值取决于当前状态和当前维护措施,具体是:1)若当前不采取维护措施,则s′比s高一个等级但不超过最高等级,2)若当前采取预防性维护措施,则s′=s,3)若当前采取必要性维护措施,则s′=s0s' is the next possible state of the bridge, and its value depends on the current state and the current maintenance measures, specifically: 1) if no maintenance measures are currently taken, s' is one level higher than s but does not exceed the highest level, 2) if If preventive maintenance measures are currently taken, then s'=s, 3) If necessary maintenance measures are currently taken, then s'=s 0 ;

Δt为相邻两个决策时刻之间的时间间隔,其概率分布p(Δt|πt(s))由当前维护措施决定,具体是:1)若当前不采取维护措施,则Δt为桥梁在当前状态上的停留时间,其概率分布由可靠度指标退化过程确定,2)若当前采取预防性维护措施或必要性维护措施,则Δt为锈蚀起始时间,且服从正态分布;Δt is the time interval between two adjacent decision moments, and its probability distribution p(Δt|π t (s)) is determined by the current maintenance measures, specifically: 1) If no maintenance measures are currently taken, then Δt is the bridge The probability distribution of the residence time in the current state is determined by the degradation process of the reliability index. 2) If preventive maintenance measures or necessary maintenance measures are currently taken, Δt is the start time of rust and obeys a normal distribution;

T为桥梁设计使用年限;λ为贴现率;c0为桥梁维护费用,由桥梁状态和维护措施决定;c1为时间间隔Δt内因桥梁倒塌造成的期望贴现间接损失,可按下式计算:T is the design life of the bridge; λ is the discount rate; c 0 is the bridge maintenance cost, which is determined by the bridge state and maintenance measures; c 1 is the expected discounted indirect loss caused by bridge collapse within the time interval Δt, which can be calculated as follows:

Figure BDA0002860851190000043
Figure BDA0002860851190000043

上式中,Cf为因桥梁失效造成的间接经济损失;i和j都为表示时间的求和变量。In the above formula, C f is the indirect economic loss caused by bridge failure; i and j are both summation variables representing time.

本发明的有益效果是:The beneficial effects of the present invention are:

该方法基于桥梁的可靠度指标对预防性维护策略和必要性维护策略进行统一优化,同时考虑了桥梁性能退化过程中的随机性以及决策区间内桥梁的时变可靠度,便于桥梁管理部门根据桥梁的检测结果采取最优维护措施,使得桥梁的全寿命维护费用与期望贴现间接损失之和最小。This method optimizes the preventive maintenance strategy and the necessary maintenance strategy in a unified manner based on the reliability index of the bridge. At the same time, the randomness of the bridge performance degradation process and the time-varying reliability of the bridge in the decision interval are considered, which is convenient for the bridge management department according to the bridge. The optimal maintenance measures are taken according to the detection results of the bridge, so that the sum of the whole-life maintenance cost of the bridge and the expected discounted indirect loss is minimized.

附图说明Description of drawings

图1为本发明实施例所述桥梁可靠度指标退化过程样本。FIG. 1 is a sample of the degradation process of the bridge reliability index according to the embodiment of the present invention.

图2为本发明实施例所述的Q学习算法的流程图。FIG. 2 is a flowchart of a Q-learning algorithm according to an embodiment of the present invention.

图3为本发明实施例所述Q值收敛过程。FIG. 3 is a Q value convergence process according to an embodiment of the present invention.

图4为本发明实施例所述不同状态、不同维护措施对应的Q值。FIG. 4 shows Q values corresponding to different states and different maintenance measures according to the embodiment of the present invention.

图5为本发明实施例所述不同服役时间、不同维护措施对应的Q值。FIG. 5 shows Q values corresponding to different service times and different maintenance measures according to the embodiment of the present invention.

图6为本发明实施例所述最优全寿命维护策略示意图。FIG. 6 is a schematic diagram of an optimal full-life maintenance strategy according to an embodiment of the present invention.

具体实施方式Detailed ways

为了便于本领域人员更好的理解本发明,下面结合附图和具体实施例对本发明做进一步详细说明,下述仅是示例性的不限定本发明的保护范围。In order to facilitate those skilled in the art to better understand the present invention, the present invention will be described in further detail below with reference to the accompanying drawings and specific embodiments. The following are only exemplary and do not limit the protection scope of the present invention.

本实施例以一个虚拟的钢筋混凝土简支梁桥为例详细说明一下本发明所述的基于半马尔科夫决策过程的桥梁全寿命维护策略优化方法。In this embodiment, a virtual reinforced concrete simply supported girder bridge is used as an example to describe in detail the method for optimizing the bridge lifetime maintenance strategy based on the semi-Markov decision process according to the present invention.

参数设置如下:设计使用年限为T=100年;贴现率为λ=0.05;桥梁的初始可靠度指标为β0=4.5;桥梁状态数量取为11个,对应的可靠度指标退化系数D分别为0、10%、20%、…、100%;桥梁的初始建造费用取为Ccon=1×107元;假设桥梁倒塌造成的间接经济损失为桥梁初始建造费用的三倍,即Cf=3×107元;预防性维护措施的费用取为桥梁初始建造费用的5%,即CPM=5×105元;必要性维护措施的费用取决于桥梁的当前状态,按下式计算:The parameters are set as follows: the design service life is T = 100 years; the discount rate is λ = 0.05; the initial reliability index of the bridge is β 0 = 4.5; the number of bridge states is taken as 11, and the corresponding reliability index degradation coefficients D are 0, 10%, 20%, ..., 100%; the initial construction cost of the bridge is taken as C con = 1×10 7 yuan; it is assumed that the indirect economic loss caused by the collapse of the bridge is three times the initial construction cost of the bridge, that is, C f = 3 × 10 7 yuan; the cost of preventive maintenance measures is taken as 5% of the initial construction cost of the bridge, that is, C PM = 5 × 10 5 yuan; the cost of necessary maintenance measures depends on the current state of the bridge, and is calculated as follows:

CEM=min(1,0.05+0.02D2)·Ccon C EM =min(1,0.05+0.02D 2 )·C con

锈蚀起始时间的均值和标准差分别为15年和1年;锈蚀伽马过程的环境常数和尺度参数分别为ζ=1.0、u=0.01。作为示意,图1给出了桥梁服役后100年内的100条可靠度指标退化过程样本,可以看到,由于保护层的作用,服役初期桥梁的可靠度指标没有出现退化;随着服役时间的增加,可靠度指标逐渐退化。如果服役期间没有采取任何维护措施,桥梁可靠度指标甚至可能降至0。The mean and standard deviation of the onset time of corrosion were 15 years and 1 year, respectively; the environmental constants and scale parameters of the corrosion gamma process were ζ=1.0, u=0.01, respectively. As an illustration, Figure 1 shows 100 samples of the degradation process of reliability indicators within 100 years after the bridge was in service. It can be seen that due to the effect of the protective layer, the reliability indicators of the bridge did not degrade in the initial stage of service; with the increase of service time , the reliability index gradually degrades. If no maintenance measures are taken during service, the bridge reliability index may even drop to 0.

用1×105个迭代序列的Q学习算法优化全寿命维护策略,算法流程如图2所示。学习率α和贪婪参数ε都采用指数衰减模型,衰减常数取为1×10-4。为了检验计算结果的收敛性,图3给出了桥梁服役50年后,桥梁状态-维护措施对(s3,无维护)对应的Q值的迭代过程。可以看到,经过约4×104个序列的迭代,Q值便已经收敛。图4给出了桥梁服役50年后处于不同状态时采取无维护、预防性维护和必要性维护措施对应的Q值。通过对比Q值,可以确定每个状态下的最优维护措施。具体而言,若桥梁处于状态s0、s1或s2,则不需要采取任何维护措施;若桥梁处于状态s3或s4,则应采取预防性维护措施修复其保护层;对于其余状态,则应采取必要性维护措施将桥梁修复至完好状态。图5给出了桥梁服役不同时间后处于某一状态(以s5为例)时采取无维护、预防性维护和必要性维护措施对应的Q值。由于桥梁服役50年之后才可能出现状态s5(参考图1),因此图5中横坐标从50年开始。同样地,对比Q值可以确定不同服役时间下的最优维护措施。具体而言,若桥梁的剩余寿命为30年、40年或50年,应采取必要性维护措施将其修复至完好状态;若桥梁仅剩余10年或20年的寿命,则仅需采取预防性维护措施修复其保护层。最后,图6给出了完整的最优全寿命维护策略,其中,三种颜色由浅至深分别代表无维护、预防性维护和必要性维护。给定桥梁的任一状态和服役时间组合,便可以确定对应的最优维护措施。The whole-life maintenance strategy is optimized with the Q-learning algorithm of 1×10 5 iterative sequences. The algorithm flow is shown in Figure 2. Both the learning rate α and the greedy parameter ε adopt an exponential decay model, and the decay constant is taken as 1×10 -4 . In order to check the convergence of the calculation results, Fig. 3 shows the iterative process of the Q value corresponding to the bridge state-maintenance measure pair (s 3 , no maintenance) after the bridge has been in service for 50 years. It can be seen that the Q value has converged after about 4×10 4 sequence iterations. Figure 4 shows the Q-values corresponding to the no-maintenance, preventive maintenance and necessary maintenance measures when the bridge is in different states after 50 years of service. By comparing the Q values, the optimal maintenance measures for each state can be determined. Specifically, if the bridge is in state s 0 , s 1 or s 2 , no maintenance action is required; if the bridge is in state s 3 or s 4 , preventive maintenance measures should be taken to repair its protective layer; for the remaining states , the necessary maintenance measures should be taken to restore the bridge to a good state. Figure 5 shows the Q value corresponding to no maintenance, preventive maintenance and necessary maintenance measures when the bridge is in a certain state (taking s 5 as an example) after different periods of service. Since the state s 5 (refer to Figure 1) may not appear until the bridge has been in service for 50 years, the abscissa in Figure 5 starts from 50 years. Similarly, comparing the Q value can determine the optimal maintenance measures under different service times. Specifically, if the remaining life of the bridge is 30, 40 or 50 years, necessary maintenance measures should be taken to restore it to a good state; if the bridge has only 10 or 20 years of life remaining, only preventive measures should be taken. Maintenance measures repair its protective layer. Finally, Figure 6 presents the complete optimal life-cycle maintenance strategy, where the three colors from light to dark represent no maintenance, preventive maintenance and essential maintenance, respectively. Given any combination of bridge state and service time, the corresponding optimal maintenance measures can be determined.

以上仅描述了本发明的基本原理和优选实施方式,本领域人员可以根据上述描述做出许多变化和改进,这些变化和改进应该属于本发明的保护范围。The above only describes the basic principles and preferred embodiments of the present invention, and those skilled in the art can make many changes and improvements based on the above description, and these changes and improvements should belong to the protection scope of the present invention.

Claims (7)

1.一种基于半马尔科夫决策过程的桥梁全寿命维护策略优化方法,其特征在于,包括步骤:1. a bridge full-life maintenance strategy optimization method based on a semi-Markov decision process, is characterized in that, comprises the steps: S1、基于退化后的桥梁强度和车辆荷载的年极值分布,计算桥梁的年失效概率及对应的可靠度指标,并根据可靠度指标定义桥梁状态;S1. Based on the annual extreme value distribution of the degraded bridge strength and vehicle load, calculate the annual failure probability of the bridge and the corresponding reliability index, and define the bridge state according to the reliability index; S2、仅考虑锈蚀引起的可靠度指标退化,且设定退化过程符合伽马过程;S2. Only the reliability index degradation caused by corrosion is considered, and the degradation process is set to conform to the gamma process; S3、采用时变可靠度方法计算决策区间内桥梁的失效概率;S3. Use the time-varying reliability method to calculate the failure probability of the bridge within the decision interval; S4、分别定义桥梁的状态空间和桥梁维护的动作空间,每年对桥梁进行一次健康检测,根据检测结果判断桥梁保护层的退化情况并确定桥梁状态,根据桥梁状态确定采取的决策,该决策问题采用半马尔科夫决策过程模型;S4. Define the state space of the bridge and the action space of the bridge maintenance, respectively, conduct a health inspection of the bridge every year, judge the degradation of the bridge protective layer and determine the bridge state according to the inspection results, and determine the decision to be taken according to the bridge state. Semi-Markov decision process model; 半马尔科夫决策过程模型如下:The semi-Markov decision process model is as follows:
Figure FDA0003508329940000011
Figure FDA0003508329940000011
上式中,
Figure FDA0003508329940000012
为最优值函数,表示t时刻之后的最小总期望贴现成本;πt(s)为由策略πt和状态s确定的维护措施;Π为桥梁全寿命维护策略空间;
In the above formula,
Figure FDA0003508329940000012
is the optimal value function, representing the minimum total expected discounted cost after time t; π t (s) is the maintenance measure determined by the policy π t and the state s; Π is the bridge life-cycle maintenance strategy space;
s′为桥梁的下一个可能状态,其值取决于当前状态和当前维护措施,具体是:1)若当前不采取维护措施,则s′比s高一个等级但不超过最高等级,2)若当前采取预防性维护措施,则s′=s,3)若当前采取必要性维护措施,则s′=s0,s0为桥梁完好状态;s' is the next possible state of the bridge, and its value depends on the current state and the current maintenance measures, specifically: 1) if no maintenance measures are currently taken, s' is one level higher than s but does not exceed the highest level, 2) if If preventive maintenance measures are currently taken, then s'=s, 3) If necessary maintenance measures are currently taken, then s'=s 0 , and s 0 is the intact state of the bridge; Δt为相邻两个决策时刻之间的时间间隔,其概率分布p(Δt|πt(s))由当前维护措施决定,具体是:1)若当前不采取维护措施,则Δt为桥梁在当前状态上的停留时间,其概率分布由可靠度指标退化过程确定,2)若当前采取预防性维护措施或必要性维护措施,则Δt为锈蚀起始时间,且服从正态分布;Δt is the time interval between two adjacent decision moments, and its probability distribution p(Δt|π t (s)) is determined by the current maintenance measures, specifically: 1) If no maintenance measures are currently taken, then Δt is the bridge The probability distribution of the residence time in the current state is determined by the degradation process of the reliability index. 2) If preventive maintenance measures or necessary maintenance measures are currently taken, Δt is the start time of rust and obeys a normal distribution; T为桥梁设计使用年限;λ为贴现率;c0为桥梁维护费用,由桥梁状态和维护措施决定;c1为时间间隔Δt内因桥梁倒塌造成的期望贴现间接损失,可按下式计算:T is the design life of the bridge; λ is the discount rate; c 0 is the bridge maintenance cost, which is determined by the bridge state and maintenance measures; c 1 is the expected discounted indirect loss caused by bridge collapse within the time interval Δt, which can be calculated as follows:
Figure FDA0003508329940000013
Figure FDA0003508329940000013
上式中,Cf为因桥梁失效造成的间接经济损失;i和j都为表示时间的求和变量;βi和βj分别表示第i和第j年的可靠度指标;In the above formula, C f is the indirect economic loss caused by bridge failure; i and j are both summation variables representing time; β i and β j represent the reliability index of the i-th and j-th years, respectively; S5、采用Q学习算法求解步骤S4所述的半马尔科夫决策过程模型,获得桥梁最优全寿命维护策略。S5 , using the Q-learning algorithm to solve the semi-Markov decision process model described in step S4 to obtain the optimal full-life maintenance strategy for the bridge.
2.根据权利要求1所述的方法,其特征在于,所述步骤S1中,退化后的桥梁强度的获取方式如下:2. The method according to claim 1, wherein, in the step S1, the method of obtaining the degraded bridge strength is as follows: 对桥梁进行健康检测,获得关于桥梁锈蚀程度的数据,然后通过有限元分析或截面承载力公式计算得出退化后的桥梁强度。Perform health inspection on bridges to obtain data on the degree of bridge corrosion, and then calculate the degraded bridge strength through finite element analysis or section bearing capacity formula. 3.根据权利要求1所述的方法,其特征在于,所述步骤S1中,车辆荷载的年极值分布根据实测数据统计得出或参考桥梁设计规范得出。3 . The method according to claim 1 , wherein, in the step S1 , the annual extreme value distribution of the vehicle load is obtained statistically according to the measured data or obtained with reference to the bridge design specification. 4 . 4.根据权利要求1所述的方法,其特征在于,所述步骤S1中,计算可靠度指标及定义桥梁状态,具体包括如下:4. The method according to claim 1, wherein, in the step S1, calculating the reliability index and defining the bridge state, specifically including the following: 依次设桥梁的年失效概率为
Figure FDA0003508329940000021
可靠度指标为β,所述可靠度指标β的计算为:
In turn, the annual failure probability of the bridge is set as
Figure FDA0003508329940000021
The reliability index is β, and the calculation of the reliability index β is:
Figure FDA0003508329940000022
Figure FDA0003508329940000022
上式中,Φ(·)为标准正态分布函数,由β可以得到可靠度指标退化系数D:In the above formula, Φ( ) is the standard normal distribution function, and the reliability index degradation coefficient D can be obtained from β:
Figure FDA0003508329940000023
Figure FDA0003508329940000023
上式中,β0为桥梁的初始可靠度指标;In the above formula, β 0 is the initial reliability index of the bridge; 将可靠度指标退化系数从0到1均匀离散为N等份,从而得到N+1个离散值,对应N+1个桥梁状态,分别为s0、s1、……、sN,其中,s0和sN分别对应完好状态和可靠度指标为0的状态,s1至sN-1处于完好状态与可靠度指标为0的状态之间,且对应的可靠度指标依次降低。The reliability index degradation coefficient is uniformly dispersed into N equal parts from 0 to 1, so as to obtain N+1 discrete values, corresponding to N+1 bridge states, which are respectively s 0 , s 1 , ..., s N , among which, s 0 and s N correspond to the intact state and the state where the reliability index is 0 respectively, and s 1 to s N-1 are between the intact state and the state where the reliability index is 0, and the corresponding reliability indexes decrease in turn.
5.根据权利要求4所述的方法,其特征在于,所述步骤S2中,仅考虑锈蚀引起的可靠度指标退化,且设定退化过程符合伽马过程,即退化系数D(t)满足以下条件:5. The method according to claim 4, characterized in that, in the step S2, only the reliability index degradation caused by corrosion is considered, and the degradation process is set to conform to the gamma process, that is, the degradation coefficient D(t) satisfies the following condition: 1)D(0)=0依概率1成立;1) D(0)=0 is established according to probability 1; 2)D(t)为独立增量过程;2) D(t) is an independent incremental process; 3)对任意t2>t1≥0,D(t2)-D(t1)服从形状参数为v(t2)-v(t1)、尺度参数为u的伽马分布;3) For any t 2 >t 1 ≥0, D(t 2 )-D(t 1 ) obeys a gamma distribution with shape parameter v(t 2 )-v(t 1 ) and scale parameter u; 形状参数为v、尺度参数为u的伽马分布的概率密度函数如下:The probability density function of a gamma distribution with shape parameter v and scale parameter u is as follows:
Figure FDA0003508329940000031
Figure FDA0003508329940000031
上式中,
Figure FDA0003508329940000032
为伽马函数,x为伽马分布概率密度函数的自变量,y为伽马函数中的积分变量;
In the above formula,
Figure FDA0003508329940000032
is the gamma function, x is the independent variable of the probability density function of the gamma distribution, and y is the integral variable in the gamma function;
进一步设形状参数随时间线性增加,即,It is further assumed that the shape parameter increases linearly with time, that is, v(t)=ζtv(t)=ζt 上式中,ζ为环境常数。In the above formula, ζ is the environmental constant.
6.根据权利要求5所述的方法,其特征在于,所述步骤S3中,设相邻两个决策时刻之间的时间间隔为Δt年,且先后两个决策时刻桥梁的可靠度指标分别为β0(1-D1)和β0(1-D2),设该决策区间内可靠度指标逐年阶梯下降,则第i年的可靠度指标为:6. The method according to claim 5, wherein, in the step S3, the time interval between two adjacent decision-making moments is set to be Δt years, and the reliability indexes of the bridges at the two successive decision-making moments are respectively β 0 (1-D 1 ) and β 0 (1-D 2 ), assuming that the reliability index in the decision-making interval decreases year by year, the reliability index of the i-th year is:
Figure FDA0003508329940000033
Figure FDA0003508329940000033
故,桥梁在Δt年内的失效概率
Figure FDA0003508329940000034
为:
Therefore, the failure probability of the bridge in Δt years
Figure FDA0003508329940000034
for:
Figure FDA0003508329940000035
Figure FDA0003508329940000035
7.根据权利要求6所述的方法,其特征在于,所述步骤S4中,设桥梁的状态空间为S={s0,…,sN},桥梁维护的动作空间为A={a0,a1,a2},其中,a0、a1、a2分别代表无维护、预防性维护、必要性维护;每年对桥梁进行一次健康检测,当发现桥梁保护层完全失效或桥梁的退化程度提高一级,则决定是否需要采取预防性维护措施或必要性维护措施。7. The method according to claim 6, wherein in the step S4, the state space of the bridge is set to be S={s 0 ,...,s N }, and the action space of the bridge maintenance is A={a 0 , a 1 , a 2 }, where a 0 , a 1 , and a 2 represent no maintenance, preventive maintenance, and necessary maintenance, respectively; a health inspection is carried out on the bridge every year. If the level is increased by one level, it is determined whether preventive maintenance measures or necessary maintenance measures are required.
CN202011561626.1A 2020-12-25 2020-12-25 Bridge full-life maintenance strategy optimization method based on half Markov decision process Active CN112632677B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011561626.1A CN112632677B (en) 2020-12-25 2020-12-25 Bridge full-life maintenance strategy optimization method based on half Markov decision process

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011561626.1A CN112632677B (en) 2020-12-25 2020-12-25 Bridge full-life maintenance strategy optimization method based on half Markov decision process

Publications (2)

Publication Number Publication Date
CN112632677A CN112632677A (en) 2021-04-09
CN112632677B true CN112632677B (en) 2022-05-10

Family

ID=75324899

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011561626.1A Active CN112632677B (en) 2020-12-25 2020-12-25 Bridge full-life maintenance strategy optimization method based on half Markov decision process

Country Status (1)

Country Link
CN (1) CN112632677B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117057256B (en) * 2023-10-11 2024-03-15 中交公路长大桥建设国家工程研究中心有限公司 Bridge structure maintenance strategy optimization method, device, equipment and readable storage medium

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9581570B2 (en) * 2011-02-10 2017-02-28 University Of South Carolina Determination of the remaining life of a structural system based on acoustic emission signals
CN107194476B (en) * 2017-05-26 2019-12-10 中国南方电网有限责任公司超高压输电公司天生桥局 Method for Transformer Aging Prevention and Maintenance Strategy Based on Semi-Markov Chain
CN107958310B (en) * 2017-12-07 2019-03-19 北京航空航天大学 A kind of optimal Maintenance Design method of existing structure based on section time-varying reliability
CN111368863A (en) * 2018-12-26 2020-07-03 上海海事大学 An intelligent method for evaluating the health status of key structures of quay cranes
CN110852610A (en) * 2019-11-08 2020-02-28 上海市城市建设设计研究总院(集团)有限公司 Road, bridge and tunnel health state and maintenance cost measuring and calculating method based on Markov model

Also Published As

Publication number Publication date
CN112632677A (en) 2021-04-09

Similar Documents

Publication Publication Date Title
Frangopol et al. Maintenance and management of civil infrastructure based on condition, safety, optimization, and life-cycle cost
Liu et al. Condition-based maintenance for systems with aging and cumulative damage based on proportional hazards model
CN105975735B (en) A kind of modeling method for power equipment health state evaluation
CN103793854B (en) The overhead transmission line operation risk informatization evaluation method that Multiple Combination is optimized
CN110287543B (en) Method for predicting service life of relay protection device
CN111832939B (en) Method for evaluating overhaul quality of main equipment of extra-high voltage direct current transmission system
Val Effect of different limit states on life-cycle cost of RC structures in corrosive environment
CN111582542A (en) Power load prediction method and system based on abnormal restoration
CN114154716B (en) Enterprise energy consumption prediction method and device based on graph neural network
CN112632677B (en) Bridge full-life maintenance strategy optimization method based on half Markov decision process
Frangopol et al. Structural performance indicators for bridges
CN118095086A (en) Intelligent decision method for bridge full life cycle maintenance based on reinforcement learning
CN109559019A (en) A kind of Electrical Power System Dynamic safety evaluation method based on risk index
CN109359742B (en) Method for generating preventive maintenance period of subway subsystem
CN113919204B (en) Comprehensive importance analysis method for availability of multi-state manufacturing system
CN110852610A (en) Road, bridge and tunnel health state and maintenance cost measuring and calculating method based on Markov model
CN114154298A (en) Bridge maintenance scheme multi-objective optimization method based on time-varying reliability
Brüske et al. Value of pre‐construction proof loading information for structural design
CN110990916B (en) Integration method for evaluating and predicting long-term operation safety of dam by considering hysteresis effect
CN118246246A (en) Transformer life prediction method and system based on multi-source data fusion
CN111341396A (en) Method and system for evaluating material corrosion safety in atmospheric environment
Li et al. A comprehensive reliability assessment method for existing railway bridges based on Bayesian theory
CN110674951A (en) A state maintenance method for distribution automation terminal
CN115221731A (en) Transformer life evaluation method based on data fusion and Wiener model
CN114004414B (en) Method for predicting energy load of enterprise coupling Copula and LSTM

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant