[go: up one dir, main page]

CN113055230B - Network reconstruction method and system for information physical system - Google Patents

Network reconstruction method and system for information physical system Download PDF

Info

Publication number
CN113055230B
CN113055230B CN202110255858.2A CN202110255858A CN113055230B CN 113055230 B CN113055230 B CN 113055230B CN 202110255858 A CN202110255858 A CN 202110255858A CN 113055230 B CN113055230 B CN 113055230B
Authority
CN
China
Prior art keywords
node
test
network
nodes
causal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110255858.2A
Other languages
Chinese (zh)
Other versions
CN113055230A (en
Inventor
何斌
严刚
程茜
衣鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tongji University
Original Assignee
Tongji University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tongji University filed Critical Tongji University
Priority to CN202110255858.2A priority Critical patent/CN113055230B/en
Publication of CN113055230A publication Critical patent/CN113055230A/en
Application granted granted Critical
Publication of CN113055230B publication Critical patent/CN113055230B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/12Discovery or management of network topologies

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

本发明公开了一种面向信息物理系统的网络重构方法和系统,利用PC算法初步构建该网络中各节点的因果关系,并结合MCI算法对PC算法中重构出来的连边进行甄别,从而提升重构网络的真实性与可靠性。

Figure 202110255858

The invention discloses a network reconstruction method and system oriented to an information physical system. The PC algorithm is used to preliminarily construct the causal relationship of each node in the network, and the connection edges reconstructed from the PC algorithm are screened in combination with the MCI algorithm. Improve the authenticity and reliability of the reconstructed network.

Figure 202110255858

Description

一种面向信息物理系统的网络重构方法和系统A cyber-physical system-oriented network reconfiguration method and system

技术领域technical field

本发明属于网络分析与重构技术领域,涉及一种面向信息物理系统的网络推断与重构方法。The invention belongs to the technical field of network analysis and reconstruction, and relates to a network inference and reconstruction method oriented to an information physical system.

背景技术Background technique

信息物理系统(cyber physical systems,CPS)是集成计算、通信与控制于一体的下一代智能系统。由于该系统是通过人机交互接口来实现和物理进程的交互,因此人们可以使用网络化空间来进行实时而安全可靠地远程操控系统中的各个物理实体。CPS通常可由三个子系统组成:自主体数据采集与诊断系统——针对物理实体的自身控制,全域信息交互实验系统——针对人机交互接口的开发,以及智能逻辑分析系统——针对物理实体与人机交互接口间数据网络的分析。其中,智能逻辑分析子系统作为实现系统远程控制的关键部分,其整合了CPS系统的全局信息流和逻辑链条以推断因果关系,并能够构建数据网络完整的因果图谱,建立基于大数据统计的多智能体算法模型,从而为智能算法和控制芯片的开发和设计提供科学决策工具。因此,科学合理地实现对CPS系统网络的重构是该系统运作可靠性的重要保障。Cyber physical systems (CPS) are next-generation intelligent systems that integrate computing, communication and control. Since the system realizes the interaction with the physical process through the human-computer interaction interface, people can use the networked space to remotely control various physical entities in the system in real time and safely and reliably. CPS is usually composed of three subsystems: autonomous data acquisition and diagnosis system - for the self-control of physical entities, global information interaction experiment system - for the development of human-computer interaction interfaces, and intelligent logic analysis system - for physical entities and Analysis of data networks between human-computer interaction interfaces. Among them, the intelligent logic analysis subsystem is a key part to realize the remote control of the system. It integrates the global information flow and logic chain of the CPS system to infer the causal relationship, and can build a complete causal graph of the data network, and establish a multi-dimensional system based on big data statistics. The intelligent body algorithm model provides scientific decision-making tools for the development and design of intelligent algorithms and control chips. Therefore, the scientific and rational realization of the reconfiguration of the CPS system network is an important guarantee for the reliability of the system's operation.

针对网络重构问题,国内外研究学者对其做过深入研究,并提出了许多方法。Mantegna提出的相关性方法可根据变量间的变化关系,利用皮尔逊相关系数来构建变量间的生成树结构,从而对变量结构进行划分重组,但该方法只能构建无向图结构,因此并不能对网络节点间的因果性进行推断;Granger为了重构因果网络提出了一种格兰杰因果检验方法,利用“两个事件发生的先后顺序”的统计显著性来进行因果推断,但该方法只能用于线性系统,而无法分析实际系统中普遍存在的非线性关系;Sugihara提出的收敛交叉映射方法很好地解决了非线性系统中的网络重构问题,但对于高维(网络节点数目较多)的重构问题所需的计算量较大,求解效率有待提升。Aiming at the problem of network reconstruction, domestic and foreign researchers have done in-depth research on it, and proposed many methods. The correlation method proposed by Mantegna can use the Pearson correlation coefficient to build a spanning tree structure between variables according to the changing relationship between variables, so as to divide and reorganize the variable structure, but this method can only build an undirected graph structure, so it cannot Infer the causality between network nodes; in order to reconstruct the causal network, Granger proposed a Granger causality test method, which uses the statistical significance of "the sequence of two events" to conduct causal inference, but this method only It can be used in linear systems, but cannot analyze the nonlinear relationship ubiquitous in actual systems; the convergent cross-mapping method proposed by Sugihara can solve the problem of network reconstruction in nonlinear systems, but for high-dimensional (the number of network nodes is relatively small) A large amount of computation is required for reconstruction problems with multiple), and the solution efficiency needs to be improved.

PC算法作为近年新兴的因果网络推断算法,可以有效地解决高维的网络重构问题。其思路为:首先将该网络视为全连通图,而后针对任意一个节点,在迭代过程中通过条件独立性测试来不断剔除不相关(分离)节点的连边,最后在满足终止条件时可以同时构建出该节点完整的因果网络。由于不需要两两节点逐个比对,因而该方法有效地提高了网络重构的计算效率。As an emerging causal network inference algorithm in recent years, PC algorithm can effectively solve the problem of high-dimensional network reconstruction. The idea is: first treat the network as a fully connected graph, then for any node, through the conditional independence test in the iterative process to continuously eliminate the edges of irrelevant (separate) nodes, and finally when the termination condition is satisfied, it can be simultaneously Construct the complete causal network of the node. This method effectively improves the computational efficiency of network reconstruction because there is no need to compare pairs of nodes one by one.

一般来说,网络重构方法(包括PC算法)不可避免地会重构出虚假的,即实际上不存在的连边。瞬态条件独立性(momentary conditional independence,MCI)检验算法可以对重构出的连边进行进一步校验,通过识别由于间接影响或共同驱动因素而产生的虚假连边,对其进行排除,从而进一步提升网络重构的性能与可靠性。In general, network reconstruction methods (including PC algorithms) inevitably reconstruct spurious, that is, connections that do not actually exist. The transient conditional independence (MCI) test algorithm can further verify the reconstructed edges by identifying spurious edges caused by indirect effects or common driving factors, and eliminating them, thereby further Improve the performance and reliability of network reconfiguration.

针对上述方法的特点与局限性,CPS系统的网络重构问题而言亟需一种更加有效的因果推断方法,以真实地还原CPS系统中无人自主体间的相互作用结构。In view of the characteristics and limitations of the above methods, a more effective causal inference method is urgently needed for the network reconstruction of the CPS system, so as to truly restore the interaction structure between the unmanned and autonomous agents in the CPS system.

发明内容SUMMARY OF THE INVENTION

有鉴于现有技术的上述缺陷,本发明提出一种PCMCI方法,利用PC算法初步构建该网络中各节点的因果关系,并结合MCI算法对PC算法中重构出来的连边进行甄别,从而提升重构网络的真实性与可靠性。In view of the above-mentioned defects of the prior art, the present invention proposes a PCMCI method, which uses the PC algorithm to initially construct the causal relationship of each node in the network, and combines the MCI algorithm to screen the reconstructed edges in the PC algorithm, thereby improving the performance of the network. Reconstruct the authenticity and reliability of the network.

为实现上述目的,本发明在第一方面提供了一种面向信息物理系统的网络重构方法,包括步骤:In order to achieve the above object, the present invention provides, in a first aspect, a cyber-physical system-oriented network reconfiguration method, comprising the steps of:

根据收集到的网络中的节点的时序状态信息,构建全连通的时序网络图;According to the collected timing state information of nodes in the network, construct a fully connected timing network diagram;

针对任意一个节点,将与该节点不相关的节点进行剔除;For any node, the nodes that are not related to the node are eliminated;

进行迭代过程,在每次迭代过程中更新检验条件,并根据条件独立性检验算法不断剔除与该节点相独立的其他节点;Carry out an iterative process, update the test conditions in each iteration process, and continuously eliminate other nodes independent of the node according to the conditional independence test algorithm;

在满足终止条件时跳出迭代过程;Jump out of the iterative process when the termination condition is met;

根据瞬态条件独立性检验算法对各连边进行评估,并删除虚假的连边。Each connected edge is evaluated according to a transient conditional independence test algorithm and spurious connections are removed.

进一步地,包括以下具体步骤:Further, the following specific steps are included:

(1)初始化PC算法的参数:设置输入状态时间序列X=(X1,X2,...,XN)、最大因果滞后时间τmax、最大可存在因果关系的父节点数目(最大迭代轮数)Nc,mac、条件独立性检验次数NCI以及终止条件阈值αPC(1) Initialize the parameters of the PC algorithm: set the input state time series X=(X 1 , X 2 ,..., X N ), the maximum causal lag time τ max , the maximum number of parent nodes that can have a causal relationship (maximum iteration number of rounds) N c,mac , number of conditional independence tests N CI and termination condition threshold α PC ;

(2)任意选取某一节点

Figure BDA0002967072310000021
进行因果关系重构:选取某一节点
Figure BDA0002967072310000022
进行初始化,设置其父节点
Figure BDA0002967072310000023
为所有其他节点,设置其与任意其他节点
Figure BDA0002967072310000024
间的独立检验统计量
Figure BDA0002967072310000025
设置初始迭代轮数Nc=0;(2) Arbitrarily select a node
Figure BDA0002967072310000021
Perform causal reconstruction: pick a node
Figure BDA0002967072310000022
Initialize, set its parent node
Figure BDA0002967072310000023
For all other nodes, set it to be the same as any other node
Figure BDA0002967072310000024
independent test statistic between
Figure BDA0002967072310000025
Set the initial iteration round number N c =0;

(3)进行第Nc轮迭代:从

Figure BDA0002967072310000026
中选取一个在本轮迭代过程中未进行CI检测的节点
Figure BDA0002967072310000027
Figure BDA0002967072310000028
在条件
Figure BDA0002967072310000029
下进行CI检验,其中
Figure BDA00029670723100000210
且满足
Figure BDA00029670723100000211
得到假设检验的P值与独立性大小分别记为(p,I);(3) Perform the N- th round of iteration: from
Figure BDA0002967072310000026
Select a node that has not undergone CI detection in this round of iterations
Figure BDA0002967072310000027
right
Figure BDA0002967072310000028
in condition
Figure BDA0002967072310000029
The CI test is carried out under the
Figure BDA00029670723100000210
and satisfy
Figure BDA00029670723100000211
The P value and independence of the hypothesis test are recorded as (p, I) respectively;

(4)若p>αPC,则从

Figure BDA00029670723100000212
中剔除
Figure BDA00029670723100000213
的节点,并根据
Figure BDA00029670723100000214
Figure BDA00029670723100000215
进行降序排序;否则返回步骤(3);(4) If p>α PC , then from
Figure BDA00029670723100000212
medium culling
Figure BDA00029670723100000213
node, and according to
Figure BDA00029670723100000214
Give
Figure BDA00029670723100000215
Sort in descending order; otherwise, return to step (3);

(5)若

Figure BDA00029670723100000216
则节点
Figure BDA00029670723100000217
的因果关系重构完成;否则令Nc=Nc+1,并返回执行步骤(3)进行新一轮迭代;(5) If
Figure BDA00029670723100000216
then the node
Figure BDA00029670723100000217
The reconstruction of the causal relationship is completed; otherwise, let N c =N c +1, and return to step (3) for a new round of iteration;

(6)判断是否所有节点的因果关系均已完成重构,若完成则进入步骤(7);否则返回步骤(2);(6) Judging whether the causal relationship of all nodes has been reconstructed, if completed, enter step (7); otherwise, return to step (2);

(7)对重构的因果网络进行MCI检验,剔除虚假连边。(7) MCI test is performed on the reconstructed causal network, and false connections are eliminated.

进一步地,步骤(3)中采用高斯过程回归与距离相关系数来进行独立性检验,其具体过程如下:Further, in step (3), the Gaussian process regression and the distance correlation coefficient are used to carry out the independence test, and the specific process is as follows:

(3a)利用贝叶斯非参数回归模型对

Figure BDA0002967072310000031
分别进行回归分析,并计算其P值,其模型如下式所示:(3a) Using a Bayesian nonparametric regression model to
Figure BDA0002967072310000031
Regression analysis was carried out respectively, and its P value was calculated, and its model was as follows:

Figure BDA0002967072310000032
Figure BDA0002967072310000032

(3b)计算在该模型下

Figure BDA0002967072310000033
的残差为:(3b) Calculated under this model
Figure BDA0002967072310000033
The residuals are:

Figure BDA0002967072310000034
Figure BDA0002967072310000034

Figure BDA0002967072310000035
Figure BDA0002967072310000035

(3c)对于

Figure BDA0002967072310000036
Figure BDA0002967072310000037
两者间的独立性可通过
Figure BDA0002967072310000038
Figure BDA0002967072310000039
来计算,若I=0则表示两变量是独立的:(3c) For
Figure BDA0002967072310000036
and
Figure BDA0002967072310000037
The independence between the two can be achieved by
Figure BDA0002967072310000038
and
Figure BDA0002967072310000039
To calculate, if I=0, it means that the two variables are independent:

Figure BDA00029670723100000310
Figure BDA00029670723100000310

其中

Figure BDA00029670723100000311
的定义分别如下:in
Figure BDA00029670723100000311
are defined as follows:

Figure BDA00029670723100000312
Figure BDA00029670723100000312

Figure BDA00029670723100000313
Figure BDA00029670723100000313

Figure BDA00029670723100000314
Figure BDA00029670723100000314

其中du、dv分别表示向量u、v的维数;where d u and d v represent the dimensions of the vectors u and v, respectively;

(3d)更新

Figure BDA00029670723100000315
(3d) Update
Figure BDA00029670723100000315

进一步地,步骤(7)中对任意一个节点

Figure BDA00029670723100000316
的连边检验具体过程如下:Further, in step (7), to any node
Figure BDA00029670723100000316
The specific process of the edge connection test is as follows:

(7a)对任意节点

Figure BDA00029670723100000317
计算
Figure BDA00029670723100000318
(7a) For any node
Figure BDA00029670723100000317
calculate
Figure BDA00029670723100000318

(7b)选取任意节点

Figure BDA00029670723100000319
定义
Figure BDA00029670723100000320
Figure BDA00029670723100000321
的前pX个节点构成的集合;(7b) Select any node
Figure BDA00029670723100000319
definition
Figure BDA00029670723100000320
for
Figure BDA00029670723100000321
The set consisting of the first p X nodes of ;

(7c)对

Figure BDA00029670723100000322
在条件
Figure BDA00029670723100000323
下进行CI检验,其过程同第三步中所述方法;(7c) Yes
Figure BDA00029670723100000322
in condition
Figure BDA00029670723100000323
The CI test is carried out below, and its process is the same as that described in the third step;

(7d)根据CI检验结果,剔除虚假连边;(7d) According to the CI test result, remove the false connection;

(7f)选取其他节点重复步骤(7b)~(7d),若所有节点均已MCI检验完毕,则获得最终PCMCI重构的因果网络。(7f) Select other nodes and repeat steps (7b) to (7d). If all nodes have completed the MCI check, the final PCMCI-reconstructed causal network is obtained.

本发明在第二方面提供了一种面向信息物理系统的网络重构系统,包括In a second aspect, the present invention provides a cyber-physical system-oriented network reconfiguration system, comprising:

时序网络图构建模块,用于根据收集到的网络中的节点的时序状态信息,构建全连通的时序网络图;The timing network diagram building module is used to construct a fully connected timing network diagram according to the collected timing state information of the nodes in the network;

因果网络重构模块,用于针对任意一个节点,将与该节点不相关的节点进行剔除;进行迭代过程,在每次迭代过程中更新检验条件,并根据条件独立性检验算法不断剔除与该节点相独立的其他节点;在满足终止条件时跳出迭代过程;The causal network reconstruction module is used to eliminate any node that is not related to the node; iterative process is performed, the test conditions are updated in each iteration process, and the node is continuously eliminated according to the conditional independence test algorithm Other nodes that are independent of each other; jump out of the iterative process when the termination condition is met;

因果网络检验模块,用于根据瞬态条件独立性检验算法对各连边进行评估,并删除虚假的连边。The causal network checking module is used to evaluate each connected edge according to the transient conditional independence test algorithm and remove false connected edges.

进一步地,时序网络图构建模块被设置为执行如下步骤:Further, the timing network diagram building block is configured to perform the following steps:

(1)接收初始化PC算法的参数设置:状态时间序列X=(X1,X2,...,XN)、最大因果滞后时间τmax、最大可存在因果关系的父节点数目(最大迭代轮数)Nc,max、条件独立性检验次数NCI以及终止条件阈值αPC(1) Receive parameter settings for initializing the PC algorithm: state time series X=(X 1 , X 2 ,..., X N ), maximum causal lag time τ max , maximum number of parent nodes that can have causal relationships (maximum iteration number of rounds) N c,max , number of conditional independence tests N CI and termination condition threshold α PC ;

因果网络重构模块被设置为执行如下步骤:The causal network reconstruction module is set up to perform the following steps:

(2)任意选取某一节点

Figure BDA0002967072310000041
进行因果关系重构:选取某一节点
Figure BDA0002967072310000042
进行初始化,设置其父节点
Figure BDA0002967072310000043
为所有其他节点,设置其与任意其他节点
Figure BDA0002967072310000044
间的独立检验统计量
Figure BDA0002967072310000045
设置初始迭代轮数Nc=0;(2) Arbitrarily select a node
Figure BDA0002967072310000041
Perform causal reconstruction: pick a node
Figure BDA0002967072310000042
Initialize, set its parent node
Figure BDA0002967072310000043
For all other nodes, set it to be the same as any other node
Figure BDA0002967072310000044
independent test statistic between
Figure BDA0002967072310000045
Set the initial iteration round number N c =0;

(3)进行第Nc轮迭代:从

Figure BDA0002967072310000046
中选取一个在本轮迭代过程中未进行CI检测的节点
Figure BDA0002967072310000047
Figure BDA0002967072310000048
在条件
Figure BDA0002967072310000049
下进行CI检验,其中
Figure BDA00029670723100000410
且满足
Figure BDA00029670723100000411
得到假设检验的P值与独立性大小分别记为(p,I);(3) Perform the N- th round of iteration: from
Figure BDA0002967072310000046
Select a node that has not undergone CI detection in this round of iterations
Figure BDA0002967072310000047
right
Figure BDA0002967072310000048
in condition
Figure BDA0002967072310000049
The CI test is carried out under the
Figure BDA00029670723100000410
and satisfy
Figure BDA00029670723100000411
The P value and independence of the hypothesis test are recorded as (p, I) respectively;

(4)若p>αPC,则从

Figure BDA00029670723100000412
中剔除
Figure BDA00029670723100000413
的节点,并根据
Figure BDA00029670723100000414
Figure BDA00029670723100000415
进行降序排序;否则返回步骤(3);(4) If p>α PC , then from
Figure BDA00029670723100000412
medium culling
Figure BDA00029670723100000413
node, and according to
Figure BDA00029670723100000414
Give
Figure BDA00029670723100000415
Sort in descending order; otherwise, return to step (3);

(5)若

Figure BDA00029670723100000416
则节点
Figure BDA00029670723100000417
的因果关系重构完成;否则令Nc=Nc+1,并返回执行步骤(3)进行新一轮迭代;(5) If
Figure BDA00029670723100000416
then the node
Figure BDA00029670723100000417
The reconstruction of the causal relationship is completed; otherwise, let N c =N c +1, and return to step (3) for a new round of iteration;

(6)判断是否所有节点的因果关系均已完成重构,若完成则进入步骤(7);否则返回步骤(2);(6) Judging whether the causal relationship of all nodes has been reconstructed, if completed, enter step (7); otherwise, return to step (2);

因果网络检验模块被设置为执行如下步骤:The causal network inspection module is set up to perform the following steps:

(7)对重构的因果网络进行MCI检验,剔除虚假连边。(7) MCI test is performed on the reconstructed causal network, and false connections are eliminated.

进一步地,因果网络重构模块被设置为在执行步骤(3)时,采用高斯过程回归与距离相关系数来进行独立性检验,其具体过程如下:Further, the causal network reconstruction module is set to use Gaussian process regression and distance correlation coefficient to carry out the independence test when performing step (3), and the specific process is as follows:

(3a)利用贝叶斯非参数回归模型对

Figure BDA00029670723100000418
分别进行回归分析,并计算其P值,其模型如下式所示:(3a) Using a Bayesian nonparametric regression model to
Figure BDA00029670723100000418
Regression analysis was carried out respectively, and its P value was calculated, and its model was as follows:

Figure BDA00029670723100000419
Figure BDA00029670723100000419

(3b)计算在该模型下

Figure BDA00029670723100000420
的残差为:(3b) Calculated under this model
Figure BDA00029670723100000420
The residuals are:

Figure BDA00029670723100000421
Figure BDA00029670723100000421

Figure BDA00029670723100000422
Figure BDA00029670723100000422

(3c)对于

Figure BDA00029670723100000423
Figure BDA00029670723100000424
两者间的独立性可通过
Figure BDA00029670723100000425
Figure BDA00029670723100000426
来计算,若I=0则表示两变量是独立的:(3c) For
Figure BDA00029670723100000423
and
Figure BDA00029670723100000424
The independence between the two can be achieved by
Figure BDA00029670723100000425
and
Figure BDA00029670723100000426
To calculate, if I=0, it means that the two variables are independent:

Figure BDA00029670723100000427
Figure BDA00029670723100000427

其中

Figure BDA00029670723100000428
的定义分别如下:in
Figure BDA00029670723100000428
are defined as follows:

Figure BDA00029670723100000429
Figure BDA00029670723100000429

Figure BDA00029670723100000430
Figure BDA00029670723100000430

Figure BDA0002967072310000051
Figure BDA0002967072310000051

其中du、dv分别表示向量u、v的维数;where d u and d v represent the dimensions of the vectors u and v, respectively;

(3d)更新

Figure BDA0002967072310000052
(3d) Update
Figure BDA0002967072310000052

进一步地,因果网络检验模块被设置为在执行步骤(7)时,对任意一个节点

Figure BDA0002967072310000053
的连边检验具体过程如下:Further, the causal network checking module is set to perform step (7), for any node
Figure BDA0002967072310000053
The specific process of the edge connection test is as follows:

(7a)对任意节点

Figure BDA0002967072310000054
计算
Figure BDA0002967072310000055
(7a) For any node
Figure BDA0002967072310000054
calculate
Figure BDA0002967072310000055

(7b)选取任意节点

Figure BDA0002967072310000056
定义
Figure BDA0002967072310000057
Figure BDA0002967072310000058
的前pX个节点构成的集合;(7b) Select any node
Figure BDA0002967072310000056
definition
Figure BDA0002967072310000057
for
Figure BDA0002967072310000058
The set consisting of the first p X nodes of ;

(7c)对

Figure BDA0002967072310000059
在条件
Figure BDA00029670723100000510
下进行CI检验,其过程同第三步中所述方法;(7c) Yes
Figure BDA0002967072310000059
in condition
Figure BDA00029670723100000510
The CI test is carried out below, and its process is the same as that described in the third step;

(7d)根据CI检验结果,剔除虚假连边;(7d) According to the CI test result, remove the false connection;

(7f)选取其他节点重复步骤(7b)~(7d),若所有节点均已MCI检验完毕,则获得最终PCMCI重构的因果网络。(7f) Select other nodes and repeat steps (7b) to (7d). If all nodes have completed the MCI check, the final PCMCI-reconstructed causal network is obtained.

本发明针对CPS系统中无人自主体节点数量多、网络复杂度高的情景,设计了一种基于PCMCI算法的因果网络推断方法。该方法既能够解决非线性系统的因果网络重构问题,同时在高维的网络场景下也具有一定的计算速度优势。针对网络重构问题中经常面临的两难窘境——虚假连边数量一般正比于重构出的真实因果关系数量,为了保证在尽可能完整地重构出真实因果网络的条件下减少虚假连边的数量,本发明采用MCI检验对虚假连边进一步加以剔除,弥补了PC算法虚假连边较多的缺陷,解决了CPS系统的复杂网络重构问题。该方法同时也具有可移植性,适用于其它网络情景下的因果推断问题。The present invention designs a causal network inference method based on the PCMCI algorithm for the situation that the number of unmanned autonomous nodes is large and the network complexity is high in the CPS system. This method can not only solve the problem of causal network reconstruction of nonlinear systems, but also has certain computational speed advantages in high-dimensional network scenarios. Aiming at the dilemma often faced in network reconstruction problems - the number of false connections is generally proportional to the number of reconstructed true causal relationships, in order to ensure that the true causal network is reconstructed as completely as possible to reduce the number of false connections. The invention adopts MCI test to further eliminate false connection edges, which makes up for the defect of PC algorithm that there are many false connection edges, and solves the problem of complex network reconstruction of CPS system. The method is also portable and suitable for causal inference problems in other network scenarios.

本发明重点关注CPS系统的因果网络推断问题,从非线性系统以及高维网络结构出发,开展了网络重构方法研究。该方法对于复杂网络的因果推断与重构具有重要的现实意义。The present invention focuses on the causal network inference problem of the CPS system, and starts from the nonlinear system and the high-dimensional network structure, and conducts research on the network reconstruction method. This method has important practical significance for causal inference and reconstruction of complex networks.

以下将结合附图对本发明的构思、具体结构及产生的技术效果作进一步说明,以充分地了解本发明的目的、特征和效果。The concept, specific structure and technical effects of the present invention will be further described below in conjunction with the accompanying drawings, so as to fully understand the purpose, characteristics and effects of the present invention.

附图说明Description of drawings

图1是现有技术中因果网络图模型图示例;1 is an example of a causal network graph model diagram in the prior art;

图2是本发明采用的因果网络时序图示例;Fig. 2 is an example of a causal network sequence diagram adopted by the present invention;

图3是本发明的一个较佳实施例中的因果网络重构后的时序图;Fig. 3 is the sequence diagram after causal network reconstruction in a preferred embodiment of the present invention;

图4是本发明的一个较佳实施例中的PCMCI方法流程图。FIG. 4 is a flow chart of a PCMCI method in a preferred embodiment of the present invention.

具体实施方式Detailed ways

以下参考说明书附图介绍本发明的多个优选实施例,使其技术内容更加清楚和便于理解。本发明可以通过许多不同形式的实施例来得以体现,本发明的保护范围并非仅限于文中提到的实施例。The following describes several preferred embodiments of the present invention with reference to the accompanying drawings, so as to make its technical content clearer and easier to understand. The present invention can be embodied in many different forms of embodiments, and the protection scope of the present invention is not limited to the embodiments mentioned herein.

下文中各符号的说明:Description of each symbol below:

X:状态时间序列集合;X: state time series collection;

Xi:无人自主体;X i : Unmanned autonomous body;

Figure BDA0002967072310000061
无人自主体在时序图中对应的节点;
Figure BDA0002967072310000061
The node corresponding to the unmanned autonomous body in the sequence diagram;

Figure BDA0002967072310000062
时序图中t时刻之前的所有时序节点集合;
Figure BDA0002967072310000062
The set of all timing nodes before time t in the timing diagram;

Figure BDA0002967072310000063
Figure BDA0002967072310000064
存在“因”关系的父节点;
Figure BDA0002967072310000063
right
Figure BDA0002967072310000064
There is a parent node with a "cause"relationship;

I:独立检验统计量;I: independent test statistic;

Imin(·):独立检验统计量的最小值;I min ( ): the minimum value of the independent test statistic;

Figure BDA0002967072310000065
独立检验条件;
Figure BDA0002967072310000065
independent inspection conditions;

p:独立检验P值;p: independent test P value;

αPC:独立检验显著性阈值;α PC : independent test significance threshold;

τmax:最大因果滞后时间;τ max : maximum causal lag time;

Nc,max:最大父节点数目;N c,max : the maximum number of parent nodes;

NCI:条件独立性检验次数;N CI : the number of conditional independence tests;

Nc:迭代轮数;N c : the number of iteration rounds;

本发明提供了一种面向信息物理系统的网络推断与重构方法。在本方法当中,首先根据收集到的无人自主体(网络中的节点)的时序状态信息,构建全连通的时序网络图;其次针对任意一个节点,将与该节点不相关的节点进行剔除;接下来进行迭代过程,在每次迭代过程中更新检验条件,并根据条件独立性检验算法不断剔除与该节点相独立的其他节点;在满足终止条件时跳出迭代过程,此时剩余的连边初步构成了该节点与其他节点间的因果关系网络;最后根据瞬态条件独立性检验算法对各连边进行评估,并删除虚假的连边,以完成更加可靠的因果网络重构。The invention provides a network inference and reconstruction method oriented to an information physical system. In this method, firstly, a fully connected time sequence network diagram is constructed according to the collected time sequence state information of unmanned autonomous entities (nodes in the network); secondly, for any node, the nodes that are not related to the node are eliminated; Next, the iterative process is carried out, the test conditions are updated in each iteration process, and other nodes independent of the node are continuously eliminated according to the conditional independence test algorithm; when the termination conditions are met, the iterative process is jumped out, and the remaining edges are initially connected. The causal relationship network between the node and other nodes is formed; finally, each connection is evaluated according to the transient conditional independence test algorithm, and the false connection is deleted to complete a more reliable causal network reconstruction.

基于PCMCI的因果网络推断方法,包括如下具体步骤:The causal network inference method based on PCMCI includes the following specific steps:

步骤1:初始化PC算法的参数,设置输入状态时间序列X=(X1,X2,...,XN)、最大因果滞后时间τmax、最大可存在因果关系的父节点数目(最大迭代轮数)Nc,max、条件独立性检验次数NCI以及终止条件阈值αPCStep 1: Initialize the parameters of the PC algorithm, set the input state time series X=(X 1 , X 2 ,..., X N ), the maximum causal lag time τ max , the maximum number of parent nodes that can have causal relationships (maximum iteration number of rounds) N c,max , number of conditional independence tests N CI and termination condition threshold α PC .

步骤2:选取某一节点

Figure BDA0002967072310000066
进行初始化,设置其父节点
Figure BDA0002967072310000067
为所有其他节点,设置其与任意其他节点
Figure BDA0002967072310000068
间的独立检验统计量
Figure BDA0002967072310000069
设置初始迭代轮数Nc=0。Step 2: Select a node
Figure BDA0002967072310000066
Initialize, set its parent node
Figure BDA0002967072310000067
For all other nodes, set it to be the same as any other node
Figure BDA0002967072310000068
independent test statistic between
Figure BDA0002967072310000069
Set the initial iteration round number N c =0.

步骤3:第Nc轮迭代。根据PC1算法,对

Figure BDA00029670723100000610
在不同的
Figure BDA00029670723100000611
条件下进行条件独立性检验。Step 3: N c round of iteration. According to the PC 1 algorithm, the
Figure BDA00029670723100000610
in different
Figure BDA00029670723100000611
Conditional independence tests were performed.

步骤4:根据假设检验的显著性水平,对与

Figure BDA00029670723100000612
独立的
Figure BDA00029670723100000613
从其父节点
Figure BDA00029670723100000614
中进行剔除。Step 4: According to the significance level of the hypothesis test, compare the
Figure BDA00029670723100000612
independent
Figure BDA00029670723100000613
from its parent node
Figure BDA00029670723100000614
removed in.

步骤5:若

Figure BDA0002967072310000071
则执行步骤6;否则返回步骤3进行一轮迭代。Step 5: If
Figure BDA0002967072310000071
Then go to step 6; otherwise, return to step 3 for one round of iteration.

步骤6:若所有节点的因果关系均已完成重构则执行步骤7;否则返回执行步骤2。Step 6: If the causal relationship of all nodes has been reconstructed, go to Step 7; otherwise, return to Step 2.

步骤7:通过MCI检验节点间的瞬态条件独立性,剔除由于间接影响或共同驱动因素而产生的虚假连边,得到最终重构的因果网络。Step 7: Check the transient conditional independence between nodes through MCI, remove false connections due to indirect influences or common driving factors, and obtain the final reconstructed causal network.

具体而言,本发明提供一种面向信息物理系统(CPS)的网络推断与重构方法,考虑的问题是:在CPS系统中,保证重构无人自主体间的实际存在的因果关系的同时,尽可能地减少虚假连边/虚假关系的存在。图1所示为一个CPS中无人自主体间可能存在的因果关系示意图,无人自主体以节点的形式表示,其状态记为Xi,节点间的因果关系用带箭头的实线表示,且若存在由Xi指向Xj的箭头,则我们称无人自主体i为无人自主体j的“因”。由于在实际的物理系统当中,信号的传递是需要时间的,因此节点间的因果关系也是与时间信息挂钩的。图1的图模型并不能表现该因果关系的时间滞后影响,因此在本方法中,我们采用如图2的时序图来进行描述,与图模型不同的是,时序图同时包括节点信息与时间信息,因此可以更为完整地描述网络的因果关系。Specifically, the present invention provides a cyber-physical system (CPS)-oriented network inference and reconstruction method. The problem to be considered is: in the CPS system, while ensuring the reconstruction of the actual causal relationship between unmanned and autonomous entities , to reduce the existence of false connections/false relationships as much as possible. Figure 1 shows a schematic diagram of the causal relationship that may exist between unmanned and autonomous agents in a CPS. Unmanned and autonomous agents are represented in the form of nodes, and their states are denoted as X i , and the causal relationship between nodes is represented by solid lines with arrows. And if there is an arrow pointing from X i to X j , then we call unmanned subject i the "cause" of unmanned subject j. In the actual physical system, the transmission of signals takes time, so the causal relationship between nodes is also linked to time information. The graph model in Figure 1 cannot express the time lag effect of the causal relationship. Therefore, in this method, we use the sequence diagram as shown in Figure 2 to describe it. Unlike the graph model, the sequence diagram includes both node information and time information. , so the causal relationship of the network can be described more completely.

第一步,初始化PC算法的参数,具体包括以下几点:1、网络中无人自主体(节点)的总数N;2、状态时间序列X=(X1,X2,...,XN),即观测到的各节点的时序状态信息;3、最大因果滞后时间τmax,即某一节点发出的物理影响因素对另一节点产生作用所需的最大传递时间;4、最大可存在因果关系的父节点数目(最大迭代轮数)Nc,max,即理论上最多可对某一节点产生影响的节点个数,在本方法中设为网络中包含的节点总数N;5、条件独立性(conditional independence,CI)检验次数NCI,即在每次迭代过程中,条件独立性检验需要检验的条件个数,在本方法中设为1,表明每次迭代只需进行一次CI检验;6、终止条件阈值αPC,即条件独立性假设检验的显著性阈值。The first step is to initialize the parameters of the PC algorithm, including the following points: 1. The total number N of unmanned autonomous entities (nodes) in the network; 2. The state time series X=(X 1 , X 2 ,...,X N ), that is, the observed time-series state information of each node; 3. The maximum causal lag time τ max , that is, the maximum transfer time required for the physical influencing factors emitted by a node to have an effect on another node; 4. The maximum can exist The number of parent nodes of the causal relationship (maximum number of iteration rounds) N c,max , that is, the number of nodes that can theoretically affect a node at most, in this method, it is set as the total number of nodes included in the network N; 5. Conditions The number of tests of independence (conditional independence, CI) N CI , that is, the number of conditions that need to be tested for conditional independence test in each iteration process, is set to 1 in this method, indicating that only one CI test needs to be performed in each iteration 6. Termination condition threshold α PC , that is, the significance threshold of conditional independence hypothesis test.

第二步,任意选取某一节点

Figure BDA0002967072310000072
进行因果关系重构。首先对其进行初始化,设置其父节点为所有其他节点(全连通),即满足下式:The second step is to arbitrarily select a node
Figure BDA0002967072310000072
Reconstruct causality. First initialize it, set its parent node to all other nodes (fully connected), that is, to satisfy the following formula:

Figure BDA0002967072310000073
Figure BDA0002967072310000073

其中

Figure BDA0002967072310000074
表示节点
Figure BDA0002967072310000075
的父节点,
Figure BDA0002967072310000076
表示当前时刻t之前时刻的所有时序状态节点的集合,由于实际的物理信息传递需要时间,因此本方法认为同时刻下的其他节点
Figure BDA0002967072310000077
与节点
Figure BDA0002967072310000078
间无因果关系。in
Figure BDA0002967072310000074
represents a node
Figure BDA0002967072310000075
the parent node of ,
Figure BDA0002967072310000076
Represents the set of all timing state nodes before the current time t. Since the actual physical information transmission takes time, this method considers other nodes at the same time
Figure BDA0002967072310000077
with node
Figure BDA0002967072310000078
There is no causal relationship between them.

同时,初始化节点

Figure BDA0002967072310000079
与任意其他节点
Figure BDA00029670723100000710
间的独立检验统计量为一个足够大的量,即:At the same time, initialize the node
Figure BDA0002967072310000079
with any other node
Figure BDA00029670723100000710
The independent test statistic between is a large enough quantity, namely:

Figure BDA00029670723100000711
Figure BDA00029670723100000711

最后,初始化当前迭代次数Nc=0。Finally, the current iteration number N c =0 is initialized.

第三步,进行第Nc轮迭代。从

Figure BDA00029670723100000712
中选取一个在本轮迭代过程中未进行CI检测的节点
Figure BDA00029670723100000713
Figure BDA00029670723100000714
在条件
Figure BDA00029670723100000715
下进行CI检验,其中
Figure BDA00029670723100000716
且满足
Figure BDA0002967072310000081
得到假设检验的P值与独立性大小分别记为(p,I)。在本方法中,采用高斯过程回归(Gaussian process regression,GPR)与距离相关系数(distance correlation,DC)来进行独立性检验,其具体过程如下:The third step is to perform the N c round of iterations. from
Figure BDA00029670723100000712
Select a node that has not undergone CI detection in this round of iterations
Figure BDA00029670723100000713
right
Figure BDA00029670723100000714
in condition
Figure BDA00029670723100000715
The CI test is carried out under the
Figure BDA00029670723100000716
and satisfy
Figure BDA0002967072310000081
The P value and the degree of independence of the hypothesis test were obtained as (p, I), respectively. In this method, Gaussian process regression (GPR) and distance correlation coefficient (distance correlation, DC) are used to test independence. The specific process is as follows:

1、利用贝叶斯非参数回归模型对

Figure BDA0002967072310000082
分别进行回归分析,并计算其P值,其模型如下式所示:1. Use a Bayesian nonparametric regression model to
Figure BDA0002967072310000082
Regression analysis was carried out respectively, and its P value was calculated, and its model was as follows:

Figure BDA0002967072310000083
Figure BDA0002967072310000083

2、计算在该模型下

Figure BDA0002967072310000084
的残差为:2. Calculated under this model
Figure BDA0002967072310000084
The residuals are:

Figure BDA0002967072310000085
Figure BDA0002967072310000085

Figure BDA0002967072310000086
Figure BDA0002967072310000086

3、对于

Figure BDA0002967072310000087
Figure BDA0002967072310000088
两者间的独立性可通过
Figure BDA0002967072310000089
Figure BDA00029670723100000810
来计算,若I=0则表示两变量是独立的:3. For
Figure BDA0002967072310000087
and
Figure BDA0002967072310000088
The independence between the two can be achieved by
Figure BDA0002967072310000089
and
Figure BDA00029670723100000810
To calculate, if I=0, it means that the two variables are independent:

Figure BDA00029670723100000811
Figure BDA00029670723100000811

其中

Figure BDA00029670723100000812
的定义分别如下:in
Figure BDA00029670723100000812
are defined as follows:

Figure BDA00029670723100000813
Figure BDA00029670723100000813

Figure BDA00029670723100000814
Figure BDA00029670723100000814

Figure BDA00029670723100000815
Figure BDA00029670723100000815

其中du、dv分别表示向量u、v的维数。where d u and d v represent the dimensions of the vectors u and v, respectively.

4、更新

Figure BDA00029670723100000816
4. Update
Figure BDA00029670723100000816

第四步、若p>αPC,则从

Figure BDA00029670723100000817
中剔除
Figure BDA00029670723100000818
的节点,并根据
Figure BDA00029670723100000819
Figure BDA00029670723100000820
进行降序排序;否则返回执行第三步。The fourth step, if p > α PC , then from
Figure BDA00029670723100000817
medium culling
Figure BDA00029670723100000818
node, and according to
Figure BDA00029670723100000819
Give
Figure BDA00029670723100000820
Sort in descending order; otherwise, return to the third step.

第五步,若

Figure BDA00029670723100000821
则节点
Figure BDA00029670723100000822
的因果关系重构完成;否则令Nc=Nc+1,并返回执行第三步进行新一轮迭代。The fifth step, if
Figure BDA00029670723100000821
then the node
Figure BDA00029670723100000822
The reconstruction of the causal relationship is completed; otherwise, let N c =N c +1, and return to the third step for a new round of iteration.

第六步,判断是否所有节点的因果关系均已完成重构,若完成则执行第七步;否则返回执行第二步。The sixth step is to judge whether the causal relationship of all nodes has been reconstructed, and if so, execute the seventh step; otherwise, return to the second step.

第七步,对重构的因果网络进行MCI检验。重构好的网络局部(针对节点

Figure BDA00029670723100000823
)示意图如附图3所示,其中黑色实线表示对于
Figure BDA00029670723100000824
实际存在因果关系的连边,灰色实线表示对于
Figure BDA0002967072310000091
实际不存在因果关系的连边,不同深度颜色的点分别表示不同迭代轮数中被剔除的点,颜色越深表示越晚被剔除,而黑色的点表示最终保留下来的点。可以看到被倒三角符号标识的点并不是实际与
Figure BDA0002967072310000092
存在因果关系的点,因此需要通过MCI对其进行识别与剔除。针对任意一个节点
Figure BDA0002967072310000093
的连边甄别具体过程如下:The seventh step is to perform MCI test on the reconstructed causal network. The reconstructed network part (for nodes
Figure BDA00029670723100000823
) schematic diagram as shown in Figure 3, wherein the black solid line represents for
Figure BDA00029670723100000824
There is actually a causal connection, and the gray solid line indicates that for
Figure BDA0002967072310000091
There is actually no causal connection. Points with different depth colors represent points that are removed in different iteration rounds. The darker the color, the later the points are removed, and the black points represent the points that are finally retained. It can be seen that the point identified by the inverted triangle symbol is not actually related to the
Figure BDA0002967072310000092
There is a causal relationship, so it needs to be identified and eliminated by MCI. for any node
Figure BDA0002967072310000093
The specific process of edge screening is as follows:

1、对任意节点

Figure BDA0002967072310000094
计算
Figure BDA0002967072310000095
1. For any node
Figure BDA0002967072310000094
calculate
Figure BDA0002967072310000095

2、选取任意节点

Figure BDA0002967072310000096
定义
Figure BDA0002967072310000097
Figure BDA0002967072310000098
的前pX个节点构成的集合;2. Select any node
Figure BDA0002967072310000096
definition
Figure BDA0002967072310000097
for
Figure BDA0002967072310000098
The set consisting of the first p X nodes of ;

3、对

Figure BDA0002967072310000099
在条件
Figure BDA00029670723100000910
下进行CI检验,其过程同第三步中所述方法;3. Yes
Figure BDA0002967072310000099
in condition
Figure BDA00029670723100000910
The CI test is carried out below, and its process is the same as that described in the third step;

4、根据CI检验结果,剔除虚假连边;4. According to the CI test results, eliminate false connections;

5、选取其他节点重复步骤2~4,若所有节点均已MCI检验完毕,则获得最终PCMCI重构的因果网络。5. Select other nodes and repeat steps 2-4. If all nodes have completed the MCI check, the final PCMCI-reconstructed causal network is obtained.

整个发明的方法可以用图4流程图来说明。The entire inventive method can be illustrated by the flowchart of FIG. 4 .

以上详细描述了本发明的较佳具体实施例。应当理解,本领域的普通技术无需创造性劳动就可以根据本发明的构思作出诸多修改和变化。因此,凡本技术领域中技术人员依本发明的构思在现有技术的基础上通过逻辑分析、推理或者有限的实验可以得到的技术方案,皆应在由权利要求书所确定的保护范围内。The preferred embodiments of the present invention have been described in detail above. It should be understood that many modifications and changes can be made according to the concept of the present invention by those skilled in the art without creative efforts. Therefore, all technical solutions that can be obtained by those skilled in the art through logical analysis, reasoning or limited experiments on the basis of the prior art according to the concept of the present invention shall fall within the protection scope determined by the claims.

Claims (6)

1.一种面向信息物理系统的网络重构方法,其特征在于,包括步骤:1. a cyber-physical system-oriented network reconstruction method, is characterized in that, comprises the steps: 根据收集到的网络中的节点的时序状态信息,构建全连通的时序网络图;According to the collected timing state information of nodes in the network, construct a fully connected timing network diagram; 针对任意一个节点,将与该节点不相关的节点进行剔除;For any node, the nodes that are not related to the node are eliminated; 进行迭代过程,在每次迭代过程中更新检验条件,并根据条件独立性检验算法不断剔除与该节点相独立的其他节点;Carry out an iterative process, update the test conditions in each iteration process, and continuously eliminate other nodes independent of the node according to the conditional independence test algorithm; 在满足终止条件时跳出迭代过程;Jump out of the iterative process when the termination condition is met; 根据瞬态条件独立性检验算法对各连边进行评估,并删除虚假的连边;Evaluate each connected edge according to the transient conditional independence test algorithm, and delete false connections; 其中,包括以下具体步骤:Among them, the following specific steps are included: (1)初始化PC算法的参数:设置输入状态时间序列X=(X1,X2,...,XN)、最大因果滞后时间τmax、最大可存在因果关系的父节点数目(最大迭代轮数)Nc,max、条件独立性检验次数NCI以及终止条件阈值αPC(1) Initialize the parameters of the PC algorithm: set the input state time series X = (X 1 , X 2 , . number of rounds) N c , max , number of conditional independence tests N CI and termination condition threshold α PC ; (2)任意选取某一节点
Figure FDA0003348874180000011
进行因果关系重构:选取某一节点
Figure FDA0003348874180000012
进行初始化,设置其父节点
Figure FDA0003348874180000013
为所有其他节点,设置其与任意其他节点
Figure FDA0003348874180000014
间的独立检验统计量
Figure FDA0003348874180000015
设置初始迭代轮数Nc=0;
(2) Arbitrarily select a node
Figure FDA0003348874180000011
Perform causal reconstruction: pick a node
Figure FDA0003348874180000012
Initialize, set its parent node
Figure FDA0003348874180000013
For all other nodes, set it to be the same as any other node
Figure FDA0003348874180000014
independent test statistic between
Figure FDA0003348874180000015
Set the initial iteration round number N c =0;
(3)进行第Nc轮迭代:从
Figure FDA0003348874180000016
中选取一个在本轮迭代过程中未进行CI检测的节点
Figure FDA0003348874180000017
Figure FDA0003348874180000018
在条件
Figure FDA0003348874180000019
下进行CI检验,其中
Figure FDA00033488741800000110
且满足
Figure FDA00033488741800000111
得到假设检验的P值与独立性大小分别记为(p,I);
(3) Perform the N- th round of iteration: from
Figure FDA0003348874180000016
Select a node that has not undergone CI detection in this round of iterations
Figure FDA0003348874180000017
right
Figure FDA0003348874180000018
in condition
Figure FDA0003348874180000019
The CI test is carried out under the
Figure FDA00033488741800000110
and satisfy
Figure FDA00033488741800000111
The P value and independence of the hypothesis test are recorded as (p, I) respectively;
(4)若p>αPC,则从
Figure FDA00033488741800000112
中剔除
Figure FDA00033488741800000113
的节点,并根据
Figure FDA00033488741800000114
Figure FDA00033488741800000115
进行降序排序;否则返回步骤(3);
(4) If p>α PC , then from
Figure FDA00033488741800000112
medium culling
Figure FDA00033488741800000113
node, and according to
Figure FDA00033488741800000114
Give
Figure FDA00033488741800000115
Sort in descending order; otherwise, return to step (3);
(5)若
Figure FDA00033488741800000116
则节点
Figure FDA00033488741800000117
的因果关系重构完成;否则令Nc=Nc+1,并返回执行步骤(3)进行新一轮迭代;
(5) If
Figure FDA00033488741800000116
then the node
Figure FDA00033488741800000117
The reconstruction of the causal relationship is completed; otherwise, let N c =N c +1, and return to step (3) for a new round of iteration;
(6)判断是否所有节点的因果关系均已完成重构,若完成则进入步骤(7);否则返回步骤(2);(6) Judging whether the causal relationship of all nodes has been reconstructed, if completed, enter step (7); otherwise, return to step (2); (7)对重构的因果网络进行MCI检验,剔除虚假连边。(7) MCI test is performed on the reconstructed causal network, and false connections are eliminated.
2.如权利要求1所述的面向信息物理系统的网络重构方法,其中,步骤(3)中采用高斯过程回归与距离相关系数来进行独立性检验,其具体过程如下:2. the network reconstruction method for cyber-physical systems as claimed in claim 1, wherein, in step (3), adopt Gaussian process regression and distance correlation coefficient to carry out independence test, and its concrete process is as follows: (3a)利用贝叶斯非参数回归模型对
Figure FDA0003348874180000021
分别进行回归分析,并计算其P值,其模型如下式所示:
(3a) Using a Bayesian nonparametric regression model to
Figure FDA0003348874180000021
Regression analysis was carried out respectively, and its P value was calculated, and its model was as follows:
Figure FDA0003348874180000022
Figure FDA0003348874180000022
(3b)计算在该模型下
Figure FDA0003348874180000023
的残差为:
(3b) Calculated under this model
Figure FDA0003348874180000023
The residuals are:
Figure FDA0003348874180000024
Figure FDA0003348874180000024
Figure FDA0003348874180000025
Figure FDA0003348874180000025
(3c)对于
Figure FDA0003348874180000026
Figure FDA0003348874180000027
两者间的独立性可通过
Figure FDA0003348874180000028
Figure FDA0003348874180000029
来计算,若I=0则表示两变量是独立的:
(3c) For
Figure FDA0003348874180000026
and
Figure FDA0003348874180000027
The independence between the two can be achieved by
Figure FDA0003348874180000028
and
Figure FDA0003348874180000029
To calculate, if I=0, it means that the two variables are independent:
Figure FDA00033488741800000210
Figure FDA00033488741800000210
其中
Figure FDA00033488741800000211
的定义分别如下:
in
Figure FDA00033488741800000211
are defined as follows:
Figure FDA00033488741800000212
Figure FDA00033488741800000212
Figure FDA00033488741800000213
Figure FDA00033488741800000213
Figure FDA00033488741800000214
Figure FDA00033488741800000214
其中du、dv分别表示向量u、v的维数;where d u and d v represent the dimensions of the vectors u and v, respectively; (3d)更新
Figure FDA00033488741800000215
(3d) Update
Figure FDA00033488741800000215
3.如权利要求2所述的面向信息物理系统的网络重构方法,其中,步骤(7)中对任意一个节点
Figure FDA00033488741800000216
的连边检验具体过程如下:
3. The cyber-physical system-oriented network reconfiguration method as claimed in claim 2, wherein, in step (7), to any node
Figure FDA00033488741800000216
The specific process of the edge connection test is as follows:
(7a)对任意节点
Figure FDA00033488741800000217
计算
Figure FDA00033488741800000218
(7a) For any node
Figure FDA00033488741800000217
calculate
Figure FDA00033488741800000218
(7b)选取任意节点
Figure FDA00033488741800000219
定义
Figure FDA00033488741800000220
Figure FDA00033488741800000221
的前pX个节点构成的集合;
(7b) Select any node
Figure FDA00033488741800000219
definition
Figure FDA00033488741800000220
for
Figure FDA00033488741800000221
The set of the first pX nodes of ;
(7c)对
Figure FDA00033488741800000222
在条件
Figure FDA00033488741800000223
下进行CI检验,其过程同第三步中所述方法;
(7c) Yes
Figure FDA00033488741800000222
in condition
Figure FDA00033488741800000223
The CI test is carried out below, and its process is the same as that described in the third step;
(7d)根据CI检验结果,剔除虚假连边;(7d) According to the CI test result, remove the false connection; (7f)选取其他节点重复步骤(7b)~(7d),若所有节点均已MCI检验完毕,则获得最终PCMCI重构的因果网络。(7f) Select other nodes and repeat steps (7b) to (7d). If all nodes have completed the MCI check, the final PCMCI-reconstructed causal network is obtained.
4.一种面向信息物理系统的网络重构系统,其特征在于,包括4. A cyber-physical system-oriented network reconfiguration system is characterized in that, comprising 时序网络图构建模块,用于根据收集到的网络中的节点的时序状态信息,构建全连通的时序网络图;The timing network diagram building module is used to construct a fully connected timing network diagram according to the collected timing state information of the nodes in the network; 因果网络重构模块,用于针对任意一个节点,将与该节点不相关的节点进行剔除;进行迭代过程,在每次迭代过程中更新检验条件,并根据条件独立性检验算法不断剔除与该节点相独立的其他节点;在满足终止条件时跳出迭代过程;The causal network reconstruction module is used to eliminate any node that is not related to the node; iterative process is performed, the test conditions are updated in each iteration process, and the node is continuously eliminated according to the conditional independence test algorithm Other nodes that are independent of each other; jump out of the iterative process when the termination condition is met; 因果网络检验模块,用于根据瞬态条件独立性检验算法对各连边进行评估,并删除虚假的连边;The causal network test module is used to evaluate each connection according to the transient conditional independence test algorithm, and delete the false connection; 其中,时序网络图构建模块被设置为执行如下步骤:where the timing network diagram building block is set up to perform the following steps: (1)接收初始化PC算法的参数设置:状态时间序列X=((X1,X2,...,XN)、最大因果滞后时间τmax、最大可存在因果关系的父节点数目(最大迭代轮数)Nc,max、条件独立性检验次数NCI以及终止条件阈值αPC;(1) Receive parameter settings for initializing the PC algorithm: state time series X = ((X 1 , X 2 , . The number of iteration rounds) N c, max , the number of conditional independence tests N CI and the termination condition threshold αPC; 因果网络重构模块被设置为执行如下步骤:The causal network reconstruction module is set up to perform the following steps: (2)任意选取某一节点
Figure FDA0003348874180000031
进行因果关系重构:选取某一节点
Figure FDA0003348874180000032
进行初始化,设置其父节点
Figure FDA0003348874180000033
为所有其他节点,设置其与任意其他节点
Figure FDA0003348874180000034
间的独立检验统计量
Figure FDA0003348874180000035
设置初始迭代轮数Nc=0;
(2) Arbitrarily select a node
Figure FDA0003348874180000031
Perform causal reconstruction: pick a node
Figure FDA0003348874180000032
Initialize, set its parent node
Figure FDA0003348874180000033
For all other nodes, set it to be the same as any other node
Figure FDA0003348874180000034
independent test statistic between
Figure FDA0003348874180000035
Set the initial iteration round number N c =0;
(3)进行第Nc轮迭代:从
Figure FDA0003348874180000036
中选取一个在本轮迭代过程中未进行CI检测的节点
Figure FDA0003348874180000037
Figure FDA0003348874180000038
在条件
Figure FDA0003348874180000039
下进行CI检验,其中
Figure FDA00033488741800000310
且满足
Figure FDA00033488741800000311
得到假设检验的P值与独立性大小分别记为(p,I);
(3) Perform the N- th round of iteration: from
Figure FDA0003348874180000036
Select a node that has not undergone CI detection in this round of iterations
Figure FDA0003348874180000037
right
Figure FDA0003348874180000038
in condition
Figure FDA0003348874180000039
The CI test is carried out under the
Figure FDA00033488741800000310
and satisfy
Figure FDA00033488741800000311
The P value and independence of the hypothesis test are recorded as (p, I) respectively;
(4)若p>αPC,则从
Figure FDA00033488741800000312
中剔除
Figure FDA00033488741800000313
的节点,并根据
Figure FDA00033488741800000314
Figure FDA00033488741800000315
进行降序排序;否则返回步骤(3);
(4) If p>α PC , then from
Figure FDA00033488741800000312
medium culling
Figure FDA00033488741800000313
node, and according to
Figure FDA00033488741800000314
Give
Figure FDA00033488741800000315
Sort in descending order; otherwise, return to step (3);
(5)若
Figure FDA00033488741800000316
则节点
Figure FDA00033488741800000317
的因果关系重构完成;否则令Nc=Nc+1,并返回执行步骤(3)进行新一轮迭代;
(5) If
Figure FDA00033488741800000316
then the node
Figure FDA00033488741800000317
The reconstruction of the causal relationship is completed; otherwise, let N c =N c +1, and return to step (3) for a new round of iteration;
(6)判断是否所有节点的因果关系均已完成重构,若完成则进入步骤(7);否则返回步骤(2);(6) Judging whether the causal relationship of all nodes has been reconstructed, if completed, enter step (7); otherwise, return to step (2); 因果网络检验模块被设置为执行如下步骤:The causal network inspection module is set up to perform the following steps: (7)对重构的因果网络进行MCI检验,剔除虚假连边。(7) MCI test is performed on the reconstructed causal network, and false connections are eliminated.
5.如权利要求4所述的面向信息物理系统的网络重构系统,其中,因果网络重构模块被设置为在执行步骤(3)时,采用高斯过程回归与距离相关系数来进行独立性检验,其具体过程如下:5. The cyber-physical system-oriented network reconstruction system as claimed in claim 4, wherein the causal network reconstruction module is set to perform independence test by using Gaussian process regression and distance correlation coefficient when performing step (3). , the specific process is as follows: (3a)利用贝叶斯非参数回归模型对
Figure FDA0003348874180000041
分别进行回归分析,并计算其P值,其模型如下式所示:
(3a) Using a Bayesian nonparametric regression model to
Figure FDA0003348874180000041
Regression analysis was carried out respectively, and its P value was calculated, and its model was as follows:
Figure FDA0003348874180000042
Figure FDA0003348874180000042
(3b)计算在该模型下
Figure FDA0003348874180000043
的残差为:
(3b) Calculated under this model
Figure FDA0003348874180000043
The residuals are:
Figure FDA0003348874180000044
Figure FDA0003348874180000044
Figure FDA0003348874180000045
Figure FDA0003348874180000045
(3c)对于
Figure FDA0003348874180000046
Figure FDA0003348874180000047
两者间的独立性可通过
Figure FDA0003348874180000048
Figure FDA0003348874180000049
来计算,若I=0则表示两变量是独立的:
(3c) For
Figure FDA0003348874180000046
and
Figure FDA0003348874180000047
The independence between the two can be achieved by
Figure FDA0003348874180000048
and
Figure FDA0003348874180000049
To calculate, if I=0, it means that the two variables are independent:
Figure FDA00033488741800000410
Figure FDA00033488741800000410
其中
Figure FDA00033488741800000411
的定义分别如下:
in
Figure FDA00033488741800000411
are defined as follows:
Figure FDA00033488741800000412
Figure FDA00033488741800000412
Figure FDA00033488741800000413
Figure FDA00033488741800000413
Figure FDA00033488741800000414
Figure FDA00033488741800000414
其中du、dv分别表示向量u、v的维数;where d u and d v represent the dimensions of the vectors u and v, respectively; (3d)更新
Figure FDA00033488741800000415
(3d) Update
Figure FDA00033488741800000415
6.如权利要求5所述的面向信息物理系统的网络重构系统,其中,因果网络检验模块被设置为在执行步骤(7)时,对任意一个节点的连边检验具体过程如下:6. The cyber-physical system-oriented network reconfiguration system as claimed in claim 5, wherein the causal network checking module is set to when performing step (7), and the specific process of checking the connection of any node is as follows: (7a)对任意节点
Figure FDA00033488741800000416
计算
Figure FDA00033488741800000417
(7a) For any node
Figure FDA00033488741800000416
calculate
Figure FDA00033488741800000417
(7b)选取任意节点
Figure FDA00033488741800000418
定义
Figure FDA00033488741800000419
Figure FDA00033488741800000420
的前pX个节点构成的集合;
(7b) Select any node
Figure FDA00033488741800000418
definition
Figure FDA00033488741800000419
for
Figure FDA00033488741800000420
The set of the first pX nodes of ;
(7c)对
Figure FDA0003348874180000051
在条件
Figure FDA0003348874180000052
下进行CI检验,其过程同第三步中的方法;
(7c) Yes
Figure FDA0003348874180000051
in condition
Figure FDA0003348874180000052
The CI test is carried out below, and the process is the same as the method in the third step;
(7d)根据CI检验结果,剔除虚假连边;(7d) According to the CI test result, remove the false connection; (7f)选取其他节点重复步骤(7b)~(7d),若所有节点均已MCI检验完毕,则获得最终PCMCI重构的因果网络。(7f) Select other nodes and repeat steps (7b) to (7d). If all nodes have completed the MCI check, the final PCMCI-reconstructed causal network is obtained.
CN202110255858.2A 2021-03-09 2021-03-09 Network reconstruction method and system for information physical system Active CN113055230B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110255858.2A CN113055230B (en) 2021-03-09 2021-03-09 Network reconstruction method and system for information physical system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110255858.2A CN113055230B (en) 2021-03-09 2021-03-09 Network reconstruction method and system for information physical system

Publications (2)

Publication Number Publication Date
CN113055230A CN113055230A (en) 2021-06-29
CN113055230B true CN113055230B (en) 2022-04-19

Family

ID=76510817

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110255858.2A Active CN113055230B (en) 2021-03-09 2021-03-09 Network reconstruction method and system for information physical system

Country Status (1)

Country Link
CN (1) CN113055230B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113806452B (en) 2021-09-17 2022-10-25 北京百度网讯科技有限公司 Information processing method, device, electronic device and storage medium
CN118228853A (en) * 2023-11-08 2024-06-21 国网上海市电力公司 A method for predicting electricity consumption based on causal discovery and deep learning

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6909696B1 (en) * 2000-01-21 2005-06-21 Verizon Corporate Services Group Inc. Systems and methods for visualizing a communications network
CN101827413A (en) * 2009-03-05 2010-09-08 赵欣 Dynamic multipath routing algorithm based on mobility prediction
CN111866914A (en) * 2020-06-29 2020-10-30 同济大学 A method for configuring large-scale IoT working nodes in 5G networks

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6909696B1 (en) * 2000-01-21 2005-06-21 Verizon Corporate Services Group Inc. Systems and methods for visualizing a communications network
CN101827413A (en) * 2009-03-05 2010-09-08 赵欣 Dynamic multipath routing algorithm based on mobility prediction
CN111866914A (en) * 2020-06-29 2020-10-30 同济大学 A method for configuring large-scale IoT working nodes in 5G networks

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
优化增量型随机权神经网络及应用;姜乐等;《化工学报》;20191231(第12期);全文 *
基于贝叶斯网络的量化信任评估方法;林青等;《计算机技术与发展》;20161231(第12期);全文 *

Also Published As

Publication number Publication date
CN113055230A (en) 2021-06-29

Similar Documents

Publication Publication Date Title
CN115208680B (en) Dynamic network risk prediction method based on graph neural network
CN111585948B (en) Intelligent network security situation prediction method based on power grid big data
CN113055230B (en) Network reconstruction method and system for information physical system
CN109597401A (en) A kind of equipment fault diagnosis method based on data-driven
CN107220540A (en) Intrusion detection method based on intensified learning
CN109214456A (en) A kind of network anomaly detection method, system and electronic equipment
CN113886225B (en) A fuzzy testing system and method for unknown industrial control protocols
CN114500004A (en) An Anomaly Detection Method Based on Conditional Diffusion Probability Generation Model
CN112822184B (en) Unsupervised autonomous attack detection method in endogenous security system
CN116520806A (en) Intelligent fault diagnosis system and method for industrial system
CN117580046A (en) Deep learning-based 5G network dynamic security capability scheduling method
CN111353160B (en) Software bug abnormity intelligent detection system and method
CN118411589A (en) A method and device for detecting anomalies in semiconductor wafer manufacturing based on spatiotemporal graph neural network
CN108683658A (en) An abnormal identification method for industrial control network traffic based on multi-RBM network construction benchmark model
CN115567305A (en) Sequential network attack prediction and analysis method based on deep learning
CN117879907A (en) A network environment anomaly detection method based on graph convolution behavior feature extraction
CN118193954A (en) A method and system for detecting abnormal data in distribution network based on edge computing
CN118138428A (en) Intelligent perception-based fault diagnosis method and system for Internet of things equipment
CN117454145A (en) Aeroengine fault diagnosis method based on multi-mode sensor
CN117111464A (en) Self-adaptive fault diagnosis method under multiple working conditions
CN115356599A (en) Multi-mode urban power grid fault diagnosis method and system
CN112465150A (en) Real data enhancement-based multi-element time sequence data filling method
CN118675658B (en) Sewage dosing effect prediction method and system based on big data
CN117640342B (en) A method, device, equipment and medium for detecting abnormality of a power monitoring system
CN119004238B (en) Gear box fault diagnosis method and device based on mutual information entropy diagram

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant