CN111627553A - Method for constructing individualized prediction model of first-onset schizophrenia - Google Patents
Method for constructing individualized prediction model of first-onset schizophrenia Download PDFInfo
- Publication number
- CN111627553A CN111627553A CN202010454990.1A CN202010454990A CN111627553A CN 111627553 A CN111627553 A CN 111627553A CN 202010454990 A CN202010454990 A CN 202010454990A CN 111627553 A CN111627553 A CN 111627553A
- Authority
- CN
- China
- Prior art keywords
- network
- schizophrenia
- node
- brain
- constructing
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 201000000980 schizophrenia Diseases 0.000 title claims abstract description 64
- 238000000034 method Methods 0.000 title claims abstract description 37
- 210000004556 brain Anatomy 0.000 claims abstract description 64
- 238000009792 diffusion process Methods 0.000 claims abstract description 32
- 230000004927 fusion Effects 0.000 claims abstract description 23
- 238000012549 training Methods 0.000 claims abstract description 20
- 238000012216 screening Methods 0.000 claims abstract description 11
- 238000007781 pre-processing Methods 0.000 claims abstract description 8
- 238000007500 overflow downdraw method Methods 0.000 claims abstract description 6
- 238000011156 evaluation Methods 0.000 claims abstract description 5
- 238000012795 verification Methods 0.000 claims abstract description 4
- 239000011159 matrix material Substances 0.000 claims description 58
- 239000000835 fiber Substances 0.000 claims description 19
- 238000012360 testing method Methods 0.000 claims description 13
- 238000004364 calculation method Methods 0.000 claims description 12
- 238000009826 distribution Methods 0.000 claims description 12
- 230000000694 effects Effects 0.000 claims description 12
- 230000002776 aggregation Effects 0.000 claims description 11
- 238000005070 sampling Methods 0.000 claims description 9
- 238000004220 aggregation Methods 0.000 claims description 8
- 238000002595 magnetic resonance imaging Methods 0.000 claims description 7
- 238000012937 correction Methods 0.000 claims description 6
- 238000004458 analytical method Methods 0.000 claims description 5
- 238000006243 chemical reaction Methods 0.000 claims description 5
- 238000007477 logistic regression Methods 0.000 claims description 5
- 238000000605 extraction Methods 0.000 claims description 4
- 230000006870 function Effects 0.000 claims description 4
- 238000003384 imaging method Methods 0.000 claims description 4
- 238000005054 agglomeration Methods 0.000 claims description 3
- 210000005013 brain tissue Anatomy 0.000 claims description 3
- 238000004422 calculation algorithm Methods 0.000 claims description 3
- 238000007635 classification algorithm Methods 0.000 claims description 3
- 238000002790 cross-validation Methods 0.000 claims description 3
- 230000008030 elimination Effects 0.000 claims description 3
- 238000003379 elimination reaction Methods 0.000 claims description 3
- 238000003909 pattern recognition Methods 0.000 claims description 3
- 238000001558 permutation test Methods 0.000 claims description 3
- 230000035945 sensitivity Effects 0.000 claims description 3
- 210000003625 skull Anatomy 0.000 claims description 3
- 238000012706 support-vector machine Methods 0.000 claims description 3
- 238000013519 translation Methods 0.000 claims description 3
- 238000012614 Monte-Carlo sampling Methods 0.000 claims description 2
- 238000005516 engineering process Methods 0.000 claims description 2
- 239000006185 dispersion Substances 0.000 claims 3
- 238000012935 Averaging Methods 0.000 claims 1
- 230000004886 head movement Effects 0.000 claims 1
- 210000001259 mesencephalon Anatomy 0.000 claims 1
- 238000013421 nuclear magnetic resonance imaging Methods 0.000 claims 1
- 230000001131 transforming effect Effects 0.000 claims 1
- 238000003745 diagnosis Methods 0.000 abstract description 7
- 238000002610 neuroimaging Methods 0.000 abstract description 6
- 238000010276 construction Methods 0.000 abstract description 5
- 238000013473 artificial intelligence Methods 0.000 abstract description 4
- 230000000698 schizophrenic effect Effects 0.000 abstract 1
- 230000000875 corresponding effect Effects 0.000 description 9
- 238000005457 optimization Methods 0.000 description 8
- 239000000090 biomarker Substances 0.000 description 4
- 230000008569 process Effects 0.000 description 3
- 230000009286 beneficial effect Effects 0.000 description 2
- 230000000295 complement effect Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 201000010099 disease Diseases 0.000 description 2
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 2
- 238000013399 early diagnosis Methods 0.000 description 2
- 238000012804 iterative process Methods 0.000 description 2
- 238000010801 machine learning Methods 0.000 description 2
- 230000009466 transformation Effects 0.000 description 2
- 206010022998 Irritability Diseases 0.000 description 1
- 230000005856 abnormality Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000008309 brain mechanism Effects 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 208000020016 psychiatric disease Diseases 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/20—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for computer-aided diagnosis, e.g. based on medical expert systems
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H20/00—ICT specially adapted for therapies or health-improving plans, e.g. for handling prescriptions, for steering therapy or for monitoring patient compliance
- G16H20/70—ICT specially adapted for therapies or health-improving plans, e.g. for handling prescriptions, for steering therapy or for monitoring patient compliance relating to mental therapies, e.g. psychological therapy or autogenous training
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/70—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Public Health (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Data Mining & Analysis (AREA)
- Primary Health Care (AREA)
- Epidemiology (AREA)
- Databases & Information Systems (AREA)
- Pathology (AREA)
- Child & Adolescent Psychology (AREA)
- Developmental Disabilities (AREA)
- Hospice & Palliative Care (AREA)
- Psychiatry (AREA)
- Psychology (AREA)
- Social Psychology (AREA)
- Magnetic Resonance Imaging Apparatus (AREA)
Abstract
本发明属于精神医学、神经影像及人工智能领域,其公开了一种首发精神分裂症个体化预测模型的构建方法,解决现有SCH脑结构网络模型的辅助诊断的正确率低下的问题。该方法包括以下步骤:A、获取首发精神分裂症患者弥散张量图像;B、对获取的所述弥散张量图像进行预处理;C、基于预处理后的图像构建稀疏脑结构网络;D、采用相似网络融合方法构建每个被试稀疏后的多阈值融合脑结构网络;E、提取多阈值融合脑结构网络拓扑属性特征,然后进行特征筛选;F、基于筛选后的特征,采用分类器进行分类训练,获得首发精神分裂症个体化预测模型;G、对训练获得的首发精神分裂症个体化预测模型进行性能验证评估。
The invention belongs to the fields of psychiatry, neuroimaging and artificial intelligence, and discloses a construction method for an individualized prediction model of first-episode schizophrenia, which solves the problem of low accuracy of auxiliary diagnosis of the existing SCH brain structure network model. The method includes the following steps: A. Obtaining a diffusion tensor image of a first-episode schizophrenic patient; B. Preprocessing the obtained diffusion tensor image; C. Constructing a sparse brain structure network based on the preprocessed image; D. The similarity network fusion method is used to construct the multi-threshold fusion brain structure network after each subject's sparseness; E. Extract the topological attribute features of the multi-threshold fusion brain structure network, and then perform feature screening; F. Based on the filtered features, use a classifier to perform Perform classification training to obtain an individualized prediction model for first-episode schizophrenia; G. Perform performance verification and evaluation on the individualized prediction model for first-episode schizophrenia obtained through training.
Description
技术领域technical field
本发明属于精神医学、神经影像及人工智能领域,具体涉及一种首发精神分裂症个体化预测模型的构建方法。The invention belongs to the fields of psychiatry, neuroimaging and artificial intelligence, and particularly relates to a method for constructing an individualized prediction model for first-episode schizophrenia.
背景技术Background technique
精神分裂症(Schizophrenia,SCH)是一种高致残、致死性精神障碍,世界卫生组织将其列为全球疾病负担排行榜居前的十大疾病之一,然而其脑机制仍未完全厘清,诊断缺乏客观标准,治愈率低。寻求一种客观有效、方便可行的生物学标记对SCH进行早期个体化分类诊断和治疗成为亟待解决的临床问题。脑结构网络改变是SCH神经解剖异常的重要生物学基础,机器学习方法作为基于数据驱动的预测与分析工具,能够充分利用生物标志数据内在的结构信息构建SCH个体化脑结构网络模型。Schizophrenia (SCH) is a highly disabling and fatal mental disorder. The World Health Organization ranks it as one of the top ten diseases in the global disease burden list. However, its brain mechanism has not been fully clarified. Diagnosis lacks objective criteria and cure rates are low. It has become an urgent clinical problem to seek an objective, effective, convenient and feasible biological marker for early individualized classification, diagnosis and treatment of SCH. Brain structural network changes are an important biological basis for neuroanatomical abnormalities in SCH. As a data-driven prediction and analysis tool, machine learning methods can make full use of the inherent structural information of biomarker data to build an individualized brain structural network model for SCH.
目前SCH脑结构网络模型的研究现状是:1)直接提取原始网络中结构连接值作为特征,纳入了权重较低的伪连接;2)基于单一固定阈值来稀疏网络的方法存在不同级别的噪声影响,且单一阈值的选择具有主观性;3)直接应用原始的结构连接值作为低层次结构信息特征,忽视大脑拓扑复杂的网络重要属性。The current research status of the SCH brain structure network model is: 1) Directly extract the structural connection value in the original network as a feature, and incorporate pseudo-connections with lower weights; 2) The method of sparse network based on a single fixed threshold has different levels of noise influence , and the selection of a single threshold is subjective; 3) The original structural connection value is directly used as the low-level structural information feature, ignoring the important network properties of the brain's complex topology.
因此,现有SCH脑结构网络模型基于脑结构网络特征寻找精神分裂症敏感生物标志非常困难,模型的辅助诊断的正确率低下。Therefore, it is very difficult to find sensitive biomarkers of schizophrenia based on the characteristics of the brain structure network in the existing SCH brain structure network model, and the accuracy of the auxiliary diagnosis of the model is low.
发明内容SUMMARY OF THE INVENTION
本发明所要解决的技术问题是:提出一种首发精神分裂症个体化预测模型的构建方法,解决现有SCH脑结构网络模型的辅助诊断的正确率低下的问题。The technical problem to be solved by the present invention is: to propose a method for constructing an individualized prediction model of first-episode schizophrenia, so as to solve the problem of low accuracy of auxiliary diagnosis of the existing SCH brain structure network model.
本发明解决上述技术问题采用的技术方案是:The technical scheme adopted by the present invention to solve the above-mentioned technical problems is:
首发精神分裂症个体化预测模型的构建方法,包括以下步骤:The construction method of the individualized prediction model of first-episode schizophrenia includes the following steps:
A、获取首发精神分裂症患者弥散张量图像;A. Obtaining diffusion tensor images of patients with first-episode schizophrenia;
B、对获取的所述弥散张量图像进行预处理;B, preprocessing the acquired diffusion tensor image;
C、基于预处理后的图像构建稀疏脑结构网络;C. Build a sparse brain structure network based on the preprocessed images;
D、采用相似网络融合方法构建每个被试稀疏后的多阈值融合脑结构网络;D. Using the similarity network fusion method to construct a multi-threshold fusion brain structure network after each subject's sparseness;
E、提取多阈值融合脑结构网络拓扑属性特征,然后进行特征筛选;E. Extract multi-threshold fusion brain structure network topology attribute features, and then perform feature screening;
F、基于筛选后的特征,采用分类器进行分类训练,获得首发精神分裂症个体化预测模型;F. Based on the characteristics after screening, the classifier is used for classification training to obtain the individualized prediction model of first-episode schizophrenia;
G、对训练获得的首发精神分裂症个体化预测模型进行性能验证评估。G. Perform performance verification evaluation on the individualized prediction model of first-episode schizophrenia obtained by training.
作为进一步优化,步骤A中,利用核磁共振成像扫描机采用单次激发平面回波成像(EPI)技术扫描获取首发精神分裂症患者的弥散张量图像。As a further optimization, in step A, the diffusion tensor image of the first-episode schizophrenia patient is obtained by scanning with a single-shot echo-planar imaging (EPI) technique using an MRI scanner.
作为进一步优化,步骤B中,所述对获取的弥散张量图像进行预处理,具体包括:As a further optimization, in step B, the preprocessing of the acquired diffusion tensor image specifically includes:
B1、利用MRI convert(磁共振成像转换)将DICOM数据格式的弥散张量图像转换为NIFTI格式的图像;B1. Use MRI convert (magnetic resonance imaging conversion) to convert the diffusion tensor image in DICOM data format to an image in NIFTI format;
B2、对数据格式转换后的弥散张量图像进行涡流校正和头动校正;B2. Perform eddy current correction and head motion correction on the diffusion tensor image after data format conversion;
B3、应用FSL的Brain Extraction Tool(脑提取工具)去除头骨,移除非脑组织图像。B3. Use the Brain Extraction Tool of FSL to remove the skull and remove non-brain tissue images.
作为进一步优化,步骤C中,所述基于预处理后的图像构建稀疏脑结构网络,具体包括:As a further optimization, in step C, the construction of a sparse brain structure network based on the preprocessed images specifically includes:
C1、在弥散张量空间用旋转和平移的线性配准法将各被试预处理后的脑图像配准到b0像;然后将配准后的b0像配准到标准MNI空间中的T1像;通过转换矩阵求逆,利用得到的逆矩阵将AAL模板从MNI空间变换到弥散张量空间,获得每个被试基于AAL模板划分的90个脑区网络节点;C1. The preprocessed brain image of each subject is registered to the b0 image by the linear registration method of rotation and translation in the diffusion tensor space; then the registered b0 image is registered to the T1 image in the standard MNI space ; By inverting the transformation matrix, using the obtained inverse matrix to transform the AAL template from the MNI space to the diffusion tensor space, and obtaining 90 brain area network nodes divided by each subject based on the AAL template;
C2、使用概率纤维束成像方法,基于BEDPOSTX工具进行弥散参数的抽样贝叶斯估计,用马尔可夫链蒙特卡罗(Markov Chain Monte Carlo,MCMC)抽样方法来给每个体素的弥散参数建立分布,将大脑每个体素都预设为纤维交叉模型,并自动判别有多少种交叉通过的纤维束;C2. Using the probabilistic tractography method, based on the BEDPOSTX tool for sampling Bayesian estimation of the diffusion parameters, and using the Markov Chain Monte Carlo (MCMC) sampling method to establish a distribution for the diffusion parameters of each voxel , preset each voxel of the brain as a fiber crossing model, and automatically determine how many fiber bundles there are crossing;
C3、基于PROBTRACKX工具进行概率追踪纤维束重建,通过对每个体素的主弥散方向的分布中进行反复抽样,每次从抽取的局部样本中生成流线,通过多次抽样建立起流线位置后验分布的统计图,得到两两脑区间结构连接概率的分布情形;将每个边的权重定义为两两节点区域之间的纤维束连接概率,则每个被试获得一个对称的90×90的纤维束连接概率的加权网络矩阵;C3. Probability tracking fiber bundle reconstruction based on PROBTRACKX tool, by repeatedly sampling the distribution of the main diffusion direction of each voxel, generating streamlines from the extracted local samples each time, and establishing the streamline position through multiple sampling The statistical diagram of the test distribution can be used to obtain the distribution of the probability of structural connection between the two brain regions; the weight of each edge is defined as the probability of fiber bundle connection between the two node regions, then each subject obtains a symmetrical 90×90 The weighted network matrix of the fiber bundle connection probability;
C4、设定纤维束连接概率阈值,超过阈值的两个脑区存在结构连接,测试不同稀疏度阈值对融合效果的影响,采用相对更窄的阈值范围(5%到40%,步长为1%)来构建稀疏结构网络。C4. Set the probability threshold of fiber bundle connection. There are structural connections between the two brain regions that exceed the threshold. Test the effect of different sparsity thresholds on the fusion effect. Use a relatively narrow threshold range (5% to 40%, step size is 1) %) to build a sparse structure network.
作为进一步优化,步骤D中,所述采用相似网络融合方法构建每个被试稀疏后的多阈值融合脑结构网络,具体包括:As a further optimization, in step D, the similar network fusion method is used to construct a sparse multi-threshold fusion brain structure network for each subject, which specifically includes:
D1、以稀疏后结构连接矩阵定义为全核(full kernel)矩阵Wi j,对于第i个被试第j个阈值下的全核矩阵Wi j,进一步构建对应的稀疏核(sparse kernel)矩阵:D1. Define the connection matrix after sparse as a full kernel matrix W i j , for the full kernel matrix W i j under the jth threshold of the i-th subject, further construct the corresponding sparse kernel (sparse kernel) matrix:
令δu为全核矩阵Wi j中节点u的k近邻(包括节点u本身),则稀疏核矩阵Si j被定义为:Let δ u be the k-nearest neighbors of node u in the full-kernel matrix Wi j (including node u itself), then the sparse kernel matrix S i j is defined as:
D2、基于全核矩阵对应的稀疏核矩阵进行全核矩阵的迭代更新:D2. Iteratively update the full-kernel matrix based on the sparse kernel matrix corresponding to the full-kernel matrix:
其中(Wi c)(m)表示第m次迭代时第i个被试第c个阈值下的全核矩阵,(Wi j)(m+1)表示第m+1次迭代时的全核矩阵,N为总的稀疏阈值个数;where (W i c ) (m) represents the full kernel matrix of the i-th subject at the c-th threshold at the m-th iteration, and (W i j ) (m+1) represents the full-kernel matrix at the m+1-th iteration Kernel matrix, N is the total number of sparse thresholds;
D3、判断是否满足迭代收敛条件,若是,则执行步骤D4,否则继续迭代;D3. Determine whether the iterative convergence condition is met, if so, execute step D4, otherwise continue to iterate;
其中,所述收敛条件为:||(Wi j)(m+1)-(Wi j)(m)||≤0.01;Wherein, the convergence condition is: ||(W i j ) (m+1) -(W i j ) (m) ||≤0.01;
D4、将更新后的N个阈值对应的全核矩阵Wi j进行平均,从而为每个被试构建平均的全核矩阵:D4. Average the full-kernel matrix W i j corresponding to the updated N thresholds, so as to construct an average full-kernel matrix for each subject:
D5、将Wi中的元素归一化到区间[0,1],从而为每个被试生成最终的融合后的网络。D5. Normalize the elements in Wi to the interval [0, 1], thereby generating the final fused network for each subject.
作为进一步优化,步骤E中,所述提取多阈值融合脑结构网络拓扑属性特征,具体包括:As a further optimization, in step E, the extraction of multi-threshold fusion brain structure network topology attribute features specifically includes:
基于图理论分析方法计算融合后网络的8个全局拓扑属性和3种节点拓扑属性在所有阈值下的AUC值作为后续分类的初始特征;Based on the graph theory analysis method, the AUC values of 8 global topological attributes and 3 node topological attributes of the fused network under all thresholds were calculated as the initial features of subsequent classification;
所述8个全局拓扑属性包括网络强度Sp,全局效率Eglob,局部效率Eloc,最短路径长度Lp,集聚系数Cp,标准化的最短路径长度λ,标准化集聚系数γ,小世界属性σ:The 8 global topological properties include network strength Sp , global efficiency E glob , local efficiency E loc , shortest path length L p , clustering coefficient C p , normalized shortest path length λ , normalized clustering coefficient γ , small-world property σ :
所述网络强度Sp的计算公式为:The calculation formula of the network strength Sp is:
其中,S(i)是与第i个节点连接的边的权重总和,N是全脑网络中脑区域的数目;where S(i) is the sum of the weights of the edges connected to the ith node, and N is the number of brain regions in the whole-brain network;
所述最短路径长度Lp的计算公式为:The calculation formula of the shortest path length L p is:
其中,Lij代表节点i与节点j之间的最短路径,Lp为整个网络G的最短路径长度;Among them, L ij represents the shortest path between node i and node j, and L p is the shortest path length of the entire network G;
所述全局效率Eglob的计算公式包括:The calculation formula of the global efficiency E glob includes:
其中,Eglob_i(G)为节点i的全局效率,网络G的全局效率为网络中所有节点的全局效率的平均值;Among them, E glob_i (G) is the global efficiency of node i, and the global efficiency of network G is the average value of the global efficiency of all nodes in the network;
所述局部效率Eloc的计算公式包括:The calculation formula of the local efficiency E loc includes:
其中,Ljk是区域j和区域k之间的最短路径长度,Gi是和区域i相连的节点构成的子网络,NGi是子网络Gi中脑区域的数目;Eloc_i(G)是节点i的局部效率,网络G的局部效率Eloc(G)为网络中所有节点的局部效率的平均值;where L jk is the shortest path length between region j and region k, G i is the sub-network formed by nodes connected to region i, N Gi is the number of brain regions in sub-network Gi ; E loc_i (G) is The local efficiency of node i, the local efficiency of network G E loc (G) is the average of the local efficiencies of all nodes in the network;
所述集聚系数Cp的计算公式包括:The calculation formula of the agglomeration coefficient C p includes:
其中,C(i)为节点i的集聚系数,网络G的集聚系数为所有节点的集聚系数的平均值;Among them, C(i) is the clustering coefficient of node i, and the clustering coefficient of network G is the average value of the clustering coefficients of all nodes;
所述标准化的聚集系数γ和标准化的最短路径长度λ计算公式如下:The normalized aggregation coefficient γ and the normalized shortest path length λ are calculated as follows:
其中,Cp rand和Lp rand分别是100个随机网络的集聚系数和最短路径长度的平均值;Among them, C p rand and L p rand are the average of the clustering coefficient and the shortest path length of 100 random networks;
所述小世界属性σ的计算公式如下:The calculation formula of the small-world attribute σ is as follows:
σ=γ/λ;σ=γ/λ;
所述3种节点拓扑属性包括节点度Dnodal(i),节点效率Enodal(i),节点介数中心度Bnodal(i),分别定义如下:The three types of node topology attributes include node degree D nodal (i), node efficiency E nodal (i), node betweenness centrality B nodal (i), which are respectively defined as follows:
其中,est表示在网络G中节点s到节点t的所有最短路径的数量,esit是这些最短路径中通过节点i的数量。where est represents the number of all shortest paths from node s to node t in network G, and e sit is the number of these shortest paths through node i.
作为进一步优化,步骤E中,所述进行特征筛选具体包括:As a further optimization, in step E, the feature screening specifically includes:
基于支持向量机的递归特征消除(Recursive Feature Elimination,RFE)算法通过不断训练分类器并去除特征权重较小的特征维度来进行特征选择,具体包括:The Recursive Feature Elimination (RFE) algorithm based on support vector machine performs feature selection by continuously training the classifier and removing the feature dimension with small feature weight, including:
①将被选特征集初始化为含有所有被选特征, ① Initialize the selected feature set to contain all selected features,
②将特征集作为输入,训练分类器,得到分类效果和每个特征的权重;② Take the feature set as input, train the classifier, and get the classification effect and the weight of each feature;
③移除权重最小的特征,形成新的特征集;③ Remove the feature with the smallest weight to form a new feature set;
④重复②、③直到选取分类效果最好的情况。④Repeat ②, ③ until Choose the case with the best classification effect.
作为进一步优化,步骤F中,所述分类器采用基于径向基函数(radial basisfunction,RBF)核的SVM分类器、逻辑回归分类器(Logistic Regression Classifier)或多个分类器集成学习;在进行分类训练时,应用多种模式识别分类方法来寻找最优的分类器模型及筛选关键的多阈值融合的脑结构网络特征。As a further optimization, in step F, the classifier adopts SVM classifier based on radial basis function (RBF) kernel, logistic regression classifier (Logistic Regression Classifier) or integrated learning of multiple classifiers; During training, a variety of pattern recognition classification methods are applied to find the optimal classifier model and screen the key multi-threshold fusion brain structure network features.
作为进一步优化,步骤G中,采用交叉验证评估精神分裂症训练集和测试集的多种预测模型性能,包括准确率、敏感性、特异性、ROC曲线和AUC值等,系统通过测试特征筛选和分类算法,应用置换检验,优化脑影像特征选择和个体亚型预测模型,识别亚型相关的核心脑影像组特征,以提高个体化预测准确率。As a further optimization, in step G, cross-validation is used to evaluate the performance of various prediction models of schizophrenia training set and test set, including accuracy, sensitivity, specificity, ROC curve and AUC value, etc. Classification algorithms, applying permutation tests, optimizing brain imaging feature selection and individual subtype prediction models, and identifying subtype-related core brain imaging group features to improve individualized prediction accuracy.
本发明的有益效果是:The beneficial effects of the present invention are:
(1)构建多阈值融合后的网络,通过整合原始网络在不同拓扑视图下提供的互补信息,从而生成不依赖于单个阈值的融合网络的拓扑属性特征作为分类的初始特征,经过特征筛选后进行分类训练,该手段既能提高分类准确性,又考虑了结构指标的分量,具有可解释性;(1) Constructing a multi-threshold fusion network, by integrating the complementary information provided by the original network under different topological views, the topological attribute features of the fusion network that do not depend on a single threshold are generated as the initial features of classification. Classification training, which can not only improve the classification accuracy, but also consider the components of structural indicators, which is interpretable;
(2)基于融合多阈值脑结构网络生物学数据建立的人工智能模型,能自动采集数据更新模型,建立精神分裂症患者早期诊断的客观生物学标记,该模型对首发精神分裂症患者分类准确定性显示极高的可靠性,同时具有很高的稳定性。(2) The artificial intelligence model based on the fusion of multi-threshold brain structure network biological data can automatically collect data to update the model and establish objective biological markers for early diagnosis of schizophrenia patients. The model is accurate and qualitative for the classification of first-episode schizophrenia patients. Shows extremely high reliability and high stability at the same time.
附图说明Description of drawings
图1为本发明中的首发精神分裂症个体化预测模型的构建方法流程图。FIG. 1 is a flowchart of the construction method of the individualized prediction model of first-episode schizophrenia in the present invention.
具体实施方式Detailed ways
本发明旨在提出一种首发精神分裂症个体化预测模型的构建方法,解决现有SCH脑结构网络模型的辅助诊断的正确率低下的问题。其核心思想是:获取首发精神分裂症患者弥散张量的单次激发平面回波成像;对弥散张量图像进行预处理;基于预处理后的图像构建稀疏脑结构网络;采用相似网络融合方法,构建每个被试稀疏后的多阈值结构网络;提取基于处理后的融合多阈值脑结构网络拓扑属性特征,经过特征筛选后进行分类训练,获得首发精神分裂症个体化预测模型,最后对训练获得的首发精神分裂症个体化预测模型进行性能验证评估。The invention aims to propose a method for constructing an individualized prediction model of first-episode schizophrenia, so as to solve the problem of low accuracy of auxiliary diagnosis of the existing SCH brain structure network model. The core idea is to obtain the single-shot echo plane imaging of the diffusion tensor of patients with first-episode schizophrenia; to preprocess the diffusion tensor images; to construct a sparse brain structure network based on the preprocessed images; Construct a sparse multi-threshold structure network for each subject; extract the topological attribute features based on the processed fusion multi-threshold brain structure network, perform classification training after feature screening, and obtain an individualized prediction model for first-episode schizophrenia. Performance validation evaluation of an individualized prediction model for first-episode schizophrenia.
对于不同稀疏度阈值下的网络,可以视作对同一被试的脑网络的不同类型的特征表达,本发明将多稀疏度阈值的网络进行融合,将会得到更丰富的拓扑信息,有利于后续分类工作,也规避了采用单一固定阈值来稀疏网络的方法存在不同级别的噪声影响,以及单一阈值的选择具有主观性的问题;而本发明基于图理论分析得到的网络拓扑属性能够反映网络的高层次属性,将这些属性作为分类的特征,会比将原始的网络连接值这些只能反映低层次的信息作为分类特征,能够得到更好的分类结果。Networks with different sparsity thresholds can be regarded as different types of feature expressions for the same subject's brain network. The present invention fuses networks with multiple sparsity thresholds to obtain richer topology information, which is beneficial for subsequent The classification work also avoids the influence of different levels of noise in the method of using a single fixed threshold to sparse the network, and the subjectivity of the selection of a single threshold; and the network topology attributes obtained by the present invention based on graph theory analysis can reflect the high level of the network. Hierarchical attributes, using these attributes as classification features can obtain better classification results than using the original network connection values, which can only reflect low-level information as classification features.
在具体实现上,本发明中的首发精神分裂症个体化预测模型的构建方法流程如图1所示,其包括以下实现步骤:In terms of specific implementation, the construction method flow of the first-episode schizophrenia individualized prediction model in the present invention is shown in Figure 1, and it includes the following implementation steps:
1、获取首发精神分裂症患者弥散张量图像;1. Obtain diffusion tensor images of first-episode schizophrenia patients;
本步骤中,采用Philip 3.0T和GE 3.0T核磁共振成像扫描机采集数据作为训练模块测试集和测试模块数据集,作为具体实施手段,扫描参数如下:在32个轴向平面方向上,TR=10295ms,TE=91ms,FOV=128mm×128mm2,翻转角=90°,层厚=4mm,矩阵=256×256,单个体素大小为2×2×2mm3,b=1000m/s。采集3DT1结构像数据优化DTI数据配准,扫描参数如下:TR=8.4ms,TE=3.8ms,FOV=256×256mm2,翻转角=90°,层厚=lmm,无间隔连续扫描,矩阵=256×256,单个体素大小为1×1×1mm3,全脑共采集188层图像。In this step, the data collected by Philip 3.0T and GE 3.0T MRI scanners are used as the training module test set and the test module data set. As a specific implementation method, the scanning parameters are as follows: in the 32 axial plane directions, TR= 10295ms, TE=91ms, FOV=128mm×128mm 2 , flip angle=90°, slice thickness=4mm, matrix=256×256, single voxel size is 2×2×2mm 3 , b=1000m/s. Collect 3DT1 structural image data to optimize DTI data registration. The scanning parameters are as follows: TR=8.4ms, TE=3.8ms, FOV=256×256mm 2 , flip angle=90°, slice thickness=lmm, continuous scan without interval, matrix= 256×256, the size of a single voxel is 1×1×1 mm 3 , and a total of 188 slices of images are collected in the whole brain.
2、对获取的所述弥散张量图像进行预处理;2. Preprocessing the acquired diffusion tensor image;
本步骤中,作为具体实施手段,预处理过程如下:In this step, as a specific means of implementation, the preprocessing process is as follows:
①利用MRI convert将DICOM数据格式的弥散张量图像转换为NIFTI格式图像;①Use MRI convert to convert the diffusion tensor image in DICOM data format to NIFTI format image;
②对数据格式转换后的弥散张量图像进行涡流校正和头动校正;② Perform eddy current correction and head motion correction on the diffusion tensor image after data format conversion;
③应用FSL的Brain Extraction Tool去除头骨,移除非脑组织图像。③ Apply FSL's Brain Extraction Tool to remove the skull and remove the non-brain tissue images.
3、基于预处理后的图像构建稀疏脑结构网络;3. Build a sparse brain structure network based on the preprocessed images;
本步骤中,作为具体实施手段,构建稀疏脑结构网络的过程如下:In this step, as a specific implementation method, the process of constructing a sparse brain structure network is as follows:
①在弥散张量空间用旋转和平移的线性配准法将各被试预处理后的脑图像配准到b0像;然后将配准后的b0像配准到标准MNI空间中的T1像;通过转换矩阵求逆,利用得到的逆矩阵将AAL模板从MNI空间变换到弥散张量空间,获得每个被试基于AAL模板划分的90个脑区网络节点;①Register the preprocessed brain image of each subject to the b0 image in the diffusion tensor space using the linear registration method of rotation and translation; then register the registered b0 image to the T1 image in the standard MNI space; By inverting the transformation matrix, using the obtained inverse matrix to transform the AAL template from the MNI space to the diffusion tensor space, and obtaining 90 brain area network nodes divided by the AAL template for each subject;
②使用概率纤维束成像方法,基于BEDPOSTX工具进行弥散参数的抽样贝叶斯估计,用马尔可夫链蒙特卡罗抽样方法来给每个体素的弥散参数建立分布,将大脑每个体素都预设为纤维交叉模型,并自动判别有多少种交叉通过的纤维束;②Using the probabilistic tractography method, based on the BEDPOSTX tool for sampling Bayesian estimation of the diffusion parameters, and using the Markov chain Monte Carlo sampling method to establish a distribution for the diffusion parameters of each voxel, and preset each voxel in the brain It is a fiber crossing model, and automatically determines how many types of fiber bundles cross through;
③基于PROBTRACKX工具进行概率追踪纤维束重建,通过对每个体素的主弥散方向的分布中进行反复抽样,每次从抽取的局部样本中生成流线,通过多次抽样建立起流线位置后验分布的统计图,得到两两脑区间结构连接概率的分布情形;将每个边的权重定义为两两节点区域之间的纤维束连接概率,则每个被试获得一个对称的90×90的纤维束连接概率的加权网络矩阵;③Probability tracing fiber bundle reconstruction based on the PROBTRACKX tool, by repeatedly sampling the distribution of the main diffusion direction of each voxel, generating streamlines from the extracted local samples each time, and establishing the posterior position of the streamlines through multiple sampling The statistical diagram of the distribution can obtain the distribution of the probability of structural connection between the two brain regions; the weight of each edge is defined as the probability of fiber bundle connection between the two node regions, then each subject obtains a symmetrical 90×90 Weighted network matrix of fiber bundle connection probabilities;
④设定纤维束连接概率阈值,超过阈值的两个脑区存在结构连接,测试不同稀疏度阈值对融合效果的影响,采用相对更窄的阈值范围(5%到40%,步长为1%)来构建稀疏结构网络。(4) Set the fiber bundle connection probability threshold, and there are structural connections between the two brain regions that exceed the threshold. Test the effect of different sparsity thresholds on the fusion effect, using a relatively narrower threshold range (5% to 40%, with a step size of 1%). ) to build a sparse structure network.
4、采用相似网络融合方法构建每个被试稀疏后的多阈值融合脑结构网络;4. Using the similarity network fusion method to construct a multi-threshold fusion brain structure network after each subject's sparse;
本步骤中,作为具体实施手段,构建多阈值融合脑结构网络的过程如下:In this step, as a specific implementation method, the process of constructing a multi-threshold fusion brain structure network is as follows:
①以稀疏后结构连接矩阵定义为全核矩阵Wi j,对于第i个被试第j个阈值下的全核矩阵Wi j,进一步构建对应的稀疏核矩阵,稀疏核矩阵是用于编码网络稀疏后依然保留的强连接:① The sparse post-structure connection matrix is defined as the full-kernel matrix W i j , for the full-kernel matrix Wi j under the j-th threshold of the i - th subject, the corresponding sparse kernel matrix is further constructed. The sparse-kernel matrix is used for coding Strong connections that remain after the network is sparse:
令δu为全核矩阵Wi j中节点u的k近邻(包括节点u本身),则稀疏核矩阵Si j被定义为:Let δ u be the k-nearest neighbors of node u in the full-kernel matrix Wi j (including node u itself), then the sparse kernel matrix S i j is defined as:
②、基于全核矩阵对应的稀疏核矩阵进行全核矩阵的迭代更新:2. Iteratively update the full kernel matrix based on the sparse kernel matrix corresponding to the full kernel matrix:
其中(Wi c)(m)表示第m次迭代时第i个被试第c个阈值下的全核矩阵,(Wi j)(m+1)表示第m+1次迭代时的全核矩阵,N为总的稀疏阈值个数;where (W i c ) (m) represents the full kernel matrix of the i-th subject at the c-th threshold at the m-th iteration, and (W i j ) (m+1) represents the full-kernel matrix at the m+1-th iteration Kernel matrix, N is the total number of sparse thresholds;
通过与除自己以外的所有其它阈值网络进行交互,全核矩阵Wi j可以集成原始网络在其它拓扑视图下提供的互补信息。同时,稀疏核矩阵Si j通过对应全核矩阵Wi j中的最强的连接引导了迭代过程,因此可以有效地抑制噪声。从矩阵乘法的角度来看整个迭代过程,上述公式中意味着全核矩阵Wi j中任何两个节点的连接值大小同时依赖于其它阈值网络中对应节点的k近邻。特别当如果两个节点各自的k近邻在其它的阈值的网络中是最强连接的话,则它们之间的连接在迭代更新后会加强(尽管它们本身可能是弱连接),反之亦然。By interacting with all other threshold networks except its own, the all-kernel matrix W i j can integrate the complementary information provided by the original network under other topological views. Meanwhile, the sparse kernel matrix S i j guides the iterative process through the strongest connection in the corresponding full kernel matrix W i j , so the noise can be effectively suppressed. Looking at the entire iterative process from the perspective of matrix multiplication, the above formula means that the connection value of any two nodes in the full-kernel matrix W i j also depends on the k-nearest neighbors of the corresponding nodes in other threshold networks. In particular, if the respective k-nearest neighbors of two nodes are the strongest connections in other thresholded networks, the connections between them will strengthen after iterative updates (although they may themselves be weak connections), and vice versa.
③、判断是否满足迭代收敛条件,若是,则执行步骤④,否则继续迭代;3. Determine whether the iterative convergence conditions are met, if so, perform step 4, otherwise continue to iterate;
其中,所述收敛条件为:||(Wi j)(m+1)-(Wi j)(m)||≤0.01;Wherein, the convergence condition is: ||(W i j ) (m+1) -(W i j ) (m) ||≤0.01;
④、将更新后的N个阈值对应的全核矩阵Wi j进行平均,从而为每个被试构建平均的全核矩阵:④. Average the full-kernel matrix W i j corresponding to the updated N thresholds to construct an average full-kernel matrix for each subject:
⑤、将Wi中的元素归一化到区间[0,1],从而为每个被试生成最终的融合后的网络。⑤. Normalize the elements in Wi to the interval [0, 1] to generate the final fused network for each subject.
5、提取多阈值融合脑结构网络拓扑属性特征,然后进行特征筛选;5. Extract multi-threshold fusion brain structure network topology attribute features, and then perform feature screening;
本步骤中,作为具体实施手段,基于图理论分析方法计算融合后网络的8个全局拓扑属性和3种节点拓扑属性在所有阈值下的AUC值作为后续分类的初始特征。其中,8个全局拓扑属性包括网络强度(节点度)Sp,全局效率Eglob,局部效率Eloc,最短路径长度Lp,集聚系数Cp,标准化的最短路径长度λ,标准化集聚系数γ,小世界属性σ。具体定义如下:In this step, as a specific implementation method, the AUC values of 8 global topology attributes and 3 node topology attributes of the fused network under all thresholds are calculated based on the graph theory analysis method as the initial features of subsequent classification. Among them, the 8 global topological properties include network strength (node degree) S p , global efficiency E glob , local efficiency E loc , shortest path length L p , clustering coefficient C p , normalized shortest path length λ, normalized agglomeration coefficient γ, Small world property σ. The specific definitions are as follows:
网络强度(节点度)Sp反映重要的网络演化特性。节点度定义为与节点直接连接的边的权重和,节点的度越大则该节点的连接就越多,其节点在网络中的地位也就越重要。定义公式为:The network strength (node degree) Sp reflects important network evolution characteristics. The degree of a node is defined as the sum of the weights of the edges directly connected to the node. The greater the degree of a node, the more connections the node has, and the more important its position in the network is. The definition formula is:
其中S(i)是与第i个节点连接的边的权重总和,N是全脑网络中脑区域的数目。网络中所有节点的度的平均值为该网络的强度。where S(i) is the sum of the weights of edges connected to the ith node, and N is the number of brain regions in the whole-brain network. The average of the degrees of all nodes in a network is the strength of that network.
最短路径长度Lp:网络中所有节点最短路径的平均值为该网络的最短路径,反映整个网络的运行效率。通过最短路径可以更快地传输信息,节省系统资源。定义公式为:The shortest path length L p : the average value of the shortest paths of all nodes in the network is the shortest path of the network, reflecting the operation efficiency of the entire network. Information can be transmitted faster over the shortest path, saving system resources. The definition formula is:
其中Li,j代表节点i与节点j之间的最短路径,Lp为整个网络G的最短路径长度。可以看出最短路径长度的计算必须是基于网络全连通的情况的:假如节点i不能通过任何途径到达节点j,则Li,j不存在或者无穷大,Lp也将不存在。Among them, L i,j represents the shortest path between node i and node j, and L p is the shortest path length of the entire network G. It can be seen that the calculation of the shortest path length must be based on the full connectivity of the network: if node i cannot reach node j through any way, then Li, j does not exist or is infinite, and L p also does not exist.
全局效率Eglob:描述网络内信息传输效率,节点i的全局效率是在最短路径长度的基础上按照公式以下定义Global efficiency E glob : describe the information transmission efficiency in the network, the global efficiency of node i is defined by the following formula on the basis of the shortest path length
由上述公式可以看出一个节点的最短路径越小,节点与其它节点之间的信息传递越快,即该节点的全局效率越高。It can be seen from the above formula that the smaller the shortest path of a node, the faster the information transfer between the node and other nodes, that is, the higher the global efficiency of the node.
而网络G的全局效率Eglob被定义为网络中所有节点全局效率的平均值And the global efficiency E glob of the network G is defined as the average value of the global efficiency of all nodes in the network
局部效率Eloc是衡量网络内各相邻节点组成的“团(clique)”的紧凑程度,同时也描述了网络的冗余(redundancy)性及对外界攻击的容忍(tolerance)力的重要指标。节点i的局部效率和网络G的局部效率按照公式定义为:The local efficiency E loc is an important indicator to measure the compactness of the "clique" composed of adjacent nodes in the network, and also to describe the redundancy of the network and the tolerance of external attacks. The local efficiency of node i and the local efficiency of network G are defined as:
其中Ljk是区域j和区域k之间的最短路径长度,Gi是和区域i相连的节点构成的子网络,NGi是子网络Gi中脑区域的数目,N是全脑网络中G中节点的数目。where L jk is the shortest path length between region j and region k, G i is the sub-network composed of nodes connected to region i, N Gi is the number of brain regions in sub-network Gi , and N is G in the whole-brain network the number of nodes in the .
集聚系数Cp是衡量网络的小集团(cliquishness)性和局部互连接(interconnectivity)程度的重要指标,节点i的类聚系数C(i)定义为网络G内与节点i直接相连的“其它节点”之间的边数与这些“其它节点”之间的最大可能边数之间的比值,按照以下公式定义。网络G的集聚系数Cp被定义为所有节点集聚系数的平均值。The clustering coefficient Cp is an important indicator to measure the cliquishness and the degree of local interconnectivity of the network. The clustering coefficient C(i) of node i is defined as the “other nodes” directly connected to node i in network G. The ratio between the number of edges between these "other nodes" and the maximum possible number of edges between these "other nodes", as defined by the following formula. The clustering coefficient Cp of the network G is defined as the average of the clustering coefficients of all nodes.
如果一个网络同时具有较高的集聚系数和较短的最短路径长度,这个网络则被认为具有小世界属性(small-worldness)。为了定量判定网络是否具有小世界属性,一般将该网络的集聚系数和最短路径长度与随机网络的对应属性进行比较。A network is considered to have small-worldness if it has both a high clustering coefficient and a short shortest path length. In order to quantitatively determine whether the network has the small-world property, the clustering coefficient and the shortest path length of the network are generally compared with the corresponding properties of the random network.
根据以下公式分别计算出标准化的聚集系数γ和标准化的最短路径长度λ:The normalized aggregation coefficient γ and normalized shortest path length λ are calculated respectively according to the following formulas:
其中Cp rand和Lp rand分别是100个随机网络的集聚系数和最短路径长度的平均值。where C p rand and L p rand are the mean values of the clustering coefficients and the shortest path lengths of 100 random networks, respectively.
小世界属性σ被定义为σ=γ/λ。如果γ>1且λ≈1,即σ>1,则这个网络被判定具有小世界属性。The small-world property σ is defined as σ=γ/λ. If γ>1 and λ≈1, that is, σ>1, the network is judged to have small-world properties.
3个节点拓扑属性包括节点度Dnodal(i),节点效率Enodal(i),节点介数中心度Bnodal(i),分别定义如下:The three node topological properties include node degree D nodal (i), node efficiency E nodal (i), node betweenness centrality B nodal (i), which are defined as follows:
其中介数中心度(betweenness centrality)则是从信息流的角度出发定义节点的中心程度,在公式est表示在网络G中节点s到节点t的所有最短路径的数量,esit是这些最短路径中通过节点i的数量。Among them, betweenness centrality defines the centrality of nodes from the perspective of information flow. In the formula est represents the number of all shortest paths from node s to node t in network G, and e sit is these shortest paths. The number of passing nodes i in .
本步骤中,所述进行特征筛选具体包括:In this step, the feature screening specifically includes:
基于支持向量机的递归特征消除(Recursive Feature Elimination,RFE)算法通过不断训练分类器并去除特征权重较小的特征维度来进行特征选择,具体包括:The Recursive Feature Elimination (RFE) algorithm based on support vector machine performs feature selection by continuously training the classifier and removing the feature dimension with small feature weight, including:
①将被选特征集初始化为含有所有被选特征, ① Initialize the selected feature set to contain all selected features,
②将特征集作为输入,训练分类器,得到分类效果和每个特征的权重;② Take the feature set as input, train the classifier, and get the classification effect and the weight of each feature;
③移除权重最小的特征,形成新的特征集;③ Remove the feature with the smallest weight to form a new feature set;
④重复②、③直到选取分类效果最好的情况。④Repeat ②, ③ until Choose the case with the best classification effect.
6、基于筛选后的特征,采用分类器进行分类训练,获得首发精神分裂症个体化预测模型;6. Based on the characteristics after screening, the classifier is used for classification training to obtain the individualized prediction model of first-episode schizophrenia;
本步骤中,作为具体实施手段,所述分类器采用基于径向基函数(radial basisfunction,RBF)核的SVM分类器、逻辑回归分类器(Logistic Regression Classifier)或多个分类器集成学习;在进行分类训练时,应用多种模式识别分类方法来寻找最优的分类器模型及筛选关键的多阈值融合的脑结构网络特征。In this step, as a specific implementation means, the classifier adopts SVM classifier based on radial basis function (radial basis function, RBF) kernel, logistic regression classifier (Logistic Regression Classifier) or integrated learning of multiple classifiers; During classification training, a variety of pattern recognition classification methods are applied to find the optimal classifier model and screen the key multi-threshold fusion brain structure network features.
7、对训练获得的首发精神分裂症个体化预测模型进行性能验证评估。7. Perform performance verification evaluation on the individualized prediction model of first-episode schizophrenia obtained by training.
本步骤中,作为具体实施手段,采用交叉验证评估精神分裂症训练集和测试集的多种预测模型性能,包括准确率、敏感性、特异性、ROC曲线和AUC值等,系统通过测试特征筛选和分类算法,应用置换检验,优化脑影像特征选择和个体亚型预测模型,识别亚型相关的核心脑影像组特征,以提高个体化预测准确率。In this step, as a specific implementation method, cross-validation is used to evaluate the performance of various prediction models of schizophrenia training set and test set, including accuracy, sensitivity, specificity, ROC curve and AUC value, etc. and classification algorithms, apply permutation tests, optimize brain imaging feature selection and individual subtype prediction models, and identify subtype-related core brain imaging group features to improve individualized prediction accuracy.
综上,本发明采用人工智能与机器学习的技术,通过对首发精神分裂症脑结构磁共振成像数据的分析与挖掘,构建一种具有良好鲁棒性的首发精神分裂症个体化预测模型,用以对精神分裂症早期诊断识别,以实现精确和客观的辅助诊断,提高疗效。To sum up, the present invention adopts the technology of artificial intelligence and machine learning, and constructs an individualized prediction model of first-episode schizophrenia with good robustness by analyzing and mining the magnetic resonance imaging data of the brain structure of first-episode schizophrenia. Early diagnosis and identification of schizophrenia can achieve accurate and objective auxiliary diagnosis and improve curative effect.
Claims (9)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010454990.1A CN111627553A (en) | 2020-05-26 | 2020-05-26 | Method for constructing individualized prediction model of first-onset schizophrenia |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202010454990.1A CN111627553A (en) | 2020-05-26 | 2020-05-26 | Method for constructing individualized prediction model of first-onset schizophrenia |
Publications (1)
| Publication Number | Publication Date |
|---|---|
| CN111627553A true CN111627553A (en) | 2020-09-04 |
Family
ID=72260015
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202010454990.1A Pending CN111627553A (en) | 2020-05-26 | 2020-05-26 | Method for constructing individualized prediction model of first-onset schizophrenia |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN111627553A (en) |
Cited By (9)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112190269A (en) * | 2020-12-04 | 2021-01-08 | 兰州大学 | Construction method of depression auxiliary identification model based on multi-source electroencephalogram data fusion |
| CN113197578A (en) * | 2021-05-07 | 2021-08-03 | 天津医科大学 | Schizophrenia classification method and system based on multi-center model |
| CN113261973A (en) * | 2021-04-13 | 2021-08-17 | 山东师范大学 | Method and system for identifying atrial fibrillation electrocardiosignals through weighted multi-scale finite-crossing visual image |
| CN113283465A (en) * | 2021-04-02 | 2021-08-20 | 电子科技大学 | Diffusion tensor imaging data analysis method and device |
| CN114334130A (en) * | 2021-12-25 | 2022-04-12 | 浙江大学 | PET molecular image computer-aided diagnosis system based on brain symmetry |
| CN114882261A (en) * | 2022-04-20 | 2022-08-09 | 韩少强 | Clustering division method and system suitable for magnetic resonance image and magnetic resonance image information processing equipment |
| CN116486067A (en) * | 2023-04-11 | 2023-07-25 | 南通大学 | A sparse bidirectional Spark method for image feature selection of schizophrenia medical records |
| CN118335319A (en) * | 2024-06-12 | 2024-07-12 | 四川大学 | Early prediction method for common major diseases based on virtual person simulation |
| CN119742081A (en) * | 2025-03-03 | 2025-04-01 | 杭州市第七人民医院(杭州市心理危机研究与干预中心) | A method for classifying neurotransmitter subtypes in schizophrenia |
Citations (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH01238696A (en) * | 1988-03-19 | 1989-09-22 | Nozomi Hoshimiya | Collation and recognition system of time series pattern |
| US20130231552A1 (en) * | 2012-03-05 | 2013-09-05 | Siemens Corporation | Method and System for Diagnosis of Attention Deficit Hyperactivity Disorder from Magnetic Resonance Images |
| CN103425983A (en) * | 2013-07-25 | 2013-12-04 | 电子科技大学 | Brain network topology difference fast extracting method based on network synchronicity |
| US20150339570A1 (en) * | 2014-05-22 | 2015-11-26 | Lee J. Scheffler | Methods and systems for neural and cognitive processing |
| CN105726026A (en) * | 2016-01-28 | 2016-07-06 | 电子科技大学 | Mild cognitive impairment disease classifying method based on brain network and brain structure information |
| WO2016168724A1 (en) * | 2015-04-15 | 2016-10-20 | Sync-Think, Inc. | System and method for concussion detection and quantification |
| CN106096616A (en) * | 2016-06-08 | 2016-11-09 | 四川大学华西医院 | Magnetic resonance image feature extraction and classification method based on deep learning |
| CN106845137A (en) * | 2017-02-27 | 2017-06-13 | 浙江工业大学 | A kind of sacred disease analysis method based on brain network structure |
| US20170319123A1 (en) * | 2016-05-06 | 2017-11-09 | The Board Of Trustees Of The Leland Stanford Junior University | Systems and Methods for Using Mobile and Wearable Video Capture and Feedback Plat-Forms for Therapy of Mental Disorders |
| CN109376751A (en) * | 2018-08-21 | 2019-02-22 | 北京工业大学 | A classification method of human brain function network based on convolutional neural network |
| US20190090749A1 (en) * | 2017-09-26 | 2019-03-28 | Washington University | Supervised classifier for optimizing target for neuromodulation, implant localization, and ablation |
| CN110084381A (en) * | 2019-04-22 | 2019-08-02 | 天津工业大学 | A kind of brain network class method based on weight characteristic attribute fusion and the novel kernel of graph |
| CN110298364A (en) * | 2019-06-27 | 2019-10-01 | 安徽师范大学 | Based on the feature selection approach of multitask under multi-threshold towards functional brain network |
-
2020
- 2020-05-26 CN CN202010454990.1A patent/CN111627553A/en active Pending
Patent Citations (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH01238696A (en) * | 1988-03-19 | 1989-09-22 | Nozomi Hoshimiya | Collation and recognition system of time series pattern |
| US20130231552A1 (en) * | 2012-03-05 | 2013-09-05 | Siemens Corporation | Method and System for Diagnosis of Attention Deficit Hyperactivity Disorder from Magnetic Resonance Images |
| CN103425983A (en) * | 2013-07-25 | 2013-12-04 | 电子科技大学 | Brain network topology difference fast extracting method based on network synchronicity |
| US20150339570A1 (en) * | 2014-05-22 | 2015-11-26 | Lee J. Scheffler | Methods and systems for neural and cognitive processing |
| WO2016168724A1 (en) * | 2015-04-15 | 2016-10-20 | Sync-Think, Inc. | System and method for concussion detection and quantification |
| CN105726026A (en) * | 2016-01-28 | 2016-07-06 | 电子科技大学 | Mild cognitive impairment disease classifying method based on brain network and brain structure information |
| US20170319123A1 (en) * | 2016-05-06 | 2017-11-09 | The Board Of Trustees Of The Leland Stanford Junior University | Systems and Methods for Using Mobile and Wearable Video Capture and Feedback Plat-Forms for Therapy of Mental Disorders |
| CN106096616A (en) * | 2016-06-08 | 2016-11-09 | 四川大学华西医院 | Magnetic resonance image feature extraction and classification method based on deep learning |
| CN106845137A (en) * | 2017-02-27 | 2017-06-13 | 浙江工业大学 | A kind of sacred disease analysis method based on brain network structure |
| US20190090749A1 (en) * | 2017-09-26 | 2019-03-28 | Washington University | Supervised classifier for optimizing target for neuromodulation, implant localization, and ablation |
| CN109376751A (en) * | 2018-08-21 | 2019-02-22 | 北京工业大学 | A classification method of human brain function network based on convolutional neural network |
| CN110084381A (en) * | 2019-04-22 | 2019-08-02 | 天津工业大学 | A kind of brain network class method based on weight characteristic attribute fusion and the novel kernel of graph |
| CN110298364A (en) * | 2019-06-27 | 2019-10-01 | 安徽师范大学 | Based on the feature selection approach of multitask under multi-threshold towards functional brain network |
Non-Patent Citations (7)
| Title |
|---|
| HONGWEI WEN ET AL.: "Multi-threshold white matter structural networks fusion for accurate diagnosis of early tourette syndrome children" * |
| 余敏;陶玲;钱志余;朱威州;: "基于弥散张量成像的脑结构网络参数研究", no. 05 * |
| 吴斌: "《复杂网络和科技文献知识发现》", 科学技术文献出版社, pages: 28 - 29 * |
| 杜欣;李嘉慧;熊冬生;潘智林;吴逢春;宁玉萍;陈军;吴凯;: "认知负载状态下的精神分裂症患者脑电图特异性研究", no. 01 * |
| 温洪;郭浩;李越;陈俊杰;: "重度抑郁症患者脑功能网络的分类研究", no. 08 * |
| 胡茂林;郑俊杰;唐劲松;廖艳辉;李宗昌;贺莹;陈华富;陈晓岗;宗小芬;: "首发未服药精神分裂症全脑及局部白质脑网络拓扑改变", no. 02 * |
| 薛绍伟;唐一源;李健;张兰华;曹宸;: "一种基于fMRI数据的脑功能网络构建方法", no. 11 * |
Cited By (13)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN112190269B (en) * | 2020-12-04 | 2024-03-12 | 兰州大学 | Construction method of depression auxiliary identification model based on multi-source EEG data fusion |
| CN112190269A (en) * | 2020-12-04 | 2021-01-08 | 兰州大学 | Construction method of depression auxiliary identification model based on multi-source electroencephalogram data fusion |
| CN113283465A (en) * | 2021-04-02 | 2021-08-20 | 电子科技大学 | Diffusion tensor imaging data analysis method and device |
| CN113283465B (en) * | 2021-04-02 | 2022-04-29 | 电子科技大学 | Diffusion tensor imaging data analysis method and device |
| CN113261973B (en) * | 2021-04-13 | 2022-09-02 | 山东师范大学 | Method and system for identifying atrial fibrillation electrocardiosignals through weighted multi-scale finite-crossing visual image |
| CN113261973A (en) * | 2021-04-13 | 2021-08-17 | 山东师范大学 | Method and system for identifying atrial fibrillation electrocardiosignals through weighted multi-scale finite-crossing visual image |
| CN113197578A (en) * | 2021-05-07 | 2021-08-03 | 天津医科大学 | Schizophrenia classification method and system based on multi-center model |
| CN114334130B (en) * | 2021-12-25 | 2023-08-22 | 浙江大学 | Brain symmetry-based PET molecular image computer-aided diagnosis system |
| CN114334130A (en) * | 2021-12-25 | 2022-04-12 | 浙江大学 | PET molecular image computer-aided diagnosis system based on brain symmetry |
| CN114882261A (en) * | 2022-04-20 | 2022-08-09 | 韩少强 | Clustering division method and system suitable for magnetic resonance image and magnetic resonance image information processing equipment |
| CN116486067A (en) * | 2023-04-11 | 2023-07-25 | 南通大学 | A sparse bidirectional Spark method for image feature selection of schizophrenia medical records |
| CN118335319A (en) * | 2024-06-12 | 2024-07-12 | 四川大学 | Early prediction method for common major diseases based on virtual person simulation |
| CN119742081A (en) * | 2025-03-03 | 2025-04-01 | 杭州市第七人民医院(杭州市心理危机研究与干预中心) | A method for classifying neurotransmitter subtypes in schizophrenia |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN111627553A (en) | Method for constructing individualized prediction model of first-onset schizophrenia | |
| CN113616184B (en) | Brain network modeling and individual prediction method based on multi-modal magnetic resonance images | |
| CN114242236B (en) | Method for constructing a structure-function brain network bidirectional mapping model and a brain network bidirectional mapping model | |
| CN103093087B (en) | A kind of multi-modal brain network characterization fusion method based on multi-task learning | |
| CN107133651B (en) | The functional magnetic resonance imaging data classification method of subgraph is differentiated based on super-network | |
| CN111090764B (en) | Image classification method and device based on multi-task learning and graph convolutional neural network | |
| CN106548206B (en) | Multi-modal nuclear magnetic resonance image data classification method based on minimum spanning tree | |
| CN106650818B (en) | Resting state function magnetic resonance image data classification method based on high-order hyper-network | |
| CN109344889A (en) | Brain disease classification method, device and user terminal | |
| CN111738302A (en) | A system for classifying and diagnosing Alzheimer's disease based on multimodal data | |
| CN115601346B (en) | Multi-modal MRI (magnetic resonance imaging) multi-level classification method for knee joint cartilage injury based on deep learning | |
| CN111754395A (en) | A Robustness Evaluation Method of Brain Function Supernetwork Model | |
| CN118105035A (en) | A method for detecting mild cognitive impairment based on the combination of brain region and group information | |
| CN114862834A (en) | A classification method of resting-state functional magnetic resonance imaging data | |
| CN117765530A (en) | A multimodal brain network classification method, system, electronic device and medium | |
| CN111783887B (en) | A polygraph recognition method based on fMRI small world brain network computer classification | |
| CN115294054A (en) | A prediction method of cognitive function score based on the characteristics of brain functional network | |
| CN115670427A (en) | Group-Level fMRI Brain Functional Network Analysis Method Based on Graph Convolutional Neural Network | |
| Luo et al. | Multi-view brain network analysis with cross-view missing network generation | |
| CN111325268A (en) | Image classification method and device based on multi-level feature representation and integrated learning | |
| CN113283465B (en) | Diffusion tensor imaging data analysis method and device | |
| Sanchez et al. | Automatic quality control in multi-centric fetal brain MRI super-resolution reconstruction | |
| CN114445643B (en) | Fmri brain function connection data feature extraction method based on significant sparse strong correlation | |
| CN115760785A (en) | Brain somatotropin morphology high-order feature extraction method | |
| WO2025175424A1 (en) | Multimodal computing-based early intelligent graded screening system for brain disease |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| RJ01 | Rejection of invention patent application after publication |
Application publication date: 20200904 |
|
| RJ01 | Rejection of invention patent application after publication |



















































