[go: up one dir, main page]

CN113341044B - A method for identifying drowning based on metabolomics markers and its application - Google Patents

A method for identifying drowning based on metabolomics markers and its application Download PDF

Info

Publication number
CN113341044B
CN113341044B CN202110682946.0A CN202110682946A CN113341044B CN 113341044 B CN113341044 B CN 113341044B CN 202110682946 A CN202110682946 A CN 202110682946A CN 113341044 B CN113341044 B CN 113341044B
Authority
CN
China
Prior art keywords
metabolites
model
drowned
acid
metabolomics
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110682946.0A
Other languages
Chinese (zh)
Other versions
CN113341044A (en
Inventor
赵锐
王林林
张富源
官大威
王鹏飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Medical University
Original Assignee
China Medical University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Medical University filed Critical China Medical University
Priority to CN202110682946.0A priority Critical patent/CN113341044B/en
Publication of CN113341044A publication Critical patent/CN113341044A/en
Application granted granted Critical
Publication of CN113341044B publication Critical patent/CN113341044B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N30/00Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
    • G01N30/02Column chromatography
    • G01N30/04Preparation or injection of sample to be analysed
    • G01N30/06Preparation
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N30/00Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
    • G01N30/02Column chromatography
    • G01N30/86Signal analysis
    • G01N30/8675Evaluation, i.e. decoding of the signal into analytical information
    • G01N30/8686Fingerprinting, e.g. without prior knowledge of the sample components
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01NINVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
    • G01N30/00Investigating or analysing materials by separation into components using adsorption, absorption or similar phenomena or using ion-exchange, e.g. chromatography or field flow fractionation
    • G01N30/02Column chromatography
    • G01N30/88Integrated analysis systems specially adapted therefor, not covered by a single one of the groups G01N30/04 - G01N30/86

Landscapes

  • Analytical Chemistry (AREA)
  • Immunology (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Chemical & Material Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Biochemistry (AREA)
  • Pathology (AREA)
  • General Physics & Mathematics (AREA)
  • Engineering & Computer Science (AREA)
  • Library & Information Science (AREA)
  • Other Investigation Or Analysis Of Materials By Electrical Means (AREA)
  • Investigating Or Analysing Biological Materials (AREA)

Abstract

本发明属于法医学和生物医药技术领域,尤其涉及基于代谢组学筛选的代谢物在判别早期水中尸体是否为溺死的应用。所述用于判别早期水中尸体是否为溺死的标志物,其特征在于所述标志物包括下述14种小分子代谢产物中的任意一种或一种以上的组合:乳酸、甲羟戊酸、苯乙酰甘氨酸、十四烷二酸、肌苷、苹果酸、β羟基丁酸酯、十一烷酸、甘氨酰亮氨酸、皮质酮、硫胺素、乙酰肉碱、丙氨酰亮氨酸及犬尿氨酸。本发明提供的一种基于代谢组学标志物鉴定溺死的方法基于大量生物标志物综合分析比使用一个或几个生物标志物的方法具有更高的预测能力,通过代谢组学技术检测样本中若干内源性目标生物标志物的含量,结合机器学习算法能够方便准确地判别新鲜溺死尸体,有望为法医学实践中案件侦破提供更有利的帮助。

Figure 202110682946

The invention belongs to the technical fields of forensic science and biomedicine, and in particular relates to the application of metabolites screened based on metabolomics in judging whether an early water corpse is drowned. The marker for judging whether an early water corpse is drowned is characterized in that the marker includes any one or a combination of more than one of the following 14 small molecule metabolites: lactic acid, mevalonic acid, Phenylacetylglycine, tetradecanedioic acid, inosine, malic acid, beta hydroxybutyrate, undecanoic acid, glycylleucine, corticosterone, thiamine, acetylcarnitine, alanylleucine acid and kynurenine. A method for identifying drowning based on metabolomics markers provided by the present invention has a higher predictive ability based on comprehensive analysis of a large number of biomarkers than the method using one or a few biomarkers. The content of endogenous target biomarkers, combined with machine learning algorithms, can easily and accurately identify fresh drowned corpses, which is expected to provide more favorable assistance for case detection in forensic practice.

Figure 202110682946

Description

一种基于代谢组学标志物鉴定溺死的方法及其应用A method for identifying drowning based on metabolomics markers and its application

技术领域technical field

本发明属于法医学和生物医药技术领域,涉及基于代谢组学筛选的代谢物在判别早期水中尸体是否为溺死的应用。The invention belongs to the technical field of forensic science and biomedicine, and relates to the application of metabolites screened based on metabolomics in judging whether an early water corpse is drowned.

背景技术Background technique

溺死是机体落水后,由于溺液阻塞呼吸道及肺泡,阻碍气体交换,造成体内缺乏氧气及二氧化碳潴留而发生的窒息性死亡,又称作淹死。据世界卫生组织统计,全世界每年约有36万人因溺水而死亡,溺死已经成为造成人群意外伤害死亡的第三大原因。在法医学司法实践中,利用其他手段杀死受害人后抛尸入水伪造溺死的案例屡有发生,因此,准确鉴定水中尸体是否为溺死对进一步的案件调查至关重要。Drowning is the suffocation death caused by the lack of oxygen and carbon dioxide retention in the body due to the obstruction of the respiratory tract and alveoli by the drowning liquid after the body falls into the water. According to the statistics of the World Health Organization, about 360,000 people die from drowning every year in the world, and drowning has become the third leading cause of accidental death among people. In the judicial practice of forensic medicine, there are frequent cases of killing victims by other means and throwing their corpses into the water to fake drowning. Therefore, it is very important to accurately identify whether the corpse in the water is drowned or not for further case investigation.

机体在溺水过程中,自然界水体中的硅藻、叶绿素、微生物等可通过受损的肺泡毛细血管进入到血液循环中,进而播散到其他器官中。目前,法医学实践中应用硅藻检验辅助判断溺死,硅藻的检验已经成为溺死鉴定的“金标准”,即在全身多个脏器(如肺、肝、肾等)中检测到硅藻可提示死者生前吸入了大量的溺液,并为鉴定溺死提供参考。但一方面,有学者在非溺死尸体中检测到了硅藻,其可能是与土壤甚至空气中存在的硅藻有关,这种假阳性的硅藻检验结果对硅藻在溺死诊断中的特异性提出了质疑。另一方面,在硅藻含量低或没有硅藻的水中发生的溺水而死亡,应用现有方法也可能在获得的多个脏器检材中检测不到硅藻。由于硅藻的检验结果存在假阳性和假阴性的情况,一定程度上限制了硅藻在溺死诊断中的应用,因此,法医学实践中亟需发现新的具有较高特异性及灵敏性的检验方法和指标来辅助鉴定水中尸体是否为溺死。During the drowning process of the body, diatoms, chlorophyll, microorganisms, etc. in natural water can enter the blood circulation through damaged alveolar capillaries, and then spread to other organs. At present, in the practice of forensic medicine, diatom detection is used to assist in the judgment of drowning. The detection of diatoms has become the "gold standard" for drowning identification, that is, the detection of diatoms in multiple organs (such as lungs, liver, kidneys, etc.) The deceased inhaled a large amount of drowning fluid during his lifetime, which provided a reference for identification of drowning. But on the one hand, some scholars have detected diatoms in non-drowning corpses, which may be related to diatoms existing in the soil or even in the air. questioned. On the other hand, in the case of drowning in water with low or no diatom content, diatoms may not be detected in multiple organ specimens obtained using existing methods. Due to the existence of false positives and false negatives in the test results of diatoms, the application of diatoms in the diagnosis of drowning is limited to a certain extent. Therefore, it is urgent to find new testing methods with high specificity and sensitivity in forensic practice. And indicators to assist in identifying whether the corpse in the water is drowned.

近些年研究表明机体在溺死过程中经历了复杂的过程和机制,包括对溺水的恐惧、潜水反应、上呼吸道反射、吞咽、呕吐、溺液吸入肺泡内随气血交换进入血液、血细胞破裂、电解质紊乱等,而深入了解这些机制的相互作用及多条代谢通路的变化可以帮助我们理解溺死的过程,进而寻找具有较高特异性与灵敏性的溺死辅助诊断指标。随着相应仪器性能的提高和代谢组学方法的发展和建立,使得从整体角度分析机体在短时间内代谢过程的变化成为可能。代谢组学是一种新兴的系统生物学方法,通过检测生物体液或组织中各种小分子量代谢物的含量并进行数学建模分析,可获得含有丰富信息的高维数据,为研究疾病和外部刺激对机体的影响提供了新的方向。通过将先进的分析技术(如:GC-MS、LC-MS及NMR)和高通量生物信息学工具相结合,代谢组学已被尝试解决临床医学及法医学领域的科学问题。与传统的统计分析方法相比,机器学习方法往往更适合于解决涉及大量潜在指标的分类问题。随机森林是众多机器学习方法之一。它使用随机挑选的特征子集来建立多棵决策树,并将这些决策树进行组合形成森林,以预测分类问题的结果。通过随机挑选子集并综合多个决策树的分析结果可以最大限度地减少由偏差和方差引起的错误,同时随机森林应用于高维数据集时表现出了很好的泛化能力和鲁棒性,提示我们基于代谢组学高通量数据的随机森林分析方法可能为判别水中尸体是否为溺死提供新的思路。In recent years, studies have shown that the body has experienced complex processes and mechanisms during drowning, including fear of drowning, diving responses, upper airway reflexes, swallowing, vomiting, drowning liquid inhaled into the alveoli and entering the blood with air and blood exchange, blood cell rupture, Electrolyte disorders, etc., and an in-depth understanding of the interaction of these mechanisms and changes in multiple metabolic pathways can help us understand the process of drowning, and then look for auxiliary diagnostic indicators for drowning with high specificity and sensitivity. With the improvement of the performance of corresponding instruments and the development and establishment of metabolomics methods, it is possible to analyze the changes in the metabolic process of the body in a short period of time from an overall perspective. Metabolomics is an emerging method of systems biology. By detecting the content of various small-molecular-weight metabolites in biological fluids or tissues and performing mathematical modeling and analysis, high-dimensional data with rich information can be obtained for the study of diseases and external The effects of stimuli on the organism provide new directions. By combining advanced analytical techniques (such as GC-MS, LC-MS, and NMR) with high-throughput bioinformatics tools, metabolomics has been attempted to solve scientific problems in the fields of clinical medicine and forensic science. Compared with traditional statistical analysis methods, machine learning methods are often better suited to solving classification problems involving a large number of latent indicators. Random forest is one of many machine learning methods. It uses a randomly selected subset of features to build multiple decision trees and combines these decision trees to form a forest to predict the outcome of a classification problem. By randomly selecting subsets and combining the analysis results of multiple decision trees, the errors caused by bias and variance can be minimized. At the same time, random forests show good generalization ability and robustness when applied to high-dimensional data sets. , suggesting that our random forest analysis method based on metabolomics high-throughput data may provide a new idea for judging whether a corpse in water is drowned.

综上所述,判别早期水中尸体是否为溺死尚缺乏具有较高特异性和灵敏性的指标,基于代谢组学数据筛选特征性指标有望为判别水中尸体是否为溺死提供新的依据。In summary, there is still a lack of indicators with high specificity and sensitivity for judging whether an early water corpse is drowned. Screening characteristic indicators based on metabolomics data is expected to provide a new basis for judging whether a water corpse is drowned.

发明内容Contents of the invention

针对上述问题,本发明提供一种基于代谢组学筛选的标志物鉴定溺死的方法及其应用,本发明证明了代谢组学技术在判别新鲜水中尸体是否为溺死的可行性,并应用筛选出的若干小分子代谢物指标建立数学模型,用于快速准确判别早期水中尸体是否为溺死。In view of the above problems, the present invention provides a method for identifying drowning based on metabolomics screening markers and its application. The present invention proves the feasibility of metabolomics technology in judging whether a corpse in fresh water is drowned, and applies the screened A number of small molecule metabolite indicators were used to establish mathematical models to quickly and accurately determine whether the early water corpses were drowned.

为了实现上述目的,本发明提供了如下技术方案。In order to achieve the above object, the present invention provides the following technical solutions.

本发明提供了一种用于判别早期水中尸体是否为溺死的标志物,其特征在于,所述标志物包括下述14种小分子代谢产物中的任意一种或一种以上的组合:乳酸、甲羟戊酸、苯乙酰甘氨酸、十四烷二酸、肌苷、苹果酸、β羟基丁酸酯、十一烷酸、甘氨酰亮氨酸、皮质酮、硫胺素、乙酰肉碱、丙氨酰亮氨酸及犬尿氨酸。The present invention provides a marker for judging whether an early corpse in water is drowned, characterized in that the marker includes any one or a combination of more than one of the following 14 small molecule metabolites: lactic acid, Mevalonate, Phenylacetylglycine, Tetradecanedioic Acid, Inosine, Malate, Beta Hydroxybutyrate, Undecanoic Acid, Glycylleucine, Corticosterone, Thiamine, Acetylcarnitine, Alanyl Leucine and Kynurenine.

本发明还提供了上述标志物的筛选方法,所述筛选方法具体步骤如下:The present invention also provides a screening method for the above-mentioned markers, and the specific steps of the screening method are as follows:

步骤1、将分别从溺死组大鼠和死后抛尸入水组大鼠尸体提取的心血样本随机分为训练集和验证集;Step 1. Randomly divide the heart blood samples extracted from the dead bodies of the rats in the drowning group and the dead bodies into the water after death into a training set and a verification set;

步骤2、采用超高效液相色谱串联质谱UHPLC-MS/MS系统,进行代谢组学检测,得到各种小分子代谢物的代谢指纹图谱;Step 2, using ultra-high performance liquid chromatography tandem mass spectrometry UHPLC-MS/MS system to perform metabolomics detection, and obtain metabolic fingerprints of various small molecule metabolites;

步骤3、将训练集数据导入到R(version 3.6.1)语言中,借助randomForest程序包以样本分组为因变量,以各种代谢物为自变量,有放回地随机抽取训练集样本建立随机森林分类器模型,最终得到自变量的相对重要程度和模型的预测精度。以平均准确度下降为指标来表示自变量的相对重要程度,该值指将某变量的取值变为随机数时,随机森林分类器模型预测准确率的降低程度,越大表示该变量的重要性越大,以模型在验证集上的受试者工作特征曲线下面积及准确率来表示模型的预测精度,准确率为正确分类的验证样本数量占所有验证样本数量的比例,AUC及准确率越接近于1说明模型越可靠;Step 3. Import the training set data into the R (version 3.6.1) language, use the randomForest program package to take the sample grouping as the dependent variable and various metabolites as the independent variable, and randomly select the training set samples with replacement to establish a random The forest classifier model finally obtains the relative importance of independent variables and the prediction accuracy of the model. The relative importance of the independent variable is expressed by the average accuracy decline as an indicator. This value refers to the degree of reduction in the prediction accuracy of the random forest classifier model when the value of a variable is changed to a random number. The greater the value, the greater the importance of the variable. The greater the accuracy, the prediction accuracy of the model is represented by the area under the receiver operating characteristic curve and the accuracy rate of the model on the verification set. The accuracy rate is the ratio of the number of correctly classified verification samples to the number of all verification samples, AUC and accuracy rate The closer to 1, the more reliable the model is;

步骤4、对以所有代谢物为自变量建立的随机森林分类器模型进行交叉检验,以步骤3中建立的分类器模型交叉检验误差95%可信区间上限为参考,以相同交叉检验误差为标准,拟建立包含更少代谢物的简单分类器模型;Step 4. Cross-check the random forest classifier model established with all metabolites as independent variables, use the upper limit of the 95% confidence interval of the cross-check error of the classifier model established in step 3 as a reference, and use the same cross-check error as a standard , to build a simple classifier model containing fewer metabolites;

步骤5、根据各代谢物在步骤3中所述的随机森林分类器模型中的相对重要性及PLS-DA模型中的变量投影重要性值,挑选并确定14种对判别新鲜水中尸体是否为溺死的标志性代谢物。Step 5. According to the relative importance of each metabolite in the random forest classifier model described in step 3 and the variable projection importance value in the PLS-DA model, select and determine 14 pairs of pairs for judging whether the corpse in fresh water is drowned marked metabolites.

本发明还提供了一种如上所述的14种代谢标志物构建简单分类器模型,并用于判别新鲜水中尸体是否为溺死的模型的构建方法,其特征在于,所述构建方法的具体步骤如下:The present invention also provides a method for constructing a simple classifier model for the above-mentioned 14 metabolic markers, and for judging whether a dead body in fresh water is drowned. It is characterized in that the specific steps of the construction method are as follows:

步骤1、提取训练集每个样本中14种标志物含量及死因分组数据并导入到R中,借助randomForest程序包以样本分组为因变量,以14种标志物为自变量,有放回地随机抽取训练集样本建立随机森林简单分类器模型;Step 1. Extract the content of 14 markers and the grouping data of the cause of death in each sample of the training set and import them into R. With the help of the randomForest package, the sample grouping is used as the dependent variable, and the 14 markers are used as the independent variable. Extract training set samples to establish a random forest simple classifier model;

步骤2、将验证集样本中对应的14种标志物含量信息输入到构建的随机森林简单分类器模型中,得到该模型对各样本分组的预测结果,并进行分析。Step 2. Input the content information of the corresponding 14 markers in the verification set samples into the constructed random forest simple classifier model, obtain the prediction results of the model for each sample group, and analyze them.

与现有技术相比本发明的有益效果。Compared with the prior art, the present invention has beneficial effects.

(1)本发明通过在自然河水(淡水)中建立动物模型,模拟水中环境发现的人体尸体,提供了一种基于代谢组学标志物鉴定溺死的方法,有利于成果的转化。(1) The present invention establishes animal models in natural river water (fresh water) to simulate human corpses found in the water environment, and provides a method for identifying drowning based on metabolomics markers, which is conducive to the transformation of results.

(2)本发明提供了用于判别早期水中尸体是否为溺死的标志物,所述14种标志物具有与水中尸体死亡原因密切相关、受死后变化影响相对较小、平行样本之间差异性小、在随机森林机器学习算法上对模型预测精度贡献度高等特点。(2) The present invention provides markers for judging whether an early corpse in water is drowned. The 14 markers are closely related to the cause of death of corpses in water, are relatively less affected by postmortem changes, and have differences among parallel samples. It has the characteristics of small size and high contribution to model prediction accuracy in the random forest machine learning algorithm.

(3)本发明提供的一种基于代谢组学标志物鉴定溺死的方法,代谢组学技术能够同时检测多种代谢物,这些代谢物能够解释复杂多变的过程,帮助人们深入了解溺死过程及死后早期降解过程中的微观变化。(3) The present invention provides a method for identifying drowning based on metabolomics markers. Metabolomics technology can detect multiple metabolites at the same time. Microscopic changes during early postmortem degradation.

(4)本发明提供的一种基于代谢组学标志物鉴定溺死的方法基于大量生物标志物综合分析比使用一个或几个生物标志物的方法具有更高的预测能力,通过代谢组学技术检测样品中若干内源性目标生物标志物的含量,结合机器学习算法能够方便准确地判别新鲜溺死尸体,有望为法医学实践中案件侦破提供更有利的帮助。(4) A method for identifying drowning based on metabolomics markers provided by the present invention is based on a comprehensive analysis of a large number of biomarkers, which has a higher predictive ability than the method using one or a few biomarkers. The content of several endogenous target biomarkers in the sample, combined with machine learning algorithms, can conveniently and accurately identify fresh drowned corpses, which is expected to provide more favorable assistance for case detection in forensic practice.

附图说明Description of drawings

图1是训练集中不同分组的尸体心血样品中代谢特征观察结果。Figure 1 shows the observation results of metabolic characteristics in different groups of cadaver heart blood samples in the training set.

图2是以训练集样本建立的PLS-DA模型。Figure 2 is the PLS-DA model built with training set samples.

图3是对PLS-DA模型进行了200次置换检验结果。Figure 3 shows the results of 200 permutation tests on the PLS-DA model.

图4是以所有代谢物建立的随机森林分类器模型在验证集中的ROC曲线。Figure 4 is the ROC curve of the random forest classifier model built for all metabolites in the validation set.

图5是对初步建立的随机森林分类器模型进行5重10折交叉检验的结果。Figure 5 is the result of the 5-fold 10-fold cross-validation of the initially established random forest classifier model.

图6是是14种标志物在早期水中尸体中不同死因间的含量差异与变化箱式图。Figure 6 is a box diagram of the content differences and changes of 14 markers among different causes of death in early water corpses.

图7是基于14种标志物含量差异构建的随机森林分类器模型在验证集中的ROC曲线。Figure 7 is the ROC curve in the verification set of the random forest classifier model constructed based on the differences in the contents of 14 markers.

具体实施方式Detailed ways

下面结合附图和实施例对本发明的具体实施方式作进一步详细描述。以下实施例详细说明了本发明,但不用于限制本发明的范围。本实施例采用常规实验技术,这些均是本技术领域人员所熟悉的,可以按照本实施例使用材料厂商所提供的说明书即可进行。The specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings and examples. The following examples illustrate the invention in detail, but are not intended to limit the scope of the invention. This embodiment adopts conventional experimental techniques, which are familiar to those skilled in the art, and can be carried out according to the instructions provided by the material manufacturer according to this embodiment.

1.研究对象及分组。1. Research objects and groups.

训练集:取雄性SD大鼠80只,随机分为溺死组(D组)和死后入水组(PS组),每组40只。溺死组大鼠浸入河水中溺死,死后入水组大鼠采取CO2气体窒息处死后浸入河水中,实验期间河水温度为20-25℃。于死后不同的时间点(0h、6h、12h、18h及24h),每组取8只大鼠尸体,提取心血(约200μL),取材后将样本立即放入液氮中,随后-80℃保存备检。Training set: 80 male SD rats were randomly divided into a drowning group (group D) and a postmortem submersion group (group PS), with 40 rats in each group. The rats in the drowning group were immersed in river water and drowned, and the rats in the water-entrance group were killed by asphyxiation with CO 2 gas and then immersed in river water. The temperature of the river water was 20-25°C during the experiment. At different time points after death (0h, 6h, 12h, 18h and 24h), 8 dead rats were taken from each group, and heart blood (about 200 μL) was extracted. Save for future inspection.

验证集:取雄性SD大鼠20只,随机分为溺死组和死后入水组,每组10只。分别按照上述方法处死后浸入河水中,于死后0h、6h、12h、18h及24h,从每组各取2只大鼠提取心血,之后将样本置于-80℃保存备检。Verification set: 20 male SD rats were randomly divided into a drowning group and a water submersion group after death, with 10 rats in each group. They were sacrificed according to the above method and immersed in river water. At 0h, 6h, 12h, 18h and 24h after death, 2 rats in each group were taken to extract heart blood, and then the samples were stored at -80°C for future inspection.

2.代谢物提取:取心血样品100μL置于新EP管中,加入400μL质谱级甲醇沉淀蛋白,涡旋震荡,冰浴静置5 min,15000 g、4℃离心10 min,取100μL的上清液加质谱级水稀释至甲醇含量为53%,并置于离心管中15000g、4℃离心10min,收集上清,上机进行分析。2. Metabolite extraction: Take 100 μL of heart blood sample and put it in a new EP tube, add 400 μL of mass spectrometry grade methanol to precipitate protein, vortex, let stand in ice bath for 5 min, centrifuge at 15000 g, 4°C for 10 min, and take 100 μL of supernatant The solution was diluted with mass spectrometry-grade water to a methanol content of 53%, and placed in a centrifuge tube at 15,000 g, centrifuged at 4°C for 10 min, and the supernatant was collected for analysis on the machine.

从每个实验样本中取等体积样本混匀作为QC样本,用于平衡色谱-质谱系统和监测仪器状态,在整个实验过程中对系统稳定性进行评价。以53%甲醇水溶液为blank样本,处理过程与实验样本相同。Take an equal volume sample from each experimental sample and mix it as a QC sample, which is used to balance the chromatography-mass spectrometry system and monitor the status of the instrument, and evaluate the system stability during the entire experiment. Take 53% methanol aqueous solution as the blank sample, and the treatment process is the same as that of the experimental sample.

3.代谢谱检测。3. Metabolic profile detection.

使用超高效液相色谱串联质谱(UHPLC-MS/MS)系统检测样品中各种小分子代谢物的含量。UHPLC-MS/MS system was used to detect the content of various small molecule metabolites in the samples.

LC-MS/MS分析使用Vanquish UHPLC系统(Thermo Fisher)和Orbitrap QExactive系列质谱仪(Thermo Fisher)。LC-MS/MS analysis was performed using a Vanquish UHPLC system (Thermo Fisher) and an Orbitrap QExactive series mass spectrometer (Thermo Fisher).

(1)仪器条件。(1) Instrument conditions.

色谱条件:Chromatographic conditions:

色谱柱:Hyperil Gold column(C18)Chromatographic column: Hyperil Gold column (C18)

柱温:40 ℃Column temperature: 40°C

流速:0.2 mL/minFlow rate: 0.2 mL/min

正离子模式:流动相 A:0.1%甲酸Positive ion mode: mobile phase A: 0.1% formic acid

流动相 B:甲醇 Mobile Phase B: Methanol

负离子模式:流动相 A:5mM醋酸铵,pH 9.0Negative ion mode: mobile phase A: 5mM ammonium acetate, pH 9.0

流动相 B:甲醇。 Mobile Phase B: Methanol.

色谱梯度洗脱程序如表1所示。The chromatographic gradient elution program is shown in Table 1.

表1. 色谱梯度洗脱程序Table 1. Chromatographic gradient elution program

Figure 242367DEST_PATH_IMAGE001
Figure 242367DEST_PATH_IMAGE001
.

质谱条件:扫描范围选择m/z 100-1500;ESI源的设置如下:喷雾电压:3.2 kV; 鞘气流速:35 arb; 辅助气流速:10 arb; 离子传输管温度:320 ℃;极性: 正离子模式;负离子模式;MS/MS二级扫描为数据依赖性扫描。Mass spectrometry conditions: scan range selection m/z 100-1500; ESI source settings are as follows: spray voltage: 3.2 kV; sheath gas flow rate: 35 arb; auxiliary gas flow rate: 10 arb; ion transfer tube temperature: 320 ℃; polarity: Positive ion mode; negative ion mode; MS/MS secondary scan is a data-dependent scan.

代谢物鉴定:将下机数据(.raw)文件导入Compound Discoverer 3.1(CD)搜库软件中,进行保留时间、质荷比等参数的简单筛选,然后对不同样本根据保留时间偏差0.2min和质量偏差5ppm 进行峰对齐,使鉴定更准确,随后根据设置的质量偏差5ppm、信号强度偏差30%、信噪比3、最小信号强度100000、加和离子等信息进行峰提取,同时对峰面积进行定量,再整合目标离子,然后通过分子离子峰和碎片离子进行分子式的预测并与mzCloud、mzVault和MassList数据库进行比对,用blank样本去除背景离子,并对定量结果进行归一化,最后得到各种代谢物的定性定量结果。Metabolite identification: Import the off-machine data (.raw) file into Compound Discoverer 3.1 (CD) library search software, perform simple screening of parameters such as retention time and mass-to-charge ratio, and then analyze different samples according to the retention time deviation of 0.2min and mass Perform peak alignment with a deviation of 5ppm to make the identification more accurate, and then perform peak extraction according to the set mass deviation of 5ppm, signal intensity deviation of 30%, signal-to-noise ratio of 3, minimum signal intensity of 100000, and summation ions, etc., and quantify the peak area at the same time , and then integrate the target ions, then predict the molecular formula through the molecular ion peak and fragment ion and compare it with the mzCloud, mzVault and MassList databases, use the blank sample to remove the background ions, and normalize the quantitative results, and finally get various Qualitative and quantitative results of metabolites.

4.新鲜水中尸体心血样本代谢谱差异。4. Differences in metabolic profiles of cadaver blood samples in fresh water.

经过代谢谱检测及数据预处理,共检测到601种代谢物。首先利用鉴定到的所有代谢物进行主成分分析(Principal Component Analysis,PCA),以观察不同死因尸体心血样品中代谢特征。After metabolic profile detection and data preprocessing, a total of 601 metabolites were detected. Firstly, Principal Component Analysis (PCA) was performed using all the identified metabolites to observe the metabolic characteristics in heart blood samples from cadavers with different causes of death.

结果如图1所示,QC样本聚集良好,表明仪器稳定,检测结果可用于深入分析。在得分图中各时间组有明显的分离趋势说明不同死后浸没时间(Postmortem SubmersionInterval,PMSI)的心血样品中代谢谱具有明显的差异。但不同死因间区分效果不理想,这可能是因为不同死因间的代谢水平差异被不同时间的差异所掩盖,需进一步探索不同死因间的代谢谱差异。The results are shown in Figure 1. The QC samples were well aggregated, indicating that the instrument was stable and the test results could be used for in-depth analysis. In the score chart, there is an obvious separation trend in each time group, indicating that the metabolic profiles of heart blood samples with different postmortem submersion intervals (Postmortem Submersion Interval, PMSI) have obvious differences. However, the effect of distinguishing between different causes of death is not ideal, which may be because the differences in metabolic levels between different causes of death are covered by the differences at different times. Further exploration of the differences in metabolic profiles between different causes of death is needed.

随后以训练集样本建立偏最小二乘法判别分析(Partial Least SquaresDiscriminant Analysis,PLS-DA)模型,如图2所示,进一步区分不同死因,结果显示两种死因明显分离,且模型具有较好的解释性和预测性(R2Y=0.939,Q2Y=0.834),说明不同死因的新鲜水中尸体具有明显的差异。为验证该模型是否存在过拟合现象,本发明专利对模型进行了置换检验(200次),如图3所示,结果表明Q2截距为负,该PLS-DA模型未过拟合。同时得到了各代谢物的VIP值,其可以衡量各代谢物差异对各组样本分类判别的影响强度和解释能力。Then, a Partial Least Squares Discriminant Analysis (PLS-DA) model was established using the training set samples, as shown in Figure 2, to further distinguish different causes of death. The results showed that the two causes of death were clearly separated, and the model had a better explanation and predictability (R2Y=0.939, Q2Y=0.834), indicating that fresh water corpses with different causes of death have significant differences. In order to verify whether the model has overfitting phenomenon, the patent of the present invention has carried out permutation tests on the model (200 times), as shown in Figure 3, the result shows that the Q2 intercept is negative, and the PLS-DA model is not overfitted. At the same time, the VIP value of each metabolite was obtained, which can measure the impact strength and explanatory power of each metabolite difference on the classification and discrimination of samples in each group.

5. 新鲜溺死尸体判别模型的建立与验证。5. Establishment and validation of a discriminant model for freshly drowned corpses.

以上的结果表明溺死及死后抛尸尸体在死后早期(24h内)代谢谱始终存在差异性。猜想这种差异能够为判别新鲜水中尸体是否为溺死提供重要的参考信息。因此,根据多种代谢物的含量差异并结合随机森林算法建立溺死尸体判别模型,并使用验证集样本对该模型进行验证。在训练集中以溺死及死后抛尸为因变量,以各种代谢物为自变量,有放回地随机抽取训练集样本做随机森林分类,最终得到模型的误差及各自变量的相对重要程度排序。模型的误差以袋外数据(Out-of-bag,OOB)误差表示。OOB是指对训练集数据进行随机有放回地抽样时,未被抽到的那部分数据(约占1/3)。OOB误差是指用抽取样本构建的模型在袋外数据里进行验证时,模型错误识别的个数占所有袋外数据个数的百分比。代谢物相对重要程度以模型平均准确度下降程度评估。为避免模型不同参数设置带来的影响,以默认参数建立模型并验证,结果表明模型在验证集中AUC为1,验证集20个样品中19个样品能够被准确鉴别死因,该分类器模型预测准确率可达95%,如图4和表2所示。说明代谢组学可用于鉴别溺死尸体,对法医学实践具有重要的参考价值。The above results show that there are always differences in metabolic profiles in the early postmortem period (within 24 hours) between drowned and postmortem corpses. It is conjectured that this difference can provide important reference information for judging whether the corpse in fresh water is drowned. Therefore, based on the content differences of various metabolites and combined with the random forest algorithm, a drowned corpse discrimination model was established, and the model was verified using the validation set samples. In the training set, drowning and post-mortem dumping are used as dependent variables, and various metabolites are used as independent variables, and the training set samples are randomly selected with replacement for random forest classification, and finally the error of the model and the relative importance of each variable are sorted . The error of the model is represented by Out-of-bag (OOB) error. OOB refers to the part of the data (about 1/3) that is not sampled when the training set data is randomly sampled with replacement. The OOB error refers to the percentage of the number of model misidentifications to the number of all out-of-bag data when the model constructed by drawing samples is verified in the out-of-bag data. The relative importance of metabolites was assessed by the decrease in the average accuracy of the model. In order to avoid the impact of different parameter settings of the model, the model was established and verified with default parameters. The results showed that the AUC of the model in the verification set was 1, and 19 samples out of 20 samples in the verification set could accurately identify the cause of death. The classifier model predicted accurately The rate can reach 95%, as shown in Figure 4 and Table 2. It shows that metabolomics can be used to identify drowned corpses, which has important reference value for forensic practice.

表2以全部代谢物建立的死因判别模型对验证集样本分组预测结果Table 2 Prediction results of the grouping of validation set samples by the cause of death discriminant model established with all metabolites

Figure 791160DEST_PATH_IMAGE002
Figure 791160DEST_PATH_IMAGE002
.

6.判别溺死尸体的生物标志物筛选、模型建立与验证。6. Biomarker screening, model establishment and verification for identifying drowned corpses.

通过分析整个代谢谱获得了大量有效的信息,并证明了代谢组学在新鲜溺死尸体判别中的可行性,但研究整个代谢谱需要大量的成本,以上结果不便于实践中应用。接下来进一步筛选对于判别溺死具有重要作用的指标体系并以此构建数学模型,以便于实际应用。对之前建立的随机森林死因判别模型进行5重10折交叉检验,图5所示,结果显示该模型的预测误差随着构建模型变量(代谢物)数目的增加呈现明显下降后略微波动的趋势。经交叉检验,发现以14种代谢物构建模型时,误差与以所有代谢物构建的模型误差不具有统计学差异,因此根据各代谢物的重要性及VIP值筛选对区分不同死因有重要意义的14种潜在生物标志物,见表3。根据14种潜在标志物的箱式图可发现,一些代谢物在不同死因的死亡过程中表现出明显的差异,而另一些代谢物在死后早期降解过程中表现出不同的代谢模式,如图6所示。在死亡过程中,溺死组皮质酮水平明显升高反映出机体下丘脑-垂体-肾上腺轴激活,产生应激反应。此外,溺水过程中机体冷觉感受器对皮肤温度突然下降产生动态反应,伴随肌肉张力增加,从而增加了代谢率,在水中剧烈挣扎进一步增加了氧气供应中断情况下的耗氧量,导致溺死组乳酸含量较高。而大多数代谢物在两种死因间表现出不同的死后变化模式,可能是因为在早期降解过程中机体内微生物组成不同,不同的微生物对底物有不同的偏好。A large amount of effective information has been obtained by analyzing the entire metabolic profile, and the feasibility of metabolomics in the identification of freshly drowned corpses has been proved. However, the study of the entire metabolic profile requires a lot of cost, and the above results are not convenient for practical application. Next, further screen the index system that plays an important role in judging drowning and build a mathematical model based on it for practical application. A 5-fold 10-fold cross-test was carried out on the previously established random forest cause of death discrimination model, as shown in Figure 5. The results showed that the prediction error of the model showed a trend of decreasing obviously and then fluctuating slightly with the increase of the number of model variables (metabolites). After cross-checking, it was found that when the model was built with 14 metabolites, the error was not statistically different from that of the model built with all metabolites. Therefore, screening according to the importance and VIP value of each metabolite is of great significance for distinguishing different causes of death. The 14 potential biomarkers are listed in Table 3. According to the box plot of 14 potential markers, it can be found that some metabolites show obvious differences in the death process of different causes of death, while other metabolites show different metabolic patterns in the early postmortem degradation process, as shown in Fig. 6. During the death process, the level of corticosterone in the drowning group increased significantly, reflecting the activation of the hypothalamus-pituitary-adrenal axis in the body, resulting in a stress response. In addition, during the drowning process, the body's cold sensory receptors dynamically respond to the sudden drop in skin temperature, accompanied by an increase in muscle tension, which increases the metabolic rate, and severe struggle in the water further increases the oxygen consumption in the event of interruption of oxygen supply, resulting in lactic acid in the drowning group. The content is higher. Most of the metabolites showed different postmortem change patterns between the two causes of death, which may be due to the different composition of microorganisms in the body during the early degradation process, and different microorganisms have different preferences for substrates.

表3 14种生物标志物Table 3 14 biomarkers

Figure DEST_PATH_IMAGE001
Figure DEST_PATH_IMAGE001
.

筛选到的14种潜在生物标志物包含了丰富的信息,基于这14种代谢物在不同死因间的差异重新建立随机森林模型,并检验其有效性。结果表明以14种潜在生物标志物建立的简化模型准确性为95%,见表4,与以全部代谢物建立的模型相一致,AUC=0.95,如图7所示,模型简便且效果优异。The 14 potential biomarkers screened contained rich information, and the random forest model was rebuilt based on the differences of these 14 metabolites among different causes of death, and its validity was tested. The results show that the accuracy of the simplified model established with 14 potential biomarkers is 95%, as shown in Table 4, which is consistent with the model established with all metabolites, AUC=0.95, as shown in Figure 7, the model is simple and effective.

表4以14种生物标志物建立的死因判别模型对验证集样本分组预测结果Table 4 Prediction results of the grouping of validation set samples by the cause of death discriminant model established with 14 biomarkers

Figure 848033DEST_PATH_IMAGE004
Figure 848033DEST_PATH_IMAGE004
.

通过对早期水中尸体心血样本进行代谢组学分析,本发明证明了代谢组学技术在溺死尸体判别方面的可行性,进一步筛选了具有重要作用的生物标志物并验证了其准确性,这些结果能够为法医学实践提供参考,为进一步转化奠定了基础。By performing metabolomics analysis on the heart blood samples of early water cadavers, the present invention proves the feasibility of metabolomics technology in the discrimination of drowned corpses, and further screens important biomarkers and verifies their accuracy. These results can It provides a reference for forensic practice and lays the foundation for further transformation.

Claims (1)

1. A screening method for a marker for judging whether an early underwater corpse is drowned is characterized by comprising the following specific steps:
step 1, randomly dividing heart blood samples extracted from dead rats in a drowned group and dead rats in a post-mortem and water-entering group at 0h, 6h, 12h, 18h and 24h into a training set and a verification set respectively;
step 2, performing cardioblood metabonomics detection by adopting an ultra-high performance liquid chromatography tandem mass spectrometry UHPLC-MS/MS system to obtain metabolic fingerprint spectrums of various micromolecular metabolites;
step 3, importing the training set data into R language version 3.6.1, taking sample groups as dependent variables by means of a random forest program package, taking various metabolites as independent variables, and randomly extracting training set samples in a release manner to establish a random forest classifier model, so as to finally obtain the relative importance degree of the independent variables and the prediction precision of the model;
the relative importance degree of the independent variable is expressed by taking average accuracy reduction as an index, the value is the reduction degree of the prediction accuracy of a random forest classifier model when the value of a variable is changed into a random number, the greater the importance of the variable is, the prediction accuracy of the model is expressed by the area and the accuracy of the model under the working characteristic curve of a subject on a verification set, the accuracy is the proportion of the number of correctly classified verification samples to the number of all verification samples, and the more the AUC and the accuracy are close to 1, the more reliable the model is;
step 4, cross-checking the random forest classifier model established by taking all metabolites as independent variables, establishing a simple classifier model containing less metabolites by taking the upper limit of the 95% credible interval of the cross-checking error of the classifier model established in the step 3 as reference and the same cross-checking error as standard;
step 5, selecting and determining 14 marked metabolites for judging whether the carcasses in the water are drowned according to the relative importance of the metabolites in the random forest classifier model in the step 3 and the variable projection importance value in the PLS-DA model;
the marker metabolites are: lactic acid, mevalonic acid, phenylacetylglycine, tetradecanedioic acid, inosine, malic acid, beta-hydroxybutyrate, undecanoic acid, glycylleucine, corticosterone, thiamine, acetyl-carnitine, alanylleucine and kynurenine.
CN202110682946.0A 2021-06-18 2021-06-18 A method for identifying drowning based on metabolomics markers and its application Active CN113341044B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110682946.0A CN113341044B (en) 2021-06-18 2021-06-18 A method for identifying drowning based on metabolomics markers and its application

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110682946.0A CN113341044B (en) 2021-06-18 2021-06-18 A method for identifying drowning based on metabolomics markers and its application

Publications (2)

Publication Number Publication Date
CN113341044A CN113341044A (en) 2021-09-03
CN113341044B true CN113341044B (en) 2022-11-18

Family

ID=77477718

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110682946.0A Active CN113341044B (en) 2021-06-18 2021-06-18 A method for identifying drowning based on metabolomics markers and its application

Country Status (1)

Country Link
CN (1) CN113341044B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118090932A (en) * 2022-11-25 2024-05-28 中国科学院深圳先进技术研究院 Alzheimer's disease markers based on fecal metabolites and their applications

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102473208A (en) * 2009-07-02 2012-05-23 百奥科瑞茨生命科学公司 Method for normalization in metabolomics analysis methods with endogenous reference metabolites

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2249161B1 (en) * 2009-05-05 2020-05-13 InfanDx AG Method of diagnosing asphyxia

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102473208A (en) * 2009-07-02 2012-05-23 百奥科瑞茨生命科学公司 Method for normalization in metabolomics analysis methods with endogenous reference metabolites

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
Postmortem biochemistry: Current applications;S.L.Belsey 等;《Journal of Forensic and Legal Medicine》;20160731;第41卷;第49-57页 *
SD大鼠血清心肌损伤标志物的死后变化及法医学应用;高军;《中国优秀硕士论文全文数据库》;20171015(第10期);全文 *
基于高通量测序的平均核苷酸一致性在1株弗朗西斯菌鉴定中的应用;张磊 等;《临床检验杂志》;20170731;第35卷(第7期);第499-502页 *
溺死诊断的研究进展;张巍 等;《法医学杂志》;20170228;第33卷(第1期);第42-47页 *

Also Published As

Publication number Publication date
CN113341044A (en) 2021-09-03

Similar Documents

Publication Publication Date Title
Turi et al. A review of metabolomics approaches and their application in identifying causal pathways of childhood asthma
CN105209909B (en) Biomarker relevant to renal function and its application method
JP6021187B2 (en) Metabolic biomarkers of autism
US20170358434A1 (en) Big-data analyzing method and mass spectrometric system using the same method
CN104204798A (en) Biomarkers for bladder cancer and methods using the same
CN112183616B (en) Diagnostic marker and kit for diagnosis of glioma, screening method and construction method of glioma diagnostic model
CN113341044B (en) A method for identifying drowning based on metabolomics markers and its application
CN109946411B (en) Biomarkers for the diagnosis of ossification of the ligamentum flavum of the thoracic spine and their screening methods
CN118671346B (en) Metabolic marker for gastric cancer diagnosis and application thereof
CN104297442B (en) The application in terms of quickly detection early stage cardiac toxicity of the endogenous small-molecule substance
CN103336914B (en) Method and the device of biomarker assembled in a kind of extraction
Anas Kamleh et al. Metabolic profiling in disease diagnosis, toxicology and personalized healthcare
WO2021084251A1 (en) Method to determine cognitive impairment
CN109946467B (en) A biomarker for the diagnosis of ossification of the ligamentum flavum of the thoracic spine
CN106706692A (en) Pesticide toxicity evaluation method based on metabonomical technique
Firdous et al. Integrating HRMAS-NMR Data and Machine Learning-Assisted Profiling of Metabolite Fluxes to Classify Low-and High-Grade Gliomas
Pelit et al. Identification of volatile biomarkers in exhaled breath by polythiophene solid phase microextraction fiber for disease diagnosis using GC–MS
US20240142455A1 (en) Method of diagnosing early-stage non-small cell lung cancer
CN113341043B (en) Method for deducing early death and post-death inundation time of corpse in water based on metabonomics marker and application thereof
CN105044343B (en) A method for constructing a diagnostic model of esophageal squamous cell carcinoma, the resulting diagnostic model, and a method for using the model
Gowda et al. Metabolomics
Yang et al. Metabolomic Profiling of Serum for Large Cohort Oral Squamous Cell Carcinoma Diagnosis
CN119673292A (en) Prognosis analysis marker, prognosis analysis system and application of diffuse large B cell lymphoma patient
CN118230955A (en) Model training method for depression detection, detection system, program, and storage medium
Borges Coutinho Gallo Biomarkers Related to the Hypotheses of the Pathophysiology of Schizophrenia Spectrum Disorders

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant