CN110047559B - Calculation method, system, device and medium for binding free energy of protein and drug - Google Patents
Calculation method, system, device and medium for binding free energy of protein and drug Download PDFInfo
- Publication number
- CN110047559B CN110047559B CN201910167720.XA CN201910167720A CN110047559B CN 110047559 B CN110047559 B CN 110047559B CN 201910167720 A CN201910167720 A CN 201910167720A CN 110047559 B CN110047559 B CN 110047559B
- Authority
- CN
- China
- Prior art keywords
- protein
- energy
- drug
- free energy
- term
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
Landscapes
- Bioinformatics & Cheminformatics (AREA)
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Genetics & Genomics (AREA)
- Biotechnology (AREA)
- Biophysics (AREA)
- Chemical & Material Sciences (AREA)
- Molecular Biology (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Bioinformatics & Computational Biology (AREA)
- Analytical Chemistry (AREA)
- Evolutionary Biology (AREA)
- General Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Theoretical Computer Science (AREA)
- Investigating Or Analysing Biological Materials (AREA)
Abstract
本公开公开了蛋白质与药物结合自由能的计算方法、系统、设备及介质,构建蛋白质与药物的结合自由能的函数,所述蛋白质与药物结合自由能的函数为蛋白质与药物复合物的结合自由能与静电项、极性溶剂化能项、范德华项、非极性溶剂化能和相互作用熵的函数关系;采集训练集中每个蛋白质与药物复合物的各能项,利用训练集中每个蛋白质与药物复合物的各能量项和每个蛋白质与药物的实验值,对蛋白质与药物的结合自由能的函数的进行多元线性拟合,得到蛋白质与药物的结合自由能的函数;将待测试的蛋白质与药物复合物各能量项输入到训练好的蛋白质与药物的结合自由能的函数中,输出蛋白质与药物的结合自由能。
The present disclosure discloses a calculation method, system, equipment and medium for the binding free energy of proteins and drugs, and constructs a function of the binding free energy of proteins and drugs, and the function of the binding free energy of proteins and drugs is the binding free energy of protein and drug complexes. The functional relationship between energy and electrostatic term, polar solvation energy term, van der Waals term, non-polar solvation energy and interaction entropy; collect the energy terms of each protein-drug complex in the training set, use each protein in the training set Each energy term of the complex with the drug and the experimental value of each protein and drug, perform multivariate linear fitting on the function of the binding free energy of the protein and the drug to obtain the function of the binding free energy of the protein and the drug; The energy terms of the protein-drug complex are input into the trained function of the binding free energy of the protein and the drug, and the binding free energy of the protein and the drug is output.
Description
技术领域technical field
本公开涉及一种蛋白质与药物结合自由能的计算方法、系统、设备及介质。The present disclosure relates to a method, system, device and medium for calculating the binding free energy of proteins and drugs.
背景技术Background technique
本部分的陈述仅仅是提到了与本公开相关的背景技术,并不必然构成现有技术。The statements in this section merely mention background related to the present disclosure and do not necessarily constitute prior art.
生物分子之间的相互作用,如受体-配体、抗原-抗体、DNA-蛋白、糖-凝集素、RNA-核糖体等,在很大程度上决定着生物的生理功能,如自我复制、新陈代谢和信息处理等。因此,研究生物分子之间的识别原理在生物学中起着至关重要的作用。其中,蛋白质与药物的结合是药物设计和药物相互作用的核心,同时,它在生物医学中的实验领域和计算领域中具有直接的医学研究意义和应用价值。特别是在虚拟筛选中,通过研究靶标蛋白与药物之间的相互作用方式,利用已有的方法对药物的结合能力进行评价,最终挑选出结合能力较强的药物。The interactions between biomolecules, such as receptor-ligand, antigen-antibody, DNA-protein, sugar-lectin, RNA-ribosome, etc., largely determine the physiological functions of organisms, such as self-replication, metabolism and information processing, etc. Therefore, studying the recognition principle between biomolecules plays a crucial role in biology. Among them, the combination of protein and drug is the core of drug design and drug interaction. At the same time, it has direct medical research significance and application value in the experimental field and computing field in biomedicine. Especially in virtual screening, by studying the interaction between the target protein and the drug, the existing methods are used to evaluate the binding ability of the drug, and finally the drug with stronger binding ability is selected.
据发明人了解,一般而言,我们用结合自由能来描述两者之间的结合亲和力。目前,能够准确计算结合自由能的方法是自由能微扰(FEP)和热力学积分(TI),然而这两中方法所带来的计算成本往往令人望而却步,而且,由于上述方法需要模拟复合物的多种非物理中间态,导致所计算的结果很难达到收敛状态。线性相互作用能(LIE)方法也是一种经典的方法,它是利用相互作用能和可调参数来估算结合自由能,但这种方法仅适用于具有类似相互作用机制的复合物,因此它的普适性比较差,且计算成本非常的高。As far as the inventors know, in general, we use the binding free energy to describe the binding affinity between the two. At present, the methods that can accurately calculate the binding free energy are free energy perturbation (FEP) and thermodynamic integration (TI). There are many non-physical intermediate states of , which makes it difficult for the calculated results to reach a convergent state. The linear interaction energy (LIE) method is also a classical method, which uses the interaction energy and tunable parameters to estimate the binding free energy, but this method is only suitable for complexes with similar interaction mechanisms, so its The universality is relatively poor, and the computational cost is very high.
另一种较常见的方法是分子力学/泊松-波尔兹曼表面积(MM/PBSA),MM/PBSA方法是一种结合了分子力学与连续介质模型计算自由能的方法,与FEP和TI方法相比,这个方法仅仅需要对靶点和药物结合和非结合两个状态进行相空间遍历,而不需要考虑中间态,因此计算量大大减少,所以较为常用。在MM/PBSA中,通常用正则振动(Nmode)方法来近似计算熵变。该方法采用谐振子近似,这种近似将平移、转动和振动视为非耦合的,从而计算平移、转动和振动的贡献。由于计算量巨大,因此非谐贡献往往被忽略。Another more common method is Molecular Mechanics/Poisson-Boltzmann Surface Area (MM/PBSA). Compared with the method, this method only needs to traverse the phase space of the two states of target and drug binding and non-binding, and does not need to consider the intermediate state, so the calculation amount is greatly reduced, so it is more commonly used. In MM/PBSA, the normalized vibration (Nmode) method is usually used to approximate the entropy change. The method employs a harmonic oscillator approximation, which treats translation, rotation, and vibration as uncoupled, and calculates the contributions of translation, rotation, and vibration. Due to the huge computational cost, the anharmonic contributions are often ignored.
此外,有研究发现,在相同的轨迹中,振动熵的变化甚至达到5kcal/mol[Kuhn,B.;Kollman,P.A.Binding of a Diverse Set of Ligands to Avidin and Streptavidin:AnAccurate Quantitative Prediction of Their Relative Affinities by aCombination of Molecular Mechanics and Continuum SolventModels.J.Med.Chem.2000,43,3786-3791.]。因此,很多研究人员选择忽略熵对结合自由能的贡献,从而使得最终结果往往更加不令人信服。In addition, studies have found that in the same trajectory, the change in vibrational entropy even reaches 5 kcal/mol [Kuhn, B.; Kollman, P.A. Binding of a Diverse Set of Ligands to Avidin and Streptavidin: An Accurate Quantitative Prediction of Their Relative Affinities by aCombination of Molecular Mechanics and Continuum Solvent Models. J. Med. Chem. 2000, 43, 3786-3791.]. Therefore, many researchers choose to ignore the contribution of entropy to the binding free energy, making the end result often less convincing.
发明内容SUMMARY OF THE INVENTION
为了解决现有技术的不足,本公开提供了蛋白质与药物结合自由能的计算方法、系统、设备及介质;In order to solve the deficiencies of the prior art, the present disclosure provides a method, system, device and medium for calculating the binding free energy of proteins and drugs;
第一方面,本公开提供了蛋白质与药物结合自由能计算方法;In a first aspect, the present disclosure provides a method for calculating the binding free energy of proteins and drugs;
蛋白质与药物结合自由能计算方法,包括:Methods for calculating the free energy of protein-drug binding, including:
构建蛋白质与药物的结合自由能的函数,所述蛋白质与药物结合自由能的函数为蛋白质与药物复合物的结合自由能与静电项、极性溶剂化能项、范德华项、非极性溶剂化能和相互作用熵的函数关系;Construct the function of the binding free energy of the protein and the drug, and the function of the binding free energy of the protein and the drug is the binding free energy of the protein and the drug complex and the electrostatic term, the polar solvation energy term, the van der Waals term, the non-polar solvation term The functional relationship between energy and interaction entropy;
采集训练集中每个蛋白质与药物复合物的各能项,各能量项包括:静电项、极性溶剂化能项、范德华项、非极性溶剂化能和相互作用熵;利用训练集中每个蛋白质与药物复合物的各能量项和每个蛋白质与药物的实验值,对蛋白质与药物的结合自由能的函数的进行多元线性拟合,得到蛋白质与药物的结合自由能的函数;Collect the energy terms of each protein-drug complex in the training set, including: electrostatic term, polar solvation energy term, van der Waals term, non-polar solvation energy and interaction entropy; use each protein in the training set Each energy term of the complex with the drug and the experimental value of each protein and drug, perform multivariate linear fitting on the function of the binding free energy of the protein and the drug, and obtain the function of the binding free energy of the protein and the drug;
将待计算的蛋白质与药物复合物的静电项、极性溶剂化能项、范德华项、非极性溶剂化能和相互作用熵输入到拟合得到的蛋白质与药物的结合自由能的函数中,输出蛋白质与药物的结合自由能。Input the electrostatic term, polar solvation energy term, van der Waals term, non-polar solvation energy and interaction entropy of the protein-drug complex to be calculated into the fitted function of the binding free energy of protein and drug, Export the binding free energy of protein and drug.
第二方面,本公开还提供了蛋白质与药物结合自由能计算系统;In a second aspect, the present disclosure also provides a protein-drug binding free energy calculation system;
蛋白质与药物结合自由能计算系统,包括:Protein and drug binding free energy calculation system, including:
蛋白质与药物的结合自由能的函数构建模块,被配置为构建蛋白质与药物的结合自由能的函数,蛋白质与药物结合自由能的函数为蛋白质与药物复合物的结合自由能与静电项、极性溶剂化能、范德华项、非极性溶剂化能和相互作用熵的函数关系;The function building block of the binding free energy of protein and drug is configured to construct the function of binding free energy of protein and drug, and the function of binding free energy of protein and drug is the binding free energy of protein and drug complex and electrostatic term, polarity The functional relationship of solvation energy, van der Waals term, non-polar solvation energy and interaction entropy;
多元线性拟合模块,被配置为采集训练集中每个蛋白质与药物复合物的各能项,各能量项包括:静电项、极性溶剂化能项、范德华项、非极性溶剂化能和相互作用熵;利用训练集中每个蛋白质与药物复合物的各能量项和每个蛋白质与药物的实验值,对蛋白质与药物的结合自由能的函数的进行多元线性拟合,得到蛋白质与药物的结合自由能的函数;The multivariate linear fitting module is configured to collect the energy terms of each protein-drug complex in the training set. The energy terms include: electrostatic terms, polar solvation energy terms, van der Waals terms, non-polar solvation energy and mutual Action entropy; use the energy terms of each protein-drug complex in the training set and the experimental values of each protein and drug to perform multivariate linear fitting on the function of the binding free energy of protein and drug to obtain the binding of protein and drug. function of free energy;
结合自由能计算模块,被配置为将待计算的蛋白质与药物复合物的静电项、极性溶剂化能项、范德华项、非极性溶剂化能和相互作用熵输入到训练好的蛋白质与药物的结合自由能的函数中,输出蛋白质与药物的结合自由能。The binding free energy calculation module is configured to input the electrostatic term, polar solvation energy term, van der Waals term, non-polar solvation energy and interaction entropy of the protein-drug complex to be calculated into the trained protein-drug As a function of the binding free energy of , output the binding free energy of the protein and the drug.
第三方面,本公开还提供了一种电子设备,包括存储器和处理器以及存储在存储器上并在处理器上运行的计算机指令,所述计算机指令被处理器运行时,完成第一方面所述方法的步骤。In a third aspect, the present disclosure also provides an electronic device, including a memory, a processor, and computer instructions stored in the memory and executed on the processor, and when the computer instructions are executed by the processor, the first aspect is completed. steps of the method.
第四方面,本公开还提供了一种计算机可读存储介质,用于存储计算机指令,所述计算机指令被处理器执行时,完成第一方面所述方法的步骤。In a fourth aspect, the present disclosure further provides a computer-readable storage medium for storing computer instructions, which, when executed by a processor, complete the steps of the method in the first aspect.
与现有技术相比,本公开的有益效果是:Compared with the prior art, the beneficial effects of the present disclosure are:
本公开综合PBSA和IE的各自优缺点,通过PBSA和IE计算具有实验值的大量蛋白质与药物复合物的各能项,将计算的各能量项与实验值进行多元线性拟合,得到一组线性拟合函数,通过该拟合函数精确且简单的计算蛋白质与药物复合物的结合自由能。The present disclosure integrates the respective advantages and disadvantages of PBSA and IE, calculates various energy terms of a large number of protein and drug complexes with experimental values through PBSA and IE, and performs multivariate linear fitting between the calculated energy terms and experimental values to obtain a set of linear Fitting function, through which the binding free energy of the protein-drug complex is calculated accurately and simply.
由于该拟合函数的各能量项本身是由主流的方法计算得出,而且又通过大量复合物进行拟合,从最终计算结果来看,其计算的结合自由能在数值上极为接近实验值,整体的相关性也远优于常规方法。Since the energy terms of the fitting function are calculated by the mainstream method, and are also fitted by a large number of complexes, from the final calculation results, the calculated binding free energy is numerically very close to the experimental value, The overall correlation is also much better than conventional methods.
同时,从在测试集上进行的测试结果上看,其得到的规律与训练集上得到的规律一致。因此具有优良的准确性和普适性。再者,由于IE方法的主要优点(即计算速度快)在本公开中得到了充分的利用,本公开还具有计算成本低的优势。At the same time, from the test results on the test set, the rules obtained are consistent with the rules obtained on the training set. Therefore, it has excellent accuracy and universality. Furthermore, since the main advantage of the IE method (ie, fast computation speed) is fully utilized in the present disclosure, the present disclosure also has the advantage of low computational cost.
基于上述的优点,本公开将会在在描述蛋白-药物类系统的亲和力中大放光彩。尤其是在药物的虚拟筛选中,它将会为此提供一条计算可靠、适用广泛和成本低廉的新途径。Based on the above-mentioned advantages, the present disclosure will shine in describing the affinity of protein-drug-like systems. Especially in the virtual screening of drugs, it will provide a computationally reliable, widely applicable and low-cost new avenue for this.
附图说明Description of drawings
构成本申请的一部分的说明书附图用来提供对本申请的进一步理解,本申请的示意性实施例及其说明用于解释本申请,并不构成对本申请的不当限定。The accompanying drawings that form a part of the present application are used to provide further understanding of the present application, and the schematic embodiments and descriptions of the present application are used to explain the present application and do not constitute improper limitations on the present application.
图1为一个或多个实施方式的方法流程图。FIG. 1 is a method flow diagram of one or more embodiments.
具体实施方式Detailed ways
应该指出,以下详细说明都是示例性的,旨在对本申请提供进一步的说明。除非另有指明,本公开使用的所有技术和科学术语具有与本申请所属技术领域的普通技术人员通常理解的相同含义。It should be noted that the following detailed description is exemplary and intended to provide further explanation of the application. Unless otherwise defined, all technical and scientific terms used in this disclosure have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs.
需要注意的是,这里所使用的术语仅是为了描述具体实施方式,而非意图限制根据本申请的示例性实施方式。如在这里所使用的,除非上下文另外明确指出,否则单数形式也意图包括复数形式,此外,还应当理解的是,当在本说明书中使用术语“包含”和/或“包括”时,其指明存在特征、步骤、操作、器件、组件和/或它们的组合。It should be noted that the terminology used herein is for the purpose of describing specific embodiments only, and is not intended to limit the exemplary embodiments according to the present application. As used herein, unless the context clearly dictates otherwise, the singular is intended to include the plural as well, furthermore, it is to be understood that when the terms "comprising" and/or "including" are used in this specification, it indicates that There are features, steps, operations, devices, components and/or combinations thereof.
如图1所示,第一个实施例,本公开实施例提供了蛋白质与药物结合自由能计算方法;As shown in FIG. 1 , the first embodiment, an embodiment of the present disclosure provides a method for calculating the binding free energy of proteins and drugs;
蛋白质与药物结合自由能计算方法,包括:Methods for calculating the free energy of protein-drug binding, including:
构建蛋白质与药物的结合自由能的函数,所述蛋白质与药物结合自由能的函数为蛋白质与药物复合物的结合自由能与静电项、极性溶剂化能项、范德华项、非极性溶剂化能和相互作用熵的函数关系;Construct the function of the binding free energy of the protein and the drug, and the function of the binding free energy of the protein and the drug is the binding free energy of the protein and the drug complex and the electrostatic term, the polar solvation energy term, the van der Waals term, the non-polar solvation term The functional relationship between energy and interaction entropy;
采集训练集中每个蛋白质与药物复合物的各能项,各能量项包括:静电项、极性溶剂化能项、范德华项、非极性溶剂化能和相互作用熵;利用训练集中每个蛋白质与药物复合物的各能量项和每个蛋白质与药物的实验值,对蛋白质与药物的结合自由能的函数的进行多元线性拟合,得到蛋白质与药物的结合自由能的函数;Collect the energy terms of each protein-drug complex in the training set, including: electrostatic term, polar solvation energy term, van der Waals term, non-polar solvation energy and interaction entropy; use each protein in the training set Each energy term of the complex with the drug and the experimental value of each protein and drug, perform multivariate linear fitting on the function of the binding free energy of the protein and the drug, and obtain the function of the binding free energy of the protein and the drug;
将待计算的蛋白质与药物复合物的静电项、极性溶剂化能项、范德华项、非极性溶剂化能和相互作用熵输入到拟合得到的蛋白质与药物的结合自由能的函数中,输出蛋白质与药物的结合自由能。Input the electrostatic term, polar solvation energy term, van der Waals term, non-polar solvation energy and interaction entropy of the protein-drug complex to be calculated into the fitted function of the binding free energy of protein and drug, Export the binding free energy of protein and drug.
进一步地,所述方法,还包括:Further, the method also includes:
将测试集的蛋白质与药物复合物的静电项、极性溶剂化能项、范德华项、非极性溶剂化能和相互作用熵输入到拟合得到的蛋白质与药物的结合自由能的函数中,输出蛋白质与药物的结合自由能估计值;通过比较计算出的估计值与实验值的关系,评估该拟合函数的合理性、普适性以及稳定性。Input the electrostatic term, polar solvation energy term, van der Waals term, non-polar solvation energy and interaction entropy of the protein-drug complex of the test set into the fitted function of the binding free energy of protein and drug, The estimated value of binding free energy of protein and drug is output; by comparing the relationship between the calculated estimated value and the experimental value, the rationality, generality and stability of the fitting function are evaluated.
进一步地,所述构建蛋白质与药物的结合自由能的函数的具体步骤为:Further, the specific steps of constructing the function of the binding free energy of the protein and the drug are:
PBSA_IE=a(ΔEele+ΔGpb)+bΔEvdw+cΔGnp+dIE+ePBSA_IE=a(ΔE ele +ΔG pb )+bΔE vdw +cΔG np +dIE+e
其中,PBSA_IE表示蛋白质与药物的结合自由能;ΔGpb表示极性溶剂化能项;ΔEvdw表示蛋白质与药物范德华能量项;ΔGnp表示非极性溶剂化能项;IE表示蛋白质与药物复合物的相互作用熵;a、b、c、d、e分别表示待拟合参数。Among them, PBSA_IE represents the binding free energy of the protein and the drug; ΔG pb represents the polar solvation energy term; ΔE vdw represents the van der Waals energy term of the protein and the drug; ΔG np represents the non-polar solvation energy term; IE represents the protein-drug complex The interaction entropy of ; a, b, c, d, and e represent the parameters to be fitted, respectively.
进一步地,在进行多元线性拟合之前,对训练集和测试集进行预处理的步骤包括:Further, before performing multivariate linear fitting, the steps of preprocessing the training set and the test set include:
训练集和测试集均采用同样的预处理步骤,训练集和测试集的蛋白质与药物的蛋白库编号(PDB ID)均来源于PDB bind(网址:http://www.pdbbind-cn.org/)数据库中“Protein-ligand complexes:The general set minus refined set”数据集。我们从该数据集中按字母排列随机从1a~1u共1047个复合物中随机挑选的84个复合物作为训练集,从1v~2h共860个复合物中随机挑选44个复合物作为测试集。训练集和测试集不包含电荷数超过2或含有金属离子的复合物)。复合物的晶体结构来自于PDB(网址:https://www.rcsb.org/)数据库,实验值从PDB bind中查询。The same preprocessing steps were used for both the training set and the test set. The protein library IDs (PDB IDs) of the proteins and drugs in the training set and the test set were all derived from PDB bind (URL: http://www.pdbbind-cn.org/ ) in the "Protein-ligand complexes: The general set minus refined set" dataset. From this dataset, we randomly selected 84 complexes from a total of 1047 complexes from 1a to 1u in alphabetical order as a training set, and randomly selected 44 complexes from a total of 860 complexes from 1v to 2h as a test set. The training and test sets do not contain complexes with more than 2 charges or containing metal ions). The crystal structure of the complex was obtained from the PDB (website: https://www.rcsb.org/) database, and the experimental values were queried from PDB bind.
(110)药物预处理:(110) Drug pretreatment:
对药物利用Gaussian 03软件进行两步量子处理:Two-step quantum processing of drugs using Gaussian 03 software:
第一步:通过Gaussian 03软件的HF/6-31G**方法对药物进行优化处理,用以找到最优结构,即能量最低结构;The first step: optimize the drug through the HF/6-31G** method of the Gaussian 03 software to find the optimal structure, that is, the structure with the lowest energy;
第二步:在高斯基组B3LYP/cc-PVTZ上进行单点能计算,得到药物的静电势;通过约束静电势方法来拟合各原子的电荷;Step 2: Perform single-point energy calculation on the Gaussian group B3LYP/cc-PVTZ to obtain the electrostatic potential of the drug; fit the charge of each atom by the constrained electrostatic potential method;
(120)蛋白质预处理:(120) Protein pretreatment:
由于所获得的蛋白质晶体结构一般不含有氢原子的信息,因此,通过AMBER 12中LEaP模块将蛋白质缺失的氢原子进行补充;Since the obtained protein crystal structure generally does not contain the information of hydrogen atoms, the missing hydrogen atoms of the protein are supplemented by the LEaP module in AMBER 12;
(130)蛋白质与药物复合物预处理:(130) Pretreatment of protein and drug complexes:
将蛋白质与药物复合物置于AMBER ff12SB力场中,并放入具有周期性的TIP3P水盒子当中,溶质距水盒子边缘最小的距离为同时,根据每个体系的带电性,加入反向离子对体系进行电性平衡;The protein and drug complexes are placed in the AMBER ff12SB force field and placed in a periodic TIP3P water box. The minimum distance between the solute and the edge of the water box is At the same time, according to the electrification of each system, counter ions are added to balance the electrical properties of the system;
进一步地,训练集和测试集中每个体系均采用AMBER程序包进行MD模拟,具体模拟方式如下:Further, each system in the training set and test set uses the AMBER package for MD simulation. The specific simulation methods are as follows:
(140)蛋白质与药物复合体系能量优化:(140) Energy optimization of protein and drug complex system:
先用最陡下降法将蛋白质-药物复合物能量最小化,然后用共轭梯度最小化法直至体系的能量到达收敛;First use the steepest descent method to minimize the energy of the protein-drug complex, and then use the conjugate gradient minimization method until the energy of the system reaches convergence;
(150)复合体系升温至常温:(150) composite system is warming up to normal temperature:
进行300ps的限制性MD模拟,约束力常数为将每个体系的温度从0K逐步升高到300K,利用朗之万动力学调节温度,碰撞频率为1.0ps-1,所有涉及氢原子的键都通过SHAKE算法进行约束,模拟步长为2fs;A 300ps restricted MD simulation was performed with the constraining force constant of The temperature of each system was gradually increased from 0K to 300K, the temperature was adjusted by Langevin dynamics, the collision frequency was 1.0ps -1 , all bonds involving hydrogen atoms were constrained by the SHAKE algorithm, and the simulation step size was 2fs;
(160)复合体系非限制性MD模拟:(160) Non-restrictive MD simulation of composite system:
对蛋白质-药物体系进行时长为2ns步长为2fs的非限制性的MD模拟:Perform an unrestricted MD simulation of the protein-drug system with a duration of 2ns and steps of 2fs:
前1ns每隔2000步记录一次轨迹,该轨迹包含复合物、平衡离子和水盒子的水分子中所有原子位置信息,该段MD模拟的目的是让复合物运动到相对平衡的构象;For the first 1 ns, the trajectory is recorded every 2000 steps, and the trajectory contains the information of all atomic positions in the water molecules of the complex, counter ion and water box. The purpose of this MD simulation is to move the complex to a relatively balanced conformation;
后1ns每隔5步记录一次轨迹,该轨迹仅包含复合物的位置信息,结合自由能的各能量项是通过对该轨迹采样获得。The trajectory was recorded every 5 steps for the next 1 ns. The trajectory only contained the position information of the complex, and the energy terms of the combined free energy were obtained by sampling the trajectory.
进一步地,采集训练集和测试集中每个蛋白质与药物体系的各能量项的具体步骤为:Further, the specific steps for collecting the energy terms of each protein and drug system in the training set and the test set are as follows:
(170) (170)
其中,为复合物的静电能,为蛋白质的静电能,为药物的静电能;其通过如下公式得出:in, is the electrostatic energy of the complex, is the electrostatic energy of the protein, is the electrostatic energy of the drug; it is given by the following formula:
其中,N为总原子数,qi、qj为第i、j个原子带电量,Rij为第i个原子与第j个原子间的距离。Among them, N is the total number of atoms, q i and q j are the charge amounts of the i-th and j-th atoms, and R ij is the distance between the i-th atom and the j-th atom.
(180)蛋白质与药物范德华能量项的表达式为:(180) The expression of the van der Waals energy term for proteins and drugs is:
其中,和分别为复合物、蛋白质和药物的范德华势,其通过如下公式得出in, and are the van der Waals potentials of the complex, protein, and drug, respectively, which are given by
其中,Aij、Bij为第i个原子与第j个原子间兰纳-琼斯势,Aij、Bij来源于AMBERff12SB力场。Among them, A ij and B ij are the Lanna-Jones potentials between the ith atom and the j th atom, and A ij and B ij are derived from the AMBERff12SB force field.
(190)极性溶剂化能项:(190) Polar solvation energy term:
ΔGpb为极性溶剂化能项,由隐式溶剂模型求解泊松-玻尔兹曼(PB)方程得出:ΔG pb is the polar solvation energy term, obtained by solving the Poisson-Boltzmann (PB) equation from the implicit solvent model:
其中,ε(r)为r落在分子表面内部或外部时分子内或外的介电常数;k(r)表示溶剂区域中的德拜-休克尔常数;ρ(r)表示溶质分子的电荷分布函数;Φ(r)为r处的电势分布。在本方法中,该项由AMBER软件包中的PBSA程序求解PB方程求得,内部和外部的介电常数分别设置为1和80。where ε(r) is the dielectric constant inside or outside the molecule when r falls inside or outside the molecular surface; k(r) is the Debye-Huckel constant in the solvent region; ρ(r) is the charge of the solute molecule Distribution function; Φ(r) is the potential distribution at r. In this method, this term is obtained by solving the PB equation with the PBSA program in the AMBER software package, and the internal and external dielectric constants are set to 1 and 80, respectively.
(200)非极性溶剂化能项:(200) Non-polar solvation energy term:
其通过溶剂可及表面积SASA得出:It is derived from the solvent accessible surface area SASA:
ΔGnp=γSASA+βΔG np = γSASA+β
其中,经验常数γ和β分别设置为和0.92kcal/mol。where the empirical constants γ and β are respectively set as and 0.92kcal/mol.
(210)蛋白质与药物复合物的相互作用熵:(210) The interaction entropy of protein-drug complexes:
其中,K为玻尔兹曼常数,T为温度,为蛋白质与药物的相互作用能,其可以通过对MD模拟进行平均来评估:where K is the Boltzmann constant, T is the temperature, is the protein-drug interaction energy, which can be estimated by averaging the MD simulations:
则but
其中,N为总共取得的构象帧数,i为第i帧构象,β为常数,ti为第i帧的时间,<.>表示求平均值的数学符号;是蛋白质与药物相互作用能在平均能量附近波动的振幅;Among them, N is the total number of conformation frames obtained, i is the conformation of the i-th frame, β is a constant, t i is the time of the i-th frame, and <.> represents the mathematical symbol for averaging; is the amplitude of the protein-drug interaction energy fluctuation around the mean energy;
基于相互作用熵(IE)的概念,本公开用它来计算复合物的熵变,来代替常规的Nmode方法。它是由严格的理论公式推导得出,与近似推导而出的Normal mode方法相比,它具有更加令人信服的理论依据。Based on the concept of interaction entropy (IE), the present disclosure uses it to calculate the entropy change of the complex instead of the conventional Nmode method. It is derived from a strict theoretical formula, and it has a more convincing theoretical basis than the Normal mode method derived approximately.
其次,它直接通过蛋白质与药物复合物的动力学模拟构象轨迹来计算相互作用熵的平均值,因此每一帧模拟的“气相”中复合物的相互作用能都可用于IE的计算,因此在计算熵的稳定性上也要优于目前的计算熵方法。Second, it directly calculates the average value of the interaction entropy through the kinetically simulated conformational trajectory of the protein-drug complex, so the interaction energy of the complex in the "gas phase" of each frame of simulation can be used for the calculation of IE, so in The stability of computational entropy is also better than that of current computational entropy methods.
再者,由复合物、蛋白质和药物的绝对熵值作差的Normal mode方法,常常由于绝对值太大造成数值误差,而IE方法通过直接计算系统的熵变而完美避开了这个缺点,同时它又极大的加快了计算速度。因此该方法具有可靠性强和计算成本低的特点。Furthermore, the Normal mode method, which is based on the absolute entropy value of complexes, proteins and drugs, often causes numerical errors due to too large absolute values, while the IE method perfectly avoids this shortcoming by directly calculating the entropy change of the system. It also greatly speeds up the calculation. Therefore, the method has the characteristics of strong reliability and low computational cost.
由于PBSA自身的固有的缺陷性,其计算结果往往不准确和可靠。尽管结合PBSA与IE方法(即PBSA+IE方法)计算的理论值与实验值之间具有优良的相关性,但是在数值上存在着较大的绝对误差,最终得到的结合自由能往往被高估。Due to the inherent defects of PBSA, its calculation results are often inaccurate and reliable. Although there is an excellent correlation between the theoretical and experimental values calculated by combining the PBSA and IE methods (ie, PBSA+IE method), there is a large absolute error in the numerical value, and the final binding free energy is often overestimated. .
本公开属于基于PBSA和相互作用熵IE的打分函数的方法,通过由随机挑选的大量的蛋白质与药物复合物组成的训练集进行多元线性拟合,给各能量项赋予不同的权重,从而弥补了PBSA的固有缺陷,以此对其他蛋白质与药物复合物的结合自由能能进行精确的预测。同时,为了验证该方法的普适性,我们又随机挑选了与训练集无重复的多个蛋白质与药物复合物作为测试集对该打分函数进行评估。The present disclosure belongs to the scoring function method based on PBSA and interaction entropy IE. By performing multivariate linear fitting on a training set consisting of a large number of randomly selected protein and drug complexes, each energy term is given different weights, thereby making up for the PBSA is inherently flawed to accurately predict the binding free energies of other protein-drug complexes. At the same time, in order to verify the universality of the method, we randomly selected multiple protein-drug complexes with no repetition from the training set as the test set to evaluate the scoring function.
结合自由能各能量项计算Combining the energy terms of free energy to calculate
结合自由能通过焓项与熵之和进行表示:The binding free energy is expressed as the sum of the enthalpy term and entropy:
ΔG=ΔH-TΔS (1)ΔG=ΔH-TΔS (1)
由于本公开采用IE方法计算熵项,因此上式可写为:Since the present disclosure adopts the IE method to calculate the entropy term, the above formula can be written as:
ΔG=ΔH+IE (2)ΔG=ΔH+IE (2)
PBSA计算焓项PBSA calculates the enthalpy term
蛋白质与药物的自由能可以看成气相结合自由能和溶剂化能两部分:The free energy of protein and drug can be regarded as two parts: gas-phase binding free energy and solvation energy:
ΔH=ΔEMM+ΔGsol (3)ΔH=ΔE MM +ΔG sol (3)
其中ΔEMM为气相能,其形式如下:where ΔE MM is the gas phase energy, and its form is as follows:
ΔEMM=ΔEele+ΔEvdw (4)ΔE MM = ΔE ele + ΔE vdw (4)
其中ΔEele,其表达式为:where ΔE ele , its expression is:
其中,和分别为复合物、蛋白质和药物的静电能,其通过如下公式得出in, and are the electrostatic energies of the complex, protein and drug, respectively, which are obtained by the following formula
其中N为总原子数,qi、qj为第i、j个原子带电量,Rij为第i、j原子间的距离(下同)。式(4)中ΔEvdw为蛋白质与药物范德华能量项:Among them, N is the total number of atoms, q i and q j are the charges of the ith and jth atoms, and R ij is the distance between the ith and jth atoms (the same below). In formula (4), ΔE vdw is the van der Waals energy term of protein and drug:
其中,和分别为复合物、蛋白质和药物的范德华势,其通过如下公式得出:in, and are the van der Waals potentials of the complex, protein, and drug, respectively, which are given by:
其中Aij、Bij为第i个原子与第j个原子间兰纳-琼斯势,此两项来源AMBER ff12SB力场。Among them, A ij and B ij are the Lanna-Jones potentials between the ith atom and the jth atom, which are derived from the AMBER ff12SB force field.
式(3)中ΔGsol为复合物的溶剂化能,其可以分为如下两部分:In formula (3), ΔG sol is the solvation energy of the complex, which can be divided into the following two parts:
ΔGsol=ΔGpb+ΔGnp (9)ΔG sol = ΔG pb + ΔG np (9)
其中ΔGpb为极性溶剂化能项,由隐式溶剂模型求解泊松-玻尔兹曼(PB)方程得出:where ΔG pb is the polar solvation energy term, obtained by solving the Poisson-Boltzmann (PB) equation from the implicit solvent model:
其中ε(r)为r落在分子表面内部或外部时分子内或外的介电常数;k(r)表示溶剂区域中的德拜-休克尔常数;ρ表示溶质分子的电荷分布;Φ(r)为r处的电势分布。在本方法中,该项由AMBER软件包中的PBSA程序求解PB方程求得,内部和外部的介电常数分别设置为1和80。where ε(r) is the dielectric constant inside or outside the molecule when r falls inside or outside the molecular surface; k(r) is the Debye-Huckel constant in the solvent region; ρ is the charge distribution of the solute molecule; Φ( r) is the potential distribution at r. In this method, this term is obtained by solving the PB equation with the PBSA program in the AMBER software package, and the internal and external dielectric constants are set to 1 and 80, respectively.
式(9)中ΔGnp是非极性溶剂化能项,其通过经验溶剂可及表面积SASA得出:ΔG np in Eq. (9) is the non-polar solvation energy term, which is derived from the empirical solvent accessible surface area SASA:
ΔGnp=γSASA+β (11)ΔG np = γSASA+β (11)
在本申请实施例中,经验常数γ和β分别设置为和0.92kcal/mol。所计算的构象是从最后1ns中的构象空间中每隔1000帧取出共100帧构象用于计算PBSA。In the embodiment of the present application, the empirical constants γ and β are respectively set as and 0.92kcal/mol. The calculated conformations are taken from the conformation space in the last 1 ns every 1000 frames for a total of 100 frames for calculating PBSA.
IE计算熵项IE calculates the entropy term
在本申请实施采用的相互作用熵的方法中,结合自由能的气相部分是由如下公式推导出:In the method of interaction entropy adopted in the implementation of this application, the gas phase part of the binding free energy is derived by the following formula:
其中Ep、El和Ew分别为蛋白质、药物和水的内能,和分别为蛋白质与药物、蛋白质-水和药物-水的相互作用能。是蛋白质与药物相互作用能的平均值, 是蛋白质与药物相互作用能在平均能量附近波动的振幅。因此,我们将IE定义为:where E p , El and E w are the internal energies of protein, drug and water, respectively, and are the interaction energies of protein and drug, protein-water and drug-water, respectively. is the average value of protein-drug interaction energy, is the amplitude of the fluctuation of the protein-drug interaction energy around the mean energy. Therefore, we define IE as:
其中K为玻尔兹曼常数,T为温度,为蛋白质与药物的相互作用能,其可以通过对MD模拟进行平均来评估:where K is the Boltzmann constant, T is the temperature, is the protein-drug interaction energy, which can be estimated by averaging the MD simulations:
则but
N为总共取得的构象帧数,i为第i帧构象,β为常数,ti为第i帧的时间。N is the total number of conformation frames obtained, i is the conformation of the ith frame, β is a constant, and t i is the time of the ith frame.
多元线性拟合:PBSA_IEMultivariate Linear Fitting: PBSA_IE
根据结合自由能各能量项之间的本质关系,本公开采用如下方式进行线性拟合:According to the essential relationship between the energy terms of the binding free energy, the present disclosure adopts the following method to perform linear fitting:
PBSA_IE=a(ΔEele+ΔGpb)+bΔEvdw+cΔGnp+dIE+e (16)PBSA_IE=a(ΔE ele +ΔG pb )+bΔE vdw +cΔG np +dIE+e (16)
第二个实施例,本公开实施例还提供了蛋白质与药物结合自由能计算系统;In the second embodiment, an embodiment of the present disclosure also provides a protein-drug binding free energy calculation system;
蛋白质与药物结合自由能计算系统,包括:Protein and drug binding free energy calculation system, including:
蛋白质与药物的结合自由能的函数构建模块,被配置为构建蛋白质与药物的结合自由能的函数,蛋白质与药物结合自由能的函数为蛋白质与药物复合物的结合自由能与静电项、极性溶剂化能、范德华项、非极性溶剂化能和相互作用熵的函数关系;The function building block of the binding free energy of protein and drug is configured to construct the function of binding free energy of protein and drug, and the function of binding free energy of protein and drug is the binding free energy of protein and drug complex and electrostatic term, polarity The functional relationship of solvation energy, van der Waals term, non-polar solvation energy and interaction entropy;
多元线性拟合模块,被配置为采集训练集中每个蛋白质与药物复合物的各能项,各能量项包括:静电项、极性溶剂化能项、范德华项、非极性溶剂化能和相互作用熵;利用训练集中每个蛋白质与药物复合物的各能量项和每个蛋白质与药物的实验值,对蛋白质与药物的结合自由能的函数的进行多元线性拟合,得到蛋白质与药物的结合自由能的函数;The multivariate linear fitting module is configured to collect the energy terms of each protein-drug complex in the training set. The energy terms include: electrostatic terms, polar solvation energy terms, van der Waals terms, non-polar solvation energy and mutual Action entropy; use the energy terms of each protein-drug complex in the training set and the experimental values of each protein and drug to perform multivariate linear fitting on the function of the binding free energy of protein and drug to obtain the binding of protein and drug. function of free energy;
结合自由能计算模块,被配置为将待计算的蛋白质与药物复合物的静电项、极性溶剂化能项、范德华项、非极性溶剂化能和相互作用熵输入到训练好的蛋白质与药物的结合自由能的函数中,输出蛋白质与药物的结合自由能。The binding free energy calculation module is configured to input the electrostatic term, polar solvation energy term, van der Waals term, non-polar solvation energy and interaction entropy of the protein-drug complex to be calculated into the trained protein-drug As a function of the binding free energy of , output the binding free energy of the protein and the drug.
第三个实施例,本公开实施例还提供了一种电子设备,包括存储器和处理器以及存储在存储器上并在处理器上运行的计算机指令,所述计算机指令被处理器运行时,完成第一个实施例所述方法的步骤。In the third embodiment, the embodiment of the present disclosure also provides an electronic device, including a memory, a processor, and computer instructions stored in the memory and executed on the processor, and when the computer instructions are executed by the processor, the first step is completed. Steps of the method of an embodiment.
本公开还提供了一种电子设备,包括存储器和处理器以及存储在存储器上并在处理器上运行的计算机指令,所述计算机指令被处理器运行时,完成方法中的各个操作,为了简洁,在此不再赘述。The present disclosure also provides an electronic device, including a memory, a processor, and computer instructions stored in the memory and executed on the processor, and when the computer instructions are executed by the processor, each operation in the method is completed. For brevity, It is not repeated here.
应理解,在本公开中,该处理器可以是中央处理单元CPU,该处理器还算可以是其他通用处理器、数字信号处理器DSP、专用集成电路ASIC,现成可编程门阵列FPGA或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。It should be understood that in the present disclosure, the processor may be a central processing unit CPU, and the processor may also be other general-purpose processors, digital signal processors DSP, application-specific integrated circuits ASIC, off-the-shelf programmable gate array FPGA or other Programming logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
该存储器可以包括只读存储器和随机存取存储器,并向处理器提供指令和数据、存储器的一部分还可以包括非易失性随机存储器。例如,存储器还可以存储设备类型的信息。The memory may include read-only memory and random access memory and provide instructions and data to the processor, and a portion of the memory may also include non-volatile random access memory. For example, the memory may also store device type information.
第四个实施例,本公开实施例还提供了一种计算机可读存储介质,用于存储计算机指令,所述计算机指令被处理器执行时,完成第一个实施例所述方法的步骤。In the fourth embodiment, an embodiment of the present disclosure further provides a computer-readable storage medium for storing computer instructions, and when the computer instructions are executed by a processor, the steps of the method described in the first embodiment are completed.
在实现过程中,上述方法的各步骤可以通过处理器中的硬件的集成逻辑电路或者软件形式的指令完成。结合本公开所公开的方法的步骤可以直接体现为硬件处理器执行完成,或者用处理器中的硬件及软件模块组合执行完成。软件模块可以位于随机存储器、闪存、只读存储器、可编程只读存储器或者电可擦写可编程存储器、寄存器等本领域成熟的存储介质中。该存储介质位于存储器,处理器读取存储器中的信息,结合其硬件完成上述方法的步骤。为避免重复,这里不再详细描述。本领域普通技术人员可以意识到,结合本公开中所公开的实施例描述的各示例的单元即算法步骤,能够以电子硬件或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。In the implementation process, each step of the above-mentioned method can be completed by a hardware integrated logic circuit in a processor or an instruction in the form of software. The steps of the method disclosed in conjunction with the present disclosure can be directly embodied as executed by a hardware processor, or executed by a combination of hardware and software modules in the processor. The software modules may be located in random access memory, flash memory, read-only memory, programmable read-only memory or electrically erasable programmable memory, registers and other storage media mature in the art. The storage medium is located in the memory, and the processor reads the information in the memory, and completes the steps of the above method in combination with its hardware. To avoid repetition, detailed description is omitted here. Those of ordinary skill in the art can realize that the units, ie algorithm steps, of each example described in conjunction with the embodiments disclosed in the present disclosure can be implemented by electronic hardware or a combination of computer software and electronic hardware. Whether these functions are performed in hardware or software depends on the specific application and design constraints of the technical solution. Skilled artisans may implement the described functionality using different methods for each particular application, but such implementations should not be considered beyond the scope of this application.
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统、装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that, for the convenience and brevity of description, the specific working process of the above-described systems, devices and units may refer to the corresponding processes in the foregoing method embodiments, which will not be repeated here.
在本申请所提供的几个实施例中,应该理解到,所揭露的系统、装置和方法,可以通过其他的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能的划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另外一点,所显示或讨论的相互之间的耦合或者直接耦合或者通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性、机械或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed system, apparatus and method may be implemented in other manners. For example, the apparatus embodiments described above are only illustrative. For example, the division of the units is only a division of a logical function. In actual implementation, there may be other division methods, for example, multiple units or components may be combined Either it can be integrated into another system, or some features can be omitted, or not implemented. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, indirect coupling or communication connection of devices or units, which may be in electrical, mechanical or other forms.
所述功能如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机、服务器或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。The functions, if implemented in the form of software functional units and sold or used as separate products, may be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present application can be embodied in the form of a software product in essence, or the part that contributes to the prior art or the part of the technical solution. The computer software product is stored in a storage medium, including Several instructions are used to cause a computer device (which may be a personal computer, a server, or a network device, etc.) to execute all or part of the steps of the methods described in the various embodiments of the present application. The aforementioned storage medium includes: U disk, mobile hard disk, Read-Only Memory (ROM, Read-Only Memory), Random Access Memory (RAM, Random Access Memory), magnetic disk or optical disk and other media that can store program codes .
综上,PBSA_IE方法能广泛的、稳定的以及精确的计算蛋白质与药物复合物的结合自由能。同时,也综合了PBSA和方法的各自优缺点,在计算成本上也具有良好的优势。In conclusion, the PBSA_IE method can extensively, stably and accurately calculate the binding free energy of protein-drug complexes. At the same time, the advantages and disadvantages of PBSA and the method are also integrated, and they also have good advantages in computational cost.
以上所述仅为本申请的优选实施例而已,并不用于限制本申请,对于本领域的技术人员来说,本申请可以有各种更改和变化。凡在本申请的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本申请的保护范围之内。The above descriptions are only preferred embodiments of the present application, and are not intended to limit the present application. For those skilled in the art, the present application may have various modifications and changes. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of this application shall be included within the protection scope of this application.
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910167720.XA CN110047559B (en) | 2019-03-06 | 2019-03-06 | Calculation method, system, device and medium for binding free energy of protein and drug |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910167720.XA CN110047559B (en) | 2019-03-06 | 2019-03-06 | Calculation method, system, device and medium for binding free energy of protein and drug |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110047559A CN110047559A (en) | 2019-07-23 |
CN110047559B true CN110047559B (en) | 2021-06-25 |
Family
ID=67274637
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910167720.XA Active CN110047559B (en) | 2019-03-06 | 2019-03-06 | Calculation method, system, device and medium for binding free energy of protein and drug |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110047559B (en) |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111161810B (en) * | 2019-12-31 | 2022-03-22 | 中山大学 | Free energy perturbation method based on constraint probability distribution function optimization |
CN111341391B (en) * | 2020-02-25 | 2023-12-01 | 深圳晶泰科技有限公司 | Free energy perturbation calculation scheduling method for heterogeneous cluster environment |
WO2022094870A1 (en) * | 2020-11-05 | 2022-05-12 | 深圳晶泰科技有限公司 | Relative free energy calculation method which is physically rigorous and which maximizes phase space overlap |
CN112216350B (en) * | 2020-11-05 | 2022-09-13 | 深圳晶泰科技有限公司 | Physical strict relative free energy calculation method with phase space overlapping maximization |
CN114360663B (en) * | 2021-12-30 | 2024-07-02 | 深圳晶泰科技有限公司 | Method, device and storage medium for determining relative binding free energy contribution |
WO2023123288A1 (en) * | 2021-12-30 | 2023-07-06 | 深圳晶泰科技有限公司 | Method and apparatus for determining contribution to relative binding free energy, and storage medium |
WO2023123396A1 (en) * | 2021-12-31 | 2023-07-06 | 深圳晶泰科技有限公司 | Enhanced sampling method, and method for calculating binding free energy of complex |
CN119108052A (en) * | 2024-08-15 | 2024-12-10 | 苏州予路乾行生物科技有限公司 | A method and system for calculating the solvation free energy of drug molecules |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2865627A1 (en) * | 2012-03-16 | 2013-09-19 | Robert J. Woods | Glycomimetics to inhibit pathogen-host interactions |
CN103333227A (en) * | 2013-06-07 | 2013-10-02 | 东南大学 | Metastatic tumor deletion protein small molecule cyclic peptide inhibitor and its preparation method and application |
CN106220707A (en) * | 2016-08-05 | 2016-12-14 | 孙非 | A kind of method for designing of antibody affinity ligand |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1629097A1 (en) * | 2003-05-22 | 2006-03-01 | University of California | Method for producing a synthetic gene or other dna sequence |
CN107862173B (en) * | 2017-11-15 | 2021-04-27 | 南京邮电大学 | A method and device for virtual screening of lead compounds |
-
2019
- 2019-03-06 CN CN201910167720.XA patent/CN110047559B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA2865627A1 (en) * | 2012-03-16 | 2013-09-19 | Robert J. Woods | Glycomimetics to inhibit pathogen-host interactions |
CN103333227A (en) * | 2013-06-07 | 2013-10-02 | 东南大学 | Metastatic tumor deletion protein small molecule cyclic peptide inhibitor and its preparation method and application |
CN106220707A (en) * | 2016-08-05 | 2016-12-14 | 孙非 | A kind of method for designing of antibody affinity ligand |
Non-Patent Citations (2)
Title |
---|
JAK2蛋白热点氨基酸与配体相互作用理论计算预测及新方法的应用;周一凡;《中国优秀硕士学位论文全文数据库 工程科技I辑》;20190115;第14页 * |
蛋白配体结合自由能精确理论计算新方法研究;刘笑;《中国博士学位论文全文数据库 工程科技I辑》;20190115;第20-61页 * |
Also Published As
Publication number | Publication date |
---|---|
CN110047559A (en) | 2019-07-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110047559B (en) | Calculation method, system, device and medium for binding free energy of protein and drug | |
Bleiziffer et al. | Machine learning of partial charges derived from high-quality quantum-mechanical calculations | |
Raman et al. | Automated, accurate, and scalable relative protein–ligand binding free-energy calculations using lambda dynamics | |
Jensen et al. | The elephant in the room of density functional theory calculations | |
US11562808B2 (en) | Rational drug design with computational free energy difference calculation using a modified bond stretch potential | |
Aquilante et al. | Molcas 8: New capabilities for multiconfigurational quantum chemical calculations across the periodic table | |
Canongia Lopes et al. | CL&P: A generic and systematic force field for ionic liquids modeling | |
Shirts et al. | An introduction to best practices in free energy calculations | |
Giese et al. | A GPU-accelerated parameter interpolation thermodynamic integration free energy method | |
Constantin et al. | Modified fourth-order kinetic energy gradient expansion with Hartree potential-dependent coefficients | |
Hemmen et al. | Grand Canonical Monte Carlo Simulations Guided by an Analytic Equation of State Transferable Anisotropic Mie Potentials for Ethers | |
Kiss et al. | Efficient handling of Gaussian charge distributions: An application to polarizable molecular models | |
Sampson et al. | A “stepping stone” approach for obtaining quantum free energies of hydration | |
Tsai et al. | Validation of free energy methods in AMBER | |
Mecklenfeld et al. | Comparison of RESP and IPolQ-mod partial charges for solvation free energy calculations of various solute/solvent pairs | |
Moustafa et al. | Harmonically assisted methods for computing the free energy of classical crystals by molecular simulation: A comparative study | |
Cuendet et al. | Alchemical free energy differences in flexible molecules from thermodynamic integration or free energy perturbation combined with driven adiabatic dynamics | |
Visscher et al. | Deriving a polarizable force field for biomolecular building blocks with minimal empirical calibration | |
Zhang et al. | Accelerating PDE constrained optimization by the reducedbasis method: application to batch chromatography | |
Zhang et al. | Adaptive coupling of a deep neural network potential to a classical force field | |
König et al. | Efficient alchemical intermediate states in free energy calculations using λ-enveloping distribution sampling | |
Robo et al. | Fast free energy estimates from λ-dynamics with bias-updated Gibbs sampling | |
Lukashina et al. | Lipophilicity prediction with multitask learning and molecular substructures representation | |
Abella et al. | Hydration free energy from orthogonal space random walk and polarizable force field | |
Poltavsky et al. | Perturbed path integrals in imaginary time: Efficiently modeling nuclear quantum effects in molecules and materials |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CB03 | Change of inventor or designer information |
Inventor after: Duan Lili Inventor after: Huang Kaifang Inventor after: Cong Yalong Inventor after: Li Hao Inventor after: Zhong Susu Inventor after: Dong Shuheng Inventor before: Duan Lili Inventor before: Huang Kaifang Inventor before: Cong Yalong Inventor before: Li Hao Inventor before: Zhong Susu Inventor before: Dong Shuheng |
|
CB03 | Change of inventor or designer information | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20220907 Address after: Room 2006, Building 1, Lushang Phoenix Plaza, Tangye Street, Jinan Area, China (Shandong) Free Trade Pilot Zone, Jinan City, Shandong Province, 250101 Patentee after: Shandong Zhibo Linghang Technology Innovation Co.,Ltd. Address before: No.1 Daxue Road, University Science Park, Changqing District, Jinan City, Shandong Province Patentee before: SHANDONG NORMAL University |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20220916 Address after: A2-304, Dawangshan Civil Affairs Apartment, Nanshan District, Shenzhen, Guangdong 518000 Patentee after: Yi Jihui Address before: Room 2006, Building 1, Lushang Phoenix Plaza, Tangye Street, Jinan Area, China (Shandong) Free Trade Pilot Zone, Jinan City, Shandong Province, 250101 Patentee before: Shandong Zhibo Linghang Technology Innovation Co.,Ltd. |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20230728 Address after: D1101, Building 4, Software Industry Base, No. 19, 17, 18, Haitian 1st Road, Binhai Community, Yuehai Street, Nanshan District, Shenzhen, Guangdong, 518000 Patentee after: Shenzhen Xinrui Gene Technology Co.,Ltd. Address before: A2-304, Dawangshan Civil Affairs Apartment, Nanshan District, Shenzhen, Guangdong 518000 Patentee before: Yi Jihui |