Molecular Dynamics
Molecular Dynamics
Molecular Dynamics
A Short Introduction
Michel Cuendet
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008
Plan
Introduction
The classical force field
Setting up a simulation
Connection to
statistical mechanics
Usage of MD simulation
Why we do simulation
In some cases, experiment is :
1.
impossible
Inside of stars
Weather forecast
2.
too dangerous
Flight simulation
Explosion simulation
3.
expensive
4.
blind
replace experiment
provoke experiment
explain experiment
aid in establishing intellectual property
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008
Molecular modeling
What is Molecular Modeling?
Molecular Modeling is concerned with the description of the atomic
and molecular interactions that govern microscopic and macroscopic
behaviors of physical systems
Average of observable
over selected microscopic
states
Computational tools
Quantum Mechanics (QM)
Electronic structure (Schrdinger)
ACCURATE
EXPENSIVE small system
10-100 atoms
10-100 ps
104-105 atoms
10-100 ns
MM
QM
104-105 atoms
10-100 ps
5
Types of phenomena
Goal : simulate/predict processes such as
1.
2.
3.
4.
polypeptide folding
biomolecular association
partitioning between solvents
membrane/micelle formation
thermodynamic equilibria
governed by weak
(non-bonded) forces
5.
6.
7.
chemical transformations
governed by strong forces
characteristics (1-4):
degrees of freedom:
equations of motion:
governing theory:
characteristics (5-7):
degrees of freedom:
equations of motion:
governing theory:
classical MD
quantum MD
Micelle Formation
denatured
Complexation
bound
unbound
micelle
mixture
Partitioning
in membrane
in water in mixtures
Plan
Introduction
The classical force field
Setting up a simulation
Connection to
statistical mechanics
Usage of MD simulation
Degrees of freedom:
atoms as elementary
particles + topology
Forces or
interactions
between atoms
Boundary conditions
MOLECULAR
MODEL
Force field =
physico-chemical
knowledge
Methods to generate
configurations of
atoms: Newton
system
temperature
pressure
walls
external forces
10
Bonds
r
E bond = K b (r " r0 )
r0
Type: CT1-CT1-CT3
Angles
11
Dihedral angles
Type: OC-OC-CT1-CC
O
C
H2N
E improper = K" (# $ # 0 )
R
2
12
+$ # '12 $ # ' 6 .
+$ r '12 $ r ' 6 .
E VdW = 4" -& ) * & ) 0 = " -& m ) * 2& m ) 0
%r(/
,% r ( % r ( /
,% r (
Combination rule for two different atoms i, j :
rm = rm,i + rm, j
Type: CT3-CT3
" = "i" j
!
Repulsive :
Pauli
! exclusion principle
1
" 12
r
EVdW
Attractive:
induced dipole / induced dipole
!
rm
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008
1
"# 6
r
13
Electrostatic interactions
Coulomb law
qi q j
E elec =
4 "#0# rij
14
Derived Interactions
Some interactions are often referred to as particular interactions, but they
result from the two interactions previously described, i.e. the electrostatic
and the van der Waals interactions.
H
d
2) Hydrophobic effect
- Collective effect resulting from the energetically
unfavorable surface of contact between the water
and an apolar medium (loss of water-water Hb)
- The apolar medium reorganizes to minimize the
water exposed surface (compaction, association... )
Water
Oil
15
# K b (r " r0 ) +
bonds
# K$ ($ " $0 ) +
angles
impropers
/) r ,12 ) r , 6 2
qiq j
m
m
+ #(1+ . " 2+ . 4 + #
* * r - 3 i> j 4 5(0( r
i> j 0 r
Introduction of cutoff
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008
16
cutofnb
cutonnb cutnb
!
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008
17
Effect of cutoff
No cutoff
8 cutoff
Elec
VdW
Total
Elec
VdW
Total
18
# K (r " r ) + #K ($ " $ )
b
bonds
angles
'
impropers
dihedrals
0* ) -12 * ) -6 3
qiq j
+ # 4(2, / " , / 5 + #
1+ r . + r . 4 i> j 46(0( r
i> j
Phase
Type of properties
Force field
parameters
structural data
(exp.)
Type of
system
small
molecules
crystalline solid
phase
r0, 0, 0
spectroscopic data
(exp.)
small
molecules
gas phase
molecular geometry:
bond lengths, angles
intra-molecular
vibrations: force
constants
small
molecules
gas phase
torsional-angle
rotational profiles
K, , n
small
molecules
gas phase
atom charges
charges qi (initial)
thermodynamic
data
(exp.)
molecules in
solution,
mixtures
condensed phase
heat of vaporisation,
density, free energy of
solvation
v. d. Waals : i, i
charges qi (final)
dielectric data
(exp.)
small
molecules
condensed phase
dielectric permittivity,
relaxation
charges
qi
transport data
(exp.)
small
molecules
condensed phase
transport coefficients:
diffusion, viscosity
v. d. Waals : i, i
charges qi
Type of data
quantum-chemical
calculations :
energy profiles
(theor.)
quantum-chemical
calculations :
electron densities
(theor.)
K b , K , K
19
Solvation
Fundamental influence on the structure, dynamics and
thermodynamics of biological molecules
Effect through:
Solvation of charge
Screening of charge - charge interactions
qi q j
E elec =
4 "#0#(r) r
=1
-
=80
20
21
Implicit solvation
Screening constant
" = Nr
!
Generalized born (GB) equation.
=80
Gelec = $
i> j
=1
qiq j
qi q j
1 & 1)
% (1% +$ $
4 "#0 r 2 ' # * i j r 2 + ai a j exp(% r 2 4ai a j )
ai : Born radius
solvation energy
22
Solvent accessible
Surface (SASA)
Van der Waals
Surface
Definitions:
- Van der Waals:
- Connolly:
- Solvent:
23
Connolly
(Contact)
Solvent accessible
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008
24
Limitations of classical MD
Problems
Solutions
QM-MM
Full QM simulations
25
Map
to all-atom
configurations
Centre of mass
A1 A4
Coarse-grained model
4 atoms
Centre of mass
B1 B4
Centre of mass
C1 C4
Centre of mass
D1 D4
Centre of mass
W1 W4
B
C
D
26
Amber
amber.scripps.edu
GROMOS
www.igc,ethz.ch/GROMOS
Gromacs
www.gromacs.org
NAMD
www.ks.uiuc.edu/Research/namd
E = explicit solvent
I = implicit solvent
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008
Plan
Introduction
The classical force field
Setting up a simulation
Connection to
statistical mechanics
Usage of MD simulation
28
4) Thermodynamical properties:
T0
NV
29
MD flowchart
Rescale velocities
Temp. Ok?
Structure Ok?
http://www.ch.embnet.org/MD_tutorial/
30
Coordinate files
Cartesian coordinates
(x,y,z)
ATOM
ATOM
ATOM
ATOM
ATOM
ATOM
ATOM
1
2
3
4
5
6
7
N
HT1
HT2
HT3
CA
HA
CB
VAL
VAL
VAL
VAL
VAL
VAL
VAL
1
1
1
1
1
1
1
-0.008
-0.326
-0.450
-0.172
1.477
1.777
2.038
-0.022
0.545
-0.956
0.566
-0.077
-0.598
-0.740
-0.030
0.778
-0.084
-0.876
0.073
0.971
-1.193
1.00
1.00
1.00
1.00
1.00
1.00
1.00
0.00
0.00
0.00
0.00
0.00
0.00
0.00
PEP
PEP
PEP
PEP
PEP
PEP
PEP
D
B
1
1
1
1
1
1
1
1
N
N
N
CG1
CG1
CA
HG11
HG11
1
1
1
1
1
1
1
1
C
C
CA
CA
CA
CB
CB
CB
1
1
1
1
1
1
1
1
*CA
*CA
CB
*CB
*CB
CG1
*CG1
*CG1
1
1
1
1
1
1
1
1
CB
HA
CG1
CG2
HB
HG11
HG12
HG13
1.4896
1.4896
1.4896
1.5421
1.5421
1.5353
1.1099
1.1099
105.11 117.88
105.11 -118.25
108.86 176.05
110.92 121.67
110.92 -118.15
110.92
56.96
111.11 -119.80
111.11 120.81
111.68
108.30
110.92
110.44
109.36
111.11
110.60
110.60
RCD
1.5353
1.0807
1.5421
1.5454
1.1177
1.1099
1.1134
1.1103
31
1
2
3
4
5
6
20
21
22
23
24
25
No.
H
HC
HA
HT
HP
HB
C
CA
CT1
CT2
CT3
CPH1
atom
type
1.00800
1.00800
1.00800
1.00800
1.00800
1.00800
H
H
H
H
H
H
!
!
!
!
!
!
polar H
N-ter H
nonpolar H
TIPS3P WATER HYDROGEN
aromatic H
backbone H
12.01100
12.01100
12.01100
12.01100
12.01100
12.01100
C
C
C
C
C
C
!
!
!
!
!
!
mass
90 atom types
atom
32
O
NH
NH
O
R
NH
NH
R
NH
R
Residue definition in
CHARMM
33
file : top_all22_prot.inp
Total charge
RESI ALA
0.00
GROUP
ATOM N
NH1
-0.47 !
|
ATOM HN
H
0.31 ! HN-N
ATOM CA
CT1
0.07 !
|
HB1
ATOM HA
HB
0.09 !
|
/
GROUP
! HA-CA--CB-HB2
ATOM CB
CT3
-0.27 !
|
\
ATOM HB1 HA
0.09 !
|
HB3
ATOM HB2 HA
0.09 !
O=C
ATOM HB3 HA
0.09 !
|
GROUP
!
ATOM C
C
0.51
ATOM O
O
-0.51
BOND CB CA N HN N CA
BOND C CA C +N CA HA CB HB1 CB HB2 CB HB3
DOUBLE O C
IMPR N -C CA HN C CA +N O
DONOR HN N
ACCEPTOR O C
IC -C
CA
*N
HN
1.3551 126.4900 180.0000
IC -C
N
CA
C
1.3551 126.4900 180.0000
IC N
CA
C
+N
1.4592 114.4400 180.0000
IC +N
CA
*C
O
1.3558 116.8400 180.0000
IC CA
C
+N
+CA
1.5390 116.8400 180.0000
IC N
C
*CA CB
1.4592 114.4400 123.2300
IC N
C
*CA HA
1.4592 114.4400 -120.4500
IC C
CA
CB
HB1
1.5390 111.0900 177.2500
IC HB1 CA
*CB HB2
1.1109 109.6000 119.1300
IC HB1 CA
*CB HB3
1.1109 109.6000 -119.5800
Atom name
Atom type
Atom charge
Bond
Improper
angle
Previous residue
Next residue
115.4200
114.4400
116.8400
122.5200
126.7700
111.0900
106.3900
109.6000
111.0500
111.6100
0.9996
1.5390
1.3558
1.2297
1.4613
1.5461
1.0840
1.1109
1.1119
1.1114
IC
34
Patching residues
Treat protein special features.
To make N- and C-termini:
PRES NTER
1.00 ! standard N-terminus
GROUP
! use in generate statement
ATOM N
NH3
-0.30 !
ATOM HT1 HC
0.33 !
HT1
ATOM HT2 HC
0.33 !
(+)/
ATOM HT3 HC
0.33 ! --CA--N--HT2
ATOM CA
CT1
0.21 !
|
\
ATOM HA
HB
0.10 !
HA
HT3
DELETE ATOM HN
BOND HT1 N HT2 N HT3 N
DONOR HT1 N
DONOR HT2 N
DONOR HT3 N
IC HT1 N
CA
C
0.0000 0.0000 180.0000 0.0000
IC HT2 CA
*N
HT1
0.0000 0.0000 120.0000 0.0000
IC HT3 CA
*N
HT2
0.0000 0.0000 120.0000 0.0000
0.0000
0.0000
0.0000
GROUP
ATOM 1CB CT2
ATOM 1SG SM
GROUP
ATOM 2SG SM
ATOM 2CB CT2
DELETE ATOM 1HG1
DELETE ATOM 2HG1
BOND 1SG 2SG
IC 1CA 1CB 1SG
IC 1CB 1SG 2SG
IC 1SG 2SG 2CB
2SG
2CB
2CA
0.0000
0.0000
0.0000
0.0000
0.0000
0.0000
180.0000
90.0000
180.0000
0.0000
0.0000
0.0000
0.0000
0.0000
0.0000
Etc...
35
Kb
r0
36
*
!------ Standard Topology and Parameters
OPEN UNIT 1 CARD READ NAME top_all22_prot.inp
READ RTF CARD UNIT 1
CLOSE UNIT 1
OPEN UNIT 1 CARD READ NAME par_all22_prot.inp
READ PARA CARD UNIT 1
CLOSE UNIT 1
! Generate actual topology
OPEN UNIT 1 READ CARD NAME 1aho-xray.pdb
READ SEQUENCE PDB UNIT 1
GENE 1S SETUP FIRST NTER LAST CTER
REWIND UNIT 1
READ COOR PDB UNIT 1
CLOSE UNIT 1
64
298
1753
169
118
37
38
output
*
CHARMM>
ENERGY
Delta-E
ANGLes
ELEC
--------0.00000
88.24015
-1285.42224
---------
GRMS
UREY-b
HBONds
--------4.26768
5.92868
0.00000
---------
DIHEdrals
ASP
---------
IMPRopers
USER
---------
239.13668
0.00000
---------
2.18868
0.00000
---------
39
Energy landscape
Landscape for / plane of dialanine
3N Spatial coordinates
40
Minimization
2
"
E
and
2 >0
"x i
Used to:
relieve strain in experimental conformations
! (energetically)
find
stable states
!
Find lower
energy mimima
41
Step-size
DIHEdrals
ASP
--------0.00000
6.34896
0.00000
--------0.00545
0.97825
0.00000
--------0.00279
1.87179
0.00000
---------
ENERgy
BONDs
VDWaals
---------28.09805
0.92928
6.19000
---------
Delta-E
ANGLes
ELEC
--------0.00062
3.80437
-41.86221
---------
GRMS
UREY-b
HBONds
--------0.00826
0.58415
0.00000
---------
0.15433
0.00000
--------0.05940
0.00000
--------0.09659
0.00000
---------
0 GROUP PAIRS
IMPRopers
USER
---------
0.00027
2.19377
0.00000
--------0.00005
2.18679
0.00000
---------
0.06973
0.00000
--------0.06957
0.00000
---------
200) exceeded.
Step-size
DIHEdrals
ASP
--------0.00005
2.18679
0.00000
---------
IMPRopers
USER
--------0.06957
0.00000
---------
42
dE i
dri
MD algorithm
F(t)
r(t + "t) = 2r(t) # r(t # "t) +
$ "t 2
m
calculate
forces F(t)
t = t + "t
!
update
r(t + "t)
t ~ 1 fs = 10-15 s
Propagation of time: position at time t+dt is a determined by position at time t and t-dt, and by
the acceleration at time t (i.e., the forces at time t)
The equations of motion are deterministic, e.g., the positions and the velocities at time zero
!
determine
the positions and velocities at all other times, t.
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008
43
!------ Coordinates
OPEN UNIT 1 READ CARD NAME val.pdb
READ COOR PDB UNIT 1
CLOSE UNIT 1
!------ MD
OPEN WRITE
OPEN WRITE
OPEN WRITE
DYNA
simulation
UNIT 31 CARD NAME traj/dyna.rst
UNIT 32 FILE NAME traj/dyna.dcd
UNIT 33 CARD NAME traj/dyna.ene
VERLET
IUNWRI
IPRFRQ
FIRSTT
IEQFRQ
INBFRQ
Generated files:
Restart file (rst). To restart after
crash or to continue MD sim.
Collection of instantaneous
coordinates along trajectory (dcd)
! Restart file
! Coordinates file
! Energy file
START NSTEP 1000 TIMESTEP 0.001 31 IUNCRD 32 KUNIT 33 100 NPRINT 100 NSAVC 100 NSAVV 100 ISVFRQ 2000 300.0 FINALT 300.0 ICHEW 1 10 IASORS 0 ISCVEL 0 IASVEL 1 ISEED 8364127 10
Control output
Control temperature
Control non bonded list update
frequency
44
Output of MD simulation
DYNA DYN: Step
DYNA PROP:
DYNA INTERN:
DYNA EXTERN:
DYNA PRESS:
---------DYNA>
0
DYNA PROP>
DYNA INTERN>
DYNA EXTERN>
DYNA PRESS>
----------
Time
GRMS
BONDs
VDWaals
VIRE
--------0.00000
1.79744
0.49721
4.45183
0.00000
---------
TOTEner
HFCTote
ANGLes
ELEC
VIRI
---------6.35935
-6.35862
2.67684
-37.44124
2.21273
---------
TOTKe
HFCKe
UREY-b
HBONds
PRESSE
--------19.31479
19.31700
0.28823
0.00000
0.00000
---------
ENERgy
EHFCor
DIHEdrals
ASP
PRESSI
---------25.67414
0.00074
3.69866
0.00000
0.00000
---------
TEMPerature
VIRKe
IMPRopers
USER
VOLUme
--------381.16244
-3.31909
0.15433
0.00000
0.00000
---------
TOTKe
HFCKe
UREY-b
HBONds
PRESSE
--------16.21830
16.53681
3.70007
0.00000
0.00000
---------
ENERgy
EHFCor
DIHEdrals
ASP
PRESSI
---------8.40109
0.10617
3.31010
0.00000
0.00000
---------
TEMPerature
VIRKe
IMPRopers
USER
VOLUme
--------320.05568
-73.15251
0.21480
0.00000
0.00000
---------
[...]
DYNA DYN: Step
Time
TOTEner
DYNA PROP:
GRMS
HFCTote
DYNA INTERN:
BONDs
ANGLes
DYNA EXTERN:
VDWaals
ELEC
DYNA PRESS:
VIRE
VIRI
-------------------------DYNA>
10
0.01000
7.81721
DYNA PROP>
14.53782
7.92338
DYNA INTERN>
3.28801
14.20105
DYNA EXTERN>
9.54825
-42.66337
DYNA PRESS>
0.00000
48.76834
-------------------------UPDECI: Nonbond update at step
10
[...]
45
p
m
p = F(r)
r =
pi2
H (r ,p) = "
+ V(r ) =
i 2mi
cste
In practice, constant energy dynamics is not often used for two reasons :
System :
reservoir
cutoffs
constraints
barostat
46
Plan
Introduction
The classical force field
Setting up a simulation
Connection to
statistical mechanics
Usage of MD simulation
47
Thermodynamic ensembles
A macroscopic state is described by :
- number of particles :
- volume :
- energy :
N
V
E
- chemical potential :
- pressure :
- temperature :
P
T
fixed N, V, E
fixed N, V, T
fixed N, P, T
fixed , P, T
often used in MD
often used in MD
48
1 " #E i
Pi = e
Z
= 1/KBT
KB = Boltzmann constant
Z =$ e" #Ej
such that :
"P =1
i
!
Illustration:
If a system can have two unique states, state (1) and state (2),
then the ratio of systems in state (1) and (2) is
P 1 e__E
= _ _E = e_ __E _ E _= e_ __ E
P2 e
1
(1)
(2)
Cave: if state (1) and (2) are composed of several microscopic states, E G
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008
49
Z = $ e" #E i
i
O =
1
# $E i
O
e
" i
Z i
...
Expectation Value
!
"
ln( Z ) = U
E =
"#
Internal Energy
# "ln( Z ) &
p = kT%
(
$ "V ' N,T
Pressure
A = "kT ln(Z)
50
Microscopic
E1, P1 ~ e-E1
Macroscopic
Expectation value
O =
Where
!
1
# $E
Oie i
"
Z
Z = $e
E2, P2 ~ e-E2
E3, P3 ~ e-E3
" #E i
E4, P4 ~ e-E4
E5, P5 ~ e-E5
51
Ergodic Hypothesis
The ergodic hypothesis is that the ensemble averages used to compute expectation
values can be replaced by time averages over the simulation.
O
1
Z
Ergodicity
ensemble
1
" #E(r, p )
drdp =
$ O(r, p)e
%
time
$ O(t)dt
0
Microcanonic ensemble
Canonical ensemble
Isothermic-isobaric ensemble
P = cst.
P(E) = e-E
P(E) = e-(E+PV)
" #E
52
ensemble
NVE simulation in a
local minimum
3N Spatial coordinates
1
=
Z
$ O(r, p)e
" #E (r, p )
drdp
?=
1
%
$ O(t)dt =
time
2)
NVE
NV
T
(infinite E
reservoir)
fixed P
3)
T
(infinite E
reservoir)
54
friction term
temperature regulation
the
canonical distribution is reproduced
for the physical variables
Conserved quantity :
Non-Hamiltonian dynamics...
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008
55
(t)
ri =
pi
mi
This equation can for example simulate the friction and stochastic effect of the
solvent in implicit solvent simulations. The temperature is adjusted via and ,
using the dissipation-fluctuation
! theorem.
The stochastic
term can improve barrier crossing and hence sampling.
!
56
P(C ")
= e# $ (V "#V )
P(C)
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008
57
Plan
Introduction
The classical force field
Setting up a simulation
Connection to
statistical mechanics
Usage of MD simulation
58
Statistical mechanics:
energy
U(x)
kBT
S(x1)
S(x2)
F = U - TS
U(x2)
x2 may have higher energy but lower
free energy than x1
U(x1)
x1
x2
coordinate x
60
Types of problems
Molecular dynamics simulations permit the study of complex, dynamic
processes that occur in biological systems. These include, for example:
Protein stability
Conformational changes
Protein folding
Molecular recognition: proteins, DNA, membranes, complexes
Ion transport in biological systems
http://www.ch.embnet.org/MD_tutorial/
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008
61
Historical perspective
Theoretical milestones:
Newton (1643-1727):
Boltzmann(1844-1906):
Schrdinger (1887-1961):
Rahman (1964):
Proteins
Liquids
Wood (1957):
Alder (1957):
62
System sizes
Simulations in explicit solvent
BPTI (VAC), bovine pancreatic
trypsin inhibitor without solvent
BPTI, bovine pancreatic
trypsin inhibitor with solvent
RHOD, photosynthetic reaction
center of Rhodopseudomonas
viridis
HIV-1, HIV-1 protease
ES, estrogenDNA
STR, streptavidin
DOPC, DOPC lipid bilayer
RIBO, ribosome
Solid curve, Moores law
doubling every 28.2 months.
Dashed curve, Moores law
doubling every 39.6 months.
K.Y. Sanbonmatsu, C.-S. Tung, Journal of Structural Biology 157(3) : 470, 2007
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008
63
1 million atoms
Simulation time : 50 ns
system size : 220 A
64
65
Exemple : Aquaporin
Selective translocation of water across a membrane
S. Hub and Bert L. de Groot. Mechanism of selectivity in aquaporins and aquaglyceroporins PNAS. 105:1198-1203 (2008)
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008
66
Exemple : Aquaporin
Selective translocation of water across a membrane
Propose a model for
selectivity at the atomic
level
S. Hub and Bert L. de Groot. Mechanism of selectivity in aquaporins and aquaglyceroporins, PNAS 105:1198-1203 (2008)
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008
67