Collisionless Dynamics
Collisionless Dynamics
Collisionless Dynamics
Useful to Remember
A⃗×A ⃗=0
A⃗×B ⃗ = −B ⃗ ×A ⃗
A⃗ · (A
⃗ × B)
⃗ =0
A⃗ · (B
⃗ × C)
⃗ = B·(⃗ C ⃗ × A)⃗ = C·(
⃗ A ⃗ × B)
⃗
A⃗ × (B⃗ × C)
⃗ = B(⃗ A ⃗ · C)
⃗ − C(⃗ A⃗ · B)
⃗
(A⃗ × B)
⃗ · (C⃗ × D)
⃗ = (A ⃗ · C)(
⃗ B ⃗ · D)
⃗ − (A⃗ · D)(
⃗ B ⃗ · C)
⃗
⃗ =vector operator= ( ∂ ,
∇ ∂
, ∂
)
∂x ∂y ∂z
⃗
∇S = gradS = vector
⃗ ·A
∇ ⃗ = divA⃗ = scalar
⃗ ×A
∇ ⃗ = curlA⃗ = vector
Summary of Vector Calculus II
⃗ ·∇
⃗ = scalar operator = ∂2 ∂2 ∂2
Laplacian: ∇ = ∇
2
∂x2
+ ∂y 2
+ ∂z 2
∇2 S ⃗ · (∇S)
= ∇ ⃗ = scalar
⃗
∇2 A ⃗ · ∇)
= (∇ ⃗ A
⃗ = vector
⃗ ∇
∇( ⃗ · A)
⃗ ̸= ∇2 A⃗ = vector
⃗ × (∇S)
∇ ⃗ = 0 curl(gradS) = 0
⃗ · (∇
∇ ⃗ × A)
⃗ = 0 ⃗ =0
div(curl A)
⃗ × (∇
∇ ⃗ × A)
⃗ ⃗ ∇
= ∇( ⃗ · A)
⃗ − ∇2 A ⃗
⃗
∇(ST ) = ⃗ + T ∇S
S ∇T ⃗
⃗ · (S A)
∇ ⃗ = S(∇⃗ · A)
⃗ +A ⃗ · ∇S
⃗
⃗ × (S A)
∇ ⃗ = S(∇⃗ × A)
⃗ −A ⃗ × ∇S⃗
⃗ · (A
∇ ⃗ × B)
⃗ = ⃗ · (∇
B ⃗ × A)
⃗ −A ⃗ · (∇
⃗ × B)
⃗
Integral Theorems I
Gradient Theorem: Let γ be a curve running from ⃗ x0 to ⃗ x1 , d⃗l is the
x) is a scalar field then:
directed element of length along γ , and φ(⃗
!1
x
⃗ !1
x
⃗
⃗ · d⃗l =
∇φ dφ = φ(⃗
x1 ) − φ(⃗
x0 )
x0
⃗ x0
⃗
It follows that
"
⃗ · d⃗l = 0
∇φ
⃗ ×F
∇ ⃗ =0
We immediately infer that a conservative force is curl free, and that the
⃗ · d⃗
amount of work done (dW = F r ) is independent of the path taken.
⃗ =
∇ψ 1 ∂ψ
⃗
e
hi ∂qi i
The divergence:
$ %
⃗ ·A
∇ ⃗= 1 ∂
(h2 h3 A1 ) + ∂
(h3 h1 A2 ) + ∂
(h1 h2 A3 )
h1 h2 h3 ∂q1 ∂q2 ∂q3
The Laplacian:
$ & ' & ' & '%
1 ∂ h2 h3 ∂ψ ∂ h3 h1 ∂ψ ∂ h1 h2 ∂ψ
∇2 ψ = h1 h2 h3 ∂q1 h1 ∂q1
+ ∂q2 h2 ∂q2
+ ∂q3 h3 ∂q3
Cylindrical Coordinates
For cylindrical coordinates (R, φ, z) we have that
x = R cos φ y = R sin φ z=z
The scale factors of the metric are:
hR = 1 hφ = R hz = 1
x = R⃗
and the position vector is ⃗ eR + z⃗
ez
⃗ = AR ⃗
Let A eR + Aφ⃗
eφ + Az ⃗
ez an arbitrary vector, then
AR = Ax cos φ − Ay sin φ
Aφ = −Ax sin φ + Ay cos φ
Az = Az
v = Ṙ⃗
Velocity: ⃗ e˙ R + ż⃗
eR + R⃗ ez = Ṙ⃗
eR + Rφ̇⃗
eφ + ż⃗
ez
Gradient & Laplacian:
⃗ ·A
⃗= 1 ∂ 1 ∂Aφ ∂Az
∇ R ∂R
(RAR ) + R ∂φ
+ ∂z
& '
1 ∂ ∂ψ 1 ∂2ψ ∂2ψ
∇ ψ=
2
R ∂R
R ∂R + R2 ∂φ2
+ ∂z 2
Spherical Coordinates
For spherical coordinates (r, θ, φ) we have that
x = r sin θ cos φ y = r sin θ sin φ z = r cos θ
The scale factors of the metric are:
hr = 1 hθ = r hφ = r sin θ
x = r⃗
and the position vector is ⃗ er
⃗ = Ar ⃗
Let A er + Aθ ⃗
eθ + Aφ⃗
eφ an arbitrary vector, then
v = ṙ⃗
Velocity: ⃗ e˙ r = ṙ⃗
er + r ⃗ er + r θ̇⃗
eθ + r sin θ φ̇⃗
eφ
Gradient & Laplacian:
⃗ ·A
⃗= 1 ∂ 1 ∂ 1 ∂Aφ
∇ 2
r ∂r
(r 2
Ar ) + r sin θ ∂θ
(sin θAθ ) + r sin θ ∂φ
& ' & '
∂2ψ
∇ ψ=
2 1 ∂
r 2 ∂r
r 2 ∂ψ
∂r
+ 2
1 ∂
r sin θ ∂θ
sin θ ∂φ
∂θ
+ 1
r 2 sin2 θ ∂ψ 2
Introduction
COLLISIONLESS DYNAMICS: The study of the motion of large numbers of
point particles orbiting under the influence of their mutual self-gravity
MAIN GOALS
∇2 Φ = 4πGρ
For ρ = 0 this reduces to the Laplace equation: ∇2 Φ = 0.
see B&T p.31 for derivation of Poisson Equation
Gauss’s Theorem & Potential Theorem
If we integrate the Poisson Equation, we obtain
! ! !
4πG ρd x
3
⃗ = 4πGM = ∇ Φd x
2
⃗ = 3 ⃗
∇Φd 2
⃗
s
V S
!
Gauss’s Theorem: ⃗ d2 ⃗
∇Φ s = 4πGM
S
Gauss’s Theorem states that the integral of the normal component of the
g (⃗
gravitational field [⃗ ⃗ ] over any closed surface S is equal to 4πG
r) = ∇Φ
times the total mass enclosed by S .
!
cf. Electrostatics: ⃗ ·⃗
E n d 2
s=
⃗ Qint
S ϵ0
1
!
W = 2
x) Φ(⃗
ρ(⃗ x) d3 ⃗
x
(see B&T p.33 for derivation)
⃗i,j = Gmi mj
F |⃗ xj | 3
xi −⃗
(⃗
xi xj )
−⃗
dvk,i #
N
mj
dt
= G (xk,i −xk,j )2
j=1,j̸=i
dxk,i
dt
= vk,i (k = 1, 3)
This corresponds to a closed set of 6N equations, for a total of 6N
unknowns (x, y, z, vx , vy , vz )
Since N is typically very, very large, we can’t make progress studying the
dynamics of these systems by solving the 6N equations of motion.
Even with the most powerful computers to date, we can only run N -body
simulations with N ∼< 106
From Discrete to Smooth
The density distribution and gravitational potential of N -body system are:
#
N
ρN (⃗
x) = ⃗ i)
x−x
mi δ(⃗
i=1
#
N
Gmi
ΦN (⃗
x) = − |⃗
x−⃗
xi |
i=1
#
N
mj
⃗i
F = G (⃗
xj ⃗ i)
−x
|⃗ xi | 3
xj −⃗
j=1,j̸=i
#
N ! (⃗ xi )
xj −⃗
= G |⃗ xi | 3
xj −⃗
mj δ(⃗
xj ⃗ )d3 x
−x ⃗
j=1,j̸=i
! (⃗ xi )
xj −⃗
= G |⃗
xj −⃗
xi | 3 ρN (⃗
x )d 3
x
⃗
We will replace ρN (⃗
x) and ΦN (⃗
x) with smooth and continuous functions
x) and Φ(⃗
ρ(⃗ x)
From Discrete to Smooth
For systems with large N , it is useful to try to use statistical descriptions of
the system (cf. Thermodynamics)
Replacing a discrete density distribution by a continuous density distribution
is familiar to us from fluid dynamics and plasma physics
GRAVITATIONAL SYSTEM
• mean-free path of particles ≫ size of system
• No collisional pressure, although kinetic energy of particles act as a
source of ‘pressure’, balancing the potential energy in virial equilibrium.
• No equivalent of equation of state. Pressure follows from kinetic
energy, but kinetic energy follows from the actual orbits within
gravitational potential, which in turn follows from the spatial distribution
of the particles (Self-Consistency Problem)
The Self-Consistency Problem
Given a density distribution ρ(⃗x), the Poisson equation yields the
gravitational potential Φ(⃗
x). In this potential I can integrate orbits using
Newton’s equations of motion. The self-consistency problem is the problem
of finding that combination of orbits that reproduces ρ(⃗ x).
Poisson Eq.
Density Potential
aw
dl
2n
?
’s
on
wt
Orbits
Ne
Think of self-consistency problem as follows: Given Φ(⃗ x), integrate all
possible orbits Oi (⃗
x), and find the orbital weights wi such that
#
x) =
ρ(⃗ wi Oi (⃗
x). Here Oi (⃗x) is the density contributed to ⃗
x by orbit i.
Timescales for Collisions
Following fluid dynamics and plasma physics, we replace our discrete
ρN (⃗x) with a smooth, continuous ρ(⃗
x). Orbits are then integrated in the
corresponding smooth potential Φ(⃗x).
In reality, the true orbits will differ from these orbits, because the true
potential is not smooth.
density of bodies
λ 4πR3
( R )2 1
R
= 3 N 4πr 2 R
≃ r N
It takes a crossing time tcross ∼ R/v to cross the system, so that the time
scale for direct collisions is
( R )2 1
tcoll = r
t
N cross
Example: A Milky Way like galaxy has R = 10 kpc = 3.1 × 1017 km,
v ≃ 200 km s−1 , N ≃ 1010 , and r is roughly the radius of the Sun
(r = 6.9 × 105 km). This yields λ = 2 × 1013 R. In other words, a direct
collision occurs on average only once per 2000 billion crossings! The
crossing time is tcross = R/v = 5 × 107 yr, so that tcoll ≃ 1021 yr.
This is about 1011 times the age of the Universe!!!
Relaxation Time I
Now that we have seen that direct collisions are completely negligble, let’s
focus on encounters
Consider once again a system of size R consisting of N identical particles
of mass m. Consider one such particle crossing the system with velocity v .
As we will see later, a typical value for the velocity is
* *
GM GN m
v= R
= R
We want to calculate how long it takes before the cumulative effect of many
encounters has given our particle a kinetic energy Ekin ∝ v 2 in the
direction perpendicular to its original motion of the order of its its initial
kinetic energy.
Note that for a sufficiently close encounter, this may occur in a single
encounter. We will treat this case seperately, and call such an encounter a
close encounter.
Relaxation Time II
First consider a single encounter
x v
F b
θ
m
At any given time, the gravitational force in the direction perpendicular to the
direction of the particle is
2
$ ( vt )2 %−3/2
Gm2
F⊥ = G x2m+b2 cos θ = b2
1+ b
2Gm
bmin = v2
≃ R/N
(∆v⊥ )2 10lnN
v2
≃ N
N
trelax = t
10lnN cross
Summary of Time Scales
Let R be the size of the system, r the size of a particle (e.g., star), v the
typical velocity of the particles, and N the number of particles in the system.
*
3
tcross = 4πGρ̄
Dynamical time: the time required to travel halfway across the system.
*
3π π
tdyn = 16Gρ
= t
2 cross
Free-fall time: the time it takes a sphere with zero pressure to collapse to
a point.
* √
3π
tff = 32Gρ
= tdyn / 2
Orbital time: the time it takes to complete a (circular) orbit.
*
3π
torb = Gρ
= 2πtcross
NOTE: All these timescales are the same as the crossing time, except for
some pre-factors
tcross ∼
< t
ff ∼ tdyn ∼ torb
< <
Example of Time Scales
System Mass Radius Velocity N tcross trelax
M⊙ kpc km s−1 yr yr
Galaxy 1010 10 100 1010 108 > 1015
DM Halo 1012 200 200 > 1050 109 > 1060
Cluster 1014 1000 1000 103 109 ∼ 1010
Globular 104 0.01 2 104 5 × 106 5 × 108
• Dark Matter Haloes and Galaxies are collisionless
• Collisions may or may not be important in clusters of galaxies
• Relaxation is expected to have occured in (some) globular clusters
*
GM
NOTE: For a self-gravitating system, the typical velocities are v ≃ R
* *
R R3 3
For the crossing time this implies: tcross = v = GM
= 4πGρ
The DF f (⃗
x, ⃗
v , t) completely specifies a collisionless system
In the case of our smooth density distribution we define the 6 dimensional
phase-space density :
f (⃗ xd3⃗
v , t)d3 ⃗
x, ⃗ v
1
!
x, t)⟩ =
⟨Q(⃗ ρ(⃗x)
d3⃗v Q(⃗ v ) f (⃗
x, ⃗ x, ⃗
v , t)
1
! ! 3
⟨Q(t)⟩ = M
d ⃗
3
x d ⃗v Q(⃗ v ) f (⃗
x, ⃗ x, ⃗v , t)
EXAMPLES:
1
! !
RMS velocity: ⟨vi2 ⟩ = M
d ⃗
3
x d3⃗
v vi2 f (⃗
x, ⃗
v , t)
!!!
Velocity Profile: L(x, y, vz ) = f (⃗ v , t) dz dvx dvy
x, ⃗
!!!!
Surface Brightness: Σ(x, y) = f (⃗
x, ⃗v , t) dz dvx dvy dvz
The Distribution Function III
Each particle (star) follows a trajectory in the 6D phase-space (⃗ v ), which
x, ⃗
is completely governed by Newtonian Dynamics (for a collisionless system).
This trajectory projected in the 3D space ⃗
x is called the orbit of the particle.
As we will see later, the Lagrangian time-derivative of the DF, i.e. the
time-derivative of f (⃗x, ⃗
v , t) as seen when travelling through phase-space
along the particle’s trajectory, is
df
dt
=0
This simple equation is the single most important equation for collisionless
dynamics. It completely specifies the evolution of a collisionless system, and
is called the Collisionless Boltzmann Equation (C.B.E.) or Vlasov equation.
∂f
NOTE: Don’t confuse this with ∂t = 0!!!!
This is the Eulerian time-derivative as seen from a fixed phase-space
location, which is only equal to zero for a system in steady-state equilibrium.
Collisionless Dynamics in a Nutshell
!
x)
ρ(⃗ = f (⃗ v ) d3 ⃗
x, ⃗ v
∇2 Φ(⃗
x) = 4πGρ(⃗ x)
df
dt
= 0
x) is
The self-consistency problem of finding the orbits that reproduce ρ(⃗
equivalent to finding the DF f (⃗ v ) which yields ρ(⃗
x, ⃗ x).
E = 12 (ṙ 2 + r 2 θ̇ 2 ) + Φ(r)
J = r 2 θ̇
1 2 J2
2
ṙ + 2r 2
+ Φ(r) = E
Circular & Escape Velocities II
In the co-rotating frame, the equation of motion reduces to a
one-dimensional radial motion under influence of the effective potential
J2
U (r) = 2r 2
+ Φ(r). The ‘extra’ term arises due to the non-inertial nature
of the reference frame, and corresponds to the centrifugal force
& '
⃗cen = d J2 J2 vθ2
F − dr 2r 2
er =
⃗ ⃗
e
r3 r
= r r
⃗
e
⃗cen = −F
For a circular orbit we have that F ⃗grav , so that we obtain the
circular speed.
* *
GM (r)
vc (r) = r dΦ
dr
= r
Thus, rvc2 (r) measures the mass enclosed within radius r (in spherical
symmetry). Note that for a point mass vc (r) ∝ r −1/2 , which is called a
Keplerian rotation curve
Escape Speed: The speed a particle needs in order to ‘escape’ to infinity
+
vesc (r) = 2|Φ(r)|
z To observer
R r
!
∞ !
∞
Σ(R) = 2 ν(r)dz = 2 ν(r) √ r2dr
0 R r −R2
!
∞
Φ(r) = − GMr (r) − 4πG ρ(r ′ ) r ′ dr ′
r
Power-law Density Profiles I
Consider a spherical system with a simple power-law density distribution
& '−α
r
ρ(r) = ρ0 r0
!
∞
Σ(R) = 2 ρ(r) √ r2dr = ρ0 r0α B( α
2
− 1 1
,
2 2
) R 1−α
R r −R2
!r α
4πρ0 r0
M (< r) = 4π ρ(r ) r dr =
′ ′2 ′
3−α
r 3−α
(α < 3)
0
!
∞ α
4πρ0 r0
M (> r) = 4π ρ(r ) r dr =
′ ′2 ′
α−3
r 3−α
(α > 3)
r
NOTE: For α ≥ 3 the enclosed mass is infinite, while for α ≤ 3 the total
mass (r → ∞) is infinite: A pure power-law system can not exist in nature!
2
2
vesc (r) = α−2
v 2
c (r)
NOTE: For α > 3 you find that vc (r) falls off more rapidly than Keplerian.
How can this be? After all, a Keplerian RC corresponds to a delta-function
density distribution (point mass), which is the most concentrated mass
distribution possible....
Power-law Density Profiles II
The potential of a power-law density distribution is:
. α
4πGρ0 r0
r 2−α
if 2 < α < 3
Φ(r) = (α−3)(α−2)
∞ otherwise
2
2
vesc (r) = α−2
v 2
c (r)
NOTE: For α > 3 you find that vc (r) falls off more rapidly than Keplerian.
How can this be? After all, a Keplerian RC corresponds to a delta-function
density distribution (point mass), which is the most concentrated mass
distribution possible....
answer: the circular velocity is defined via the gradient of the potential. As shown above,
Φ is only defined for 2 < α < 3, and therefore so does vc
Power-law Density Profiles: Summary
It is very useful to remember the following scaling relations:
ρ(r) ∝ r −α
Σ(R) ∝ R1−α
Φ(r) ∝ r 2−α (2 < α < 3)
vc2 (r) ∝ r 2−α (2 < α < 3)
M (< r) ∝ r 3−α (α < 3)
M (> r) ∝ r 3−α (α > 3)
Double Power-law Density Profiles
As we have seen, no realistic system can have a density distribution that is
described by a single power-law. However, many often used density
distributions have a double power-law.
C
ρ(r) = r γ (1+r 1/α )(β−γ)α
#
3
x2
m =
2
a21 i
a2
a1 ≥ a 2 ≥ a 3
i
i=1
Note that we have taken the three principal axes to be aligned with our
Cartesian coordinate system (x, y, z). If a1 > a2 > a3 then the ellipsoid
is said to be triaxial
A body whose isodensity surfaces are concentric ellipsoids is called an
ellipsoidal body.
1−(a2 /a1 )2
Triaxiality Parameter: T ≡ 1−(a3 /a1 )2
Oblate Spheroid
d
oi
er
a3 /a1
ph
eS
at
ol
Pr
T=1 T=2/3 T=1/3 T=0
NOTE: the Homoeoid Theorem applies only to thin homoeoids. However, for
any homoeoid of any thickness we have:
Here
m2 R2 z2
a2
= τ +a2
+ τ +b2
0 0 0
√
with a0 any constant and b0 = 1 − e2 a0 , and
! m2
ψ(m) ≡ 0 ρ(m2 ) dm2
∂Φ
√ !R ρ(m2 ) m2 dm
vc2 (R) = R ∂R = 4πG 1− e2 √ 2 2 2
0 R −m e
Ellipsoids IV
Consider a spheroidal density distribution ρ(R, z) = ρ(m2 ) with
m2 = R2 + z 2 /q 2 , then the potential is:
!
∞
ψ(m) dτ
Φ(R, z) = −2πGq arcsine
e
ψ(∞) + πGq √
0 (τ +1) τ +q 2
+
Here e = 1 − q 2 is the eccentricity,
R2 z2
m =
2
τ +1
+ τ +q 2
and
! m2
ψ(m) ≡ 0
ρ(m′2 ) dm′2
∂Φ
!R ρ(m2 ) m2 dm
vc2 (R) = R ∂R = 4πGq √ 2 2 2
0 R −m e
Ellipsoids V
• In general one finds that vc (R) increases with larger flattening q :
Flatter systems with the same spheroidal, enclosed mass have larger
circular speeds at given R.
• Let ερ = 1 − q the ellipticity of the density distribution. One always
has that εΦ ≤ ερ . At a few characteristic radii, a reasonable rule of
thumb is that εΦ ∼ 1 ε
3 ρ
with
m2 #
3
x2
a2
= i
τ +a2
1 i
i=1
Multipole Expansion I
In order to calculate the potential of an arbitrary density distribution, it is
useful to consider a Multipole expansion.
Using separation of variables, Φ(r, θ, φ) = R(r) P (θ) Q(φ), one can
write
#
∞ #
l
Ylm (θ,φ)
Φ(r, θ, φ) = −4πG 2l+1
, l=0 m=−l -
1
!
r !
∞
dr ′
r (l+1) ρ lm (r ′
)r ′(l+2)
dr ′
+ r l
ρlm (r ′
) r ′(l−1)
0 r
Here
!π !
2π
ρlm (r) = sin θdθ Ŷlm (θ, φ)ρ(r, θ, φ)
0 0
and
*
2l+1 (l−|m|)! |m|
Ylm (θ, φ) = 4π (l+|m|)!
Pl (cos θ)eimφ
with Pl (x) the associated Legendre functions, and Ŷlm (θ, φ) the complex
conjugate of Ylm (θ, φ)
Multipole Expansion II
Monopole l = 0 1 term
Dipole l = 1 3 terms
Quadrupole l = 2 5 terms
Octopole l = 3 7 terms
Hexadecapole l = 4 9 terms
The monopole term describes the potential of a spherical system with
√ √
ρ(r, θ, φ) = ρ(r). Since Y00 (θ, φ) = 1/ 4π and ρ00 = 4πρ(r), the
(l = m = 0)-term of the multipole expansion is simply the equation for the
potential of a spherical system:
, -
1
!r !
∞
Φ(r) = −4πG r
ρ(r ′ ) r ′2 dr ′ + ρ(r ′ ) r ′ dr ′
0 r
! x′ ) 3 ′
ρ(⃗ !
∞ !
2π
dφ′
Φ(⃗
x) = −G |⃗
x−⃗x′ |
d ⃗
x = −G Σ(R ) R dR
′ ′ ′
|⃗
x−⃗x′ |
0 0
2G
!
∞ √
Φ(R, z) = −√R
K(k) k Σ(R ) ′
R′ dR′
0
∂φ G
!
∞ √
R ∂R (R, z) = √
R
dR k Σ(R ) R′ ×
′ ′
$ 0 & 2 '& ′ 2
' %
K(k) − 14 1−k
k
2
R
R
− R
R′
+ z
RR′
E(k)
with K(k) and E(k) so called complete elliptic integrals. In principle the
evaluation at z = 0 is complicated (contains integrable singularity); in
practice it often suffices to approximate the above at small z
Disk Potentials via Bessel Functions
The potential of a thin disk with surface density Σ(R) can be written as
!
∞
Φ(R, z) = S(k) J0 (kR) e−k|z| dk
0
with
!
∞
S(k) = −2πG J0 (kR) Σ(R) R dR
0
( ∂Φ ) !
∞
vc2 (R) = R ∂R z=0
= −R S(k) J1 (kR) k dk
0
This method is simple, and most of the time well behaved. For an exponential
disk with Σ(R) = Σ0 e−R/Rd one finds
⃗
er = − cos θ⃗
ex + sin θ⃗
ey
⃗
eθ = − sin θ⃗
ex + cos θ⃗
ey
d⃗
r d
dt
= dt
(r cos θ⃗ex + r sin θ⃗ey )
= ex − r θ̇ sin θ⃗
ṙ cos θ⃗ ex + ṙ sin θ⃗
ey + r θ̇ cos θ⃗
ey
= er + r θ̇⃗
ṙ⃗ eθ
and similarly one obtains that
d2 ⃗
r
dt 2 = (r̈ − r θ̇ 2 )⃗
er + (2ṙ θ̇ + r θ̈)⃗
eθ
Orbits in Central Force Fields II
We thus obtain the following set of equations of motions:
r̈ − r θ̇ 2 = F (r) = − dΦ
dr
2ṙθ̇ + r θ̈ = 0
Multiplying the second of these equations with r yields, after integration, that
d
dt
(r 2
θ̇) = 0. This simply expresses the conservation of the orbit’s angular
momentum L = r 2 θ̇ , i.e., the equations of motion can be written as
r̈ − r θ̇ 2 = − dΦ
dr
r 2 θ̇ = L = constant
ẍ = Fx = − ∂Φ
∂x
∂Φ
ÿ = Fy = − ∂y
Orbits in Central Force Fields III
As shown before, one can use the second equation of motion (in polar
coordinates) to eliminate θ̇ in the first, which yields the radial energy
equation
1 2 J2
2
ṙ + 2r 2
+ Φ(r) = E
which can be rewritten as
*
dr J2
dt
= ± 2[E − Φ(r)] − 2r 2
where the ± sign is required because r can both increase and decrease.
Solving for the turn-around points, where dr/dt = 0, yields
1 2[E−Φ(r)]
r2
= −J 2
Note that these stationary points are not necessarily minima. They may also
be maxima or sadle points.
y x(1)
δy y(x)
x(0)
x
Consider a small variation δy(x), which vanishes at the endpoints of the
integration interval: δy(x0 ) = δy(x1 ) = 0
Calculus of Variations II
Using that
∂f ∂f
δf = ∂y
δy + ∂ ẏ
δ ẏ
d
with δ ẏ = dx δy(x), the stationary values obey
!1 $
x
∂f ∂f d
%
δI = ∂y
δy + ∂ ẏ dx
δy dx = 0
x0
which are the Lagrange equations (one for each degree of freedom), which
represent the equations of motion according to Hamilton’s principle. Note
that they apply to any set of generalized coordinates
In addition to the generalized coordinates we also define the generalized
momenta pi (also called conjugate momenta) and the generalized forces Fi :
∂L ∂L
pi ≡ ∂ q̇i
Fi ≡ ∂qi
∂L d
( ∂L ) ∂Φ d
∂r
− dt ∂ ṙ
=0 ⇒ r θ̇ 2 − ∂r
− dt
(ṙ) =0 ⇒ r̈ − r θ̇ 2 = − ∂Φ
∂r
& '
∂L d ∂L d
∂θ
− dt ∂ θ̇
=0 ⇒ − dt (r 2 θ̇) = 0 ⇒ r 2 θ̇ = L = cst
Note that the Lagrangian formulation allows you to write down the equations
of motion much faster than using Newton’s second law!
The Hamiltonian Formulation I
The Hamiltonian H(qi , pi ) is related to the Lagrangian L(qi , q̇i ) via a
Legendre Transformation
In general, a Legendre Transformation is a transformation of a function
∂f ∂g
f (x, y) to g(u, y), where u = ∂x
and ∂u = x
g(u, y) = f − u x
NOTE: You might be familiar with Legendre Transformations from
Thermodynamics where they are used to compute different thermodynamic
potentials from the internal energy U = U (S, V ), such as
enthalpy: H = H(S, p) = U + p V
Helmholtz free energy: F = F (T, V ) = U − T S
#
n
H(⃗ ⃗, t) =
q, p pi q̇i (⃗ ⃗) − L(⃗
q, p ⃗˙(⃗
q, q ⃗), t)
q, p
i=1
∂H #
n
∂ q̇i #
n
∂L ∂ q̇i
∂pj
= q˙j + pi ∂p j
− ∂ q̇i ∂pj
i=1 i=1
The second and third terms vanish since pi = ∂L/∂ q̇i , so that we obtain
that ∂H/∂pj = q̇j . Similarly we obtain that
∂H #
n
∂ q̇i ∂L #
n
∂L ∂ q̇i
∂qj
= pi ∂q j
− ∂qj
− ∂ q̇i ∂qj
i=1 i=1
Here the first and third terms cancel, and since the Lagrange equations tell
us that ∂L/∂qj = ṗj , we obtain that ∂H/∂qj = −ṗj .
In the case of motion in a fixed potential, the Hamiltonian is equal to the total
energy, i.e., H = E
DEMONSTRATION: for a time-independent potential Φ = Φ(⃗
x) the
Lagrangian is equal to L = 1 ˙ 2 − Φ(⃗
x
⃗ x). Since p x˙ = ⃗
⃗ = ∂L/∂ ⃗ x˙ we
2
have that H = x⃗˙ · ⃗
x˙ − 12 ⃗
x˙ 2 + Φ(⃗ x˙ 2 + Φ(⃗
x) = 12 ⃗ x) = E
∂H p2 ∂Φ ∂H
∂r
= − r3θ + ∂r
= −ṗr ∂θ
= 0 = −ṗθ
∂H ∂H pθ
∂pr
= pr = ṙ ∂pθ
= r2
= θ̇
which reduce to
r̈ − r θ̇ 2 = − ∂Φ
∂r
pθ = r 2 θ̇ = cst
Let f = f (⃗
q, p
⃗, t) then
∂f ∂f ∂f
df = ∂qi
dqi + ∂pi
dpi + ∂t
dt
where we have used the summation convention. This differential of f ,
combined with Hamilton’s equations, allows us to write
df ∂f ∂f ∂H ∂f ∂H
dt
= ∂t
+ ∂qi ∂pi
− ∂pi ∂qi
which reduces to
df ∂f
dt
= ∂t
+ [f, H]
This is often called Poisson’s equation of motion. It shows that the
time-evolution of any dynamical variable is governed by the Hamiltonian
through the Poisson bracket of the variable with the Hamiltonian.
Poisson Brackets II
Using the Poisson brackets we can write
dH ∂H ∂H ∂L
dt
= ∂t
+ [H, H] = ∂t
= ∂t
With the help of the Poisson brackets we can write Hamilton’s equations in a
more compact form
ẇα = [wα , H]
which makes it explicit that they hold for any canonical coordinate system.
Note that the generalized coordinates and momenta (⃗ ⃗) form a canonical
q, p
coordinate system, since they obey the canonical commutation relations
Consider a transformation L → L′ = L + dF
dt
where F = F (⃗
q , t)
Under this transformation the action integral becomes
!t1 !t1 !t1 dF
I′ = L′ dt = Ldt + dt
dt = I + F (t1 ) − F (t0 )
t0 t0 t0
L(⃗q, p
⃗, t) = p
⃗·q⃗˙ − H(⃗q, p
⃗, t)
⃗ P
L′ (Q, ⃗ , t) = P
⃗ ·Q ⃗˙ − H′ (Q,
⃗ P ⃗ , t)
In order to transform (⃗ ⃗ P
⃗) → (Q,
q, p ⃗ ) one proceeds as follows:
• Find a function F (⃗ ⃗ so that pi = ∂F /∂qi . This yields Qi (qj , pj )
q , Q)
• Substitute Qi (qj , pj ) in Pi = ∂F /∂Qi to obtain Pi (qj , pj )
∂S ∂S ∂S ∂S
pi = ∂qi
Qi = ∂Pi ∂Qi
=0 H′ = H + ∂t
Note that the third of these rules implies that S = S(⃗ ⃗ , t) as intended.
q, P
Canonical Transformations V
The potential strength of canonical transformations becomes apparent from
the following: Suppose one can find a canonical transformation
(⃗ ⃗ P
⃗) → (Q,
q, p ⃗ ) such that H(⃗ ⃗) → H′ (P
q, p ⃗ ), i.e., such that the new
Hamiltonian does not explicitely depend on the new coordinates Qi .
Hamilton’s equation of motion then become
∂H′ ∂H′
∂Qi
= −Ṗi = 0 ∂Pi
= −Q̇i
Thus, we have that all the conjugate momenta Pi are constant, and this in
turn implies that none of Q̇i can depend on time either. The equations of
motion in our new, canonical coordinate system are therefore extremely
simple:
Qi (t) = Ωi t + ki Pi = constant
Here Ωi = ∂H′ /∂Pi are constants and ki are integration constants. Any
generalized coordinate whose conjugate momentum is a conserved quantity,
is called a cyclic variable. The question that remains now is how to find the
generator S(q, P, t) of the canonical transformation (⃗ ⃗ P
⃗) → (Q,
q, p ⃗)
which leads to only cyclic variables Qi .
The Hamilton-Jacobi Equation I
Recall the transformation rules for the generator S(⃗ ⃗ , t):
q, P
∂S ∂S ∂S
pi = ∂qi
Qi = ∂Pi
H′ = H + ∂t
Pi (t) = P
& i (0) '
′
∂H
Qi (t) = ∂Pi
t + ki
One might think at this point, that one has to take Pi = Ii . However, this
choice is not unique. Consider an integrable Hamiltonian with n = 2
degrees of freedom and let I1 and I2 be two isolating integrals of motion in
involution. Now define Ia = 1 2
(I 1 + I 2 ) and I b = 1
2
(I1 − I2 ), then it is
straightforward to proof that [Ia , Ib ] = 0, and thus that (Ia , Ib ) is also a
set of isolating integrals of motion in involution. In fact, one can construct
infinitelly many sets of isolating integral of motion in involution. Which one
should we choose, and in particular, which one yields the most meaningful
description of the invariant tori?
Answer: The Action-Angle variables
Action-Angle Variables I
ω2
ω2
Γ(t) .
A2
A1 .
γ1
ω1 γ2
Let’s be guided by the idea of our invariant tori. The figure illustrates a
2D-torus (in 4D-phase space), with a trajectory Γ(t) on its surface. One can
specify a location on this torus by the two position angles ω1 and ω2 . The
torus itself is characterized by the areas of the two (hatched) cross sections
labelled A1 and A2 . The action variables J1 and J2 are intimitally related to
A1 and A2 , which clearly are two integrals of motion.
The action variables are defined by:
1
"
Ji = 2π γi
⃗ · d⃗
p q
∂H′
Ωi ≡ ∂Ji
This implies that the motion along each of the n degrees of freedom, qi , is
periodic in time, and this can occur in two ways:
• Libration: motion between two states of vanishing kinetic energy
• Rotation: motion for which the kinetic energy never vanishes
The Pendulum
To get insight into libration and rotation consider a pendulum, which is a
integrable Hamiltonian system with one degree of freedom, the angle q ..
r
s l=libration
l
r=rotation
−π +π q s=separatrix
• They are are the only conjugate momenta that enjoy the property of
adiabatic invariance (to be discussed later)
• The angle-variables are the natural coordinates to label points on
invariant tori.
• They are ideally suited for perturbation analysis, which is used to
investigate near-integrable systems (see below)
• They are ideally suited to study the (in)-stability of a Hamiltonian system
Example: Central Force Field
As an example, to get familiar with action-angle variables, let’s consider once
again motion in a central force field.
As we have seen before, the Hamiltonian is
2
1 2 1 pθ
H= p
2 r
+ 2 r2
+ Φ(r)
where pr = ṙ and pθ = r 2 θ̇ = L.
In our planar description, we have two integrals of motion, namely energy
I1 = E = H and angular momentum I2 = L = pθ .
These are classical integrals of motion, as they are associated with
symmetries. Consequently, they are also isolating.
Let’s start by checking whether they are in involution
$ % $ %
∂I1 ∂I2 ∂I1 ∂I2 ∂I1 ∂I2 ∂I1 ∂I2
[I1 , I2 ] = ∂r ∂pr
− ∂pr ∂r
+ ∂θ ∂pθ
− ∂pθ ∂θ
1
!
2π
Jθ = 2π
I2 dθ = I2
0
Once we make a choise for the potential Φ(r) then Jr can be solved as
function of I1 and Jθ . Since I1 = H, this in turn allows us to write the
Hamiltonian as function of the actions: H(Jr , Jθ ).
Example: Central Force Field
As an example, let’s consider a potential of the form
β
Φ(r) = − α
r
− r2
with α and β two constants. Substituting this in the above, one finds:
& '1/2 +
1
Jr = α 2|I1 |
− Jθ2 − 2β
Inverting this for I1 = H yields
& + '−2
α2
H(Jr , Jθ ) = 2
Jr + Jθ2 − 2β
& + '−3
∂H Jθ
Ωθ = ∂J = −α 2
J r + J 2
θ − 2β √ 2
θ Jθ −2β
Example: Central Force Field
The ratio of these frequencies is
& '1/2
Ωr 2β
Ωθ
= 1− Jθ2
#
∞
x(t) =
⃗ X̃lmn exp [i(lΩ1 + mΩ2 + nΩ3 )t]
l,m,n=−∞
2π
The azimuthal period can thus be written as Tθ = ∆θ Tr . From this we see
that the orbit will be closed (resonant) if
Tθ 2π n
Tr
= ∆θ
= m
with n and m integers
Orbits in Spherical Potentials III
In general ∆θ/2π will not be a rational number. ◃ orbit not closed.
Instead, a typical orbit resembles a rosette and eventually passes through
every point in between the annuli bounded by the apo- and pericenter.
However, there are two special potentials for which all orbits are closed:
1 2 2
• Spherical Harmonic Oscillator Potential: Φ(r) = 2
Ω r
−Orbits are ellipses centered on center of attraction
− Tθ : Tr = 2 : 1
• Kepler Potential: Φ(r) = − GrM
−Orbits are ellipses with attracting center at one focal point
− Tθ : Tr = 1 : 1
Since galaxies are less centrally concentrated than point masses and more
centrally concentrated than homogeneous spheres, a typical star in a
spherical galaxy changes its angular coordinate by ∆θ during a radial
libration, where π < ∆θ < 2π
QUESTION: What is the resonance that corresponds to a circular orbit?
Orbits in Spherical Potentials III
In general ∆θ/2π will not be a rational number. ◃ orbit not closed.
Instead, a typical orbit resembles a rosette and eventually passes through
every point in between the annuli bounded by the apo- and pericenter.
However, there are two special potentials for which all orbits are closed:
1 2 2
• Spherical Harmonic Oscillator Potential: Φ(r) = 2
Ω r
−Orbits are ellipses centered on center of attraction
− Tθ : Tr = 2 : 1
• Kepler Potential: Φ(r) = − GrM
−Orbits are ellipses with attracting center at one focal point
− Tθ : Tr = 1 : 1
Since galaxies are less centrally concentrated than point masses and more
centrally concentrated than homogeneous spheres, a typical star in a
spherical galaxy changes its angular coordinate by ∆θ during a radial
libration, where π < ∆θ < 2π
QUESTION: What is the resonance that corresponds to a circular orbit?
ANSWER: Although closed, a circular orbit is NOT a resonance orbit
Orbits in Spherical Potentials IV
This reveals two major orbit families: The first is the family of box orbits,
which have no net sense of cirulation about the center, and which, in the
course of time, will pass arbitrarily close to the center of the potential
Note that the orbit completes a filled curve in the SOS, indicating that it
admits a second isolating integral of motion, I2 . This is not a classical
integral, as it is not associated with a symmetry of the system. We can, in
general, not express I2 in the phase-space coordinates.
Orbits in Planar Potentials V
The second main family is that of loop orbits. These do have a net sense of
circulation, and always maintain a minimum distance from the center of the
potential. Any star launched from R ≫ Rc in the tangential direction with a
speed of the order of v0 will follow such a loop orbit.
Once again, the fact that the orbit completes a filled curve in the SOS,
indicates that it admits a second, (non-classical) isolating integral of motion.
Since we don’t know what this integral is (in terms of the phase-space
coordinates) it is simply called I2 .
Orbits in Planar Potentials VI
In ΦL (x, y) there are two main orbit families: loop orbits and box orbits
Each family of orbits is closely associated with a corresponding closed orbit.
This closed orbit is called the parent of the orbit family. All closed orbits that
are parents to families are said to be stable, in that members of their family
that are initially close to them remain close to them at all times.
Unstable, closed orbits also exist, but they don’t parent an orbit family.
Modulo the irregular orbits, one can obtain a good consensus of the orbits in
a system by finding the stable periodic orbits at each energy.
For our planar, logarithmic potential, the parent of the loop orbits is the
closed loop orbit (which intersects SOS at a single point on ẋ = 0 axis).
The parent of the box orbits is the closed long-axis orbit. This is the orbit that
is confined to the x-axis with y = ẏ = 0. In the SOS (x vs. px ) this is the
orbit associated with the boundary curve which corresponds to
1 2
2
ẋ + ΦL (x, 0) = E i.e. y = ẏ = 0
Finally, there is the closed short-axis orbit. This is the orbit that is confined to
the y -axis x = ẋ = 0. In the SOS this is the orbit associated with a single
dot exactly at the center of the SOS. Clearly, this orbit is unstable: rather than
parenting an orbit family, it marks the transition between loop and box orbits.
Orbits in Planar Potentials VII
If we set the core radius Rc = 0 in our planar, logarithmic potential, we
remove the homogeneous core and introduce a singular R−2 cusp.
This singular logarithmic potential admits a number of new orbit families,
which are associated with resonant parents, which we ID by the frequency
ratio ωx : ωy .
The family associated with the 2 : 1 resonance are called the banana orbits
The family associated with the 3 : 2 resonance are called the fish orbits
The family associated with the 4 : 3 resonance are called the pretzel orbits
All these families together are called boxlet orbits.
The (singular) logarithmic potential is somewhat special in that it shows a
surprisingly regular orbit structure. Virtually the entire phase-space admits
two isolating integrals of motion in involution (E and I2 ). Clearly, the
logarithmic potential must be very near-integrable.
However, upon introducing a massive black hole in the center of the
logarithmic potential, many of the box-orbits become stochastic. One says
that the BH destroys the box-orbits. Recall that each box orbit comes
arbitrarily close to the BH.
Orbits in Singular Logarithmic Potentials
The closed boxlets comes in two kinds: centrophobic, which avoid the center
and are stable, and centrophilic, which go through the center and are
unstable. The centrophilic versions of the banana and fish orbits are called
the antibanana and antifish orbits, etc. Because they are unstable, they don’t
parent any families.
Orbits in Planar Potentials: Summary
If Φ(x, y) has a homogeneous core, at sufficiently small E potential is
indistinguishable from that of 2D harmonic oscillator
• x-axial and y -axial closed orbits are stable
• they parent a family of box orbits (filling a rectangular box)
If central BH is present, discrete scattering events can turn box orbits into
stochastic orbits
Orbits in Axisymmetric Potentials I
Axisymmetric potentials (oblate or prolate) are far more realistic examples to
consider in astronomy. Elliptical galaxies might well be spheroidal (but could
also be ellipsoidal), while disk galaxies almost certainly are axisymmetric
(though highly flattened).
For axisymmetric systems the coordinate system of choise are the
cylindrical coordinates (R, φ, z), and Φ = Φ(R, z).
Solving Newton’s equation of motion in cylindrical coordinates yields:
∂Φ
R̈ −
& R φ̇ 2
' = − ∂R
d
dt
R2 φ̇ = 0
z̈ = − ∂Φ
∂z
R̈ = − ∂Φeff
∂R
z̈ = − ∂Φeff
∂z
L2
with Φeff (R, z) = Φ(R, z) + z
2R2
the effective potential. The
L2z /R2 -term serves as a centrifugal barrier, only allowing orbits with
Lz = 0 near the symmetry-axis.
This allows us to reduce the 3D motion to 2D motion in Meridional Plane
(R, z), which rotates non-uniformly around the symmetry axis according to
φ̇ = Lz /R2 .
In addition to simplifying the problem, it also allows the use of
surfaces-of-section to investigate the orbital properties.
For the energy we can write
$ % & '
1 1
E= 2
Ṙ + (Rφ̇) + ż + Φ =
2 2 2
2
Ṙ + ż
2 2
+ Φeff
so that the orbit is restricted to the area in the meridional plane satisfying
E ≥ Φeff . The curve bounding this area is called the zero-velocity curve
(ZVC) (since for a point on it ⃗v = 0).
Epicycle Approximation I
L2
We have defined the effective potential Φeff = Φ + z
2R2
. This has a
minimum at (R, z) = (Rg , 0), where
∂Φeff ∂Φ L2
∂R
= ∂R
− z
R3
=0
The radius R = Rg corresponds to the radius of a circular orbit with energy
1 2 L2
E = Φ(Rg , 0) + v
2 c
= Φ(Rg , 0) + z
2R2
= Φeff .
g
where
( ∂Φeff ) & 2
' & 2
'
∂ Φeff ∂ Φeff
Φx = ∂x (Rg ,0)
Φxx = ∂x2
Φxy = ∂x∂y
(Rg ,0) (Rg ,0)
κ2 ≡ Φxx ν 2 ≡ Φyy
we thus have that, in the epicycle approximation,
ẍ = −κ2 x ÿ = −ν 2 y
Thus, the x- and y -motions are simple harmonic oscillations with the
epicycle frequency κ and the vertical frequency ν .
In addition, we have the circular frequency
* ( ∂Φ )
vc (R) 1 Lz
Ω(R) = R
= R ∂R (R,0)
= R2
R(t) = A cos(κt + a) + Rg
z(t) = B cos(νt + b)
2Ω A
φ(t) = Ωg t + φ0 − κRgg sin(κt + a)
Note that there are three frequencies (Ω, κ, ν) and also three isolating
integrals of motion in involution: (ER , Ez , Lz ) with ER = 1
2
(ẋ2
+ κ x )
2 2
and Ez = 1 2
(ż 2
+ ν z )
2 2
◃ all orbits are regular.
The motion in (R, φ) can be described as retrogate motion on an ellipse (the
epicycle), whose guiding center (or epicenter) is in prograde motion around
the center of the system.
Epicycle Approximation IV
An important question is: “When is the epicycle approximation valid?”
First consider the z -motion: The equation of motion, z̈ = −ν 2 z implies a
constant density in the z -direction. Hence, the epicycle approximation is
valid as long as ρ(z) is roughly constant. This is only approximately true
very close to equatorial plane. In general, however, epicycle approx. is poor
for motion in z -direction.
In the radial direction, we have to realize that the Taylor expansion is only
accurate sufficiently close to R = Rg . Hence, the epicycle approximation is
only valid for small librations around the guiding center; i.e., for orbits with
an angular momentum that is close to that of the corresponding circular
orbit.
Epicyclic Motion
Orbits in Axisymmetric Potentials III
thin tube
parent ZVC
The orbit shown on the previous page is a so-called short-axis tube orbit.
This is the main orbit family in oblate potentials, and is associated with
(parented by) the circular orbits in equatorial plane.
Orbits (c), (e) and (f) above are from the same orbit family. Orbits (a), (b) and
(d) are special in that Lz = 0.
Orbits in Axisymmetric Potentials V
Because of the centrifugal barrier only orbits with Lz = 0 will be able to
come arbitrarily close to the center.
However, not all orbits with Lz = 0 are box orbits. There is another family of
zero-angular momentum orbits, namely the two-dimensional loop orbits (e.g.,
orbit (d) on previous page). Their meriodional plane is stationary (i.e.,
φ̇ = 0) and their angular momentum vector is perpendicular to the z -axis.
Hence, I3 = L; note that [L, Lz ] = 0.
In triaxial Stäckel potentials all three integrals (E, I2 , I3 ) are analytical, and
the orbits are confined by contours of constant ellipsoidal coordinates (see
next page).
Although Stäckel are a very special class, the fact that they are separable
makes them ideally suited to get insight. Most triaxial potentials that do not
have a Stäckel form have orbital structures that are similar to that of Stäckel
potentials.
Stäckel Potentials III
box orbits
B = box orbits
S = short-axis tubes
I = inner long-axis tubes
O = outer long-axis tubes
Libration versus Rotation
Three-dimensional orbits
All tube orbits are build up from 2 librations and 1 rotation.
All box orbits are build up from 3 librations.
All boxlets are build up from 2 librations and 1 rotation.
Two-dimensional orbits
All loop orbits are build up from 1 libration and 1 rotation.
All box orbits are build up from 2 librations.
All boxlets are build up from 2 librations.
Rotating Potentials I
The figures of non-axisymmetric potentials may rotate with respect to inertial
space.
The example of interest for astronomy are barred potentials, which are
rotating with a certain pattern speed.
⃗ p = Ωp⃗
We express the pattern speed in angular velocity Ω ez
In what follows we denote by d⃗
a/dt the rate of change of a vector ⃗
a as
measured by an inertial observer, and by ⃗ a˙ the rate of change as measured
by an observer corotating with the figure.
It is straightforward to show that
d⃗
a
dt
a˙ + Ω
=⃗ ⃗p ×⃗
a
r˙ 2 + Φ(⃗
EJ ≡ 12 ⃗ ⃗p ×⃗
r ) − 12 |Ω r |2
Rotating Potentials III
The importance of EJ becomes apparent from the following:
& '
dEJ
dt
r˙ dt
= ⃗ d
r˙ +
⃗ dΦ
dt
− ( ⃗p ×⃗
Ω r ) · d ⃗
dt
(Ωp × ⃗ r)
$ %
r˙ ⃗
= ⃗ r¨ + (Ω⃗p ×⃗ r˙ ) + ∇Φ⃗ ·⃗ r˙ − (Ω ⃗p ×⃗ r˙ )
⃗p ×⃗
r ) · (Ω
r˙ · ⃗
= ⃗ r¨ + ⃗
r˙ · ∇Φ
⃗ − (Ω ⃗p ×⃗
r ) · (Ω⃗p ×⃗ r˙ )
⃗ · (A
Here we have used that A ⃗ × B)
⃗ = 0 and that dΩ
⃗ p /dt = 0. If we
r˙ we obtain that
multiply the equation of motion with ⃗
$ %
r˙ · ⃗
⃗ r¨ + ⃗
r˙ · ∇Φ r˙ · (Ω
⃗ + 2⃗ ⃗p ×⃗ r˙ ) + ⃗
r˙ · Ω⃗ p × (Ω
⃗p ×⃗
r) = 0
r˙ · ⃗
⇔ ⃗ r¨ + ⃗
r˙ · ∇Φ
⃗ + (Ω
⃗p ×⃗ r˙ × Ω
r ) · (⃗ ⃗ p) = 0
⃗ · (B
Where we have used that A ⃗ × C)
⃗ =C
⃗ · (A
⃗ × B)
⃗ . Since
⃗×B
A ⃗ = −B
⃗ ×A
⃗ we have that
dEJ
dt
=0
The Jacobi Integral is a conserved quantity, i.e. an integral of motion.
Rotating Potentials IV
For comparison, since Φ = Φ(t) the Hamiltonian is explictely
time-dependent; consequently, the total energy E is not a conserved
quantity (i.e., is not an integral of motion).
The angular momentum is given by
⃗ =⃗
L r× d⃗
r
dt
=⃗ r˙ + ⃗
r×⃗ ⃗p ×⃗
r × (Ω r)
This allows us to write
$ %
⃗p ·L
Ω ⃗ = Ω
⃗ p · (⃗
r×⃗r˙ ) + Ω
⃗p · ⃗
r × (Ω⃗p ×⃗
r)
r˙ · (Ω
= ⃗ ⃗p ×⃗ ⃗p ×⃗
r ) + |Ω r |2
from which we obtain that
⃗p ·L
EJ = E − Ω ⃗
Note that EJ is the sum of 1 ˙ 2 + Φ, which would be the energy if the frame
⃗
r
2
were not rotating, and the quantity − 1 | ⃗p ×⃗
Ω r | 2
= − 1 2 2
Ω R , which can
2 2 p
be thought of as the potential energy corresponding to the centrifugal force.
Rotating Potentials V
If we now define the effective potential
Φeff = Φ − 12 Ω2p R2
r¨ = −∇Φ
⃗ ⃗ eff − 2(Ω r˙ )
⃗b ×⃗
L5
L2 L3 L1
L4
Ωp
Ω+κ/2
Ω
Ω−κ/2
IILR OILR CR OLR Radius
Lindblad Resonances play important role for orbits in barred potentials.
Lindblad Resonances IV
As an example, we discus the orbital families in a planar, rotating,
logarithmic potential
⃗ = (⃗
w v ) = (w1 , w2 , ..., w6 )
x, ⃗
The velocity of the flow in phase-space is then
⃗˙ = (⃗
w x˙ , ⃗
v˙ ) = (⃗ ⃗
v , −∇Φ)
The mass flowing out of V through an area element d2 S per unit time is
given by ρ⃗ ⃗ with d2 S
v · d2 S ⃗ an outward pointing vector normal to the
surface S . Thus
!
dM
=− ⃗
v · d2 S
ρ⃗
dt S
so that we obtain ! !
∂ρ
d x⃗ + S ρ⃗
3 ⃗=0
v · d2 S
V ∂t
! !
⃗ ⃗
Using the divergence theorem V ∇ · F d x 3
⃗ = SF ⃗ · d2 S
⃗ we obtain
! $ ∂ρ %
+ ∇⃗ · (ρ⃗v ) d3
⃗ =0
x
V ∂t
Since this must hold for any volume V we obtain the continuity equation:
∂ρ ⃗ · (ρ⃗
+∇ v) = ∂ρ ⃗ ·⃗
+ ρ∇ v +⃗ ⃗ =0
v · ∇ρ
∂t ∂t
The Collisionless Boltzmann Equation I
Similarly, for our 6D flow in phase-space the continuity equation is given by
∂f
∂t
+∇ ⃗˙ = 0
⃗ · (f w)
or in vector notation
∂f
+⃗ ⃗ − ∇Φ
v · ∇f ⃗ · ∂f
=0
∂t ∂⃗
v
The Collisionless Boltzmann Equation II
Note: Since f = f (⃗
x, ⃗
v , t) we have that
∂f ∂f ∂f
df = ∂t
dt + ∂xi
dxi + ∂vi
dvi
and thus
df ∂f ∂f ∂Φ ∂f
dt
= ∂t
+ v
∂xi i
− ∂xi ∂vi
Using this we can write the CBE in its compact form:
df
dt
=0
v˙ = −∇Φ
In the presence of collisions it is no longer true that ⃗ ⃗ , and the
CBE no longer holds. Rather, collisions result in an additional collision term:
df
dt
= Γ(t)
with w(⃗ v ) some (properly normalized) kernel which rapidly falls to zero
x, ⃗
for |⃗
x| > ϵx and |⃗
v | > ϵv .
NOTE: the fine-grained DF does satisfy the CBE.
NOTE: the coarse-grained DF does not satisfy the CBE.
Moment Equations I
Although the CBE looks very simple (df /dt = 0), solving it for the DF is
virtually impossible. It is more practical to consider moment equations.
The resulting Stellar-Hydrodynamics Equations are obtained by multiplying
the CBE by powers of velocity and then integrating over all of velocity space.
Consider moment equations related to vil vjm vk
n
where the indices (i, j, k)
refer to one of the three generalized coordinates, and (l, m, n) are integers.
Recall that
! !
ρ= fd ⃗
3
v ρ⟨vil vjm vkn ⟩ = vil vjm vkn f d3⃗
v
The (l + m + n)th moment equation of the CBE is
! ! !
vil vjm vkn ∂f∂t
d ⃗
v+
3 ∂f
vil vjm vkn va ∂x d3 ∂Φ ∂f
vil vjm vkn ∂x
v−
⃗ a ∂va
d3
v=0
⃗
∂
! ∂
! l m n a
∂Φ
! l m n ∂f 3
⇔ ∂t
vil vjm vkn f d3⃗ v+ ∂xi
vi vj vk va f d3⃗
− ∂xa vi vj vk ∂va d ⃗
v v=0
$ % $ % ! l m n ∂f 3
∂ ∂ ∂Φ
⇔ ∂t
ρ⟨vil vjm vkn ⟩ + ∂x i
ρ⟨vi vj vk va ⟩ − ∂xa vi vj vk ∂va d ⃗
l m n
v=0
∂
1st term: integration range doesn’t depend on t so ∂t
may be taken outside
∂
2nd term: ∂xi
doesn’t depend on vi , so derivative may be taken outside.
∂Φ
3rd term: ∂xi
doesn’t depend on vi , so may be taken outside.
Moment Equations II
Let’s consider the zeroth moment: l = m = n = 0
∂ρ ∂(ρ⟨vi ⟩) ∂Φ
! ∂f 3
∂t
+ ∂xi
− ∂xi ∂vi
d ⃗
v =0
Using the divergence theorem we can write
! !
∂f 3
d ⃗
v = ⃗=0
f d2 S
∂vi
where the last equality follows from the fact that f → 0 if |v| → ∞. The
zeroth moment of the CBE therefore reduces to
∂ρ ∂(ρ⟨vi ⟩)
∂t
+ ∂xi
=0
Note that this is the continuity equation, identical to that of fluid dynamics.
Just a short remark regarding notation:
!
⟨vi ⟩ = vi f d3⃗
v
is used as short-hand for
!
⟨vi (⃗
x)⟩ = vi (⃗
x)f (⃗ v )d3⃗
x, ⃗ v
Thus, ⟨vi ⟩ is a local expectation value. For brevity we do not explicitely write
the ⃗
x-dependence.
Moment Equations III
Next we consider the first-order moment equations (l, m, n) = (1, 0, 0) or
(0, 1, 0) or(0, 0, 1)
! ! !
vj ∂f
∂t
d3
⃗
v + ∂f 3
vj vi ∂x i
d ⃗
v − ∂Φ ∂f 3
vj ∂x i ∂vi
d ⃗
v=0
∂(ρ⟨vj ⟩) ∂(ρ⟨vi vj ⟩) ∂Φ
! ∂f 3
⇔ ∂t
+ ∂xi
− ∂xi
vj ∂v i
d ⃗v=0
Using integration by parts we write
! ∂f 3
! ∂(vj f ) 3 ! ∂vj
vj ∂v d ⃗
v = ∂vi
d ⃗v − ∂vi f d3⃗
v
i ! !
= vj f d2 S − δij f d3⃗v
= −δij ρ
so that we obtain
∂(ρ⟨vj ⟩) ∂(ρ⟨vi vj ⟩) ∂Φ
∂t
+ ∂xi
+ ρ ∂x j
=0
These are called the momentum equations. Note that this represents a set of
three equations (for j = 1, 2, 3), and that a summation over i is implied.
The Jeans Equations I
We can obtain the so-called Jeans Equations by subtracting ⟨vj ⟩ times the
continuity equation from the momentum equations:
First we write ⟨vj ⟩ times the continuity equation:
∂(ρ⟨vi ⟩)
⟨vj ⟩ ∂ρ
∂t
+ ⟨v j ⟩ ∂xi
=0
∂(ρ⟨vj ⟩) ∂⟨vj ⟩ ∂(ρ⟨vi ⟩⟨vj ⟩) ∂⟨vj ⟩
⇔ ∂t
−ρ ∂t
+ ∂xi
− ρ⟨vi ⟩ ∂xi
=0
Subtracting this from the momentum equations yields
∂(ρ⟨vi vj ⟩) ∂Φ ∂⟨vj ⟩ ∂(ρ⟨vi ⟩⟨vj ⟩) ∂⟨vj ⟩
∂xi
+ ρ ∂x j
+ρ ∂t
− ∂xi
+ ρ⟨vi ⟩ ∂xi
=0
2
If we define σij = ⟨vi vj ⟩ − ⟨vi ⟩ · ⟨vj ⟩ then we obtain
2
∂⟨v ⟩ ∂⟨v ⟩ ∂Φ ∂(ρσij )
ρ ∂tj + ρ⟨vi ⟩ ∂xji = −ρ ∂x j
− ∂xi
These are called the Jeans Equations. Once again, this represents a set of
three equations (for j = 1, 2, 3), and a summation over i is implied.
The Jeans Equations II
We can derive a very similar equation for fluid dynamics. The equation of
motion of a fluid element in the fluid is
ρ d⃗
v
= − ⃗ − ρ∇Φ
∇P ⃗ ext
dt
with P the pressure and Φext some external potential.
v=⃗
Since ⃗ v (⃗ v = ∂⃗
x, t) we have that d⃗ v
∂t
dt + ∂⃗
v
∂xi
dxi , and thus
& '
d⃗
v
= ∂v
+ ⃗ ⃗ ⃗
v·∇ v
dt ∂t
These are the so-called Euler Equations. A comparison with the Jeans
2
Equations shows that ρσij has a similar effect as the pressure. However,
now it is not a scalar but a tensor.
2
ρσij is called the stress-tensor
The stress-tensor is manifest symmetric (σij = σji ) and there are thus 6
independent terms.
The Jeans Equations III
2
The stress tensor σij measures the random motions of the stars around the
streaming part ⟨vi ⟩⟨vj ⟩.
2
Note that the stress-tensor is a local quantity σij = σij
2
(⃗
x). At each point ⃗
x
it defines the velocity ellipsoid; an ellipsoid whose principal axes are defined
2
by the orthogonal eigenvectors of σij with lengths that are proportional to
the square roots of the respective eigenvalues.
The incompressible stellar fluid experiences anisotropic pressure-like forces.
Note that the Jeans Equations have 9 unknowns (3 streaming motions ⟨vi ⟩
and 6 terms of the stress-tensor). With only three equations, this is not a
closed set.
For comparison, in fluid dynamics there are only 4 unknowns (3 streaming
motions and the pressure). The 3 Euler Equations combined with the
Equation of State forms a closed set.
The Jeans Equations IV
One might think that adding higher-order moment equations of the CBE will
allow to obtain a closed set of equations. However, adding more equations
also adds more unknowns such as ⟨vi vj vk ⟩, etc. The set of CBE moment
equations never closes!
If, with this approach, a solution is found, the solution may not correspond to
a physical (i.e., everywhere positive) DF. Thus, although any real DF obeys
the Jeans equations, not every solution to the Jeans equations cprresponds
to a physical DF!!!
Cylindrically Symmetric Jeans Equations
As a worked out example we derive the Jeans equations under cylindrical
symmetry. We therefore write the Jeans equations in the cylindrical
coordinate system (R, φ, z).
The first step is to write the CBE in cylindrical coordinates
df ∂f ∂f ∂f
dt
= ∂t
+ Ṙ ∂R + φ̇ ∂φ + ż ∂f
∂z
+ v̇ R
∂f
∂vR
+ v̇ φ
∂f
∂vφ
+ v̇ z
∂f
∂vz
e˙ R = φ⃗
Using that ⃗ e˙ φ = −φ̇⃗
eφ , ⃗ e˙ z = 0 we have that
eR , and ⃗
$ % $ %
a = R̈ − Rφ̇2
⃗ eR + 2Ṙφ̇ + Rφ̈ ⃗
⃗ eφ + z̈⃗
ez
vR = Ṙ ⇒ v̇R = R̈
vφ = Rφ̇ ⇒ v̇φ = Ṙφ̇ + Rφ̈
vz = ż ⇒ v̇z = z̈
Cylindrically Symmetric Jeans Equations
This allows us to write
$ 2
vφ
% / vR vφ 0
a = v̇R −
⃗ R
eR +
⃗ R
+ v̇φ ⃗
eφ + v̇z ⃗
ez
⃗ ⃗ =
a = −∇Φ ∂Φ
⃗
e + 1 ∂Φ
⃗
e + ∂Φ
⃗
e
∂R R R ∂φ φ ∂z z
! 2 ∂f 3 ∂
! 2 ∂(ρ⟨vR2
⟩)
vR ∂R d ⃗v = ∂R vR f d ⃗ 3
v= ∂R
! !
∂f 3
vR vz ∂z d ⃗ ∂
v = ∂z vR vz f d3⃗ v = ∂(ρ⟨v∂z R vz ⟩)
! 2 $! 2 ! 2 % 2
vR vφ ∂f 1 ∂(v v f ) ∂(v v ) ⟨vφ ⟩
d 3
= d 3
d 3
=
R R
R ∂vR
⃗
v R ∂vR
φ
⃗
v − ∂vR
φ
f ⃗
v −ρ R
! $! ! %
∂Φ ∂f ∂Φ ∂(vR f ) 3 ∂vR ∂Φ
vR ∂R ∂vR
d 3
⃗
v = ∂R ∂vR
d ⃗
v − ∂vR
f d 3
⃗
v = −ρ ∂R
! 2
$! 2 ! 2
% 2
vR vφ ∂f 1 ∂(v v f ) ∂(v v ) ⟨vR ⟩
R ∂vφ
d3
⃗
v = R
R φ
∂vφ
d3
⃗
v − R φ
∂vφ
f d 3
⃗
v = −ρ R
! $! ! ∂vz %
∂Φ ∂f 3 ∂Φ ∂(vR f ) 3
vR ∂z ∂vz d ⃗ v = ∂z ∂vz
d ⃗v − ∂vR f d3⃗ v =0
Cylindrically Symmetric Jeans Equations
Working out the similar terms for the other Jeans equations we obtain the
Jeans Equations in Cylindrical Coordinates
2
$ 2
⟨vR ⟩−⟨vφ 2
⟩
%
∂(ρ⟨vR ⟩) ∂(ρ⟨vR ⟩) ∂(ρ⟨vR vz ⟩) ∂Φ
∂t
+ ∂R
+ ∂z
+ ρ R
+ ∂R
=0
∂(ρ⟨vφ ⟩) ∂(ρ⟨vR vφ ⟩) ∂(ρ⟨vφ vz ⟩) ⟨vR vφ ⟩
∂t
+ ∂R
+ ∂z
+ $2ρ R
= 0%
2
∂(ρ⟨vz ⟩) ∂(ρ⟨vR vz ⟩) ∂(ρ⟨vz ⟩) ⟨vR vz ⟩ ∂Φ
∂t
+ ∂R
+ ∂z
+ ρ R
+ ∂z
= 0
Note that there are indeed 9 unknowns in these 3 equations. Only if we make
additional assumptions can we solve these equations. In particular, one
often makes the following assumptions:
∂
(1) System is static ⇒ ∂t
-terms are zero and ⟨vR ⟩ = ⟨vz ⟩ = 0
Note that we have only 2 equations left: the system is still not closed.
If from the surface brightness we can estimate the mass density ρ(R, z)
and hence the potential Φ(R, z), we can solve the second of these Jeans
equations for the meridional velocity dispersion
1
!
∞
σ (R, z) =
2
ρ
ρ ∂Φ
∂z
dz
z
and the first Jeans equation then gives the mean square azimuthal velocity
2
⟨vφ ⟩ = ⟨vφ ⟩2 + σφ
2
2
∂Φ R ∂(ρσ )
2
⟨vφ ⟩(R, z) = σ (R, z) +
2
R ∂R + ρ ∂R
2
Thus, although ⟨vφ ⟩ is uniquely specified by the Jeans equations, we don’t
know how it splits in the actual azimuthal streaming ⟨vφ ⟩ and the azimuthal
2
dispersion σφ . Additional assumptions are needed for this.
Spherically Symmetric Jeans Equations
A similar analysis but for a spherically symmetric system, using the spherical
coordinate system (r, θ, φ), gives the following set of Jeans equations
$ %
∂(ρ⟨vr ⟩) ∂(ρ⟨vr2 ⟩) ρ
∂t
+ ∂r
+ r
2⟨vr2 ⟩ − ⟨vθ2 ⟩ − ⟨vφ
2
⟩ + ρ ∂Φ
∂r
=0
$ & ' %
∂(ρ⟨vθ ⟩) ∂(ρ⟨vr vθ ⟩) ρ
∂t
+ ∂r
+ r 3⟨vr vθ ⟩ + ⟨vθ2 ⟩ − ⟨vφ 2
⟩ cotθ = 0
∂(ρ⟨vφ ⟩) ∂(ρ⟨vr vφ ⟩) ρ
∂t
+ ∂r
+ r
[3⟨vr vφ ⟩ + 2⟨vθ vφ ⟩cotθ] = 0
If we now make the additional assumptions that the system is static and that
also the kinematic properties of the system are spherical symmetric then
there can be no streaming motions and all mixed second-order moments
vanish. Consequently, the stress tensor is diagonal with σθ2 = σφ
2
. Under
these assumptions only one of the three Jeans equations remains:
∂(ρσr2 ) 2ρ
/ 0
∂r
+ r
σr2 − σθ2 + ρ ∂Φ
∂r
=0
Notice once again how the spherical Jeans equation is not sufficient to
determine the dynamics: if the density and potential are presumed known, it
contains two unknown functions σr2 (r) and σθ2 (r) which can therefore not
be determined both.
Spherically Symmetric Jeans Equations
It is useful to define the anisotropy parameter
σθ2 (r)
β(r) ≡ 1 − σr2 (r)
Thus, if we can measure ρ(r), ⟨vr2 ⟩(r), and β(r), we can use the Jeans
equations to infer the mass profile M (r).
Consider an external, spherical galaxy. Observationally, we can measure the
projected surface brightness profile, Σ(R), which is related to the
luminosity density ν(r) = ρ(r)/Υ(r) as
!
∞
Σ(R) = 2 √ν r2 dr
R r −R2
α To Observer
R r
Note that the first term uses the luminosity density ν(r) rather than the
2
mass density ρ(r), because σp is weighted by light rather than mass.
Spherically Symmetric Jeans Equations
The mass-to-light ratio now follows from
Υ(r) = R rM (r)
4π 0 ν(r) r 2 dr
which can be used to investigate whether system contains dark matter halo
or central black hole, but always under assumption that system is isotropic.
EXAMPLE 2: Assume a constant mass-to-light ratio: Υ(r) = Υ0 . In this
case the luminosity density ν(r) immediately yields the enclosed mass:
!r
M (r) = 4πΥ0 ν(r) r 2 dr
0
We can now use the Jeans Equation to write β(r) in terms of M (r), ν(r)
and ⟨vr2 ⟩(r). Substituting this in the equation for Σ(R)σp
2
(R) allows a
solution for ⟨vr2 ⟩(r), and thus for β(r). As long as 0 ≤ β(r) ≤ 1 the
model is said to be self-consistent witin the context of the Jeans equations.
Almost always, radically different models (based on radically different
assumptions) can be constructed, that are all consistent with the data and
the Jeans equations. This is often referred to as the mass-anisotropy
degeneracy. Note, however, that none of these models need to be physical:
they can still have f < 0.
The Virial Equations I
We can obtain an important tensor equation relating global properties of the
system, by multiplying the CBE by both vj and xk and then integrating over
the entire phase-space.
The first step of this has already been performed in our derivation of the
Jeans equations, and yielded the momentum equations
∂(ρ⟨vj ⟩) ∂(ρ⟨vi vj ⟩) ∂Φ
∂t
+ ∂xi
+ ρ ∂x j
=0
Multiplying all terms with xk and integrating over real space yields
∂
! ! ∂(ρ⟨vi vj ⟩) 3 ! ∂Φ 3
∂t
ρxk ⟨vj ⟩d x = −
3
xk ∂xi
d ⃗x − ρ xk ∂xj
d ⃗
x
Using integration by parts the first term on the r.h.s. becomes
! ∂(ρ⟨vi vj ⟩) 3 !∂(ρxk ⟨vi vj ⟩) 3 !
xk ∂xi
d ⃗x = d x⃗− ρ⟨vi vj ⟩ ∂x k
d3
⃗
x
! ∂xi ∂xi
= − δki ρ⟨vi vj ⟩d3 x ⃗
!
= − ρ⟨vk vj ⟩d3 ⃗ x
= −2Kkj
where we have defined the kinetic energy tensor
1
!
Kij = 2
ρ⟨vi vj ⟩d3 x
⃗
The Virial Equations II
It is customary to split the kinetic energy tensor into contributions from
ordered and random motions:
2K + W + Sp = 0
with the surface pressure term
!
Sp = − ⟨v 2 ⟩⃗ ⃗
nd 2 S
r·⃗
As long as Sp ̸= 0 we thus expect that 2K/|W | ̸= 1.
See Shapiro et al. (astro-ph/0409173) for a detailed discussion.
The Virial Equations V
From a simple dimensional analysis one finds that |W | ∝ GM 2 /R with
M the system’s mass and R a characteristic radius.
A useful characteristic radius is the so-called gravitational radius defined by
GM 2
rg ≡ |W |
One can relate the gravitational radius to the half-mass radius rh , defined as
radius enclosing half the total mass. As shown by Spitzer (1969), typical
stellar systems have rg ≃ 2.5rh .
Combining this with the scalar virial theorem we can write that
rh ⟨v2 ⟩
M ≃ 2.5 G
which is a useful equation to obtain a (rough) estimate of the virial mass from
a measure of the half-mass radius and the rms motion
The Virial Equations VI
Using the scalar virial theorem we obtain
1
E = K + W = −K = 2
W
!
2π !
∞ !
∞
K= 3Υ
2
dφ dRRΣ(R)σp2 (R) = 3πΥ dRRΣ(R)σp2 (R) ≡ ΥJ
0 0 0
we obtain that
,r -2
!
∞ ! !
∞
W = −8Υ2 dr
r2
dr ′ r ′2 dΣ
dR
√ dR ≡ Υ2 J˜
0 0 r′ R −r 2
2
Υ = − 2J
J˜
Flattening of Oblate Spheroids I
As another application of the virial theorem we relate the flattening of an
oblate spheroid to its kinematics.
Consider an oblate system with it’s symmetry axis along the z -direction.
Because of symmetry considerations we have that
The usefulness of this equation lies in the fact that, for density distributions
that are constant on similar concentric spheroids, i.e., ρ = ρ(m2 ), the ratio
Wxx /Wzz depends only on the axis ratio c/a of the spheroids, and is
independent of the density profile! For an oblate body, to good approximation
Wxx
( c )−0.9
Wzz
≃ a
σ̃zz
( c )0.45
σ̃xx
≃ a
df #
n
∂f dIk
dt
= ∂Ik dt
=0
k=1
†
except for point masses and uniform spheres, which have five isolating
integrals of motion
Jeans Theorem & Spherical Systems
If the system is spherically symmetric in all its properties, then
⃗ : ie., the DF can only depend on
f = f (E, L2 ) rather than f = f (E, L)
the magnitude of the angular momentum vector, not on its direction.
Contrary to what one might naively expect, this is not true in general. In fact,
as beautifully illustrated by Lynden-Bell (1960), a spherical system can rotate
without being oblate.
r) + 12 [vr2 + vθ2 + vφ
Since E = Φ(⃗ 2
] we have that
! & '
1 1 2
⟨vr2 ⟩ = ρ
dvr dvθ dvφ vr2 f Φ + [v
2 r
+ vθ2 + vφ
2
]
! & '
1 1 2
⟨vθ2 ⟩ = ρ
dvr dvθ dvφ vθ f Φ + 2 [vr + vθ + vφ ]
2 2 2
! & '
1
2
⟨vφ ⟩= ρ
dvr dvθ dvφ vφ2
f Φ + 12 [vr2 + vθ2 + vφ
2
]
Since these equations differ only in the labelling of one of the variables of
integration, it is immediately evident that ⟨vr2 ⟩ = ⟨vθ2 ⟩ = ⟨vφ
2
⟩.
(note the minus sign in the Poisson equation), which can be written as
1 d
( ) !Ψ +
r 2 dr
r 2 dΨ
dr
= −16π G
2
f (E) 2(Ψ − E) dE
0
“from ρ to f ”
As an example, consider a stellar-dynamical system with a DF
& 2
'
ρ1 Ψ− 1
2v
f (E) = 2 )3/2
(2πσ0
exp σ02
Inspection shows that the solution for Ψ(r) and the corresponding ρ(r) are
2
σ0
Ψ(r) = −2σ02 lnr ρ(r) = 2πGr 2
σ 2 (r) = σ02
thus the local velocity dispersion, which is related to the “temperature”, is
independent of r .
Eddington’s Formula
“from f to ρ”
Using that Ψ is a monotonic function of r , so that ρ can be regarded as a
function of Ψ, we have
! !Ψ +
ρ(Ψ) = fd ⃗
v = 4π
3
f (E) 2(Ψ − E)dE
0
dρ
!Ψ f (E) dE
√1 = √
8π dΨ Ψ−E
0
which is an Abel integral equation, whose solution is
!E dρ
f (E) = √1 d √dΨ
8π 2 dE dΨ E−Ψ
0
This is called Eddington’s formula, which may also be written in the form
1 2
!E & '
d2 ρ dρ
f (E) = √1 2
√dΨ + √1
8π 2 dΨ E−Ψ E dΨ Ψ=0
0
Eddington’s Formula
Given a spherically symmetric density distribution, which can be written as
ρ = ρ(Ψ) (which is not always possible), Eddington’s formula yields a
corresponding DF f = f (E).
Note, however, that there is no guarantee that the solution for f (E) satisfies
the physical requirement that f ≥ 0 for all E .
Using Eddington’s formula
!E dρ
f (E) = √1 d √dΨ
8π dE
2 dΨ E−Ψ
0
is an increasing function of E .
If a density distribution ρ(r) does not satisfy this requirement, then the
model obtained by setting the anisotropy parameter β = 0 [i.e., by
assuming that f = f (E)] and solving the Jeans Equations is unphysical.
Anisotropic Spherical Models I
In the more general case, spherical systems (with spherical symmetry in all
their properties) have f = f (E, L2 ).
L2
Q=E− 2
2ra
Thus [ρQ (r), f (Q)] are similarly related as [ρ(r), f (E)] so that we may
use Eddington’s formula to write
!Q dρQ
f (Q) = √1 d √ dΨ
8π dQ
2 dΨ Q−Ψ
0
with L0 a constant, and Lc (E) the angular momentum of the circular orbit
with energy E .
Here g(E) controls the distribution of stars between energy surfaces, while
the circularity function h(x) describes the distribution of stars over orbits of
different angular momenta on surfaces of constant E .
Depending on the choise for h(x) one can construct models with different
anisotropies. If h(x) decreases with increasing x, the model will be radially
anisotropic, and vice versa.
Once a choise for h(x) is made, one can (numerically) obtain g(E) for a
given ρ(r).
All these various models are useful to explore how different orbital
anisotropies impact on observables, such as the line-of-sight velocity
distributions (LOSVDs).
Anisotropic Spherical Models IV
Velocity profiles, L(v), for the outer parts of spherical f (E, L2 ) models.
Results are shown for β = ∞ (circular orbits), −1, 0 (isotropic model), 0.5,
and 1 (radial orbits). The unit of velocity is the velocity dispersion, which is
different for each curve. (from: van der Marel & Franx 1993)
⟨vr2 ⟩(r)
β(r) = 1 − ⟨vr2 ⟩(r)
Spherical Models: Summary II
Many different models, with different orbital anisotropies, can correspond to
the same density distribution. Examples of models are:
• f (E, L2 ) = f (E) isotropic model, i.e., β = 0
Suppose I have measured the surface brightness profile Σ(R) and the
2
line-of-sight velocity dispersion σp (R). Depending on the assumption
regarding β(r) these data imply very different mass distributions M (r).
One can (partially) break this mass-anisotropy degeneracy by using
information regarding the LOSVD shapes.
Axisymmetric Models
Next we consider axisymmetric systems. If we only consider systems for
which most orbits are regular, then the strong Jeans Theorem states that, in
the most general case, f = f (E, Lz , I3 ).
From the symmetries of the individual orbits, it is evident that in this case
f = f (E, Lz ) =⇒ 2
⟨vR ⟩ = ⟨vz2 ⟩ and ⟨vR vz ⟩ = 0
2 2
Now we have two unknowns left, ⟨vR ⟩ and ⟨vφ ⟩, and the Jeans equations
reduce to
2
$ ⟨v2 ⟩−⟨v2 ⟩ %
∂(ρ⟨vR ⟩) ∂Φ
∂R
+ρ R
R
φ
+ ∂R
=0
2
∂(ρ⟨vz ⟩)
∂z
+ ρ ∂Φ
∂z
=0
which can be solved. Note, however, that the Jeans equations provide no
2
information regarding how ⟨vφ ⟩ splits in streaming and random motions.
In practice one often follows Satoh (1980), and writes that
$ %
⟨vφ ⟩2 = k ⟨vφ
2 2
⟩ − ⟨vR ⟩ . Here k is a free parameter, and the model is
isotropic for k = 1.
f (E, Lz ) Models III
Now let us return to our expression for the density ρ. If we change the
variables of integration from (vm , vφ ) to (E, Ψ), we obtain
2π
!Ψ !
ρ = 0
dE L2 <2(Ψ−E)R2
f (E, Lz )dLz
R z√
2π
!Ψ ! R 2(Ψ−E)
= R 0
dE 0 √
[f (E, Lz ) + f (E, −Lz )] dLz
4π
!Ψ ! R 2(Ψ−E)
= R 0
dE 0
f+ (E, Lz )dLz
We thus see that the density depends only on the even part of the DF (i.e., the
density contributed by a star does not depend on its sense of rotation). This
also implies that there are infinitely many DFs f (E, Lz ) that correspond to
exactly the same ρ(R, z), namely all those that only differ in f− (E, Lz ).
f (E, Lz ) Models IV
Thus, given a density distribution ρ(R, z), one can in principle obtain a
unique f+ (E, Lz ). In practice, the computation of f+ (E, Lz ) was
considered very difficult.
It was thought that one needs to (i) be able to express ρ explicitely as a
function of R and Ψ, and (ii) perform a complicated integral transform.
However, this situation changed drastically when Hunter & Qian (1993)
derived an axisymmetric analogue of Eddington’s Formula based on a
complex contour integral, which does not require explicit knowledge of
ρ(R, Ψ).
In addition, Evans (1993, 1994) has found a large and useful family of models
for which all relevant calculations can be done analytically. This is the family
of power-law models, which we already encoutered in the exersizes.
So for a wide range of ρ(R, z), we can compute the unique, corresponding
f+ (E, Lz ). But what about the odd part of the DF, f− (E, Lz )?
While f− (E, Lz ) has no influence on the density distribution, it specifies
the asymmetry between clockwise and counter-clockwise orbits. Hence, it is
responsible for the mean streaming velocity.
f (E, Lz ) Models V
In fact, it is straightforward to show that
4π
!Ψ ! R√2(Ψ−E)
⟨vφ ⟩ = ρR2 0
dE 0
f− (E, Lz ) Lz dLz
In principle, if we were to know ⟨vφ ⟩(R, z), we could solve for f− (E, Lz )
using the Hunter & Qian complex contour integral method, just like we can
recover ρ(R, z) from f+ (E, Lz ).
In practice, the observationally accessible quantities are Σ(x, y) and
vlos (x, y). Unless the system is seen edge-on, one can not uniquely
deproject these for ρ(R, z) and ⟨vφ ⟩(R, z).
f (E, Lz ) Models VI
Most f (E, Lz )-modelling therefore uses the following methodology:
(1a) Assume functional form for ν(R, z) and value for inclination angle i.
Project ν along line-of-sight and adjust free parameters by fitting Σ(x, y).
(1b) Assume value for i and deproject Σ(x, y) using some assumptions.
Examples are the Richardson-Lucy algorithm (Binney, Davies & Illingworth
1990) and the Multi-Gaussian-Expansion method (Emsellem, Monnet &
Bacon 1994). Warning, unless i = 90o these are not unique!
(2) Assume mass-to-light ratio, Υ(R, z), and compute Ψ(R, z) from
ρ(R, z) = Υ(R, z)ν(R, z) using the Poisson equation.
2 2
(3) Solve Jeans Equations for ⟨vR ⟩(R, z) and ⟨vφ ⟩(R, z).
2
(4) Make assumptions regarding split of ⟨vφ ⟩ in streaming motion and
random motion (i.e., pick a value for the Satoh k-parameter).
(5) Project model and compare resulting vlos (x, y) and σlos (x, y) to data
obtained from spectroscopy.
(6) Repeat analysis to constrain i, k and Υ(R, z).
For examples, see Binney, Davies & Illingworth (1990), van der Marel (1991),
Cretton & van den Bosch (1999)
f (E, Lz ) Models VII
The results of f (E, Lz ) modeling can be summarized as follows
Rotation velocities and velocity dispersions along major axis (WHT + HST)
k = 1, i = 90o
σp in center requires BH
Note: vrms is not well fitted
This is independent of the
assumed value for k, which
only determines how vrms
splits in vrot and σp
⇒ f = f (E, Lz , I3 )
VCC1010
NGC4365
νobs = (1 − β)γνem
where we have used the first term of the series expansion of ln(1 + x)
Let S(x) correspond to an observed spectrum, rebinned linearly in x. Then
we may write that
This method has the advantage that no assumption is made regarding the
functional form of B(x). However, the disadvantage is that one needs to
adopt some noise filtering which introduces correlations between different
points in the B(x) estimate. This complicates comparison with models.
The alternative is to use a parameterized method, by assuming a functional
form B̃(x) for the broadening function which has n free parameters. The
best-fit parameters are obtained by minimizing
! $ %2
χ2 = S(x) − B̃(x) ⊗ T (x) dx
Rather than talking about the broadening function B(x) one often prefers to
talk about the velocity profile L(v) ≡ B(v/c), also called the LOSVD
Kinematics IV
A typical functional form to assume for L(v) is a simple Gaussian
1 2
L(v) = √1 e −2w
2πσ
where Hl (x) are Hermite Polynomials of degree l, and hj are additional free
parameters that describe the deviation of L(v) from a pure Gaussian
(i.e., hj = 0 for a Gaussian LOSVD).
Typically one truncates the series expansion at N = 4 so that the LOSVD
has four free parameters: V , σ , h3 , and h4 . Their best-fit values are
determined by minimizing the χ2 defined above.
Kinematics V
For galaxies and DM haloes one always finds that trelax ≫ tHubble . As first
pointed out by Zwicky (1939), how is it possible that galaxies appear relaxed?
In particular, if galaxies did not form near equilibrium, two-body relaxation
certainly did not have the time to get them there. This would imply that
galaxies all form in (virial) equilibrium, which seems very contrived.
In a collisionless system
#
2N
λi = 0
i=1
which expresses the incompressibility of the flow (conservation of volume).
The Lyapunov Exponent II
x is regular then λi = 0 for i = 1, ..., 2N .
If the trajectory through ⃗
On the other hand, if the trajectory is stochastic then λ ≡ max λi > 0.
The inverse of the largest Lyapunov exponent is called the Lyapunov time,
and defines the characteristic e-folding time.
For a chaotic (stochastic) orbit the Lyapunov time is finite, while it is infinite
for regular orbits.
Chaotic Mixing I
In the parts of phase-space that are not filled with regular, but with stochastic
orbits, mixing occurs naturally due to the chaotic behavior of the obits.
Chaos implies a sensitivity to initial conditions: two stars intially close
together separate exponentially with time.
After a sufficiently long time, the group of stars that were initially close
together will have spread over the entire accessible phase-space (ie., the
Arnold web). As for phase-mixing, chaotic mixing thus smooths out (i.e.,
relaxes) the coarse-grained DF, but leaves the fine-grained DF invariant.
Chaotic Mixing II
Unlike for phase-mixing, chaotic mixing is irreversible, in the sense that an
infinitely precise fine-tuning of the phase-space coordinates is required to
undo its effects.
Chaotic mixing occurs on the Lyapunov timescale.
However, the mixing rate of chaotic ensembles typically falls below the
Lyapunov rate once the trajectories separate, because stochastic orbits are
often confined over long periods of time to restricted parts of phase-space.
The time scale on which the orbits are uniformly spread over their accessible
phase-space then becomes dependent on the efficiency of Arnold diffusion.
All in all, the effective rates of phase mixing and chaotic mixing might
therefore be comparable in real galaxies.
time
Violent Relaxation III
A few remarks regarding Violent Relaxation:
• During the collapse of a collisionless system the CBE is still valid, i.e., the
fine-grained DF does not evolve (df /dt = 0). However, unlike for a
‘steady-state’ system, ∂f /∂t ̸= 0.
• A time-varying potential does not guarantee violent relaxation. One can
construct oscillating models that exhibit no tendency to relax: although the
energies of the individual particles change as function of time, the relative
distribution of energies is invariant (cf. Sridhar 1989).
Although fairly artificial, this demonstrates that violent relaxation requires
both a time-varying potential and mixing to occur simultaneous.
• The time-scale for violent relaxation is
4 5−1/2 4 5−1/2
(dE/dt)2 (∂Φ/∂t)2 3
tvr = E2
= E2
= 4
⟨ Φ̇2
/Φ 2 −1/2
⟩
where the last step follows from the time-dependent virial theorem (see
Lynden-Bell 1967).
Note that tvr is thus of the order of the time-scale on which the potential
changes by its own amount, which is basically the collapse time. ◃ the
relaxation mechanism is very fast. Hence the name ‘violent relaxation’
Violent Relaxation IV
0 0.2 0.4 0 0.2 0.4 0 0.2 0.4
-2
-4
-2
-4
Violent relaxation leads to efficient fine-grained mixing of the DF, and erases
the system’s memory of its initial conditions.
For comparison: phase-mixing only leads to a relaxation of the
coarse-grained DF and is reversible.
Particles initially at A and D gain energy during the first quarter cycle of the
wave (i.e., their net velocity increases), while those at B and C loose energy.
Since there are more particles at A than at C and more at D than at B (see
distribution at left), the particles experience a net gain. Consequently, the
wave has to experience a net loss.
The End-State of Relaxation I
The various relaxation mechanism discussed will drive the system to an
equilibrium configuration. However, there are many different equilibrium
configurations for a collisionless system of mass M and energy E . So to
which of these configurations does a system settle?
This is a very complicated problem which is still largely unsolved.
One might expect the system to evolve to a most probable state.
This actually happens in a collisional gas, where the collisions cause a rapid
exchange of energy between the particles. The most probable distribution is
obtained by maximizing the entropy, which results in the Maxwell-Boltzmann
velocity distribution.
If one applies the same logic to collisionless systems, and (somewhat
naively) defines the systems entropy by
!
S=− f lnf d3 ⃗
xd3⃗
v
then one finds (not surprisingly) that the functional form of f which
maximizes S subject to given values of the system’s mass and energy is that
of a singular isothermal sphere, for which the DF again has a Maxwellian
velocity distribution (see Lynden-Bell 1967).
The End-State of Relaxation II
However, a singular isothermal sphere has infinite mass, and is thus
inconsistent with our constraint. Thus, no DF that is compatible with finite
M and E maximizes S !
In fact, one can show that one can always increase the system’s entropy (as
defined above) by increasing the systems central concentration (see
Tremaine et al. 1986).
Thus, violent relaxation will drive the system towards its equilibrium state by
making the system more concentrated, but can never reach it. In other
words, there is no end state. This seems unsatisfactory.
There are two possible ‘solutions’, both of which are probably correct:
(1) The relaxation is not complete.
(2) The expression for the entropy is not appropriate.
As an example of the latter, since gravitational systems do not obey
extensive statistics, a more appropriate definition for the entropy may be
!
Sq = − f q lnq f d3 ⃗
xd3⃗
v
where lnq (x) = (x1−q − 1)/(1 − q) with q ̸= 1 (see Hansen et al. 2004).
Note that for q = 1 one recovers the normal Boltzmann-Gibbs entropy S .
The End-State of Relaxation III
Despite the uncertainties regarding the definition of entropy, it has become
clear that relaxation is in general not complete. There are various reasons for
this:
• The time-scales for phase-mixing and chaotic mixing may not be small
enough compared to the Hubble time.
• Violent relaxation is only efficient as long as the potential fluctuates. This
requires global, coherent modes, which, however, as strongly damped by
Landau Damping. This occurs roughly on the collapse time-scale, which is
also the time-scale on which violent relaxation works. Thus it is not too
suprising if it is not complete.
• This is strongly supported by numerical simulations, which show that the
end-state of violent relaxation depends on the clumpiness of the initial
conditions (van Albada 1982). This illustrates that the final state is not
completely governed by statistical mechanics alone, but also by the details
of the collapse.
rAS
rAB
r0 rBS b
M
vM
⃗ d⃗
vM 4πG2 M 2
Fdf = M dt = − v2 lnΛ ρ(< vM )
M
Note that F⃗df ∝ M 2 : the amount of material that is deflected (i.e., the
‘mass’ of the wake) is proportional to M and the gravitational force that this
wake exerts on M is proportional to M times its own mass.
−2
⃗df ∝ v in the limit of large vM , but F ⃗df ∝ vM in the limit of
Note that F M
small vM [i.e., for sufficiently small vM one may replace f (vm ) with f (0)].
⃗df is independent of m!
Note that F
The Coulomb Logarithm
One has that Λ = bmax /bmin with bmin and bmax the minimum and
maximum impact parameters for which encounters can be considered
effective:
Encounters with b > bmax don’t cause a significant deflection, and these
therefore do not contribute significantly to the wake. Encounters with
b < bmin cause a very strong deflection so that these also do not contribute
to the wake.
We can estimate bmin as the impact parameter that corresponds to a close
encounter (see first lecture), and thus bmin ≃ GM
⟨v2 ⟩
with ⟨v 2 ⟩1/2 the rms
velocity of the background particles.
The maximum impact parameter, bmax , is much harder to estimate (see
White 1976), and one typically simply takes bmax ≃ L with L the size of the
system.
Typical values that one encounters for the Coulomb Logarithm are
3∼< lnΛ < 30.
∼
Dynamical Friction: Local vs. Global
Note that Chandrasekhar Dynamical Friction is a purely local phenomenon:
⃗df depends only on the local density ρ(< v),
The dynamical friction force F
and the backreaction owes to a local phenomenon, namely wake-creation
due to gravitational focussing.
However, a system A can also experience dynamical friction due to a system
B when it is located outside of B (Lin & Tremaine 1983). Clearly, this friction
can not arise from a wake. Instead, it arises from torques between A and
stars/particles in B that are in resonance with A (Tremaine & Weinberg
1984).
(Weinberg 1989)
The extent to which dynamical friction is a local (wake) versus a global
(resonant-coupling) effect is still being debated.
Orbital Decay I
Consider a singular, isothermal sphere with density and potential given by
Vc2
ρ(r) = 4πGr 2
Φ(r) = Vc2 lnr
If we further assume that this sphere has, at each point, an isotropic and
Maxwellian velocity distribution, then
$ 2
%
ρ(r) vm
f (vm ) = (2πσ 2 )3/2
exp − 2σ2
√
with σ = Vc / 2. Now consider a test-particle of mass M moving on a
circular orbit (i.e., vM = Vc ) through this sphere. The Chandrasekhar
dynamical friction that this particle experiences is
$ %
4πlnΛG2 M 2 ρ(r) GM 2
Fdf = − erf (1) − √2 e−1 ≃ −0.428 lnΛ
Vc2 π r2
Vc dr
dt
= −0.428 lnΛ GM
r
As an example, consider the LMC. Assume for simplicity that the LMC moves
on a circular orbit at ri = 50 kpc, that the mass of the LMC is
M = 2 × 1010 M⊙ , and that the MW can be approximated as a singular
isothermal sphere with Vc = 220 km s−1 and with a radius of
r = 200 kpc.
We then find that the LMC will reach the center of the MW halo after a time
tdf ≃ 7.26
lnΛ
Gyr. Using the approximation for Λ discussed before we find
that lnΛ ≃ 6, and thus tdf ≃ 1.2 Gyr.
Orbital Decay III
The derivation on the previous pages was for a circular orbit. We now focus
on the orbital decay of an eccentric orbit, whose eccentricity is defined as
r+ − r−
e= r+ + r−
where we have used that Lc (E) = rc (E)Vc . Using that L = rv⊥ , with
v⊥ the velocity in the direction perpendicular to the radial vector, we find that
dE
dt
= v dv
dt
dL
dt
= r dv⊥
dt
= L dv
v dt
Combining all the above we finally find that
, & '2 -
de η de v dv
dt
= v dη
1− Vc dt
where dv/dt = Fdf /M < 0 (see van den Bosch et al. 1999).
Orbital Decay V
de
At pericenter we have that v > Vc . Since η > 0, dη < 0, and dv
dt
< 0 we
thus have that de
dt
< 0; the eccentricity decreases and the orbit becomes
more circular.
However, at apocenter v < Vc and therefore de
dt
> 0: the orbit becomes
more eccentric during an apocentric passage.
The overall effect of dynamical friction on the orbit’s eccentricity, integrated
over an entire orbit, can not be obtained from inspection: numerical
simulations are required.
b
b’ R x
θ
S z
r’−R
r’
ϕ
R
From the above figure one can see that
⃗ 2 = (R − r ′ cos φ)2 + r ′2 sin2 φ = R2 − 2rR cos φ + r ′2
r ′ − R|
|⃗
The Impulse Approximation V
Using the series expansion √ 1 = 1 − 12 x + 1·3 2
2·4
x − 1·3·5 3
2·4·6
x + ... this
1+x
yields
, & ' & '2 -
1 1 1 r′ ′2
r 3 r′ r ′2
⃗
r ′ −R|
|⃗
= R
1− 2
−2 R cos φ + R2
+ 8
−2 R cos φ + R2
+ ...
These are related to the tidal forces in the (x, y, z) coordinate system:
x′ = x cos θ − y sin θ
y′ = −x sin θ + y cos θ
z′ = z
so that we obtain, after some algebra
dvx GMp / 2
0
Fx = = R3 /
x (2 − 3 sin θ) + 3 y sin θ cos θ
dt
dvy GMp 0
Fy = dt
= R3
y (2 − 3 cos θ) + 3 x sin θ cos θ
2
dvz GM z
Fz = dt
= − R3p
The Impulse Approximation VII
Integrating these equations over time yields the cumulative velocity changes
⃗
wrt the center of S . Using that R(t) = (b, vp t, 0), and thus cos θ = b/R
and sin θ = vp t/R we obtain
2GMp x 2GMp z
∆vx = vp b2
∆vy = 0 ∆vz = − vp b2
2GM
We thus have that ∆⃗
v = vp b2p (x, 0, −z), and
∆E = 1
2
(⃗ v )2 + Φ(⃗
v + ∆⃗ r ′ ) − 12 ⃗
v 2 − Φ(⃗
r′ ) = ⃗ v + 12 (∆v)2
v · ∆⃗
Note that, in the impulse approximation, the potential energy does not
change during the encounter.
We are interested in computing ∆Etot which is obtained by integrating ∆E
over the entire system S .
First we note that the integral of the first term of ∆E typically is equal to
zero, by symmetry. Therefore
1
!
∆Etot = 2
ρ(⃗ r ′
)|∆⃗ v | d ⃗
2 3 ′
r
2G2 Mp2 !
= 2
vp b 4 ρ(⃗ r ′
)(x 2
+ z 2 )d3⃗
r′
2G2 Mp2
= 2
vp b 4 M s ⟨x 2
+ z 2
⟩
The Impulse Approximation VIII
Assuming spherical symmetry for S , so that
2 2
⟨x2 + z 2 ⟩ = 3
⟨x2
+ y2 + z2⟩ = 3
⟨r 2
⟩ we finally obtain
& '2
4 2 Mp ⟨r 2 ⟩
∆Etot = 3
G Ms vp b4
As shown by Aguilar & White (1985), this derivation, which is originally due
to Spitzer (1958), is surprisingly accurate for encounters with
b∼ > 5max[r , r ] (with r and r the median radii of P and S ), even for
p s p s
relatively slow collisions with v∞ ≃ ⟨vs2 ⟩1/2 .
The above shows that fast encounters pump energy into the systems
involved. This energy originates from the kinetic energy associated with the
orbit of P wrt S . Note that ∆Etot ∝ b−4 , so that close encounters are far
more important that distant encounters.
As soon as the amount of energy pumped into S becomes comparable to its
binding energy, the system S will become tidally disrupted.
Some stars can be accelerated to velocities that exceed the local escape
velocity ◃ encounters, even those that do not lead to tidal disruption, may
cause mass loss of S . In this case, the first terms of ∆E is not zero, and the
above impulse approximation has to be handled with care.
Return to Equilibrium
As we have seen, a fast encounter transfers orbital energy to the two systems
involved in the encounter, whose kinetic energy has subsequently increased.
After the encounter the systems are therefore no longer in virial equilibrium.
The systems now need to readjust themselves to find a new virial
equilibrium. Interestingly, this process changes the internal kinetic energy
more than did the encounter itself.
Let the initial kinetic and total energies of a system be T0 and E0 ,
respectively. According to the virial theorem we have that E0 = −T0 .
Due to the encounter T0 → T0 + δT , and thus also E0 → E0 + δT .
Applying the virial theorem we obtain that after the relaxation the new kinetic
energy is
T1 = −E1 = −(E0 + δT ) = T0 − δT
Thus, the relaxation process decreases the kinetic energy by 2δT from
T0 + δT to T0 − δT0 .
Similarly, the gravitational energy becomes less negative:
W1 = 2E1 = 2E0 + 2δT = W0 + 2δT
Since the gravitational radius rg = GM 2 /|W | the system will expand!
Heat Capacity of Gravitating Systems
As we have seen on the previous page, by pumping energy (‘heat’) into the
system, it has actually grown ‘colder’. This is a consequence of the negative
heat capacity of gravitational systems.
By analogy with an ideal gas we defined the temperature of a self-gravitating
system as
1 3
2
m⟨v 2 ⟩ = k T
2 B
Unlike an isothermal gas, the temperature in a self-gravitating system is
typically a function of position. Therefore, we define the mean temperature as
1
!
⟨T ⟩ ≡ M
x)T (⃗
ρ(⃗ x)d3 ⃗
x
and the total kinetic energy of a system of N particles is then
K = 32 N kB ⟨T ⟩. According to the virial theorem we thus have that
E = − 32 N kB ⟨T ⟩.
This allows us to define the heat capacity of the system as
dE
C≡ d⟨T ⟩
= − 32 N kB