Professional Documents
Culture Documents
k
r
i
q
k
q
k
+
r
i
t
r
i
q
k
=
r
i
q
k
cancelling the dots.
This last step comes from taking the partial derivative with respect to q
k
.
6
4 The Lagrangian
SEE ALSO HAND AND FINCH, 1.61.11
Lagranges formulation of mechanics
We will assume:
1. Holonomic constraints
2. Constraining forces do no work
3. Applied forces are conservative, such that a scalar potential energy function exists.
These assumptions imply: no friction or viscous drag, reaction forces are normal to sur-
faces, strings are inextensible and always tight. Note that the potential function may
change with time.
DAlemberts Principle
Newtons 2nd law for particle i is
p
i
= F
i
.
This implies that
i
(F
i
p
i
) r
i
= 0,
where r
i
represents an arbitrary virtual displacement. A virtual displacement is a hypo-
thetical change of coordinates at one instant in time that is compatible with the con-
straints. This force includes both the constraint and applied forces, such that
F
i
= F
(c)
i
+ F
(a)
i
.
Given that we have supposed the constraining forces do no work, this gives us DAlemberts
Principle:
i
(F
(a)
i
p
i
) r
i
= 0. (1)
We have removed the constraint forces from this equation, but have yet to rewrite the
equation in terms of the generalised coordinates that are independent of one another.
7
Generalised Equations of Motion (EoM)
We would like to rewrite DAlemberts Principle (
i
(F
(a)
i
p
i
) r
i
= 0) in terms of the inde-
pendent, generalised coordinates, q
k
. Recall
v
i
=
d
dt
(r
i
) =
k
r
i
q
k
q
k
+
r
i
t
. (2)
Taking the partial derivative of this with respect to q
k
yields
v
i
q
k
=
r
i
q
k
. (3)
By analogy with Eq. (2), we can write
d
dt
_
r
i
q
j
_
=
2
r
i
q
j
q
k
q
k
+
2
r
i
q
j
t
. (4)
Taking the partial derivative of Eq. (2) with respect to q
j
gives
v
i
q
j
=
2
r
i
q
j
q
k
q
k
+
2
r
i
q
j
t
. (5)
Comparing Eqs. (4) and (5) leads to
d
dt
_
r
i
q
j
_
=
v
i
q
j
. (6)
The 1st termin Eq. (1), which represents the virtual work done by the applied force (nowwithout
the
(a)
, as we no longer need to worry about the constraint force) can be written as
W =
i
F
i
r
i
=
i
F
i
k
r
i
q
k
q
k
=
k
_
i
F
i
r
i
q
k
_
. .
Generalised forces F
k
=
W
q
k
q
k
=
k
F
k
q
k
.
The 2nd term in Eq. (1), can be written as
i
p
i
r
i
=
i
m
i
v
i
r
i
=
k
m
i
v
i
r
i
q
k
q
k
.
If we note that
v
i
r
i
q
k
=
d
dt
_
v
i
r
i
q
k
_
v
i
d
dt
_
r
i
q
k
_
then we can write
i
p
i
r
i
=
k
m
i
_
d
dt
_
v
i
r
i
q
k
_
v
i
d
dt
_
r
i
q
k
__
q
k
.
8
Using Eqs. (3) and (6), we can write
i
p
i
r
i
=
k
_
d
dt
_
m
i
v
i
v
i
q
k
_
m
i
v
i
v
i
q
k
_
q
k
.
Changing the order of summation gives
i
p
i
r
i
=
k
_
d
dt
_
q
k
_
i
1
2
m
i
v
2
i
__
q
k
_
i
1
2
m
i
v
2
i
__
q
k
.
Therefore, DAlemberts Principle can be written as
k
_
F
k
d
dt
_
T
q
k
_
+
T
q
k
_
q
k
= 0,
where the kinetic energy, T =
i
1
2
m
i
v
2
i
= T(q
1
, . . . , q
N
, q
1
, . . . , q
N
, t).
The q
k
are independent, so the generalised equations of motion are
F
k
d
dt
_
T
q
k
_
+
T
q
k
= 0. (7)
We are assuming that the applied forces are conservative, so F
i
=
i
V(r
1
, . . . , r
i
, . . . , r
M
).
Correspondingly,
F
k
=
i
V
r
i
q
k
=
V
q
k
(follows from the chain rule).
Substituting this into Eq. (7), and dening the Lagrangian by
L = T V,
gives the Euler-Lagrange equations:
d
dt
_
L
q
k
_
L
q
k
= 0.
These are the fundamental equations of Lagrangian mechanics. Note that L is a function
of all of the q
k
, q
k
and t.
Ignorable coordinates
If the time derivative of a coordinate appears in the Lagrangian, but the coordinate itself
does not, then this is an ignorable coordinate. As
L
q
k
= 0,
the Euler-Lagrange equation implies that
d
dt
_
L
q
k
_
= 0.
9
Thus, the canonically conjugate momentum to the coordinate, dened as
p
k
=
L
q
k
,
is a constant of the motion.
Example: a free particle in 1D with displacement x, has a Lagrangian L =
1
2
m x
2
. Thus,
the coordinate x is ignorable, and p =
L
x
= m x is constant.
Having constants of motion can greatly simplify nding the equation of motion of a
system. The canonically conjugate momentum plays a central role in the Hamiltonian
approach to quantum mechanics.
10
5 Variational Calculus
SEE ALSO HAND AND FINCH, 2.12.8
Calculus of Variations
What path y(x) leads to an extreme value of the functional
I[y] =
_
x
2
x
1
f (y, y
, x)dx, (8)
where y
/ =
.
dI
d
=
_
x
2
x
1
_
f
y
y
+
f
y
_
dx
=
_
x
2
x
1
_
f
y
+
f
y
_
dx
Integrating the second term by parts gives
_
x
2
x
1
f
y
d
dx
dx =
_
f
y
_
x
2
x
1
_
x
2
x
1
d
dx
_
f
y
_
dx.
The rst term is zero because = 0 at the ends of the path. Therefore
dI
d
= 0
_
x
2
x
1
_
d
dx
_
f
y
f
y
_
(x)dx = 0 (x).
Hence we arrive at Eulers equation
d
dx
_
f
y
f
y
= 0.
11
Recall the Euler-Lagrange equations
d
dt
_
L
q
k
_
L
q
k
= 0.
For the case where f does not explicitly depend upon x, ie f /x = 0, one can use
d f
dx
=
f
x
+
f
y
dy
dx
+
f
y
dy
dx
in conjunction with the Euler equation to show that
d
dx
_
f y
f
y
_
= 0. (9)
This is a useful alternative form of the Euler equation.
Hamiltons Principle of Least Action
The Euler-Lagrange equations are the solution to the calculus of variations problem: for
what route q(t) between points q
1
and q
2
at t
1
and t
2
is the integral
S[q(t)] =
_
t
2
t
1
L(q(t), q(t), t)dt
a minimum? Hamiltons Principle states that a mechanical system moves in just such a
way as to minimise this integral, which is also known as the action functional, S[q(t)]. The
argument of a function is a number, and a number is returned the argument of a
functional is a function, and a number is returned.
Hamiltons Principle can be written as S = 0, where the represents a change with
respect to the path between the end points of the integral. The variational derivative of
L with respect to q is written
L
q
L
q
d
dt
_
L
q
_
.
12
Lagrange multipliers
If a mechanical system has an applied constraint and the number of DoF reduces, then this
is usually dealt with by eliminating a dynamical variable and nding the equation of motion
for a smaller set of generalised coordinates. An alternative method involves using Lagrange
multipliers.
Consider a pendulum of length l. The natural generalised coordinate to choose to describe the
motion is the angle . Suppose you stubbornly and foolishly insist on using x and y, then you
also have the constraint x
2
+ y
2
= l
2
. The variation of the action is
S =
_
_
L
x
x +
L
y
y
_
dt.
Hamiltons Principle implies that S = 0, but the non-independence of x(t) and y(t) means that
we cannot simply say that each variational derivative is separately zero. Our constraint can be
written as
G(x, y) = x
2
+ y
2
= c a constant.
Now
G =
G
x
x +
G
y
y = 0
is how the variations in x and y are connected. The variation of the action can then be written as
S =
_
__
L
x
G
x
_
x +
_
L
y
G
y
_
y
_
dt.
where is an arbitrary function of the independent variable (t in this case) called a Lagrange
multiplier. If we choose such that the coefcient of x is set to zero, then S = 0 implies that
the coefcient of y must also vanish. It is as if we had independent variables x and y.
13
Workshop 1: Applications of the Lagrangian Approach
1. Consider a point mass (of mass m) falling in the Earths gravitational eld (take the acceler-
ation = g to be constant, i.e., independent of the distance of the point mass from the Earths
surface, and ignore air resistance).
(a) What is the kinetic energy T of the point mass? What is the potential energy V? Hence,
determine the Lagrangian L = T V.
(b) Use the Euler-Lagrange equation to determine the equation of motion for the vertical
motion (z direction) of the point mass.
(c) Solve the equation of motion, and express the solution in terms of the point masss
initial position z(0) and velocity z(0).
2. Consider a point mass (of mass m) attached to a spring (spring constant k), constrained
somehow to move only in the x direction, where x = 0 is taken to be the position of the
point mass when it is at rest. This is just a one-dimensional simple harmonic oscillator
(undamped), with characteristic oscillation frequency =
k/m.
(a) What is the kinetic energy T of the point mass? What is the potential energy V? Hence,
determine the Lagrangian L = T V.
(b) Use the Euler-Lagrange equation to determine the equation of motion of the point
mass.
(c) The equation of motion is a linear, homogeneous, second-order differential equation.
Solve this using the trial solution x e
t
.
(d) Find the general solution of the equation of motion, in terms of the point masss initial
position x(0) and velocity x(0).
(e) Two independent real constants A and B can equally well be represented in terms of
two other real constants A
and
, through A = A
sin
, and B = A
cos
. Use this,
along with the trigonometric identity sin( + ) = sin cos + cos sin , to show
your solution can also be expressed in the form
x(t) = A
sin(t +
),
and express A
and
frictionless surfaces
15
6 Linear Oscillators
SEE ALSO HAND AND FINCH, 3.13.3
Equilibrium
A mechanical system that remains at rest is in equilibrium. This occurs at points in cong-
uration space where all generalised forces F
k
vanish.
For a conservative system, this corresponds to congurations where the potential en-
ergy V(q
1
, . . . , q
N
) is stationary, i.e., its rst derivatives with respect to the q
k
vanish.
Lagrangian Near Static Equilibrium (1 DoF)
Carry out a 2nd order Taylor series expansion of some L(q, q) around q = q
S
[where q
S
is a
stationary point, i.e., V/q|
q=q
S
= 0] and q = q
S
= 0. Choose q
S
= 0, then
L L
approx
=L(q
S
, q
S
) + q
L
q
q
S
, q
S
+ q
L
q
q
S
, q
S
+
1
2
_
q
2
2
L
q
2
q
S
, q
S
+ 2q q
2
L
q q
q
S
, q
S
+ q
2
2
L
q
2
q
S
, q
S
_
= A + Bq + C q + Dq
2
+ Eq q + F q
2
.
Substitute the resulting L
approx
into the Euler-Lagrange equation (n.b., B = 0 by the denition of
equilibrium)
d
dt
(
L
approx
q
..
C + Eq + 2F q)
L
approx
q
..
(2Dq + E q) = 0.
This yields q (D/F)q = 0, which is identical to the EoM for a particle of mass m derived from
L =
m
2
_
q
2
+
D
F
q
2
_
. (10)
Simple Harmonic Oscillator (SHO)
The Lagrangian for a simple harmonic oscillator, such as a mass m attached to a spring
with spring constant k, can be written as
L =
1
2
m q
2
1
2
kq
2
.
If D/F < 0, then Eq. (10) is the Lagrangian for a simple harmonic oscillator, with k =
m(D/F).
Close to equilibrium one therefore observes simple harmonic motion with character-
istic frequency =
D/F. Hence, the motion is described as stable (if D/F > 0, it is
unstable).
q +
2
q = 0 is a linear, homogeneous differential equation, and can be solved to give
16
q(t) = q(0) cos(t) +
q(0)
sin(t).
Damping Force
A frictional force F
d
, acts to suppress motion, i.e., it must act in opposition to motion, and
vanish when the motion ceases.
The simplest such force is F
d
= q = (m/Q) q (expressed in convenient form for a
damped SHO), where Q is the dimensionless quality factor of the oscillator.
Damped Simple Harmonic Oscillator
We insert the SHO Lagrangian L = (m/2)( q
2
2
q
2
) into the Euler-Lagrange equation and
incorporate the (non-conservative) damping force separately
d
dt
_
L
q
_
L
q
= F
d
.
This corresponds to taking Eq. (7), grouping the conservative forces (in the form of a
potential energy function V) with T to make L, and keeping the non-conservative forces
separate.
The result is a linear, homogeneous differential equation, q + (/Q) q +
2
q = 0.
This is solved by setting q = e
t
and determining the allowed values of from the
resulting auxiliary equation. The solutions differ qualitatively depending on the value of Q.
17
7 Driven Oscillators
SEE ALSO HAND AND FINCH, 3.43.8
Oscillator Driven by an External Force
We consider an undamped oscillator, driven by a time-dependent external force F(t)
with no spatial dependence.
Whatever supplies the driving force is not considered to be part of the dynamical
system. In particular we do not consider any effect of forces the oscillator exerts on the
source of the external force.
The corresponding Lagrangian is given by
L =
m q
2
2
..
T
_
m
2
q
2
2
F(t)q
_
. .
V
,
which yields a linear, inhomogeneous differential equation:
q +
2
q = F(t)/m. (11)
Dirac -Functions
It is instructive to start by considering an impulsive (instantaneously applied and innitely
intense) force. For this we make use of -functions. Recall that the effect of (t t
) is
dened (necessarily under an integral) as
f (t
) =
_
(t t
) f (t)dt.
(t t
,
with unit area:
_
(t t
yields
_
t
+
t
( q +
2
q)dt =
_
t
+
t
K
m
(t t
)dt,
q(t
+ ) q(t
) +
2
_
t
+
t
q(t)dt
. .
0 as 0, unless q(t
)
=
K
m
.
Hence, there is an instantaneous change in the velocity: q(t
+
) = q(t
) + K/m, where
t
= lim
0
t
+
) = q(t
), velocity q(t
).
2. t = t
) = q(t
), q(t
) = q(t
) + K/m.
3. t > t
) cos([t t
]) +
q(t
) + K/m
sin([t t
]). (12)
(Causal) Greens Function G
Consider G(t t
) +
2
G(t t
) = (t t
).
We consider the oscillator to be at rest prior to application of the impulsive force.
Hence, by comparison with Eq. (12),
G(t t
) =0, t t
0,
G(t t
) =
1
sin([t t
]), t t
0.
(13)
General Driving Force F(t)
By denition of the -function, the equation of motion for the driven oscillator [Eq. (11)]
can be written as
q +
2
q = (1/m)
_
dt
F(t
)(t t
).
We can then eliminate (t t
dt
F(t
)[
G(t t
) +
2
G(t t
)].
Comparing terms, and substituting in the solution to G(t t
F(t
)G(t t
)dt
=
1
m
_
t
F(t
) sin([t t
])dt
.
19
8 Coupled Small Oscillations
SEE ALSO HAND AND FINCH, 9.19.2
Lagrangian formulation particularly useful for small oscillations of a set of coupled os-
cillators about equilibrium positions. Important in acoustics, molecular spectroscopy, me-
chanical systems and coupled electrical circuits.
We consider small oscillations around a stable equilibrium point of a mechanical sys-
tem. If this motion does not depart too far from a stable equilibrium point, then the system
resembles a set of coupled oscillators.
Two coupled pendulums
Consider a system with two pendulums of length l, connected half way down by a
spring with constant k, and unstretched length equal to the horizontal separation of the
pivot points. There are 2 DoF - use
1
and
2
as generalised coordinates. If the rigid
pendulums have all of their mass, m, contained in the bob, and the spring is massless,
then the kinetic energy is
T =
ml
2
2
_
2
1
+
2
2
_
.
Without coupling, and assuming small oscillations,
V =
mgl
2
(
2
1
+
2
2
).
The spring provides an extra potential energy,
V
coupling
=
k
2
_
l
2
_
2
(
2
1
)
2
.
Hence
L =
ml
2
2
(
2
1
+
2
2
)
1
2
_
mgl(
2
1
+
2
2
) + k
_
l
2
_
2
(
2
1
)
2
_
.
20
The
1
2
term makes life awkward, so transform to centre of mass
c
= (
1
+
2
)/2 and
relative
r
=
2
1
coordinates. Dening
0
=
_
g/l, and the coupling constant =
kl
4mg
,
L = ml
2
_
_
2
c
2
0
2
c
_
+
1
4
_
2
r
2
0
(1 + 2)
2
r
_
_
.
The Euler-Lagrange equations yield two separated harmonic oscillator EoMs:
C
+
2
0
C
= 0
and
r
+
2
0
(1 + 2)
r
= 0.
In a system of N coupled oscillators there always exist N generalised coordinates that
separate, yielding N independent, homogeneous, linear differential equations.
Those generalised coordinates undergoing simple harmonic motion are called normal
coordinates.
A more general recipe
1. Find an equilibrium conguration, i.e., values of the generalised coordinates where
all generalised forces F
k
= V/q
k
are = 0.
2. Taylor expand the Lagrangian L to second order in the generalised coordinates and
velocities, around values for the generalised coordinates q
k
given by the equilibrium
conguration, with the values of the generalised velocities q
k
set = 0.
3. For N DoF, the Taylor expansion must in principal be in 2N variables (all generalised
coordinates and velocities). In practice, expansions in fewer variables may be ade-
quate, e.g. if some terms are already in quadratic form.
Matrix form of L
If a
k
are the equilibrium values of an initially used set of generalised coordinates q
k
, then
we can dene a new set of generalised coordinates q
k
= q
k
a
k
.
These q
k
will have equilibrium values = 0, and L can then take the form
L = q
T
q
..
T
q
T
q
..
V
+c (a constant),
where q is a column vector, and its transpose q
T
= (q
1
, . . . , q
N
) is a row vector.
The matrices and are symmetric (i.e., equal to their transposes, so that e.g.
kj
=
jk
), with matrix elements given by
jk
=
1
2
2
T
q
j
q
k
q
j
, q
k
=0
,
jk
=
1
2
2
V
q
j
q
k
q
j
,q
k
=0
.
21
Equations of Motion
Note that partial differentiation of the Lagrangian L with respect to the generalised coor-
dinates and velocities yields
L
q
j
= 2
N
k=1
jk
q
k
= 2( q)
j
,
L
q
j
= 2
N
k=1
jk
q
k
= 2( q)
j
,
and that the Euler-Lagrange equations therefore yield a linear systemof differential equa-
tions: q + q = 0.
22
9 Normal Modes
SEE ALSO HAND AND FINCH, 9.39.5
The EoM for a system of coupled oscillators can be written compactly in matrix nota-
tion as q + q = 0.
Working Towards a Solution
Inserting a trial solution q = be
it
into the EoM yields
(
2
)b = 0. (14)
We also know (due to a theorem from linear algebra) that, unless b = 0, this can only be
fullled if |
2
+ | = 0.
(This strictly only represents an eigenvalue problem if is a multiple of the identity
matrix. If different masses are associated with the different generalised velocities, then
a variable transformation
i
=
m
i
q
i
recties this. In practice, we can treat the solutions
to Eq. (14) as eigenvectors with their associated eigenvalues.)
Calculating the determinant will yield an Nth order polynomial in
2
. Hence, there
are N generalised values
2
j
(possible values of
2
), each with its corresponding vector b
j
that will solve Eq. (14).
Returning to the example of the coupled pendulums, before we used our intuition to
convert from (
1
,
2
) to (
c
,
r
), we had
L =
ml
2
2
(
2
1
+
2
2
)
1
2
_
mgl(
2
1
+
2
2
) + k
_
l
2
_
2
(
2
1
)
2
_
=
mgl
2
_
1
2
0
(
2
1
+
2
2
) ((1 + )(
2
1
+
2
2
) (
1
2
+
2
1
))
_
,
where
0
=
_
g/l and =
kl
4mg
. We can thus infer that
2
=
mgl
2
_
0
0
_
,
where =
2
2
0
, and
=
mgl
2
_
1 +
1 +
_
The values of for which the system oscillates at a single frequency are thus
2
+
= (1 + )
2
2
= 0,
i.e.
(1)
= 1 and
(2)
= 1 + 2, which correspond to
(1)
=
0
and
(2)
=
0
_
1 + 2.
To nd the mode vectors associated with these modes of oscillation, substitute
(j)
back
into Eq. (14). This gives
b
(j)2
b
(j)1
=
1 +
,
from which, b
(1)1
= b
(1)2
and b
(2)1
= b
(2)2
.
23
Normal Coordinates
It is common to normalise the mode vectors b
(j)
, i.e., to have b
T
(j)
= (b
(j)1
, b
(j)2
, . . . , b
(j)N
)
such that
_
N
k=1
b
2
(j)k
= 1. Using the vector elements, we may dene a new set of sepa-
rated generalised normal coordinates r
j
via
r
j
=
N
k=1
b
(j)k
q
k
.
Note that, for the coupled pendulum example, r
1
(
1
+
2
) and r
2
(
2
1
).
If
2
j
> 0, the equilibrium conguration is stable, and the normal coordinate r
j
evolves
as r
j
(t) = r
j
(0) cos(
j
t) + ( r
j
(0)/
j
) sin(
j
t). Such motion is called a normal mode, where
all parts of the system oscillate with the same frequency. Motion in general can be de-
scribed by a superposition of normal modes.
If
2
j
< 0, the equilibrium is unstable, and if
2
j
= 0, the motion is also not oscillatory,
usually corresponding to e.g. rotation or translation of the centre of mass.
A normal mode is the motion of a normal coordinate.
Mode orthogonality
As we have effectively been nding eigenvectors of a real symmetric matrix, they will be
orthogonal, ie
b
T
i
b
j
= 0 if i = j.
1. If
i
s are all different, then b
i
s are unique.
2. If some
i
s are the same, then there is a choice of b
i
s, but can always nd or-
thogonal b
i
s (if b
i
and b
j
are eigenvectors with the same , then so is any linear
combination b
i
+ b
j
).
The b
i
s are the natural axes in N-dimensional space. With respect to them, the trans-
formed potential energy,
2
)L must = 0, i.e.,
L
x
1
+
L
x
2
= 0,
L
y
1
+
L
y
2
= 0,
L
z
1
+
L
z
2
= 0.
From the Euler-Lagrange equations for x
1
and x
2
(and equivalently for y and z):
d
dt
_
L
x
1
_
+
d
dt
_
L
x
2
_
=
L
x
1
+
L
x
2
= 0.
L/ x
1
= p
1x
denes the canonically conjugate momentum p
1x
to x
1
(here this is equal to the
mechanical momentum m x
1
, but this need not in general be so). Hence,
P = p
1
+ p
2
= 0, and
so the total momentum P is conserved.
Ignorable Centre of Mass (CoM) Motion and Conservation of Momentum
Conservation of the total momentum P implies that the CoM [coordinate R = (m
1
r
1
+
m
2
r
2
)/(m
1
+ m
2
)] moves like a free particle.
Rewrite L in terms of R and the relative coordinate r = r
1
r
2
:
L =
1
2
M
R
2
+
1
2
r
2
V
12
(r),
25
where M = m
1
+ m
2
(total mass) and = m
1
m
2
/(m
1
+ m
2
) (the reduced mass).
R does not appear in the Lagrangian L. Such coordinates are called ignorable. For
any ignorable generalised coordinate q, it follows from the Euler-Lagrange equation that
d(L/ q)/dt p
q
= 0, and hence that P is conserved in this case.
Rotational Invariance and Conservation of Angular Momentum
If we transform to the centre of mass frame (a frame moving with the CoM), then R =
R = 0, and L = (1/2) r
2
V
12
(r). Note that V
12
(r) is dependent on distance only, not
direction, and hence that L is rotationally invariant.
In terms of spherical coordinates (x = r cos sin , y = r sin sin , z = r cos ), we de-
duce that r
2
= x
2
+ y
2
+ z
2
= r
2
+r
2
2
+r
2
2
sin
2
. Hence,
L =
1
2
[ r
2
+r
2
2
+r
2
2
sin
2
] V
12
(r). (15)
is ignorable, as it does not appear in L. Therefore
p
= r
2
sin
2
= constant J
z
..
can show to be the z component of J = r r
Without loss of generality, we can choose the coordinates such that the total angular
momentum, J, lies along the z axis. Therefore J is always constant.
Motion Takes Place in a Plane
As J = r r, both p r and r must be perpendicular to J.
Hence, p and r must be in a plane perpendicular to J, and remain in that plane if J is
to remain constant.
Choose J to lie along the z direction, so that the angle of elevation = /2,
= 0,
and J = |J| = J
z
= r
2
.
Equal Areas are Swept out in Equal Times (Keplers Second Law)
d
r
r+dr
dr
{
height h
Consider motion in the = /2 plane. If the position vector changes incrementally
from r to r + dr, with the angle changing by d, then the area swept out, dA, is given by
the area of the triangle formed by r, dr, and r + dr.
By straightforward trigonometry dA = (1/2)|r + dr||r| sin(d). Keeping only rst-order
terms, dA = (1/2)r
2
d.
It follows that
dA
dt
=
1
2
r
2
=
J
2
. (16)
26
J is conserved, so the rate at which area is swept out is constant.
Keplers second law is therefore a consequence of angular momentum conservation,
itself a consequence of considering a central force, and has nothing specically to do
with gravity as such.
Equivalent 1D Problem
We can remove and entirely from the Lagrangian.
Following Eq. (15), we can write E = T + V = (1/2)[ r
2
+ r
2
2
+ r
2
2
sin
2
()] + V
12
(r),
which, if we make use of
= 0, = /2,
= J/r
2
, simplies to
E =
1
2
r
2
+
J
2
2r
2
+V
12
(r).
. .
V
eff
(r)
(17)
The effective potential V
eff
(r) combines the interaction potential with the so-called angu-
lar momentum barrier (responsible for the centrifugal force).
The total energy E is the same as that corresponding to an effective Lagrangian L
eff
=
(1/2) r
2
V
eff
, from which the Euler-Lagrange equation for r yields
r =
J
2
r
3
V
12
r
. (18)
27
11 Gravitational Attraction
SEE ALSO HAND AND FINCH, 4.1, 4.54.6
Gravitational Interaction Potential
The potential energy between two gravitating bodies of masses m
1
and m
2
is V
12
(r) =
k/r, where k = Gm
1
m
2
and G is the gravitational constant. The EoM is then
r =
J
2
r
3
k
r
2
We would like to solve this equation to determine the orbits of the bodies.
The u = 1/r Transformation
It is helpful to change the dependent variable from r to u = 1/r. From J = r
2
(d/dt), we can
determine that dt = [/(Ju
2
)]d and hence that
d
dt
=
Ju
2
d
d
dr
dt
=
1
u
2
du
dt
=
J
du
d
.
Expressing the energy in the CoM frame [Eq. (17)] in terms of u and , gives
E =
J
2
2
_
_
du
d
_
2
+ u
2
_
ku. (19)
Differentiating with respect to (note that E is constant, i.e., dE/d = 0),
J
2
2
_
2
du
d
d
2
u
d
2
+ 2u
du
d
_
k
du
d
= 0,
from which
d
2
u
d
2
+ u =
k
J
2
. (20)
This can be solved to give (1/r) u = (k/J
2
) + Bcos (B is an integration constant, and we
have set a second integration constant, equivalent to an arbitrary phase added to , to 0).
Elliptical Orbits (Keplers First Law)
Dene p J
2
/k and pB. Then putting the solution for u into Eq. (19) yields
E =
k
2
2J
2
(
2
1) =
k
2p
(
2
1).
If 0 < 1, the energy E is negative, implying a bound state.
The solution to Eq. (20) can be written as
p = r + r cos .
Expressed in cartesian coordinates, this becomes p =
_
x
2
+ y
2
+ x, or equivalently
28
(1
2
)x
2
+ 2px + y
2
p
2
= 0. (21)
Restricting ourselves to 0 < 1, Eq. (21) describes an ellipse (Keplers rst law):
1
a
2
_
x +
p
1
2
_
2
+
y
2
b
2
= 1, (22)
where the semimajor axis a = p/(1
2
) and the semiminor axis b = p/
1
2
(with
these quantities dened, the energy takes the simple form E = k/2a).
is the eccentricity of the elliptical orbit. = 0 for a circular orbit, = 1 describes a
parabolic trajectory and > 1 for a hyperbola.
Keplers Third Law
From Keplers second law dA/dt = J/(2) [Eq. (16)], the period of an elliptical orbit
follows from the total area A = ab of that orbit , i.e., = A2/J. A little algebra yields
= 2
1
G(m
1
+ m
2
)
a
3/2
Keplers third law states that
2
a
3
with the same proportionality constant for all the
planets. As the mass of the sun dominates the masses of the individual planets, this is
approximately true.
Bertrands Theorem
For force laws of the form
F cr
,
only = 2 or = 1 give rise to closed, non-circular orbits. (Note that a circular orbit is possible
for any .) A closed orbit is one for which the radius at a xed azimuthal angle, , is described
by a function r(). = 2 corresponds to the familiar inverse square law, where the orbit is an
ellipse and the central force originates from one focus. = 1 is Hookes law, which gives rise to
elliptical orbits that are centred on the source of the force.
The precession of the perihelion of Mercury was one hint that the inverse square law of gravity
was not the whole story. The General Theory of Relativity accurately predicts the size of the
effect.
Solving 1D Systems by Quadrature
Consider a 1D system such as a point mass in an external potential. The Lagrangian
L = (1/2)m q
2
V(q), and it has energy E = (1/2)m q
2
+V(q). This implies
q
dq
dt
=
_
2[E V(q)]
m
dt = dq
_
_
m
2[E V(q)]
_
.
The time t taken to move from q = 0 to q(t) is then
t =
_
m
2
_
q
0
dq
1
E V(q
)
. (23)
29
The elapsed time as a function of the coordinate, t(q), must then be inverted if we
wish to obtain q(t) (this may have to be carried out numerically).
A problem solved as an integral [i.e., put into integral form, as in Eq. (23)] is said to
be solved by quadrature.
30
Workshop 2: Central Forces/Rutherford Scattering
1. A particle of mass m moves in a central force eld with potential energy given by V(r). In
spherical polar coordinates, the Lagrangian of the particle can be written as
L =
1
2
m( r
2
+r
2
2
+r
2
sin
2
2
) V(r).
(a) Using the fact that the azimuthal angle, , is an ignorable coordinate, determine the as-
sociated constant of the motion, J. What does this constant represent? Express the total
energy of the particle as a function of the single variable, r, in a way that incorporates
the effective potential energy
V
eff
= V(r) +
J
2
2mr
2
and explain why no dependence is required.
(b) Assuming that V(r) = Ar
n+1
/(n + 1) where A > 0, for each of the cases n + 1 < 2,
2 < n + 1 < 0 and n + 1 > 0, sketch the radial variation of the effective potential.
In each case, mark the radius, r
0
, where the potential is stationary with respect to ra-
dius and a circular orbit exists, and state whether or not the orbit is stable or unstable.
Calculate an expression for the radius r
0
.
(c) By performing a Taylor series expansion of the effective potential about the point r = r
0
,
show that, for small perturbations away from a stable circular orbit, the radius of the
particle performs simple harmonic motion with an angular frequency
=
n + 3
J
mr
2
0
.
(d) By comparing this angular frequency with that for a circular orbit at radius r
0
, deter-
mine which values of n give rise to closed orbits. Describe the orbits for the cases
n = 2 and n = 1.
ADDITIONAL EXERCISE
2. Consider two positively charged point masses of (identical) mass m and charge c. Use the
Coulomb potential V
C
= [c
2
/(4
0
)](1/r) to model how they interact, and consider them
to be in free space (no external potentials).
(a) Determine the Lagrangian describing the motion of the relative coordinate r = r
1
r
2
.
Hence, determine an expression for the total energy E in the CoM frame as a function
of r = |r| and the angular momentum J.
(b) Dene an effective 1D Lagrangian L
eff
which yields the same value of E, but with
T = (1/4)m r
2
(where r = |r|). With the Euler-Lagrange equation, derive from L
eff
the equation of motion for r.
(c) Express the energy in the centre of mass frame in terms of u 1/r and the azimuthal
angle , and solve the differential equation (of u, differentiated with respect to ) re-
sulting from the expression for dE/d, to nd a relationship between r and .
(d) Express this solution in terms of cartesian coordinates x and y rather than polar coor-
dinates r and , and hence show that all trajectories must take the form of hyperbolae,
of the general form (x x
c
)
2
/a
2
y
2
/b
2
= 1, in the centre of mass frame.
31
(e) Use the identity cosh
2
sinh
2
= 1 to parametrize x and y in terms of , and hence
determine an equivalently parametrized form of r. Use this to solve the effective 1D
problem by quadrature and determine an equivalent to Keplers equation for this sys-
tem.
32
12 Noethers Theorem and Hamiltonian Mechanics
SEE ALSO HAND AND FINCH, 5.15.5
Invariance of L under Continuous Transformations
Consider a set of continuous transformations, i.e. depending on a continuously variable pa-
rameter, s (e.g. translations along or rotations about the z axis are single-variable families of
transformations).
When s = 0, this should yield the identity transformation, i.e., no change.
If q
k
(t) is a solution of the original EoM for a Lagrangian L, then Q
k
(s, t) will be a solution for the
Euler-Lagrange equations evaluated within a transformed set of coordinates, where Q
k
(0, t) =
q
k
(t).
Given that the Lagrangian L is invariant (independent of s) under this transformation,
L
L(Q
1
(s, t), . . . , Q
N
(s, t),
Q
1
(s, t), . . . ,
Q
N
(s, t), t)
=L(q
1
, . . . , q
N
, q
1
, . . . , q
N
, t).
As the Lagrangian is independent of s, dL
ds
=
N
k=1
_
L
Q
k
..
=
d
dt
_
L
Q
k
_
dQ
k
ds
+
L
Q
k
d
Q
k
ds
..
=
d
dt
_
dQ
k
ds
_
_
=
d
dt
_
N
k=1
L
Q
k
dQ
k
ds
_
= 0.
This implies the existence of a constant of the motion
I(q
1
, . . . , q
N
, q
1
, . . . , q
N
)
N
k=1
p
k
(dQ
k
/ds)|
s=0
,
which is evaluated at s = 0 for convenience.
For multiple parameters s
1
, s
2
, . . ., the procedure can be repeated for each to nd associated
constants I
1
, I
2
, . . ..
This procedure effectively describes Noethers theorem.
Noethers Theorem
If the Lagrangian is invariant under a continuous symmetry transformation, then there
are conserved quantities associated with that symmetry, one for each parameter of the
transformation. These can be found by differentiating each coordinate with respect to the
parameters of the transformation in the immediate neighbourhood of the identity trans-
formation, multiplying by the conjugate momentum, and summing over the degrees of
freedom.
Take the example of a point mass, m, moving in free space with L =
1
2
m q
2
. For trans-
formed coordinates Q = q + s, this gives
dQ
ds
s=0
= 1 I(q) = m q.
33
Legendre Transformations
Consider a function A(x, y) of two variables. Introduce a third variable z and dene B(x, y, z)
yz A(x, y). By application of the chain rule,
dB = zdy + ydz
A
x
y
dx
A
y
x
dy =
_
z
A
y
x
_
. .
vanishes, as we dene z = z(x, y)
A
y
x
dy + ydz
A
x
y
dx. (24)
For this choice of z, B has no explicit y dependence. Thus y can be written in terms of x and z
giving
B = B(x, y(x, z), z) = B(x, z) = yz A(x, y)
This is a Legendre transformation (cf thermodynamics). Furthermore, it follows from Eq. (24)
that
B
z
x
= y,
B
x
z
=
A
x
y
.
Note that the same Legendre transformation takes A(x, y) to B(x, z) as takes B(x, z) back to
A(x, y). x is called a passive variable, and y an active variable.
The Hamiltonian
A Legendre transformation of a (time-independent, 1 DoF) Lagrangian L(q, q), where the
active variable is the velocity q, the passive variable is the coordinate q, and the third
variable is the canonically conjugate momentum p, yields the Hamiltonian H:
H(q, p) p q L(q, q), where p
L
q
constant q
. (25)
Differentiating this equation and using the Euler-Lagrange equation, we nd Hamiltons
equations.
H
p
q
= q,
H
q
p
=
L
q
q
=
d
dt
L
q
q
= p.
Note that, while the Lagrangian formulation of mechanics produces N second order
ordinary differential equations (in the q
k
), the Hamiltonian treatment yields 2N rst order
ordinary differential equations (in q
k
and p
k
).
34
Hamiltons Principle
From Hamiltons principle, variations in the action S =
_
Ldt are given by S = S[q + q]
S[q] =
_
Ldt = 0. Using the relationship between L and H given by Eq. (25),
L = qp + p q
_
H
q
q +
H
p
p
_
. .
H
= qp +
d
dt
(pq) pq
H
q
q
H
p
p =
_
q
H
p
_
p
_
p +
H
q
_
q +
d
dt
(pq).
Note that d(pq)/dt vanishes on integration, as q = 0 at the endpoints of the physical path,
and that Hamiltons equations imply that the q, p terms also vanish. Hence,
_
Ldt = 0 and
independent innitesimal variations in q, p from the physical path in coordinate-momentum
space (called phase space as opposed to the conguration space of Lagrangian mechanics) do not
change the action, S.
Hamiltons Equations of Motion
For N DoF, H
N
k=1
p
k
q
k
L (where L may now be explicitly time-dependent). Therefore
include a time differential in
dH =
N
k=1
_
p
k
d q
k
+ q
k
dp
k
L
q
k
d q
k
L
q
k
dq
k
_
L
t
dt
=
N
k=1
( q
k
dp
k
p
k
dq
k
)
L
t
dt.
We can also consider time as a passive variable in the Legendre transformation, which
yields
H
t
q
1
,...,p
1
,...
=
L
t
q
1
,..., q
1
,...
dH
dt
=
H
t
=
L
t
.
If there is no explicit time dependence in L, H is a constant of the motion. If T is also a
quadratic form in the q
k
, H is the total energy E = T +V.
Finally, Hamiltons equations of motion in complete generality are:
q
k
=
H
p
k
, p
k
=
H
q
k
,
dH
dt
=
L
t
.
35
13 Canonical Transformations and Poisson Brackets
SEE ALSO HAND AND FINCH, 6.16.3
Canonical Transformations
The Lagrangian formulation of mechanics makes coordinate transformations (i.e. Q =
Q(q, t)) fairly straightforward. These types of change to new generalised coordinates and
their associated generalised velocities are commonly called point transformations.
Coordinate transformations take place in conguration space. In phase space one
can consider new coordinates Q and momenta P that are functions of the original coor-
dinates q and their momenta p (and possibly the time t):
Q = Q(q, p, t), P = P(q, p, t).
These are commonly called contact transformations.
A transformation is by denition canonical , if it preserves the structure of Hamiltons
equations for all dynamical systems. Hence, for a specic dynamical system, a new
Hamiltonian H
such that L
L
q
=
d
dt
_
q
_
dF
dt
__
q
_
dF
dt
_
=
d
dt
_
q
_
q
F
q
+
F
t
__
q
_
q
F
q
+
F
t
_
=
d
dt
_
F
q
_
_
q
2
F
q
2
+
2
F
qt
_
= 0,
i.e., the Euler-Lagrange equation derived from L. Hence, the descriptions of a physical system
provided by L and L
are equivalent.
We now consider instead a starting Lagrangian L(q, q) and a transformed Lagrangian L
(Q,
Q)
related by a similar F that is a function of both q and Q, and perhaps t, such that
L
(Q,
Q, t) = L(q, q, t)
dF(q, Q, t)
dt
(26)
( = 1 is associated with a change of units or scale transformation. We will always consider = 1)
We next integrate Eq. (26) with respect to t (and with = 1):
_
t
2
t
1
L
dt =
_
t
2
t
1
Ldt + F(q(t
1
), Q(t
1
), t
1
) F(q(t
2
), Q(t
2
), t
2
) (27)
36
Recalling Hamiltons principle, if we take variations of Eq. (27) [assuming that arbitrary vari-
ations q(t) imply arbitrary variations Q(t)] the variation of
_
t
2
t
1
Ldt vanishes. If we further
assume that F = 0 at the end points, then we can say that the variation of
_
t
2
t
1
L
dt vanishes,
and it follows that the two descriptions are equivalent.
The Generating Function
F(q, Q, t) is called a generating function. From the chain rule, the time derivative for the
generating function is dF/dt = (F/q) q + (F/Q)
Q + F/t
By construction, q does not appear explicitly in L
q
=
L
q
q
_
F
q
q +
F
Q
Q +
F
t
_
=
L
q
F
q
p =
F
q
. (28)
Similarly
L
Q
=
F
Q
P =
F
Q
. (29)
These are known as the implicit transformation equations.
With the implicit transformation equations we have two equations and two unknowns,
P(p, q) and Q(p, q). To nd an explicit form for the transformation, solve Eq. (28) to express
Q = Q(q, p, t), and then insert this relation into Eq. (29) to get P = P(p, q, t).
Extension to N > 1 DoF is in principle straightforward, although in practice the task of
determining 2N unknowns from 2N equations may be formidable.
For example, consider the generating function F = qQ, then
p =
F
q
= Q, P =
F
Q
= q.
This particular generating function essentially interchanges the coordinates and momenta.
The Transformed Hamiltonian
To nd the new Hamiltonian H
in terms of a Legendre
transformation [Eq. (25)]. Hence,
H
(Q, P, t) P
Q L
=
F
Q
Q L +
F
q
q +
F
Q
Q +
F
t
= p q L +
F
t
=H(q, p, t) +
F(q, Q, t)
t
(30)
where we have made use of both Eq. (29) and Eq. (28). Finally, p and q appearing in the
RHS of Eq. (30) must be expressed in terms of P and Q.
If there is no explicit time dependence to F, then the new Hamiltonian H
is simply the
original Hamiltonian H, rewritten by inserting the inverse of the transformation equations
expressing P and Q in terms of p and q.
37
Four Forms of Generating Function
The generating function need not be written in terms of q and Q. Any combination of
one original and one new dynamical variable may be employed, in conjunction with
appropriately modied implicit transformation equations:
F
1
(q, Q, t), p =
F
1
q
, P =
F
1
Q
;
F
2
(q, P, t), p =
F
2
q
, Q =
F
2
P
;
F
3
(p, Q, t), q =
F
3
p
, P =
F
3
Q
;
F
4
(p, P, t), q =
F
4
p
, Q =
F
4
P
.
(31)
The different forms of generating function may be related via Legendre transforma-
tions, or the implicit transformation equations can simply be determined independently
in a similar manner to that used for F = F
1
above.
Poisson Brackets
For N DoF, a Poisson bracket for two arbitrary functions F and G is dened as
{F, G}
N
k=1
_
F
q
k
G
p
k
F
p
k
G
q
k
_
For the case where N = 1,
{F, G}
F
q
G
p
F
p
G
q
.
This is just the Jacobian dened by
J
_
F, G
q, p
_
(F, G)
(q, p)
F
q
G
q
F
p
G
p
Clearly, {q, p} = 1. If we consider a Poission bracket of q and p dened with respect to the
canonically transformed dynamical variables Q and P, then [using Eq. (31) and the chain rule]
q
Q
p
P
q
P
p
Q
=
q
Q
2
F
2
Pq
+
2
F
4
Pp
p
Q
=
Q
q
q
Q
+
Q
p
p
Q
=
Q
Q
= 1,
i.e., the value of the Poisson bracket for q and p is independent of the representation. Conversely,
{Q, P} = 1 is a necessary and sufcient condition for a transformation to be canonical.
There is an equivalent chain rule for Jacobians whereby
(F, G)
(Q, P)
=
(F, G)
(q, p)
(q, p)
(Q, P)
,
38
so
F
Q
G
P
F
P
G
Q
=
F
q
G
p
F
p
G
q
.
We therefore do not need to specify a particular set of coordinates and conjugate momenta when
discussing Poisson brackets.
Poisson Bracket Formulation of Hamiltons Equations
Hamiltons equations of motion can be reformulated in terms of Poisson brackets:
q = {q, H}, p = {p, H}. For an arbitrary function F(q, p, t),
F = {F, H} +
F
t
.
39
14 Rotating Reference Frames
SEE ALSO HAND AND FINCH, 7.17.7
Accelerating Reference Frames
One can only specify the position r
S
of a point mass with respect to a chosen reference
frame S.
One can equally choose another reference frame B to specify the position r
B
of the
same point mass, displaced by R from S. The positions of the point mass relative to the
two frames are then related by r
B
= r
S
R.
If S is an inertial frame where Newtons laws hold (xed in Space), but
R = 0, then B
is not an inertial frame (attached to an accelerating Body). Hence, in frame S, m r
S
= F,
but in frame B,
m r
B
= m( r
S
R) = F
..
true force
m
R.
..
ctitious force
Rotated Reference Frames in 2D
(a)
x
S
y
S
x
B
y
B
r
(b)
x
B
y
B
r
(c)
x
y
r
S
r
B
(a) In frame S, the position vector r has positive x and y components. A new xed
reference frame B is dened with respect to an original reference frame S by an
anticlockwise rotation through angle .
(b) In frame B, r has a positive x component and a negative y component.
(c) An observer who changes their observation frame from S to B nds that r appears
to have rotated clockwise through an angle , from r
S
to r
B
.
40
Rotations in 3D
O
N
P
Q
n
N
P
Q
r
S
n
V
Figure 1: Left: Overall and Right: top views of a general clockwise rotation of a vector OP = r
S
through an angle about an axis dened by the unit vector n, to become OQ = r
B
. This is what
we would observe if the reference frame we were in rotated in an anticlockwise direction.
We wish to determine OQ = ON + NV +VQ. Firstly,
ON = (n r
S
)n (component of r
S
in n direction).
Noting that OP = ON + NP NP = r
S
(n r
S
)n, and that NQ has the same magnitude as
NP, it follows that
NV =
NP
|NP|
|NQ| cos() = [r
S
(n r
S
)n] cos()
We note that r
S
n has direction perpendicular to r
S
= OP and n (as does VQ), and magnitude
|r
S
n| = |NQ|. Hence,
VQ =
(r
S
n)
|r
S
n|
|NQ| sin() = (r
S
n) sin().
Gathering the terms calculated above, we have
OQ
..
r
B
=
ON
..
(n r
S
)n +
NV
..
[r
S
(n r
S
)n] cos() +
VQ
..
(r
S
n) sin(),
which is generally rearranged slightly to obtain the rotation formula:
r
B
= r
S
cos() + (n r
S
)n[1 cos()] + (r
S
n) sin(). (32)
41
Innitesimal Rotations
Consider a point xed in frame S at r
S
= r. If frame B is innitesimally rotated w.r.t. S by
d, then r
B
= r + dr ( dr = r
B
r
S
). Taking the small-angle limit (i.e. = d), the rotation
formula [Eq. (32)] transforms to r + dr = r + (r n)d.
Hence, (noting that for any two vectors U V = V U) the velocity of the point in
reference frame B relative to that in frame S is given by:
_
dr
dt
_
in B
=
_
n
d
dt
_
r =
..
angular velocity
r.
Note that even if r is constant in reference frame B, it is time dependent if viewed from
reference frame S.
Velocity and Acceleration
If the vector r does have a time-dependence in reference frame S (e.g. represents the
position of a moving particle), then this must also be accounted for:
_
dr
dt
_
in B
. .
velocity in B
=
_
dr
dt
_
in S
. .
velocity in S
r.
. .
velocity due to motion of B relative to S
This equation is completely general for any vector A, so
_
dA
dt
_
in B
=
_
dA
dt
_
in S
A.
We can use the operator
_
d
dt
_
in B
=
__
d
dt
_
in S
_
in conjunction with v
B
= v
S
r to determine the acceleration in the non-inertial frame
B.
_
dv
B
dt
_
in B
=
__
d
dt
_
in S
_
v
B
=
_
d
dt
_
in S
(v
S
r) v
B
= a
S
r v
S
v
B
= a
S
r (v
B
+ r) v
B
= a
S
2 v
B
( r) r
Inertial Forces in a Rotating Frame
If considering motion of a point mass, then multiplying
_
dv
B
dt
_
in B
by m and noting that
F = m(dv
S
/dt)
S
, we deduce that
m r = F 2m r
. .
coriolis force
m ( r)
. .
centrifugal force
m r.
. .
Euler force
42
15 Inertial Forces on Earth
SEE ALSO HAND AND FINCH, 7.77.10
Procedure for Determining a Local Coordinate Systems
x
z
y
z
y
Figure 2: Left: Overall view of a local coordinate system in the region of a point on the surface of
the spherical earth. The positive x direction is east, the positive y direction is north, the positive z
direction is out from the centre of the earth. Right: Cross-sectional view, showing the local y and z
axes only (the x axis points perpendicularly into the page). is the latitude, and the colatitude
(in spherical coordinates, the angle of elevation).
1. Model the Earth as a perfect, solid sphere, and require an appropriate coordinate
system to describe motion near a point on its surface. Start with a cartesian coordi-
nate systemwith the z axis running through the poles, and the x and y axes emerging
through the equator [in this frame = (0, 0, )].
2. Rotate this coordinate system around its x axis so that the z axis emerges at a chosen
point on the Earths surface. In these new coordinates, this looks like a rotation of
about the x axis, thus the angular velocity in the local frame is
=
_
_
1 0 0
0 cos sin
0 sin cos
_
_
_
_
0
0
_
_
=
_
_
0
sin
cos
_
_
3. Finally, we displace the origin of the coordinate system by R to the point where the
local z axis emerges from the Earths surface (where R = |R| is the radius of the
Earth).
43
Inertial Forces in the Local Coordinate System
Note that the derivation of centrifugal, coriolis, and Euler force terms assumes r to be
dened with respect to a point on the axis of rotation.
The nal displacement of the local coordinate system origin means the associated
position vector r
is the
position vector associated with the local coordinate system.
Centrifugal Force m ( r)
The direction of the centrifugal force can be determined quite generally: r is perpen-
dicular to , r, and ( r) is perpendicular to both and r.
Hence, the centrifugal force m ( r) points directly away from the axis of rota-
tion, meaning it can counteract gravity.
Effective Gravity on Earths Surface
(a) (b) (c)
(a) Direction and magnitude of the force due to earths gravitation on the surface of a
spherical earth model.
(b) Direction of the centrifugal force on the surface of a spherical earth model (magni-
tude relative to the gravitational force greatly exaggerated for effect).
(c) Effect of adding the gravitational and centrifugal forces together. The net force for
a static object does not point exactly to the centre of the earth except at the poles
and the equator.
Coriolis Force 2m r
In the local coordinate system dened for the surface of the Earth, the coriolis force is
given by
44
2m r
_
.
Use this to show that the force measured in the rotating frame satises
m r = F m ( r) 2m r m r,
where r represents the position measured in frame B, and F is the force in the inertial
frame.
In which directions do the three inertial forces point for the falling stone?
The rate at which the Earths rotation is decreasing is very small, so you should now
assume = 0.
(b) Ignoring inertial forces, nd expressions for the speed and height of the stone as a
function of time, and the time taken before the stone hits the ground, t
0
. What condition
must be satised for the inertial forces to be unimportant in determining t
0
?
(c) Using the Coriolis force and the approximation that it always acts in the same direction,
how far, in terms of t
0
, does the stone land from the point beneath the helicopter, and
in which direction?
(d) Solve the same problemby using the conservation of angular momentum, and compare
your answer with that from (c).
ADDITIONAL EXERCISES
2. A Foucault pendulum is a heavy weight suspended by a cable from a bearing that does not
restrict the motion (unlike e.g. a typical clock pendulum, the motion of which is constrained
to a plane), thus making it sensitive to the earths angular velocity .
If the oscillation amplitude is small compared to the length of the pendulum l (small angle
approximation), then we can neglect motion in the vertical (z) direction, and approximate
the relevant forces in the (x, y) plane tangential to the earths surface by F = m
2
0
(x +y ),
where
2
0
= g/l, at the earths surface.
(a) Using
m r = F m ( r) 2m r m r,
derive expressions for x and y for the Foucault pendulum in the appropriate local co-
ordinate system ( pointing east, pointing north,
k pointing out from the centre of the
earth).
(b) In what follows we will neglect the centrifugal force terms. This is acceptable if
0
is
signicantly larger than ||. Why?
46
(c) Determine the general solution of the motion of the pendulum bob in the plane tan-
gential to the earths surface (to the assumed level of approximation) in terms of the
initial position and velocity. The mathematics will be easier if you consider the time
evolution of the complex number (t) = x(t) + iy(t). Also, you may like to dene
= cos . Make sure that you consistently drop all terms of the same size or smaller
than the neglected centrifugal force terms (to do this you will need the approximate
identity
1 + 1 + /2 for small ).
(d) The pendulum is allowed to start oscillating after being held at an amplitude A due
east from the origin. What is the position of the pendulum bob after each complete os-
cillation? Determine how many such oscillations must elapse before the pendulum bob
(approximately) returns to its original position. How many hours does this correspond
to:
i. At the north pole;
ii. At the equator;
iii. In Durham ( latitude 54
47
north)?
3. What angular velocity would be required for your effective weight at the equator to be half
that at the north pole? (Take the earths radius to be 6378 km). How many minutes would a
full rotation of the earth take?
47
16 Rotational Inertia, Angular Momentum and Kinetic Energy
SEE ALSO HAND AND FINCH, 8.18.3
DoF of Rigid Bodies
A rigid body is a system of mass points (or a continuum) constrained so that distances
between all points are constant (e.g. connected by massless rigid rods). A system of 2
connected point masses has 5 DoF, and a system of 3 connected point masses has 6 DoF.
Connecting a further point mass (e.g. by massless rigid rods) to 3 original point
masses completely species its position relative to the rigid body, and contributes 3 coor-
dinates 3 constraints = 0 additional DoF. Any rigid body more complex than two con-
nected point masses has 6 DoF.
These 6 DoF correspond to a translation (3 DoF) and a rotation (2 DoF to dene a
rotation axis and 1 DoF for the rotation).
Moment of Inertia I about an Axis of Rotation
Consider N mass points, rotating with angular velocity anticlockwise about an axis
of rotation (assumed to pass through the origin), where the direction of the axis of rotation
is dened by the unit vector n (i.e., the angular velocity vector = n).
Each mass point has a moment of inertia that is the square of its distance from the axis
of rotation multiplied by its mass m. For N mass points, I is the sum of such terms. Hence,
I =
N
k=1
m
k
|(r
k
n)n r
k
|
2
=
k=1
m
k
[r
2
k
(r
k
n)
2
].
48
Angular Momentum J and Rotational Kinetic Energy
The angular momentum (about the origin) of a system of mass points is given by J =
N
k=1
r
k
p
k
=
N
k=1
m
k
r
k
r
k
, (if the point about which we calculate J is R = 0, then we replace r
k
, r
k
by
r
k
, r
k
, where r
k
= r
k
R).
If rigid body constraints are imposed, any velocity r
k
is due to rotation only.
r
k
is the velocity of the kth mass point due to its anticlockwise rotation.
Using the vector identity a (b c) = (a c)b (a b)c and recalling = n,
J =
N
k=1
m
k
[r
k
( r
k
)] =
N
k=1
m
k
[r
2
k
(r
k
)r
k
]
J n =
N
k=1
m
k
[r
2
k
(r
k
n)
2
] = I.
Similarly, the kinetic energy reduces to T =
1
2
N
k=1
m
k
r
2
k
=
2
2
N
k=1
m
k
|n r
k
|
2
. Noting that
|n r
k
| = |(r
k
n)n r
k
|, the rotational kinetic energy takes a very simple form:
T =
2
2
N
k=1
m
k
|(r
k
n)n r
k
|
2
=
I
2
2
.
Denition of the Inertia Tensor
I
From J =
N
k=1
m
k
[r
2
k
(r
k
)r
k
], we infer that
J
x
=
N
k=1
m
k
(r
2
k
x
2
k
)
x
m
k
x
k
y
k
y
m
k
x
k
z
k
z
,
J
y
=
N
k=1
m
k
(r
2
k
y
2
k
)
y
m
k
y
k
x
k
x
m
k
y
k
z
k
z
,
J
z
=
N
k=1
m
k
(r
2
k
z
2
k
)
z
m
k
z
k
x
k
x
m
k
z
k
y
k
y
,
and use the linear transformation relating the angular velocity vector to the angular
momentum vector J to dene the inertia tensor
I:
_
_
J
x
J
y
J
z
_
_
. .
J
=
_
_
I
xx
I
xy
I
xz
I
yx
I
yy
I
yz
I
zx
I
zy
I
zz
_
_
. .
I
_
_
z
_
_
. .
with I
=
N
k
m
k
(r
2
k
r
k,
r
k,
), where
=
_
1 if =
0 if =
For continuous rigid bodies I
=
_
volume
dxdydz(x, y, z)(r
2
), where is a mass
density.
49
Generalised Description of the Rotational Kinetic Energy
Using the vector identity a (b c) = b (c a) = c (a b), the rotational kinetic energy can
be phrased more generally in terms of the moment of inertia tensor as
T =
1
2
N
k=1
m
k
r
2
k
=
1
2
N
k=1
m
k
r
k
( r
k
) =
1
2
_
N
k=1
m
k
(r
k
r
k
)
_
.
. .
J =
I
Hence T = (1/2) (
I) = (
2
/2)n
T
In, where n
T
In is the moment of inertia I about the axis
dened by n.
Centre of Mass (CoM)
The centre of mass position and velocity for a system of N mass points (mass of kth point
= m
k
) are dened by (total mass M =
N
k=1
m
k
):
R
C
=
N
k=1
m
k
r
k
M
,
R
C
=
N
k=1
m
k
r
k
M
.
It immediately follows that the total momentum P =
N
k=1
m
k
r
k
= M
R
C
is equivalent to the
momentum of a single particle of mass M =
N
k=1
m
k
moving with the CoM.
Separating the CoM Angular Momentum and Kinetic Energy
The total angular momentum about some point (situated at the origin of the coordinate system)
is J =
N
k=1
m
k
(r
k
r
k
).
Let r
k
= r
k
+ R
C
. Hence,
J =
N
k=1
m
k
(r
k
+ R
C
) ( r
k
+
R
C
)
=
N
k=1
m
k
(r
k
r
k
+ R
C
r
k
+r
R
C
+ R
C
R
C
)
=
N
k=1
m
k
(r
k
r
k
) + MR
C
N
k=1
m
k
r
k
M
+
N
k=1
m
k
r
k
M
. .
CoM velocity and position with respect to CoM, i.e., = 0
M
R
C
+ MR
C
R
C
.
J =
N
k=1
m
k
(r
k
r
k
) + MR
C
R
C
can be expressed as the angular momentum about the CoM
plus the angular momentum of the CoM about the origin.
Similarly, the CoM kinetic energy, and that for motion with respect to the CoM can be separated:
50
T =
1
2
N
k=1
m
k
r
2
k
=
1
2
N
k=1
m
k
r
k
r
k
=
1
2
N
k=1
m
k
( r
k
+
R
C
) ( r
k
+
R
C
)
=
1
2
N
k=1
m
k
r
2
k
+
R
C
k=1
m
k
r
k
. .
= 0
+
1
2
M
R
2
C
.
In general, if the external forces acting on a rigid body act through the CoM (e.g. gravity), then
the CoM motion can be considered separately.
51
17 The Parallel and Principal Axis Theorems
SEE ALSO HAND AND FINCH, 8.2,8.7
Displaced Axis Theorem
In a similar fashion to the angular momentum and rotational kinetic energy, one can insert the
CoM position R
C
into the elements of the Inertia tensor I
. With r
k
= r
k
+ R
C
,
I
=
N
k=1
m
k
(r
2
k
r
k,
r
k,
) =
N
k=1
m
k
[|r
k
+ R
C
|
2
(r
k,
+ R
C,
)(r
k,
+ R
C,
)]
=
N
k=1
m
k
[(r
2
k
+ 2r
k
R
C
+ R
2
C
)
(r
k,
r
k,
+r
k,
R
C,
+ R
C,
r
k,
+ R
C,
R
C,
)]
=
N
k=1
m
k
(r
2
k
k,
r
k,
) + M(R
2
C
R
C,
R
C,
)
+ 2R
C
k=1
m
k
r
. .
=0
R
C,
N
k=1
m
k
r
k,
. .
=0
R
C,
N
k=1
m
k
r
k,
. .
=0
We have used the fact that the CoM position with respect to the CoM is = 0.
Hence, the inertia tensor of a rigid body, dened with respect to rotations about the
origin can be related to the inertia tensor of the same rigid body, dened with respect to
rotations about its CoM, through
I =
I
CoM
+ M
A,
where
A can be represented by a matrix, the elements of which are determined by the
elements of the CoM position vector: A
= R
2
C
R
C,
R
C,
. This is the displaced axis
theorem.
Parallel Axis Theorem
Consider a moment of inertia I
COM
about an axis of rotation passing through the CoM.
This is equivalent to considering a diagonal element of some representation of the inertia
tensor.
From the displaced axis theorem, the moment of inertia I about a parallel axis is
I = I
CoM
+ Md
2
,
where d is the distance of parallel axis from the axis of rotation passing through the CoM.
This is the parallel axis theorem.
52
Symmetric Nature of the Inertia Tensor
The inertia tensor
I is dened for a particular rigid body, and for rotations of that body
about a particular point (frequently the centre of mass).
In any representation (e.g. for some set of cartesian axes, conveniently chosen with
the origin at the point of rotation) the inertia tensor is represented by a symmetric matrix,
i.e., I
= I
. We say
I is a symmetric tensor.
The matrix elements depend on the chosen representation (the tensor is dened rep-
resentation free). We wish to know if there is a diagonal matrix representation of the
inertia tensor.
Orthogonal Matrices and Orthogonal Diagonalization
The inverse
P
1
of a matrix
P is dened such that
P
1
P =
P
P
1
=
1, the identity matrix. A
matrix
P for which an inverse
P
1
exists is called invertible.
A real invertible square matrix
P such that its inverse
P
1
=
P
T
, its transpose, is said to be an
orthogonal matrix. (So-called because both its rows and columns form orthonormal sets).
A general 2 2 orthogonal matrix has a single free parameter , and describes a 2-dimensional
rotation when acting on a 2-dimensional column vector:
P =
_
cos sin
sin cos
_
.
The inverse matrix describes a rotation back in the opposite direction.
Similarly, in 3 dimensions, one can consider separate rotations about each axis:
P
x
=
_
_
1 0 0
0 cos sin
0 sin cos
_
_
,
P
y
=
_
_
cos 0 sin
0 1 0
sin 0 cos
_
_
,
P
z
=
_
_
cos sin 0
sin cos 0
0 0 1
_
_
.
(
P
z
P
y
P
x
)
T
P
z
P
y
P
x
=
P
T
x
P
T
y
P
T
z
P
z
P
y
P
x
=
1, and the product
P
z
P
y
P
x
is a way of formulating a gen-
eral 3 3 orthogonal matrix, describing a general rotation in 3-dimensions.
A square matrix
A is called orthogonally diagonalizable if there is an invertible matrix
P such that
P
1
A
P =
P
T
A
P =
D is diagonal;
P orthogonally diagonalizes
A. If
A is an N N matrix, then
the following are equivalent. (a)
A is orthogonally diagonalizable; (b)
A has an orthonormal
set of N eigenvectors; (c)
A is symmetric.
Consider the 3 3 case. Let us say that a 3 3 matrix
A has 3 orthonormal eigenvectors p
1
=
(p
11
, p
21
, p
31
), p
2
= (p
12
, p
22
, p
32
), p
3
= (p
13
, p
23
, p
33
) with corresponding eigenvalues
1
,
2
,
3
.
Let
P be the matrix whose columns are these eigenvectors:
P =
_
_
p
11
p
12
p
13
p
21
p
22
p
23
p
31
p
32
p
33
_
_
This implies that
53
A
P =
_
_
1
p
11
2
p
12
3
p
13
1
p
21
2
p
22
3
p
23
1
p
31
2
p
32
3
p
33
_
_
=
_
_
p
11
p
12
p
13
p
21
p
22
p
23
p
31
p
32
p
33
_
_
. .
P
_
_
1
0 0
0
2
0
0 0
3
_
_
. .
D
.
Since the columns of
P are orthonormal,
P is an orthogonal matrix. Hence,
A
P =
P
D
P
1
A
P =
P
T
A
P =
D, i.e.,
A is orthogonally diagonalizable.
To help see that
A must be symmetric, i.e., that
A
T
=
A, we use
D =
P
1
A
P =
P
T
A
P
P
D
P
1
=
P
D
P
T
=
A. Hence,
A
T
= (
P
D
P
T
)
T
=
P
D
T
P
T
=
P
D
P
T
=
A.
Principal Axis Theorem
I has three eigenvalues (which may be repeated). These are the principal moments of
inertia, called I
1
, I
2
, I
3
.
I must also have three orthonormal eigenvectors (in the case of repeated eigenval-
ues, there is some freedom in choosing what form the corresponding eigenvectors should
take). The directions the eigenvectors point in determine the principal axes, labelled 1,
2, 3.
It is always possible to nd a rotated coordinate system such that the representation of
I is diagonal. The diagonal elements are the principal moments of inertia (eigenvalues of
J =
N
k=1
m
k
( r
k
r
k
)
. .
= 0, as a a = 0 for any vector a
+
N
k=1
m
k
(r
k
r
k
) =
N
k=1
r
k
F
k
.
. .
total torque N
Hence, we can say, viewing the rigid body from outside (space axes/coordinate system)
(dJ/dt)
S
= N.
Recalling that, for any vector a, (da/dt)
B
= (da/dt)
S
a, viewing the rigid body
while travelling with its rotational motion (body axes/coordinate system) we can say that
(dJ/dt)
B
+ J = N.
We assume a coordinate system dened by the principal axes, i.e., the decomposition
J = (I
1
1
, I
2
2
, I
3
3
), and thus derive Eulers equations of motion:
I
1
1
3
(I
2
I
3
) = N
1
,
I
2
2
1
(I
3
I
1
) = N
2
,
I
3
3
2
(I
1
I
2
) = N
3
.
55
Torque-Free Motion
If there are no external torques (e.g. a freely spinning object moving through the Earths
gravitational eld), then we can set N
1
= N
2
= N
3
= 0.
Note that the total angular momentum (about the CoM) and the total kinetic energy
(associated with motion about the CoM) are both conserved.
J
2
= J
2
1
+ J
2
2
+ J
2
3
describes a sphere in angular momentum space, and T = J
2
1
/2I
1
+
J
2
2
/2I
2
+ J
2
3
/2I
3
describes an ellipsoid.
Where the sphere and ellipsoid intersect shows the possible trajectories for the angular
momentum components J
k
=
k
I
k
.
Figure 4: Left: Contours of constant T on a sphere described by J
2
= 1, for a system where
I
1
> I
2
> I
3
. Rotation about principal axis 2 is unstable, rotation about principal axes 1 and 3 is
stable. Right: Equivalent diagram for the case where I
1
= I
2
> I
3
. Principal axes 1 and 2 can be
rotated freely about principal axis 3, and there are no points on the J
2
= 1 sphere about which
the rotation is unstable.
56
A Second-Order Ordinary Linear Differential Equations with
Constant Coefcients
A.1 Denition
A second-order homogeneous ordinary linear differential equation with constant coefcients has
the form:
a
d
2
x
dt
2
+ b
dx
dt
+ cx = 0 (33)
This has been written to appear to be a differential equation of a position variable with respect to
time, as this is most obviously relevant for classical mechanics.
A.2 Solution
A.2.1 Auxiliary Equation
The recipe for solving such a linear differential equation begins by substituting e
t
for x. Thus
a
2
e
t
+ be
t
+ ce
t
= 0 (34)
which, cancelling e
t
everywhere, becomes
a
2
+ b + c = 0. (35)
This is the auxiliary equation. One can then solve the auxiliary equation by nding the roots from
the quadratic formula:
=
b
b
2
4ac
2a
. (36)
Thus
(x
+
)(x
) = 0, (37)
i.e., assuming the roots are not repeated, there are 2 distinct values of solving the auxiliary
equation, which may be complex or imaginary.
A.2.2 General Solution (Roots not Repeated)
The general solution is then a linear superposition of the two particular solutions e
+
t
and e
t
,
i.e.,
x(t) = A
+
e
+
t
+ A
t
(38)
where the A
+
, A
(39)
and the initial velocity, through
x(t) = A
+
+
e
+
t
+ A
t
x(0) = A
+
+
+ A
. (40)
The roots
+
and
may be imaginary if this is the case and x is a real quantity (such as po-
sition) then A
+
and A
must have (complex) values such that the overall imaginary contribution
is zero.
I
A.3 Possible Complications
A.3.1 Repeated Roots
If
+
=
= , then e
t
solves the differential equation, and a second linearly independent
solution is given by te
t
. The general solution is thus given by
x(t) = Ae
t
+ Bte
t
. (41)
A.3.2 Additional Constant
If the differential equation has the form
a
d
2
x
dt
2
+ b
dx
dt
+ cx + k = 0 (42)
then this can be rephrased as
a
d
2
x
dt
2
+ b
dx
dt
+ c
_
x +
k
c
_
= 0. (43)
One can then dene a new variable x = x + k/c, for which it follows
a
d
2
x
dt
2
+ b
d x
dt
+ c x = 0. (44)
One then nds the general solution for x(t), as shown above, and the general solution for x(t)
from it, by x(t) = x(t) k/c.
A.4 Representing Position in a Plane by a Single Complex Number
If we have a situation where, for example, we have coupled motion in the x, y plane, i.e., the x
and y variables cannot be separated:
d
2
x
dt
2
+ a
x
dx
dt
+ b
x
dy
dt
+ c
x
x + d
x
y + k
x
= 0, (45)
d
2
y
dt
2
+ a
y
dx
dt
+ b
y
dy
dt
+ c
y
x + d
y
y + k
y
= 0, (46)
then we can try to simplify matters by considering the complex variable = x + iy, and adding
the bottomequation multiplied by i to the top equation. We then consider the differential equation
given by
d
2
dt
2
+ =
d
2
x
dt
2
+ + i
_
d
2
y
dt
2
+
_
. (47)
In general, this will produce an equation containing terms proportional to both and its complex
conjugate
dt
2
+
d
dt
+ + = 0, (48)
where , , are (in general) complex constant coefcients, i.e., there is no coupling to the complex
conjugate variable
. If this is the case, then the general solution for (t) is determined using
exactly the same methodology as described on the rst two pages. Having determined the general
II
solution for (t), one derives the general solution for x(t) by taking the real part of (t), and the
general solution for y(t) by taking the imaginary part.
The very substantial simplication of the mathematics offered by such an approach means it is
at least worth considering. Otherwise it is necessary to use methods developed for linear systems
of equations; this will also be necessary if motion in the x, y, and z directions cannot be sepa-
rated. Extension to higher-order linear differential equations is straightforward. See essentially
any textbook on ordinary differential equations for further details, if interested.
B Taylor Series
B.1 Taylors Theorem of the Mean
We consider a function of N variables f (x
1
, x
2
, . . . , x
N
). This could, for example, be a Lagrangian
describing the motion of n objects, which would be considered a function of n generalized coor-
dinates and n generalized velocities (which are considered independent variables), making for a
total of N = 2n variables. Or it could be just the potential energy term of such a Lagrangian, with
N = n variables.
If, within a region (i.e., some volume of n-dimensional space) all possible partial derivatives
of f exist, Taylors theorem of the mean implies
f (x
01
+ a
1
, . . . , x
0N
+ a
N
) =f (x
01
, . . . , x
0N
)
+
n
k=1
1
k!
_
a
1
x
1
+ + a
N
x
N
_
k
f (x
1
, . . . , x
N
)
x
01
,...,x
0N
+O(x
n+1
k
)
(49)
(Taylors theorem of the mean is in fact a more general result see essentially any moderately
advanced calculus text if interested). It is also assumed that the function f is well-dened over
the region of interest (in particular, there should be no innities), i.e., the above series converges
within this region. The vertical line at the right-hand-side of the derivative terms means that the
derivative is rst calculated, and then evaluated at x
1
= x
01
, x
2
= x
02
, etc..
B.2 Taylor Series in One Variable
If we restrict ourselves to f being a function of just one variable f (x), then
f (x
0
+ a) =f (x
0
) +
n
k=1
1
k!
a
k
k
x
k
f (x)
x
0
+O(x
n+1
).
(50)
If we write the above in a form where a = x x
0
, then we get the usual form of a single-variable
Taylor series:
f (x) =f (x
0
) +
n
k=1
1
k!
_
_
k
x
k
f (x)
x
0
_
_
(x x
0
)
k
+O(x
n+1
). (51)
Note that, for example, the function f (x) = x
1
cannot be expanded around x
0
= 0, as this point
is singular [ f (x) as x 0]. Any value of x
0
slightly different from zero is acceptable.
III
B.3 Taylor Series in Two Variables
It is not so easy to write down such a compact formulation for a Taylor series in two or more
variables, but the procedure is essentially the same. Bearing in mind that we are frequently only
interested in (at most) the rst- and second-order derivative terms, we write down
f (x
0
+ a, y
0
+ b) =f (x
0
, y
0
) +
_
a
x
+ b
y
_
f (x, y)
x
0
,y
0
+
1
2
_
a
2
2
x
2
+ 2ab
2
xy
+ b
2
2
y
2
_
f (x, y)
x
0
,y
0
+ .
(52)
Similarly to the case for a single-variable Taylor series, we set a = x x
0
and b = y y
0
, yielding
f (x, y) =f (x
0
, y
0
) +
__
x
f (x, y)
x
0
,y
0
_
(x x
0
) +
_
y
f (x, y)
x
0
,y
0
_
(y y
0
)
_
+
1
2
__
2
x
2
f (x, y)
x
0
,y
0
_
(x x
0
)
2
+ 2
_
2
xy
f (x, y)
x
0
,y
0
_
(x x
0
)(y y
0
)
+
_
2
y
2
f (x, y)
x
0
,y
0
_
(y y
0
)
2
_
+ .
(53)
B.4 Taylor Series in Three Variables
Proceeding in an exactly analogous fashion, we determine for a function of three variables, f (x, y, z),
that
f (x, y, z) =f (x
0
, y
0
, z
0
)
+
__
x
f (x, y, z)
x
0
,y
0
,z
0
_
(x x
0
) +
_
y
f (x, y, z)
x
0
,y
0
,z
0
_
(y y
0
)
+
_
z
f (x, y, z)
x
0
,y
0
,z
0
_
(z z
0
)
_
+
1
2
__
2
x
2
f (x, y, z)
x
0
,y
0
,z
0
_
(x x
0
)
2
+
_
2
y
2
f (x, y, z)
x
0
,y
0
,z
0
_
(y y
0
)
2
+
_
2
z
2
f (x, y, z)
x
0
,y
0
,z
0
_
(z z
0
)
2
+ 2
_
2
xy
f (x, y, z)
x
0
,y
0
,z
0
_
(x x
0
)(y y
0
)
+ 2
_
2
xz
f (x, y, z)
x
0
,y
0
,z
0
_
(x x
0
)(z z
0
)
+2
_
2
yz
f (x, y, z)
x
0
,y
0
,z
0
_
(y y
0
)(z z
0
)
_
+ .
(54)
This can be extended to functions of more than three variables, and to include higher-order
derivatives, as desired, in a reasonably straightforward fashion.
IV
C Determinants, Eigenvalues and Eigenvectors
C.1 Symmetric Matrices
In classical mechanics, we can usually restrict ourselves to considering real, symmetric matrices,
i.e., matrices with real entries that are equal to their own transpose. Formally, we say a matrix
A
is symmetric if
A =
A
T
. (55)
In the case of a general 2 2 matrix
B
B
T
=
_
a b
c d
_
T
=
_
a c
b d
_
. (56)
Hence, if
B is to equal
B
T
, it follows that b must equal c, i.e., the most general 2 2 symmetric
matrix has the form
B =
_
a b
b d
_
. (57)
C.2 Determinants
The determinant of the matrix
B is given by
det(
B) =
a b
c d
= ad bc. (58)
There is considerable freedom in the choice of steps to take when calculating the determinant of
a 3 3 matrix
C. The most conventional formulation is
det(
C) =
c
11
c
12
c
13
c
21
c
22
c
23
c
31
c
32
c
33
= c
11
c
22
c
23
c
32
c
33
c
12
c
21
c
23
c
31
c
33
+ c
13
c
21
c
22
c
31
c
32
. (59)
We thus have a sum of 3 terms involving determinants of 2 2 matrices, which we already know
how to calculate.
Similarly, the determinant of an n n matrix
D is given by a sum of n terms involving deter-
minants of (n 1) (n 1) matrices. These (n 1) (n 1) matrices are conventionally formed
by removing the 1st row and the kth column from the matrix
D. In this case, the determinant is
formulated by
det(
D) =
d
11
d
12
d
1n
d
21
d
22
d
2n
.
.
.
.
.
.
.
.
.
.
.
.
d
n1
d
n2
d
nn
=
n
k=1
(1)
1+k
d
1k
d
2,k1
d
2,k+1
.
.
.
.
.
.
.
.
.
.
.
.
d
n,k1
d
n,k+1
. (60)
C.3 The Conventional Eigenvalue Problem
If we take a matrix
A, a vector v, and a scalar , such that
Av = v, (61)
V
we say that v is an eigenvector of the matrix
A, with eigenvalue . It obviously follows that
Av v = 0 (
A
1)v = 0, (62)
where
1 is the identity matrix [i.e., the matrix with ones down the main (top-left-to-bottom-right)
diagonal and zeros everywhere else]. It so happens that this can only be possible if the determi-
nant of the matrix (
A
B =
_
a b
b d
_
. (63)
To determine the eigenvalues of
B, we calculate
a b
b d
= (a )(d ) b
2
=
2
(a + d) + (ad b
2
) = 0. (64)
The values of that solve this equation can in this case be generally determined fromthe quadratic
formula. There are two solutions,
+
and
, given by
=
(a + d)
_
(a d)
2
+ b
2
2
. (65)
The eigenvector v
+
associated with the eigenvalue
+
can then be determined from
_
a
+
b
b d
+
__
v
1+
v
2+
_
=
_
0
0
_
(66)
which implies
(
+
a)v
1+
=bv
2+
(67)
(
+
d)v
2+
=bv
1+
. (68)
It is common to choose v
1+
= 1, and then normalize the result. Hence, from Eq. (68)
v
+
=
1
_
1 + (
+
d)
2
/b
2
_
1
(
+
d)/b
_
, (69)
and equivalently, for the determining of v