Newton's laws of motion are often considered to be fundamental postulates
for describing the motion of particles in a gravitational field, at least
from our daily viewpoint. In a more general picture this is not so. Not
only are they just a result of the general theory of relativity, they can
also be derived from a more general principle, namely Hamilton's
principle. Newton's laws of motion are just one example of equations
that can be deduced from Hamilton's principle, the equations of motion for
galaxies in an expanding Universe is another. The following description of
this principle is mainly taken from Gold.
Hamilton's principle is an ``integral
principle'', which means that it considers the entire motion of a system
between time
and
. What is meant by this needs to be specified somewhat. The instantaneous
configuration of the system is described by the values of n
generalized coordinates
, and corresponds to a particular point in a Cartesian hyperspace where
the q-s form the n coordinate axes. This n-dimensional
space is known as the configuration space. As the time evolves, the system
point moves in this configuration space, tracing out a curve. This curve
describes the path of motion of the system. The configuration space can be
very different from the physical three-dimensional space, where only three
coordinates are needed to describe a position at any give time. For
example, a system that is being described both by the spatial coordinates
and the velocities would have a six-dimensional configuration space at any
given point in time.
Hamilton's principle is a version of the
integral principle which considers the motion of a mechanical system,
described by a scalar potential that may be a function of the coordinates,
velocities and time. The integral, often also referred to as the action,
is, in an essential one-dimensional form from
to
, given by
where
is the Lagrangian, given by
, T and V being the kinetic and potential energy,
respectively. The dot indicate derivative with respect to time. The
dependence of x on t is not fixed; that is, x(t)
is unknown. This means that although the integral is from
to
, the exact path of integration is not known. The correct path of motion
of the system is such that the action has a stationary value; i.e. the
integral along the given path has the same value to within first-order
infinitesimals as that along all neighbouring paths. The difference
between two paths for a given t is called the variation of
x,
, and is conventionally described by introducing a new function
to define the arbitrary deformation of the path and a scale factor
to give the magnitude of the variation. The function
is arbitrary except for two restrictions: firstly, it must satisfy the
boundary values,
; secondly, it must be twice differentiable. The paths can then be
described as

We have a stationary value of the action
when the derivative of A with respect to the scale factor
is zero:
The
-dependence of the integral is contained in
and
, thus
By inserting equation and integrating the
second term by parts, we get
The integrated part vanishes due to the fixed end-points (boundary
values). The condition for stationary values, equation, is therefore equal
to the following relation:
In order to arrive at the equation of motion, the fundamental theorem of
variational calculus is needed. It state that if the integral in equation
vanishes for every
continuously differentiable in the interval
, then the content of the brackets in the equation must identically
vanish in the same interval; that is, for
. It therefore follows that A can have stationary values only if
which is the familiar Euler-Lagrange differential equation. By inserting
the Lagrangian, one can then deduce the equation of motion for the
mechanical system.
What is the point of going through this
rather extensive deduction, just arrive at the well known Euler-Lagrange
differential equation? In this case there are at least three motivations.
Firstly, it shows that the Euler-Lagrange equations are in fact a result
of the very compact Hamilton's principle. Secondly, it will be needed in
the next section to justify the use of mixed boundary values. Thirdly, and
most importantly, the principle is applied directly in the numerical
implementation of the AVP.
The one-dimensional deduction presented in
this section can easily be extended to any multidimensional case. |