FROM NEWTON TO KEPLER

notes by Roberto Bigoni

(In the following text, vector quantities are represented by letters in bold; in the numbered equations vector quantities are topped with an arrow, the scalar (or dot) product is denoted by · and the vector cross product is indicated by ×.)

1. Position, velocity and acceleration in plane polar coordinates.

The position of a point P in a plane can be represented by choosing a polar reference system with pole at O and polar axis OX and by matching the point P with a vector r with magnitude equal to the measure of the segment OP and polar angle θ given by the measure of the oriented angle XOP.

If, with an infinitesimal change in position, the point P moves to P', the new position is represented by the vector r', ie the vector sum r+dr.

dr represents an infinitesimal displacement, which in turn can be expressed as the sum of two infinitesimal vectors, one of which is radial, that is directed as PO and with magnitude dr, and the other one is tangential, that is perpendicular to OP, with magnitude r dθ.

I we represent with u_r the radial unit vector and with u_θ the tangential unit vector, we can write

Since the velocity vector is the derivative with respect to time of the position vector, we obtain

The first term in the right hand side of (1.2) is the radial component of velocity or, more briefly, the radial velocity; the second term is the tangential component of velocity or, more briefly, the tangential velocity.

With a convention introduced by Newton, we show the first derivative of a quantity with respect to time by superimposing a dot to the symbol of the quantity. Adopting this notation we can rewrite the equation (1.2) in the following way

Moreover, often the derivative of the polar angle θ with respect to time, said angular velocity, is represented by the symbol ω.
Then we can also write the following version of of (1.2)

The square of the velocity vector is

By definition the acceleration vector is given by the derivative of the velocity vector with respect to time:

To obtain the acceleration we derive the equation (1.2.2)

To derive the radial and the tangential unit vector, we can represent them in a cartesian system of coordinates.

If we call i and j the units vectors of the cartesian axes, we have

The derivatives of the equations (1.5) with respect to θ are

then

Now, from the equations (1.7) and (1.4), we get

and finally

If the magnitude of the position r is constant, the motion is a circular motion centered on the pole O.
In this case, the radial velocity is zero so the velocity coincides with the tangential velocity.

Therefore, in the case of a circular motion, we obtain

If in a circular motion ω is constant, the motion is called uniform circular motion.

In a uniform circular motion, if T is the period, ie the duration of a full rotation, we have

and the expression of the acceleration, from the equation (1.9), is simply

that is, the acceleration has only the radial component which is constant and directed from the point to the pole.

In this case the acceleration is said to be centripetal.

2. The theorem of live forces.

The work of a force f which causes an infinitesimal displacement dr of its point of application, by definition, is

From the second law of motion

and from the equation (2.1) we have

Moreover, differentiating the square of speed, we obtain

Then the right hand side of the equation (2.3) becomes

where

K is called kinetic energy.

So, from the equation (2.1), we obtain

that is

3. Angular momentum and central forces.

If a body (which we assume to be point-like) with mass m moves with velocity v, its momentum p is given by p=mv.

If the body is in the position r with respect to the pole O, its angular momentum with respect to the pole O is

If we derive both the sides of the equation (3.1) with respect to time we get

The first vector product in the right hand side is zero, since the derivative of the position vector is the velocity vector that is parallel to the momentum.

The derivative of the momentum with respect to time, from the second law of motion

is the force acting on the body. Therefore, in general

In the equation (3.3) the vector τ, given by the vector product of position and force, is called moment of the force or torque.
The equation (1.3) can be seen as an extension of the second law of motion to the angular quantities: f matches τ, p matches l and τ is the derivative with respect to time of l as f is the derivative with respect to time of p.

If f is a central force, ie parallel to r, its moment is zero; so also the derivative of the angular momentum with respect to time is zero, that is the angular momentum of a central force is constant.

If a vector is constant, its direction and its magnitude are constant.
The direction of angular momentum, which is the vector product of r and p, is perpendicular to the plane of r and p: this means that r and p, during movement of the body, are always on the same plane; therefore the trajectory is a plane curve that is covered by the body always with the same sense of rotation.

As a first approximation, ie neglecting the forces due to the presence of other planets, according to the theory of Newton, the force of universal gravitation acting on a single planet is a central force directed from the planet toward the Sun,

therefore, according to this law, the planets follow orbits that are described by planar curves and are free of retrograde motion.

The magnitude of l is rp sinδ where δ is the angle between the vectors r and p, so p sinδ is the tangential component of the momentum, that is the product of the mass by the tangential velocity. The tangential velocity is ωr, then

ω is the derivative with respect to time of the polar angle θ, then

r²dθ can be geometrically interpreted as twice the area dA swept by the position vector

Therefore, ultimately

Since the right hand side is constant, also the left hand side, said areal velocity, is constant.

This is the second of Kepler's laws, obtained by him by accurately recording the positions of the planets visible to the naked eye over time, and traditionally formulated in the following way:

A line joining a planet and the Sun sweeps out equal areas during equal intervals of time.

This is the same as saying that the areal velocity is constant; as we have seen, this is an astronomical result, of the fact that, if a body is subjected to a central force, its angular momentum is constant.

4. The gravitational force is conservative.

By definition, the work of a force is given by the scalar product of the force and the displacement vector.

To calculate the work of the force of gravitation related to an infinitesimal displacement dr we need to perform the scalar product of force (3.4) for the infinitesimal displacement (1.1)

The right hand side in (4.1) is the opposite of the exact differential of the function

If we represent this function, said gravitational potential energy, with U, the equation (4.1) can be written as

By the theorem of live forces

If we subtract the equation (4.3) from (4.4) we obtain

So the total energy E, sum of potential and kinetic energy, of a body with mass m located in the gravitational field generated by a body with mass M (assumed at rest in the pole O) is constant over time

For this reason we say that the gravitational field is a conservative field.

5. Trajectories in gravitational fields.

From the equation (4.6) we obtain the square of the velocity

Using the expression (1.2.3) of the square of the velocity, we have

and after

From the equation (3.5) we have

so we can rewrite the equation (5.3) as

Using again the equation (3.5)

we have

Now we divide the equation (5.6) by the equation (5.5) and separate the variables

To simplify the differential expression of the right hand side, it is useful to let

and

With these substitutions the equation (5.7) can be rewritten as

The integration of the differential at right hand side is facilitated by writing the radicand as a difference of squares

Now we integrate

Remembering the value of s:

If we impose that r has its maximum when θ = 0, we get C = π and then

The equation (5.17) is the polar equation of a conic with semilatus rectum p and eccentricity e given by

Remembering the values of a and b we have

In particular, by examining the expression of the eccentricity, we can see that

if E is zero, the orbit is a parabola;
if E is negative, the orbit is an ellipse;
if E is positive, the orbit is a hyperbola.

This result is more general than that stated by Kepler's first law: 'The orbit of every planet is an ellipse with the Sun at one of the two foci', because it provides that the heavenly bodies in the gravitational field of the Sun or another star, can also move along parabolic or hyperbolic orbits.

6. The Kepler's third law.

For simplicity sake we will consider only circular orbits.

If the orbit is a circumference, the eccentricity is 0, so, from the equation (5.19)

Moreover the gravitational force is a centripetal force that can be expressed by the equation (1.12)

We can simplify:

The velocity can be expressed using the equation (1.10)

The total energy E=U+K is

From the equations (6.1) and (6.6) we obtain

Recovering l from the equation (3.5)

Recovering ω from equation (1.11)

The equation (6.12) expresses Kepler's third law: The square of the orbital period of a planet is directly proportional to the cube of the semi-major axis of its orbit.

7. Kepler's Equation.

In order to calculate the position of a planet as a function of time, referring to the diagram below, we can proceed as follows.

The Sun is at the focus F of the orbit.
We assume that the time starts when the planet is at point A where the distance from the Sun is highest (aphelion).
P is the position of the planet at a later time t.
The elliptical orbit has semi-major axis a, semi-minor axis b and eccentricity e.
We draw the circle with center O, coincident with the center of the orbit, and radius equal to a.
We draw the perpendicular from P to the major axis, which intersects the circumference at the point Q and the axis at the point X.
We represent the distance FP with ρ, the measure of the angle AFP with θ and the measure of the angle AOQ with ε. These angles are called, respectively, true anomaly and eccentric anomaly.

In the Cartesian system with origin O and axes coincident with the axes of the ellipse, the distance FO has measure ae and the abscissa x is common to the points P and Q.

In this system, the equation of the circle is then the ordinate y_Q of the point Q of the circle is .
In the same system the equation of the ellipse is , then the ordinate y_P of the point P of the ellipse is

so the lengths of the segments PX and QX are such that .
This implies that:

the area of the ellipse is ^b/_a of the area of the circle
the area of the region of ellipse with vertices XAP area is ^b/_a of the area of the region of circle with vertices XAQ.

To calculate the area of the region XAQ we make the difference between the area of the circular sector OAQ and that of the triangle OXQ. We get

and then

We get the area of the whole elliptical sector FAP (with a dark background in the figure) by adding the area of the triangle FXP and the area of the region XAP

This is the area 'swept' by the radius vector FP in time t. For the second Kepler's law, the ratio between this area and the time t is constant; in particular it is equal to the ratio between the whole area of the ellipse and the period T of revolution:

If we represent the mean angular velocity with ω, , we finally get

This equation, known as Kepler's equation, given the orbital period T of a planet (or satellite) and the eccentricity of its orbit, allows to compute the eccentric anomaly ε.

Having computed the eccentric anomaly ε, we can derive the true anomaly, for example, in the following way:

From the relation between sin θ e sin ε
Moreover
If we sum the equations (7.2) an (7.4) we have
Now we consider together the equations (7.3) and (7.5)

and divide side by side
Remembering the half-angle formulae
we can get

If θ is negative, we can increase it by 2π

After obtaining θ, we can calculate the radius ρ using the polar equation of the ellipse.

last revision: April 2017