Relativity:
The Special and General Theory
The
Principle of Relativity (In the Restricted Sense)
IN order to attain the greatest possible clearness, let
us return to our example of the railway carriage supposed
to be travelling uniformly. We call its motion a uniform
translation (“uniform” because it is of constant
velocity and direction, “translation” because
although the carriage changes its position relative to the
embankment yet it does not rotate in so doing). Let us imagine
a raven flying through the air in such a manner that its
motion, as observed from the embankment, is uniform and
in a straight line. If we were to observe the flying raven
from the moving railway carriage, we should find that the
motion of the raven would be one of different velocity and
direction, but that it would still be uniform and in a straight
line. Expressed in an abstract manner we may say: If a mass
m is moving uniformly in a straight line with respect to
a co-ordinate system K, then it will also be moving uniformly
and in a straight line relative to a second co-ordinate
system K', provided that the latter is executing a uniform
translatory motion with respect to K. In accordance with
the discussion contained in the preceding section, it follows
that:
If K is a Galileian co-ordinate system, then every other
co-ordinate system K' is a Galileian one, when, in relation
to K, it is in a condition of uniform motion of translation.
Relative to K' the mechanical laws of Galilei-Newton hold
good exactly as they do with respect to K.
We advance a step farther in our generalisation when we
express the tenet thus: If, relative to K, K' is a uniformly
moving co-ordinate system devoid of rotation, then natural
phenomena run their course with respect to K' according
to exactly the same general laws as with respect to K. This
statement is called the principle of relativity (in the
restricted sense).
As long as one was convinced that all natural phenomena
were capable of representation with the help of classical
mechanics, there was no need to doubt the validity of this
principle of relativity. But in view of the more recent
development of electrodynamics and optics it became more
and more evident that classical mechanics affords an insufficient
foundation for the physical description of all natural phenomena.
At this juncture the question of the validity of the principle
of relativity became ripe for discussion, and it did not
appear impossible that the answer to this question might
be in the negative.
Nevertheless, there are two general facts which at the outset
speak very much in favour of the validity of the principle
of relativity. Even though classical mechanics does not
supply us with a sufficiently broad basis for the theoretical
presentation of all physical phenomena, still we must grant
it a considerable measure of “truth,” since
it supplies us with the actual motions of the heavenly bodies
with a delicacy of detail little short of wonderful. The
principle of relativity must therefore apply with great
accuracy in the domain of mechanics. But that a principle
of such broad generality should hold with such exactness
in one domain of phenomena, and yet should be invalid for
another, is a priori not very probable.
We now proceed to the second argument, to which, moreover,
we shall return later. If the principle of relativity (in
the restricted sense) does not hold, then the Galileian
co-ordinate systems K, K', K'', etc., which are moving uniformly
relative to each other, will not be equivalent for the description
of natural phenomena. In this case we should be constrained
to believe that natural laws are capable of being formulated
in a particularly simple manner, and of course only on condition
that, from amongst all possible Galileian co-ordinate systems,
we should have chosen one (K0) of a particular state of
motion as our body of reference. We should then be justified
(because of its merits for the description of natural phenomena)
in calling this system “absolutely at rest,”
and all other Galileian systems K “in motion.”
If, for instance, our embankment were the system K0, then
our railway carriage would be a system K, relative to which
less simple laws would hold than with respect to K0. This
diminished simplicity would be due to the fact that the
carriage K would be in motion (i.e. “really”)
with respect to K0. In the general laws of natural which
have been formulated with reference to K, the magnitude
and direction of the velocity of the carriage would necessarily
play a part. We should expect, for instance, that the note
emitted by an organ-pipe placed with its axis parallel to
the direction of travel would be different from that emitted
if the axis of the pipe were placed perpendicular to this
direction. Now in virtue of its motion in an orbit round
the sun, our earth is comparable with a railway carriage
travelling with a velocity of about 30 kilometres per second.
If the principle of relativity were not valid we should
therefore expect that the direction of motion of the earth
at any moment would enter into the laws of nature, and also
that physical systems in their behaviour would be dependent
on the orientation in space with respect to the earth. For
owing to the alteration in direction of the velocity of
rotation of the earth in the course of a year, the earth
cannot be at rest relative to the hypothetical system K0
throughout the whole year. However, the most careful observations
have never revealed such anisotropic properties in terrestrial
physical space, i.e. a physical non-equivalence of different
directions. This is a very powerful argument in favour of
the principle of relativity.
On
the Idea of Time in Physics
LIGHTNING has struck the rails on our railway embankment
at two places A and B far distant from each other. I make
the additional assertion that these two lightning flashes
occurred simultaneously. If now I ask you whether there
is sense in this statement, you will answer my question
with a decided “Yes.” But if I now approach
you with the request to explain to me the sense of the statement
more precisely, you find after some consideration that the
answer to this question is not so easy as it appears at
first sight.
After some time perhaps the following answer would occur
to you: “The significance of the statement is clear
in itself and needs no further explanation; of course it
would require some consideration if I were to be commissioned
to determine by observations whether in the actual case
the two events took place simultaneously or not.”
I cannot be satisfied with this answer for the following
reason. Supposing that as a result of ingenious considerations
an able meteorologist were to discover that the lightning
must always strike the places A and B simultaneously, then
we should be faced with the task of testing whether or not
this theoretical result is in accordance with the reality.
We encounter the same difficulty with all physical statements
in which the conception “simultaneous” plays
a part. The concept does not exist for the physicist until
he has the possibility of discovering whether or not it
is fulfilled in an actual case. We thus require a definition
of simultaneity such that this definition supplies us with
the method by means of which, in the present case, he can
decide by experiment whether or not both the lightning strokes
occurred simultaneously. As long as this requirement is
not satisfied, I allow myself to be deceived as a physicist
(and of course the same applies if I am not a physicist),
when I imagine that I am able to attach a meaning to the
statement of simultaneity. (I would ask the reader not to
proceed farther until he is fully convinced on this point.)
After thinking the matter over for some time you then offer
the following suggestion with which to test simultaneity.
By measuring along the rails, the connecting line AB should
be measured up and an observer placed at the mid-point M
of the distance AB. This observer should be supplied with
an arrangement (e.g. two mirrors inclined at 90°) which
allows him visually to observe both places A and B at the
same time. If the observer perceives the two flashes of
lightning at the same time, then they are simultaneous.
I am very pleased with this suggestion, but for all that
I cannot regard the matter as quite settled, because I feel
constrained to raise the following objection: “Your
definition would certainly be right, if I only knew that
the light by means of which the observer at M perceives
the lightning flashes travels along the length A —>
M with the same velocity as along the length B —>
M. But an examination of this supposition would only be
possible if we already had at our disposal the means of
measuring time. It would thus appear as though we were moving
here in a logical circle.”
After further consideration you cast a somewhat disdainful
glance at me—and rightly so—and you declare:
“I maintain my previous definition nevertheless, because
in reality it assumes absolutely nothing about light. There
is only one demand to be made of the definition of simultaneity,
namely, that in every real case it must supply us with an
empirical decision as to whether or not the conception that
has to be defined is fulfilled. That my definition satisfies
this demand is indisputable. That light requires the same
time to traverse the path A —> M as for the path
B —> M is in reality neither a supposition nor
a hypothesis about the physical nature of light, but a stipulation
which I can make of my own freewill in order to arrive at
a definition of simultaneity.”
It is clear that this definition can be used to give an
exact meaning not only to two events, but to as many events
as we care to choose, and independently of the positions
of the scenes of the events with respect to the body of
reference 1 (here the railway embankment). We are thus led
also to a definition of “time” in physics. For
this purpose we suppose that clocks of identical construction
are placed at the points A, B and C of the railway line
(co-ordinate system), and that they are set in such a manner
that the positions of their pointers are simultaneously
(in the above sense) the same. Under these conditions we
understand by the “time” of an event the reading
(position of the hands) of that one of these clocks which
is in the immediate vicinity (in space) of the event. In
this manner a time-value is associated with every event
which is essentially capable of observation.
This stipulation contains a further physical hypothesis,
the validity of which will hardly be doubted without empirical
evidence to the contrary. It has been assumed that all these
clocks go at the same rate if they are of identical construction.
Stated more exactly: When two clocks arranged at rest in
different places of a reference-body are set in such a manner
that a particular position of the pointers of the one clock
is simultaneous (in the above sense) with the same position
of the pointers of the other clock, then identical “settings”
are always simultaneous (in the sense of the above definition).
Note 1. We suppose further that, when three events A, B
and C take place in different places in such a manner that,
if A is simultaneous with B, and B is simultaneous with
C (simultaneous in the sense of the above definition), then
the criterion for the simultaneity of the pair of events
A, C is also satisfied. This assumption is a physical hypothesis
about the law of propagation of light; it must certainly
be fulfilled if we are to maintain the law of the constancy
of the velocity of light in vacuo.
Experience
and the Special Theory of Relativity
TO what extent is the special theory of relativity supported
by experience? This question is not easily answered for
the reason already mentioned in connection with the fundamental
experiment of Fizeau. The special theory of relativity has
crystallised out from the Maxwell-Lorentz theory of electromagnetic
phenomena. Thus all facts of experience which support the
electromagnetic theory also support the theory of relativity.
As being of particular importance, I mention here the fact
that the theory of relativity enables us to predict the
effects produced on the light reaching us from the fixed
stars. These results are obtained in an exceedingly simple
manner, and the effects indicated, which are due to the
relative motion of the earth with reference to those fixed
stars, are found to be in accord with experience. We refer
to the yearly movement of the apparent position of the fixed
stars resulting from the motion of the earth round the sun
(aberration), and to the influence of the radial components
of the relative motions of the fixed stars with respect
to the earth on the colour of the light reaching us from
them. The latter effect manifests itself in a slight displacement
of the spectral lines of the light transmitted to us from
a fixed star, as compared with the position of the same
spectral lines when they are produced by a terrestrial source
of light (Doppler principle). The experimental arguments
in favour of the Maxwell-Lorentz theory, which are at the
same time arguments in favour of the theory of relativity,
are too numerous to be set forth here. In reality they limit
the theoretical possibilities to such an extent, that no
other theory than that of Maxwell and Lorentz has been able
to hold its own when tested by experience.
But there are two classes of experimental facts hitherto
obtained which can be represented in the Maxwell-Lorentz
theory only by the introduction of an auxiliary hypothesis,
which in itself—i.e. without making use of the theory
of relativity—appears extraneous.
It
is known that cathode rays and the so-called -rays emitted
by radioactive substances consist of negatively electrified
particles (electrons) of very small inertia and large velocity.
By examining the deflection of these rays under the influence
of electric and magnetic fields, we can study the law of
motion of these particles very exactly.
In the theoretical treatment of these electrons, we are
faced with the difficulty that electrodynamic theory of
itself is unable to give an account of their nature. For
since electrical masses of one sign repel each other, the
negative electrical masses constituting the electron would
necessarily be scattered under the influence of their mutual
repulsions, unless there are forces of another kind operating
between them, the nature of which has hitherto remained
obscure to us. 1 If we now assume that the relative distances
between the electrical masses constituting the electron
remain unchanged during the motion of the electron (rigid
connection in the sense of classical mechanics), we arrive
at a law of motion of the electron which does not agree
with experience. Guided by purely formal points of view,
H. A. Lorentz was the first to introduce the hypothesis
that the particles constituting the electron experience
a contraction in the direction of motion in consequence
of that motion, the amount of this contraction being proportional.
The
is not justifiable by any electrodynamical facts, supplies
us then with that particular law of motion which has been
confirmed with great precision in recent years.
The theory of relativity leads to the same law of motion,
without requiring any special hypothesis whatsoever as to
the structure and the behaviour of the electron. We arrived
at a similar conclusion in Section XIII in connection with
the experiment of Fizeau, the result of which is fore-told
by the theory of relativity without the necessity of drawing
on hypotheses as to the physical nature of the liquid.
The second class of facts to which we have alluded has reference
to the question whether or not the motion of the earth in
space can be made perceptible in terrestrial experiments.
We have already remarked in Section V that all attempts
of this nature led to a negative result. Before the theory
of relativity was put forward, it was difficult to become
reconciled to this negative result, for reasons now to be
discussed. The inherited prejudices about time and space
did not allow any doubt to arise as to the prime importance
of the Galilei transformation for changing over from one
body of reference to another. Now assuming that the Maxwell-Lorentz
equations hold for a reference-body K, we then find that
they do not hold for a reference-body K' moving uniformly
with respect to K, if we assume that the relations of the
Galileian transformation exist between the co-ordinates
of K and K'. It thus appears that of all Galileian co-ordinate
systems one (K) corresponding to a particular state of motion
is physically unique. This result was interpreted physically
by regarding K as at rest with respect to a hypothetical
æther of space. On the other hand, all co-ordinate
systems K' moving relatively to K were to be regarded as
in motion with respect to the æther. To this motion
of K' against the æther (“æther-drift”
relative to K') were assigned the more complicated laws
which were supposed to hold relative to K'. Strictly speaking,
such an æther-drift ought also to be assumed relative
to the earth, and for a long time the efforts of physicists
were devoted to attempts to detect the existence of an æther-drift
at the earth’s surface.
In one of the most notable of these attempts Michelson devised
a method which appears as though it must be decisive. Imagine
two mirrors so arranged on a rigid body that the reflecting
surfaces face each other. A ray of light requires a perfectly
definite time T to pass from one mirror to the other and
back again, if the whole system be at rest with respect
to the æther. It is found by calculation, however,
that a slightly different time T' is required for this process,
if the body, together with the mirrors, be moving relatively
to the æther. And yet another point: it is shown by
calculation that for a given velocity v with reference to
the æther, this time T' is different when the body
is moving perpendicularly to the planes of the mirrors from
that resulting when the motion is parallel to these planes.
Although the estimated difference between these two times
is exceedingly small, Michelson and Morley performed an
experiment involving interference in which this difference
should have been clearly detectable. But the experiment
gave a negative result—a fact very perplexing to physicists.
Lorentz and FitzGerald rescued the theory from this difficulty
by assuming that the motion of the body relative to the
æther produces a contraction of the body in the direction
of motion, the amount of contraction being just sufficient
to compensate for the difference in time mentioned above.
Comparison with the discussion in Section XII shows that
from the standpoint also of the theory of relativity this
solution of the difficulty was the right one. But on the
basis of the theory of relativity the method of interpretation
is incomparably more satisfactory. According to this theory
there is no such thing as a “specially favoured”
(unique) co-ordinate system to occasion the introduction
of the æther-idea, and hence there can be no æther-drift,
nor any experiment with which to demonstrate it. Here the
contraction of moving bodies follows from the two fundamental
principles of the theory without the introduction of particular
hypotheses; and as the prime factor involved in this contraction
we find, not the motion in itself, to which we cannot attach
any meaning, but the motion with respect to the body of
reference chosen in the particular case in point. Thus for
a co-ordinate system moving with the earth the mirror system
of Michelson and Morley is not shortened, but it is shortened
for a co-ordinate system which is at rest relatively to
the sun.
Note 1. The general theory of relativity renders it likely
that the electrical masses of an electron are held together
by gravitational forces.
Special
and General Principle of Relativity
THE BASAL principle, which was the pivot of all our previous
considerations, was the special principle of relativity,
i.e. the principle of the physical relativity of all uniform
motion. Let us once more analyse its meaning carefully.
It was at all times clear that, from the point of view of
the idea it conveys to us, every motion must only be considered
as a relative motion. Returning to the illustration we have
frequently used of the embankment and the railway carriage,
we can express the fact of the motion here taking place
in the following two forms, both of which are equally justifiable:
The carriage is in motion relative to the embankment.
The embankment is in motion relative to the carriage.
In (a) the embankment, in (b) the carriage, serves as the
body of reference in our statement of the motion taking
place. If it is simply a question of detecting or of describing
the motion involved, it is in principle immaterial to what
reference-body we refer the motion. As already mentioned,
this is self-evident, but it must not be confused with the
much more comprehensive statement called “the principle
of relativity,” which we have taken as the basis of
our investigations.
The principle we have made use of not only maintains that
we may equally well choose the carriage or the embankment
as our reference-body for the description of any event (for
this, too, is self-evident). Our principle rather asserts
what follows: If we formulate the general laws of nature
as they are obtained from experience, by making use of
the embankment as reference-body,
the railway carriage as reference-body,
then these general laws of nature (e.g. the laws of mechanics
or the law of the propagation of light in vacuo) have exactly
the same form in both cases. This can also be expressed
as follows: For the physical description of natural processes,
neither of the reference-bodies K, K' is unique (lit. “specially
marked out”) as compared with the other. Unlike the
first, this latter statement need not of necessity hold
a priori; it is not contained in the conceptions of “motion”
and “referencebody” and derivable from them;
only experience can decide as to its correctness or incorrectness.
Up to the present, however, we have by no means maintained
the equivalence of all bodies of reference K in connection
with the formulation of natural laws. Our course was more
on the following lines. In the first place, we started out
from the assumption that there exists a reference-body K,
whose condition of motion is such that the Galileian law
holds with respect to it: A particle left to itself and
sufficiently far removed from all other particles moves
uniformly in a straight line. With reference to K (Galileian
reference-body) the laws of nature were to be as simple
as possible. But in addition to K, all bodies of reference
K' should be given preference in this sense, and they should
be exactly equivalent to K for the formulation of natural
laws, provided that they are in a state of uniform rectilinear
and non-rotary motion with respect to K; all these bodies
of reference are to be regarded as Galileian reference-bodies.
The validity of the principle of relativity was assumed
only for these reference-bodies, but not for others (e.g.
those possessing motion of a different kind). In this sense
we speak of the special principle of relativity, or special
theory of relativity.
In contrast to this we wish to understand by the “general
principle of relativity” the following statement:
All bodies of reference K, K', etc., are equivalent for
the description of natural phenomena (formulation of the
general laws of nature), whatever may be their state of
motion. But before proceeding farther, it ought to be pointed
out that this formulation must be replaced later by a more
abstract one, for reasons which will become evident at a
later stage.
Since the introduction of the special principle of relativity
has been justified, every intellect which strives after
generalisation must feel the temptation to venture the step
towards the general principle of relativity. But a simple
and apparently quite reliable consideration seems to suggest
that, for the present at any rate, there is little hope
of success in such an attempt. Let us imagine ourselves
transferred to our old friend the railway carriage, which
is travelling at a uniform rate. As long as it is moving
uniformly, the occupant of the carriage is not sensible
of its motion, and it is for this reason that he can un-reluctantly
interpret the facts of the case as indicating that the carriage
is at rest, but the embankment in motion. Moreover, according
to the special principle of relativity, this interpretation
is quite justified also from a physical point of view.
If the motion of the carriage is now changed into a non-uniform
motion, as for instance by a powerful application of the
brakes, then the occupant of the carriage experiences a
correspondingly powerful jerk forwards. The retarded motion
is manifested in the mechanical behaviour of bodies relative
to the person in the railway carriage. The mechanical behaviour
is different from that of the case previously considered,
and for this reason it would appear to be impossible that
the same mechanical laws hold relatively to the non-uniformly
moving carriage, as hold with reference to the carriage
when at rest or in uniform motion. At all events it is clear
that the Galileian law does not hold with respect to the
non-uniformly moving carriage. Because of this, we feel
compelled at the present juncture to grant a kind of absolute
physical reality to non-uniform motion, in opposition to
the general principle of relativity. But in what follows
we shall soon see that this conclusion cannot be maintained.
The
Structure of Space According to the General Theory of Relativity
ACCORDING to the general theory of relativity, the geometrical
properties of space are not independent, but they are determined
by matter. Thus we can draw conclusions about the geometrical
structure of the universe only if we base our considerations
on the state of the matter as being something that is known.
We know from experience that, for a suitably chosen co-ordinate
system, the velocities of the stars are small as compared
with the velocity of transmission of light. We can thus
as a rough approximation arrive at a conclusion as to the
nature of the universe as a whole, if we treat the matter
as being at rest.
We already know from our previous discussion that the behaviour
of measuring-rods and clocks is influenced by gravitational
fields, i.e. by the distribution of matter. This in itself
is sufficient to exclude the possibility of the exact validity
of Euclidean geometry in our universe. But it is conceivable
that our universe differs only slightly from a Euclidean
one, and this notion seems all the more probable, since
calculations show that the metrics of surrounding space
is influenced only to an exceedingly small extent by masses
even of the magnitude of our sun. We might imagine that,
as regards geometry, our universe behaves analogously to
a surface which is irregularly curved in its individual
parts, but which nowhere departs appreciably from a plane:
something like the rippled surface of a lake. Such a universe
might fittingly be called a quasi-Euclidean universe. As
regards its space it would be infinite. But calculation
shows that in a quasi-Euclidean universe the average density
of matter would necessarily be nil. Thus such a universe
could not be inhabited by matter everywhere;
If we are to have in the universe an average density of
matter which differs from zero, however small may be that
difference, then the universe cannot be quasi-Euclidean.
On the contrary, the results of calculation indicate that
if matter be distributed uniformly, the universe would necessarily
be spherical (or elliptical). Since in reality the detailed
distribution of matter is not uniform, the real universe
will deviate in individual parts from the spherical, i.e.
the universe will be quasi-spherical. But it will be necessarily
finite. In fact, the theory supplies us with a simple connection
1 between the space-expanse of the universe and the average
density of matter in it.