Ex) Article Title, Author, Keywords
Ex) Article Title, Author, Keywords
New Phys.: Sae Mulli 2024; 74: 394-400
Published online April 30, 2024 https://doi.org/10.3938/NPSM.74.394
Copyright © New Physics: Sae Mulli.
Hongbin Kim1,2,3, Dong-han Yeom1,2*, Jong Hyun Kim1,2
1Department of Physics Education, Pusan National University, Busan 46241, Korea
2Research Center for Dielectric and Advanced Matter Physics, Pusan National University, Busan 46241, Korea
3Department of Physics Education, Seoul National University, Seoul 08826, Korea
Correspondence to:*innocent.yeom@pusan.ac.kr
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License(http://creativecommons.org/licenses/by-nc/3.0) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
In this paper, we present a new heuristic derivation of the gravitational deflection of light around the Sun at the undergraduate level. Instead of solving the geodesic equation directly, we compute the correct deflection angle by focusing on the acceleration term of null geodesics. Using this heuristic deviation, we expect that undergraduate students who have not learned general relativity will be able to experience this computation, which is one of the most remarkable evidences of general relativity.
Keywords: Gravitational deflection of light, Gravitational bending, General relativity education, Physics education
The gravitational deflection of light around the Sun is one of the striking evidences of the general theory of relativity (GR). In 1919, on the island of Príncipe, an observational expedition team led by Eddington confirmed that light passing around the Sun was deflected by 1.75 arcsec, as predicted by Einstein. Since the deflection of light was an important and dramatic observation that supports the validity of general relativity, it became one of the examples that must appear when introducing general relativity not only at the college level but even at the high school level.
In order to teach this phenomenon to students who have not learned GR properly, qualitative explanations such as `the metaphor of a trampoline'[1, 2] or intuitive models for curved space are used[3]. If students want to obtain a quantitative result of 1.75 arcsec, they should be able to handle complex concepts such as the geodesic equation and the Christoffel symbols that usually appear in GR textbooks[4, 5, 6, 7, 8, 9]. However, these are not easy concepts for undergraduate students in usual college curriculum. Particularly, undergraduate students even majoring in physics often experience a barrier between qualitative explanations and quantitative calculations in GR, even though they already know quite a bit of the basic concepts of GR.
In order to overcome this difficulty, there have been some suggestions that provide quantitative results on the gravitational deflection angle of light through simplified calculations[10, 11, 12, 13]. For example, a study used the dimensional analysis and obtained a half of the observed value using the assumption that light is accelerated only in the near zone that corresponds to the diameter of the Sun[10]. Another study presented the deflection angle through the comparison with the situation in which the spacecraft is accelerated[11]. In this approach, the width of the spacecraft was chosen to be the diameter of the Sun to yield a half of the value, too, which is similar to the approach of Ref. 10 in that the parameter was chosen arbitrarily. Meanwhile, there were also approaches that yield the deflection angle by using the fact that the refractive index varies with distance from the Sun[12, 13]. In fact, this is essentially the same approach as Einstein showed in his 1916 paper[14]. Although these approaches could help undergraduate students to focus on the physical meaning rather than just mathematical calculation, they still include some computational complexities such as expressing metrics in isotropic form. These are bound to be burdensome for average undergraduate students.
In this paper, we propose a new heuristic derivation for the deflection angle of light passing around the Sun. In Section 2, the attempts to explain the gravitational deflection of light in Newtonian framework and Einstein's early approach are examined. This section will provide how we approximate the calculations. In Section 3, we present a heuristic derivation at the undergraduate level. Finally, we summarize and discuss our results in Section 4.
Contrary to popular belief that “light has no mass and therefore is not affected by gravity", Isaac Newton himself seriously considered the question of whether light would be affected by an object, and left this issue to be solved in the future. In 1704, Newton raised the following question at the Opticks (Query 1 appeared at the end of the book)[15]:
“Do not Bodies act upon Light at a distance, and by their action bend its Rays, and is not this action strongest at the least distance? (Query 1)"
According to Newtonian mechanics, every object with non-zero mass experiences an inverse square force from the Sun. Moreover, the trajectories of the objects are independent of their mass provided that their initial velocities are the same. The reason is that the values of inertial mass and gravitational mass are equivalent. Mathematically, instead of assuming the mass of light to be zero, if one regards that the mass of light converges to zero, it might not be surprising even the light ray bends around the Sun in the same way as other ponderable objects. It is not surprising that Newton, who thought that light was made up of tiny discrete particles (called corpuscles), had this idea. But Newton himself would have had to leave the above question as an open question because he had no way of confirming whether the mass of light was absolutely zero or not.
The trajectory of an object is determined according to the inverse square law of gravitation and the Newton's second law of motion as follows:
The above equation shows that the trajectory is independent of the mass of the moving object. Figure 1 shows how the object is deflected by the Sun through gravitation.
Because the initial speed
Meanwhile, the y-component of the change in velocity can be obtained by integrating the y-component of the acceleration as follows:
where b is the impact parameter, and
where
Already in 1804, Johann Georg von Soldner calculated the deflecting angle of light passing around the Earth using the method introduced above (seemingly different, but essentially the same), and obtained one-half of the general relativistic value[16]. About 20 years earlier, around 1784, Henry Cavendish also obtained the similar result (not exactly the same as Soldner's result, but they agree at the first order of approximation), although he never published the result[17, 18].
In 1911, almost one hundred years after Soldner's calculation, Albert Einstein obtained a similar result even before establishing the complete framework of GR[19]. Einstein considered the equivalence principle and his famous formula about the photon energy,
From a pedagogical point of view, it will be helpful for students to introduce the above historical context from Newton's original query to Einstein's approach by only using Newtonian mechanics and the special theory of relativity before introducing general relativistic corrections.
One of the biggest difficulties faced by students who have not yet learned general relativity is to understand the fact that general relativity is a field theory. In other words, although Newtonian mechanics also deals with continuous objects, it is basically a theory that describes the motion of point particles. That is, continuous objects can be reduced to the collection of point particles in Newtonian mechanics. Meanwhile, GR is a field theory that describes the gravitational phenomena using the ‘fields’ concept1 instead of ‘forces’. Although the particle concept also appears in field theory and sometimes the interaction between particles and fields are of interest, basically the dynamical variables of a field theory is the fields. And fields are not reduced to particles in field theory. In GR, the gravitational field is expressed as a spacetime metric.
In typical undergraduate curriculum in physics, students learn Maxwell's electromagnetic theory, the prototype of classical field theory, so they can get acquainted with the field concept. However, unlike the electric fields and magnetic fields (those can be usually expressed as vector fields), the gravitational field is usually described by a tensor fields of rank two. Of course, second rank tensor is not a completely unfamiliar concept to undergraduate students, because they also appears in electromagnetic theory (even in Newtonian mechanics, for example, tidal tensor or strain tensor etc.). But they are not the main dynamical variables both in electromagnetic theory and Newtonian mechanics. Therefore, it seems reasonable to assume that a second rank tensor (as a field) is a relatively unfamiliar concept to average undergraduate students.
One way to overcome this difference between general relativity and Newtonian mechanics is to emphasize the role of spacetime metrics in the classical mechanics course. In other words, by simply introducing the spacetime metric in a classical mechanics course and explaining its role through a simple example instead of introducing the second rank tensor in details, the barrier that students feel in learning GR can be significantly lowered. So called the Geometrized Newtonian Formulation is a good example of this strategy[8]. Students who have learned the Lagrangian formulation and the variational method are expected to know that one can derive the Newtonian equation of motion from the following action:
where m is the mass of a particle2, c is the speed of light, and ds is the line element (distance concept in spacetime) that determines the world-line of a particle in spacetime[20]. For a free particle, the line element is given as a Minkowski metric in the following form:
If the potential Φ is turned on, the equation of motion of a particle,
This is the first step of our approach. It is important to have students to know that the concept of spacetime metric is not an exclusive property of the general relativity. In other words, spacetime metric can be introduced in the undergraduate-level classical mechanics course as an another formalism of mechanics. In this framework, one can derive the equation of motion by applying the variational method with the action Eq. (5) and the line element Eq. (7) and substituting Φ by -GM/r for a point-like mass distribution.
The second step is to introduce the Schwarzschild metric as a solution to the field equation that describes the spacetime around the Sun. By substituting -GM/r in place of Φ in Eq. (7) and adding some explanations, we obtain the Schwarzschild metric as follows:3
Once the above Schwarzschild metric is obtained, it may be possible to visualize how the spacetime is curved by using embedding diagrams with
The third step is to introduce the trajectories of light in the geometrized Newtonian formulation. In general, defining a light ray is not an easy task, but in the context of general relativity, light can simply be defined as a physical object that satisfies the null condition,
where ˙ represents the derivative with respect to the affine parameter.
The fourth step is to applying the variational method. In order to use the Lagrangian formalism, we set the Lagrangian describing the path of light as follows:4
and after obtaining the equation of motion, we will substitute L=0 again. Students need to be reminded that the above Lagrangian is a function of six variables: t,
where, we formally introduced m, which has the mass dimension, and l, which has the angular momentum dimension, to treat this problem as if describing the motion of a massive particle. And the constant value E is defined as
At this stage, one could plot the effective potential experienced by light. Students could recognize Eq. (13) as a kind of energy conservation relation by comparing this with the energy conservation in the central-force problem.
Finally, by differentiating Eq. (13) with respect to the affine parameter, we obtain the equation of motion as follows:
where l(=mcb) is the angular momentum of light (See Fig. 1). Therefore, from the properties of the polar coordinates, the radial component of the acceleration is
Unlike the functional dependence of the acceleration on r in Eq. (1), this is
Hence, we finally obtain the deflection angle
This result is in exact agreement with the observation.
In this approach, we tried to utilize the Newtonian type of equation of motion as much as possible and minimize the steps in which the approximations are used. As shown in Eq. (2), instead of considering the whole trajectory, we focused on obtaining the acceleration term (i.e., the change of the velocity) as the minimal information to obtain the deflection angle. A comparison to other approaches is shown in Fig. 2.
The approaches shown in typical GR textbooks are largely divided into two groups: the geodesic equation approach[4, 5] and the energy integral approach[6, 7, 8, 9]. In the geodesic equation approach (a representative example of this type is MTW's landmark book[4]), a second-order differential equation (relativistic version of the Binet's formula) that describes the whole trajectories of light is derived, and then one needs to solve the perturbed equations order by order. On the other hand, in the energy integral approach (a representative example of this type is Weinberg's book[6]), the energy integral for radial motion with effective potential describing whole trajectories of light is obtained, and then the deflection angle is expressed in the integral form. Here, a delicate power series expansion is necessary to make the expression integrable.
Both of these approaches must be difficult calculations at the elementary level undergraduate course. For the pedagogical purposes, hand-waving arguments were suggested to avoid these complicated calculations. However, it is hardly to say that these arguments make the students experience real calculations because they use the arbitrary parameters by hand to double the final value[10, 11]. Our heuristic approach shows a third way that both avoids the hand-waving argument and complicated approximation techniques.
In this study we presented a new heuristic derivation of the gravitational deflection angle of light around the Sun at the undergraduate level. From a pedagogical point of view, this approach is expected to provide more plausible calculation rather than simply doubling the result of Newtonian gravity as shown in several previous studies. Moreover, the computational details in this approach is much easier for undergraduate students to understand than directly solving the geodesic equation by using complicated approximations.
Of course, even in this approach, having to require the Schwarzschild metric as a minimal prerequisite could be a heavy burden on undergraduate students. However, as described in Section 3, if the role of the spacetime metric is introduced to some extent through classical mechanics courses, it is expected that the derivation presented in this study can be understood sufficiently without a detailed understanding of GR as a field theory. Even if it is explained to students who do not have prior knowledge on the Lagrangian formalism or the role of the spacetime metric in classical mechanics, the main calculation in this study can be introduced just by substituting the
One of the advantages of the approach presented in this study is that approximations are already used from the first stage of the computation. Although the higher-order terms are ignored, e.g., Eq. (2), they do not affect the final result on the deflection angle; this is the reason why our approach does not require complicated perturbative technologies. Moreover, another advantage is that it actively uses computational environments already familiar in classical mechanics, such as focusing on the acceleration term, Lagrangian, cyclic coordinates, effective potentials and familiar integrals of rational functions.
The authors expect that this study will be helpful to students who have college physics-level knowledge and instructors who have experienced difficulties in explaining the gravitational deflection quantitatively at the introductory level. Similar attempts can be made for easy understanding through quantitative calculations at the undergraduate level for other important observational evidences of GR, such as gravitational waves and perihelion precession of Mercury. We leave these topics for future research topics.
The authors would like to thank the anonymous reviewers for their helpful comments. This work was supported by the 2-Year Research Grant of Pusan National University.
1The concept of `fields' can be used in various meanings depending on the context. In this paper, we used the term `fields' as a dynamical variable as a function of space and time. In that sense, so called the gravitational fields in Newtonian mechanics is not referred to as a field in this article since they describe only static situations.
2In this article, we consider the motion of light, i.e., a massless particle. Hence, in this case, m is to be understood as just a formal parameter of mass dimension. The value of m does not affect the final result.
3Of course, the coefficient of the
4This form of Lagrangian amounts to
5Of course, to be precise, one must follow the order of substituting the conserved quantities