Deriving Lorentz Transformation Using Graphical Method
Spacetime Diagram
I read this derivation in A First Course of General Relativity by Schutz. We are going to take light speed c=1, a dimensionaless constant. As before, we start off with the two special relativity postulates
- all laws of physics have the same form in all inertial frames
- light speed is constant for all inertial frames
Let O and O′ be two inertial frames in which O′ moves at a speed v in positive x direction relative to O. Let their origins be the same position initially. The t′-axis locates at where x′=0 in O′ frame. As O′ is moving at speed v, t′-axis should be a slope of v passing through O origin.
In O′, a light signal was sent at t′=−a at x′=0 to x′=a then reflected back to x′=0 at t′=a. As light travels at constand speed c=1 in all inertial frames, the worldlines have the same slopes in O frame. Their intersection should pass through the x′-axis at x′=a.
We would like to know the angle between the x′-axis and x-axis.
For the line PE, as we know the speed is c=1, slope of PE is 1. Then,
y=x+a(sinϕ−cosϕ).Similarly, for RP,
y=−x+a(cosϕ+sinϕ).Then, the coordinate of their intersection, E, will be
Px+a(sinϕ−cosϕ)=−Px+a(cosϕ+sinϕ)Px=acosϕPy=acosϕ+a(sinϕ−cosϕ)=asinϕ.Then,
tanϕ′=asinϕacosϕ=tanϕ. So, ϕ′=ϕ.We now know how to draw a spacetime diagram.
Space-time Interval
Define the spacetime interval as Δs2=−Δt2+Δx2+Δy2+Δz2. Similarly, the spacetime interval in O′ is Δs′2=−Δt′2+Δx′2+Δy′2+Δz′2. From the graphs, we know that Δx′α and Δxβ are related linearly, so the spaetime interval in O′ can be expressed as Δs′2=3∑α=03∑β=0Mαβ(Δxα)(Δxβ). For light, Δs2=0, Δt=Δr=[(Δx)2+(Δy)2+(Δz)2]1/2. In O′ frame, Δx′2=M00(Δr)2+2(3∑i=1M0iΔxi)Δr+3∑i=13∑j=1MijΔxiΔxj. Thus, in order for Δs′2=Δs2=0, we have M0i=0,i=1,2,3Mij=−M00δij,i,j=1,2,3. Therefore, Δs′2=M00(Δt2−Δx2−Δy2−Δz2). Define ϕ(→v)≡M00.
Suppose a rod lies on the y-axis. Let A be the event of the end of the rod at t=0 and B be the event of the start of the rod at t=0. Since y-axis is perpendicular to the motion of O′ frame, y-axis and y′-axis are parallel to each other. Hence event A and event B lie on the y′-axis as well, perpendicular to t′-axis. Hence, Δt′AB=ΔtAB=0. The spacetime interval in O′ frame is Δs′AB=(length of rod in O′)2. So, we have (length of rod in O′)2=ϕ(→v)(length of rod in O)2. As the same argument could be made for an inertial frame moving at −v relative to O positive x direction, ϕ(→v)=ϕ(v).
Let O″ be another inertial frame moving at −v with respect to x′ direction of O′. So, in fact, O″ is essentially identical to O frame. Δs″2=ϕ(v)Δs′2Δs′2=ϕ(v)Δs2. Substituting Δs′, we have Δs″2=ϕ(v)2Δs2=Δs2. Then, ϕ(v)=±1. As shown before, Δs′AB=(length of rod in O′)2=ϕ(→v)(length of rod in O)2. and length cannot be negative, ϕ(v)=1. We come to the conclusion that Δs′2=Δs2
Time Dilation
Consider events only on the tx plane. All events lie on the hyperbola −t2+x2=−1 have the same spacetime interval with respect to origin. In O′ frame, 1 unit of time stationary at x′=0 in O′ frame will have the spacetime interval −t′2=−1. As spacetime interval is the same in all inertail frames, as proven above, this event also lies on the hyperbola −t2+x2=−1. Thus, this event lies on the intersection of the hyperbola with the t′-axis (i.e. x′=0)
On that intersection, t is −t2+x2=−1, in which v=xt. So, 1 unit of time in O′ frame will be observed as t=1√1−v2 units of time in O frame. Thus, the duration in O′ observed in O will be Δt=Δt′√1−v2. This is the time dilation.
Length Contraction
At C, in O′, (x′,t′)=(l,0). The spacetime interval is −t2+x2=x′2=l2. The x′-axis is vx=t. So, we know at point C xC=l√1−v2tC=vl√1−v2. As we know the slope of BC is xC−xBtc−0=vxB=xC−vtC=l√1−v2−v2l√1−v2=l√1−v2. This is known as length contraction.
Lorentz Transformation
We know that x′α relates with xβ linearly. Let t′=αt+βxx′=γt+σxy′=yz′=z, where α, β, γ and sigma are constant. The t′-axis is a line with equation vt−x=0 and the x′-axis is a line with equation vx−t=0. At t′-axis (i.e. x′=0), t′=α(t+βαx)=0. At x′-axis (i.e. t′=0, t' &= \sigma\left(\frac{\gamma}{\sigma}t + x\right) = 0. So, we know that −βα=−γσ=v. Substitute into the equations, we have t′=α(t−vx)x′=σ(x−vt). By the invariance of the spacetime interval, −Δt′2+Δx2=−Δt2+Δx2α=±1√1−v2. When v=0, x′ was supposed to equal x, so the minus is rejected. Then we have the relation t′=1√1−v2(t−vx)x′=1√1−v2(x−vt)y′=yz′=z. This is known as the Lorentz transformation.
Comments
Post a Comment