Deriving Lorentz Transformation Using Graphical Method

Spacetime Diagram

I read this derivation in A First Course of General Relativity by Schutz. We are going to take light speed c=1, a dimensionaless constant. As before, we start off with the two special relativity postulates

  • all laws of physics have the same form in all inertial frames
  • light speed is constant for all inertial frames

Let O and O be two inertial frames in which O moves at a speed v in positive x direction relative to O. Let their origins be the same position initially. The t-axis locates at where x=0 in O frame. As O is moving at speed v, t-axis should be a slope of v passing through O origin.

In O, a light signal was sent at t=a at x=0 to x=a then reflected back to x=0 at t=a. As light travels at constand speed c=1 in all inertial frames, the worldlines have the same slopes in O frame. Their intersection should pass through the x-axis at x=a.

We would like to know the angle between the x-axis and x-axis.

For the line PE, as we know the speed is c=1, slope of PE is 1. Then,

y=x+a(sinϕcosϕ).

Similarly, for RP,

y=x+a(cosϕ+sinϕ).

Then, the coordinate of their intersection, E, will be

Px+a(sinϕcosϕ)=Px+a(cosϕ+sinϕ)Px=acosϕPy=acosϕ+a(sinϕcosϕ)=asinϕ.

Then,

tanϕ=asinϕacosϕ=tanϕ. So, ϕ=ϕ.

We now know how to draw a spacetime diagram.

Space-time Interval

Define the spacetime interval as Δs2=Δt2+Δx2+Δy2+Δz2. Similarly, the spacetime interval in O is Δs2=Δt2+Δx2+Δy2+Δz2. From the graphs, we know that Δxα and Δxβ are related linearly, so the spaetime interval in O can be expressed as Δs2=3α=03β=0Mαβ(Δxα)(Δxβ). For light, Δs2=0, Δt=Δr=[(Δx)2+(Δy)2+(Δz)2]1/2. In O frame, Δx2=M00(Δr)2+2(3i=1M0iΔxi)Δr+3i=13j=1MijΔxiΔxj. Thus, in order for Δs2=Δs2=0, we have M0i=0,i=1,2,3Mij=M00δij,i,j=1,2,3. Therefore, Δs2=M00(Δt2Δx2Δy2Δz2). Define ϕ(v)M00.

Suppose a rod lies on the y-axis. Let A be the event of the end of the rod at t=0 and B be the event of the start of the rod at t=0. Since y-axis is perpendicular to the motion of O frame, y-axis and y-axis are parallel to each other. Hence event A and event B lie on the y-axis as well, perpendicular to t-axis. Hence, ΔtAB=ΔtAB=0. The spacetime interval in O frame is ΔsAB=(length of rod in O)2. So, we have (length of rod in O)2=ϕ(v)(length of rod in O)2. As the same argument could be made for an inertial frame moving at v relative to O positive x direction, ϕ(v)=ϕ(v).

Let O be another inertial frame moving at v with respect to x direction of O. So, in fact, O is essentially identical to O frame. Δs2=ϕ(v)Δs2Δs2=ϕ(v)Δs2. Substituting Δs, we have Δs2=ϕ(v)2Δs2=Δs2. Then, ϕ(v)=±1. As shown before, ΔsAB=(length of rod in O)2=ϕ(v)(length of rod in O)2. and length cannot be negative, ϕ(v)=1. We come to the conclusion that Δs2=Δs2

Time Dilation

Consider events only on the tx plane. All events lie on the hyperbola t2+x2=1 have the same spacetime interval with respect to origin. In O frame, 1 unit of time stationary at x=0 in O frame will have the spacetime interval t2=1. As spacetime interval is the same in all inertail frames, as proven above, this event also lies on the hyperbola t2+x2=1. Thus, this event lies on the intersection of the hyperbola with the t-axis (i.e. x=0)

On that intersection, t is t2+x2=1, in which v=xt. So, 1 unit of time in O frame will be observed as t=11v2 units of time in O frame. Thus, the duration in O observed in O will be Δt=Δt1v2. This is the time dilation.

Length Contraction

At C, in O, (x,t)=(l,0). The spacetime interval is t2+x2=x2=l2. The x-axis is vx=t. So, we know at point C xC=l1v2tC=vl1v2. As we know the slope of BC is xCxBtc0=vxB=xCvtC=l1v2v2l1v2=l1v2. This is known as length contraction.

Lorentz Transformation

We know that xα relates with xβ linearly. Let t=αt+βxx=γt+σxy=yz=z, where α, β, γ and sigma are constant. The t-axis is a line with equation vtx=0 and the x-axis is a line with equation vxt=0. At t-axis (i.e. x=0), t=α(t+βαx)=0. At x-axis (i.e. t=0, t' &= \sigma\left(\frac{\gamma}{\sigma}t + x\right) = 0. So, we know that βα=γσ=v. Substitute into the equations, we have t=α(tvx)x=σ(xvt). By the invariance of the spacetime interval, Δt2+Δx2=Δt2+Δx2α=±11v2. When v=0, x was supposed to equal x, so the minus is rejected. Then we have the relation t=11v2(tvx)x=11v2(xvt)y=yz=z. This is known as the Lorentz transformation.

Comments

Popular Posts