Mascot image.
#Math#Vectors#Geometry

Solids, Planes, and Systems in Space

In Lesson 2AM we built the vector space R3\mathbb{R}^3, recovered the Euclidean metric through iterated Pythagoras, and worked out which collections of space vectors are linearly independent. What was still missing was any genuine use of that machinery against solid figures: tetrahedra, planes fixed by three non-aligned points, cevians viewed as segments inside a triangular slice of space. We now put the algebra to work. Each definition that follows is chosen so that the object it introduces earns its keep in the proof it enables, and each proof settles a question that space geometry had previously left open.

Displacement Vectors in Space

The first thing space geometry asks of us is a way to describe the relative position of two points without forcing the origin into every calculation. In Lesson 1PM we did exactly this in the plane by attaching the displacement ba\mathbf{b} - \mathbf{a} to the directed segment AB\overrightarrow{AB}. The construction transplants into R3\mathbb{R}^3 component by component.

Definition 28 (Displacement Vector in Space)

Let AA and BB be points in R3\mathbb{R}^3 with position vectors a\mathbf{a} and b\mathbf{b}. The displacement vector associated with the directed segment AB\overrightarrow{AB} is

AB=ba.\overrightarrow{AB} = \mathbf{b} - \mathbf{a}.

In coordinates, if A=(a1,a2,a3)A = (a_1, a_2, a_3) and B=(b1,b2,b3)B = (b_1, b_2, b_3), then

AB=[b1a1b2a2b3a3].\overrightarrow{AB} = \begin{bmatrix} b_1 - a_1 \\ b_2 - a_2 \\ b_3 - a_3 \end{bmatrix}.

Because the algebraic laws of addition and scalar multiplication are identical in R2\mathbb{R}^2 and R3\mathbb{R}^3, every theorem from Lesson 1PM whose proof used only those laws transfers without a single altered line. Three pieces will matter for the rest of this lesson:

  • Midpoint Formula: the midpoint MM of AB\overline{AB} has position vector m=12(a+b)\mathbf{m} = \tfrac{1}{2}(\mathbf{a} + \mathbf{b}).
  • Collinearity: three points AA, BB, CC are collinear if and only if AC=kAB\overrightarrow{AC} = k\,\overrightarrow{AB} for some scalar kk.
  • Parametric Line Equation: the line through distinct AA and BB consists of the points XX with x=(1t)a+tb\mathbf{x} = (1-t)\mathbf{a} + t\mathbf{b}, and the interpretation of tt fixed in Lesson 1PM is unchanged.

A triangle in space still lies in a unique plane, so any statement about a triangle proved purely from position vectors is inherited at once. In particular, the concurrency of medians at the centroid holds for a triangle placed anywhere in R3\mathbb{R}^3: the proof in Lesson 1PM never touched a second coordinate.

Example 32 (Displacements Between Space Points)

Let A=(1,2,1)A = (1, 2, -1), B=(4,0,3)B = (4, 0, 3), and C=(7,2,7)C = (7, -2, 7). Then

AB=[324],AC=[648]=2AB.\overrightarrow{AB} = \begin{bmatrix} 3 \\ -2 \\ 4 \end{bmatrix}, \qquad \overrightarrow{AC} = \begin{bmatrix} 6 \\ -4 \\ 8 \end{bmatrix} = 2\,\overrightarrow{AB}.

The collinearity criterion is satisfied, so AA, BB, CC lie on a common line in space, and BB is in fact the midpoint of AC\overline{AC} since the position vector of BB equals 12(a+c)\tfrac{1}{2}(\mathbf{a} + \mathbf{c}).

The Tetrahedron

With displacements and lines available, we can turn to a figure that cannot be drawn in the plane at all. A triangle is the smallest closed figure the plane admits; in space the smallest closed solid requires a fourth vertex lifted out of the plane of the first three.

Definition 29 (Tetrahedron)

A tetrahedron is the configuration determined by four points AA, BB, CC, DD in R3\mathbb{R}^3 that are not coplanar. The points are its vertices; the four triangles ABCABC, BCDBCD, CDACDA, DABDAB are its faces; the six segments ABAB, ACAC, ADAD, BCBC, BDBD, CDCD are its edges. Two edges are opposite when they share no vertex, and a vertex is opposite to a face when it is not one of the three vertices of that face.

The requirement that the four points not be coplanar is the spatial analogue of the non-collinearity demanded of the vertices of a triangle, and Lesson 2AM already supplies the exact test: the displacements AB\overrightarrow{AB}, AC\overrightarrow{AC}, AD\overrightarrow{AD} must be linearly independent. Every tetrahedron therefore comes with a built-in certificate of its own three-dimensionality.

A triangle has three medians. A tetrahedron carries two families of distinguished internal segments.

Definition 30 (Medians and Bimedians of a Tetrahedron)

Let ABCDABCD be a tetrahedron.

  • A median is the segment joining a vertex to the centroid of the opposite face.
  • A bimedian is the segment joining the midpoints of two opposite edges.

There are four medians (one per vertex) and three bimedians (one per pair of opposite edges). Seven internal segments is a lot to keep track of, and the next theorem is precisely what keeps the picture tractable.

Theorem 32 (Centroid of a Tetrahedron)

Let ABCDABCD be a tetrahedron with position vectors a,b,c,d\mathbf{a}, \mathbf{b}, \mathbf{c}, \mathbf{d}. The three bimedians and the four medians are all distinct, yet they pass through a single common point

m=14(a+b+c+d),\mathbf{m} = \tfrac{1}{4}(\mathbf{a} + \mathbf{b} + \mathbf{c} + \mathbf{d}),

called the centroid of the tetrahedron. The centroid divides each median in the ratio 3:13 : 1, measured from the vertex towards the opposite face.

Proof

Consider first the bimedian joining the midpoint UU of edge BCBC to the midpoint VV of the opposite edge ADAD. By the midpoint formula,

u=12(b+c),v=12(a+d).\mathbf{u} = \tfrac{1}{2}(\mathbf{b} + \mathbf{c}), \qquad \mathbf{v} = \tfrac{1}{2}(\mathbf{a} + \mathbf{d}).

The midpoint of the segment UV\overline{UV} is therefore

12(u+v)=14(a+b+c+d)=m.\tfrac{1}{2}(\mathbf{u} + \mathbf{v}) = \tfrac{1}{4}(\mathbf{a} + \mathbf{b} + \mathbf{c} + \mathbf{d}) = \mathbf{m}.

The expression on the right treats a,b,c,d\mathbf{a}, \mathbf{b}, \mathbf{c}, \mathbf{d} symmetrically, so the argument by symmetry from Lesson 1PM forces m\mathbf{m} to be the midpoint of the other two bimedians as well. All three bimedians therefore meet at MM.

Now fix the median from AA to the centroid EE of the opposite face BCDBCD. That face centroid is e=13(b+c+d)\mathbf{e} = \tfrac{1}{3}(\mathbf{b} + \mathbf{c} + \mathbf{d}), and we can rewrite m\mathbf{m} as

m=14a+34(b+c+d3)=14a+34e.\mathbf{m} = \tfrac{1}{4}\mathbf{a} + \tfrac{3}{4}\left(\frac{\mathbf{b} + \mathbf{c} + \mathbf{d}}{3}\right) = \tfrac{1}{4}\mathbf{a} + \tfrac{3}{4}\mathbf{e}.

By the parametric line equation, MM lies on the segment AE\overline{AE} at parameter t=3/4t = 3/4, so MM divides AEAE in the ratio 3:13 : 1 from vertex to face. Symmetry hands us the same conclusion for the medians issuing from BB, CC, and DD.

A tetrahedron ABCD showing a bimedian UV, a median from A to the centroid E of the opposite face, and their common intersection at the centroid M

The figure above fixes the labels used in the proof: UU is the midpoint of BCBC, VV the midpoint of ADAD, and EE the centroid of face BCDBCD. The point MM is where the bimedian UVUV and the median AEAE intersect, and the same intersection holds for every other pair of median and bimedian by symmetry.

Example 33 (Centroid of a Concrete Tetrahedron)

Let A=(0,0,0)A = (0, 0, 0), B=(4,0,0)B = (4, 0, 0), C=(0,6,0)C = (0, 6, 0), D=(0,0,8)D = (0, 0, 8). These four points form a tetrahedron because AB\overrightarrow{AB}, AC\overrightarrow{AC}, AD\overrightarrow{AD} are linearly independent (each lies along a distinct coordinate axis). Its centroid is

m=14((0,0,0)+(4,0,0)+(0,6,0)+(0,0,8))=(1,32,2).\mathbf{m} = \tfrac{1}{4}\bigl((0,0,0) + (4,0,0) + (0,6,0) + (0,0,8)\bigr) = \left(1,\, \tfrac{3}{2},\, 2\right).

The centroid EE of face BCDBCD is e=13(4,6,8)=(43,2,83)\mathbf{e} = \tfrac{1}{3}(4,6,8) = \left(\tfrac{4}{3},\, 2,\, \tfrac{8}{3}\right), and the identity 14a+34e=(1,32,2)=m\tfrac{1}{4}\mathbf{a} + \tfrac{3}{4}\mathbf{e} = \left(1,\, \tfrac{3}{2},\, 2\right) = \mathbf{m} confirms the 3:13 : 1 division along the median AEAE.

Problem 20

The vertices of a tetrahedron are A=(2,1,1)A = (2, 1, -1), B=(5,3,0)B = (5, 3, 0), C=(1,4,2)C = (1, 4, 2), D=(0,1,3)D = (0, -1, 3). Compute the centroid, verify that the midpoint of the bimedian joining the midpoints of ABAB and CDCD coincides with it, and locate the point at which the median from AA meets the opposite face.

Planes, Barycentric Coordinates, and Ceva’s Theorem

The proof above quietly used a fact that deserves to be isolated: the average of three vertex vectors is the centroid of the triangular face they span, and more generally every point of that face is a weighted average of the three vertices. This is the geometric motivation behind the parametric equation of a plane, which plays the same structural role in R3\mathbb{R}^3 as the parametric line equation did in R2\mathbb{R}^2.

The tetrahedron proof has already shown the first instance of this idea. When we wrote the centroid of face BCDBCD as

e=13(b+c+d),\mathbf{e} = \tfrac{1}{3}(\mathbf{b} + \mathbf{c} + \mathbf{d}),

we were using a weighted average of the three vertices of that face. That is not a special trick reserved for centroids. More generally, every point in the plane determined by three non-collinear points can be described by weighting those three vertices, provided the weights add to 11. This is the planar analogue of the parametric line equation: on a line through AA and BB, the coefficients of a\mathbf{a} and b\mathbf{b} add to 11; on a plane through AA, BB, and CC, the coefficients of a\mathbf{a}, b\mathbf{b}, and c\mathbf{c} do the same.

Theorem 33 (Parametric Equation of a Plane)

Let AA, BB, CC be three non-collinear points in R3\mathbb{R}^3 with position vectors a,b,c\mathbf{a}, \mathbf{b}, \mathbf{c}. A point XX with position vector x\mathbf{x} lies on the unique plane Π\Pi through AA, BB, CC if and only if there exist scalars rr, ss, tt satisfying

x=ra+sb+tcwithr+s+t=1.\mathbf{x} = r\mathbf{a} + s\mathbf{b} + t\mathbf{c} \qquad \text{with} \qquad r + s + t = 1.
Proof

XX lies on Π\Pi precisely when AX\overrightarrow{AX} is a linear combination of AB\overrightarrow{AB} and AC\overrightarrow{AC}, because AA, BB, CC are non-collinear and the two displacements are linearly independent. Expanding this condition,

xa=s(ba)+t(ca),\mathbf{x} - \mathbf{a} = s(\mathbf{b} - \mathbf{a}) + t(\mathbf{c} - \mathbf{a}),

and collecting terms gives

x=(1st)a+sb+tc.\mathbf{x} = (1 - s - t)\mathbf{a} + s\mathbf{b} + t\mathbf{c}.

Setting r=1str = 1 - s - t yields the stated form with r+s+t=1r + s + t = 1. The converse is immediate: such a combination rearranges back into xa=s(ba)+t(ca)\mathbf{x} - \mathbf{a} = s(\mathbf{b}-\mathbf{a}) + t(\mathbf{c}-\mathbf{a}), placing XX on Π\Pi.

The triple (r,s,t)(r, s, t) is called the set of barycentric coordinates of XX relative to ABC\triangle ABC. Physically, they are exactly the weights one would place at AA, BB, CC so that the centre of mass of the weighted system sits at XX. When r=s=t=1/3r = s = t = 1/3 the centre of mass is the centroid, which recovers the formula we already used for the face centroid in the tetrahedron proof. This is not a coincidence: the tetrahedron centroid was constructed out of face centroids, and the face centroid is the simplest barycentric point.

Once points in a triangle are written in this weighted form, ratios along cevians become algebraic relations among the coefficients. With that perspective, the next theorem becomes almost purely algebraic, even though it is one of the oldest and most elegant results about triangles.

Theorem 34 (Ceva's Theorem)

Let ABC\triangle ABC be a triangle, and let FF, GG, HH be points in the interiors of the sides BCBC, CACA, ABAB respectively. Set

λ=dr(B,C;F),μ=dr(C,A;G),ν=dr(A,B;H),\lambda = \mathrm{dr}(B, C;\, F), \qquad \mu = \mathrm{dr}(C, A;\, G), \qquad \nu = \mathrm{dr}(A, B;\, H),

using the directed ratio from Lesson 1PM. The cevians AFAF, BGBG, CHCH are concurrent if and only if

λμν=1.\lambda\,\mu\,\nu = 1.
Proof

Let XX be a point in the plane of ABC\triangle ABC, written in barycentric form as

x=αa+βb+γc,α+β+γ=1.\mathbf{x} = \alpha\mathbf{a} + \beta\mathbf{b} + \gamma\mathbf{c}, \qquad \alpha + \beta + \gamma = 1.

Since dr(B,C;F)=λ\mathrm{dr}(B, C;\, F) = \lambda, the directed-ratio formula from Lesson 1PM gives

f=b+λc1+λ.\mathbf{f} = \frac{\mathbf{b} + \lambda\mathbf{c}}{1+\lambda}.

Thus XX lies on the cevian AFAF if and only if

x=(1m)a+mf\mathbf{x} = (1-m)\mathbf{a} + m\mathbf{f}

for some scalar mm. Substituting the expression for f\mathbf{f} shows that the coefficients of b\mathbf{b} and c\mathbf{c} in such a point satisfy

γβ=λ.\frac{\gamma}{\beta} = \lambda.

In the same way,

XBG    αγ=μ,XCH    βα=ν.X \in BG \iff \frac{\alpha}{\gamma} = \mu, \qquad X \in CH \iff \frac{\beta}{\alpha} = \nu.

Suppose the three cevians meet at a common point. Then the barycentric weights of that point satisfy all three ratio conditions simultaneously, so

λμν=γβαγβα=1.\lambda\,\mu\,\nu = \frac{\gamma}{\beta} \cdot \frac{\alpha}{\gamma} \cdot \frac{\beta}{\alpha} = 1.

Conversely, assume λμν=1\lambda\,\mu\,\nu = 1. Because FF, GG, HH lie on the sides of the triangle, the directed ratios λ\lambda, μ\mu, ν\nu are positive. Define

x=λμa+b+λc1+λ+λμ.\mathbf{x} = \frac{\lambda\mu\,\mathbf{a} + \mathbf{b} + \lambda\,\mathbf{c}}{1 + \lambda + \lambda\mu}.

This is a genuine point of the plane because the denominator is positive. Its barycentric coordinates are proportional to (λμ,1,λ)(\lambda\mu,\, 1,\, \lambda), so

γβ=λ,αγ=μ,βα=1λμ=ν,\frac{\gamma}{\beta} = \lambda, \qquad \frac{\alpha}{\gamma} = \mu, \qquad \frac{\beta}{\alpha} = \frac{1}{\lambda\mu} = \nu,

where the last equality uses λμν=1\lambda\,\mu\,\nu = 1. Each ratio places XX on the corresponding cevian, so AFAF, BGBG, CHCH meet at XX.

Remark

This is the affine version of Ceva’s theorem for points chosen on the actual sides of the triangle. If one allows FF, GG, HH to range over the full lines determined by those sides, then the condition λμν=1\lambda\mu\nu = 1 also includes an exceptional case in which the three cevians are parallel. We do not pursue that extension here.

A triangle ABC with cevians AF, BG, CH meeting at a common interior point D
Remark

Setting λ=μ=ν=1\lambda = \mu = \nu = 1 forces FF, GG, HH to be the midpoints of their respective sides, turning the cevians into medians. Since 111=11 \cdot 1 \cdot 1 = 1, Ceva’s theorem instantly reproduces the concurrency of medians proved in Lesson 1PM. The centroid is not an isolated coincidence, but the simplest worked example of a general principle about cevians.

Example 34 (Enforcing Concurrency)

In a triangle ABCABC, place FF on BCBC with dr(B,C;F)=2\mathrm{dr}(B, C;\, F) = 2 and GG on CACA with dr(C,A;G)=3\mathrm{dr}(C, A;\, G) = 3. For the three cevians AFAF, BGBG, CHCH to pass through a common point, Ceva’s theorem demands

23ν=1,soν=16.2 \cdot 3 \cdot \nu = 1, \qquad \text{so} \qquad \nu = \tfrac{1}{6}.

Equivalently, HH divides AB\overline{AB} so that AH=16HB\overrightarrow{AH} = \tfrac{1}{6}\,\overrightarrow{HB}, i.e. HH sits one seventh of the way from AA to BB. Any other placement of HH produces cevians that fail to share a single intersection.

Problem 21

In a triangle ABCABC, let FF be the midpoint of BCBC and let GG divide CACA with dr(C,A;G)=2\mathrm{dr}(C, A;\, G) = 2. Determine the location of HH on line ABAB for which the cevians AFAF, BGBG, CHCH are concurrent, and state the result using the directed ratio convention from Lesson 1PM.

The Dot Product in Space

Everything to this point has been affine: positions, displacements, ratios along lines, concurrence. To say anything about lengths and angles inside a spatial figure, we need the dot product again, and we need it for three-component vectors. The extension is dictated by the component structure set up in Lesson 2AM.

Definition 31 (Dot Product in Space)

Let a=[a1a2a3]\mathbf{a} = \begin{bmatrix} a_1 \\ a_2 \\ a_3 \end{bmatrix} and b=[b1b2b3]\mathbf{b} = \begin{bmatrix} b_1 \\ b_2 \\ b_3 \end{bmatrix} be vectors in R3\mathbb{R}^3. Their dot product is the real number

ab=a1b1+a2b2+a3b3.\mathbf{a} \cdot \mathbf{b} = a_1 b_1 + a_2 b_2 + a_3 b_3.

The only change from Lesson 1PM is the presence of a third summand. As in the planar case, the self dot product recovers the square of the magnitude fixed in Lesson 2AM:

aa=a12+a22+a32=a2,\mathbf{a} \cdot \mathbf{a} = a_1^2 + a_2^2 + a_3^2 = \|\mathbf{a}\|^2,

so the Euclidean distance between space points can be written as AB=ba|AB| = \|\mathbf{b} - \mathbf{a}\|, bridging the dot product and the distance formula from Lesson 2AM.

Theorem 35 (Algebraic Properties of the Dot Product in Space)

For all a,b,cR3\mathbf{a}, \mathbf{b}, \mathbf{c} \in \mathbb{R}^3 and all rRr \in \mathbb{R}:

  1. Symmetry: ab=ba\mathbf{a} \cdot \mathbf{b} = \mathbf{b} \cdot \mathbf{a}.
  2. Bilinearity: (ra+b)c=r(ac)+bc(r\mathbf{a} + \mathbf{b}) \cdot \mathbf{c} = r(\mathbf{a} \cdot \mathbf{c}) + \mathbf{b} \cdot \mathbf{c}, and symmetrically in the second argument.
  3. Positive Definiteness: aa0\mathbf{a} \cdot \mathbf{a} \ge 0, with equality if and only if a=0\mathbf{a} = \mathbf{0}.
Proof

The component-wise verification from Lesson 1PM gains only one extra term per identity when the vectors carry three components instead of two, and each identity is preserved under that addition. No new argument is required.

Metric Identities in Space

Because the algebraic laws of the dot product are the same in R2\mathbb{R}^2 and R3\mathbb{R}^3, the entire chain of metric inequalities already proved in Lesson 1PM rebroadcasts into space with no further work.

Theorem 36 (Metric Identities in Space)

Let a,bR3\mathbf{a}, \mathbf{b} \in \mathbb{R}^3 and rRr \in \mathbb{R}. Then:

  1. Homogeneity: ra=ra\|r\mathbf{a}\| = |r|\,\|\mathbf{a}\|.
  2. Parallelogram Law: a+b2+ab2=2(a2+b2)\|\mathbf{a} + \mathbf{b}\|^2 + \|\mathbf{a} - \mathbf{b}\|^2 = 2\bigl(\|\mathbf{a}\|^2 + \|\mathbf{b}\|^2\bigr).
  3. Cauchy-Schwarz Inequality: abab|\mathbf{a} \cdot \mathbf{b}| \le \|\mathbf{a}\|\,\|\mathbf{b}\|.
  4. Triangle Inequality: a+ba+b\|\mathbf{a} + \mathbf{b}\| \le \|\mathbf{a}\| + \|\mathbf{b}\|.
  5. Law of Cosines: ab2=a2+b22(ab)\|\mathbf{a} - \mathbf{b}\|^2 = \|\mathbf{a}\|^2 + \|\mathbf{b}\|^2 - 2(\mathbf{a} \cdot \mathbf{b}).
Proof

Each item was proved in Lesson 1PM using nothing beyond symmetry, bilinearity, and positive definiteness of the dot product, together with the identity v=vv\|\mathbf{v}\| = \sqrt{\mathbf{v} \cdot \mathbf{v}}. All three ingredients hold in R3\mathbb{R}^3 by the preceding theorem, so the planar proofs apply verbatim.

The absence of any new work here is itself the punchline. Metric geometry in space inherits metric geometry in the plane once the algebraic framework is in place; and the reason is simple enough to state precisely. Any triangle drawn inside a spatial figure lies in a plane of its own, and every metric inequality above is a statement about pairs of vectors that collectively span at most a single plane.

Angles, Orthogonality, and Projection

Because the Cauchy-Schwarz inequality still holds, the quantity abab\dfrac{\mathbf{a} \cdot \mathbf{b}}{\|\mathbf{a}\|\,\|\mathbf{b}\|} remains in [1,1][-1, 1] for all non-zero a,bR3\mathbf{a}, \mathbf{b} \in \mathbb{R}^3. The definition of angle used in Lesson 1PM lifts to space without modification.

Definition 32 (Angle Between Space Vectors)

For non-zero a,bR3\mathbf{a}, \mathbf{b} \in \mathbb{R}^3, the angle θ[0,π]\theta \in [0, \pi] between them is the unique number satisfying

cosθ=abab.\cos\theta = \frac{\mathbf{a} \cdot \mathbf{b}}{\|\mathbf{a}\|\,\|\mathbf{b}\|}.

Two space vectors are orthogonal, written ab\mathbf{a} \perp \mathbf{b}, when ab=0\mathbf{a} \cdot \mathbf{b} = 0, which agrees with the planar convention from Lesson 1PM.

This is consistent with the geometric angle AOB\angle AOB formed by the segments OAOA and OBOB: the triangle OABOAB lies in the unique plane determined by a\mathbf{a} and b\mathbf{b}, and inside that plane the angle reduces to the planar one. The ambient third dimension plays no role.

Example 35 (A Generic Space Angle)

Let

u=[212],v=[112].\mathbf{u} = \begin{bmatrix} 2 \\ 1 \\ 2 \end{bmatrix}, \qquad \mathbf{v} = \begin{bmatrix} 1 \\ 1 \\ -2 \end{bmatrix}.

Then

uv=2+14=1,u=3,v=6.\mathbf{u} \cdot \mathbf{v} = 2 + 1 - 4 = -1, \qquad \|\mathbf{u}\| = 3, \qquad \|\mathbf{v}\| = \sqrt{6}.

Hence the angle θ\theta between them satisfies

cosθ=136,\cos\theta = -\frac{1}{3\sqrt{6}},

so

θ=arccos ⁣(136)97.8.\theta = \arccos\!\left(-\frac{1}{3\sqrt{6}}\right) \approx 97.8^\circ.

The negative cosine confirms that the angle is obtuse.

Example 36 (A Right-Angled Corner in a Tetrahedron)

Return to the tetrahedron with A=(0,0,0)A = (0, 0, 0), B=(4,0,0)B = (4, 0, 0), C=(0,6,0)C = (0, 6, 0), D=(0,0,8)D = (0, 0, 8). The edges AB\overrightarrow{AB} and AC\overrightarrow{AC} meet at vertex AA, and the angle between them satisfies

cosθ=ABACABAC=046=0,\cos\theta = \frac{\overrightarrow{AB} \cdot \overrightarrow{AC}}{\|\overrightarrow{AB}\|\,\|\overrightarrow{AC}\|} = \frac{0}{4 \cdot 6} = 0,

so θ=π/2\theta = \pi/2. A parallel computation shows that ABAD\overrightarrow{AB} \perp \overrightarrow{AD} and ACAD\overrightarrow{AC} \perp \overrightarrow{AD}, so the three edges meeting at AA form a right-angled corner, which is exactly the expected behaviour once each edge is placed along a distinct coordinate axis.

Example 37 (The Acute Angle Between Two Cube Diagonals)

Consider the unit cube in R3\mathbb{R}^3 with vertices (x,y,z)(x,y,z) satisfying 0x,y,z10 \le x,y,z \le 1. Take the diagonal from

P=(0,0,0)toQ=(1,1,1),P = (0,0,0) \qquad \text{to} \qquad Q = (1,1,1),

and the diagonal from

R=(0,1,1)toS=(1,0,0).R = (0,1,1) \qquad \text{to} \qquad S = (1,0,0).

The corresponding displacement vectors are

x=[111],y=[111].\mathbf{x} = \begin{bmatrix} 1 \\ 1 \\ 1 \end{bmatrix}, \qquad \mathbf{y} = \begin{bmatrix} 1 \\ -1 \\ -1 \end{bmatrix}.

Hence

xy=111=1,x=3,y=3.\mathbf{x} \cdot \mathbf{y} = 1 - 1 - 1 = -1, \qquad \|\mathbf{x}\| = \sqrt{3}, \qquad \|\mathbf{y}\| = \sqrt{3}.

So

cosθ=13.\cos\theta = \frac{-1}{3}.

This gives the obtuse angle between the chosen directed diagonals. The acute angle between the two diagonal lines is therefore

arccos ⁣(13)70.5.\arccos\!\left(\frac{1}{3}\right) \approx 70.5^\circ.
Problem 22

Find two non-zero vectors perpendicular to

[122]\begin{bmatrix} 1 \\ 2 \\ 2 \end{bmatrix}

that are not scalar multiples of each other. Then find one unit vector perpendicular to this vector, and describe geometrically the full set of all vectors perpendicular to it.

Remark (Optional: The Law of Sines from Dot Products)

The dot product machinery already gives a clean proof of the Law of Sines. Take a triangle and direct its three sides cyclically by vectors x\mathbf{x}, y\mathbf{y}, z\mathbf{z}, so that

x+y+z=0.\mathbf{x} + \mathbf{y} + \mathbf{z} = \mathbf{0}.

At the vertex where the sides x-\mathbf{x} and y\mathbf{y} meet, let the interior angle be θ\theta, and let the opposite side have length L=zL = \|\mathbf{z}\|. By the angle formula,

cosθ=(x)yxy=xyxy,\cos\theta = \frac{(-\mathbf{x}) \cdot \mathbf{y}}{\|\mathbf{x}\|\,\|\mathbf{y}\|} = -\frac{\mathbf{x} \cdot \mathbf{y}}{\|\mathbf{x}\|\,\|\mathbf{y}\|},

so

(sinθL)2=1cos2θz2=(xx)(yy)(xy)2(xx)(yy)(zz).\left(\frac{\sin\theta}{L}\right)^2 = \frac{1 - \cos^2\theta}{\|\mathbf{z}\|^2} = \frac{(\mathbf{x} \cdot \mathbf{x})(\mathbf{y} \cdot \mathbf{y}) - (\mathbf{x} \cdot \mathbf{y})^2}{(\mathbf{x} \cdot \mathbf{x})(\mathbf{y} \cdot \mathbf{y})(\mathbf{z} \cdot \mathbf{z})}.

Now use z=xy\mathbf{z} = -\mathbf{x} - \mathbf{y}. A direct expansion shows

(xx)(yy)(xy)2=(xy)(xz)+(yx)(yz)+(zx)(zy).(\mathbf{x} \cdot \mathbf{x})(\mathbf{y} \cdot \mathbf{y}) - (\mathbf{x} \cdot \mathbf{y})^2 = (\mathbf{x} \cdot \mathbf{y})(\mathbf{x} \cdot \mathbf{z}) + (\mathbf{y} \cdot \mathbf{x})(\mathbf{y} \cdot \mathbf{z}) + (\mathbf{z} \cdot \mathbf{x})(\mathbf{z} \cdot \mathbf{y}).

Hence

(sinθL)2=(xy)(xz)+(yx)(yz)+(zx)(zy)(xx)(yy)(zz).\left(\frac{\sin\theta}{L}\right)^2 = \frac{ (\mathbf{x} \cdot \mathbf{y})(\mathbf{x} \cdot \mathbf{z}) + (\mathbf{y} \cdot \mathbf{x})(\mathbf{y} \cdot \mathbf{z}) + (\mathbf{z} \cdot \mathbf{x})(\mathbf{z} \cdot \mathbf{y}) }{ (\mathbf{x} \cdot \mathbf{x})(\mathbf{y} \cdot \mathbf{y})(\mathbf{z} \cdot \mathbf{z}) }.

The right-hand side is unchanged by permuting x\mathbf{x}, y\mathbf{y}, z\mathbf{z}, so the same value is obtained from each of the three vertices of the triangle. Since each interior angle lies in (0,π)(0,\pi), its sine is positive, and therefore the ratio sinθ/L\sin\theta/L itself is the same at all three vertices. That is exactly the Law of Sines.

Orthogonal Projection

Resolving a vector into a component along a reference direction and a component perpendicular to it is another theorem that needs no new proof: the argument in Lesson 1PM depended only on bilinearity and positive definiteness. In space, the decomposition still takes place inside a plane, namely the plane spanned by the two vectors involved.

Theorem 37 (Orthogonal Projection in Space)

Let aR3\mathbf{a} \in \mathbb{R}^3 be a non-zero vector. For every bR3\mathbf{b} \in \mathbb{R}^3 there is a unique decomposition

b=ta+cwithca,\mathbf{b} = t\mathbf{a} + \mathbf{c} \qquad \text{with} \qquad \mathbf{c} \perp \mathbf{a},

in which the scalar is

t=baaa.t = \frac{\mathbf{b} \cdot \mathbf{a}}{\mathbf{a} \cdot \mathbf{a}}.

The vector projab=ta\mathrm{proj}_{\mathbf{a}}\mathbf{b} = t\mathbf{a} is called the orthogonal projection of b\mathbf{b} onto a\mathbf{a}.

Proof

Write c=bta\mathbf{c} = \mathbf{b} - t\mathbf{a} and impose ca=0\mathbf{c} \cdot \mathbf{a} = 0. This yields bat(aa)=0\mathbf{b} \cdot \mathbf{a} - t(\mathbf{a} \cdot \mathbf{a}) = 0. Since a0\mathbf{a} \neq \mathbf{0}, positive definiteness gives aa>0\mathbf{a} \cdot \mathbf{a} > 0, so tt is forced to equal (ba)/(aa)(\mathbf{b} \cdot \mathbf{a})/(\mathbf{a} \cdot \mathbf{a}). With tt fixed, the vector c=bta\mathbf{c} = \mathbf{b} - t\mathbf{a} is determined, and uniqueness of the decomposition follows.

Orthogonal projection of b onto a inside the plane they span, with the orthogonal component c drawn from the tip of the projection to the tip of b

The triangle with vertices OO, projab\mathrm{proj}_{\mathbf{a}}\mathbf{b}, BB lies in the plane spanned by a\mathbf{a} and b\mathbf{b} and is right-angled at the tip of the projection. Its hypotenuse has length b\|\mathbf{b}\| and the leg along a\mathbf{a} has length

projab=baa.\|\mathrm{proj}_{\mathbf{a}}\mathbf{b}\| = \frac{|\mathbf{b} \cdot \mathbf{a}|}{\|\mathbf{a}\|}.

Reading the inequality hypotenuse \ge leg out of this triangle gives a purely geometric proof of Cauchy-Schwarz, this time without a discriminant in sight. The equality cases of the metric inequalities now fall out immediately.

Corollary 2 (Equality Conditions)

Let a,bR3\mathbf{a}, \mathbf{b} \in \mathbb{R}^3.

  1. ab=ab|\mathbf{a} \cdot \mathbf{b}| = \|\mathbf{a}\|\,\|\mathbf{b}\| if and only if a\mathbf{a} and b\mathbf{b} are linearly dependent.
  2. a+b=a+b\|\mathbf{a} + \mathbf{b}\| = \|\mathbf{a}\| + \|\mathbf{b}\| if and only if one of the vectors is a non-negative scalar multiple of the other.
Proof

For (i), if a\mathbf{a} and b\mathbf{b} are dependent then either one of them is the zero vector, in which case both sides are zero, or b=ka\mathbf{b} = k\mathbf{a} for some scalar kk, in which case both sides equal ka2|k|\,\|\mathbf{a}\|^2.

Conversely, suppose

ab=ab.|\mathbf{a} \cdot \mathbf{b}| = \|\mathbf{a}\|\,\|\mathbf{b}\|.

If a=0\mathbf{a} = \mathbf{0} or b=0\mathbf{b} = \mathbf{0} then the vectors are trivially dependent, so assume both are non-zero. By the projection theorem,

b=ta+c,ca.\mathbf{b} = t\mathbf{a} + \mathbf{c}, \qquad \mathbf{c} \perp \mathbf{a}.

Then

ab=t(aa)=ta2,\mathbf{a} \cdot \mathbf{b} = t(\mathbf{a} \cdot \mathbf{a}) = t\|\mathbf{a}\|^2,

so the equality hypothesis gives

ta=b.|t|\,\|\mathbf{a}\| = \|\mathbf{b}\|.

Squaring,

t2a2=b2.t^2\|\mathbf{a}\|^2 = \|\mathbf{b}\|^2.

But orthogonality gives

b2=ta+c2=t2a2+c2,\|\mathbf{b}\|^2 = \|t\mathbf{a} + \mathbf{c}\|^2 = t^2\|\mathbf{a}\|^2 + \|\mathbf{c}\|^2,

so c2=0\|\mathbf{c}\|^2 = 0. Hence c=0\mathbf{c} = \mathbf{0} and therefore b=ta\mathbf{b} = t\mathbf{a}, proving dependence.

For (ii), square the supposed equality a+b=a+b\|\mathbf{a} + \mathbf{b}\| = \|\mathbf{a}\| + \|\mathbf{b}\| to obtain

a2+b2+2(ab)=a2+b2+2ab,\|\mathbf{a}\|^2 + \|\mathbf{b}\|^2 + 2(\mathbf{a} \cdot \mathbf{b}) = \|\mathbf{a}\|^2 + \|\mathbf{b}\|^2 + 2\|\mathbf{a}\|\,\|\mathbf{b}\|,

which reduces to ab=ab\mathbf{a} \cdot \mathbf{b} = \|\mathbf{a}\|\,\|\mathbf{b}\|. This is simultaneously the Cauchy-Schwarz equality (hence dependence, by (i)) and the statement that ab\mathbf{a} \cdot \mathbf{b} is non-negative, which for dependent vectors means one is a non-negative multiple of the other.

Example 38 (Projection in Space)

Let a=[122]\mathbf{a} = \begin{bmatrix} 1 \\ 2 \\ 2 \end{bmatrix} and b=[304]\mathbf{b} = \begin{bmatrix} 3 \\ 0 \\ 4 \end{bmatrix}. Then ab=3+0+8=11\mathbf{a} \cdot \mathbf{b} = 3 + 0 + 8 = 11 and aa=1+4+4=9\mathbf{a} \cdot \mathbf{a} = 1 + 4 + 4 = 9, so

projab=119[122]=[11/922/922/9].\mathrm{proj}_{\mathbf{a}}\mathbf{b} = \tfrac{11}{9}\begin{bmatrix} 1 \\ 2 \\ 2 \end{bmatrix} = \begin{bmatrix} 11/9 \\ 22/9 \\ 22/9 \end{bmatrix}.

The orthogonal component is

c=bprojab=[16/922/914/9],\mathbf{c} = \mathbf{b} - \mathrm{proj}_{\mathbf{a}}\mathbf{b} = \begin{bmatrix} 16/9 \\ -22/9 \\ 14/9 \end{bmatrix},

and the check ca=16/944/9+28/9=0\mathbf{c} \cdot \mathbf{a} = 16/9 - 44/9 + 28/9 = 0 confirms that the decomposition is correct.

Problem 23

Let a=[212]\mathbf{a} = \begin{bmatrix} 2 \\ -1 \\ 2 \end{bmatrix} and b=[431]\mathbf{b} = \begin{bmatrix} 4 \\ 3 \\ 1 \end{bmatrix}. Compute the angle between a\mathbf{a} and b\mathbf{b}, the orthogonal projection of b\mathbf{b} onto a\mathbf{a}, and the length of the resulting orthogonal component. Verify directly that the component you produce is perpendicular to a\mathbf{a}.

Planes in Space

The orthogonal projection just developed turns the dot product from an angle-measuring device into a way of writing down loci. A plane in R3\mathbb{R}^3 is the simplest such locus: fix a direction, and demand that every displacement from a chosen point be orthogonal to it. Unpacking that single sentence delivers the whole elementary theory of the plane as a set of points, and its results will recover in space everything Lesson 1PM established for the line in the plane.

Begin with the homogeneous linear equation

a1x+a2y+a3z=0,a_1 x + a_2 y + a_3 z = 0,

which the dot product collapses to ax=0\mathbf{a} \cdot \mathbf{x} = 0, where

a=[a1a2a3],x=[xyz].\mathbf{a} = \begin{bmatrix} a_1 \\ a_2 \\ a_3 \end{bmatrix}, \qquad \mathbf{x} = \begin{bmatrix} x \\ y \\ z \end{bmatrix}.

This is the set of position vectors orthogonal to the fixed vector a\mathbf{a}, which is precisely the plane through the origin with a\mathbf{a} as its normal. A general plane need not pass through the origin, but the inhomogeneous form

a1x+a2y+a3z+c=0a_1 x + a_2 y + a_3 z + c = 0

handles that case with no structural change. If QQ is any point on the plane, with position vector q\mathbf{q}, then aq+c=0\mathbf{a} \cdot \mathbf{q} + c = 0, so c=aqc = -\mathbf{a} \cdot \mathbf{q}, and substituting back rewrites the defining equation as

a(xq)=0.\mathbf{a} \cdot (\mathbf{x} - \mathbf{q}) = 0.

Read geometrically, the plane is the set of points XX whose displacement QX\overrightarrow{QX} is orthogonal to a\mathbf{a}. Every choice of QQ produces the same locus; only the algebraic label cc shifts.

Theorem 38 (Equation of a Plane)

A linear equation

a1x+a2y+a3z+c=0a_1 x + a_2 y + a_3 z + c = 0

with (a1,a2,a3)(0,0,0)(a_1, a_2, a_3) \neq (0, 0, 0) defines a plane in R3\mathbb{R}^3, and the vector

a=[a1a2a3]\mathbf{a} = \begin{bmatrix} a_1 \\ a_2 \\ a_3 \end{bmatrix}

is a normal vector to that plane.

Problem 24

Let

n=[123].\mathbf{n} = \begin{bmatrix} 1 \\ 2 \\ 3 \end{bmatrix}.

Find a non-zero vector perpendicular to n\mathbf{n}. Then show that the vectors perpendicular to n\mathbf{n} are exactly those whose coordinates satisfy

x+2y+3z=0.x + 2y + 3z = 0.

Finally, find a non-zero vector perpendicular to the plane

3x+4y+5z=0.3x + 4y + 5z = 0.
Example 39 (Intercept Form)

A plane that meets the coordinate axes at A=(a,0,0)A = (a, 0, 0), B=(0,b,0)B = (0, b, 0), and C=(0,0,c)C = (0, 0, c), with aa, bb, cc all non-zero, is described by the compact equation

xa+yb+zc=1.\frac{x}{a} + \frac{y}{b} + \frac{z}{c} = 1.

Each intercept satisfies this by inspection: at AA, the left-hand side is a/a+0+0=1a/a + 0 + 0 = 1, and the other two are analogous. Clearing denominators,

bcx+acy+abzabc=0,bc\,x + ac\,y + ab\,z - abc = 0,

so by the previous theorem the vector

n=[bcacab]\mathbf{n} = \begin{bmatrix} bc \\ ac \\ ab \end{bmatrix}

is a normal to the plane.

Distance from a Point to a Plane

The point-line distance problem in the plane was solved in Lesson 1PM by projecting a displacement onto the normal of the line. The same strategy works in space, and this is the first place the orthogonal projection theorem earns its keep against a problem with no honest two-dimensional version.

Theorem 39 (Distance to a Plane)

Let EE be the plane defined by ax+c=0\mathbf{a} \cdot \mathbf{x} + c = 0, and let PP be a point with position vector p\mathbf{p}. The perpendicular distance from PP to EE is

d(P,E)=ap+ca.d(P, E) = \frac{|\mathbf{a} \cdot \mathbf{p} + c|}{\|\mathbf{a}\|}.
Proof

Let QQ be any point on EE and let RR be the foot of the perpendicular from PP to EE. The displacement RP\overrightarrow{RP} is parallel to a\mathbf{a}, since both are orthogonal to every direction inside the plane, so RP\overrightarrow{RP} is exactly the orthogonal projection of QP=pq\overrightarrow{QP} = \mathbf{p} - \mathbf{q} onto a\mathbf{a}:

RP=proja(pq)=(pq)aaaa.\overrightarrow{RP} = \mathrm{proj}_{\mathbf{a}}(\mathbf{p} - \mathbf{q}) = \frac{(\mathbf{p} - \mathbf{q}) \cdot \mathbf{a}}{\mathbf{a} \cdot \mathbf{a}}\,\mathbf{a}.

Taking magnitudes,

d=RP=(pq)aa=apaqa.d = \|\overrightarrow{RP}\| = \frac{|(\mathbf{p} - \mathbf{q}) \cdot \mathbf{a}|}{\|\mathbf{a}\|} = \frac{|\mathbf{a} \cdot \mathbf{p} - \mathbf{a} \cdot \mathbf{q}|}{\|\mathbf{a}\|}.

Since QQ lies on EE, aq=c\mathbf{a} \cdot \mathbf{q} = -c, and the stated formula follows.

A plane E with a point P above it, its foot R on E, an arbitrary point Q on E, the displacement p minus q from Q to P, the normal vector a at P, and the perpendicular distance d from R to P

Dividing the defining equation through by a\|\mathbf{a}\| recasts the plane into its normal form

n^x+k=0,n^=1,\hat{\mathbf{n}} \cdot \mathbf{x} + k = 0, \qquad \|\hat{\mathbf{n}}\| = 1,

and in this form the distance formula collapses to d(P,E)=f(p)d(P, E) = |f(\mathbf{p})|, where ff denotes the linear polynomial on the left. The value of ff at any point is literally its signed distance from the plane, which is a small but extremely useful fact: wherever we have a plane written in normal form, the function ff doubles as a ruler.

Example 40 (Ratio of Division by a Plane)

Let f(x)=ax+cf(\mathbf{x}) = \mathbf{a} \cdot \mathbf{x} + c define a plane EE, and let PP, QQ be points on opposite sides of EE so that the segment PQ\overline{PQ} meets EE at a unique point RR. The position of RR on the segment, recorded through the directed ratio from Lesson 1PM, is

λ=dr(P,Q;R)=f(p)f(q).\lambda = \mathrm{dr}(P, Q;\, R) = -\frac{f(\mathbf{p})}{f(\mathbf{q})}.

To see this, observe that RR divides PQ\overline{PQ} in the ratio λ\lambda, so

r=p+λq1+λ.\mathbf{r} = \frac{\mathbf{p} + \lambda\,\mathbf{q}}{1 + \lambda}.

The condition f(r)=0f(\mathbf{r}) = 0 combined with the linearity of the dot product gives

ap+c+λ(aq+c)=0,i.e.f(p)+λf(q)=0,\mathbf{a} \cdot \mathbf{p} + c + \lambda(\mathbf{a} \cdot \mathbf{q} + c) = 0, \qquad \text{i.e.} \qquad f(\mathbf{p}) + \lambda\,f(\mathbf{q}) = 0,

from which λ=f(p)/f(q)\lambda = -f(\mathbf{p})/f(\mathbf{q}) is immediate. The signed values of ff at the endpoints therefore encode the division ratio directly, and this observation is the bridge we will need for sheaves of planes below.

Problem 25

Let EE be the plane

2xy+2z5=0.2x - y + 2z - 5 = 0.

(a) Find the distance from the point P=(1,1,2)P = (1,-1,2) to EE.

(b) Rewrite the equation of EE in normal form

n^x+k=0\hat{\mathbf{n}} \cdot \mathbf{x} + k = 0

with n^=1\|\hat{\mathbf{n}}\| = 1.

(c) Use the normal form to determine the signed distance of PP from EE.

Systems of Planes

A single plane is one linear condition. Two or more planes turn solid geometry into linear algebra in miniature: whether they meet, how they meet, and which further planes are generated by their intersection. The remainder of the lesson treats these questions with the tools already in hand.

Parallel Planes

Write two planes EE and FF as

f(x)=ax+c=0,g(x)=bx+d=0,f(\mathbf{x}) = \mathbf{a} \cdot \mathbf{x} + c = 0, \qquad g(\mathbf{x}) = \mathbf{b} \cdot \mathbf{x} + d = 0,

with respective normals a\mathbf{a} and b\mathbf{b}. The planes are parallel precisely when their normals are collinear, i.e. b=ra\mathbf{b} = r\mathbf{a} for some non-zero scalar rr. Dividing the equation for FF through by rr replaces its normal by a\mathbf{a} without altering the locus, so no information is lost by assuming from the outset that two parallel planes are presented with a common normal vector.

Theorem 40 (Distance Between Parallel Planes)

The distance between two parallel planes ax+c=0\mathbf{a} \cdot \mathbf{x} + c = 0 and ax+d=0\mathbf{a} \cdot \mathbf{x} + d = 0 is

dist(E,F)=cda.\mathrm{dist}(E, F) = \frac{|c - d|}{\|\mathbf{a}\|}.
Proof

Pick any point PP on EE. Then ap+c=0\mathbf{a} \cdot \mathbf{p} + c = 0, so ap=c\mathbf{a} \cdot \mathbf{p} = -c. The Distance to a Plane theorem applied to PP and FF gives

dist(P,F)=ap+da=c+da=cda.\mathrm{dist}(P, F) = \frac{|\mathbf{a} \cdot \mathbf{p} + d|}{\|\mathbf{a}\|} = \frac{|-c + d|}{\|\mathbf{a}\|} = \frac{|c - d|}{\|\mathbf{a}\|}.

Because PP was arbitrary and every point of EE is the same perpendicular distance from FF, this common value is dist(E,F)\mathrm{dist}(E, F).

Problem 26

Find the distance between the parallel planes

x2y+2z+1=0x - 2y + 2z + 1 = 0

and

2x4y+4z7=0.2x - 4y + 4z - 7 = 0.

First rewrite them with a common normal vector, and then apply the parallel-plane distance formula.

Intersecting Planes and Sheaves

Two planes that are not parallel share an entire line. Their normals a\mathbf{a} and b\mathbf{b} are linearly independent and together span the plane orthogonal to the intersection line LL, while the direction of LL itself is the unique space direction (up to sign) orthogonal to both. The natural numerical quantity attached to such a pair is the angle between them, defined through their normals:

cosθ=abab,θ[0,π/2].\cos\theta = \frac{|\mathbf{a} \cdot \mathbf{b}|}{\|\mathbf{a}\|\,\|\mathbf{b}\|}, \qquad \theta \in [0, \pi/2].

The absolute value pins the result to the dihedral angle along LL, ignoring the irrelevant orientation of the two normals.

The more interesting question is algebraic: which linear equations describe the remaining planes that contain LL? The answer is remarkably clean, and hinges on the observation that any linear combination of ff and gg still vanishes on LL.

Definition 33 (Sheaf of Planes)

Let EE and FF be two distinct planes meeting in a line LL. The sheaf (or pencil) of planes through LL is the set of all planes in R3\mathbb{R}^3 that contain LL.

Theorem 41 (Equation of a Sheaf)

Let f(x)=0f(\mathbf{x}) = 0 and g(x)=0g(\mathbf{x}) = 0 define two distinct planes meeting in the line LL. A plane HH belongs to the sheaf through LL if and only if it can be written as

rf(x)+sg(x)=0r\,f(\mathbf{x}) + s\,g(\mathbf{x}) = 0

for some pair of scalars (r,s)(0,0)(r, s) \neq (0, 0).

Proof

Suppose first that HH is given by rf(x)+sg(x)=0r\,f(\mathbf{x}) + s\,g(\mathbf{x}) = 0 for some (r,s)(0,0)(r, s) \neq (0, 0). Every point of LL satisfies f=0f = 0 and g=0g = 0 simultaneously, hence it also satisfies the combined equation, so LHL \subset H and HH lies in the sheaf.

Conversely, let HH be an arbitrary plane through LL, with normal e\mathbf{e} and constant term kk. The vector e\mathbf{e} is orthogonal to the direction of LL, so it lies in the two-dimensional subspace spanned by a\mathbf{a} and b\mathbf{b}; there are therefore scalars rr and ss with e=ra+sb\mathbf{e} = r\mathbf{a} + s\mathbf{b}. The linear form rf+sgr\,f + s\,g has leading part ex\mathbf{e} \cdot \mathbf{x} and vanishes on every point of LL, while h(x)=ex+kh(\mathbf{x}) = \mathbf{e} \cdot \mathbf{x} + k also vanishes on LL by construction. Two linear forms with the same leading part that agree on any single point are identical, so h=rf+sgh = r\,f + s\,g, and HH has the stated form.

The sheaf equation is, at bottom, the observation that containing LL is itself a linear condition on the space of linear equations. Its real utility is that it converts statements about the geometry of the sheaf into statements about pairs (r,s)(r, s), and in doing so exposes an invariant that is not at all apparent from the geometry alone. For that invariant we need one more classical notion, which is new to this course but built entirely out of quantities we have already met.

Definition 34 (Cross-Ratio of Four Collinear Points)

Let A1,A2,A3,A4A_1, A_2, A_3, A_4 be four distinct collinear points. Their cross-ratio is the quotient of directed ratios

cr(A1,A2;A3,A4)=dr(A1,A2;A3)dr(A1,A2;A4),\mathrm{cr}(A_1, A_2;\, A_3, A_4) = \frac{\mathrm{dr}(A_1, A_2;\, A_3)}{\mathrm{dr}(A_1, A_2;\, A_4)},

where dr\mathrm{dr} is the directed ratio along the common line from Lesson 1PM.

Theorem 42 (Cross-Ratio Invariance)

Let E1,E2,E3,E4E_1, E_2, E_3, E_4 be four distinct planes in a common sheaf with common line LL, and let \ell be a transversal line that meets each EiE_i in a unique point AiA_i, does not lie in any of the four planes, and does not meet LL. The cross-ratio cr(A1,A2;A3,A4)\mathrm{cr}(A_1, A_2;\, A_3, A_4) depends only on the four planes, not on the transversal line \ell.

Proof

Pick two planes of the sheaf defined by linear forms ff and gg, with the extra choice that the plane g(x)=0g(\mathbf{x}) = 0 is not one of E1,E2,E3,E4E_1, E_2, E_3, E_4. This is possible because the sheaf contains infinitely many planes. Then every EiE_i can be written in the affine chart

f(x)+tig(x)=0,i=1,2,3,4,f(\mathbf{x}) + t_i\,g(\mathbf{x}) = 0, \qquad i = 1, 2, 3, 4,

for finite scalars t1,t2,t3,t4t_1, t_2, t_3, t_4. Let \ell meet the four planes at A1,A2,A3,A4A_1, A_2, A_3, A_4 with position vectors a1,a2,a3,a4\mathbf{a}_1, \mathbf{a}_2, \mathbf{a}_3, \mathbf{a}_4. Since \ell does not meet LL, none of the points AiA_i lies on the auxiliary plane g(x)=0g(\mathbf{x}) = 0, so the values g(ai)g(\mathbf{a}_i) are all non-zero.

The plane E3E_3 is defined by the linear polynomial F3(x)=f(x)+t3g(x)F_3(\mathbf{x}) = f(\mathbf{x}) + t_3\,g(\mathbf{x}), and the Ratio of Division computation from the previous section, applied along the transversal line \ell, yields

dr(A1,A2;A3)=F3(a1)F3(a2)=f(a1)+t3g(a1)f(a2)+t3g(a2).\mathrm{dr}(A_1, A_2;\, A_3) = -\frac{F_3(\mathbf{a}_1)}{F_3(\mathbf{a}_2)} = -\frac{f(\mathbf{a}_1) + t_3\,g(\mathbf{a}_1)}{f(\mathbf{a}_2) + t_3\,g(\mathbf{a}_2)}.

Since A1A_1 lies on E1E_1 we have f(a1)+t1g(a1)=0f(\mathbf{a}_1) + t_1\,g(\mathbf{a}_1) = 0, so f(a1)=t1g(a1)f(\mathbf{a}_1) = -t_1\,g(\mathbf{a}_1), and similarly f(a2)=t2g(a2)f(\mathbf{a}_2) = -t_2\,g(\mathbf{a}_2). Substituting and cancelling the common factor of gg in numerator and denominator,

dr(A1,A2;A3)=(t1+t3)g(a1)(t2+t3)g(a2)=g(a1)g(a2)t3t1t3t2.\mathrm{dr}(A_1, A_2;\, A_3) = -\frac{(-t_1 + t_3)\,g(\mathbf{a}_1)}{(-t_2 + t_3)\,g(\mathbf{a}_2)} = -\frac{g(\mathbf{a}_1)}{g(\mathbf{a}_2)}\cdot\frac{t_3 - t_1}{t_3 - t_2}.

The same calculation with t3t_3 replaced by t4t_4 gives the directed ratio in which A4A_4 divides A1A2\overline{A_1 A_2}. Forming the cross-ratio as the quotient of these two directed ratios, the prefactor g(a1)/g(a2)g(\mathbf{a}_1)/g(\mathbf{a}_2) (which is the only place the transversal \ell enters the formula) cancels, leaving

cr(A1,A2;A3,A4)=(t3t1)(t4t2)(t3t2)(t4t1).\mathrm{cr}(A_1, A_2;\, A_3, A_4) = \frac{(t_3 - t_1)(t_4 - t_2)}{(t_3 - t_2)(t_4 - t_1)}.

The right-hand side is built entirely from the parameters tit_i of the four planes in the sheaf and contains no trace of \ell, so the cross-ratio is independent of the transversal.

Problem 27

Suppose four planes in a common sheaf are written in the affine chart

f(x)+tig(x)=0,t1=0, t2=1, t3=2, t4=5.f(\mathbf{x}) + t_i\,g(\mathbf{x}) = 0, \qquad t_1 = 0,\ t_2 = 1,\ t_3 = 2,\ t_4 = 5.

(a) Compute the cross-ratio

cr(A1,A2;A3,A4)\mathrm{cr}(A_1, A_2;\, A_3, A_4)

on any transversal line meeting the four planes.

(b) Explain why your answer is independent of the chosen transversal.