Mascot image.
#Math#Vectors#Geometry

Vectors and Plane Geometry

We have now developed sufficient algebraic machinery to revisit plane geometry through the lens of vectors. Rather than reformulating the entirety of Euclidean geometry, we establish a dictionary between the geometric properties of the plane and the algebraic structure of the vector space R2\mathbb{R}^2. This correspondence allows us to prove geometric theorems with algebraic rigour.

Position and Displacement Vectors

Definition 10 (Position Vector)

Let O=(0,0)O = (0,0) be the origin of the Cartesian plane. For any point A=(a1,a2)A = (a_1, a_2), the vector a=[a1a2]\mathbf{a} = \begin{bmatrix} a_1 \\ a_2 \end{bmatrix} represented by the directed segment OA\overrightarrow{OA} is called the position vector of AA.

Through this convention, every point in the plane is uniquely associated with a vector. The position vector of a point is simply the vector whose components are the coordinates of that point. While position vectors are “bound” to the origin, we also need vectors between arbitrary points.

Definition 11 (Displacement Vector)

Let AA and BB be distinct points with position vectors a\mathbf{a} and b\mathbf{b} respectively. The directed segment AB\overrightarrow{AB} is represented by the vector

AB=ba.\overrightarrow{AB} = \mathbf{b} - \mathbf{a}.

This vector is called the displacement vector from AA to BB.

Remark

This definition is consistent with the triangle rule for vector addition. The chain OA+AB=OB\overrightarrow{OA} + \overrightarrow{AB} = \overrightarrow{OB} implies a+AB=b\mathbf{a} + \overrightarrow{AB} = \mathbf{b}, so AB=ba\overrightarrow{AB} = \mathbf{b} - \mathbf{a}. Observe that reversing direction negates the vector: BA=ab=AB\overrightarrow{BA} = \mathbf{a} - \mathbf{b} = -\overrightarrow{AB}.

Position vectors a and b from the origin to points A and B, and the displacement vector b minus a from A to B

With position and displacement vectors in hand, fundamental geometric properties translate directly into vector notation.

  • Distance: The length of the segment ABAB is the magnitude of the displacement vector: d(A,B)=bad(A,B) = |\mathbf{b} - \mathbf{a}|.
  • Collinearity: Three distinct points A,B,CA, B, C are collinear if and only if the displacement vectors AB\overrightarrow{AB} and AC\overrightarrow{AC} are linearly dependent (Definition 9). That is, ba=k(ca)\mathbf{b}-\mathbf{a} = k(\mathbf{c}-\mathbf{a}) for some scalar kk.
  • Parallelograms: A quadrilateral ABCDABCD is a parallelogram if and only if AB=DC\overrightarrow{AB} = \overrightarrow{DC}, which in terms of position vectors reads ba=cd\mathbf{b} - \mathbf{a} = \mathbf{c} - \mathbf{d}.
Example 7

Let A=(1,3)A = (1, 3), B=(4,1)B = (4, 1), and C=(7,1)C = (7, -1). The displacement vectors are:

AB=[32],AC=[64]=2[32]=2AB.\overrightarrow{AB} = \begin{bmatrix} 3 \\ -2 \end{bmatrix}, \qquad \overrightarrow{AC} = \begin{bmatrix} 6 \\ -4 \end{bmatrix} = 2\begin{bmatrix} 3 \\ -2 \end{bmatrix} = 2\,\overrightarrow{AB}.

Since AC\overrightarrow{AC} is a scalar multiple of AB\overrightarrow{AB}, the three points are collinear. The distance from AA to BB is AB=9+4=13|\overrightarrow{AB}| = \sqrt{9 + 4} = \sqrt{13}.

Example 8

Let P=(2,1)P = (2, 1), Q=(5,3)Q = (5, 3), and R=(3,4)R = (3, 4). Are these three points collinear?

PQ=[32],PR=[13].\overrightarrow{PQ} = \begin{bmatrix} 3 \\ 2 \end{bmatrix}, \qquad \overrightarrow{PR} = \begin{bmatrix} 1 \\ 3 \end{bmatrix}.

If PR=kPQ\overrightarrow{PR} = k\,\overrightarrow{PQ}, then 1=3k1 = 3k and 3=2k3 = 2k, giving k=1/3k = 1/3 and k=3/2k = 3/2 respectively. These are inconsistent, so no such kk exists. Alternatively, the determinant (Theorem 5) is 3321=703 \cdot 3 - 2 \cdot 1 = 7 \neq 0. The three points are not collinear; they form a triangle.

The Midpoint Formula

Theorem 9 (Midpoint Formula)

Let AA and BB be points with position vectors a\mathbf{a} and b\mathbf{b}. The midpoint MM of the segment AB\overline{AB} has the position vector

m=12(a+b).\mathbf{m} = \frac{1}{2}(\mathbf{a} + \mathbf{b}).
Proof

Let MM be the midpoint of AB\overline{AB}. By definition, MM divides the segment into two equal parts, so the displacement from AA to MM equals the displacement from MM to BB:

AM=MB.\overrightarrow{AM} = \overrightarrow{MB}.

In terms of position vectors, ma=bm\mathbf{m} - \mathbf{a} = \mathbf{b} - \mathbf{m}. Adding m\mathbf{m} to both sides gives 2m=a+b2\mathbf{m} = \mathbf{a} + \mathbf{b}, whence m=12(a+b)\mathbf{m} = \frac{1}{2}(\mathbf{a} + \mathbf{b}).

The midpoint is therefore the average of the two position vectors.

The position vector of the midpoint M is the average of a and b
Example 9

For A=(1,3)A = (1, 3) and B=(4,1)B = (4, 1), the midpoint is

M=12([13]+[41])=12[54]=(52,  2).M = \frac{1}{2}\left(\begin{bmatrix} 1 \\ 3 \end{bmatrix} + \begin{bmatrix} 4 \\ 1 \end{bmatrix}\right) = \frac{1}{2}\begin{bmatrix} 5 \\ 4 \end{bmatrix} = \left(\frac{5}{2},\; 2\right).
Problem 6

Let A=(3,1)A = (3, -1) and M=(5,2)M = (5, 2), where MM is the midpoint of AB\overline{AB}. Determine the coordinates of BB.

Parallelogram Diagonals

Using the midpoint formula, we can prove a classical result about parallelograms with almost no computation.

Theorem 10 (Parallelogram Diagonals)

The diagonals of a parallelogram bisect each other.

Proof

Let ABCDABCD be a parallelogram with position vectors a,b,c,d\mathbf{a}, \mathbf{b}, \mathbf{c}, \mathbf{d}. Since ABCDABCD is a parallelogram, opposite sides are equal and parallel: AB=DC\overrightarrow{AB} = \overrightarrow{DC}. In terms of position vectors, ba=cd\mathbf{b} - \mathbf{a} = \mathbf{c} - \mathbf{d}, which rearranges to

a+c=b+d.\mathbf{a} + \mathbf{c} = \mathbf{b} + \mathbf{d}.

The midpoint of diagonal ACAC has position vector m1=12(a+c)\mathbf{m}_1 = \frac{1}{2}(\mathbf{a} + \mathbf{c}), and the midpoint of diagonal BDBD has position vector m2=12(b+d)\mathbf{m}_2 = \frac{1}{2}(\mathbf{b} + \mathbf{d}). Since a+c=b+d\mathbf{a} + \mathbf{c} = \mathbf{b} + \mathbf{d}, it follows that m1=m2\mathbf{m}_1 = \mathbf{m}_2. The midpoints of the diagonals coincide, so the diagonals bisect each other.

Example 10

Let A=(0,0)A = (0,0), B=(3,1)B = (3,1), D=(1,2)D = (1,2). For ABCDABCD to be a parallelogram, we need AB=DC\overrightarrow{AB} = \overrightarrow{DC}, so C=B+DA=(4,3)C = B + D - A = (4, 3). The midpoint of ACAC is 12(0+4,  0+3)=(2,  3/2)\frac{1}{2}(0+4,\;0+3) = (2,\; 3/2). The midpoint of BDBD is 12(3+1,  1+2)=(2,  3/2)\frac{1}{2}(3+1,\;1+2) = (2,\; 3/2). The diagonals bisect each other, as the theorem guarantees.

Problem 7

A quadrilateral ABCDABCD has vertices A=(1,0)A = (1, 0), B=(4,1)B = (4, 1), C=(5,4)C = (5, 4), D=(2,3)D = (2, 3). Verify that ABCDABCD is a parallelogram by checking AB=DC\overrightarrow{AB} = \overrightarrow{DC}. Then confirm that the midpoints of the diagonals coincide.

The Parametric Equation of a Line

We established above that a point XX lies on the line passing through AA and BB if and only if the vectors AX\overrightarrow{AX} and AB\overrightarrow{AB} are linearly dependent. The parametric equation makes this condition explicit by expressing every point on the line in terms of a single real parameter.

Theorem 11 (Parametric Line Equation)

Let AA and BB be distinct points with position vectors a\mathbf{a} and b\mathbf{b}. A point XX lies on the line LABL_{AB} if and only if its position vector x\mathbf{x} satisfies

x=(1t)a+tb\mathbf{x} = (1-t)\,\mathbf{a} + t\,\mathbf{b}

for some scalar tRt \in \mathbb{R}.

Proof

Suppose XX lies on LABL_{AB}. Then AX\overrightarrow{AX} is parallel to AB\overrightarrow{AB}, so AX=tAB\overrightarrow{AX} = t\,\overrightarrow{AB} for some scalar tt. In terms of position vectors:

xa=t(ba).\mathbf{x} - \mathbf{a} = t(\mathbf{b} - \mathbf{a}).

Rearranging: x=a+tbta=(1t)a+tb\mathbf{x} = \mathbf{a} + t\mathbf{b} - t\mathbf{a} = (1-t)\mathbf{a} + t\mathbf{b}.

Conversely, if x=(1t)a+tb\mathbf{x} = (1-t)\mathbf{a} + t\mathbf{b}, then xa=t(ba)\mathbf{x} - \mathbf{a} = t(\mathbf{b} - \mathbf{a}), so AX=tAB\overrightarrow{AX} = t\,\overrightarrow{AB}, which means AX\overrightarrow{AX} and AB\overrightarrow{AB} are linearly dependent and XX lies on LABL_{AB}.

The scalar tt is the parameter. The position of XX relative to AA and BB is determined entirely by tt:

  • t=0t = 0 gives x=a\mathbf{x} = \mathbf{a}, so X=AX = A.
  • t=1t = 1 gives x=b\mathbf{x} = \mathbf{b}, so X=BX = B.
  • 0<t<10 < t < 1 places XX strictly between AA and BB, on the segment AB\overline{AB}.
  • t>1t > 1 places XX beyond BB, so that BB lies between AA and XX.
  • t<0t < 0 places XX beyond AA in the opposite direction, so that AA lies between XX and BB.
Points on the line through A and B determined by the parameter t
Remark

The midpoint formula (Theorem 9) is a special case of the parametric line equation with t=1/2t = 1/2:

x=12a+12b=12(a+b).\mathbf{x} = \tfrac{1}{2}\mathbf{a} + \tfrac{1}{2}\mathbf{b} = \tfrac{1}{2}(\mathbf{a} + \mathbf{b}).
Example 11

Let A=(1,2)A = (1, 2) and B=(4,5)B = (4, 5). The point on LABL_{AB} with parameter t=2/3t = 2/3 is

x=13[12]+23[45]=[1/3+8/32/3+10/3]=[34].\mathbf{x} = \frac{1}{3}\begin{bmatrix} 1 \\ 2 \end{bmatrix} + \frac{2}{3}\begin{bmatrix} 4 \\ 5 \end{bmatrix} = \begin{bmatrix} 1/3 + 8/3 \\ 2/3 + 10/3 \end{bmatrix} = \begin{bmatrix} 3 \\ 4 \end{bmatrix}.

Since 0<2/3<10 < 2/3 < 1, this point lies on the segment AB\overline{AB}, two-thirds of the way from AA to BB.

Problem 8

Let A=(2,1)A = (2, -1) and B=(6,7)B = (6, 7). Find the parameter tt such that the point XX on LABL_{AB} has coordinates (5,5)(5, 5). Verify that this value of tt lies in the interval (0,1)(0, 1), confirming that XX is between AA and BB.

Concurrency and the Centroid

The parametric representation is particularly powerful for proving concurrency theorems: results asserting that several lines pass through a single common point.

Theorem 12 (Concurrency of Medians)

The three medians of a triangle are concurrent. Their common point is called the centroid.

Proof

Let ABCABC be a triangle with position vectors a,b,c\mathbf{a}, \mathbf{b}, \mathbf{c}. The midpoints of the sides BCBC, CACA, ABAB are, by Theorem 9:

d=12(b+c),e=12(c+a),f=12(a+b).\mathbf{d} = \frac{1}{2}(\mathbf{b}+\mathbf{c}), \qquad \mathbf{e} = \frac{1}{2}(\mathbf{c}+\mathbf{a}), \qquad \mathbf{f} = \frac{1}{2}(\mathbf{a}+\mathbf{b}).

Consider the median from AA to DD. By the parametric line equation (Theorem 11), any point on this median has position vector

x=(1t)a+td=(1t)a+t2(b+c).\mathbf{x} = (1-t)\,\mathbf{a} + t\,\mathbf{d} = (1-t)\,\mathbf{a} + \frac{t}{2}(\mathbf{b}+\mathbf{c}).

Choosing t=2/3t = 2/3 gives

g=13a+13b+13c=13(a+b+c).\mathbf{g} = \frac{1}{3}\mathbf{a} + \frac{1}{3}\mathbf{b} + \frac{1}{3}\mathbf{c} = \frac{1}{3}(\mathbf{a}+\mathbf{b}+\mathbf{c}).

The expression 13(a+b+c)\frac{1}{3}(\mathbf{a}+\mathbf{b}+\mathbf{c}) is completely symmetric in a\mathbf{a}, b\mathbf{b}, c\mathbf{c}. Repeating the calculation on the median from BB to EE with parameter t=2/3t = 2/3 from BB:

(123)b+23e=13b+13(c+a)=13(a+b+c)=g.(1-\tfrac{2}{3})\,\mathbf{b} + \tfrac{2}{3}\,\mathbf{e} = \frac{1}{3}\mathbf{b} + \frac{1}{3}(\mathbf{c}+\mathbf{a}) = \frac{1}{3}(\mathbf{a}+\mathbf{b}+\mathbf{c}) = \mathbf{g}.

The same holds for the median from CC to FF. Since g\mathbf{g} lies on all three medians, the medians are concurrent at GG, the centroid.

The medians of a triangle intersect at the centroid G, which divides each median in a 2:1 ratio
Remark

The parameter t=2/3t = 2/3 tells us that the centroid lies two-thirds of the way from each vertex to the opposite midpoint. Equivalently, the centroid divides each median in a 2:12:1 ratio, measured from vertex to midpoint.

Note (Symmetry of an Expression)

An expression in several variables is called symmetric if it is unchanged when the variables are permuted. The formula 13(a+b+c)\frac{1}{3}(\mathbf{a} + \mathbf{b} + \mathbf{c}) treats a\mathbf{a}, b\mathbf{b}, and c\mathbf{c} identically: swapping any two of them produces the same result. This means that any property we derive for one variable automatically holds for the others. In the proof above, once we showed that g\mathbf{g} lies on the median from AA, the symmetric form of g\mathbf{g} guaranteed the same conclusion for the medians from BB and CC without repeating the calculation. This reasoning pattern, known as argument by symmetry, appears frequently throughout mathematics: if the setup treats several objects identically, a conclusion established for one must hold for all.

Example 12

Let A=(0,0)A = (0, 0), B=(6,0)B = (6, 0), C=(3,6)C = (3, 6). The centroid is

G=13(0+6+3,  0+0+6)=(3,2).G = \frac{1}{3}(0+6+3,\; 0+0+6) = (3, 2).

We verify: the midpoint of BCBC is D=(9/2,3)D = (9/2, 3). On the median ADAD, the point with t=2/3t = 2/3 is

13(0,0)+23(9/2,3)=(3,2)=G.\frac{1}{3}(0,0) + \frac{2}{3}(9/2, 3) = (3, 2) = G. \quad \checkmark
Problem 9

Let A=(1,1)A = (1, 1), B=(5,3)B = (5, 3), C=(3,7)C = (3, 7). Compute the centroid GG and verify that GG divides the median from CC to the midpoint of ABAB in a 2:12:1 ratio.

Affine Dependence and Menelaus’ Theorem

To handle more advanced incidence theorems, we need a more flexible criterion for collinearity than linear dependence of displacement vectors. The following result characterises collinearity through a condition on position vectors directly.

Theorem 13 (Affine Dependence)

Three points X,Y,ZX, Y, Z with position vectors x,y,z\mathbf{x}, \mathbf{y}, \mathbf{z} are collinear if and only if there exist scalars u,v,wu, v, w, not all zero, such that

ux+vy+wz=0andu+v+w=0.u\mathbf{x} + v\mathbf{y} + w\mathbf{z} = \mathbf{0} \quad \text{and} \quad u + v + w = 0.
Proof

Suppose X,Y,ZX, Y, Z are collinear. Then ZZ lies on the line through XX and YY, so by the parametric line equation (Theorem 11), z=(1t)x+ty\mathbf{z} = (1-t)\mathbf{x} + t\mathbf{y} for some scalar tt. Rearranging:

(1t)x+tyz=0.(1-t)\mathbf{x} + t\mathbf{y} - \mathbf{z} = \mathbf{0}.

Setting u=1tu = 1-t, v=tv = t, w=1w = -1, we have ux+vy+wz=0u\mathbf{x} + v\mathbf{y} + w\mathbf{z} = \mathbf{0} and u+v+w=(1t)+t1=0u + v + w = (1-t) + t - 1 = 0. Since w=10w = -1 \neq 0, not all scalars are zero.

Conversely, suppose ux+vy+wz=0u\mathbf{x} + v\mathbf{y} + w\mathbf{z} = \mathbf{0} and u+v+w=0u + v + w = 0 with, say, w0w \neq 0. Then w=(u+v)w = -(u+v), so

ux+vy(u+v)z=0    (u+v)z=ux+vy.u\mathbf{x} + v\mathbf{y} - (u+v)\mathbf{z} = \mathbf{0} \implies (u+v)\mathbf{z} = u\mathbf{x} + v\mathbf{y}.

Since w0w \neq 0, we have u+v0u + v \neq 0, so we can divide:

z=uu+vx+vu+vy.\mathbf{z} = \frac{u}{u+v}\mathbf{x} + \frac{v}{u+v}\mathbf{y}.

Setting t=vu+vt = \frac{v}{u+v} gives 1t=uu+v1 - t = \frac{u}{u+v}, so z=(1t)x+ty\mathbf{z} = (1-t)\mathbf{x} + t\mathbf{y}. By Theorem 11, ZZ lies on the line through XX and YY.

We now apply this criterion to prove Menelaus’ theorem, one of the classical results in triangle geometry. We first need a notion that measures where a point falls on a line relative to two reference points.

Definition 12 (Directed Ratio)

Let AA and BB be distinct points. For any point ZZ on the line LABL_{AB} with ZBZ \neq B, the directed ratio of ZZ relative to A,BA, B, denoted dr(A,B;Z)\mathrm{dr}(A, B;\, Z), is the unique scalar rr such that

AZ=rZB.\overrightarrow{AZ} = r\,\overrightarrow{ZB}.
Remark

If ZZ lies between AA and BB, then AZ\overrightarrow{AZ} and ZB\overrightarrow{ZB} point in the same direction, so r>0r > 0. If ZZ lies outside the segment AB\overline{AB}, they point in opposite directions, so r<0r < 0. In terms of position vectors, za=r(bz)\mathbf{z} - \mathbf{a} = r(\mathbf{b} - \mathbf{z}), which rearranges to (1+r)z=a+rb(1+r)\mathbf{z} = \mathbf{a} + r\mathbf{b}.

Example 13

Let A=(0,0)A = (0, 0) and B=(6,0)B = (6, 0). The point Z=(2,0)Z = (2, 0) lies between AA and BB. We have AZ=[20]\overrightarrow{AZ} = \begin{bmatrix} 2 \\ 0 \end{bmatrix} and ZB=[40]\overrightarrow{ZB} = \begin{bmatrix} 4 \\ 0 \end{bmatrix}, so AZ=12ZB\overrightarrow{AZ} = \frac{1}{2}\overrightarrow{ZB} and dr(A,B;Z)=1/2>0\mathrm{dr}(A, B;\, Z) = 1/2 > 0. The point Z=(3,0)Z' = (-3, 0) lies outside AB\overline{AB}: AZ=[30]\overrightarrow{AZ'} = \begin{bmatrix} -3 \\ 0 \end{bmatrix} and ZB=[90]\overrightarrow{Z'B} = \begin{bmatrix} 9 \\ 0 \end{bmatrix}, so dr(A,B;Z)=1/3<0\mathrm{dr}(A, B;\, Z') = -1/3 < 0.

Theorem 14 (Menelaus' Theorem)

Let X,Y,ZX, Y, Z be points on the lines containing sides BCBC, CACA, ABAB of a triangle ABCABC, respectively. Then X,Y,ZX, Y, Z are collinear if and only if

dr(B,C;X)dr(C,A;Y)dr(A,B;Z)=1.\mathrm{dr}(B, C;\, X) \cdot \mathrm{dr}(C, A;\, Y) \cdot \mathrm{dr}(A, B;\, Z) = -1.
Proof

Let r=dr(B,C;X)r = \mathrm{dr}(B, C;\, X), s=dr(C,A;Y)s = \mathrm{dr}(C, A;\, Y), and t=dr(A,B;Z)t = \mathrm{dr}(A, B;\, Z). From the definition of directed ratio:

(1+r)x=b+rc,(1+s)y=c+sa,(1+t)z=a+tb.(1+r)\mathbf{x} = \mathbf{b} + r\mathbf{c}, \qquad (1+s)\mathbf{y} = \mathbf{c} + s\mathbf{a}, \qquad (1+t)\mathbf{z} = \mathbf{a} + t\mathbf{b}.

Assume rst=1rst = -1. We form a linear combination of x\mathbf{x}, y\mathbf{y}, z\mathbf{z} designed to satisfy the affine dependence criterion (Theorem 13). Consider

st(1+r)x+(1+s)ys(1+t)z.st(1+r)\,\mathbf{x} + (1+s)\,\mathbf{y} - s(1+t)\,\mathbf{z}.

Substituting the expressions above:

st(b+rc)+(c+sa)s(a+tb)st(\mathbf{b} + r\mathbf{c}) + (\mathbf{c} + s\mathbf{a}) - s(\mathbf{a} + t\mathbf{b})=stb+strc+c+sasastb=(str+1)c.= st\mathbf{b} + str\mathbf{c} + \mathbf{c} + s\mathbf{a} - s\mathbf{a} - st\mathbf{b} = (str + 1)\mathbf{c}.

Since rst=1rst = -1, we have str+1=0str + 1 = 0, so the entire combination equals 0\mathbf{0}.

We also check the sum of the coefficients:

st(1+r)+(1+s)s(1+t)=st+str+1+ssst=str+1=0.st(1+r) + (1+s) - s(1+t) = st + str + 1 + s - s - st = str + 1 = 0.

By Theorem 13, the points X,Y,ZX, Y, Z are collinear.

The converse follows by reversing the argument: if X,Y,ZX, Y, Z are collinear, Theorem 13 provides scalars u,v,wu, v, w with u+v+w=0u + v + w = 0 and ux+vy+wz=0u\mathbf{x} + v\mathbf{y} + w\mathbf{z} = \mathbf{0}. Expressing x,y,z\mathbf{x}, \mathbf{y}, \mathbf{z} in terms of a,b,c\mathbf{a}, \mathbf{b}, \mathbf{c} and comparing coefficients forces rst=1rst = -1.

Menelaus' theorem: points X, Y, Z on the sides of triangle ABC are collinear
Example 14

Consider the triangle with A=(0,0)A = (0, 0), B=(4,0)B = (4, 0), C=(3,3)C = (3, 3). Let the line \ell intersect side CACA at Y=(1.2,1.2)Y = (1.2,\, 1.2) and line ABAB at Z=(1.5,0)Z = (-1.5,\, 0). We compute the directed ratios and verify Menelaus’ condition.

For YY on CACA: CY=(1.8,1.8)\overrightarrow{CY} = (-1.8,\, -1.8) and YA=(1.2,1.2)\overrightarrow{YA} = (-1.2,\, -1.2), so s=dr(C,A;Y)=3/2s = \mathrm{dr}(C, A;\, Y) = 3/2.

For ZZ on ABAB: AZ=(1.5,0)\overrightarrow{AZ} = (-1.5,\, 0) and ZB=(5.5,0)\overrightarrow{ZB} = (5.5,\, 0), so t=dr(A,B;Z)=3/11t = \mathrm{dr}(A, B;\, Z) = -3/11.

Menelaus requires rst=1r \cdot s \cdot t = -1, giving r=1/(st)=1/((3/2)(3/11))=22/9r = -1/(st) = -1/((3/2)(-3/11)) = 22/9.

Problem 10

Let A=(0,0)A = (0, 0), B=(6,0)B = (6, 0), C=(2,4)C = (2, 4). The point XX lies on BCBC with dr(B,C;X)=1\mathrm{dr}(B, C;\, X) = 1 (i.e., XX is the midpoint of BCBC), and ZZ lies on line ABAB with dr(A,B;Z)=1/2\mathrm{dr}(A, B;\, Z) = -1/2. Use Menelaus’ theorem to find dr(C,A;Y)\mathrm{dr}(C, A;\, Y), and hence determine the coordinates of YY on line CACA.

The Dot Product

In the preceding sections, we employed vector algebra to investigate affine properties of plane geometry: parallelism, collinearity, and ratios of segments. These properties are independent of any specific unit of measurement. However, to discuss metric geometry (concepts involving lengths and angles), we require a stronger algebraic tool. This tool is the dot product.

Definition 13 (Dot Product)

Let a=[a1a2]\mathbf{a} = \begin{bmatrix} a_1 \\ a_2 \end{bmatrix} and b=[b1b2]\mathbf{b} = \begin{bmatrix} b_1 \\ b_2 \end{bmatrix} be vectors in R2\mathbb{R}^2. The dot product (also known as the scalar product or inner product) of a\mathbf{a} and b\mathbf{b} is the real number defined by:

ab=a1b1+a2b2.\mathbf{a} \cdot \mathbf{b} = a_1 b_1 + a_2 b_2.

It is crucial to observe that while the operation takes two vectors as input, the output is a scalar. By examining the definition alongside our earlier definition of magnitude (Definition 3), we immediately observe a fundamental link between the dot product and the magnitude of a vector:

aa=a12+a22=a2.\mathbf{a} \cdot \mathbf{a} = a_1^2 + a_2^2 = |\mathbf{a}|^2.

Thus, the magnitude of a vector is the square root of the dot product of the vector with itself: a=aa|\mathbf{a}| = \sqrt{\mathbf{a} \cdot \mathbf{a}}.

Example 15

Let a=[31]\mathbf{a} = \begin{bmatrix} 3 \\ -1 \end{bmatrix} and b=[24]\mathbf{b} = \begin{bmatrix} 2 \\ 4 \end{bmatrix}. Then ab=(3)(2)+(1)(4)=2\mathbf{a} \cdot \mathbf{b} = (3)(2) + (-1)(4) = 2. Also, aa=9+1=10=a2\mathbf{a} \cdot \mathbf{a} = 9 + 1 = 10 = |\mathbf{a}|^2, consistent with a=10|\mathbf{a}| = \sqrt{10} from Definition 3.

The dot product satisfies several fundamental algebraic laws which justify its manipulation in equations.

Theorem 15 (Algebraic Properties of the Dot Product)

For any vectors a,b,cR2\mathbf{a}, \mathbf{b}, \mathbf{c} \in \mathbb{R}^2 and any scalar rRr \in \mathbb{R}, the following hold:

  1. Symmetry: ab=ba\mathbf{a} \cdot \mathbf{b} = \mathbf{b} \cdot \mathbf{a}.
  2. Bilinearity: The dot product is linear in both arguments:
    • (ra+b)c=r(ac)+bc(r\mathbf{a} + \mathbf{b}) \cdot \mathbf{c} = r(\mathbf{a} \cdot \mathbf{c}) + \mathbf{b} \cdot \mathbf{c}.
    • a(rb+c)=r(ab)+ac\mathbf{a} \cdot (r\mathbf{b} + \mathbf{c}) = r(\mathbf{a} \cdot \mathbf{b}) + \mathbf{a} \cdot \mathbf{c}.
  3. Positive Definiteness: aa0\mathbf{a} \cdot \mathbf{a} \ge 0, with equality if and only if a=0\mathbf{a} = \mathbf{0}.
Proof

These properties follow directly from the properties of real numbers. Symmetry holds because a1b1+a2b2=b1a1+b2a2a_1 b_1 + a_2 b_2 = b_1 a_1 + b_2 a_2. Bilinearity is verified by expanding the components: (ra1+b1)c1+(ra2+b2)c2=r(a1c1+a2c2)+(b1c1+b2c2)(ra_1 + b_1)c_1 + (ra_2 + b_2)c_2 = r(a_1 c_1 + a_2 c_2) + (b_1 c_1 + b_2 c_2). The second bilinearity identity follows from symmetry and the first. Positive definiteness follows from the fact that a12+a220a_1^2 + a_2^2 \ge 0, with equality only when a1=a2=0a_1 = a_2 = 0.

Remark

The vector space R2\mathbb{R}^2 equipped with this specific dot product is often referred to as the Euclidean plane. This structure bridges the gap between abstract vector spaces and Euclidean geometry.

Metric Theorems

The interaction between the dot product and the magnitude leads to several powerful inequalities and identities.

Theorem 16 (Fundamental Metric Identities)

Let a\mathbf{a} and b\mathbf{b} be vectors in R2\mathbb{R}^2.

  1. Homogeneity: ra=ra|r\mathbf{a}| = |r|\,|\mathbf{a}|.
  2. Parallelogram Law: a+b2+ab2=2(a2+b2)|\mathbf{a} + \mathbf{b}|^2 + |\mathbf{a} - \mathbf{b}|^2 = 2(|\mathbf{a}|^2 + |\mathbf{b}|^2).
  3. Cauchy-Schwarz Inequality: abab|\mathbf{a} \cdot \mathbf{b}| \le |\mathbf{a}|\,|\mathbf{b}|.
  4. Triangle Inequality: a+ba+b|\mathbf{a} + \mathbf{b}| \le |\mathbf{a}| + |\mathbf{b}|.
  5. Law of Cosines: ab2=a2+b22(ab)|\mathbf{a} - \mathbf{b}|^2 = |\mathbf{a}|^2 + |\mathbf{b}|^2 - 2(\mathbf{a} \cdot \mathbf{b}).
Proof

(i) We have ra2=(ra)(ra)=r2(aa)=r2a2|r\mathbf{a}|^2 = (r\mathbf{a}) \cdot (r\mathbf{a}) = r^2(\mathbf{a} \cdot \mathbf{a}) = r^2 |\mathbf{a}|^2. Taking square roots gives ra=ra|r\mathbf{a}| = |r|\,|\mathbf{a}|.

(ii) We expand the norms using the dot product properties:

a+b2=(a+b)(a+b)=a2+2(ab)+b2|\mathbf{a} + \mathbf{b}|^2 = (\mathbf{a} + \mathbf{b}) \cdot (\mathbf{a} + \mathbf{b}) = |\mathbf{a}|^2 + 2(\mathbf{a} \cdot \mathbf{b}) + |\mathbf{b}|^2ab2=(ab)(ab)=a22(ab)+b2|\mathbf{a} - \mathbf{b}|^2 = (\mathbf{a} - \mathbf{b}) \cdot (\mathbf{a} - \mathbf{b}) = |\mathbf{a}|^2 - 2(\mathbf{a} \cdot \mathbf{b}) + |\mathbf{b}|^2

Adding these two equations yields the result. Geometrically, this states that the sum of the squares of the diagonals of a parallelogram equals the sum of the squares of its sides.

(iii) If a=0\mathbf{a} = \mathbf{0}, the inequality holds trivially. Assume a0\mathbf{a} \neq \mathbf{0}. Consider the vector v(x)=xa+b\mathbf{v}(x) = x\mathbf{a} + \mathbf{b} for any scalar xx. By positive definiteness, v(x)20|\mathbf{v}(x)|^2 \ge 0. Expanding:

xa+b2=x2a2+2x(ab)+b20.|x\mathbf{a} + \mathbf{b}|^2 = x^2|\mathbf{a}|^2 + 2x(\mathbf{a} \cdot \mathbf{b}) + |\mathbf{b}|^2 \ge 0.

This is a quadratic polynomial in xx. Since it is non-negative for all real xx, its discriminant must be non-positive:

Δ=4(ab)24a2b20    (ab)2a2b2.\Delta = 4(\mathbf{a} \cdot \mathbf{b})^2 - 4|\mathbf{a}|^2 |\mathbf{b}|^2 \le 0 \implies (\mathbf{a} \cdot \mathbf{b})^2 \le |\mathbf{a}|^2 |\mathbf{b}|^2.

Taking the square root yields abab|\mathbf{a} \cdot \mathbf{b}| \le |\mathbf{a}|\,|\mathbf{b}|.

(iv) Using the expansion from part (ii) and the Cauchy-Schwarz inequality:

a+b2=a2+b2+2(ab)a2+b2+2ab=(a+b)2.|\mathbf{a} + \mathbf{b}|^2 = |\mathbf{a}|^2 + |\mathbf{b}|^2 + 2(\mathbf{a} \cdot \mathbf{b}) \le |\mathbf{a}|^2 + |\mathbf{b}|^2 + 2|\mathbf{a}|\,|\mathbf{b}| = (|\mathbf{a}| + |\mathbf{b}|)^2.

Taking square roots gives a+ba+b|\mathbf{a} + \mathbf{b}| \le |\mathbf{a}| + |\mathbf{b}|.

(v) This is simply the expansion ab2=(ab)(ab)=a22(ab)+b2|\mathbf{a} - \mathbf{b}|^2 = (\mathbf{a} - \mathbf{b}) \cdot (\mathbf{a} - \mathbf{b}) = |\mathbf{a}|^2 - 2(\mathbf{a} \cdot \mathbf{b}) + |\mathbf{b}|^2.

The Parallelogram Law involves the diagonals a+b and a-b

Angle and Orthogonality

To appreciate why identity (v) in Theorem 16 is commonly referred to as the Cosine Law for vectors, consider a triangle OABOAB in the plane. Let θ\theta be the angle at the vertex OO. The classical Law of Cosines from trigonometry states:

AB2=OA2+OB22(OA)(OB)cosθ.AB^2 = OA^2 + OB^2 - 2(OA)(OB)\cos\theta.

Setting a=OA\mathbf{a} = \overrightarrow{OA} and b=OB\mathbf{b} = \overrightarrow{OB}, we have AB=abAB = |\mathbf{a} - \mathbf{b}|. Substituting:

ab2=a2+b22abcosθ.|\mathbf{a} - \mathbf{b}|^2 = |\mathbf{a}|^2 + |\mathbf{b}|^2 - 2|\mathbf{a}|\,|\mathbf{b}|\cos\theta.

Comparing this with identity (v), which gives ab2=a2+b22(ab)|\mathbf{a} - \mathbf{b}|^2 = |\mathbf{a}|^2 + |\mathbf{b}|^2 - 2(\mathbf{a} \cdot \mathbf{b}), we equate the angle-dependent terms:

ab=abcosθor equivalentlycosθ=abab.\mathbf{a} \cdot \mathbf{b} = |\mathbf{a}|\,|\mathbf{b}|\cos\theta \quad \text{or equivalently} \quad \cos\theta = \frac{\mathbf{a} \cdot \mathbf{b}}{|\mathbf{a}|\,|\mathbf{b}|}.

This allows us to define the angle between two vectors rigorously. For the definition to be valid, the ratio on the right must lie in [1,1][-1, 1]. The Cauchy-Schwarz inequality (identity (iii)) guarantees exactly this:

1abab1.-1 \le \frac{\mathbf{a} \cdot \mathbf{b}}{|\mathbf{a}|\,|\mathbf{b}|} \le 1.

Thus the value is always valid for the arccosine function.

Definition 14 (Angle Between Vectors)

Let a\mathbf{a} and b\mathbf{b} be non-zero vectors. The angle θ\theta between them is the unique number in the interval [0,π][0, \pi] such that

cosθ=abab.\cos \theta = \frac{\mathbf{a} \cdot \mathbf{b}}{|\mathbf{a}|\,|\mathbf{b}|}.
The angle theta between vectors a and b is defined by their dot product
Example 16 (Calculating Angles)

Let a=[10]\mathbf{a} = \begin{bmatrix} 1 \\ 0 \end{bmatrix}, b=[11]\mathbf{b} = \begin{bmatrix} -1 \\ 1 \end{bmatrix}, and c=[553]\mathbf{c} = \begin{bmatrix} 5 \\ -5\sqrt{3} \end{bmatrix}. First, we compute the magnitudes: a=1|\mathbf{a}| = 1, b=2|\mathbf{b}| = \sqrt{2}, c=25+75=10|\mathbf{c}| = \sqrt{25 + 75} = 10.

(i) Angle between a\mathbf{a} and b\mathbf{b}: ab=1\mathbf{a} \cdot \mathbf{b} = -1, so cosθ=1/2\cos\theta = -1/\sqrt{2}, giving θ=3π/4\theta = 3\pi/4.

(ii) Angle between a\mathbf{a} and c\mathbf{c}: ac=5\mathbf{a} \cdot \mathbf{c} = 5, so cosθ=5/10=1/2\cos\theta = 5/10 = 1/2, giving θ=π/3\theta = \pi/3.

(iii) Angle between b\mathbf{b} and c\mathbf{c}: bc=553\mathbf{b} \cdot \mathbf{c} = -5 - 5\sqrt{3}, so cosθ=5(1+3)/(102)=(2+6)/4\cos\theta = -5(1+\sqrt{3})/(10\sqrt{2}) = -(\sqrt{2}+\sqrt{6})/4. This corresponds to θ=11π/12\theta = 11\pi/12.

The dot product also provides a simple algebraic test for perpendicularity. If vectors a\mathbf{a} and b\mathbf{b} are perpendicular, the angle between them is π/2\pi/2. Since cos(π/2)=0\cos(\pi/2) = 0, this implies ab=0\mathbf{a} \cdot \mathbf{b} = 0. We formalise this as orthogonality, extending the concept to include the zero vector.

Definition 15 (Orthogonality)

Two vectors a\mathbf{a} and b\mathbf{b} are said to be orthogonal if ab=0\mathbf{a} \cdot \mathbf{b} = 0. We denote this by ab\mathbf{a} \perp \mathbf{b}.

Note

The zero vector is orthogonal to every vector in R2\mathbb{R}^2, since 0b=0\mathbf{0} \cdot \mathbf{b} = 0 for all b\mathbf{b}.

Example 17

The vectors a=[32]\mathbf{a} = \begin{bmatrix} 3 \\ 2 \end{bmatrix} and b=[23]\mathbf{b} = \begin{bmatrix} -2 \\ 3 \end{bmatrix} are orthogonal, since ab=(3)(2)+(2)(3)=0\mathbf{a} \cdot \mathbf{b} = (3)(-2) + (2)(3) = 0. Notice that swapping the components and negating one always produces an orthogonal vector: if a=[a1a2]\mathbf{a} = \begin{bmatrix} a_1 \\ a_2 \end{bmatrix}, then a=[a2a1]\mathbf{a}^\perp = \begin{bmatrix} a_2 \\ -a_1 \end{bmatrix} satisfies aa=a1a2a2a1=0\mathbf{a} \cdot \mathbf{a}^\perp = a_1 a_2 - a_2 a_1 = 0.

Problem 11

Determine all vectors b=[b1b2]\mathbf{b} = \begin{bmatrix} b_1 \\ b_2 \end{bmatrix} that are simultaneously orthogonal to a=[12]\mathbf{a} = \begin{bmatrix} 1 \\ 2 \end{bmatrix} and have magnitude 5\sqrt{5}.

Orthogonal Projection

A fundamental problem in geometry and physics is the resolution of a vector into distinct components relative to a reference direction. Given a vector x\mathbf{x} and a non-zero reference vector a\mathbf{a}, we wish to decompose x\mathbf{x} into a sum x=p+b\mathbf{x} = \mathbf{p} + \mathbf{b}, where p\mathbf{p} is parallel to a\mathbf{a} and b\mathbf{b} is orthogonal to a\mathbf{a}.

One may approach this using elementary algebra. Let a=[a1a2]0\mathbf{a} = \begin{bmatrix} a_1 \\ a_2 \end{bmatrix} \neq \mathbf{0}. We seek a scalar tt and a scalar uu such that x=ta+ua\mathbf{x} = t\mathbf{a} + u\mathbf{a}^\perp, where a=[a2a1]\mathbf{a}^\perp = \begin{bmatrix} a_2 \\ -a_1 \end{bmatrix}. This leads to the system:

a1t+a2u=x1,a2ta1u=x2.a_1 t + a_2 u = x_1, \qquad a_2 t - a_1 u = x_2.

Multiplying the first equation by a1a_1 and the second by a2a_2, then adding, eliminates uu:

(a12+a22)t=a1x1+a2x2.(a_1^2 + a_2^2)\,t = a_1 x_1 + a_2 x_2.

Recognising the terms as dot products and magnitudes, we obtain ta2=xat\,|\mathbf{a}|^2 = \mathbf{x} \cdot \mathbf{a}, giving t=(xa)/a2t = (\mathbf{x} \cdot \mathbf{a})/|\mathbf{a}|^2. This algebraic result motivates the following vector-based formulation.

Theorem 17 (Orthogonal Decomposition)

Let aR2\mathbf{a} \in \mathbb{R}^2 be a non-zero vector. For any vector xR2\mathbf{x} \in \mathbb{R}^2, there exists a unique scalar tt and a unique vector b\mathbf{b} such that

x=ta+bandba.\mathbf{x} = t\mathbf{a} + \mathbf{b} \quad \text{and} \quad \mathbf{b} \perp \mathbf{a}.

The vector p=ta\mathbf{p} = t\mathbf{a} is called the orthogonal projection of x\mathbf{x} onto a\mathbf{a}, denoted projax\mathrm{proj}_{\mathbf{a}}\mathbf{x}.

Proof

We seek a scalar tt such that the vector b=xta\mathbf{b} = \mathbf{x} - t\mathbf{a} is orthogonal to a\mathbf{a}:

(xta)a=0    xat(aa)=0.(\mathbf{x} - t\mathbf{a}) \cdot \mathbf{a} = 0 \implies \mathbf{x} \cdot \mathbf{a} - t(\mathbf{a} \cdot \mathbf{a}) = 0.

Since a0\mathbf{a} \neq \mathbf{0}, we have aa0\mathbf{a} \cdot \mathbf{a} \neq 0, yielding the unique solution:

t=xaaa=xaa2.t = \frac{\mathbf{x} \cdot \mathbf{a}}{\mathbf{a} \cdot \mathbf{a}} = \frac{\mathbf{x} \cdot \mathbf{a}}{|\mathbf{a}|^2}.

The projection vector is therefore:

projax=(xaa2)a.\mathrm{proj}_{\mathbf{a}}\mathbf{x} = \left(\frac{\mathbf{x} \cdot \mathbf{a}}{|\mathbf{a}|^2}\right)\mathbf{a}.

Graphically, the term b=xprojax\mathbf{b} = \mathbf{x} - \mathrm{proj}_{\mathbf{a}}\mathbf{x} represents the perpendicular distance from the tip of x\mathbf{x} to the line spanned by a\mathbf{a}.

The orthogonal decomposition of x onto the line generated by a
Corollary 1 (Special Cases)
  1. x\mathbf{x} and a\mathbf{a} are linearly dependent if and only if projax=x\mathrm{proj}_{\mathbf{a}}\mathbf{x} = \mathbf{x} (i.e., b=0\mathbf{b} = \mathbf{0}).
  2. x\mathbf{x} and a\mathbf{a} are orthogonal if and only if projax=0\mathrm{proj}_{\mathbf{a}}\mathbf{x} = \mathbf{0}.
Proof

Recall from Theorem 17 that projax=(xaa2)a\mathrm{proj}_{\mathbf{a}}\mathbf{x} = \left(\frac{\mathbf{x} \cdot \mathbf{a}}{|\mathbf{a}|^2}\right)\mathbf{a} and b=xprojax\mathbf{b} = \mathbf{x} - \mathrm{proj}_{\mathbf{a}}\mathbf{x}.

(i) If x\mathbf{x} and a\mathbf{a} are linearly dependent, then x=ka\mathbf{x} = k\mathbf{a} for some scalar kk. Then xa=ka2\mathbf{x} \cdot \mathbf{a} = k|\mathbf{a}|^2, so projax=ka=x\mathrm{proj}_{\mathbf{a}}\mathbf{x} = k\mathbf{a} = \mathbf{x}, giving b=0\mathbf{b} = \mathbf{0}. Conversely, if projax=x\mathrm{proj}_{\mathbf{a}}\mathbf{x} = \mathbf{x}, then x=(xaa2)a\mathbf{x} = \left(\frac{\mathbf{x} \cdot \mathbf{a}}{|\mathbf{a}|^2}\right)\mathbf{a}, which expresses x\mathbf{x} as a scalar multiple of a\mathbf{a}.

(ii) If xa\mathbf{x} \perp \mathbf{a}, then xa=0\mathbf{x} \cdot \mathbf{a} = 0, so projax=0\mathrm{proj}_{\mathbf{a}}\mathbf{x} = \mathbf{0}. Conversely, if projax=0\mathrm{proj}_{\mathbf{a}}\mathbf{x} = \mathbf{0}, then xaa2=0\frac{\mathbf{x} \cdot \mathbf{a}}{|\mathbf{a}|^2} = 0, which implies xa=0\mathbf{x} \cdot \mathbf{a} = 0, so xa\mathbf{x} \perp \mathbf{a}.

The length of the projection vector is given by:

projax=xaa2a=xaa.|\mathrm{proj}_{\mathbf{a}}\mathbf{x}| = \left|\frac{\mathbf{x} \cdot \mathbf{a}}{|\mathbf{a}|^2}\right| |\mathbf{a}| = \frac{|\mathbf{x} \cdot \mathbf{a}|}{|\mathbf{a}|}.

This formula is particularly useful for finding the distance between points projected onto a line.

Example 18 (Projection of a Segment)

Let X=(1,3)X = (-1, 3), Y=(3,0)Y = (3, 0), A=(2,4)A = (2, 4), and B=(1,2)B = (1, -2). We wish to find the length of the orthogonal projection of the segment XYXY onto the line passing through AA and BB.

The displacement vectors are z=XY=[43]\mathbf{z} = \overrightarrow{XY} = \begin{bmatrix} 4 \\ -3 \end{bmatrix} and c=AB=[16]\mathbf{c} = \overrightarrow{AB} = \begin{bmatrix} -1 \\ -6 \end{bmatrix}. The length of the projection of segment XYXY onto line ABAB is:

zcc=(4)(1)+(3)(6)1+36=1437.\frac{|\mathbf{z} \cdot \mathbf{c}|}{|\mathbf{c}|} = \frac{|(4)(-1) + (-3)(-6)|}{\sqrt{1 + 36}} = \frac{14}{\sqrt{37}}.
Problem 12

Let a=[12]\mathbf{a} = \begin{bmatrix} 1 \\ 2 \end{bmatrix} and x=[71]\mathbf{x} = \begin{bmatrix} 7 \\ 1 \end{bmatrix}. Compute projax\mathrm{proj}_{\mathbf{a}}\mathbf{x} and the orthogonal component b=xprojax\mathbf{b} = \mathbf{x} - \mathrm{proj}_{\mathbf{a}}\mathbf{x}. Verify that ba\mathbf{b} \perp \mathbf{a}.

The Equation of a Straight Line

In elementary coordinate geometry, a straight line is defined as the locus of points X=(x,y)X = (x, y) satisfying a linear equation of the form

ax+by+c=0,ax + by + c = 0,

where aa and bb are not both zero. By translating this algebraic constraint into the language of vectors, we uncover the geometric significance of the coefficients aa and bb.

The Point-Normal Form

Let n=[ab]\mathbf{n} = \begin{bmatrix} a \\ b \end{bmatrix} and let x=[xy]\mathbf{x} = \begin{bmatrix} x \\ y \end{bmatrix} be the position vector of a point XX. The linear term ax+byax + by is precisely the dot product nx\mathbf{n} \cdot \mathbf{x}. Consequently, the equation of the line may be rewritten as

nx+c=0.\mathbf{n} \cdot \mathbf{x} + c = 0.

To interpret this geometrically, let PP be a fixed point on the line with position vector p\mathbf{p}. Since PP lies on the line, its coordinates satisfy the equation, so np+c=0\mathbf{n} \cdot \mathbf{p} + c = 0, which implies c=npc = -\mathbf{n} \cdot \mathbf{p}. Substituting this back into the general equation yields

nxnp=0    n(xp)=0.\mathbf{n} \cdot \mathbf{x} - \mathbf{n} \cdot \mathbf{p} = 0 \implies \mathbf{n} \cdot (\mathbf{x} - \mathbf{p}) = 0.

The vector xp\mathbf{x} - \mathbf{p} represents the displacement PX\overrightarrow{PX} along the line. The condition nPX=0\mathbf{n} \cdot \overrightarrow{PX} = 0 implies that n\mathbf{n} is orthogonal to the direction of the line.

Definition 16 (Normal Vector)

A non-zero vector n\mathbf{n} is called a normal vector to a straight line LL if n\mathbf{n} is orthogonal to the displacement vector between any two distinct points on LL.

This leads to the vector characterisation of a straight line.

Theorem 18 (Point-Normal Form)

The straight line passing through a specific point PP with position vector p\mathbf{p}, and perpendicular to a non-zero normal vector n\mathbf{n}, consists of all points XX with position vectors x\mathbf{x} satisfying

n(xp)=0.\mathbf{n} \cdot (\mathbf{x} - \mathbf{p}) = 0.
The line L through P with normal n, showing the displacement x minus p is orthogonal to n
Example 19 (Constructing a Line)

Find the equation of the line passing through A(3,2)A(3, 2) and perpendicular to the vector n=[21]\mathbf{n} = \begin{bmatrix} 2 \\ -1 \end{bmatrix}.

Using the point-normal form:

[21][x3y2]=0\begin{bmatrix} 2 \\ -1 \end{bmatrix} \cdot \begin{bmatrix} x - 3 \\ y - 2 \end{bmatrix} = 02(x3)1(y2)=02(x - 3) - 1(y - 2) = 02x6y+2=0    2xy4=0.2x - 6 - y + 2 = 0 \implies 2x - y - 4 = 0.

This formulation allows us to easily find the perpendicular bisector of a segment.

Example 20 (Perpendicular Bisector)

Let A=(1,3)A = (-1, 3) and B=(5,1)B = (5, 1). The perpendicular bisector of AB\overline{AB} passes through the midpoint MM of the segment and has AB\overrightarrow{AB} as a normal vector.

  1. Midpoint: m=12(a+b)=[1+523+12]=[22]\mathbf{m} = \tfrac{1}{2}(\mathbf{a} + \mathbf{b}) = \begin{bmatrix} \frac{-1+5}{2} \\ \frac{3+1}{2} \end{bmatrix} = \begin{bmatrix} 2 \\ 2 \end{bmatrix}.
  2. Normal vector: n=ba=[62]\mathbf{n} = \mathbf{b} - \mathbf{a} = \begin{bmatrix} 6 \\ -2 \end{bmatrix}. For simplicity, we may use the parallel vector [31]\begin{bmatrix} 3 \\ -1 \end{bmatrix}.
  3. Equation: [31][x2y2]=0    3(x2)(y2)=0    3xy4=0\begin{bmatrix} 3 \\ -1 \end{bmatrix} \cdot \begin{bmatrix} x - 2 \\ y - 2 \end{bmatrix} = 0 \implies 3(x - 2) - (y - 2) = 0 \implies 3x - y - 4 = 0.

Distance from a Point to a Line

The vector approach provides an elegant derivation for the distance from a point to a line, a result that is often tedious to prove using classical coordinates.

Theorem 19 (Distance to a Line)

Let LL be the line defined by nx+c=0\mathbf{n} \cdot \mathbf{x} + c = 0. The perpendicular distance from an arbitrary point X0X_0 (with position vector x0\mathbf{x}_0) to LL is given by

d(X0,L)=nx0+cn.d(X_0, L) = \frac{|\mathbf{n} \cdot \mathbf{x}_0 + c|}{|\mathbf{n}|}.
Proof

Let PP be any point on the line LL. Then np+c=0\mathbf{n} \cdot \mathbf{p} + c = 0, so c=npc = -\mathbf{n} \cdot \mathbf{p}. The distance from X0X_0 to LL is the length of the orthogonal projection of the displacement vector PX0\overrightarrow{PX_0} onto the normal vector n\mathbf{n}.

Using the projection formula from Theorem 17:

d=projn(x0p)=((x0p)nn2)n=(x0p)nn.d = |\mathrm{proj}_{\mathbf{n}}(\mathbf{x}_0 - \mathbf{p})| = \left|\left(\frac{(\mathbf{x}_0 - \mathbf{p}) \cdot \mathbf{n}}{|\mathbf{n}|^2}\right)\mathbf{n}\right| = \frac{|(\mathbf{x}_0 - \mathbf{p}) \cdot \mathbf{n}|}{|\mathbf{n}|}.

Expanding the dot product in the numerator:

d=nx0npn.d = \frac{|\mathbf{n} \cdot \mathbf{x}_0 - \mathbf{n} \cdot \mathbf{p}|}{|\mathbf{n}|}.

Substituting np=c-\mathbf{n} \cdot \mathbf{p} = c, we obtain

d=nx0+cn.d = \frac{|\mathbf{n} \cdot \mathbf{x}_0 + c|}{|\mathbf{n}|}.
The distance d is the magnitude of the projection of the displacement onto the normal n

This theorem suggests a canonical way to represent straight lines. If we divide the equation nx+c=0\mathbf{n} \cdot \mathbf{x} + c = 0 by the magnitude n|\mathbf{n}|, we obtain the Hessian normal form:

nnx+cn=0.\frac{\mathbf{n}}{|\mathbf{n}|} \cdot \mathbf{x} + \frac{c}{|\mathbf{n}|} = 0.

Defining the unit normal n^=n/n\hat{\mathbf{n}} = \mathbf{n}/|\mathbf{n}| and p=c/np = c/|\mathbf{n}|, the function f(x)=n^x+pf(\mathbf{x}) = \hat{\mathbf{n}} \cdot \mathbf{x} + p has the property that f(x0)|f(\mathbf{x}_0)| gives the exact distance from X0X_0 to the line.

Example 21 (Distance Calculation)

Find the distance from X0(2,1)X_0(2, -1) to the line 3x+4y12=03x + 4y - 12 = 0.

Here n=[34]\mathbf{n} = \begin{bmatrix} 3 \\ 4 \end{bmatrix}, so n=32+42=5|\mathbf{n}| = \sqrt{3^2 + 4^2} = 5.

d=3(2)+4(1)125=64125=105=2.d = \frac{|3(2) + 4(-1) - 12|}{5} = \frac{|6 - 4 - 12|}{5} = \frac{|-10|}{5} = 2.

Angle Between Lines

The angle between two straight lines is defined as the angle between their respective normal vectors. This definition is consistent with the geometric intuition that if two lines intersect, the angle between them is preserved if both are rotated by 90°90° (transforming the lines into their normals).

Theorem 20 (Angle Between Lines)

Let L1L_1 and L2L_2 be lines with normal vectors n1\mathbf{n}_1 and n2\mathbf{n}_2 respectively. The angle θ[0,π]\theta \in [0, \pi] between the lines is given by

cosθ=n1n2n1n2.\cos\theta = \frac{\mathbf{n}_1 \cdot \mathbf{n}_2}{|\mathbf{n}_1|\,|\mathbf{n}_2|}.

If we are interested in the acute angle ϕ\phi between the lines, we take the absolute value of the right-hand side: cosϕ=n1n2n1n2\cos\phi = \frac{|\mathbf{n}_1 \cdot \mathbf{n}_2|}{|\mathbf{n}_1|\,|\mathbf{n}_2|}.

Example 22 (Angle Between Two Lines)

Find the angle between the lines x2y+3=0x - 2y + 3 = 0 and 3x+y5=03x + y - 5 = 0.

The normal vectors are n1=[12]\mathbf{n}_1 = \begin{bmatrix} 1 \\ -2 \end{bmatrix} and n2=[31]\mathbf{n}_2 = \begin{bmatrix} 3 \\ 1 \end{bmatrix}.

n1n2=(1)(3)+(2)(1)=1.\mathbf{n}_1 \cdot \mathbf{n}_2 = (1)(3) + (-2)(1) = 1.n1=5,n2=10.|\mathbf{n}_1| = \sqrt{5}, \quad |\mathbf{n}_2| = \sqrt{10}.cosθ=1510=150=152.\cos\theta = \frac{1}{\sqrt{5}\,\sqrt{10}} = \frac{1}{\sqrt{50}} = \frac{1}{5\sqrt{2}}.
Problem 13

Find the equation of the line passing through A(1,4)A(1, 4) with normal vector n=[32]\mathbf{n} = \begin{bmatrix} 3 \\ 2 \end{bmatrix}. Then compute the perpendicular distance from B(5,1)B(5, -1) to this line.

Problem 14

Two lines are given by 2x+y1=02x + y - 1 = 0 and x3y+5=0x - 3y + 5 = 0. Determine the acute angle between them.