Newton’s Method
In Lesson 6AM we solved every “find from ” question by taking of both sides, so each unknown reduced cleanly to a closed-form expression in . The Picasso problem solved ; the doubling-time problem solved ; the matching-rate problem solved . Every equation in that lesson admitted an exact closed-form answer because the operations involved were already invertible in the toolkit we’d built up by Lesson 5PM.
Most equations we meet in practice are not so kind. The bookkeeping equation
which arises whenever the half-life of one process must equal the time constant of another, has no solution expressible using the functions in our current toolkit: , roots, , and . The polynomial equation
which arises in a structural-engineering load problem, does admit an exact root formula (the quartic formula), but the formula is too unwieldy for routine work. The single positive solution is approximately , and the closed form is several lines of nested radicals.
Both equations have the same shape: for an explicit, differentiable . Our closing application of MA0A turns the entire derivative toolkit toward a numerical method that produces such to as many decimals as desired, using only and . The key is the linear approximation that the derivative supplies us at each point.
The Tangent-Line Trick
We pick any starting guess and look at the tangent line to at the point . By the point-slope formula from Lesson 1AM, our tangent has equation
Setting in gives us the -intercept of the tangent:
The right-hand side is defined whenever . We call it .
The tangent line is the best linear approximation to near that the derivative supplies us. If the curve is close to its tangent over the interval between and the true root, then is closer to the root than was. Repeating the construction with in place of produces , then , and so on.
Let be a differentiable function and a starting guess with . The Newton iteration for from is the sequence defined by
provided at every step.
is the -intercept of the tangent line to at the point .
The tangent line to at has equation
by point-slope. Setting and dividing by gives , which is .
■
If the linear approximation is good, has overshot the root by less than did, and each iterate sits closer to the root than the previous one.
A Worked Example:
We rewrite equation as . By the product rule applied to and , with the derivative from Lesson 5AM,
Apply Newton’s iteration from and report the first few iterates to seven decimal places.
By with our explicit and above,
At : , , so
Continuing the iteration with a calculator at each step:
Our iterates have stabilised at to seven decimal places. The truncated value satisfies , with .
After only four iterations our digits stop changing. The number of correct decimals roughly doubles at each step: our gap between and the root has size , between and the root , between and the digits agree only at the first decimal, and by the agreement is at the seventh.
This doubling of correct decimals is the typical behaviour of Newton’s method when the starting guess is close enough to the root. A formal proof needs more of the linear-approximation theory than this lesson develops; the observation that each iteration roughly squares the previous error is enough to explain why we prefer Newton’s method to bisection in practice.
A Quartic Example
The polynomial has , and by the Power and sum rules from Lesson 2PM
Apply Newton’s iteration to from the starting guess , in search of the unique positive root near .
A direct evaluation gives us and , so
Continuing each step with a calculator:
Our iterates converge on after six rounds. The first few rounds spend their effort closing the gap between and the basin of fast convergence near the root; once we are close, the digits double per round, exactly as in the previous example.
Our starting guess is a poor choice numerically, since is far from zero. A starting guess closer to the root, say , would converge in fewer rounds: , , , and a few more rounds deliver the same accuracy.
Our two examples share the same arithmetic: every iteration is one evaluation of , one evaluation of , and one division. Both use the toolkit we assembled in Lessons 4 and 5; nothing new about the operations themselves.
A Cube Root in Closed Form
The classical demonstration of Newton’s method is the computation of to many decimals, using only , , , .
Solve by Newton’s method from , and report the iterates to seven decimal places.
We set and from the power rule of Lesson 2PM. Our iteration becomes
where the simplification splits from the first term and isolates the constant from the second.
The iterates from :
By our iterates have stabilised at , with on a calculator that carries fifteen digits.
Our closed-form simplification admits the reading “two-thirds of the previous estimate plus one-third of the exact fix”: each iterate is a weighted average that gives more weight to the previous estimate but pulls partly toward the value that would solve the equation if the curve were genuinely linear at .
Problem 180
Use Newton’s method from to compute to six decimal places. Show the simplified iteration formula in the form
and tabulate .
Problem 181
For the equation , Newton’s method from converges to one root and from converges to another. Compute the first three iterates from each starting point to four decimal places, and identify the two roots numerically. Use the formula for the derivative of from Lesson 5AM.
When the Tangent Is Horizontal
Our iteration involves a division by and breaks down whenever the tangent at is horizontal.
For , the derivative from vanishes at , since never vanishes and the factor does. Starting Newton’s iteration from produces
which is undefined. Geometrically, the tangent at is horizontal and never crosses the -axis.
Our remedy is a different starting guess. Any with avoids the break-down. For this , the only at which is ; every other starting choice keeps our iteration well-defined.
A horizontal tangent is also a danger nearby, even when the starting iterate avoids the exact zero of . The denominator in shrinks to nearly zero as approaches a critical number, blowing up the size of the correction and throwing the next iterate far from where it would otherwise land.
A Starting Point That Wanders Off
Our other failure mode is more subtle: the iteration is well-defined at every step but moves away from the root. The shape of the curve far from the root determines whether this happens.
The same equation has its unique real root near . Starting Newton’s iteration from :
so
The next iterate is far further from the root than the start. Continuing,
The iterates diverge toward .
The cause is visible in our curve : as , the term shrinks to zero (the exponential decay overwhelms the linear factor), so . The curve has the line as a horizontal asymptote on the left. Newton’s iteration, looking at the gentle slope of the asymptotic region and assuming the curve is approximately linear, projects the tangent to a faraway intercept; our next iterate lands in even gentler asymptotic territory; and the process accelerates outward instead of homing in.
The shallow slope of the tangent in the asymptotic region is the cause: dividing by a small produces a large correction. Newton’s method needs more than ; it needs a starting region where the curve is not too close to a horizontal asymptote.
The two failure cases just shown sit at opposite extremes:
- Local horizontal tangent at the iterate (): division by zero, the next iterate is undefined.
- Far-asymptotic region ( close to a horizontal asymptote): division by a tiny number, the next iterate is huge and farther from the root.
Both are symptoms of the same tension: a shallow tangent line cannot be trusted to predict where the true curve crosses zero.
A Function with No Real Root
Newton’s method assumes that has a solution. When it does not, our iteration must do something with each , and the resulting sequence cannot converge to anything sensible.
The function satisfies for every real , so there is no with . The Newton iteration formula
is, however, defined whenever . Starting from gives the sequence
Our iterates oscillate without settling. Some sit close to the origin, others land well outside the interval , and the magnitudes occasionally swing into the threes. No sub-pattern repeats.
A starting guess as close to the origin as already runs into trouble of a different kind: the next iterate is
and then forbids any further step. Different starting guesses produce different chaotic behaviours. The qualitative lesson: Newton’s method itself does not certify that a real solution exists; a graph or a separate sign check is still needed.
Problem 182
For the equation on , define . Compute using the derivative of from Lesson 5PM. Starting from , perform Newton’s method to four decimal places and report the iterates . Confirm the iterates are settling by comparing successive values.
Problem 183
Apply Newton’s method to on the real line. The cubic has three real roots, near , , and .
- Compute and identify the two values at which . These are the values to avoid as starting guesses.
- Starting from , , and , perform three Newton iterations from each. Confirm that each starting guess converges to the nearest of the three roots.
- Find a starting guess between and that converges to the negative root rather than the nearby root at , and explain in one sentence why such a starting guess exists. (Hint: pick a point at which the tangent is shallow but pointing leftward.)
Problem 184
A British project manager solves for the time at which a mixed savings account, with balance
first reaches £. The equation has no closed-form solution.
- Set . Compute using the formula for .
- Starting from , perform Newton’s iteration to four decimal places. Tabulate .
- Verify the answer by computing at the converged and confirming it is within £ of £.
Problem 185
(Harder.) For from , the iteration takes the form
Show that the right side simplifies to
and identify each summand. The first term comes from the difference ; the second comes from the constant . Use the simplified formula to recompute from and confirm .
Problem 186
(Harder.) A function is called monotonic on an interval if it is either strictly increasing or strictly decreasing on the interval. Suppose is differentiable, everywhere on an interval , and , so has a unique root in . Use the second derivative rule to argue that:
- If on (so is concave up there), then for every starting guess in with , the Newton iterates form a strictly decreasing sequence bounded below by .
- If on (so is concave down there), then for every starting guess in with , the Newton iterates form a strictly increasing sequence bounded above by .
(In each case the iterates approach the root from one side without overshoot. The general overshoot pattern of Newton’s method is therefore forbidden when the curve has the favourable concavity sign across the interval.)
Closing
Newton’s method is our last application of differentiation in this course. It uses every part of the toolkit we’ve built (the tangent-line construction from Lesson 2, the rules of differentiation from Lessons 4 and 5, the geometry of asymptotes from Lesson 3PM, and the rate-equation reading of from Lesson 6AM) to turn an equation that resists algebra into a numerical procedure that converges in a handful of arithmetic steps.
Our next major question, taken up in MA0B, asks for the cumulative total of a function over an interval rather than the slope at a point. The two questions are connected, and the answer to the second turns out to involve the answer to the first in a way that justifies our entire derivative apparatus once more, but the development belongs to the integration course.
| Tool | Form |
|---|---|
| Newton’s iteration | |
| Geometric reading | is the -intercept of the tangent at |
| Failure modes | , asymptotic region, no real root |