Preface

This is a tutorial made solely for the purpose of education and it was designed for students taking Applied Math 0330. It is primarily for students who have very little experience or have never used Mathematica before and would like to learn more of the basics for this computer algebra system. As a friendly reminder, don't forget to clear variables in use and/or the kernel.

Finally, the commands in this tutorial are all written in bold black font, while Mathematica output is in normal font. This means that you can copy and paste all commands into Mathematica, change the parameters and run them. You, as the user, are free to use the scripts for your needs to learn the Mathematica program, and have the right to distribute this tutorial and refer to this tutorial as long as this tutorial is accredited appropriately.

Return to computing page for the first course APMA0330
Return to computing page for the second course APMA0340
Return to Mathematica tutorial for the second course APMA0330
Return to Mathematica tutorial for the first course APMA0340
Return to the main page for the course APMA0340
Return to the main page for the course APMA0330
Return to Part III of the course APMA0330

Fixed Point Iteration

Iteration is a fundamental principle in computer science. As the name suggests, it is a process that is repeated until an answer is achieved or stopped. In this section, we study the process of iteration using repeated substitution.

More specifically, given a function g defined on the real numbers with real values and given a point x₀ in the domain of g, the fixed point iteration is

\[ x_{i+1} = g(x_i ) \quad i =0, 1, 2, \ldots , \]

which gives rise to the sequence $ \left\{ x_i \right\}_{i\ge 0} . $ If this sequence converges to a point x, then one can prove that the obtained x is a fixed point of g, namely,

\[ x = g(x ) . \]

When one wants to apply a function until the result stops changing, Mathematica provides dedicated commands FixedPoint and FixedPointList to achieve that. We present an alternative approaches. One can also use NSolve command, as the following example shows.

Example: Suppose we want to find all the fixed points of $ f(x)=x\, \sin (1/x) . $ Since it has infinitely many fixed points, so there would in theory have infinitely many outputs. Let f(x) be the following piecewise function:

f[x_] := Piecewise[{{x Sin [1/x], -1 <= x < 0 || 0 < x <= 1}}, 0]

Then we apply NSolve command:

NSolve[x == f[x], x]

{{x -> 0}, {x -> ConditionalExpression[2./(
3.14159 + 12.5664 C[1]), (C[1] \[Element] Integers &&
C[1] >= 0) || (C[1] \[Element] Integers && C[1] <= -1.)]}}

As we see, NSolve provides several answers. You can even get a nice analytical answer out of Solve

Solve[x == f[x], x]

{{x -> 0}, {x -> ConditionalExpression[
2/(π + 4 π C[1]), (C[1] \[Element] Integers &&
C[1] >= 0) || (C[1] \[Element] Integers && C[1] <= -1)]}}

Example: Suppose we need to solve the polynomial equation $ x^3 + 4\,x^2 -10 =0 , $ which we rewrite as $ 4\, x^2 = 10 - x^3 . $ Taking square root, we reduce our problem to fixed point problem: $ x = \frac{1}{2}\, \sqrt{10 - x^3} . $ Then we force Mathematica to use a given precision is to use Block and make $MinPrecision equal to $MaxPrecision. So you can obtain your result as:

Block[{$MinPrecision = 10, $MaxPrecision = 10},
FixedPointList[N[1/2 Sqrt[10 - #^3] &, 10], 1.5`10]]

Mathematical operations on numbers of a given precision cannot guarantee the output to maintain the precision of the input numbers. In general precision will decrease. The amount of decrease depends on the operations and some operations will cause precision to decrease significantly. If you want a specific precision on the final result then the working precision should be higher than the desired output precision. Frequently, the working precision is done at twice the desired output precision to allow for significant lose of precision during complex calculations. Sometimes using machine precision numbers will perform well.

result = FixedPointList[N[1/2 Sqrt[10 - #^3] &], 1.5];
{Length[result1], result1[[-1]] // InputForm}

{53, InputForm[1.3652300134140969`]}

Example: Consider the convergent iteration

\[ p_0 = 0.5 \qquad \mbox{and} \qquad p_{k+1} = e^{-2p_k} \quad \mbox{for} \quad k=0,1,2,\ldots . \]

The first 10 terms are obtained by the calculations

\begin{align*} p_1 &= e^{-1} \approx 0.367879 , \\ p_2 &= e^{-2*p_1} \approx 0.479142 , \\ p_3 &= e^{-2*p_2} \approx 0.383551 , \\ \vdots & \qquad \vdots \\ p_9 &= e^{-2*p_8} \approx 0.409676 , \\ p_{10} &= e^{-2*p_9} \approx 0.440717 . \\ \end{align*}

The sequence is converging, and Mathematica reveals that

\[ \lim_{k\to \infty} p_k = 0.426302751 \ldots . \]

p[0] = 0.5
Do[Print["k= ", k, " " , "p= " , p[k] = Exp[-2*p[k - 1]]], {k, 1, 10}]
FindRoot[p == Exp[-2*p], {p, 0.5}]

Example: Consider $ g(x) = 10/ (x^3 -1) $ and the fixed point iterative scheme $ x_{i+1} = 10/ (x^3_i -1) ,$ with the initial guess x₀ = 2.

If you repeat the same procedure, you will be surprised that the iteration is gone into an infinite loop without converging.

Theorem (Banach): Assume that the function g is continuous on the interval [a,b].

If the range of the mapping y = g(x) satisfies $ y \in [a,b] $ for all $ x \in [a,b] , $ then g has a fixed point in [a,b].
Furthermore, suppose that the derivative g'(x) is defined over (a,b) and that a positive constant (called Lipschitz constant) L < 1 exists with $ |g' (x) | \le L < 1 $ for all $ x \in (a,b) , $ then g has a unique fixed point P in [a,b]. ■

Stefan Banach (was born in 1892 in Kraków, Austria-Hungary and died 31 August 1945 in Lvov, Ukrainian SSR, Soviet Union) was a Polish mathematician who is generally considered one of the world's most important and influential 20th-century mathematicians. He was one of the founders of modern functional analysis, and an original member of the Lwów School of Mathematics. His major work was the 1932 book, Théorie des opérations linéaires (Theory of Linear Operations), the first monograph on the general theory of functional analysis.
Rudolf Otto Sigismund Lipschitz (1832--1903) was a German mathematician who made contributions to mathematical analysis (where he gave his name to the Lipschitz continuity condition) and differential geometry, as well as number theory, algebras with involution and classical mechanics.

Theorem: Assume that the function g and its derivative are continuous on the interval [a,b]. Suppose that $ g(x) \in [a,b] $ for all $ x \in [a,b] , $ and the initial approximation x₀ also belongs to the interval [a,b].

If $ |g' (x) | \le K < 1 $ for all $ x \in (a,b) , $ then the iteration $ x_{i+1} = g(x_i ) $ will converge to the unique fixed point $ P \in [a,b] . $ In this case, P is said to be an attractive fixed point: $ P= g(P) . $
If $ |g' (x) | > 1 $ for all $ x \in (a,b) , $ then the iteration $ x_{i+1} = g(x_i ) $ will not converge to P. In this case, P is said to be a repelling fixed point and the iteration exhibits local divergence. ■

In practice, it is often difficult to check the condition $ g([a,b]) \subset [a,b] $ given in the previous theorem. We now present a variant of it.

Theorem: Let P be a fixed point of g(x), that is, $ P= g(P) . $ Suppose g(x) is differentiable on $ \left[ P- \varepsilon , P+\varepsilon \right] \quad\mbox{for some} \quad \varepsilon > 0 $ and g(x) satisfies the condition $ |g' (x) | \le K < 1 $ for all $ x \in \left[ P - \varepsilon , P+\varepsilon \right] . $ Then the sequence $ x_{i+1} = g(x_i ) , $ with starting point $ x_0 \in \left[ P- \varepsilon , P+\varepsilon \right] , $ converges to P.

Remark: If g is invertible then P is a fixed point of g if and only if P is a fixed point of g^-1.

Remark: The above theorems provide only sufficient conditions. It is possible for a function to violate one or more of the hypotheses, yet still have a (possibly unique) fixed point. ■

The Banach theorem allows one to find the necessary number of iterations for a given error "epsilon." It can be calculated by the following formula (a-priori error estimate)

\[ \frac{1}{L} \, \ln \left( \frac{(1-L)\,\varepsilon}{|x_0 - x_1 |} \right) \le \mbox{iterations}(\varepsilon ), \]

and a-posteriori error estimate (where p is a fixed point):

\[ |x_k - p |\le \frac{L}{1-L} \left\vert x_k - x_{k-1} \right\vert . \]

Example: The function $ g(x) = 2x\left( 1-x \right) $ violates the hypothesis of the theorem because it is continuous everywhere $ (-\infty , \infty ) . $ Indeed, g(x) clearly does not map the interval $ [0.5, \infty ) $ into itself; moreover, its derivative $ |g' (x)| \to + \infty $ for large x. However, g(x) has fixed points at x = 0 and x = 1/2.

Example: Consider the equation

\[ x = 1 + 0.4\, \sin x , \qquad \mbox{with} \quad g(x) = 1 + 0.4\, \sin x . \]

Note that g(x) is a continuous functions everywhere and $ 0.6 \le g(x) \le 1.4 $ for any $ x \in \mathbb{R} . $ Its derivative $ \left\vert g' (x) \right\vert = \left\vert 0.4\,\cos x \right\vert \le 0.4 < 1 . $ Therefore, we can apply the theorem and conclude that the fixed point iteration

\[ x_{k+1} = 1 + 0.4\, \sin x_k , \qquad k=0,1,2,,\ldots \]

will converge for arbitrary initial point from the interval [0.6 , 1.4].

Example: Consider a similar equation

\[ x = 1 + 2\, \sin x , \qquad \mbox{with} \quad g(x) = 1 + 2\, \sin x . \]

Since $ -1 \le g(x) \le 3 , $ we are looking for a fixed point from this interval, [-1,3]. Since for its derivative we have

\[ g'(x) = 2\, \cos x \qquad \Longrightarrow \qquad \max_{x\in [-1,3]} \, \left\vert g' (x) \right\vert =2 > 1, \]

we do not expect convergence of the fixed point iteration $ x_{k+1} = 1 + 2\, \sin x_k . $

II. Aitken's Algorithm

Sometimes we can accelerate or improve the convergence of an algorithm with very little additional effort, simply by using the output of the algorithm to estimate some of the uncomputable quantities. One such acceleration was proposed by A. Aiken. Alexander Craig "Alec" Aitken was born in 1895 in Dunedin, Otago, New Zealand and died in 1967 in Edinburgh, England, where he spent the rest of his life since 1925. Aitken had an incredible memory (he knew π to 2000 places) and could instantly multiply, divide and take roots of large numbers. He played the violin and composed music to a very high standard.

Suppose that we have an iterative process that generates a sequence of numbers $ \{ x_n \}_{n\ge 0} $ that converges to α. We generate a new sequence $ \{ q_n \}_{n\ge 0} $ according to

\[ q_n = x_n - \frac{\left( x_{n+1} - x_n \right)^2}{x_{n+2} -2\, x_{n+1} + x_n} = x_n - \frac{\left( \Delta x_n \right)^2}{\Delta^2 x_n} , \qquad n=2,3,\ldots , \]

where $ \Delta x_n = x_{n+1} - x_n $ is the forward finite difference.

Theorem (Aitken's Acceleration): Assume that the sequence $ \{ p_n \}_{n\ge 0} $ converges linearly to the limit p and that $ p_n \ne p $ for all $ n \ge 0. $ If there exists a real number A < 1 such that

\[ \lim_{n \to \infty} \, \frac{p- p_{n+1}}{p- p_n} =A, \]

then the sequence $ \{ q_n \}_{n\ge 0} $ defined by

\[ q_n = p_n - \frac{\left( \Delta p_n \right)^2}{\Delta^2 p_n} = p_n - \frac{\left( p_{n+1} - p_n \right)^2}{p_{n+2} - 2p_{n+1} + p_n} \]

converges to p faster than $ \{ p_n \}_{n\ge 0} $ in the sense that $ \lim_{n \to \infty} \, \left\vert \frac{p - q_n}{p- p_n} \right\vert =0 . $ ■

This algorithm was proposed by one of New Zealand's greatest mathematicians Alexander Craig "Alec" Aitken (1895--1967). When it is applied to determine a fixed point in the equation $ x=g(x) , $ it consists in the following stages:

select x₀;
calculate
\[ x_1 = g(x_0 ) , \qquad x_2 = g(x_1 ) ; \]
calculate
\[ x_3 = x_2 + \frac{\gamma_2}{1- \gamma_2} \left( x_2 - x_1 \right) , \qquad \mbox{where} \quad \gamma_2 = \frac{x_2 - x_1}{x_1 - x_0} ; \]
calculate
\[ x_4 = g(x_3 ) , \qquad x_5 = g(x_4 ) ; \]
calculate x₆ as the extrapolate of $ \left\{ x_3 , x_4 , x_5 \right\} . $ Continue this procedure, ad infinatum. ■

We assume that g is continuously differentiable, so according to Mean Value Theorem there exists ξ_n-1 between α (which is the root of $ \alpha = g(\alpha ) $ ) and x_n-1 such that

\[ \alpha - x_n = g(\alpha ) - g(x_{n-1}) = g' (\xi_{n-1} )(\alpha - x_{n-1}) . \]

Now consider

\[ \alpha - x_n = \left( \alpha - x_{n-1} \right) + \left( x_{n-1} - x_n \right) = \frac{1}{g' (\xi_{n-1})} \,(\alpha - x_n ) + \left( x_{n-1} - x_n \right) , \]

so that solving for α implies

\[ \alpha = x_n + \frac{g' (\xi_{n-1} )}{1- g' (\xi_{n-1} )} \left( x_n - x_{n-1} \right) . \]

Since we are assuming that $ x_n \,\to\, \alpha , $ we also know that $ g' (\xi_n ) \, \to \,g' (\alpha ) ; $ thus, we can denote $ \gamma_n \approx g' (\xi_{n-1} ) \approx g' (\alpha ) . $ Using this notation, we get

\[ \alpha = x_n + \frac{g' (\gamma_n}{1- \gamma_n} \left( x_n - x_{n-1} \right) , \]

which we use to define a new sequence of approximations:

\[ q_n = x_n + \frac{g' (\gamma_n}{1- \gamma_n} \left( x_n - x_{n-1} \right) , \qquad \gamma_n = \frac{x_{n-1} - x_n}{x_{n-2} - x_{n-1}} . \]

Example: Consider $ g(x) = \frac{1}{3}\, e^{-x} $ and we solve the equation $ g(x) = x $ using iteration

\[ x_n = g(x_{n-1}) , \qquad n = 1,2,\ldots . \]

Taking x₀ = 0, we compute the first ordinary iterates as follows:

\begin{align*} x_1 &= g(x_0 ) = \frac{1}{3}\, e^0 = \frac{1}{3} , \\ x_2 &= g(x_1 ) = \frac{1}{3}\, e^{-1/3} = 0.262513 , \\ x_3 &= g(x_2 ) = \frac{1}{3}\, e^{-x_1} = 0.256372 . \end{align*}

FindRoot[x == Exp[-x]/4, {x, 0}]

{x -> 0.203888}

x[0] = 1/3.0

0.333333

x[1] = Exp[-x[0]]/3

0.238844

x[2] = Exp[-x[1]]/3

0.262513

x[3] = Exp[-x[2]]/3

0.256372

x[4] = Exp[-x[3]]/3

0.257951

Now starting with x₂, we can begin to compute the accelerated values q_n:

\[ q_n = x_n + \frac{\gamma_n}{1- \gamma_n} \left( x_n - x_{n-1} \right) , \qquad \mbox{where} \quad \gamma_n = \frac{x_{n-1} - x_n}{x_{n-2} - x_{n-1}} . \]

Proceeding in this fashion, we get

\begin{align*} q_2 &= x_2 + \frac{\gamma_2}{1- \gamma_2} \left( x_2 - x_{1} \right) = \\ q_3 &= x_3 + \frac{\gamma_3}{1- \gamma_3} \left( x_3 - x_{2} \right) = \end{align*}

gamma[2] = (x[1] - x[2])/(x[0] - x[1])

-0.250492

q[2] = x[2] + gamma[2]*(x[2] - x[1])/(1 - gamma[2])

0.257771

gamma[3] = (x[2] - x[3])/(x[1] - x[2])

-0.25943

q[3] = x[3] + gamma[3]*(x[3] - x[2])/(1 - gamma[3])

0.257637

Now we present version 1 of Aitken extrapolation that does not take as much advantage of acceleration as is possible.


Aitken Extrapolation, Version 1 

x1 = g(x0) 

x2 = g(x1) 

for k=1 to n do 

	if (dabs[x1 - x0) > 1.d-20) then  

		gamma = (x2 - x1)/(x1 - x0) 

	else 

		gamma = 0.0d0 

	endif  

	q = x2 + gamma*(x2 - x1)/(1 - gamma) 

	if(abs(q - x2) < error) then  

		alpha = q 

		stop 

	endif 

	x = g(x2) 

	x0 = x1 

	x1 = x2 

	x2 = x 

enddo

Since division by zero---or a very small number---is possible in the algorithm's computation of γ, we put in a conditional test: Only if the denominator is greater than 10^-20 do we carry through the division.

Example: Repeat the previous example according to version 1.

III. Steffensen's Acceleration

Johan Frederik Steffensen (1873--1961) was a Danish mathematician, statistician, and actuary who did research in the fields of calculus of finite differences and interpolation. He was professor of actuarial science at the University of Copenhagen from 1923 to 1943. Steffensen's inequality and Steffensen's iterative numerical method are named after him.

When Aitken's process is combined with the fixed point iteration in Newton's method, the result is called Steffensen's acceleration. Starting with p₀, two steps of Newton's method are used to compute $ p_1 = p_0 - \frac{f(p_0 )}{f' (p_0 )} $ and $ p_2 = p_1 - \frac{f(p_1 )}{f' (p_1 )} , $ then Aitken's process is used to compute $ q_0 = p_0 - \frac{\left( \Delta p_0 \right)^2}{\Delta^2 p_0} . $ To continue the iteration set $ q_0 = p_0 $ and repeat the previous steps. This means that we have a fixed-point iteration:

\[ p_0 , \qquad p_1 = g(p_0 ), \qquad p_2 = g(p_1 ). \]

Then we compute

\[ q_0 = p_0 - \frac{\left( \Delta p_0 \right)^2}{\Delta^2 p_0} = p_0 - \frac{\left( p_1 - p_0 \right)^2}{p_2 - 2p_1 +p_0} . \]

At this point, we ``restart'' the fixed point iteration with $ p_0 = q_0 , $ e.g.,

\[ p_3 = q_0 , \qquad p_4 = g(p_3 ), \qquad p_5 = g(p_4 ). \]

and compute

\[ q_3 = p_3 - \frac{\left( \Delta p_3 \right)^2}{\Delta^2 p_3} = p_3 - \frac{\left( p_4 - p_3 \right)^2}{p_5 - 2p_4 +p_3} . \]

If at some point $ \Delta^2 p_n =0 $ (which appears in the denominator), then we stop and select the current value of p_n+2 as our approximate answer.

Steffensen's acceleration is used to quickly find a solution of the fixed-point equation x = g(x) given an initial approximation p₀. It is assumed that both g(x) and its derivative are continuous, $ | g' (x) | < 1, $ and that ordinary fixed-point iteration converges slowly (linearly) to p.

Now we present the pseudocode of the algorithm that provides faster convergence. Note that we check again for division by small numbers before computing γ.


Steffensen's acceleration 

for k=1 to n do 

	x1 = g(x0) 

	x2 = g(x1) 

	if (dabs[x1 - x0) > 1.d-20) then  

		gamma = (x2 - x1)/(x1 - x0) 

	else 

		gamma = 0.0d0 

	endif  

	x0 = x2 + gamma*(x2 - x1)/(1 - gamma) 

	if(abs(x0 - x2) < error) then  

		alpha = x0 

		stop  

	endif 

enddo

Example: Repeat the previous example according to version 2.

V. Wegstein's Method

To solve the fixed point problem $ g(x) =x , $ the following algorithm, presented in
J.H. Wegstein, Accelerating convergence of iterative processes, Communications of the Association for the Computing Machinery (ACM), 1, 9--13, 1958.
can be used

\[ x_{k+1} = \frac{x_{k-1} g(x_k ) - x_k g(x_{k-1})}{g(x_k ) + x_{k-1} -x_k - g(x_{k-1})} , \quad k=1,2,\ldots . \]

Till some extend, Wegstein's method is a version of predictor-corrector method because it can be broken into two stages:

\[ \begin{split} p^{(n+1)} = g \left( x^{(n)} \right) , \quad x^{(n+1)} = q\, x^{(n)} + \left( 1-q \right) p^{(n+1)} , \quad n=1,2,\ldots ; \\ q= \frac{b}{b-1} , \quad b= \frac{x^{(n)} - p^{(n+1)}}{x^{(n-1)} - x^{(n)}} , \end{split} \]

for $ n=0,1,2,\ldots . $ Wegstein (1958) found that the iteration converge differently depending on the value of q.

q < 0	Convergence is monotone
0 < q < 0.5	Convergence is oscillatory
0.5 < q < 1	Convergence
q > 1	Divergence

Wegstein's method depends on two first guesses x⁰ and x¹ and poor start may cause the process to fail, converge to another root, or, at best, add unnecessary number of iterations.

Example: Repeat the previous example according to Wegstein's method.

Return to Mathematica page

Return to the main page (APMA0330)
Return to the Part 1 (Plotting)
Return to the Part 2 (First Order ODEs)
Return to the Part 3 (Numerical Methods)
Return to the Part 4 (Second and Higher Order ODEs)
Return to the Part 5 (Series and Recurrences)
Return to the Part 6 (Laplace Transform)
Return to the Part 7 (Boundary Value Problems)

MATHEMATICA TUTORIAL for the First Course. Part III: Fixed Point Iteration.

Vladimir Dobrushkin

Preface

Contents [hide]

Fixed Point Iteration

II. Aitken's Algorithm

III. Steffensen's Acceleration

V. Wegstein's Method