MATH661

Lecture 28: Linear Operator Splitting

A first example of problem-specific algorithms is afforded by consideration of the steady-state diffusion problem

- \nabla^{2} u = - Δ u = f,

(1)

where $u (𝒙), 𝒙 \in Ω \subseteq ℝ^{d}$ can be understood to denote the temperature in an infinitesimal volume positioned at $𝒙$ in $d$ -dimensional space, and $f$ is a local rate of heat generation. The above arises from setting the time derivative zero in $\partial_{t} u - ▵ u = f$ . Though often stated in this thermal language, the Poisson equation (1) is generally valid for unresolved transport by Brownian motion. The mathematical concept of an “infinitesimal volume” is interpreted as setting some length scale $ℓ$ much smaller than the length scale $L$ characterizing the size of the domain $Ω$ . As an example, for heat conduction in a column of water of length $L = 1$ m the length scale of a quasi-infinitesimal volume can be considered as, say, $ℓ = 1 μ$ m. There is no physical significance to the mathematical limit process $ℓ \to 0$ due to the discrete structure of matter, and for all practical purposes setting $ℓ = 1 μ$ m is an acceptable threshold to delimit a phenomenon of interest to behavior that can be neglected. In this case the phenomenon of interest is the average transport of thermal energy in volumes of size greater than $𝒪 (ℓ^{d})$ . The detailed Brownian motion of the water molecules that occurs on length scales $s \approx 1 nm ≪ ℓ$ can be neglected and is said to be unresolved. The only observable effect of this motion is that temperature gradients lead to a heat flux as described by $𝒇 (u) = - α \nabla u$ (Fourier's law). The same equation (1) arises in epidemiology when $ℓ = 10$ m, an average separation between an infected and susceptible individual, $L = 10 km$ , the size of a large city, and $u$ is reinterpreted as the percentage of infected individuals in the population. Again, the detailed Brownian steps of size $s \approx 10 cm ≪ ℓ$ taken by individuals can be neglected.

1.Poisson equation discretization

Understanding the underlying unresolved Brownian motion is useful in constructing numerical solutions. For $f = 0$ , (1) becomes $\nabla^{2} u = 0,$ which states that there is no net heat flux in an infinitesimal volume, $div \cdot (α \nabla u) = 0$ , colloquially: “what flows in on one side goes out on the other”. A function that satisfies Laplace's equation $\nabla^{2} u = 0$ is said to be harmonic. For $d = 1$ , the ordinary differential equation $\partial_{x}^{2} u = 0$ is obtained with solution $u (x) = a + b x$ , and boundary conditions $u_{0}$ at $x = 0$ and $u_{1}$ at $x = 1$ gives $u (x) = u_{0} + (u_{1} - u_{0}) x$ . A temperature difference $u_{0} \neq u_{1}$ at the boundaries induces diffusive fluxes that lead to the mean value $u (1 / 2) = (u_{0} + u_{1}) / 2$ at the midpoint. An analogous statement is made for $d > 1$ starting from Green's formula on ball $B$ with spherical boundary $S$ of radius $R$

\int_{B} (u Δ v - v Δ u) d ω = \int_{S = \partial B} (u \frac{\partial v}{\partial n} - v \frac{\partial u}{\partial n}) d σ,

for scalar functions $u, v$ . For $d = 3$ and $v = 1 / r$ the mean value theorem

u (𝒙) = \frac{1}{4 π R^{2}} \int_{S = \partial B} u (𝒚) d σ (𝒚),

(2)

is obtained, which states that the value of a harmonic function is the average of the values on a surrounding sphere. For $d = 2$ the analogous statement is

u (𝒂) = \frac{1}{2 π r} \int_{0}^{2 π} u (𝒂 + r \cos θ 𝒆_{x} + r \sin θ 𝒆_{y}) d θ .

(3)

Midpoint quadrature of (3) over four subintervals gives

u_{0} = \frac{u_{1} + u_{2} + u_{3} + u_{4}}{4} .

(4)

Apply the above on a grid covering some arbitrary domain $Ω$ with boundary $Σ = \partial Ω$ to obtain

u_{i, j} = \frac{1}{4} (u_{i, j - 1} + u_{i - 1, j} + u_{i + 1, j} + u_{i, j + 1}) .

(5)

Complications arise for general boundaries $Σ$ , but for a square domain $Ω = [0, 1] \times [0, 1]$ grid points $(x_{i} = i h, y_{j} = j h)$ align with the boundary, $h = 1 / (n + 1)$ . Instead of two indices, one for each spatial direction, organize the grid values $u$ through a single index $k = (j - 1) n + i$ , with $u_{k}$ denoting the value at the interior points $(x_{i}, y_{j})$ . The vector of interior grid values $𝒖 = [\begin{array}{lll} u_{1} & \dots & u_{m} \end{array}]$ has $m = n^{2}$ components, and the mean value theorem leads to the linear system

$\circ$

𝑨 𝒖 = 𝒃,

(6)

where $𝑨$ has a regular sparsity pattern induced by the uniform spacing of the grid

𝑨 = [\begin{array}{llllllllll} 4 & - 1 & 0 & \dots & 0 & - 1 & 0 & \dots & \dots & 0 \\ - 1 & 4 & - 1 & 0 & \dots & 0 & - 1 & 0 & \dots & 0 \\ ⋱ & ⋱ & ⋱ & ⋱ \end{array}] .

The vector $𝒃$ contains the boundary values, for example $b_{1} = u_{1, 0} + u_{0, 1}$ .

Figure 1. Left: Mean value theorem leads to $u_{0} = (u_{1} + u_{2} + u_{3} + u_{4}) / 4$ . Middle: Five-point finite difference stencil for Laplace operator. Right: Structure of $𝑨$ matrix resulting from discretization of Laplace operator.

The system (6) was obtained by discretization of the mean-value integral (3). The same linear system is also obtained by discretization of the differential equation

- \nabla^{2} u = - u_{x x} - u_{y y} = 0,

where indices denote differentiation. The minus sign arises from compatibility with the unsteady form of the heat equation $\partial_{t} u - \nabla^{2} u = f$ . A centered finite difference approximation of the derivatives on the uniform grid leads to

u_{x x} (x_{i}, y_{j}) ≅ \frac{u_{i - 1, j} - 2 u_{i, j} + u_{i + 1, j}}{h^{2}}, u_{y y} (x_{i}, y_{j}) ≅ \frac{u_{i, j - 1} - 2 u_{i, j} + u_{i, j + 1}}{h^{2}},

and (6) is recovered. For the Poisson equation $f \neq 0$ the right hand side changes to

𝑨 𝒖 = 𝒃 + h^{2} 𝒇 = 𝒄

(7)

with $f_{k} = f (x_{i}, y_{j})$ . It is often the case that the same discrete system arises from both the differential and the integral formulation of a conservation law on a uniform grid.

2.Matrix splitting iteration

The above discussion of the underlying physics of the Poisson equation productively guides construction of numerical solution procedures. Solving the linear system (7) by general factorizations such as $𝑨 = 𝑳 𝑼$ or $𝑨 = 𝑸 𝑹$ is costly in terms of memory usage since the sparisty pattern is lost. For the uniform grid and square domain considered above the matrix $𝑨$ need not be explicitly stored at all since $a_{i, i} = 4$ , and $a_{i, j} = - 1$ when $| i - j | = 1$ or $| i - j | = n$ . The discrete mean value theorem (5) suggests that some approximation $𝒖^{(l)}$ can be improved by the iteration

u_{i, j}^{(l + 1)} = \frac{1}{4} (u_{i, j - 1}^{(l)} + u_{i - 1, j}^{(l)} + u_{i + 1, j}^{(l)} + u_{i, j + 1}^{(l)}) .

(8)

The above is known as Jacobi iteration, and can be stated in matrix form by expressing $𝑨$ as

𝑨 = 𝑳 + 𝑫 + 𝑼,

(9)

with $𝑳, 𝑼$ containing non-zero components of $𝑨$ below, above the diagonal, and $𝑫$ containing the diagonal of $𝑨$ . In contrast to the multiplicative decompositions considered up to now, the $Q R$ , $L U$ , SVD, eigen or Schur decompositions, the decomposition (9) is now additive. Note that in (9) $𝑳, 𝑼$ are strictly lower, upper diagonal matrices with zeros on the diagonal in contrast to the notation for the standard $L U$ factorization algorithm. Recall that the utility of matrix multiplication was associated with the representation of linear mapping composition. Additive decompositions such as (9) generally are useful when separating different aspects of a physical process, and are a simple example of operator splitting. For the discrete Poisson system Jacobi iteration can be expressed as

𝑨 𝒖 = 𝒄 \Rightarrow (𝑳 + 𝑫 + 𝑼) 𝒖 = 𝒄 \Rightarrow 𝒖^{(l + 1)} = 𝑫^{- 1} (𝒄 - 𝑳 𝒖^{(l)} - 𝑼 𝒖^{(l)}) .

(10)

Several variants of the idea can be pursued. The matrix splitting (9) is useful in theoretical convergence analysis, but implementations directly use (8) within loops over the $(i, j)$ indices. Updated values can be immediately utilized leading to either of the following iterations

𝒖^{(l + 1)} = 𝑫^{- 1} (𝒄 - 𝑳 𝒖^{(l + 1)} - 𝑼 𝒖^{(l)}), 𝒖^{(l + 1)} = 𝑫^{- 1} (𝒄 - 𝑳 𝒖^{(l)} - 𝑼 𝒖^{(l + 1)}),

(11)

depending on loop organization. These are known as Gauss-Seidel iterations. Convergence might accelerated by extrapolation,

𝒖^{(l + 1)} = 𝒖^{(l)} + ω [𝑫^{- 1} (𝒄 - 𝑳 𝒖^{(l)} - 𝑼 𝒖^{(l + 1)}) - 𝒖^{(l)}] = (1 - ω) 𝒖^{(l)} + ω 𝑫^{- 1} (𝒄 - 𝑳 𝒖^{(l)} - 𝑼 𝒖^{(l + 1)}),

(12)

where the new iteration is continued by factor $ω$ in the direction of the Gauss-Seidel update. When $ω > 1$ this is known as successive over-relaxation (SOR) and goes further in the Gauss-Seidel direction. Choosing $0 < ω < 1$ leads to successive under-relaxation.

3.Convergence analysis

Turning now from algorithm construction to analysis of its behavior, simplify notation by letting $𝒖_{k}$ denote the current iterate. The previous notation $𝒖^{(l)}$ was convenient since individual components were referenced as in $u_{i, j}^{(l)}$ , but convergence analysis is determined by the properties of the operator splitting and not of the current iterate. Introduce the error $𝜹_{k}$ at iteration $k$ as the difference between the exact solution $𝒖$ and the current iterate $𝒖_{k}$ , $𝜹_{k} = 𝒖 - 𝒖_{k}$ . Also introduce the residual $𝒓_{k} = 𝒄 - 𝑨 𝒖_{k} = 𝑨 𝜹_{k}$ , and the correction to the current iterate $𝒆_{k} = 𝒖_{k + 1} - 𝒖_{k}$

The above methods can be formulated as a residual correction algorithm through the steps:

residual computation, $𝒓_{k} = 𝒄 - 𝑨 𝒖_{k}$
correction computation, $𝒆_{k} = 𝑩 𝒓_{k}$
approximation update, $𝒖_{k + 1} = 𝒖_{k} + 𝒆_{k}$

When $𝑩 = 𝑨^{- 1}$ the exact solution is recovered in one step

𝒆_{k} = 𝑨^{- 1} (𝒄 - 𝑨 𝒖_{k}) = 𝒖 - 𝒖_{k} \Rightarrow 𝒖_{k + 1} = 𝒖_{k} + 𝒆_{k} = 𝒖 .

Iterative methods use some approximation of the (unknown) inverse, $𝑩 ≅ 𝑨^{- 1}$ . Jacobi iteration uses $𝑩 = 𝑫^{- 1}$ since

𝒖_{k + 1} = 𝒖_{k} + 𝑫^{- 1} [𝒄 - (𝑳 + 𝑫 + 𝑼) 𝒖_{k}] = 𝑫^{- 1} [𝒄 - (𝑳 + 𝑼) 𝒖_{k}],

recovering (10). Table 1 shows several common choices for $𝑩$ . Two key aspects govern the choice of the inverse approximant:

Computational efficiency stated as a requirement that each iteration cost either $𝒪 (m)$ or $𝒪 (m \log m)$ operations;
Capturing the essential aspects of $𝑨$ .

Jacobi	$𝑫^{- 1}$	Forward Gauss-Seidel	${(𝑫 + 𝑳)}^{- 1}$
Weighted Jacobi	$ω 𝑫^{- 1}$	Backward Gauss-Seidel	${(𝑫 + 𝑼)}^{- 1}$
SOR	$ω {(𝑫 + ω 𝑳)}^{- 1}$	Symmetric Gauss-Seidel	${(𝑫 + 𝑼)}^{- 1} 𝑫 {(𝑫 + 𝑳)}^{- 1}$
Symmetric SOR	$ω (2 - ω) {(𝑫 + ω 𝑼)}^{- 1} 𝑫 (𝑫 + ω 𝑳)$	Richardson	$ω 𝑰$

Table 1. Common iterative methods

The iteration converges to the solution if $|| 𝜹_{k} || \to 0$ for increasing $k$ . The error at iteration $k + 1$ is expressed as

𝜹_{k + 1} = 𝒖 - 𝒖_{k + 1} = 𝒖 - (𝒖_{k} + 𝑩 𝒓_{k}) = 𝜹_{k} - 𝑩 𝑨 (𝒖 - 𝒖_{k}) = (𝑰 - 𝑩 𝑨) 𝜹_{k} = {(𝑰 - 𝑩 𝑨)}^{k + 1} 𝜹_{0} .

(13)

The repeated matrix multiplication indicates that the eigenstructure of the iteration matrix $𝑴 = 𝑰 - 𝑩 𝑨$ determines iteration convergence. Indeed the above is simply power iteration for $𝑴$ and can be expected to converge as

𝜹_{k} \to μ_{1}^{k} c_{1} 𝒒_{1},

with $(μ_{1}, 𝒒_{1})$ the eigenpair that corresponds to the largest eigenvalue in absolute value, $𝑴 𝒒_{1} = μ_{1} 𝒒_{1}$ , known as the spectral radius of $𝑴$ , denoted by $ρ (𝑴) .$ Clearly, the above iterations will exhibit linear order of convergence when $ρ (𝑴) < 1$ . The rate of convergence $s_{k}$ at iteration $k$ is estimated by the Rayleigh quotient

s_{k} = \frac{𝜹_{k}^{T} 𝑴 𝜹_{k}}{𝜹_{k}^{T} 𝜹_{k}} = \frac{𝜹_{k}^{T} 𝜹_{k + 1}}{𝜹_{k}^{T} 𝜹_{k}},

and is monitored in implementations of iterative methods, and determined by the eigenvalues $λ = 1 - μ$ of $𝑩 𝑨$ .

The eigenstructure of $𝑩 𝑨$ is difficult for arbitrary matrices $𝑨$ , but can be carried out when $𝑨$ has special structure induced by known physical phenomena. The relation between analytical and numerical formulations plays an essential role in convergence analysis. The diffusion equation (5) leads to a symmetric matrix $𝑨$ , $𝑨 = 𝑨^{T}$ due to two aspects:

the chosen discretization is symmetric using centered finite differences;
the operator itself exhibits symmetry.

Insight into the above two aspects is most readily gained from the one-dimensional case $- u_{x x} = f$ with homogeneous Dirichlet boundary conditions $u (0) = u (1) = 0$ . The linear system $𝑨 𝒖 = 𝒇$ obtained from the centered finite difference discretization

- u_{i - 1} + 2 u_{i} - u_{i + 1} = h^{2} f_{i}, i = 1, \dots, m, h = 1 / (m + 1), u_{0} = u_{m + 1} = 0,

has a symmetric tridiagonal system matrix $𝑨 = diag ([\begin{array}{lll} - 1 & 2 & - 1 \end{array}])$ . The $𝑨^{T} = 𝑨$ symmetry can be expressed through scalar products in a way that generalizes to differential operators. Recall that a real-valued scalar product $(𝒖, 𝒗)$ must satisfy symmetry $(𝒗, 𝒖) = (𝒖, 𝒗)$ . For $𝒖, 𝒗 \in ℝ^{m}$ the standard inner product $(𝒖, 𝒗) = 𝒖^{T} 𝒗$ has this property. Consider the action of the operator $𝑨 \in ℝ^{m \times m}$ on the two terms. If $(𝑨 𝒖, 𝒗) = (𝒖, 𝑨 𝒗)$ the operator $𝑨$ is said to be symmetric. For the inner product

(𝑨 𝒖, 𝒗) = {(𝑨 𝒖)}^{T} 𝒗 = 𝒖^{T} 𝑨^{T} 𝒗, (𝒖, 𝑨 𝒗) = 𝒖^{T} 𝑨 𝒗,

and the two expressions are equal if $𝑨 = 𝑨^{T}$ . The same approach extends to the $d = 1$ diffusion operator $L = - \partial_{x x}$ using the scalar product

(f, g) = \int_{0}^{1} f (x) g (x) d x .

Applying integration by parts

(\partial_{x x} f, g) = \int_{0}^{1} f^{''} (x) g (x) d x = {[f^{'} g]}_{x = 0}^{x = 1} - \int_{0}^{1} f^{'} (x) g^{'} (x) d x = {[f^{'} g]}_{x = 0}^{x = 1} - {[f g^{'}]}_{x = 0}^{x = 1} + \int_{0}^{1} f (x) g^{''} (x) d x = b + (f, \partial_{x x} g) .

For homogeneous boundary conditions $f (0) = f (1) = g (0) = g (1) = 0$ , the symmetry condition is satisfied. Note that symmetry involves both the operator anId the boundary conditions of the problem. For $d = 2$ the scalar product

(u, v) = \int_{Ω} u (x, y) v (x, y) d x d y,

is defined on the unit square $Ω = [0, 1] \times [0, 1]$ , and for homogeneous Dirichlet boundary conditions two applications of Green's formula leads to $(\nabla^{2} u, v) = (u, \nabla^{2} v)$ , and the Laplace operator is symmetric.

For $d = 1$ , $x_{j} = j h$ , $h = 1 / (m + 1)$ , the eigenvalues $ν_{k}$ of $𝑨 = diag ([\begin{array}{ll} 1 - 2 & 1 \end{array}])$ are inferred from those of the $- \partial_{x x}$ operator with homogeneous boundary conditions at $x = 0$ , $x = 1$

- \partial_{x x} \sin (κ π x) = {(κ π)}^{2} \sin (κ π x), κ \in ℕ .

Positing that an eigenvector $𝒒$ of $𝑨$ is the discretization of the continuum eigenfunction leads to

q_{j} = \sin (κ π x_{j}) = \sin (κ π j h) = \sin (\frac{j κ π}{m + 1}),

hypothesis that is verified by the calculation of component $j$ of $𝑨 𝒒$

{(𝑨 𝒒)}_{j} = - \sin [\frac{(j - 1) κ π}{m + 1}] + 2 \sin [\frac{j κ π}{m + 1}] - \sin [\frac{(j + 1) κ π}{m + 1}] = 2 [1 - \cos (\frac{κ π}{m + 1})] \sin (\frac{j κ π}{m + 1}),

thereby obtaining the eigenvalue

v_{κ} = 2 [1 - \cos (\frac{κ π}{m + 1})] = 4 \sin^{2} [\frac{κ π h}{2}],

which recovers the analytical eigenvalue in the $h \to 0$ limit

{lim}_{h \to 0} \frac{v_{κ}}{h^{2}} = {(κ π)}^{2} .

For Jacobi $𝑩 = 𝑫^{- 1}$ , so the eigenvalues of $𝑩 𝑨$ are

λ_{κ} = 2 \sin^{2} [\frac{κ π h}{2}], κ = 1, 2, \dots, m .

The eigenvalues of $𝑴 = 𝑰 - 𝑩 𝑨$ are therefore

μ_{κ} = 1 - 2 \sin^{2} [\frac{κ π h}{2}] = \cos (κ π h)

Replacing $h = 1 / (m + 1)$ the largest eigenvalue is obtained for $κ = 1$

μ_{max} = μ_{1} = \cos (\frac{π}{m + 1}) .

For large $m$ , $μ_{1} ⪅ 1$ , and slow convergence is expected as verified in the numerical experiment from Fig. 2.

$\circ$

Figure 2. Convergence of Jacobi iteration. Blue: exact solution. Orange, green, red: iterates after 1000, 2000, 3000 iterations.

∴	function ResidualCorrection(A,B,c,u,maxIter) m = length(u0) for k=1:maxIter r = c - A(u) e = B(r) u = u + e end return u end;

∴	function ALaplace1D(u) m = length(u); v = zeros(m,1) v[1] = 2u[1] - u[2] for k=2:m-1 v[k] = -u[k-1]+2u[k]-u[k+1] end v[m] = -u[m-1] + 2*u[m] return v end;

∴	function BJacobiLaplace1D(r) beta = 0.5 return beta*r end;

∴	m=100; h=1.0/(m+1); x=(1:m)h; c = (pih)^2sin.(pix);

∴	u0=zeros(m,1); uex=sin.(pi*x);

∴	u1=ResidualCorrection(ALaplace1D,BJacobiLaplace1D,c,u0,1000);

∴	u2=ResidualCorrection(ALaplace1D,BJacobiLaplace1D,c,u0,2000);

∴	u3=ResidualCorrection(ALaplace1D,BJacobiLaplace1D,c,u0,3000);

∴	clf(); plot(x,uex,x,u1,x,u2,x,u3);

∴	grid("on"); xlabel(L"$x$"); ylabel(L"$u(x)$");

∴	title(L"Jacobi iteration convergence for $u_{x x}=\pi^2 sin(\pi x)$");

∴

Jacobi iteration and its variants have limited practical utility by comparison to other iterative procedures due to slow convergence, but the concept of operator spliting has wide applicability. A more consequential example of operator splitting is to consider the advection diffusion equation

q_{t} + 𝒗 \cdot \nabla q = α Δ q,

which can be interpreted as stating that the time evolution of $q$ is due to the effect of a diffusion operator $𝒜 = α Δ$ and an advection operator $ℬ = - 𝒗 \cdot \nabla$

q_{t} = (𝒜 + ℬ) q .

Suppose both operators are discretized leading to matrices $𝑨, 𝑩$ and the discrete system

𝒒_{t} = (𝑨 + 𝑩) 𝒒,

with initial condition $𝒒 (𝒙, t = 0) = 𝒒_{0}$ . Advancing the solution by a time step $Δ t$ can be written as

𝒒 (t + Δ t) = e^{Δ t (𝑨 + 𝑩)} 𝒒 (t),

and can be separated into two stages

𝒒^{(l + 1)} = e^{Δ t 𝑨} e^{Δ t 𝑩} 𝒒^{(l)} .

The quantity $\tilde{𝒒} = e^{Δ t 𝑩} 𝒒^{(l)}$ captures advection effects and $𝒒^{(l + 1)} = e^{Δ t 𝑨} \tilde{𝒒}$ is the diffusion correction. Since advection and diffusion are markedly different physical effects describing resolved versus unresolved transport, it can be expected that matrices $𝑨, 𝑩$ have different properties that require specific solution procedures.

By contrast, the Jacobi iteration splitting $𝑨 = 𝑳 + 𝑫 + 𝑼$ does not separate physical effects and simply is suggested by the sparsity of $𝑨$ and computational efficiency per iteration. For example, the forward Gauss-Seidel iteration with $𝑩 = {(𝑫 + 𝑳)}^{- 1}$ leads to

𝒖_{k + 1} = 𝒖_{k} + {(𝑫 + 𝑳)}^{- 1} [𝒄 - (𝑳 + 𝑫 + 𝑼) 𝒖_{k}] = {(𝑫 + 𝑳)}^{- 1} (𝒄 - 𝑼 𝒖_{k}) \Rightarrow (𝑫 + 𝑳) 𝒖_{k + 1} = 𝒄 - 𝑼 𝒖_{k} .

(14)

The matrix $𝑫 + 𝑳$ is (non-strictly) lower-triangular and (14) is easily solved by forward substitution. The implementation is very simple to express while preserving two-dimensional indexing.

Algorithm (Componentwise forward Gauss-Seidel)

$for i = 1 : m_{x}$

$for j = 1 : m_{y}$

$u (i, j) = [c (i, j) + u (i + 1, j) + u (i - 1, j) + u (i, j + 1) + u (i, j - 1)] / 4$

$end$

In the above implementation a single memory space is used for $𝒖$ with new values taking place of the old, leading to just four floating point additions and one multiplication per grid point. Since the algorithm essentially takes the current average of neighboring values, it is also known as a relaxation method, smoothing out spatial variations in $𝒖$ .

Though such simple implementation is desirable, the non-physical splitting $𝑨 = 𝑳 + 𝑫 + 𝑼$ and associated slow convergence usually outweighs ease of coding effort and suggests looking for alternative approaches. The only scenarios where such simply implemented iterations find practical use is in parallel execution and as a preliminary modification of the system prior to use of some other algorithm.