MATH347DS

Synopsis. The traditional problem within linear algebra is to find the scaling coefficients of a linear combination to exactly represent some given vector. Methods with a long history of hand computation have been developed for this purpose, and can still offer insight into properties of linear mappings and their associated matrices.

1.Orthogonal projectors and linear systems

Consider the linear system $𝑨 𝒙 = 𝒃$ with $𝑨 \in ℝ^{m \times n}$ , $𝒃 \in ℝ^{m}$ given. The scaling coefficients $𝒙 \in ℝ^{n}$ are sought and are said to be a solution of the linear system when the equation $𝑨 𝒙 = 𝒃$ is satisfied. Orthogonal projectors and knowledge of the four fundamental matrix subspaces allows us to succintly express whether there exist no solutions, a single solution of an infinite number of solutions:

Consider the factorization $𝑸 𝑹 = 𝑨$ , the orthogonal projector $𝑷 = 𝑸 𝑸^{T}$ , and the complementary orthogonal projector $𝑰 - 𝑷$
If $|| (𝑰 - 𝑷) 𝒃 || \neq 0$ , then $𝒃$ has a component outside the column space of $𝑨$ , and $𝑨 𝒙 = 𝒃$ has no solution
If $|| (𝑰 - 𝑷) 𝒃 || = 0$ , then $𝒃 \in C (𝑸) = C (𝑨)$ and the system has at least one solution
If $N (𝑨) = {𝟎}$ (null space only contains the zero vector, i.e., null space of dimension 0) the system has a unique solution
If $\dim N (𝑨) = n - r > 0$ , then a vector $𝒚 \in N (𝑨)$ in the null space is written as
$𝒚 = c_{1} 𝒛_{1} + \dots + c_{n - r} 𝒛_{n - r}$
and if $𝒙$ is a solution of $𝑨 𝒙 = 𝒃$ , so is $𝒙 + 𝒚$ , since
$𝑨 (𝒙 + 𝒚) = 𝑨 𝒙 + c_{1} 𝑨 𝒛_{1} + \dots + c_{n - r} 𝑨 𝒛_{n - r} = 𝒃 + 𝟎 + \dots + 𝟎 = 𝒃$
The linear system has an $(n - r)$ -parameter family of solutions

If a solution exists, it can be found by backsubstitution solution of $𝑹 𝒙 = 𝑸^{T} 𝒃$ . If multiple solutions exist, an orthonormal basis $𝒁$ is found for the null space and the family of solutions is $𝒙 + 𝒁 𝒚$ .

2.Gaussian elimination and row echelon reduction

Suppose now that $𝑨 𝒙 = 𝒃$ admits a unique solution. The $Q R$ factorization approach of reducing the problem to $𝑹 𝒙 = 𝑸^{T} 𝒃$ is one procedure to compute the solution. It has the benefit of working with the orthonormal $𝑸$ matrix. Finding the orthonormal $𝑸$ matrix is however a computational expense. Recall that orthogonality implied linear independence. Other approaches might exist that only impose linear independence, without orthogonality. Gaussian elimination is the main such approach. Consider the system

{\begin{array}{ccc} x_{1} + 2 x_{2} - x_{3} & = & 2 \\ 2 x_{1} - x_{2} + x_{3} & = & 2 \\ 3 x_{1} - x_{2} - x_{3} & = & 1 \end{array} .

The idea is to combine equations such that we have one fewer unknown in each equation. Ask: with what number should the first equation be multiplied in order to eliminate $x_{1}$ from sum of equation 1 and equation 2? This number is called a Gaussian multiplier, and is in this case $- 2$ . Repeat the question for eliminating $x_{1}$ from third equation, with multiplier $- 3$ .

{\begin{array}{ccc} x_{1} + 2 x_{2} - x_{3} & = & 2 \\ 2 x_{1} - x_{2} + x_{3} & = & 2 \\ 3 x_{1} - x_{2} - x_{3} & = & 1 \end{array} . \Rightarrow {\begin{array}{ccc} x_{1} + 2 x_{2} - x_{3} & = & 2 \\ - 5 x_{2} + 3 x_{3} & = & - 2 \\ - 7 x_{2} + 2 x_{3} & = & - 5 \end{array} .

Now, ask: with what number should the second equation be multiplied to eliminate $x_{2}$ from sum of second and third equations. The multiplier is in this case $- 7 / 5$ .

{\begin{array}{ccc} x_{1} + 2 x_{2} - x_{3} & = & 2 \\ - 5 x_{2} + 3 x_{3} & = & - 2 \\ - 7 x_{2} + 2 x_{3} & = & - 5 \end{array} . \Rightarrow {\begin{array}{ccc} x_{1} + 2 x_{2} - x_{3} & = & 2 \\ - 5 x_{2} + 3 x_{3} & = & - 2 \\ - \frac{11}{5} x_{3} & = & - \frac{11}{5} \end{array} .

Starting from the last equation we can now find $x_{3} = 1$ , replace in the second to obtain $- 5 x_{2} = - 5$ , hence $x_{2} = 1$ , and finally replace in the first equation to obtain $x_{1} = 1$ .

The above operations only involve coefficients. A more compact notation is therefore to work with what is known as the "bordered matrix" and work with coefficients arising in rows

[\begin{array}{ll} 𝑨 & 𝒃 \end{array}] = [\begin{array}{cccc} 1 & 2 & - 1 & 2 \\ 2 & - 1 & 1 & 2 \\ 3 & - 1 & - 1 & 1 \end{array}] \sim [\begin{array}{ll} 𝑨_{1} & 𝒃_{1} \end{array}] = [\begin{array}{cccc} 1 & 2 & - 1 & 2 \\ 0 & - 5 & 3 & - 2 \\ 0 & - 7 & 2 & - 5 \end{array}] \sim [\begin{array}{ll} 𝑨_{2} & 𝒃_{2} \end{array}] = [\begin{array}{cccc} 1 & 2 & - 1 & 2 \\ 0 & - 5 & 3 & - 2 \\ 0 & 0 & - \frac{11}{5} & - \frac{11}{5} \end{array}] .

In Julia the above operations would be carried out as

∴	A=[1. 2 -1 2; 2 -1 1 2; 3 -1 -1 1]; A[2,:]=A[2,:]-2A[1,:]; A[3,:]=A[3,:]-3A[1,:];

∴

$[\begin{array}{cccc} 1.0 & 2.0 & - 1.0 & 2.0 \\ 0.0 & - 5.0 & 3.0 & - 2.0 \\ 0.0 & - 7.0 & 2.0 & - 5.0 \end{array}]$ (1)

∴	A[3,:]=A[3,:]-(7/5)*A[2,:]; A

$[\begin{array}{cccc} 1.0 & 2.0 & - 1.0 & 2.0 \\ 0.0 & - 5.0 & 3.0 & - 2.0 \\ 0.0 & 0.0 & - 2.1999999999999993 & - 2.2 \end{array}]$ (2)

∴

Once the above triangular form has been obtained, the solution is found by back substitution, in which we seek to form the identity matrix in the first 3 columns, and the solution is obtained in the last column.

[\begin{array}{cccc} 1 & 2 & - 1 & 2 \\ 0 & - 5 & 3 & - 2 \\ 0 & 0 & - \frac{11}{5} & - \frac{11}{5} \end{array}] \sim [\begin{array}{cccc} 1 & 2 & - 1 & 2 \\ 0 & - 5 & 3 & - 2 \\ 0 & 0 & 1 & 1 \end{array}] \sim [\begin{array}{cccc} 1 & 0 & 0 & 1 \\ 0 & 1 & 0 & 1 \\ 0 & 0 & 1 & 1 \end{array}] .

The operations arising in Gaussian elimination are successive linear combinations of rows that maintain the solution of the linear system. This idea is useful in identifying the fundamental subspaces associated with a matrix. The matrices arising at successive stages of the procedure are said to be similar to one another

𝑨 \sim 𝑨_{1} \sim 𝑨_{2},

and since $𝑨_{k}$ is obtained by linear combination of the rows of $𝑨_{k - 1}$ , the row space is not changed

C (𝑨^{T}) = C (𝑨_{1}^{T}) = C (𝑨_{2}^{T}) = \dots .

During the procedure a pivot element is identified in the diagonal position, as shown bordered above. If a zero value is encountered rows are permuted to bring a non-zero element to the pivot position. If a non-zero pivot value cannot be found by row permutation, one is sought by column permutations also. If a non-zero pivot cannot be found by either row or column permutations, the matrix is rank-deficient $r = rank (𝑨) < min (m, n)$ and has a non-trivial null space as in the following examples

𝑨 = [\begin{array}{lll} 1 & 2 & 3 \\ 0 & 1 & 1 \\ 1 & 2 & 3 \end{array}] \in ℝ^{3 \times 3}, 𝒃 = [\begin{array}{l} 3 \\ 1 \\ 3 \end{array}] \in ℝ^{3}, 𝒄 = [\begin{array}{l} 3 \\ 1 \\ 4 \end{array}] \in ℝ^{3} .

[\begin{array}{ll} 𝑨 & 𝒃 \end{array}] = [\begin{array}{llll} 1 & 2 & 3 & 3 \\ 0 & 1 & 1 & 1 \\ 1 & 2 & 3 & 3 \end{array}] \sim [\begin{array}{ll} 𝑨_{1} & 𝒃_{1} \end{array}] = [\begin{array}{llll} 1 & 2 & 3 & 3 \\ 0 & 1 & 1 & 1 \\ 0 & 0 & 0 & 0 \end{array}] \Leftrightarrow {\begin{cases} x_{1} + 2 x_{2} + 3 x_{3} & = 3 \\ \begin{array}{llll}  \end{array} x_{2} + x_{3} & = 1 \\ \begin{array}{llllllll}  \end{array} 0 & = 0 \end{cases},

[\begin{array}{ll} 𝑨 & 𝒄 \end{array}] = [\begin{array}{llll} 1 & 2 & 3 & 3 \\ 0 & 1 & 1 & 1 \\ 1 & 2 & 3 & 4 \end{array}] \sim [\begin{array}{ll} 𝑨_{1} & 𝒄_{1} \end{array}] = [\begin{array}{llll} 1 & 2 & 3 & 3 \\ 0 & 1 & 1 & 1 \\ 0 & 0 & 0 & 1 \end{array}] \Leftrightarrow {\begin{cases} x_{1} + 2 x_{2} + 3 x_{3} & = 3 \\ \begin{array}{llll}  \end{array} x_{2} + x_{3} & = 1 \\ \begin{array}{llllllll}  \end{array} 0 & = 1 \end{cases} .

The $𝑨 𝒙 = 𝒃$ has an infinite number of solutions, while the $𝑨 𝒙 = 𝒄$ system has no solutions. Note that $𝑨_{1}$ has a row of zeros, hence the rows must be linearly dependent and $N (𝑨) \neq {𝟎}$ . By the FTLA when $𝒃 \in C (𝑨)$ an infinite number of solutions is obtained, and for $𝒄 \notin C (𝑨)$ no solutions are obtained.

The rows with non-zero pivot elements are linearly independent, and reduction to the above row-echelon form is useful to identify the rank of a matrix. The first non-zero entry on a row is called either a pivot or a leading entry. A matrix is said to be brought to reduced row-echelon form when:

all zero rows are below non-zero rows;
in each non-zero row, the leading entry is to the left of lower leading entries;
each leading entry equals 1 and is the only non-zero entry in its column.

In contrast to the Gram-Schmidt procedure, Gaussian elimination does not impose orthogonality between rows, nor that a row have unit norm. This leads to fewer computations, and is therefore well-suited to hand computation of small-dimensional matrices.

The steps in Gaussian elimination can be precisely specified in a format suitable for direct computer coding.

Algorithm Gauss elimination without pivoting

for $s = 1$ to $m - 1$

for $i = s + 1$ to $m$

$t = - a_{i s} / a_{s s}$

for $j = s + 1$ to $m$

$a_{i j} = a_{i j} + t \cdot a_{s j}$

$b_{i} = b_{i} + t \cdot b_{s}$

for $s = m$ downto 1

$x_{s} = b_{s} / a_{s s}$

for $i = 1$ to $s - 1$

$b_{i} = b_{i} - a_{i s} \cdot x_{s}$

The variant of the above algorithm that accounts for possible zeros arising in a diagonal position is known as Gauss elimination with pivoting.

Algorithm Gauss elimination with partial pivoting

$p = 1 : m$ (initialize row permutation vector)

for $s = 1$ to $m - 1$

piv = $abs (a_{p (s), s})$

for $i = s + 1$ to $m$

mag = $abs (a_{p (i), s})$

$piv = mag; k = p (s); p (s) = p (i); p (i) = k$

if $piv < ϵ$ then break(“Singular matrix”)

$t = - a_{p (i) s} / a_{p (s) s}$

for $j = s + 1$ to $m$

$a_{p (i) j} = a_{p (i) j} + t \cdot a_{p (s) j}$

$b_{p (i)} = b_{p (i)} + t \cdot b_{p (s)}$

for $s = m$ downto 1

$x_{s} = b_{p (s)} / a_{p (s) s}$

for $i = 1$ to $s - 1$

$b_{p (i)} = b_{p (i)} - a_{p (i) s} \cdot x_{s}$

L U

-factorization

The operations arising in Gaussian elimination correspond to a matrix factorization, analogous to how the Gram-Schmidt procedure can be stated as the

Q R

factorization. Revisting the previous example

the idea is to express linear combinations of rows as a matrix multiplication. Recall that

𝑨 𝒙

is a linear combination of columns, and

𝑨 𝑿

expresses multiple column linear combinations. Linear combinations of columns are expressed as products in which the first factor contains the columns and the second contains the scaling coefficients. Analogously linear combinations of rows are expressed by products

𝑳 𝑨

where now the left factor contains the scaling coefficients entering into a linear combination of the rows of

𝑨

. For example, the first stage of Gaussian elimination for the above system can be expressed as

The next stage is also expressed as a matrix multiplication, after which an upper triangular matrix

𝑼

is obtained

with $l_{i, k} = a_{i, k}^{(k)} / a_{k, k}^{(k)}$ , and $𝑨^{(k)} = (a_{i, j}^{(k)})$ the matrix obtained after step $k$ of row echelon reduction (or, equivalently, Gaussian elimination) is called a Gaussian multiplier matrix.

The above is known as an

L U

factorization, short for lower-upper factorization. Solving a linear system by

L U

-factorization consists of the steps:

The various procedures encountered so far to solve a linear system are described in the following table.

4.Matrix inverse

For

𝑨 \in ℝ^{m \times n}

the pseudo-inverse

𝑨^{+}

has been introduced based on the

SVD

𝑨 = 𝑼 𝚺 𝑽^{T}

When

𝑨 \in ℝ^{m \times m}

is square and of full rank the system

𝑨 𝒙 = 𝒃

has a solution that can be stated as

𝒙 = 𝑨^{- 1} 𝒃

, where

𝑨^{- 1}

is the inverse of

𝑨

. The matrix

𝑨

is said to be invertible

𝑿 \in ℝ^{m \times m}

such that

A square matrix has an inverse only when it is of full rank. The following are equivalent statements:

Singular value decomposition	Gram-Schmidt	Lower-upper
Transformation of coordinates	$𝑨 𝒙 = 𝒃$
$𝑼 𝚺 𝑽^{T} = 𝑨$	$𝑸 𝑹 = 𝑨$	$𝑳 𝑼 = 𝑨$
$(𝑼 𝚺 𝑽^{T}) 𝒙 = 𝒃 \Rightarrow 𝑼 𝒚 = 𝒃 \Rightarrow 𝒚 = 𝑼^{T} 𝒃$	$(𝑸 𝑹) 𝒙 = 𝒃 \Rightarrow 𝑸 𝒚 = 𝒃$ , $𝒚 = 𝑸^{T} 𝒃$	$(𝑳 𝑼) 𝒙 = 𝒃 \Rightarrow 𝑳 𝒚 = 𝒃 (forward sub to find)$ y
$𝚺 𝒛 = 𝒚 \Rightarrow 𝒛 = 𝚺^{+} 𝒚$	$𝑹 𝒙 = 𝒚 (back sub to find . 𝒙$ )	$𝑼 𝒙 = 𝒚$ (back sub to find $𝒙$ )
$𝑽^{T} 𝒙 = 𝒛 \Rightarrow 𝒙 = 𝑽 𝒛$

Linear System Solution

1.Orthogonal projectors and linear systems

2.Gaussian elimination and row echelon reduction

3. $L U$ -factorization

4.Matrix inverse

Linear System Solution

1.Orthogonal projectors and linear systems

2.Gaussian elimination and row echelon reduction

3.L⁡U-factorization

4.Matrix inverse

3. $L U$ -factorization