MATH661

A central interest in scientific computation is to seek simple descriptions of complex objects. A typical situation is specifying an instance of some object of interest through an

m

-tuple

𝒗 \in ℝ^{m}

with large

m

. Assuming that addition and scaling of such objects can cogently be defined, a vector space is obtained, say over the field of reals with an Euclidean distance,

E_{m}

. Examples include for instance recordings of medical data (electroencephalograms, electrocardiograms), sound recordings, or images, for which

m

can easily reach into the millions. A natural question to ask is whether all the

m

real numbers are actually needed to describe the observed objects, or perhaps there is some intrinsic description that requires a much smaller number of descriptive parameters, that still preserves the useful idea of linear combination. The mathematical transcription of this idea is a vector subspace.

Definition. (Vector Subspace) . $𝒰 = (U, S, +, \cdot)$ , $U \neq \emptyset$ , is a vector subspace of vector space $𝒱 = (V, S, +, \cdot)$ over the same field of scalars S, denoted by $𝒰 \leq 𝒱$ , if $U \subseteq V$ and $\forall a, b \in S$ , $\forall 𝒖, 𝒗 \in U$ , the linear combination $a 𝒖 + b 𝒗 \in U$ .

The above states a vector subspace must be closed under linear combination, and have the same vector addition and scaling operations as the enclosing vector space. The simplest vector subspace of a vector space is the null subspace that only contains the null element,

U = {𝟎}

. In fact any subspace must contain the null element

𝟎

, or otherwise closure would not be verified for the particular linear combination

𝒖 + (- 𝒖) = 𝟎

. If

U \subset V

, then

𝒰

is said to be a proper subspace of

𝒱

, denoted by

𝒰 < 𝒱

$•$ $\circ$ Setting $n - m$ components equal to zero in the real space $ℛ_{m}$ defines a proper subspace whose elements can be placed into a one-to-one correspondence with the vectors within $ℛ_{n}$ . For example, setting component $m$ of $𝒙 \in ℝ^{m}$ equal to zero gives $𝒙 = {[\begin{array}{ccccc} x_{1} & x_{2} & \dots & x_{m - 1} & 0 \end{array}]}^{T}$ that while not a member of $ℝ^{m - 1}$ , it is in a one-to-one relation with $𝒙^{'} = {[\begin{array}{cccc} x_{1} & x_{2} & \dots & x_{m - 1} \end{array}]}^{T} \in ℝ^{m - 1}$ . Dropping the last component of $𝒚 \in ℝ^{m}$ , $𝒚 = {[\begin{array}{ccccc} y_{1} & y_{2} & \dots & y_{m - 1} & y_{m} \end{array}]}^{T}$ gives vector $𝒚^{'} = [\begin{array}{cccc} y_{1} & y_{2} & \dots & y_{m - 1} \end{array}] \in ℝ^{m - 1}$ , but this is no longer a one-to-one correspondence since for some given $𝒚^{'}$ , the last component $y_{m}$ could take any value.

∴	m=3; x=[1; 2; 0]; xp=x[1:2]

$[\begin{array}{c} 1 \\ 2 \end{array}]$ (1)

∴	y=[1; 2; 3]; yp=y[1:2]

$[\begin{array}{c} 1 \\ 2 \end{array}]$ (2)

∴

Vector subspaces arise in decomposition or partitioning of a vector space. The converse, composition of vector spaces

𝒰 = (U, S, +, \cdot)

𝒱 = (V, S, +, \cdot)

is defined in terms of linear combination. A vector

𝒙 \in ℝ^{3}

can be obtained as the linear combination

for some arbitrary

a \in ℝ

. In the first case,

𝒙

is obtained as a unique linear combination of a vector from the set

U = {{[\begin{array}{ccc} x_{1} & 0 & 0 \end{array}]}^{T} | x_{1} \in ℝ .}

with a vector from

V = {{[\begin{array}{ccc} 0_{} & x_{2} & x_{3} \end{array}]}^{T} | x_{2}, x_{3} \in ℝ .}

. In the second case, there is an infinity of linear combinations of a vector from

V

with another from

W = {{[\begin{array}{ccc} x_{1} & x_{2} & 0 \end{array}]}^{T} | x_{1}, x_{2} \in ℝ .}

to the vector

𝒙

. This is captured by a pair of definitions to describe vector space composition.

Definition. Given two vector subspaces $𝒰 = (U, S, +, \cdot)$ , $𝒱 = (V, S, +, \cdot)$ of the space $𝒲 = (W, S, +, \cdot)$ , the sum is the vector space $𝒰 + 𝒱 = (U + V, S, +, \cdot)$ , where the sum of the two sets of vectors $U, V$ is $U + V = {𝒖 + 𝒗 | 𝒖 \in U, 𝒗 \in V}$ .

Definition. Given two vector subspaces $𝒰 = (U, S, +, \cdot)$ , $𝒱 = (V, S, +, \cdot)$ of the space $𝒲 = (W, S, +, \cdot)$ , the direct sum is the vector space $𝒰 \oplus 𝒱 = (U \oplus V, S, +, \cdot)$ , where the direct sum of the two sets of vectors $U, V$ is $U \oplus V = {𝒖 + 𝒗 | \exists! 𝒖 \in U, \exists! 𝒗 \in V}$ . (unique decomposition)

$•$ $\circ$ Since the same scalar field, vector addition, and scaling is used , it is more convenient to refer to vector space sums simply by the sum of the vector sets $U + V$ , or $U \oplus V$ , instead of specifying the full tuplet for each space. This shall be adopted henceforth to simplify the notation.

∴	u=[1; 0; 0]; v=[0; 2; 3]; vp=[0; 1; 3]; w=[1; 1; 0]; [u+v vp+w]

$[\begin{array}{cc} 1 & 1 \\ 2 & 2 \\ 3 & 3 \end{array}]$ (3)

∴

In the previous example, the essential difference between the two ways to express

𝒙 \in ℝ^{3}

is that

U \cap V = {𝟎}

, but

V \cap W = {{[\begin{array}{ccc} 0 & a & 0 \end{array}]}^{T} | a \in ℝ .} \neq {𝟎}

, and in general if the zero vector is the only common element of two vector spaces then the sum of the vector spaces becomes a direct sum.In practice, the most important procedure to construct direct sums or check when an intersection of two vector subspaces reduces to the zero vector is through an inner product.

Definition. Two vector subspaces $U, V$ of the real vector space $ℝ^{m}$ are orthogonal, denoted as $U ⊥ V$ if $𝒖^{T} 𝒗 = 0$ for any $𝒖 \in U, 𝒗 \in V$ .

Definition. Two vector subspaces U,V of $U + V$ are orthogonal complements, denoted $U = V^{⊥}$ , $V = U^{⊥}$ if they are orthogonal subspaces, $U ⊥ V$ , and $U \cap V = {𝟎}$ , i.e., the null vector is the only common element of both subspaces.

The above concept of orthogonality can be extended to other vector subspaces, such as spaces of functions. It can also be extended to other choices of an inner product, in which case the term conjugate vector spaces is sometimes used. The concepts of sum and direct sum of vector spaces used linear combinations of the form

𝒖 + 𝒗

. This notion can be extended to arbitrary linear combinations.

Definition. In vector space $𝒱 = (V, S, +, \cdot)$ , the span of vectors $𝒂_{1}, 𝒂_{2}, \dots, 𝒂_{n} \in V,$ is the set of vectors reachable by linear combination

Note that for real vector spaces a member of the span of the vectors

{𝒂_{1}, 𝒂_{2}, \dots, 𝒂_{n}}

is the vector

𝒃

obtained from the matrix vector multiplication

From the above, the span is a subset of the co-domain of the linear mapping

𝒇 (𝒙) = 𝑨 𝒙

2.Vector subspaces of a linear mapping

The wide-ranging utility of linear algebra results from a complete characterization of the behavior of a linear mapping between vector spaces

𝒇 : U \to V

𝒇 (a 𝒖 + b 𝒗) = a 𝒇 (𝒖) + b 𝒇 (𝒗)

. For some given linear mapping the questions that arise are:

Linear mappings between real vector spaces

𝒇 : ℝ^{n} \to ℝ^{m}

, have been seen to be completely specified by a matrix

𝑨 \in ℝ^{m \times n}

. It is common to frame the above questions about the behavior of the linear mapping

𝒇 (𝒙) = 𝑨 𝒙

through sets associated with the matrix

𝑨

. To frame an answer to the first question, a set of reachable vectors is first defined.

Definition. The column space (or range) of matrix $𝑨 \in ℝ^{m \times n}$ is the set of vectors reachable by linear combination of the matrix column vectors

By definition, the column space is included in the co-domain of the function

𝒇 (𝒙) = 𝑨 𝒙

C (𝑨) \subseteq ℝ^{m}

, and is readily seen to be a vector subspace of

ℝ^{m}

. The question that arises is whether the column space is the entire co-domain

C (𝑨) = ℝ^{m}

that would signify that any vector can be reached by linear combination. If this is not the case then the column space would be a proper subset,

C (𝑨) \subset ℝ^{m}

, and the question is to determine what part of the co-domain cannot be reached by linear combination of columns of

𝑨

. Consider the orthogonal complement of

C (𝑨)

defined as the set vectors orthogonal to all of the column vectors of

𝑨

, expressed through inner products as

and leads to the definition of a set of vectors for which

𝑨^{T} 𝒚 = 𝟎

Definition. The left null space (or cokernel) of a matrix $𝑨 \in ℝ^{m \times n}$ is the set

Note that the left null space is also a vector subspace of the co-domain of

𝒇 (𝒙) = 𝑨 𝒙

N (𝑨^{T}) \subseteq ℝ^{m}

. The above definitions suggest that both the matrix and its transpose play a role in characterizing the behavior of the linear mapping

𝒇 = 𝑨 𝒙

, so analagous sets are define for the transpose

𝑨^{T}

Definition. The row space (or corange) of a matrix $𝑨 \in ℝ^{m \times n}$ is the set

Examples. Consider a linear mapping

𝒇 : ℝ^{n} \to ℝ^{m}

, defined by

𝒚 = 𝒇 (𝒙) = 𝑨 𝒙 = {[\begin{array}{ccc} y_{1} & \dots & y_{n} \end{array}]}^{T}

, with

𝑨 \in ℝ^{m \times n}

The above low dimensional examples are useful to gain initial insight into the significance of the spaces

C (𝑨), N (𝑨^{T})

. Further appreciation can be gained by applying the same concepts to processing of images. A gray-scale image of size

p_{x}

p_{y}

pixels can be represented as a vector with

m = p_{x} p_{y}

components,

𝒃 \in {[0, 1]}^{m} \subset ℝ^{m}

. Even for a small image with

p_{x} = p_{y} = 128 = 2^{7}

pixels along each direction, the vector

𝒃

would have

m = 2^{14}

components. An image can be specified as a linear combination of the columns of the identity matrix

with

b_{i}

the gray-level intensity in pixel

i

. Similar to the inclined plane example from §1, an alternative description as a linear combination of another set of vectors

𝒂_{1}, \dots, 𝒂_{m}

might be more relevant. One choice of greater utility for image processing mimics the behavior of the set

{1, \cos t, \cos 2 t, \dots, \sin t, \sin 2 t, \dots}

that extends the second example in §1, would be for

m = 4

3.Linear dependence

For the simple scalar mapping $f : ℝ \to ℝ$ , $f (x) = a x$ , the condition $f (x) = 0$ implies either that $a = 0$ or $x = 0$ . Note that $a = 0$ can be understood as defining a zero mapping $f (x) = 0$ . Linear mappings between vector spaces, $𝒇 : U \to V$ , can exhibit different behavior, and the condtion $𝒇 (𝒙) = 𝑨 𝒙 = 𝟎$ , might be satisfied for both $𝒙 \neq 𝟎$ , and $𝑨 \neq 𝟎$ . Analogous to the scalar case, $𝑨 = 𝟎$ can be understood as defining a zero mapping, $𝒇 (𝒙) = 𝟎$ .

In vector space $𝒱 = (V, S, +, \cdot)$ , vectors $𝒖, 𝒗 \in V$ related by a scaling operation, $𝒗 = a 𝒖$ , $a \in S$ , are said to be colinear, and are considered to contain redundant data. This can be restated as $𝒗 \in span {𝒖}$ , from which it results that $span {𝒖} = span {𝒖, 𝒗}$ . Colinearity can be expressed only in terms of vector scaling, but other types of redundancy arise when also considering vector addition as expressed by the span of a vector set. Assuming that $𝒗 \notin span {𝒖}$ , then the strict inclusion relation $span {𝒖} \subset span {𝒖, 𝒗}$ holds. This strict inclusion expressed in terms of set concepts can be transcribed into an algebraic condition.

Definition. The vectors $𝒂_{1}, 𝒂_{2}, \dots, 𝒂_{n} \in V,$ are linearly dependent if there exist $n$ scalars, $x_{1}, \dots, x_{n} \in S$ , at least one of which is different from zero such that

$x_{1} 𝒂_{1} + \dots + x_{n} 𝒂_{n} = 𝟎 .$

Introducing a matrix representation of the vectors

𝑨 = [\begin{array}{cccc} 𝒂_{1} & 𝒂_{2} & \dots & 𝒂_{n} \end{array}]; 𝒙 = [\begin{array}{c} x_{1} \\ x_{2} \\ ⋮ \\ x_{n} \end{array}]

allows restating linear dependence as the existence of a non-zero vector, $\exists 𝒙 \neq 𝟎$ , such that $𝑨 𝒙 = 𝟎$ . Linear dependence can also be written as $𝑨 𝒙 = 𝟎 ⇏ 𝒙 = 𝟎$ , or that one cannot deduce from the fact that the linear mapping $𝒇 (𝒙) = 𝑨 𝒙$ attains a zero value that the argument itself is zero. The converse of this statement would be that the only way to ensure $𝑨 𝒙 = 𝟎$ is for $𝒙 = 𝟎$ , or $𝑨 𝒙 = 𝟎 \Rightarrow 𝒙 = 𝟎$ , leading to the concept of linear independence.

Definition. The vectors $𝒂_{1}, 𝒂_{2}, \dots, 𝒂_{n} \in V,$ are linearly independent if the only $n$ scalars, $x_{1}, \dots, x_{n} \in S$ , that satisfy

$x_{1} 𝒂_{1} + \dots + x_{n} 𝒂_{n} = 𝟎,$ (4)

are $x_{1} = 0$ , $x_{2} = 0$ ,…, $x_{n} = 0$ .

4.Basis and dimension

Vector spaces are closed under linear combination, and the span of a vector set

ℬ = {𝒂_{1}, 𝒂_{2}, \dots}

defines a vector subspace. If the entire set of vectors can be obtained by a spanning set,

V = span ℬ

, extending

ℬ

by an additional element

𝒞 = ℬ \cup {𝒃}

would be redundant since

span ℬ = span 𝒞

. This is recognized by the concept of a basis, and also allows leads to a characterization of the size of a vector space by the cardinality of a basis set.

Definition. A set of vectors $𝒖_{1}, \dots, 𝒖_{n} \in V$ is a basis for vector space $𝒱 = (V, S, +, \cdot)$ if

Definition. The number of vectors $𝒖_{1}, \dots, 𝒖_{n} \in V$ within a basis is the dimension of the vector space $𝒱 = (V, S, +, \cdot)$ .

5.Dimension of matrix spaces

The domain and co-domain of the linear mapping

𝒇 : U \to V

𝒇 (𝒙) = 𝑨 𝒙

, are decomposed by the spaces associated with the matrix

𝑨

. When

U = ℝ^{n}

V = ℝ^{m}

, the following vector subspaces associated with the matrix

𝑨 \in ℝ^{m \times n}

have been defined:

Definition. The rank of a matrix $𝑨 \in ℝ^{m \times n}$ is the dimension of its column space and is equal to the dimension of its row space.

Definition. The nullity of a matrix $𝑨 \in ℝ^{m \times n}$ is the dimension of its null space.

Fundamental Matrix Spaces

1.Vector Subspaces

2.Vector subspaces of a linear mapping

3.Linear dependence

4.Basis and dimension

5.Dimension of matrix spaces