Data Redundancy

1.Linear dependence

For the simple scalar mapping f:, f(x)=ax, the condition f(x)=0 implies either that a=0 or x=0. Note that a=0 can be understood as defining a zero mapping f(x)=0. Linear mappings between vector spaces, 𝒇:UV, can exhibit different behavior, and the condtion 𝒇(𝒙)=𝑨𝒙=𝟎, might be satisfied for both 𝒙𝟎, and 𝑨𝟎. Analogous to the scalar case, 𝑨=𝟎 can be understood as defining a zero mapping, 𝒇(𝒙)=𝟎.

In vector space 𝒱=(V,S,+,), vectors 𝒖,𝒗V related by a scaling operation, 𝒗=a𝒖, aS, are said to be colinear, and are considered to contain redundant data. This can be restated as 𝒗span{𝒖}, from which it results that span{𝒖}=span{𝒖,𝒗}. Colinearity can be expressed only in terms of vector scaling, but other types of redundancy arise when also considering vector addition as expressed by the span of a vector set. Assuming that 𝒗span{𝒖}, then the strict inclusion relation span{𝒖}span{𝒖,𝒗} holds. This strict inclusion expressed in terms of set concepts can be transcribed into an algebraic condition.

Definition. The vectors 𝒂1,𝒂2,,𝒂nV,are linearly dependent if there exist n scalars, x1,,xnS, at least one of which is different from zero such that

x1𝒂1++xn𝒂n=𝟎.

Introducing a matrix representation of the vectors

𝑨=[ 𝒂1 𝒂2 𝒂n ];𝒙=[ x1 x2 xn ]

allows restating linear dependence as the existence of a non-zero vector, 𝒙𝟎, such that 𝑨𝒙=𝟎. Linear dependence can also be written as 𝑨𝒙=𝟎𝒙=𝟎, or that one cannot deduce from the fact that the linear mapping 𝒇(𝒙)=𝑨𝒙 attains a zero value that the argument itself is zero. The converse of this statement would be that the only way to ensure 𝑨𝒙=𝟎 is for 𝒙=𝟎, or 𝑨𝒙=𝟎𝒙=𝟎, leading to the concept of linear independence.

Definition. The vectors 𝒂1,𝒂2,,𝒂nV,are linearly independent if the only n scalars, x1,,xnS, that satisfy

x1𝒂1++xn𝒂n=𝟎, (1)

are x1=0, x2=0,…,xn=0.

2.Basis and dimension

Vector spaces are closed under linear combination, and the span of a vector set ={𝒂1,𝒂2,} defines a vector subspace. If the entire set of vectors can be obtained by a spanning set, V=span, extending by an additional element 𝒞={𝒃} would be redundant since span=span𝒞. This is recognized by the concept of a basis, and also allows leads to a characterization of the size of a vector space by the cardinality of a basis set.

Definition. A set of vectors 𝒖1,,𝒖nV is a basis for vector space 𝒱=(V,S,+,) if

  1. 𝒖1,,𝒖n are linearly independent;

  2. span{𝒖1,,𝒖n}=V.

Definition. The number of vectors 𝒖1,,𝒖nV within a basis is the dimension of the vector space 𝒱=(V,S,+,).

3.Dimension of matrix spaces

The domain and co-domain of the linear mapping 𝒇:UV, 𝒇(𝒙)=𝑨𝒙, are decomposed by the spaces associated with the matrix 𝑨. When U=n, V=m, the following vector subspaces associated with the matrix 𝑨m×n have been defined:

Definition. The rank of a matrix 𝑨m×n is the dimension of its column space and is equal to the dimension of its row space.

Definition. The nullity of a matrix 𝑨m×n is the dimension of its null space.