MATH661 Homework 3 - Least squares problems

1Track 1

Consider data

𝒟 = {(t_{i}, y_{i}) | i = 1, 2, \dots, m .}

obtained by sampling a function

f : ℝ \to ℝ

, with

y_{i} = f (t_{i})

. An approximation is sought by linear combination

f (t) ≅ x_{1} a_{1} (t) + x_{2} a_{2} (t) + \dots + x_{n} a_{n} (t) .

Introduce the vector-valued function

A : ℝ \to ℝ^{n}

(organized as a row vector)

A (t) = [\begin{array}{llll} a_{1} (t) & a_{2} (t) & \dots & a_{n} (t) \end{array}],

such that

f (t) ≅ A (t) 𝒙, 𝒙 = {[\begin{array}{llll} x_{1} & x_{2} & \dots & x_{n} \end{array}]}^{T} .

With

𝒕 = {[\begin{array}{llll} t_{1} & t_{2} & \dots & t_{m} \end{array}]}^{T}

a sampling of the function domain, a matrix is defined by

𝑨 = A (𝒕) 𝒙 = [\begin{array}{llll} a_{1} (𝒕) & a_{2} (𝒕) & \dots & a_{n} (𝒕) \end{array}] 𝒙 \in ℝ^{m \times n} .

Tasks. In each exercise below, construct the least-squares approximant for the stated range of

n \in 𝒩

, sample points

𝒕

, and choice of

A (t)

. Plot in a single figure all components of

A (t)

. Plot the approximants, as well as

f

in a single figure. Construct a convergence plot of the approximations by representation of point data

ℰ = {(\log n, \log || 𝒚 - 𝑨 𝒙 ||) | 𝑨 \in ℝ^{m \times n}, n \in 𝒩 .}

. For the largest value of

n

within

𝒩

, construct a figure superimposing increasing number of sampling points,

m \in ℳ

. Comment on what you observe in each individual exercise. Also compare results from the different exercises.

Start with the classical example due to Runge (1901)

\begin{array}{l} f : [- 1, 1] \to ℝ, f (t) = \frac{1}{(1 + 25 t^{2})}, t_{i} = \frac{2 (i - 1)}{m - 1} - 1, \\ ℳ = {16, 32, 64, 128, 256}, 𝒩 = {4, 8, 16, 32}, \\ A (t) = [\begin{array}{lllll} 1 & t & t^{2} & \dots & t^{n - 1} \end{array}] . \end{array}

Solution. With $𝒕 \in ℝ^{m}$ denoting the sampling point vector, and $𝒚 \in ℝ^{m}$ , the function values at the sample points, the least squares problem is

{min}_{𝒙 \in ℝ^{n}} || 𝒚 - 𝑨 𝒙 ||,

where

𝑨 = [\begin{array}{llll} 𝟏 & 𝒕 & \dots & 𝒕^{n - 1} \end{array}] .

The solution to the least squares problem

𝒛 = {argmin}_{𝒙 \in ℝ^{n}} || 𝒚 - 𝑨 𝒙 ||,

furnishes the approximation

f (t) ≅ \tilde{f} (t) = A (t) 𝒛,

that can be sampled at $M ⩾ m$ points to assess approximation error.

When carrying convergence studies such as these, it is convenient to define functions for common tasks:

$\circ$ sample. Returns a sample of $f : [a, b] \to ℝ$ at $m$ equidistant points

$\circ$ plotLSQ. Constructs a figure with plots of:

The $m$ sample points (i.e., data) represented as black dots;
The function sampled at more points, i.e., $M ⩾ m$ ;
The approximation sampled at $M$ points

This problem solution is obtained by:

defining the function $f$
∴

Runge(t)=1/(1+25*t^2);
∴

defining the basis

∴	function MonomialBasis(t,n) m=size(t)[1]; A=ones(m,1); for j=1:n-1 A = [A t.^j] end return A end;

∴

Invoking plotLSQ with appropriate parameters

∴	clf(); plotLSQ(-1,1,Runge,MonomialBasis,16,4,64);

∴	FigPrefix=homedir()*"/courses/MATH661/images/H03";

∴	savefig(FigPrefix*"Fig01.eps")

∴

Figure 2. Least squares approximant (red) of Runge function (blue) sampled at (black dots).

Once the above are defined cycling through the parameter ranges is straightforward (open figure folds to see code).

$\circ$

Figure 3. Effect of increasing number of monomial basis functions in least squares approximation of Runge function.

Instead of the equidistant point samples of the Runge example above use the Chebyshev nodes

t_{i} = \cos (\frac{2 i - 1}{2 m} π),

keeping other parameters as in Problem 1.

Instead of the monomial family of the Runge example, use the Fourier basis

A (t) = [\begin{array}{llllll} 1 & \cos π t & \sin π t & \dots & \cos π n t & \sin π n t \end{array}]

keeping other parameters as in Problem 1. In this case $𝑨 \in ℝ^{m \times (2 n + 1)}$ .

Instead of the monomial family of the Runge example, use the piecewise linear $B$ -spline basis

A (t) = [\begin{array}{llll} N_{1} (t) & N_{2} (t) & \dots & N_{n} (t) \end{array}],

N_{i} (t) = {\begin{cases} 0, & t < t_{i - 1} \\ \frac{t - t_{i - 1}}{h} & t_{i - 1} ⩽ t < t_{i} \\ \frac{t_{i + 1} - t}{h} & t_{i} ⩽ t < t_{i + 1} \\ 0 & t_{i + 1} < t \end{cases} ., h = \frac{2}{m - 1},

keeping other parameters as in Problem 1.

2Track 2

If $𝑸 \in ℂ^{m \times n}$ has orthonormal columns, prove that $𝑷_{𝑸} = 𝑸 𝑸^{*}$ is an orthogonal projector onto $C (𝑸)$ . Determine the expression of $𝑷_{𝑨}$ , the projector onto $C (𝑨)$ , with $𝑨 \in ℂ^{m \times n}$ . Compare the number of arithmetic operations required to compute $𝒚 = 𝑷_{𝑨} 𝒙$ , by comparison to first determining the $Q R$ factorization, $𝑨 = 𝑸 𝑹$ , and then computing $𝒚 = 𝑸 𝑸^{*} 𝒙$ .

Continuing Problem 1, determine ${|| 𝑷_{𝑸} ||}_{2}$ , and express ${|| 𝑷_{𝑨} ||}_{2}$ in terms of the singular value decomposition of $𝑨$ . Comment the result, considering, say, length of shadows at various times of day.

A matrix $𝑨 = [a_{i j}] \in ℂ^{m \times n}$ is said to be banded with bandwidth $B$ if $a_{i j} = 0$ for $| i - j | > B$ . Implement the modified Gram-Schmidt algorithm for $𝑨 \in ℂ^{m \times n}$ a banded matrix with bandwidth $B$ using as few arithmetic operations as possible.

Solve Problem 1, Track 1.

Solve Problem 4, Track 1.

In Problem 1, Track 1, replace the monomial basis with the Legendre polynomials, whose samples are determined by $Q R$ decomposition $𝑸 𝑹 = 𝑨$ . The resulting least squares problem is now

{min}_{𝒙 \in ℝ^{n}} {|| 𝒚 - 𝑸 𝒙 ||}_{2} .