MATH 661.FA23 Midterm Examination 1

MATH 661.FA23 Midterm Examination 1 - Solution

Solve the problems for your appropriate course track. Problems probe understanding of the definitions and results from the module on floating point arithmetic and linear algebra. Formulate your answers clearly, cogently, and include a concise description of your approach. Each question is meant to be completely answered within ten minutes. Allowed test time is 75 minutes.

1Common problems

Matrix $𝑨 \in ℝ^{m \times m}$ , $rank (𝑨) = m$ , has the singular value decomposition (SVD) $𝑨 = 𝑼 𝚺 𝑽^{T}$ ( $𝑼 𝑼^{T} = 𝑼^{T} 𝑼 = 𝑰$ , $𝑽 𝑽^{T} = 𝑽^{T} 𝑽 = 𝑰$ , $𝚺 = diag (σ_{1}, . ., σ_{m})$ , $σ_{1} ⩾ σ_{2} ⩾ \dots ⩾ σ_{m} > 0$ ) and the pseudoinverse $𝑨^{+} = 𝑽 𝚺^{+} 𝑼^{T}$ , $𝚺^{+} = diag (1 / σ_{1}, . ., 1 / σ_{m})$ . Find the SVDs of:
1. $𝑨^{T} 𝑨$ ;
2. ${(𝑨^{T} 𝑨)}^{+}$ ;
3. ${(𝑨^{T} 𝑨)}^{+} 𝑨^{T}$ ;
4. $𝑨 {(𝑨^{T} 𝑨)}^{+}$ ;
5. $𝑨 {(𝑨^{T} 𝑨)}^{+} 𝑨^{T}$ .
Solution.
1. $𝑨^{T} 𝑨 = {(𝑼 𝚺 𝑽^{T})}^{T} 𝑼 𝚺 𝑽^{T} = 𝑽 𝚺^{T} 𝑼^{T} 𝑼 𝚺 𝑽^{T} = 𝑽 𝚺^{T} 𝚺 𝑽^{T} = 𝑽 𝚺^{2} 𝑽^{T}$ , since $𝚺^{T} = 𝚺$ . The SVD is
  $𝑨^{T} 𝑨 = 𝑽 𝚺^{2} 𝑽^{T}$
  and the singular values of $𝑨^{T} 𝑨$ are the squares of those of $𝑨$ .
2. ${(𝑨^{T} 𝑨)}^{+} = {(𝑽 𝚺^{2} 𝑽^{T})}^{+} = 𝑽 {(𝚺^{2})}^{+} 𝑽^{T}$ . This is not an SVD since
  ${(𝚺^{2})}^{+} = diag (1 / σ_{1}^{2}, . ., 1 / σ_{m}^{2}), 1 / σ_{1}^{2} ⩽ 1 / σ_{2}^{2} ⩽ \dots ⩽ 1 / σ_{m}^{2} .$
  Introduce permutation matrix $𝑷 = [\begin{array}{llll} 𝒆_{m} & 𝒆_{m - 1} & . . . & 𝒆_{1} \end{array}]$ which is orthogonal, $𝑷 𝑷^{T} = 𝑰$ and symmetric $𝑷 = 𝑷^{T}$ to obtain
  ${(𝚺^{2})}^{+} 𝑷 = diag (1 / σ_{m}^{2}, . ., 1 / σ_{1}^{2}) = 𝚲,$
  the correct ordering and the SVD
  ${(𝑨^{T} 𝑨)}^{+} = 𝑽 {(𝚺^{2})}^{+} 𝑽^{T} = 𝑽 {(𝚺^{2})}^{+} 𝑷 𝑷^{T} 𝑽^{T} = 𝑽 𝚲 {(𝑽 𝑷)}^{T},$
  since $𝑽 𝑷$ , the product of two orthogonal matrices, is itself orthogonal.
3. Calculate
  ${(𝑨^{T} 𝑨)}^{+} 𝑨^{T} = 𝑽 𝚲 {(𝑽 𝑷)}^{T} 𝑽 𝚺^{T} 𝑼^{T} = 𝑽 𝚲 𝑷^{T} 𝑽^{T} 𝑽 𝚺 𝑼^{T} = 𝑽 𝚲 𝑷^{T} 𝚺 𝑼^{T} .$
  Since
  $𝑷^{T} 𝚺 = 𝑷 𝚺 = diag (σ_{m}, . ., σ_{1}),$
  obtain
  $𝚲 𝑷^{T} 𝚺 = diag (1 / σ_{m}, . ., 1 / σ_{1}) = 𝚪,$
  and the SVD
  ${(𝑨^{T} 𝑨)}^{+} 𝑨^{T} = 𝑽 𝚪 𝑼^{T} .$
4. Calculate
  $𝑨 {(𝑨^{T} 𝑨)}^{+} = 𝑼 𝚺 𝑽^{T} 𝑽 𝚲 {(𝑽 𝑷)}^{T} = 𝑼 𝚺 𝚲 {(𝑽 𝑷)}^{T} = 𝑼 𝚪 {(𝑽 𝑷)}^{T},$
  which is an SVD.
5. Use above and $𝚺 𝚪 = 𝑰$ to write
  $𝑨 {(𝑨^{T} 𝑨)}^{+} 𝑨^{T} = 𝑼 𝚺 𝑽^{T} 𝑽 𝚪 𝑼^{T} = 𝑼 𝚺 𝚪 𝑼^{T} = 𝑰 .$

2Track 1

Write pseudo-code to accurately evaluate the sum
$S_{2 n} = \sum_{k = 1}^{2 n} \frac{{(- 1)}^{k + 1}}{k} x^{k}$
in floating point arithmetic when $x = 1 + ε$ , $1 ≫ ε > 0$ . ( ${lim}_{n \to \infty} S_{2 n} = \ln (1 + x)$ ).

Solution. There is possible loss of precision from successive terms of alternating signs, the effect of which can be attenuated by adding two terms at a time to the sum accumulator
$S_{2 n} = \sum_{k = 1}^{n} x^{2 k - 1} (\frac{1}{2 k - 1} - \frac{x}{2 k})$

$S = 0$ ; $y = x$ ; $x 2 = x^{2}$

for $k$ =1 to $n$

$l = 2 k; d = 1 / (l - 1) - x / l$

$t = y \cdot l$ ; $S = S + t$ ; $y = y \cdot x 2$
Use the SVD of $𝑨 \in ℝ^{m \times n}$ to express the Moore-Penrose pseudoinverse as a sum of rank-one matrices.

Solution. The SVD of $𝑨$ is $𝑨 = 𝑼 𝚺 𝑽^{T}$ and the pseudoinverse is written as
$𝑨^{+} = 𝑽 𝚺^{+} 𝑼^{T} = [\begin{array}{lll} 𝒗_{1} & . . & 𝒗_{n} \end{array}] [\begin{array}{lllll} 1 / σ_{1} \\ ⋱ \\ 1 / σ_{r} \\ 0 \end{array}] [\begin{array}{l} 𝒖_{1}^{T} \\ ⋮ \\ 𝒖_{m}^{T} \end{array}] = \sum_{j = 1}^{r} \frac{1}{σ_{j}} 𝒗_{j} 𝒖_{j}^{T},$
a sum of the rank-1 updates $𝒗_{j} 𝒖_{j}^{T} / σ_{j}$ .

3Track 2

Let $𝑨 \in ℝ^{m \times n}$ . Show that the Moore-Penrose pseudoinverse $𝑿 = 𝑨^{+}$ minimizes ${|| 𝑨 𝑿 - 𝑰 ||}_{F}$ over all $n$ by $m$ matrices.

Solution. The squared Frobenius norm of $𝑨 𝑿 - 𝑰 = [\begin{array}{lll} 𝑨 𝒙_{1} - 𝒆_{1} & 𝑨 𝒙_{n} - 𝒆_{n} \end{array}]$ is the sum of its squared column vector 2-norms
${|| 𝑨 𝑿 - 𝑰 ||}_{F}^{2} = \sum_{j = 1}^{n} {|| 𝑨 𝒙_{j} - 𝒆_{j} ||}_{2}^{2},$
and the minimum is attained by the solution of the $n$ least squares problems

${min}_{𝒙_{j}} {|| 𝑨 𝒙_{j} - 𝒆_{j} ||}_{2} \Rightarrow 𝒙_{j} = 𝑽 𝚺^{+} 𝑼^{T} 𝒆_{j} \Rightarrow 𝑿 = 𝑨^{+} 𝑰 = 𝑨^{+} .$
Let $𝑨 \in ℂ^{m \times m}$ be skew-Hermitian, i.e., $𝑨^{*} = - 𝑨$ . Prove that:
1. $𝑰 - 𝑨$ is nonsingular;
2. $𝑪 = {(𝑰 - 𝑨)}^{- 1} (𝑰 + 𝑨)$ is unitary.
Solution. a) For $m = 1$ , $a + a^{*} = 0$ , and $b = 1 - a$ nonsingular implies ${|| b ||}_{2}^{2} = b^{*} b > 0$ , readily verified
$b^{*} b = (1 - a^{*}) (1 - a) = 1 + a^{*} a > 0 .$
This also holds for $m > 1$ , $𝑨 + 𝑨^{*} = 0$ , and $𝑩 = 𝑰 - 𝑨$ nonsingular implies ${|| 𝑩 𝒙 ||}_{2}^{2} = {|| 𝒚 ||}_{2}^{2} > 0$ for any $𝒙$ of unit norm. Compute
$𝒚^{*} 𝒚 = 𝒙^{*} 𝑩^{*} 𝑩 𝒙 = 𝒙^{*} (𝑰 + 𝑨^{*} 𝑨) 𝒙 = 1 + {|| 𝒚 ||}_{2}^{2} > 0 .$
b) Again, use $m = 1$ to gain insight in which case $c = {(1 - a)}^{- 1} (1 + a)$ and compute
$c^{*} c = (1 + a^{*}) {(1 - a^{*})}^{- 1} {(1 - a)}^{- 1} (1 + a) = (1 - a) {[(1 - a) (1 + a)]}^{- 1} (1 + a) = (1 - a) {(1 - a^{2})}^{- 1} (1 + a)$ $c^{*} c = (1 - a) {[(1 + a) (1 - a)]}^{- 1} (1 + a) = (1 - a) {(1 - a)}^{- 1} {(1 + a)}^{- 1} (1 + a) = 1$
Similarily, for $m > 1$
$𝑪^{*} 𝑪 = (𝑰 + 𝑨^{*}) {(𝑰 - 𝑨^{*})}^{- 1} {(𝑰 - 𝑨)}^{- 1} (𝑰 + 𝑨) = (𝑰 - 𝑨) {[(𝑰 - 𝑨) (𝑰 - 𝑨^{*})]}^{- 1} (𝑰 + 𝑨) \Rightarrow$

$𝑪^{*} 𝑪 = (𝑰 - 𝑨) {[(𝑰 + 𝑨) (𝑰 - 𝑨)]}^{- 1} (𝑰 + 𝑨) = (𝑰 - 𝑨) {(𝑰 - 𝑨)}^{- 1} {(𝑰 + 𝑨)}^{- 1} (𝑰 + 𝑨) = 𝑰 .$