MATH661 HW04 - Midterm review

Posted: 09/20/23

Due: 09/27/23, 11:59PM

At this point in the course homework has addressed:

HW0

Tools needed for scientific computation (number representation, number approximation techniques, basic coding constructs, an environment for method documentation and reproducible computational experiments).

HW2

Discretization of continuous functions leads to finite-dimensional vectors that can often be approximated by linear combination of just a few of the basis vectors required for the entire space.

HW3

Large data sets, readily acquired from observations, can guide selection of vectors within a basis to obtain data compression or efficient data representation through linear combination.

Homework 4 reinforces analytical skills within the mathematical framework of finite-dimensional vector spaces used for the above topics. Such technical proficiency is just as important as efficient coding. The midterm examination verifies proficiency in such analytical skills

Note: The exercises below contain well-known results, but should be attempted individually and independently, without recourse to references. Simply looking up a proof and transcribing it will not aid understanding nor ensure good results on the midterm examination. If you do not obtain an exercise proof within 10 minutes reread the relevant theoretical material from the lecture notes and then try again for another 15 minutes.

1Tracks 1 and 2

  1. Prove the parallelogram identity

    ||𝒙+𝒚||2+||𝒙-𝒚||2=2(||𝒙||2+||𝒚||2),

    for 𝒙,𝒚m, with |||| denoting the 2-norm.

    Solution. ||𝒙+𝒚||2+||𝒙-𝒚||2=(𝒙+𝒚)T(𝒙+𝒚)+(𝒙-𝒚)T(𝒙-𝒚)=(𝒙T+𝒚T)(𝒙+𝒚)+(𝒙T-𝒚T)(𝒙-𝒚)=𝒙T𝒙+𝒚T𝒙+𝒙T𝒚+𝒚T𝒚+𝒙T𝒙-𝒚T𝒙-𝒙T𝒚+𝒚T𝒚=2(𝒙T𝒙+𝒚T𝒚)=2(||𝒙||2+||𝒚||2).

  2. Consider 𝒖,𝒗V, 𝒱=(V,,+,) a vector space with norm induced by a scalar product ||𝒖||2=(𝒖,𝒖). Prove that ||𝒖||=||𝒗|| (𝒖+𝒗)(𝒖-𝒗). Is the converse true?

    Solution. ||𝒖||=||𝒗||||𝒖||2=||𝒗||2𝒖T𝒖=𝒗T𝒗𝒖T𝒖-𝒗T𝒗=0𝒖T𝒖+𝒗T𝒖-𝒖T𝒗-𝒗T𝒗=0(𝒖T+𝒗T)(𝒖-𝒗)=0(𝒖+𝒗)T(𝒖-𝒗)=0 (𝒖+𝒗)(𝒖-𝒗). Converse is also true:

    (𝒖+𝒗)(𝒖-𝒗)(𝒖+𝒗)T(𝒖-𝒗)=0𝒖T𝒖+𝒗T𝒖-𝒖T𝒗-𝒗T𝒗=0𝒖T𝒖=𝒗T𝒗||𝒖||=||𝒗||.

  3. Consider 𝑨m×m, C(𝑨)=m. Prove that

    (𝒙T𝑨T𝑨𝒙)1/2

    is a norm. (Track 2: generalize above to m)

    Solution. Verify norm properties:

    1. ||𝒙||=0𝒙=𝟎. ||𝒙||2=𝒙T𝑨T𝑨𝒙=𝒚T𝒚=0𝒚=𝟎. Consider now 𝑨𝒙=𝟎. Since 𝑨 is of full rank, 𝒙=𝟎 is the only solution. Note that if rank(𝑨)<m, the above is not a norm.

    2. ||c𝒙||=|c|||𝒙||. Compute: ||c𝒙||=(c𝒙T𝑨T𝑨c𝒙)1/2=|c|(𝒙T𝑨T𝑨𝒙)1/2=|c|||𝒙||.

    3. ||𝒙+𝒚||||𝒙||+||𝒚||. When 𝑨=𝑰 the standard Euclidean 2-norm ||||2 is obtained. For 𝑨𝑰 of full rank for any 𝒖,𝒗 there exist 𝒙,𝒚 such that 𝒖=𝑨𝒙, 𝒗=𝑨𝒚 and

      ||𝒙||=(𝒙T𝑨T𝑨𝒙)1/2=(𝒖T𝒖)1/2=||𝒖||2.

      Similarily ||𝒚||=||𝒗||2, ||𝒙+𝒚||=||𝒖+𝒗||2, hence it is sufficient to establish the triangle inequality for the 2-norm ||𝒖+𝒗||2||𝒖||2+||𝒗||2. Taking squares

      ||𝒖+𝒗||22=(𝒖+𝒗)T(𝒖+𝒗)=𝒖T𝒖+𝒗T𝒗+2𝒖T𝒗=||𝒖||22+||𝒗||22+2𝒖T𝒗||𝒖||22+||𝒗||22+2||𝒖||2||𝒗||2𝒖T𝒗||𝒖||2||𝒗||2,

      so it remains to establish the last inequality (known as the Schwarz inequality). The Schwarz inequality can be established by asking: when is equality obtained in the triangle inequality? This occurs if 𝒖,𝒗 are colinear, and suggest building the vector 𝒘=||𝒗||2𝒖-||𝒖||2𝒗 that becomes zero when 𝒖,𝒗 are colinear. Calculate

      0𝒘T𝒘=(||𝒗||2𝒖-||𝒖||2𝒗)T(||𝒗||2𝒖-||𝒖||2𝒗)=2||𝒖||22||𝒗||22-2||𝒖||2||𝒗||2𝒖T𝒗,

      from which 𝒖T𝒗||𝒖||2||𝒗||2, as desired.

  4. Construct the matrix 𝑨 that represents the mapping 𝒇:33, 𝒇 reflects a vector across the x1x2 plane. Construct the matrix 𝑩 that represents the mapping 𝒈:33, 𝒈 reflects a vector across the x2x3 plane. Determine the mapping represented by 𝑪=𝑩𝑨.

    Solution. Reflecting the point 𝒙=[ x1 x2 x3 ]T across the x1x2 plane gives 𝑨𝒙=[ x1 x2 -x3 ]T, hence

    𝑨=[ 𝑰2 𝟎 𝟎 -1 ]=[ 1 0 0 0 1 0 0 0 -1 ].

    Note the block structure of 𝑨. Since the first two components of 𝑨𝒙 are unchanged from those of 𝒙, an identity matrix on these two components 𝑰2 appears. Similarily

    𝑩=[ -1 0 0 0 1 0 0 0 1 ],𝑪=𝑩𝑨=[ -1 0 0 0 1 0 0 0 1 ][ 1 0 0 0 1 0 0 0 -1 ]=[ -1 0 0 0 1 0 0 0 -1 ],

    where 𝑪 is the reflection across the x2 axis.

  5. Prove that the inverse of a rank-1 perturbation of 𝑰 is itself a rank-1 perturbation of 𝑰, namely

    (𝑰+𝒖𝒗)-1=𝑰+θ𝒖𝒗.

    Determine the scalar θ.

    Solution. Assume 𝒖,𝒗𝟎. By definition of an inverse

    𝑰=(𝑰+𝒖𝒗)(𝑰+𝒖𝒗)-1=(𝑰+𝒖𝒗)(𝑰+θ𝒖𝒗)=𝑰+(θ+1)𝒖𝒗+θ𝒖𝒗𝒖𝒗

    (θ+1)𝒖𝒗+θ𝒖𝒗𝒖𝒗=𝟎.

    Note that in 𝒖𝒗𝒖𝒗 the product 𝒗𝒖 is a scalar, hence 𝒖𝒗𝒖𝒗=(𝒗𝒖)𝒖𝒗, and the above matrix equality becomes

    [θ+1+θ𝒗𝒖]𝒖𝒗=𝟎.

    For the above to be true for any 𝒖,𝒗 choose θ such that

    θ+1+θ𝒗𝒖=0θ=-11+𝒗𝒖,

    if 𝒗𝒖-1. When would 𝒗𝒖=-1? An example is -𝒆kT𝒆k in which case

    𝑰-𝒆kT𝒆k

    has a kth column of zeros and is therefore singular.

  6. Determine the rank of 𝑩=𝑨-1𝒖𝒗.

    Solution. The inverse 𝑨-1 exists only if 𝑨 is square, 𝑨m×m and of full rank, hence rank(𝑨-1)=m. What is 𝑨-1𝒖𝒗? Recall that a matrix-vector product 𝑪𝒘 is a linear combination of the columns of 𝑪, and the matrix-matrix product 𝑪𝑫 is simply a collection of matrix vector products

    𝑪𝑫=𝑪[ 𝒅1 𝒅2 𝒅n ]=[ 𝑪𝒅1 𝑪𝒅2 𝑪𝒅n ].

    Now 𝑫=𝒖𝒗 is of rank one, rank(𝒖𝒗)=1 with colinear columns

    𝑫=[ v1𝒖 v2𝒖 vn𝒖 ].

    Multiplying with 𝑪 gives

    𝑪𝑫=[ v1𝑪𝒖 v2𝑪𝒖 vn𝑪𝒖 ],

    again with colinear columns such that rank(𝑪𝑫)=1. Deduce that rank(𝑩)=rank(𝑨-1𝒖𝒗)=1.

  7. Write the inverse (𝑰+𝑨-1𝒖𝒗)-1 as a rank-1 perturbation of 𝑰.

    Solution. Since 𝑨-1𝒖𝒗=𝒘𝒗 is of rank one with 𝒘=𝑨-1𝒖, use

    (𝑰+𝒘𝒗)-1=𝑰-𝒘𝒗1+𝒗𝒘,

    to obtain

    (𝑰+𝑨-1𝒖𝒗)-1=𝑰-𝑨-1𝒖𝒗1+𝒗𝑨-1𝒖.

  8. Consider 𝑪=𝑨+𝒖𝒗=𝑨(𝑰+𝑨-1𝒖𝒗). Write 𝑪-1 as a rank-1 perturbation of 𝑨-1.

    Solution. Apply above results

    𝑪-1=[𝑨(𝑰+𝑨-1𝒖𝒗)]-1=(𝑰+𝑨-1𝒖𝒗)-1𝑨-1=(𝑰-𝑨-1𝒖𝒗1+𝒗𝑨-1𝒖)𝑨-1=𝑨-1-𝑨-1𝒖𝒗𝑨-11+𝒗𝑨-1𝒖

2Track 2

  1. For 𝒙m, prove ||𝒙||||𝒙||2.

    Solution. By definition

    ||𝒙||=maxi|xi|=|xk|,

    with k the index of the (not necessarily unique) maximal element. Then

    ||𝒙||2=(xk2+j=1,jkmxj2)1/2=(xk2+a)1/2xk,

    since a0.

  2. For 𝒙m, prove ||𝒙||2m||𝒙||.

    Solution. With above notations, |xj||xk|, hence

    ||𝒙||2=(j=1mxj2)1/2(mxk2)1/2=m||𝒙||.

  3. For 𝑨m×n, prove ||𝑨||n||𝑨||2.

    Solution. By definition

    ||𝑨||=sup||𝒙||=1||𝑨𝒙||.

    For 𝒚=𝑨𝒙m apply above results (1. and 2.) and ||𝑨𝑩||||𝑨||||𝑩|| to obtain

    ||𝑨𝒙||=||𝒚||||𝒚||2=||𝑨𝒙||2||𝑨||2||𝒙||2n||𝑨||2||𝒙||,

    and establish the bound

    ||𝑨𝒙||||𝒙||n||𝑨||2

    for any 𝒙, with the upper bound of the left hand side being ||𝑨||, hence

    ||𝑨||n||𝑨||2.

  4. For 𝑨m×n, prove ||𝑨||2m||𝑨||.

    Solution. For 𝒚=𝑨𝒙m apply above results (1. and 2.)

    ||𝑨𝒙||2||𝒙||2m||𝑨𝒙||||𝒙||2m||𝑨𝒙||||𝒙||m||𝑨||,

    for all 𝒙 including the one for which ||𝑨||2 is attained, hence ||𝑨||2m||𝑨||.

  5. Prove the Minkowski inequality: for p1, 𝒙,𝒚m, ||𝒙+𝒚||p||𝒙||p+||𝒚||p.

    Solution. The Minkowski inequality results from the Hölder inequality, for 1/p+1/q=1,

    i=1m|xiyi|(i=1m|xi|p)1/p(i=1m|yi|q)1/q

    for q=p/p-1.

  6. Construct the matrix 𝑫 that represents the mapping 𝒇:33, 𝒇 rotates a vector around the x3 axis by angle θ. Construct the matrix 𝑬 that represents the mapping 𝒇:33, 𝒇 rotates a vector around the x2 axis by angle φ.

    Solution.

    𝑫=[ cosθ sinθ 0 -sinθ cosθ 0 0 0 1 ],𝑬=[ cosφ 0 -sinφ 0 1 0 sinφ 0 cosφ ]

  7. What do 𝑫𝑬 and 𝑬𝑫 represent?

    Solution. Composite rotation in different order, 𝑫𝑬 first around x2 then around x3, 𝑬𝑫 first around x3 then around x2.

  8. Is 𝑫𝑬=𝑬𝑫 true? Explain.

    Solution. No, this is an example of non-commutative matrix multiplication. Counter-example θ=φ=π/2, and

    𝑫𝑬𝒆2=[ -1 0 0 ][ 0 0 -1 ]=𝑬𝑫𝒆2.