Lecture 17: Spectral approximations

1.Trigonometric basis

The monomial basis {1,t,t2,} for the vector space of all polynomials P(), and its derivatives (Lagrange, Newton, B-spline) allow the definition of an approximant pP() for real functions f:, e.g., for smooth functions fC(). A different approach to approximation in infinite-dimensional vector spaces such as P() or C() is to endow the vector space with a scalar product (f,g) and associated norm ||f||=(f,f)1/2. The availability of a norm allows definition of convergence of sequences and series.

Definition. A sequence {fn}n of elements of the normed vector space =(F,,+,) converges to f, fnf if ε>0, N(ε) such that ||fn-f||<ε for all n>N(ε).

Definition. The vector space =(F,,+,) with a scalar product (,):F×F is a Hilbert space if the limit of all Cauchy sequences is an element of F.

All Hilbert spaces have orthonormal bases, and of special interest are bases that arise Sturm-Liouville problems of relevance to the approximation task.

1.1.Fourier series - Fast Fourier transform

The L2([0,2π]) space of periodic, square-integrable functions is a Hilbert space (L2 is the only Hilbert space among the Lp function spaces), and has a basis

{12,cost,sint,,coskt,sinkt,}

that is orthonormal with respect to the scalar product

(f,g)=1π02πf(t)g(t)dt.

An element fL2([0,2π]) can be expressed as the linear combination

f(t)=a02+k=1[akcoskt+bksinkt].

An alternative orthonormal basis is formed by the exponentials

{e±int},n,

with respect to the scalar product

(f,g)=12π02πf(t)g(t)dt.

The partial sum

SNf(t)=k=-NNckeikt

has coefficients ck determined by projection

ck=(f,eikt)=12π02πf(t)e-iktdt,

that can be approximated by the Darboux sum on the partition tj=2πj/N

ck1Nj=1Nfje-iktj=1Nj=1NfjωN-jk

with

ω=exp[2πiN],

denoting the Nth root of unity. The Fourier coefficients are obtained through a linear mapping

𝒄=𝑾𝒇,

with 𝒄,𝒇N, and 𝑾N×N with elements

𝑾=[ω-jk]1j,kN.

The above discrete Fourier transform can be seen as a change of basis from the basis 𝑰 in which the coefficients of f are 𝒄 to the basis 𝑾 in which the coefficients are 𝒇.

1.2.Fast Fourier transform

Carrying out the matrix vector product 𝑾𝒇 directly would require 𝒪(N2) operations, but the cyclic structure of the 𝑾 matrix arising from the exponentiation of ω can be exploited to reduce the computational effort. Assume N=2P and separate even and odd indexed components of 𝒇

ck=j=1NfjωN-jk=j=1P[f2j-1ωN-(2j-1)k+f2jωN-2jk]=j=1Pf2jωP-jk+ωkj=1Pf2j-1ωP-jk.

Through the above, the 𝒪(N2) matrix-vector product is reduced to two smaller matrix-vector products, each requiring 𝒪(N2/4) operations. For N=2q, recursion of the above procedure reduces the overall operation count to 𝒪(qN), or in general for N composed of a small numer of prime factors, 𝒪(NlogN). The overall algorithm is known as the fast Fourier transform or FFT.

1.3.Data-sparse matrices from Sturm-Liouville problems

One step of the FFT can be understood as a special matrix factorization

𝑾N=[ 𝑰 𝑫N 𝑰 -𝑫N ][ 𝑾P 𝟎 𝟎 𝑾P ]𝑷N

where 𝑫N is diagonal and 𝑷N is the even-odd permutation matrix. Though the matrix 𝑾N is full (all elements are non-zero), its factors are sparse, with many zero elements. The matrix 𝑾N is said to be data sparse, in the sense that its specification requires many fewer than N2 numbers. Other examples of data sparse matrices include:

Toeplitz matrices

𝑨m×m has constant diagonal terms, e.g., for m=4

𝑨=[ a b c d e a b c f e a b g f e a ],

or in general the elements of 𝑨=[aij]1i,jm can be specified in terms of 2m-1 numbers a1-n,,an-1 through aij=ai-j.

Exterior products

Rank-1 updates arising in the singular value or eigenvalue decompositions have the form

𝑨=𝒖𝒗T=[ v1𝒖 v2𝒖 vm𝒖 ],

and the 2m components of 𝒖,𝒗 are suficient to specify the matrix 𝑨 with m2 components. This can be generalized to any exterior product of matrices 𝑩n×n, 𝑪p×p through

𝑨=𝑩𝑪=[ 𝒃1𝑪 𝒃2𝑪 bn𝑪 ]=[ b11𝑪 b12𝑪 b1n𝑪 b21𝑪 b22𝑪 b2n𝑪 bn1𝑪 bn2𝑪 bnn𝑪 ].

The m2=(np)2 components of 𝑨 are specified through only n2+p2 components of 𝑩,𝑪.

The relevance to approximation of functions typically arises due basis sets that are solutions to Sturm-Liouville problems. In the case of the Fourier transform e±ikt are eigenfunctions of the Sturm-Liouville problem

w''+λw=0,w=u+iv,u'(0)=u'(π)=0,v(0)=v(π)=0,

with eigenvalues λn=k2. The solution set {φ1,φ2,} to a general Sturm-Liouville problem to find f:[a,b]

ddt[p(t)dfdt]+q(t)f=-λw(t)f,

form an orthonormal basis under the scalar product

(f,g)=abf(t)g(t)w(t)dt,

and approximations of the form

ΦNf(t)=k=1Nckφk(t),

and Parseval's theorem states that

||𝒄||22=k=1ckck=||f||22=(f,f)=abf(t)f(t)w(t)dt,

read as an equality between the energy of f and that of 𝒄. By analogy to the finite-dimensional case, the Fourier transform is unitary in that it preserves lengths in the ||f||+(f,f)1/2 norm with weight function w(t)=1.

2.Wavelet approximations

The bases {φ1,φ2,} arising from Sturm-Liouville problems are single-indexed, giving functions of increasing resolution over the entire definition domain. For example sinkx resolves ever finer features over [0,2π]. When applied to a function with localized features, k must be increased with increased resolution in the entire [0,2π] domain. This leads to uneconomical approximation series SNf(t) with many terms, as exemplified by the Gibbs phenomenon in approximation of a step function, f(t)=H(t-π/2)-H(t-3π/2) for t[0,2π], and f(t+2π)=f(t). The approach can be represented as the decomposition of a space of functions by the direct sum

F=Φ1Φ2,

with Φk=span(φk), for example

L2=E0E1E-1E2E-2,

with Ek=span{eikt} for the Fourier series.

Approximation of functions with localized features is more efficiently accomplished by choosing some generating function ψ(t) and then defining a set of functions through translation and scaling, say

ψjk(t)=2-j/2ψ(2-jt-k).

Such systems are known as wavelets, and the simplest example is the step function

ψ(t)={ 1 0t<1/2 -1 1/2t<1 0 otherwise .,

with ψjk having support on the half-open interval hjk=[k2-j,(k+1)2-j). The set {ψ00,ψ01,} is known as an Haar orthonormal basis for L2() since

(ψjk,ψlm)=-ψjk(t)ψlm(t)dt=δjlδkm.

Approximations based upon a wavelet basis

f(t)=jk(f,ψjk)ψjk(t),

allow identification of localized features in f.

The costly evaluation of scalar products (f,ψjk) in the double summation can be avoided by a reformulation of the expansion as

f(t)=kcl,kφl(t)+jlkdj,kψjk(t), (1)

with . In addition to the ψ (“mother” wavelet), an auxilliary φ scaling function (“father” wavelet) is defined, for example

φ(t)={ 1 0t<1 0 otherwise .,

for the Haar wavelet system.

The above approach is known as a multiresolution representation and is based upon a hierarchical decomposition of the space of functions, e.g.,

L2=VlWlWl-1Wl-2

with

Vj=span{φjk|k.},Wj=span{ψjk|k.}.

The hierarchical decomposition is based upon the vector subspace inclusions

{0}<<V1<V0<V-1<V-2<<L2(),

and the relations

VmWm=Vm-1,

that state that the orthogonal complement of Vm within Vm-1 is Wm. Analogous to the FFT, a fast wavelet transformation can be defined to compute coefficients of (1).