Lecture 27: Differential Conservation Laws

1.The relevance of physics for scientific computation

Efficient algorithms often arise from the specifities of an underlying application domain, perhaps none more so than those inspired from physics. Classical physics can be derived from a remarkably small set of experimentally verified postulates.

Other constraints often refer to allowed spatial positions and are known generically as geometric constraints. Note that this is an idealization: in reality some other physical system M is interacting with the one being considered P and it is assumed that the system M is so much larger that its position does not change. Such idealization or modeling assumptions are often encountered. As another important example, the system P may exhibit energy dissipation, such as the decrease of an object's momentum due to friction. Energy is indeed lost from system P to the surrounding medium M, but the overall energy of the combined system M+P is conserved.

Scientific computation uses many concepts and terms from physics, such as the characterization of a numerical scheme for differential equations as being “conservative”, in the sense of maintaining the conserved physical quantities. Also, a remarkably large number of efficient algorithms arise from the desire to mimic physical properties. Conservation laws can be stated for a small enough spatial domain that it can be considered to be infinitesimal in the sense of calculus. In this case differential conservation laws are obtained. Alternatively, consideration of a finite-sized spatial domain leads to integral formulation of the conservation laws. In a large class of physical systems of current research interest models are constructed in which the evolution of the system P depends on the history of interactions with the surrounding medium M. Such systems are described by integro-differential laws, elegantly expressed through fractional derivatives.

2.Conservation laws

Banking example.
Conservation of some physical quantity is stated for a hypothesized isolated system. In reality no system is truly isolated and the most interesting applications come about from the study of interaction between two or more systems. This leads to the question of how one can follow the changes in physical quantities of the separate systems. An extremely useful procedure is to set up an accounting procedure. A mundane but illuminating example is quantity of Euros E in a building B. If the building is a commonplace one, it is to be expected that when completely isolated, the amount of currency in the building is fixed

E=E0. (3)

E0 is some constant. Equation (3) is self-evident but not particularly illuminating – of course the amount of money is constant if nothing goes in or out! Similar physics statements such as “the total mass-energy of the universe is constant” are again not terribly useful, though one should note this particular statement is not obviously true. Things get more interesting when we consider a more realistic scenario in which the system is not isolated. People might be coming and going from building B and some might actually have money in their pockets. In more leisurely economic times, one might be interested just in the amount of money in the building at the end of the day. Just a bit of thought leads to

En=En-1+ΔEn-1,n

where En is the amount of money at the end of day n, En-1 that from the previous day and ΔEn-1,n the difference between money received and that payed in the building during day n

ΔEn-1,n=Rn-1,n-Pn-1,n.

As economic activity picks up and we take building B to mean “bank” it becomes important to keep track of the money at all times, not just at the end of the day. It then makes sense to think of the rate at which money is moving in or out of the building so we can not only track the amount of currency at any given time, but also be able to make future predictions. Instead of separate receipts R and payments P, use a single quantity F to denote the amount of money leaving or entering building B during time interval Δt with the understanding that positive values of F represent incomes and negative ones expenditures. Such understandings go by the name of sign conventions. They're not especially meaningful but it aids communication if a single convention is adopted. The amount of euros in the building then changes in accordance to

E(t+Δt)=E(t)+FΔt, (4)

and F is known as a flux, the Latin term for flow.

While (4) is a good approximation for small intervals, errors arise when Δt is large since economic acitvity might change from hour to hour. Better accounting is obtained by considering F as defined at any given time t, such that F(t) is the instantaneous flux of euros at time t. The fundamental theorem of calculus then states

E(t+Δt)=E(t)+tt+ΔtF(τ)dτ, (5)

with the same significance as (4).

In a large bank one keeps track of the amount of money in individual rooms and the inflows and outflows through individual doors. A room or door can be identified by its spatial position 𝒙=(x1,x2,x3), but 𝒙 refers to a single point and physical currency occupies some space. The conceptual difficulty is overcome by introducing a fictitious density of currency at time t denoted by e(x,t). The only real meaning associated with this density is that the sum of all values of e(x,t) in some volume ω is the amount of currency in that volume

E(ω,t)=ωe(𝒙,t)d𝒙. (6)

On afterthought, the same sort of question should have arisen when E(t) was defined at one instant in time. Ingrained psychological perspectives make E(t) more plausible, but were we to live our lives such that quantum fluctuations are observable, E(t) would be much more questionable.

By an analogous procedure, define 𝒇(𝒙,τ) as the instantaneous flux density of euros in a small region around (𝒙,τ). This flux is a vector quantity to distinguish fluxes along different spatial directions. The flux density along direction 𝒏(𝒙) is given by 𝒇(𝒙,τ)𝒏(x). Consider 𝒏(𝒙) as the inward pointing unit vector normal to the surface B that bounds the bank. The total flux is again obtained by integrating flux densities

F(τ)=B𝒇(𝒙,τ)𝒏(𝒙)d𝒙. (7)

Gathering the above leads to re-expressing (4) or (5) gives

E(B,t+Δt)=E(B,t)+tt+ΔtB𝒇(𝒙,τ)𝒏(𝒙)d𝒙dτ (8)

Using (6) leads to the statement,

Be(𝒙,t+Δt)d𝒙=Be(𝒙,t)d𝒙+tt+ΔtB𝒇(𝒙,τ)𝒏(𝒙)d𝒙dτ.

There are special cases in which additional events affecting the balance of E can occur. When B is a reserve bank money might be (legally) printed and destroyed in the building. Again by analogy with fluid dynamics, such events are said to be sources of E within B, much like a underground spring is a source of surface water. Let Σ(t) be the total sources at time t. As before, Σ(t) might actually be obtained by summing over several sources placed in a number of positions, for instance the separate printing presses and furnaces that exist in B. It is useful to introduce a spatial density of sources σ(x,t). The conservation statement now becomes

Be(𝒙,t+Δt)d𝒙-Be(𝒙,t)d𝒙=tt+ΔtB𝒇(𝒙,τ)𝒏(𝒙)d𝒙dτ+tt+ΔtBσ(𝒙,τ)d𝒙dτ. (9)

The above encompasses all physical conservation laws, and is quite straightforward in interpretation:

change in Euros in B = net Euros coming in or going out of B + net Euros produced or destroyed in B.

It should be emphasized that the above statement has true physical meaning and is referred to as an integral formulation of a conservation law. The key term is “integral” and refers to the integration over some spatial domain.

Local formulations.
Equation (9) is useful and often applied directly in the analysis of physical systems. From an operational point of view it does have some inconveniences though. These have mainly to do with the integration domains B, sometimes difficult to describe and to perform integrations over. Avoid this by considering 𝒇(𝒙,t) defined everywhere, not only on B (the doors and windows of B). These internal fluxes can be shown to have a proper physical interpretation. Assuming that 𝒇(𝒙,t) is smooth allows use of the Gauss theorem to transform the surface integral over B into a volume integral over B

B𝒇(𝒙,τ)𝒏(𝒙)d𝒙=-B𝒇(𝒙,τ)d𝒙 (10)

The minus assign arises from the convention of an inward pointing normal. Applying (10) to (9) leads to

B[e(x,t+Δt)-e(x,t)+tt+Δt𝒇(𝒙,τ)dτ]dx=tt+ΔtBσ(𝒙,τ)d𝒙dτ. (11)

There was nothing special about the shape of the building B or the length of the time interval Δt we used in deriving (11), hence the equality should hold for infinitesimal domains

et+𝒇=σ, (12)

where, as is customary, the dependence of e,𝒇,σ on space and time is understood but not written out explicitly. Equation (12) is known as the local or differential form of the conservation law for E.

3.Special forms of conservation laws

Second law of dynamics.
The full general form (12) often arises in applications, but simplifications can arise from specific system properties. As a simple example, the dynamics of a point mass m which has no internal structure is described by the conservation of momentum statement

ddt(m𝒗)=𝑭. (13)

The correspondence with (12) is given by e(mv), σ𝑭, hence the statement: “external forces are sources of momentum”. Instead of a PDE, the lack of internal structure has led to an ODE.

Advection equation.
Other special forms of (12) are not quite so trivial. Often 𝒇,σ depend on e, with the specific form of this dependence is given by physical analysis. Accounting for all physical effects is so difficult that simple approximations are often used. For instance if 𝒇(e) is sufficiently smooth Taylor series expansion gives

𝒇(e)=𝒇0+𝒇'(e0)(e-e0)+. (14)

Choosing the origin such that 𝒇0=0 and e0=0, the simplest truncation is

𝒇(e)𝒇'(0)e=𝒖e, (15)

and the σ=0 form of (12) is

et+(𝒖e)=0. (16)

In this approximation 𝒖 is a constant giving

et+𝒖e=0 (17)

known as the constant velocity advection equation. Its one-dimensional form is the basis of much development in numerical methods for PDE's

et+uex=0 (18)

Diffusion equation.
Another widely encountered dependence of 𝒇 on e is of the form

𝒇(e)=-αe (19)

and this leads to

et-(αe)=σ(e). (20)

If there are no sources and α is a constant we have

et=α2e (21)

the heat or diffusion equation.

Combined effects.
Both above flux types can appear in which case the associated conservation law is

et+(𝒖e)=α2e, (22)

known as the advection-diffusion equation, and is a linear PDE. If sources σ exist the above becomes

et+(𝒖e)=α2e+σ, (23)

or for constant advection velocity 𝒖

et+𝒖e=α2e+σ. (24)

It is often the case that the flux depends on the conserved quantity itself, 𝒇(e)=𝒖(e)e, in which case (22) becomes a non-linear PDE.

Steady-state transport.
Various effects can balance leading to no observable time dependence, e/t=0. If there is no overall diffusive flux 𝒇(e)=-αe within an infinitesimal volume, then 𝒇=-α2e=0 leads to the Laplace equation

2e=0.

If the infinitesimal volume contains sources the Poisson equation

2e=σ,

is obtained.

Separation of variables.
Often, the time dependence can be isolated from the spatial dependence, e(𝒙,t)=X(𝒙)T(t), in which case the diffusion equation for constant α leads to

T˙T=α2XX=-λ,

with λ a positive constant to avoid unphysical exponential growth. The spatial part of the solution statisfies the Helmholtz equation

2X=-κ2X,

with κ2=λ/α. The above is interpreted as an eigenproblem for the Laplacian operator Δ=2.

The above special forms of differential conservation laws play an important role in scientific computation. Numerical techniques have been developed to capture the underlying physical behavior expressed in say the diffusion equation or the Helmholtz equation. These equations were first studied within physics, but they reflect universal behavior. Consider the Black-Scholes financial model for the price of an option V(S,t) on an asset S(t)

Vt+rSVS=rV-12σ2S22VS2,

with σ the standard deviation of stock market returns and r the annualized risk-free interest rate. The terminology might be totally different, but the same patterns emerge and the Black Scholes model can be interpreted as an advection diffusion equation with non-constant advection velocity rS, negative diffusion coefficient -σ2S2/2 and source term rV. The similarity to the physics advection and diffusion equations arises from the same type of modeling assumptions relating fluxes to state variables.