Really Optional — More Interpretation of Div and Curl

Section 4.6 Really Optional — More Interpretation of Div and Curl

We are now going to determine, in much more detail than before¹, what the divergence and curl of a vector field tells us about the flow of that vector field.

Consider a (possibly compressible) fluid with velocity field \(\vv(\vx,t)\text{.}\) Pick any time \(t_0\) and a really tiny piece of the fluid; assume that, at time \(t_0\text{,}\) it is a cube with corners at

\begin{equation*} \Set{\vx_0+n_1\veps\ha{1}+n_2\veps\ha{2}+n_3\veps\ha{3}}{n_1,n_2,n_3\in\{0,1\}} \end{equation*}

Here \(\veps \gt 0\) is the length of each edge of the cube and is assumed to be really small. The vectors \(\ha{1},\ \ha{2}\) and \(\ha{3}\) are three mutually perpendicular unit vectors that give the orientation of the edges of the cube. The vectors from the corner \(\vx_0\) to its three nearest neighbour corners are \(\veps\ha{1},\ \veps\ha{2}\) and \(\veps\ha{3}\text{.}\)

As time progresses, the chunk of fluid moves. In particular, the corners move. Let us denote by \(\veps\vb^{(1)}(t)\) the vector, at time \(t\text{,}\) joining the \(n_1=n_2=n_3=0\) corner to the \(n_1=1,\ n_2=n_3=0\) corner. Define \(\veps\vb^{(2)}(t)\) and \(\veps\vb^{(3)}(t)\) similarly. For times very close to \(t_0\) we can think of our chunk of fluid as being essentially a parallelepiped with edges \(\veps\vb^{(k)}(t)\text{.}\)

By concentrating on the edges \(\veps\vb^{(k)}(t)\) of the chunk of fluid, rather than the corners, we are ignoring any translations that the chunk of fluid might have undergone. We want, instead, to determine how the size and orientation of the parallelepiped changes as \(t\) increases.

At time \(t_0\text{,}\) \(\vb^{(k)}=\ha{k}\text{.}\) The velocities of the corners of the chunk of fluid at time \(t_0\) are

\begin{equation*} \vv\big(\vx_0+n_1\veps\ha{1}+n_2\veps\ha{2}+n_3\veps\ha{3},t_0\big) \end{equation*}

In particular, at time \(t_0\text{,}\) the tail of \(\veps\vb^{(k)}\) has velocity \(\vv(\vx_0,t_0)\) and the head of \(\veps\vb^{(k)}\) has velocity \(\vv(\vx_0+\veps\ha{k},t_0)\text{.}\) Consequently (using a Taylor approximation),

\begin{equation*} \veps\diff{\vb^{(k)}}{t}(t_0)=\vv\big(\vx_0+\veps\ha{k},t_0\big) -\vv\big(\vx_0,t_0\big) =\smsum_{j=1}^3\veps\frac{\partial\vv}{\partial x_j}\big(\vx_0,t_0\big)\ha{k}_j +O(\veps^2) \end{equation*}

and so

\begin{equation*} \diff{\vb^{(k)}}{t}(t_0) =\smsum_{j=1}^3\frac{\partial\vv}{\partial x_j}\big(\vx_0,t_0\big)\ha{k}_j +O(\veps) \end{equation*}

The notation \(O(\veps^n)\) represents a function that is bounded by a constant times \(\veps^n\) for all sufficiently small \(\veps\text{.}\) That is, we are saying that \(\diff{\vb^{(k)}}{t}(t_0)\) is \(\smsum_{j=1}^3\frac{\partial\vv}{\partial x_j}\big(\vx_0,t_0\big)\ha{k}_j\) plus a small error that is bounded by a constant time \(\veps\text{.}\) The notation \(\ha{k}_j\) just refers to the \(j^{\rm th}\) component of the vector \(\ha{k}\text{.}\)

Denote by \(\cV\) the \(3\times 3\) matrix whose \((i,j)\) matrix element is

\begin{equation*} \cV_{i,j}=\frac{\partial \vv_i}{\partial x_j}\big(\vx_0,t_0\big) \qquad 1\le i,j\le 3 \tag{M} \end{equation*}

Then we can write the above more compactly:

\begin{equation*} \diff{\vb^{(k)}}{t}(t_0)=\cV\vb^{(k)}(t_0)+O(\veps) \end{equation*}

Here \(\cV\vb^{(k)}(t_0)\) is the product of the \(3\times 3\) matrix \(\cV\) and the \(3\times 1\) column vector \(\vb^{(k)}(t_0)\text{.}\) We study the behaviour of \(\vb^{(k)}(t)\) for small \(\veps\) and \(t\) close to \(t_0\text{,}\) by studying the behaviour of the solutions to the initial value problems

\begin{equation*} \diff{\vb^{(k)}}{t}(t)=\cV\vb^{(k)}(t)\qquad \vb^{(k)}(t_0)=\ha{k} \tag{IVP} \end{equation*}

To warm up, we first look at two two-dimensional examples. In both examples, the velocity field \(\vv(x,y)\) is linear in \((x,y)\text{.}\) Consequently, in these examples, \(\vv\big(\vx_0+\veps\ha{k},t_0\big)-\vv\big(\vx_0,t_0\big)\) is exactly \(\smsum_{j=1}^3\veps\frac{\partial\vv} {\partial x_j}\big(\vx_0,t_0\big)\ha{k}_j\) and the solution to (IVP) coincides with the exact \(\vb^{(k)}(t)\text{.}\) Following each example, we discuss a broad class of \(\cV\)'s that generate behaviour similar to that example.

Example 4.6.1. \(\vv(x,y)= 2x\hi+3y\hj\).

In this example

\begin{equation*} \cV=\left[\begin{matrix}2&0\\ 0&3\end{matrix}\right] \end{equation*}

The solution to the initial value problem

\begin{equation*} \vb'(t)=\cV\vb(t)\quad \vb(0)=\left[\begin{matrix}\be_1\\\be_2\end{matrix}\right] \quad\text{or equivalently}\quad \begin{matrix}\vb_1'(t)=2\vb_1(t) & \vb_1(0)=\be_1\\ \vb_2'(t)=3\vb_2(t) & \vb_2(0)=\be_2\end{matrix} \end{equation*}

\begin{equation*} \begin{matrix}\vb_1(t)=e^{2t}\be_1\\ \vb_2(t)=e^{3t}\be_2\end{matrix} \quad\text{or equivalently}\quad \vb(t)=\left[\begin{matrix}e^{2t}&0\\ 0&e^{3t}\end{matrix}\right]\vb(0) \end{equation*}

If one chooses \(\ha{1}=\hi\) and \(\ha{2}=\hj\text{,}\) the edges, \(\vb^{(1)}(t)=e^{2t}\ha{1}\) and \(\vb^{(2)}(t)=e^{3t}\ha{2}\text{,}\) of the chunk of fluid never change direction. But their lengths do change. The relative rate of change of length per unit time, \(|\diff{\vb^{(k)}}{t}(t)|/|\vb^{(k)}(t)|\text{,}\) is \(2\) for \(\vb^{(1)}\) and 3 for \(\vb^{(2)}\text{.}\) In the figure below, the darker rectangle is the initial square. That is, the square with edges \(\vb^{(k)}(t_0)=\ha{k}\text{.}\) The lighter rectangle is that with edges \(\vb^{(k)}(t)\) for some \(t\) a bit bigger than \(t_0\text{.}\)

As time increases the initial cube becomes a larger and larger rectangle.

Example 4.6.2. Example 4.6.1, generalized.

The behaviour of Example 4.6.1 is typical of \(\cV\)'s that are symmetric matrices, i.e. that obey² \(\cV_{i,j}=\cV_{j,i}\) for all \(i,j\text{.}\) Any \(d\times d\) symmetric matrix³ (with real entries)

has \(d\) real eigenvalues
has \(d\) mutually orthogonal real unit eigenvectors.

Denote by \(\la_k,\ 1\le k\le d\text{,}\) the eigenvalues of \(\cV\) and choose \(d\) mutually perpendicular real unit vectors, \(\ha{k}\text{,}\) that obey \(\cV\ha{k}=\la_k\ha{k}\) for all \(1\le k\le d\text{.}\) Then

\begin{equation*} \vb^{(k)}(t)=e^{\la_k (t-t_0)}\ \ha{k} \end{equation*}

obeys

\begin{equation*} \diff{\vb^{(k)}}{t}(t)=\la_ke^{\la_k (t-t_0)}\ \ha{k}=e^{\la_k (t-t_0)}\ \cV\ha{k} =\cV\vb^{(k)}(t)\quad\text{and}\quad \vb^{(k)}(t_0)= \ha{k} \end{equation*}

So \(\vb^{(k)}(t)=e^{\la_k (t-t_0)}\ \ha{k}\) satisfies (IVP) for all \(t\) and \(1\le k\le d\text{.}\)

If we start, at time \(t_0\text{,}\) with a cube whose edges, \(\ha{k}\text{,}\) are eigenvectors of \(\cV\text{,}\) then as time progresses the edges, \(\vb^{(k)}(t)\text{,}\) of the chunk of fluid never change direction. But their lengths change with the relative rate of change of length per unit time being \(\la_k\) for edge number \(k\text{.}\) This rate of change may be positive (the edge grows with time) or negative (the edge shrinks in time) depending on the sign of \(\la_k\text{.}\)

The volume of the chunk of fluid at time \(t\) is \(V(t)=e^{\la_1 (t-t_0)}\cdots e^{\la_d (t-t_0)}\text{.}\) The relative rate of change of volume per unit time is \(V'(t)/V(t)=\la_1\cdots+\la_d\text{,}\) the sum of the \(d\) eigenvalues. The sum of the eigenvalues of any \(d\times d\) matrix \(\cV\) is given by its trace \(\sum_{i=1}^d \cV_{i,i}\text{.}\) For the matrix (M)

\begin{equation*} \frac{V'(t_0)}{V(t_0)} =\sum_{i=1}^d \frac{\partial v_i}{\partial x_i}\big(\vx_0,t_0\big) =\vnabla\cdot\vv\big(\vx_0,t_0\big) \end{equation*}

So, at least when the matrix \(\cV\) defined in (M) is symmetric, the divergence \(\vnabla\cdot\vv\big(\vx_0,t_0\big)\) gives the relative rate of change of volume per unit time for our tiny chunk of fluid at time \(t_0\) and position \(\vx_0\text{.}\) Thus when \(\vnabla\cdot\vv=0\) the volume is fixed. In particular, this is the case when the fluid is incompressible.

In fact we can relax the symmetry condition.

Example 4.6.3. Example 4.6.1, generalized yet again.

For any \(d\times d\) matrix \(\cV\text{,}\) the solution of

\begin{equation*} \vb'(t)=\cV\vb(t)\quad \vb(t_0)=\ve \end{equation*}

\begin{equation*} \vb(t)=e^{\cV (t-t_0)}\ve \end{equation*}

where the exponential of a \(d\times d\) matrix \(B\) is defined by the power series

\begin{equation*} e^B=\bbbone+B+\half B^2+\frac{1}{3!}B^3+\cdots=\smsum_{n=0}^\infty \frac{1}{n!}B^n \end{equation*}

with \(\bbbone\) denoting the \(d\times d\) identity matrix. This sum converges⁴

for all \(d\times d\) matrices \(B\text{.}\) Furthermore it easy to check, using the power series, that \(e^{\cV (t-t_0)}\) obeys \(\diff{}{t}e^{\cV (t-t_0)}=\cV e^{\cV (t-t_0)}\) and is the identity matrix when \(t=t_0\text{.}\) So \(\vb(t)=e^{\cV (t-t_0)}\ve\) really does obey \(\vb'(t)=\cV\vb(t)\) and \(\vb(t_0)=\ve\text{.}\)

Pick any \(d\) vectors \(\ve^{(k)},\ 1\le k\le d\text{,}\) and define \(\vb^{(k)}(t)=e^{\cV (t-t_0)}\ve^{(k)}\text{.}\) Also let \(E\) be the \(d\times d\) matrix whose \(k^{\rm th}\) column is \(\ve^{(k)}\) and \(E(t)\) be the \(d\times d\) matrix whose \(k^{\rm th}\) column is \(\vb^{(k)}(t)\text{.}\) Then the volume of the parallelepiped with edges \(\ve^{(k)},\ 1\le k\le d\text{,}\) is \(V(t_0)=\det E\) and the volume of the parallelepiped with edges \(\vb^{(k)}(t),\ 1\le k\le d\text{,}\) is

\begin{equation*} V(t)=\det E(t)=\det\big(e^{\cV (t-t_0)}E\big) =\det\big(e^{\cV (t-t_0)}\big)\det E =\det\big(e^{\cV (t-t_0)}\big)V(t_0) \end{equation*}

Of course now we have to compute the determinant of the exponential of a matrix. Luckily, there is an easy way to do this. For any \(d\times d\) matrix \(B\text{,}\) we have⁵ \(\det e^B=e^{\tr B}\text{,}\) where \(\tr B\text{,}\) called the trace of the matrix \(B\text{,}\) is the sum of the diagonal matrix elements of \(B\text{.}\) So

\begin{equation*} V(t)=e^{(t-t_0)\tr \cV }V(t_0)\qquad\Rightarrow\qquad \frac{V'(t_0)}{V(t_0)}=\tr \cV=\sum_{i=1}^d \cV_{i,i} \end{equation*}

So, for any matrix \(\cV\) defined as in (M) and any choice of \(\ha{k},\ 1\le k\le d\text{,}\) the divergence \(\vnabla\cdot\vv\big(\vx_0,t_0\big)\) gives the relative rate of change of volume per unit time for our tiny chunk of fluid at time \(t_0\) and position \(\vx_0\text{.}\)

Example 4.6.4. \(\vv(x,y)= -y\hi+x\hj\).

In this example

\begin{equation*} \cV=\left[\begin{matrix}0&-1\\ 1&0\end{matrix}\right] \end{equation*}

The solution⁶ to

\begin{equation*} \vb'(t)=\cV\vb(t)\quad \vb(0)=\left[\begin{matrix}\be_1 \\ \be_2\end{matrix}\right] \quad\text{or equivalently}\quad \begin{matrix}b_1'(t)=-b_2(t) & b_1(0)=\be_1\\ b_2'(t)=b_1(t) & b_2(0)=\be_2\end{matrix} \end{equation*}

\begin{equation*} \begin{matrix}b_1(t)=\be_1\cos t-\be_2\sin t\\ b_2(t)=\be_1\sin t+\be_2\cos t\end{matrix} \quad\text{or equivalently}\quad \vb(t)=\left[\begin{matrix}\cos t&-\sin t\\ \sin t& \cos t\end{matrix}\right]\vb(0) \end{equation*}

Consequently the vector \(\vb(t)\) has the same length as \(\vb(0)\text{.}\) The angle between \(\vb(t)\) and \(\vb(0)\) is just \(t\) radians. So, in this example, no matter what direction vectors \(\ha{k}\) we pick, the chunk of fluid just rotates at one radian per unit time. In the figure below, the outlined rectangle is the initial square. That is, the square with edges \(\vb^{(k)}(t_0)=\ha{k}\text{.}\) The shaded rectangle is that with edges \(\vb^{(k)}(t)\) for some \(t\) a bit bigger than \(t_0\text{.}\)

Example 4.6.5. Example 4.6.4, generalized.

The behaviour of Example 4.6.4 is typical of \(\cV\)'s that are antisymmetric matrices, i.e. that obey \(\cV_{i,j}=-\cV_{j,i}\) for all \(i,j\text{.}\) As we have already observed, for any \(d\times d\) matrix \(\cV\text{,}\) the solution of \(\ \vb'(t)=\cV\vb(t),\ \vb(0)=\ve \ \) is \(\ \vb(t)=e^{\cV t}\ve \text{.}\) We now show that if \(\cV\) is a \(3\times 3\) antisymmetric matrix, then \(e^{\cV t}\) is a rotation.

Assuming that \(\cV\) is not the zero matrix (in which case \(e^{\cV t}\) is the identity matrix for all \(t\)), we can find a number \(\Om \gt 0\) and a unit vector \(\hk=(k_1,k_2,k_3)\) (not necessarily the standard unit vector parallel to the \(z\)-axis) such that

\begin{equation*} \cV=\left[\begin{matrix}0 & -\Om k_3 & \Om k_2\\ \Om k_3& 0 & -\Om k_1\\ -\Om k_2& \Om k_1&0\end{matrix}\right] \tag{R} \end{equation*}

This is easy. Because \(\cV\) is antisymmetric, all of the entries on its diagonal must be zero. Define \(\Om\) to be \(\sqrt{\cV_{1,2}^2+\cV_{1,3}^2+\cV_{2,3}^2}\) and \(k_1=-\cV_{2,3}/\Om\text{,}\) \(k_2=\cV_{1,3}/\Om\text{,}\) \(k_3=-\cV_{1,2}/\Om\text{.}\) Also, let \(\hi\) be any unit vector orthogonal to \(\hk\) (again, not necessarily the standard one) and \(\hj=\hk\times\hi\text{.}\) So \(\hi,\ \hj,\ \hk\) is a right-handed system of three mutually perpendicular unit vectors.

Observe that, for any vector \(\ve=(e_1,e_2,e_3)\)

\begin{equation*} \cV\ve=\left[\begin{matrix}0 & -\Om k_3 & \Om k_2\\ \Om k_3& 0 & -\Om k_1\\ -\Om k_2& \Om k_1&0\end{matrix}\right] \left[\begin{matrix}e_1\\ e_2\\ e_3\end{matrix}\right] =\Om\left[\begin{matrix}k_2e_3-k_3e_2\\ k_3e_1-k_1e_3\\ k_1e_2-k_2e_1\end{matrix}\right] =\Om\hk\times \ve \end{equation*}

In particular,

\begin{align*} \cV\hi&=\Om\hk\times \hi=\Om\hj & \cV\hj&=\Om\hk\times \hj=-\Om\hi & \cV\hk&=\Om\hk\times \hk=\vZero\\ \cV^2\hi&=\Om \cV\hj=-\Om^2\hi & \cV^2\hj&=-\Om \cV\hi=-\Om^2\hj & \cV^2\hk&=\cV\vZero=\vZero\\ \cV^3\hi&=\Om \cV^2\hj=-\Om^3\hj & \cV^3\hj&=-\Om \cV^2\hi=\Om^3\hi & \cV^3\hk&=\cV^2\vZero=\vZero\\ \cV^4\hi&=\Om \cV^3\hj=\Om^4\hi & \cV^4\hj&=-\Om \cV^3\hi=\Om^4\hj & \cV^4\hk&=\cV^3\vZero=\vZero \end{align*}

and so on. For all odd \(n\ge 1\text{,}\)

\begin{equation*} \cV^n\hi=(-1)^{(n-1)/2}\Om^n\hj \qquad \cV^n\hj=-(-1)^{(n-1)/2}\Om^n\hi \qquad \cV^n\hk=\vZero \end{equation*}

and all even \(n\ge 2\text{,}\)

\begin{equation*} \cV^n\hi=(-1)^{n/2}\Om^n\hi \qquad \cV^n\hj=(-1)^{n/2}\Om^n\hj \qquad \cV^n\hk=\vZero \end{equation*}

Hence we can write

\begin{align*} e^{\cV t}\hi &=\smsum_{n=0}^\infty \frac{1}{n!}(\cV t)^n\hi =\smsum_{n\ {\rm even}}\!\! \frac{(-1)^{n/2}}{n!}(\Om t)^n\hi \ +\smsum_{n\ {\rm odd}}\! \frac{(-1)^{(n-1)/2}}{n!}(\Om t)^n\hj\\ &=\phantom{-}\cos(\Om t)\,\hi+\sin(\Om t)\,\hj\\ e^{\cV t}\hj &=\smsum_{n=0}^\infty \frac{1}{n!}(\cV t)^n\hj =\smsum_{n\ {\rm even}}\!\! \frac{(-1)^{n/2}}{n!}(\Om t)^n\hj \ -\smsum_{n\ {\rm odd}}\! \frac{(-1)^{(n-1)/2}}{n!}(\Om t)^n\hi\\ &=-\sin(\Om t)\,\hi+\cos(\Om t)\,\hj\\ e^{\cV t}\hk &=\smsum_{n=0}^\infty \frac{1}{n!}(\cV t)^n\hk\\ &=\hk \end{align*}

So \(e^{\cV t}\) is rotation by an angle \(\Om t\) about the axis \(\hk\text{.}\)

Example 4.6.6. Example 4.6.5, continued.

Whether or not the matrix \(\cV\) defined in (M) is antisymmetric, the related matrix with entries

\begin{equation*} A_{i,j}=\half\big(\cV_{i,j}-\cV_{j,i}\big) \end{equation*}

is. When \(\cV\) is antisymmetric, \(A\) and \(\cV\) coincide. The matrix \(A\) is (to write it out explicitly)

\begin{equation*} \frac{1}{2}\!\!\left[\begin{matrix}0 & \!\!\!\frac{\partial \vv_1}{\partial x_2}\big(\vx_0,t_0\big) \!-\!\frac{\partial \vv_2}{\partial x_1}\big(\vx_0,t_0\big)& \!\!\!\frac{\partial v_1}{\partial x_3}\big(\vx_0,t_0\big) \!-\!\frac{\partial \vv_3}{\partial x_1}\big(\vx_0,t_0\big)\\ \!-\frac{\partial \vv_1}{\partial x_2}\big(\vx_0,t_0\big) \!+\!\frac{\partial \vv_2}{\partial x_1}\big(\vx_0,t_0\big)& 0 & \! \!\!\frac{\partial \vv_2}{\partial x_3}\big(\vx_0,t_0\big) \!-\!\frac{\partial \vv_3}{\partial x_2}\big(\vx_0,t_0\big)\\ \!-\!\frac{\partial \vv_1}{\partial x_3}\big(\vx_0,t_0\big) \!+\!\frac{\partial \vv_3}{\partial x_1}\big(\vx_0,t_0\big)& \!\!-\!\frac{\partial \vv_2}{\partial x_3}\big(\vx_0,t_0\big) \!+\!\frac{\partial \vv_3}{\partial x_2}\big(\vx_0,t_0\big)& 0\end{matrix}\right] \end{equation*}

Comparing this with (R), we see that

\begin{equation*} \Om \hk=\half\nabla\times \vv\big(\vx_0,t_0\big) \end{equation*}

So, at least when the matrix \(\cV\) defined in (M) is antisymmetric, our tiny cube rotates about the axis with \(\nabla\times \vv\big(\vx_0,t_0\big)\) at rate \(\half\big|\nabla\times \vv\big(\vx_0,t_0\big)\big|\text{.}\)

Remark 4.6.7.

In the generalization, Example 4.6.5, of Example 4.6.4, we only considered dimension 3. It is a nice exercise in eigenvalues and eigenvectors to handle general dimension. Here are the main facts about antisymmetric matrices with real entries that are used.

All eigenvalues of antisymmetric matrices are either zero or pure imaginary.
For antisymmetric matrices with real entries, the nonzero eigenvalues come in complex conjugate pairs. The corresponding eigenvectors may also be chosen to be complex conjugates.

Choose as basis vectors (like \(\hi,\ \hj,\ \hk\) above)

the eigenvectors of eigenvalue 0 (they act like \(\hk\) above)
the real and imaginary parts of each complex conjugate pair of eigenvectors (they act like \(\hi,\ \hj\) above)

Resumé so far:

We have now seen that

when the matrix \(\cV\) defined in (M) is symmetric and the direction vectors \(\ha{k}\) of the cube are eigenvectors of \(\cV\text{,}\) then, at time \(t_0\text{,}\) the chunk of fluid is not changing orientation but is changing volume at instantaneous relative rate \(\nabla\cdot\vv\big(\vx_0,t_0\big)\) and
when the matrix \(\cV\) defined in (M) is antisymmetric, then, at time \(t_0\text{,}\) the chunk of fluid is not changing shape or size but is rotating about the axis \(\nabla\times \vv\big(\vx_0,t_0\big)\) at rate \(\half\big|\nabla\times \vv\big(\vx_0,t_0\big)\big|\text{.}\) For this reason, \(\nabla\times \vv\) is often referred to as a “vorticity” meter.

These agree with our earlier interpretations of divergence and curl.

The general case:

Now consider a general matrix \(\cV\text{.}\) It can always be written as the sum

\begin{equation*} \cV=S+A \end{equation*}

of a symmetric matrix \(S\) and an antisymmetric matrix \(A\text{.}\) Just define

\begin{equation*} S_{i,j}=\half\big(\cV_{i,j}+\cV_{j,i}\big)\qquad A_{i,j}=\half\big(\cV_{i,j}-\cV_{j,i}\big) \end{equation*}

As we have already observed, the solution of

\begin{equation*} \vb'(t)=\cV\vb(t)\quad \vb(0)=\ve \end{equation*}

\begin{equation*} \vb(t)=e^{\cV t}\ve=e^{(A+S) t}\ve \end{equation*}

If \(S\) and \(A\) were ordinary numbers, we would have \(e^{(A+S) t}=e^{At}e^{St}\text{.}\) But for matrices this need not be the case, unless \(S\) and \(A\) happen to commute⁷. For arbitrary matrices, it is still true that

\begin{equation*} e^{(A+S) t}=\lim_{n\rightarrow\infty }\Big[e^{At/n}e^{St/n}\Big]^n \end{equation*}

This is called the Lie⁸ product formula. It shows that our tiny chunk of fluid mixes together the behaviours of \(A\) and \(S\text{,}\) scaling a bit, then rotating a bit, then scaling a bit and so on.

Example 4.6.8. \(\ \vv(x,y)= 2y\hi\).

In this example

\begin{equation*} \cV=\left[\begin{matrix}0&2\\ 0&0\end{matrix}\right]=S+A\qquad{\rm with}\qquad S=\left[\begin{matrix}0&1\\ 1&0\end{matrix}\right]\qquad A=\left[\begin{matrix}0&1\\ -1&0\end{matrix}\right] \end{equation*}

The solution to the full flow

\begin{equation*} \vb'(t)=\cV\vb(t)\quad \vb(0)= \left[\begin{matrix}\be_1 \\ \be_2\end{matrix}\right] \quad\text{or equivalently}\quad \begin{matrix}b_1'(t)=2b_2(t) & b_1(0)=\be_1\\ b_2'(t)=0 & b_2(0)=\be_2\end{matrix} \end{equation*}

\begin{equation*} \begin{matrix}b_1(t)=\be_1+2\be_2 t\\ b_2(t)=\be_2\end{matrix} \quad\text{or equivalently}\quad \vb(t)=\left[\begin{matrix}1& 2t\\ 0&1\end{matrix}\right]\vb(0) \end{equation*}

The solution to the \(S\) part of the flow

\begin{equation*} \vb'(t)=S\vb(t)\quad \vb(0)=\left[\begin{matrix}\be_1 \\ \be_2\end{matrix}\right] \quad\text{or equivalently}\quad \begin{matrix}b_1'(t)=b_2(t) & b_1(0)=\be_1\\ b_2'(t)=b_1(t) & b_2(0)=\be_2\end{matrix} \end{equation*}

is⁹

\begin{equation*} \begin{matrix}b_1(t)=\be_1\cosh t+\be_2\sinh t\\ b_2(t)=\be_1\sinh t+\be_2\cosh t\end{matrix} \quad\text{or equivalently}\quad \vb(t)=\left[ \begin{matrix}\cosh t& \sinh t\\ \sinh t&\cosh t\end{matrix} \right]\vb(0) \end{equation*}

The eigenvectors of \(S\) are

\begin{equation*} \ha{1}=\frac{1}{\sqrt{2}}\left[\begin{matrix}1\\ 1\end{matrix}\right]\qquad \ha{2}=\frac{1}{\sqrt{2}}\left[\begin{matrix}1\\ -1\end{matrix}\right] \end{equation*}

The corresponding eigenvalues are \(+1\) and \(-1\text{.}\) The eigenvectors obey

\begin{align*} e^{St}\ha{1}&=\left[\begin{matrix}\cosh t& \sinh t\\ \sinh t&\cosh t\end{matrix}\right]\ha{1} =e^t\ha{1}\\ e^{St}\ha{2}&=\left[\begin{matrix}\cosh t& \sinh t\\ \sinh t&\cosh t\end{matrix}\right]\ha{2} =e^{-t}\ha{2} \end{align*}

Under the \(S\) part of the flow \(\ha{1}\) scales by a factor of \(e^t\text{,}\) which is bigger than one for \(t \gt 0\) and \(\ha{2}\) scales by a factor of \(e^{-t}\text{,}\) which is smaller than one for \(t \gt 0\text{.}\)

The solution to the \(A\) part of the flow

\begin{equation*} \vb'(t)=A\vb(t)\quad \vb(0)=\left[\begin{matrix}\be_1 \\ \be_2\end{matrix}\right] \quad\text{or equivalently}\quad \begin{matrix}b_1'(t)=b_2(t) & b_1(0)=\be_1\\ b_2'(t)=-b_1(t) & b_2(0)=\be_2\end{matrix} \end{equation*}

\begin{equation*} \begin{matrix}b_1(t)=\be_1\cos t+\be_2\sin t\\ b_2(t)=-\be_1\sin t+\be_2\cos t\end{matrix} \quad\text{or equivalently}\quad \vb(t)=\left[ \begin{matrix}\cos t& \sin t\\ -\sin t&\cos t\end{matrix}\right]\vb(0) \end{equation*}

The \(A\) part of the flow rotates clockwise about the origin at one radian per unit time.

Here are some figures to help us visualize this.

The first shows a square with edges \(\ha{1},\ \ha{2}\) and its image under the full flow \(t=0.4\) later. Under this full flow the vector \(\ha{k}\rightarrow e^{0.4\cV}\ha{k}\text{.}\) The darkly shaded parallelogram has edges \(e^{0.4\cV}\ha{k}\text{.}\)
The second shows its image under \(0.4\) time units of the \(S\)-flow (that is, \(\ha{k}\rightarrow e^{0.4S}\ha{k}\)). The lightly shaded rectangle has edges \(e^{0.4S}\ha{k}\text{.}\)
The third applies \(0.4\) time units of the \(A\)-flow to the shaded rectangle of the second figure. So the lightly shaded rectangle of the third figure has edges \(e^{0.4S}\ha{k}\) and the darkly shaded rectangle has edges \(e^{0.4A}e^{0.4S}\ha{k}\text{.}\)

Of course \(e^{0.4A}e^{0.4S}\ha{k}\) (as in the darkly shaded rectangle of the third figure) is not a very good approximation for \(e^{0.4(A+S)}\ha{k}\) (as in the darkly shaded parallelogram of the first figure). It is much better to take \(\big[e^{0.4A/n}e^{0.4S/n}\big]^{n}\ha{k}\) with \(n\) large. Each of the following figures shows two parallelograms. In each, the shaded region has edges \(e^{0.4\cV}\ha{k}=e^{0.4(A+S)}\ha{k}\) and the outlined region has edges \(\big[e^{0.4A/n}e^{0.4S/n}\big]^{n}\ha{k}\text{.}\)

So we can see that, as \(n\) increases, \(\big[e^{0.4A/n}e^{0.4S/n}\big]^{n}\ha{k}\) becomes a better and better approximation to \(e^{0.4(A+S)}\ha{k}\text{.}\)

We'll also use some more mathematics than before. In this section, we'll use matrix eigenvalues and eigenvectors and solve some simple systems of ordinary differential equations. We'll also need to use a lot of subscripts and superscripts. It only looks intimidating.

In terms of our original vector field, this condition is that \(\frac{\partial \vv_i}{\partial x_j}\big(\vx_0,t_0\big) =\frac{\partial \vv_j}{\partial x_i}\big(\vx_0,t_0\big)\text{.}\) So, in three dimensions, it comes down to the requirement that \(\vnabla\times\vv\) be zero at the point \(\big(\vx_0,t_0\big)\text{.}\)

This was proven by the French mathematician and physicist Augustin-Louis Cauchy (1789–1857) in 1829.

The proof is not so hard, though we'll only outline it. Just denote by \(\beta\) the magnitude of the largest matrix element of \(B\text{.}\) Then use the definition of the matrix product to prove that the largest matrix element of \(B^n\) has magnitude at most \((d\be)^n\text{.}\)

Again, we won't prove this. But for a diagonal matrix, it is easy — just compute both sides. So for a diagonalizable matrix it is also easy — diagonalize.

You can find the solution either by guessing, or by using eigenvalues and eigenvectors.

By definition, the matrices \(S\) and \(A\) commute when \(AS=SA\text{.}\)

This formula is named after the Norwegian mathematician Marius Sophus Lie (1842–1899). In 1870, he was arrested and held in prison in France for a month, because he was suspected of being a German spy. His mathematics notes were thought to be top secret coded messages.

Recall that \(\sinh t = \frac{1}{2}\big(e^t-e^{-t}\big)\) and \(\cosh t = \frac{1}{2}\big(e^t+e^{-t}\big)\text{.}\)

CLP-4 Vector Calculus

Search Results:

Section 4.6 Really Optional — More Interpretation of Div and Curl

Example 4.6.1. \(\vv(x,y)= 2x\hi+3y\hj\).

Example 4.6.2. Example 4.6.1, generalized.

Example 4.6.3. Example 4.6.1, generalized yet again.

Example 4.6.4. \(\vv(x,y)= -y\hi+x\hj\).

Example 4.6.5. Example 4.6.4, generalized.

Example 4.6.6. Example 4.6.5, continued.

Remark 4.6.7.

Example 4.6.8. \(\ \vv(x,y)= 2y\hi\).