Page updated: March 13, 2021
Author: Curtis Mobley
View PDF

# The General Vector Radiative Transfer Equation

The goal of this page and the next two is to obtain the vector and scalar radiative transfer equations in the forms that are commonly used in optical oceanography. The steps to these equations are outlined starting from the most fundamental physics. This sequence makes explicit what assumptions must be made along the way to get from an exact but highly abstract and mathematical theory to equations that are only approximate but which can be solved to get results that are accurate enough for many applications. In particular, the presentation shows what is given up at each step in exchange for increased simplicity in the resulting physics and mathematics. The ﬁnal result will be the scalar radiative transfer equation (SRTE), which is seen in many textbooks and which is solved by the widely used HydroLight radiative transfer software.

The phenomenological nature radiative transfer theory as developed over the decades made its connections to fundamental physics unclear because of its heuristic derivations and (sometimes physically indefensible) assumptions (e.g., Preisendorfer (1965), Mishchenko (2013), and Mishchenko (2014)). Theoreticians worked for decades to construct ”an analytical bridge between the mainland of physics and the island of radiative transfer theory”, as R. W. Preisendorfer worded it (Preisendorfer (1965), page 389). Most recently, M. I. Mishchenko has expended enormous eﬀort to develop a physically and mathematically rigorous connection between Maxwell’s equations and the vector radiative transfer equation (VRTE), from which the SRTE is obtained after further simpliﬁcations. Thus Preisendorfer’s bridge has now been completed with a level of rigor worthy of any branch of physics (e.g., Mishchenko (2008a), Mishchenko et al. (2016)). This page outlines how that bridge is constructed.

The path from fundamental physics to radiative transfer theory works it way through the following stages:

1.
Quantum Electrodynamics
2.
Maxwell’s Equations
3.
The general vector radiative transfer equation (VRTE)
4.
The VRTE for particles with mirror symmetry
5.
The SRTE for the ﬁrst component of the Stokes vector

Notation: This and the following pages use bold face $E$ and $B$ for electric and magnetic ﬁeld vectors, and $J$ for the current vector, in 3-D space. $4×1$ Stokes vectors and $4×4$ matrices are denoted by “blackboard” fonts: $𝕊,𝕂,𝕄$, etc.

### Quantum Electrodynamics

Quantum electrodynamics (QED) is the fundamental physical theory that explains with (as far as we know) total accuracy the interactions of light and matter, or of electrically charged particles. In this theory, the electromagnetic ﬁeld itself is quantized, and the photon is the quantum of the ﬁeld. The electromagnetic force between charged particles is described by the exchange of so-called virtual photons (called ”virtual” because they are not detectable in the roll of transferring forces between charged particles). The photon is said to mediate or carry the electromagnetic force.

As an example of the accuracy of QED, the most recently measured value of a quantity called the electron spin g-factor is (Hanneke et al., 2011. Phys. Rev. A DOI: 10.1103/PhysRevA.83.052122)

 $g∕2=1.001\phantom{\rule{0.3em}{0ex}}159\phantom{\rule{0.3em}{0ex}}652\phantom{\rule{0.3em}{0ex}}180\phantom{\rule{0.3em}{0ex}}73\left(28\right)$

where the last two digits in parentheses give the uncertainty in the preceding two digits. The value predicted by the most recent QED calculations is (Aoyama et al., 2015. Phys. Rev. D DOI: 10.1103/PhysRevD.91.033006)

 $g∕2=1.001\phantom{\rule{0.3em}{0ex}}159\phantom{\rule{0.3em}{0ex}}652\phantom{\rule{0.3em}{0ex}}181\phantom{\rule{0.3em}{0ex}}643\left(763\right)$

This is an agreement between theory and experiment of around 1 part in $1{0}^{12}$, which is an accuracy equivalent to measuring the circumference of the Earth to within 0.04 millimeters. Similar results are obtained for other QED predictions. Because of its exceptional success in explaining nature at its most fundamental level of elementary particle interactions, QED is often regarded as the most successful theory in physics.

Unfortunately, QED is exceptionally abstract and mathematical, and the calculations can be carried out only for simple interactions, such as the scattering of one elementary charged particle by another (although the calculations for even the simplest of interactions can be exceedingly complex according to the rules of QED). QED is the conceptual starting point for much of modern physics, but it isn’t even remotely practical for everyday calculations. If you want to start learning about how the theory is formulated, there is no better place to begin than Feynman’s classic popular exposition QED: The Strange Story of Light and Matter. After that, if you want to get a bit more serious and see how the calculations are actually done, the best text I’ve found is Introduction to Elementary Particles, Second Edition by Griﬃths. Learning about QED a fascinating journey, but this is not a road that oceanographers are required to travel.

### Maxwell’s Equations

QED is a quantum ﬁeld theory that describes light and electromagnetism at the level of individual photons, which in QED are viewed as the quantized vibrational modes of the electromagnetic ﬁeld. It is possible to take a “classical physics limit” of QED to get a classical ﬁeld theory, in which the electromagnetic ﬁeld is not quantized. The result is Maxwell’s equations, which describe electric and magnetic ﬁelds as continuous functions of space and time. One way to think of this limiting process was stated by Sakurai (1967) as “The classical limit of the quantum theory of radiation is achieved when the number of photons becomes so large that the occupation number may as well be regarded as a continuous variable. The space-time development of the classical electromagnetic wave approximates the dynamical behavior of trillions of photons.” Not surprisingly, getting from QED to Maxwell requires a high level of mathematical and physical sophistication.

Maxwell’s equations for the electric ﬁeld $E\left(x,t\right)$ (units of $N\phantom{\rule{2.6108pt}{0ex}}cou{l}^{-1}$) and magnetic ﬁeld $B\left(x,t\right)$ ($N\phantom{\rule{2.6108pt}{0ex}}{A}^{-1}\phantom{\rule{2.6108pt}{0ex}}{m}^{-1}$) at location $x=\left(x,y,z\right)$ and time $t$ are (in diﬀerential form, in SI units)

$\begin{array}{llll}\hfill \nabla \cdot E=& \frac{1}{{𝜖}_{o}}\rho \phantom{\rule{2em}{0ex}}& \hfill & \phantom{\rule{2em}{0ex}}\\ \hfill \nabla \cdot B=& 0\phantom{\rule{2em}{0ex}}& \hfill & \phantom{\rule{2em}{0ex}}\\ \hfill \nabla ×E=& -\frac{\partial B}{\partial t}\phantom{\rule{2em}{0ex}}& \hfill & \phantom{\rule{2em}{0ex}}\\ \hfill \nabla ×B=& {\mu }_{o}J+{\mu }_{o}{𝜖}_{o}\frac{\partial E}{\partial t}\phantom{\rule{2em}{0ex}}& \hfill & \phantom{\rule{2em}{0ex}}\end{array}$

Here $\rho \left(x,t\right)$ is the electric charge density ($coul\phantom{\rule{2.6108pt}{0ex}}{m}^{-3}$) and $J\left(x,t\right)$ is the electrical current density ($A\phantom{\rule{2.6108pt}{0ex}}{m}^{-2}$) in the region of interest. The quantity ${𝜖}_{o}=8.85×1{0}^{-12}\phantom{\rule{2.6108pt}{0ex}}{A}^{2}\phantom{\rule{2.6108pt}{0ex}}{s}^{4}\phantom{\rule{2.6108pt}{0ex}}k{g}^{-1}\phantom{\rule{2.6108pt}{0ex}}{m}^{-3}\left(or\phantom{\rule{2.6108pt}{0ex}}Farad\phantom{\rule{2.6108pt}{0ex}}{m}^{-2}\phantom{\rule{2.6108pt}{0ex}}or\phantom{\rule{2.6108pt}{0ex}}cou{l}^{2}\phantom{\rule{2.6108pt}{0ex}}{N}^{-1}\phantom{\rule{2.6108pt}{0ex}}{m}^{-2}\right)$ is the electric permittivity of free space, and ${\mu }_{o}=4\pi ×1{0}^{-7}\phantom{\rule{2.6108pt}{0ex}}kg\phantom{\rule{2.6108pt}{0ex}}m\phantom{\rule{2.6108pt}{0ex}}{s}^{-2}\phantom{\rule{2.6108pt}{0ex}}{A}^{-2}\left(or\phantom{\rule{2.6108pt}{0ex}}Henry\phantom{\rule{2.6108pt}{0ex}}{m}^{-1}\phantom{\rule{2.6108pt}{0ex}}or\phantom{\rule{2.6108pt}{0ex}}N\phantom{\rule{2.6108pt}{0ex}}{A}^{-2}\right)$ is the magnetic permeability of free space; these are fundamental physical constants.

These equations are applicable to the every-day needs of science and engineering. They govern matters as seemingly diverse as the generation, propagation, and detection of radio waves; the generation of electrical power for your house; the generation of Earth’s magnetic ﬁeld, the refraction of light at an air-water surface, and the scattering of light by phytoplankton. However, the greatest achievement of Maxwell and his equations is this: In “free space,” i.e. when $\rho =0$ and $J=0$, a bit of vector calculus shows that each of the $x$, $y$, and $z$ components of the $E$ and $B$ vectors satisﬁes an equation of the form

 ${\nabla }^{2}f={\mu }_{o}{𝜖}_{o}\frac{{\partial }^{2}f}{{\partial }^{2}t}\phantom{\rule{0.3em}{0ex}}.$

This equation describes a wave propagating with speed $c=1∕\sqrt{{\mu }_{o}{𝜖}_{o}}$. Plugging in the numbers for ${\mu }_{o}$ and ${𝜖}_{o}$ gives $c=3×1{0}^{8}\phantom{\rule{2.6108pt}{0ex}}m\phantom{\rule{2.6108pt}{0ex}}{s}^{-1}$. As Maxwell observed in 1862, “This velocity is so nearly that of light that it seems we have strong reason to conclude that light itself (including radiant heat and other radiations) is an electromagnetic disturbance in the form of waves propagated through the electromagnetic ﬁeld according to electromagnetic laws.” This result is one of the greatest intellectual achievements of all time: not only were electric and magnetic ﬁelds tied together in the ﬁrst uniﬁed ﬁeld theory in physics, but light was shown to be an electromagnetic phenomenon.

The $E$ and $B$ wave solutions coming out of Maxwell’s equations contain full information about the phases of the ﬁelds. Maxwell’s equations can thus describe interference phenomena, which depend on the instantaneous phases of the ﬁelds.

Maxwell’s equations are discussed in more detail beginning on page Maxwell’s Equations in Vacuo.

Although Maxwell’s equations are very accurate for predicting the phenomena of classical physics, they do fail when applied to atomic-scale processes, extremely strong ﬁelds, and the realm of “quantum optics,” which deals with individual photons. In particular, classical physics failed to describe blackbody radiation and the photoelectric eﬀect, which is why Planck and Einstein had to make radical new assumptions in order to explain the observations. Those assumptions were the start of quantum mechanics. (To be precise, beneﬁtting from the insights of a century of quantum mechanics, the failure of classical physics to describe the photoelectric eﬀect and blackbody radiation was not due to the radiation ﬁeld not being quantized (Maxwell’s equations), but due to the discrete nature of atomic absorption and emission processes, which in turn lead to discrete harmonics of the associated radiation. But that’s another story....)

For solution in material media, Maxwell’s equations must be augmented with information about the electrical and magnetic properties of the material and with boundary conditions describing the speciﬁc problem of interest, such as sunlight passing through the sea surface and propagating through an ocean full of absorbing and scattering particles. (The equations do become somewhat more complicated in material media, although the general form remains the same.) These equations do ﬁnd occasional use in oceanography. For example, they have been for measurement of ocean currents via the electric ﬁelds set up by a conductor (salt water) moving through the earth’s magnetic ﬁeld (e.g., Larsen (1973), Larsen (1992)). They are also the foundation for computations of sea ice emissivity and reﬂection at microwave frequencies (e.g., Tan et al. (2016)). However, those applications are either non-optical or at wavelengths far longer than the visible spectrum. Direct numerical solutions of Maxwell’s equations are reviewed in Section 7 of Mishchenko et al. (2016).

For geometry as complex as a wind-blown sea surface and a water column ﬁlled with spatially varying absorbing and scattering matter, Maxwell’s equations and their boundary conditions are just too complex to solve for typical optical oceanographic problems. So, elegant and useful as they are for some problems, optical oceanographers must seek still further simpliﬁcations.

### The General Vector Radiative Transfer Equation

The next simplifying step is to go from the world of electric and magnetic ﬁelds to the world of radiance. At optical wavelengths, the frequency of electromagnetic waves (light) is of order $1{0}^{15}$ Hz. This is far higher than can be directly measured for a time-dependent propagating $E$ ﬁeld. In practice, time-averaged irradiances are measured, in conjunction with various combinations of polarizing ﬁlters, over times long compared to the wave period. This time-averaging destroys the phase information contained in the instantaneous vector ﬁelds, but preserves the directional information about the plane of oscillation of the $E$ (and $B$) ﬁeld, i.e., it preserves information about the state of polarization of the light.

As described on the Polarization: Stokes Vectors page, the state of polarization of a light ﬁeld is speciﬁed by the four-component Stokes vector, whose elements are related to the complex amplitudes of the electric ﬁeld vector $E$ resolved into directions that are parallel (${E}_{\parallel }$) and perpendicular (${E}_{\perp }$) to a conveniently chosen reference plane. In the oceanographic setting, this reference plane is usually the meridian plane, which is deﬁned by the normal to the mean sea surface, $ẑ$, and the azimuthal direction of the propagating light; the meridian plane is thus perpendicular to the mean sea surface. In a polar coordinate system $\left(r,𝜃,\varphi \right)$, ${E}_{\parallel }={E}_{𝜃}$ is the polar angle component of the electric ﬁeld and ${E}_{\perp }={E}_{\varphi }$ is the azimuthal angle component. Figure 1 shows the meridian plane as commonly used for oceanography radiative transfer calculations.

There are two versions of the Stokes vector seen in the literature, and these two versions have diﬀerent units and refer to diﬀerent physical quantities. The coherent Stokes vector describes a quasi-monochromatic plane wave propagating in one exact direction, and the vector components have units of power per unit area (i.e., irradiance) on a surface perpendicular to the direction of propagation. For a propagating electric ﬁeld of the form $E\left(r,t\right)={E}_{o}exp\left(ik\cdot r-i\omega t\right)$, where the wavenumber vector $k$ is in the direction of propagation, and with the choice of the meridian plane as the reference, the coherent Stokes vector is then deﬁned as (Mishchenko et al. (2002))

 $𝕊=\left[\begin{array}{c}\hfill I\hfill \\ \hfill Q\hfill \\ \hfill U\hfill \\ \hfill V\hfill \end{array}\right]=\frac{1}{2}\sqrt{\frac{𝜖}{\mu }}\left[\begin{array}{c}\hfill {E}_{o𝜃}{E}_{o𝜃}^{\ast }+{E}_{o\varphi }{E}_{o\varphi }^{\ast }\hfill \\ \hfill {E}_{o𝜃}{E}_{o𝜃}^{\ast }-{E}_{o\varphi }{E}_{o\varphi }^{\ast }\hfill \\ \hfill -{E}_{o𝜃}{E}_{o\varphi }^{\ast }-{E}_{o\varphi }{E}_{o𝜃}^{\ast }\hfill \\ \hfill i\left({E}_{o\varphi }{E}_{o𝜃}^{\ast }-{E}_{o𝜃}{E}_{o\varphi }^{\ast }\right)\hfill \end{array}\right]=\frac{1}{2}\sqrt{\frac{𝜖}{\mu }}\left[\begin{array}{c}\hfill |{E}_{o𝜃}{|}^{2}+|{E}_{o\varphi }{|}^{2}\hfill \\ \hfill |{E}_{o𝜃}{|}^{2}-|{E}_{o\varphi }{|}^{2}\hfill \\ \hfill -2Re\left\{{E}_{o𝜃}{E}_{o\varphi }^{\ast }\right\}\hfill \\ \hfill 2Im\left\{{E}_{o𝜃}{E}_{o\varphi }^{\ast }\right\}\hfill \end{array}\right]\phantom{\rule{0.3em}{0ex}}.$ (1)

Here $𝜖$ is the electric permittivity of the medium (SI units of ${A}^{2}\phantom{\rule{2.6108pt}{0ex}}{s}^{4}\phantom{\rule{0.3em}{0ex}}k{g}^{-1}\phantom{\rule{2.6108pt}{0ex}}{m}^{-3}$), and $\mu$ is the magnetic permeability of the medium ($kg\phantom{\rule{2.6108pt}{0ex}}m\phantom{\rule{2.6108pt}{0ex}}{s}^{-2}\phantom{\rule{2.6108pt}{0ex}}{A}^{-2}$). Thus the elements of the coherent Stokes vector have units of $kg\phantom{\rule{2.6108pt}{0ex}}{s}^{-3}$ or $Watt∕{m}^{2}$, i.e. of irradiance. ${E}^{\ast }$ denotes complex conjugate, hence the components of the Stokes vector are real numbers. Note that the values of the $Q$ and $U$ elements of the Stokes vector depend on the choice of reference plane, whereas $I$ and $V$ do not.

The diﬀuse Stokes vector is deﬁned as in Eq. (1) but describes light propagating in a small set of directions surrounding a particular direction and has units of power per unit area per unit solid angle (i.e., radiance). It is the diﬀuse Stokes vector that appears in the radiative transfer equations as developed here. The diﬀerences in coherent and diﬀuse Stokes vectors are rigorously presented in Mishchenko et al. (2002).

Authors often omit the $\frac{1}{2}\sqrt{𝜖∕\mu }$ factor in Eq. (1) because they are interested only in relative values such as the degree of polarization, not absolute magnitudes, but this omission is both confusing and physically incorrect. Units and magnitudes matter! In particular, the diﬀerent units of coherent and diﬀuse Stokes vectors, and whether or not the $\frac{1}{2}\sqrt{𝜖∕\mu }$ factor is included in the deﬁnition of the Stokes vector, have subtle but very important consequences regarding how light propagation across a dielectric interface such as the air-water surface is formulated (e.g., Garcia (2012), Zhai et al. (2012)).

The transition from electric and magnetic ﬁelds to Stokes vectors yields a general 3D vector radiative transfer equation. Let $𝕊\left(x,\xi \right)$ denote the Stokes vector at spatial location $x=\left(x,y,z\right)=\left(r,𝜃,\varphi \right)$ for light propagating in the $\xi =\left(𝜃,\varphi \right)$ direction: $uS\left(x,\xi \right)=\underline{S}\left(x\right)\xi$. Then the VRTE has the form (Mishchenko (2008a), with slightly diﬀerent notation and with the addition of an internal source term)

 $\xi \cdot \nabla 𝕊\left(x,\xi \right)=-𝕂\left(x,\xi \right)\phantom{\rule{0.3em}{0ex}}𝕊\left(x,\xi \right)+{\iint }_{4\pi }ℤ\left(x,{\xi }^{\prime }\to \xi \right)\phantom{\rule{0.3em}{0ex}}𝕊\left(x,{\xi }^{\prime }\right)\phantom{\rule{0.3em}{0ex}}d\Omega \left({\xi }^{\prime }\right)+ℚ\left(x,\xi \right)\phantom{\rule{0.3em}{0ex}}.$ (2)

In this equation,

• $𝕂\left(x,\xi \right)$ is a $4×4$ extinction matrix, which describes the attenuation (by the background medium and any particles imbedded in the medium) of the light propagating in direction $\xi$.
• $ℤ\left(x,{\xi }^{\prime }\to \xi \right)$ is a $4×4$ phase matrix, which describes how light in an initial state of polarization and direction ${\xi }^{\prime }$ in the incident meridian plane is scattered to a diﬀerent state of polarization and direction $\xi$ in the ﬁnal meridian plane.
• $ℚ\left(x,\xi \right)$ is a $4×1$ internal source term, which speciﬁes the Stokes vector of any emitted light such as bioluminescence or light at the wavelength of interest that comes from other wavelengths via inelastic scattering.

In general, all 16 elements of $𝕂$ and $ℤ$ are nonzero, and they depend on both location and direction (and wavelength; not shown). Equation (2) can describe polarized light propagation in matter that is directionally non-isotropic (e.g., in a crystal), that can absorb light diﬀerently for diﬀerent states of polarization (linear or circular dichroism), and that contains scattering particles of any shape and random or non-random orientation. (See Mishchenko et al. (2002) or Mishchenko et al. (2016) for a rigorous discussion of how these matrix elements are deﬁned and computed. The $𝕂\left(x,\xi \right)$ of Eq. 2 is a bulk extinction matrix. In Mishchenko’s papers this is written as ${n}_{o}{⟨𝕂⟩}_{\xi }$, where ${n}_{o}=N∕V$ is the number of particles $N$ in the volume of interest $V$, and ${⟨\underline{K}⟩}_{\xi }$ is a single-particle extinction matrix averaged over all $N$ particles. A similar comment holds for $ℤ$.)

By deﬁnition the phase phase matrix $ℤ\left({\xi }^{\prime }\to \xi \right)$ scatters light from one meridian plane (deﬁned by $ẑ$ and ${\xi }^{\prime }$) to another (deﬁned by $ẑ$ and $\xi$). It is customary to write $ℤ\left({\xi }^{\prime }\to \xi \right)$ as a product of three matrices that

1.
Transform the initial (unscattered) Stokes vector from the incident meridian plane to the scattering plane, which is the plane containing the incident (${\xi }^{\prime }$) and scattered ($\xi$) directions,
2.
Scatter the Stokes vector from direction ${\xi }^{\prime }$ to $\xi$, with calculations performed in the scattering plane, and
3.
Transform the ﬁnal Stokes vector from the scattering plane to the ﬁnal meridian plane.

When this is done, the phase matrix is written as

 $ℤ\left({\xi }^{\prime }\to \xi \right)=ℝ\left(\alpha \right)𝕄\left({\xi }^{\prime }\to \xi \right)ℝ\left({\alpha }^{\prime }\right)\phantom{\rule{0.3em}{0ex}}.$ (3)

Here $ℝ\left({\alpha }^{\prime }\right)$ is a $4×4$ matrix that transforms (“rotates” through an angle ${\alpha }^{\prime }$) the incident Stokes vector into the scattering plane; $𝕄\left({\xi }^{\prime }\to \xi \right)$ is a $4×4$ matrix, the scattering matrix, which by deﬁnition scatters the incident Stokes vector to the ﬁnal Stokes vector, with both expressed in the scattering plane; and $ℝ\left(\alpha \right)$ is a $4×4$ matrix that rotates the ﬁnal Stokes vector from the scattering plane to the ﬁnal meridian plane. (In general, $𝕄$ is called the scattering matrix. In the laboratory setting, $𝕄$ is usually called the Mueller matrix.) This factoring of $ℤ$ separates the physics of the scattering process ($𝕄$) from the geometrical bookkeeping related to the coordinate systems (the rotation matrices). For the choice of a positive rotation being counterclockwise when looking into the beam, the Stokes vector rotation matrix is (e.g., page 25 of Mishchenko et al. (2002))

 $ℝ\left(\gamma \right)=\left[\begin{array}{cccc}\hfill 1\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \\ \hfill 0\hfill & \hfill cos2\gamma \hfill & \hfill -sin2\gamma \hfill & \hfill 0\hfill \\ \hfill 0\hfill & \hfill sin2\gamma \hfill & \hfill cos2\gamma \hfill & \hfill 0\hfill \\ \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 1\hfill \end{array}\right]\phantom{\rule{0.3em}{0ex}}.$ (4)

The rotation angles ${\alpha }^{\prime }$ and $\alpha$ are shown in Fig. 2 These angles determined by spherical trigonometry as described on the Polarization: Scattering Geometry page.

Equation (2) is almost never applied to oceanic problems. The reason is less for mathematical reasons than because there are no comprehensive data or models for the needed inputs $𝕂$ and $𝕄$. To be useful, all elements of these matrices must be available for a wide range of oceanic waters. Further simpliﬁcation is required.