Two Concepts in Optics of Anisotropic Dispersive Media and Polariton Case in Coordinate-Invariant Way

Alfred Wünsche

doi:10.4236/jmp.2022.134035

Journal of Modern Physics > Vol.13 No.4, April 2022

Two Concepts in Optics of Anisotropic Dispersive Media and Polariton Case in Coordinate-Invariant Way

Alfred Wünsche
Institut für Physik, Humboldt-Universität, Berlin, Germany.
DOI: 10.4236/jmp.2022.134035 PDF HTML XML 92 Downloads 446 Views

Abstract

Two concepts of phenomenological optics of homogeneous, anisotropic and dispersive media are compared, the younger and more general concept of media with spatial dispersion and the older concept of (bi)-anisotropic media with material tensors for electric and magnetic induction which only depend on the frequency. The general algebraic form of the polarization vectors for the electric field and their one-dimensional projection operators is discussed without the degenerate cases of optic axis for which they become two-dimensional projection operators. Group velocity and diffraction coefficients in an approximate equation for the slowly varying amplitudes of beam solutions are calculated. As special case a polariton permittivity for isotropic media with frequency dispersion but without losses is discussed for the usual passive case and for the active case (occupation inversion of two energy levels that goes in direction of laser theory) and the group velocity is calculated. For this active case, regions of frequency and wave vector with group velocities greater than that of light in vacuum were found. This is not fully understood and due to large diffraction is likely only to realize in guided resonator form. The notion of “negative refraction” is shortly discussed but we did not find agreement with its assessment in the original paper.

Keywords

Spatial and Frequency Dispersion, Bi-Anisotropic Media, Uniaxial Media, Passive and Active Media, Negative Refraction, Operator Invariants, Complementary Operator, Group Velocity

Share and Cite:

Wünsche, A. (2022) Two Concepts in Optics of Anisotropic Dispersive Media and Polariton Case in Coordinate-Invariant Way. Journal of Modern Physics, 13, 574-619. doi: 10.4236/jmp.2022.134035.

To the Notations

Three-dimensional vectors: bold letters, e.g., $a, b, c, \dots$ ,

$a b$ scalar products, $[a, b]$ vector products, $[a, b, c]$ volume products, $a \cdot b$ dyadic products,

$A B$ operator products, $A a, \tilde{a} A$ products of operators with vectors, $\tilde{a} A a$ bilinear (and quadratic) forms.

In Euclidean spaces with a symmetric metric tensor $g_{i j}$ , the dual tensor $b_{i k} \equiv ϵ_{i j k} b_{j}$ to a vector $b_{j}$ ( $ϵ_{i j k}$ Levi-Cicita symbol) can be also seen as antisymmetric operator and in coordinate-invariant form we can write this antisymmetric operator as $[b]$ with the advantage that vector and also volume products can be written only by displacement of the squared brackets, e.g., $a [b] \equiv [a, b]$ , $[b] c \equiv [b, c]$ , $a [b] c \equiv [a, b] c \equiv a [b, c] \equiv [a, b, c]$ .

In mathematical texts I write three-dimensional operators by serif-less Capital letters. In physical texts this makes sometimes difficulties because one cannot reasonably write all operators with physical meaning by Capital letters and all vectors with physical meaning by small letters. Furthermore, in case of Greek letters, “Latex” (and also printing) does not provide serif-less letters. In these cases I write operators as a compromise by bold letters, e.g., $ε, μ$ such as vectors to distinguish them, in particular, from scalars. This means that in present physical text one must know which kind of quantities one has: scalars, vectors or operators.

1. Introduction

There are two concepts of phenomenological macroscopic optics of the linear constitutive equations for anisotropic dispersive media. The first and younger concept is spatial dispersion in the first time mainly developed by Russian physicist in the fiftieths, in particular, Ginzburg and Agranovich [1], Ginzburg [2], Silin and Rukhadze [3] and in shorter form considered in a new chapter in the new edition of vol. 8 of the course of Landau and Lifshits [4].

The second and much older concept is to use two material equations for the electric and magnetic induction in dependence on the electric and magnetic field using tensors of second and sometimes in addition of third rank (optic activity) depending on frequency only (dispersion) and now often called “bi-anisotropic media” (including also electrical anisotropy only). Three representative monographs of the many possible possible ones are that of Tamm [5], that from Sommerfeld [6] and that of Born and Wolf [7] and in addition the comprehensive encyclopedic article from Szivessy [8]. We cite here also the most basic works of Fyodorov, the initiator of coordinate-invariant methods, and his followers from Minsk [9] [10] [11] who in addition to this concept use last methods and where the monograph [11] contains beside theory also experimental material to different media and crystals and by impression was mainly written by Filippov. Coordinate-invariant methods do not only write the starting equations in vector or tensor form but work from beginning up to the results only with vectors, operators and tensors which have a relation to the problem but not with arbitrary coordinate representations and which are mostly of advantage compared with often voluminous coordinate representations but with more sophisticated algebra. We also apply in this article widely coordinate-invariant methods where it is possible and used them also long ago in the past, e.g., [12] [13] [14] [15].

As a special case we discuss in detail a permittivity which we call polariton permittivity and which is related to phenomenological theory of excitons, e.g., in addition to [1] [2] [3] [4] by Knox, Agranovich, Davydov, Galanin and Pekar [16] [17] [18] [19] [20]. It admits to consider two essentially different special cases called the passive and the active case. The active case is connected with occupation inversion of at least two energy levels in the medium and is described in certain parts of frequency or connected wave vectors by amplification and leads into the neighborhood to laser theory. It contains also a very interesting phenomenon of propagation of excitations with velocities faster than light that is not fully understood.

In connection with my article [21] the notion of “negative refraction” of Pendry [22] came into the focus of my considerations¹. I never have used it and likely never would use the notion “negative refraction” in connection with my own results in this field. In Section 11 and in Appendix D I try to represent my imaginations to the content of this notion which seems to me as incorrect ones.

Sections 2-8 are devoted to general characterization and comparison of both concepts including calculation of group velocities with and without taking into consideration the dispersion and Section 9 and Section 10 to the most simple model of a polariton permittivity. Section 11 was made to prepare short remarks to the notion of “negative refraction” in Appendix D.

2. The Concept of Spatial Dispersion

In this more general concept compared with the bi-anisotropic concept considered in next Section we write the equations of macroscopic electrodynamics in the form

$\begin{array}{l} [\nabla, E (r, t)] + \frac{1}{c} \frac{\partial}{\partial t} B (r, t) = 0, \nabla B (r, t) = 0, \\ [\nabla, B (r, t)] - \frac{1}{c} \frac{\partial}{\partial t} D (r, t) = 0, \nabla D (r, t) = 0, \end{array}$ (2.1)

where $E (r, t)$ is the macroscopic electric and $B (r, t)$ the macroscopic magnetic field² The linear constitutive equation for spatially and temporally homogeneous but, in general, anisotropic dispersive media are written in the form

$D_{i} (r, t) = \int d^{3} r^{'} \land d t^{'} {\hat{ε}}_{i j} (r - r^{'}, t - t^{'}) E_{j} (r^{'}, t^{'}) .$ (2.2)

We now make a Fourier transformation for $E, B$ and $D$ according to the scheme

$\begin{array}{l} E (r, t) = \frac{1}{{(2 π)}^{4}} \int d^{3} k \land d ω E (k, ω) e^{i (k r - ω t)}, \\ E (k, ω) = \int d^{3} r \land d t E (r, t) e^{- i (k r - ω t)}, \end{array}$ (2.3)

we find the transformed constitutive relation in the form

$D_{i} (k, ω) = ε_{i j} (k, ω) E_{j} (k, ω),$ (2.4)

with the definition of the general permittivity tensor $ε_{i j} (k, ω)$

$ε_{i j} (k, ω) \equiv \int d^{3} ρ \land d τ {\hat{ε}}_{i j} (ρ, τ) e^{- i (k ρ - ω τ)} .$ (2.5)

In general, this tensor is non-symmetric.

After Fourier transformation of (2.1) these equations take on the form

$\begin{array}{l} [k, E (k, ω)] - \frac{ω}{c} B (k, ω) = 0, k B (k, ω) = 0, \\ [k, B (k, ω)] + \frac{ω}{c} D (k, ω) = 0, k D (k, ω) = 0, \end{array}$ (2.6)

By elimination of $B$ from these equations and using the constitutive Equation (2.4) we find the following operator equation for the Fourier components of the electric field (in case of $ω \neq 0$ )

$0 = {\frac{c^{2}}{ω^{2}} (k \cdot k - k^{2} I) + ε (k, ω)} E (k, ω) \equiv L (k, ω) E (k, ω) .$ (2.7)

From this equation follows equivalently to the vanishing of the divergence of $D (r, t)$

$0 = k L (k, ω) E (k, ω) = k ε (k, ω) E (k, ω) = k D (k, ω) .$ (2.8)

Equation (2.7) is an operator equation for the Fourier transforms of the electric field to the eigenvalue “zero” which in the original form can be written

$0 = {\frac{c^{2}}{\frac{\partial^{2}}{\partial t^{2}}} (\nabla \cdot \nabla - \nabla^{2} I) + ε (- i \nabla, i \frac{\partial}{\partial t})} E (r, t) \equiv L (- i \nabla, i \frac{\partial}{\partial t}) E (r, t),$ (2.9)

with the differential operator (or integral operator in case of $ε$ )

$L (- i \nabla, i \frac{\partial}{\partial t}) \equiv \frac{c^{2}}{\frac{\partial^{2}}{\partial t^{2}}} (\nabla \cdot \nabla - \nabla^{2} I) + ε (- i \nabla, i \frac{\partial}{\partial t}) .$ (2.10)

For the solution of these operator equations it is favorable to consider some algebra of the operators before this.

The operator $L (k, ω)$ in the wave Equation (2.7) is defined by

$L (k, ω) \equiv \frac{c^{2}}{ω^{2}} (k \cdot k - k^{2} I) + ε (k, ω) .$ (2.11)

The invariants of the operator $L (k, ω)$ are ( $ε \equiv ε (k, ω)$ )

$\begin{array}{l} | L (k, ω) | = \frac{c^{4}}{ω^{4}} k^{2} (k ε k) - \frac{c^{2}}{ω^{2}} (〈 ε 〉 k ε k - k ε^{2} k) + | ε |, \\ [L (k, ω)] = \frac{c^{4}}{ω^{4}} {(k^{2})}^{2} - \frac{c^{2}}{ω^{2}} (〈 ε 〉 k^{2} + k ε k) + [ε], \\ 〈 L (k, ω) 〉 = - 2 \frac{c^{2}}{ω^{2}} k^{2} + 〈 ε 〉, \end{array}$ (2.12)

which are involved in the Cayley-Hamilton identity $L^{3} - 〈 L 〉 L^{2} + [L] L - | L | I = 0$ for the operator $L \equiv L (k, ω)$ (see (A.1) in Appendix A).

The vanishing of the determinant of $L (k, ω)$

$| L (k, ω) | = 0,$ (2.13)

is the dispersion equation and describes a three-dimensional (hyper)-surface in the four-dimensional space of variables $(k, ω)$ . In the specialization to only frequency dispersion ( $ε (k, ω) = ε (ω)$ ) it is identical in content but not in form with the Fresnel Equation (e.g., [6] [7] ).

For the complementary operator $\bar{L} (k, ω)$ to $L (k, ω)$ we find (see Appendix A)

$\begin{matrix} \bar{L} (k, ω) \equiv L^{2} - 〈 L 〉 L + [L] I \\ = \frac{c^{4}}{ω^{4}} (k^{2}) k \cdot k - \frac{c^{2}}{ω^{2}} (〈 ε 〉 k \cdot k - ε k \cdot k - k \cdot k ε + (k ε k) I) + \bar{ε}, \end{matrix}$

$〈 \bar{L} (k, ω) 〉 = [L (k, ω)], 〈 \bar{ε} 〉 = [ε], ε^{- 1} = \frac{\bar{ε}}{| ε |}, ε = | ε | {(\bar{ε})}^{- 1}, \bar{\bar{ε}} = | ε | ε .$ (2.14)

The complementary operator $\bar{L} (k, ω)$ to $L (k, ω)$ plays an important role in optics of anisotropic media. If the determinant of $L$ is vanishing, i.e., $[L] = 0$ then the squared complementary operator ${\bar{L}}^{2}$ is proportional to $\bar{L}$ , more precisely³

$| L | = 0 : \Rightarrow {(\bar{L})}^{2} = 〈 \bar{L} 〉 \bar{L}, 〈 \bar{L} 〉 = [L],$ (2.15)

and $Π$ according to the following definition

$Π \equiv \frac{\bar{L}}{〈 \bar{L} 〉} = \frac{\bar{L}}{[L]}, \Rightarrow Π^{2} = Π, 〈 Π 〉 = 1,$ (2.16)

is projection operator to the eigenvalue $λ = 0$ of $L$ . If $a$ and $\tilde{a}$ are arbitrary vectors then non-vanishing vectors $\bar{L} a$ are right-hand eigenvector and non-vanishing vectors $\tilde{a} \bar{L}$ left-hand eigenvector of $L$ to the eigenvalue $λ = 0$ , i.e.

$| L | = 0, \bar{L} a \neq 0, \tilde{a} \bar{L} \neq 0 : \Rightarrow L \bar{L} a = | L | a = 0, \tilde{a} \bar{L} L = \tilde{a} | L | = 0.$ (2.17)

This follows from the Cayley-Hamilton identity. Arbitrary right-hand eigenvalues $\bar{L} a$ are proportional to possible solutions for the Fourier transform of the electric field according to the Equation (2.7). One may introduce mutually normalized “polarization” vectors $e$ and $\tilde{e}$ to the electric field by the condition

$Π = e \cdot \tilde{e}, 〈 Π 〉 = \tilde{e} e = 1, Π^{2} = e \cdot \tilde{e} = Π,$

$k ε Π = (k ε e) \tilde{e}, Π ε k = e (\tilde{e} ε k), \Rightarrow k ε e = 0, \tilde{e} ε k = 0,$ (2.18)

where in dependence on the symmetries of the operator $L (k, ω)$ the “co-vectors” $\tilde{e}$ can be often specialized, for example, to $\tilde{e} = e^{*}$ for operators $L (k, ω) = {(L (k^{*}, ω^{*}))}^{*} \equiv L^{*} (k, ω)$ .

Thus the explicit for of the projection operators for the determination of polarization vectors of the electric field are ( $ε \equiv ε (k, ω)$ )

$\begin{array}{l} Π (k, ω) = \frac{\bar{L} (k, ω)}{[L (k, ω)]} \\ = \frac{\frac{c^{4}}{ω^{4}} (k^{2}) k \cdot k - \frac{c^{2}}{ω^{2}} (〈 ε 〉 k \cdot k - ε k \cdot k - k \cdot k ε + (k ε k) I) + \bar{ε}}{\frac{c^{4}}{ω^{4}} {(k^{2})}^{2} - \frac{c^{2}}{ω^{2}} (〈 ε 〉 k^{2} + k ε k) + [ε]}, \end{array}$

$〈 Π (k, ω) 〉 = 1, {(Π (k, ω))}^{2} = Π (k, ω) .$ (2.19)

The degenerate case $\bar{L} (k, ω) = 0 \Rightarrow 〈 \bar{L} (k, ω) 〉 = [L (k, ω)] = 0$ (but not true in inverse order) is the case of optic axes which we do not consider in present article in detail. However, isotropic media where all axes are “optic” axes also belong to this case.

In coordinate-invariant calculations of polarization vectors by means of the projection operator (2.19) as vectors $a$ should be taken only vectors which possess a physical meaning of the considered system. According to (2.17) we have a great selection of possible choice of vectors $a$ and $\tilde{a}$ for determination of such polarization vectors but not all are advantageous. According to $k ε e = 0$ the right-hand polarization vectors $e$ are perpendicularly to the vector $k ε$ and one should not choose vectors which form a very small angle with this vector $k ε$ as, for example, the vector $k$ since then in limiting cases $k \to k ε$ it becomes undetermined. It seems to be favorable to choose for this purpose vector products of the vectors $k ε$ or of $k$ with other vectors where the last choice is favorable since in this case the most terms in the numerator of the projection operator (2.19) are canceled. We choose first the vector product $[k, k ε]$ for which we find as (non-normalized) right-hand eigenvectors of the operator $L (k, ω)$ to the eigenvalue $λ = 0$

$\begin{matrix} Π (k, ω) [k, k ε] = \frac{(\bar{ε} - \frac{c^{2}}{ω^{2}} (k ε k) I) [k, k ε]}{\frac{c^{4}}{ω^{4}} {(k^{2})}^{2} - \frac{c^{2}}{ω^{2}} (〈 ε 〉 k^{2} + k ε k) + [ε]} \\ = \frac{[k ε, k ε^{2}] + \frac{c^{2}}{ω^{2}} (k ε k) [k ε, k]}{\frac{c^{4}}{ω^{4}} {(k^{2})}^{2} - \frac{c^{2}}{ω^{2}} (〈 ε 〉 k^{2} + k ε k) + [ε]}, \end{matrix}$ (2.20)

where the identity (B.6) was applied. This choice becomes inappropriate in limiting or other cases when $[k, k ε] = 0$ that means if they become parallel.

If we directly choose as vector $a$ one of the vectors vectors $k, ε k, ε^{2} k$ then we find as (non-normalized) polarization vectors of the electric field

$\begin{array}{l} Π (k, ω) k = \frac{\frac{c^{2}}{ω^{2}} k^{2} ((\frac{c^{2}}{ω^{2}} k^{2} - 〈 ε 〉) k + ε k) + \bar{ε} k}{\frac{c^{4}}{ω^{4}} {(k^{2})}^{2} - \frac{c^{2}}{ω^{2}} (〈 ε 〉 k^{2} + k ε k) + [ε]}, \\ Π (k, ω) ε k = \frac{(\frac{c^{4}}{ω^{4}} k^{2} (k ε k) - \frac{c^{2}}{ω^{2}} (〈 ε 〉 k ε k - k ε^{2} k) + | ε |) k}{\frac{c^{4}}{ω^{4}} {(k^{2})}^{2} - \frac{c^{2}}{ω^{2}} (〈 ε 〉 k^{2} + k ε k) + [ε]} = 0, \\ Π (k, ω) ε^{2} k = \frac{\frac{c^{2}}{ω^{2}} (\frac{c^{2}}{ω^{2}} k^{2} (k ε^{2} k) - [ε] k ε k + | ε | k^{2}) k + (\frac{c^{2}}{ω^{2}} k ε^{2} k + | ε |) ε k - \frac{c^{2}}{ω^{2}} (k ε k) ε^{2} k}{\frac{c^{4}}{ω^{4}} {(k^{2})}^{2} - \frac{c^{2}}{ω^{2}} (〈 ε 〉 k^{2} + k ε k) + [ε]} . \end{array}$ (2.21)

where $ε k$ is inappropriate since it provides the zero vector and expresses that polarization vectors of the electric field are perpendicularly to $k ε$ . One may check that $k ε Π a = 0$ (2.18) in all cases.

Favorable representations of polarization vectors one may often find if we use in addition the vectors to optic axes in the representation of the permittivity tensor in principal axes form, most generally

$ε = ε_{1} c_{1} \cdot {\tilde{c}}_{1} + ε_{2} c_{2} \cdot {\tilde{c}}_{2} + ε_{3} c_{3} \cdot {\tilde{c}}_{3}, {\tilde{c}}_{i} c_{j} = δ_{i j},$ (2.22)

where all involved vectors and scalars may or may not depend on wave vector and frequency depending on the symmetry of the medium. In lossless case we have the simplification ${\tilde{c}}_{i} = c_{i}^{*}$ and under additional symmetry of the permittivity tensor $ε$ for homogeneous waves (real wave vector and frequency) $c_{i}^{*} = c_{i}$ . One may choose as vectors $a$ for the determination of polarization vectors the vectors of the optic axes $c_{i}, (i = 1,2,3)$ themselves or the vector products $[k, {\tilde{c}}_{i}]$ .

Another possible approach is via the vector field of the electric induction. From (2.7) using the representation (A.3) for the inverse operator $ε$ one may derive the following wave equation for the electric induction $D (k, ω)$

$0 = {\frac{c^{2}}{ω^{2} | ε (k, ω) |} (k \cdot k - (k^{2}) I) \bar{ε} (k, ω) + I} D (k, ω) \equiv L^{D} (k, ω) D (k, ω) .$ (2.23)

where we define the operator $L^{D} (k, ω)$ by

$L^{D} (k, ω) \equiv \frac{c^{2}}{ω^{2} | ε (k, ω) |} (k \cdot k - (k^{2}) I) \bar{ε} (k, ω) + I - \frac{ε (k, ω) \cdot k}{k ε (k, ω) k}, k L^{D} (k, ω) = 0.$ (2.24)

We have substituted taking into account $k D = 0$ the three-dimensional unit operator $I$ by a two-dimensional unit operator (or projection operator) $I^{'}$

$I^{'} \equiv I - \frac{ε k \cdot k}{k ε k}, {I^{'}}^{2} = I^{'}, 〈 I^{'} 〉 = 2,$ (2.25)

in such a way the operator $L^{D}$ possesses the properties

$k L^{D} (k, ω) = 0, L^{D} (k, ω) ε (k, ω) k = 0 .$ (2.26)

It is now a two-dimensional operator by multiplication from the left in the plane perpendicular to $k$ and from right in the plane perpendicular to $ε k$ . Landau and Lifshits [4] prefer for the treatment of some problems more directly the electric induction $D$ but, clearly, without formalizing this with introduction of an operator $L^{D}$ . The use of $D$ instead of $E$ possesses advantages (orthogonality to $k$ ) but also disadvantages and we do not consider this.

3. The Concept of Bi-Anisotropic Constitutive Equations

The concept of bi-anisotropic media with the special case of bi-isotropic media is more specially than the concept of spatial dispersion discussed in last Section. The basic equations of macroscopic optics are written in this concept in the following way for the Fourier transforms

$\begin{array}{l} [k, E (k, ω)] - \frac{ω}{c} B (k, ω) = 0, k B (k, ω) = 0, \\ [k, H (k, ω)] + \frac{ω}{c} D^{'} (k, ω) = 0, k D^{'} (k, ω) = 0, \end{array}$ (3.1)

where by definition

$D^{'} (k, ω) \equiv E (k, ω) + 4 π P^{'} (k, ω), H (k, ω) \equiv B (k, ω) - 4 π M (k, ω),$ (3.2)

and where $P^{'}$ is the polarization in a narrow sense and $M$ the magnetization and with constitutive equations of the following form for the Fourier transforms

$D^{'} (k, ω) = ε (ω) E (k, ω), B (k, ω) = μ (ω) H (k, ω),$ (3.3)

where we do not assume that $ε (ω)$ and $μ (ω)$ are symmetric tensors (e.g., magneto-optic effects). Usually, $H$ is called the magnetic field and $B$ the magnetic induction also $B$ is the averaged microscopic magnetic field [4]. These notions are made for the symmetries between $E ⇌ H$ and $D ⇌ B$ in the field equations but this may be confusing.

In considered case it is favorable to introduce the notion of refraction vectors $n$ by the definition ( $ω \neq 0$ )

$n \equiv \frac{c}{ω} k .$ (3.4)

The basic equations for the Fourier transforms of fields (3.1) simplify the slightly and are

$\begin{array}{l} [n, E] - B = 0, n B = 0, \\ [n, H] + D^{'} = 0, n D^{'} = 0, \end{array}$ (3.5)

together with the constitutive equations

$D^{'} = ε E, B = μ H,$ (3.6)

where we omitted to write the arguments in the fields and in the material tensors, e.g., $E (k, ω) \equiv E$ , $ε (ω) \equiv ε$ .

From (3.5) follow for a given refraction vectors $n$ the orthogonalities

$B E = [n, E] E = 0, D^{'} H = - [n, H] H = 0.$ (3.7)

It is also important to mention here that the Equations (3.5) together with (3.6) remain unchanged under the simultaneous permutations

$E \leftrightarrow H, D^{'} \leftrightarrow B, ε \leftrightarrow μ, n \leftrightarrow - n,$ (3.8)

but, clearly, all this is well known.

First, we derive a wave equation for the Fourier components $E$ of the electric field. For this purpose we use the formula (A.3) for the inverse operator and the mathematical identity (B.6) and, furthermore, the transposition of the application of an operator to a vector $A x = x A^{T}$ where $A^{T}$ is the transposed operator to $A$ and find

$\begin{matrix} 0 = [n, H] + D^{'} = [n, μ^{- 1} B] + D^{'} = \frac{[n, \bar{μ} [n, E]]}{| μ |} + D^{'} = \frac{[n, [n μ, E μ]]}{| μ |} + D^{'} \\ = \frac{[n, [μ^{T} n, μ^{T} E]]}{| μ |} + D^{'} = {\frac{μ^{T} n \cdot n μ^{T} - (n μ^{T} n) μ^{T}}{| μ |} + ε} E . \end{matrix}$ (3.9)

We write this Equation ( $E \equiv E (\frac{ω}{c} n, ω)$ )

$L^{E} (n) E = 0, \Rightarrow n L^{E} (n) E = n ε E = 0,$ (3.10)

with an operator $L^{E} (n)$ defined by

$L^{E} (n) \equiv \frac{μ^{T} n \cdot n μ^{T} - (n μ^{T} n) μ^{T}}{| μ |} + ε .$ (3.11)

Due to symmetry (3.8) in the starting Equations (3.5) and (3.6) one may immediately write down an analogous equation for $H$

$L^{H} (n) H = 0, \Rightarrow n L^{H} (n) H = n μ H = 0,$ (3.12)

with an operator $L^{H} (n)$ defined by

$L^{H} (n) \equiv \frac{ε^{T} n \cdot n ε^{T} - (n ε^{T} n) ε^{T}}{| ε |} + μ .$ (3.13)

Both operators $L^{E} (n)$ and $L^{H} (n)$ possess the form of the operator $L$ discussed in Appendix C with the following substitutions in case of $L^{E} (n)$ which we consider now

$A \to μ^{T}, B \to ε, x = \tilde{x} \to n .$ (3.14)

According to (C.7) and (C.10) its invariants are $μ \equiv μ (ω), ε \equiv ε ( ω )$

$\begin{array}{l} 〈 L^{E} (n) 〉 = \frac{n μ^{2} n - 〈 μ 〉 n μ n + | μ | 〈 ε 〉}{| μ |}, \\ [L^{E} (n)] = \frac{(n μ n) n^{2} - ((〈 μ 〉 〈 ε 〉 - 〈 μ^{T} ε 〉) n μ n - 〈 ε 〉 n μ^{2} n + n μ^{T} ε μ^{T} n) + | μ | [ε]}{| μ |}, \\ | L^{E} (n) | = \frac{(n μ n) (n ε n) - (〈 μ^{T} \bar{ε} 〉 n μ n + n μ^{T} \bar{ε} μ^{T} n) + | μ | | ε |}{| μ |} . \end{array}$ (3.15)

We wrote here the general case $μ \neq μ^{T}$ (and $ε \neq ε^{T}$ ) but it was not necessary to write the sign “T” at $μ$ for transposition in all cases because, e.g., $n μ^{T} n = n μ n$ and for all invariants holds, e.g., $〈 μ^{T} 〉 = 〈 μ 〉$ (however, e.g., $μ^{T} ε = {(ε^{T} μ)}^{T}$ )⁴. The complementary operator $\bar{L^{E}} (n)$ obtainable from (C.11) is fairly complicated and we do not write it down.

The dispersion equation for a bi-anisotropic medium is

$| L^{E} (n) | = 0,$ (3.16)

or equivalently the analogous equation for the operator $| L^{H} (n) |$ . Polarization vectors $e$ together with left-hand eigenvectors $\tilde{e}$ to the operator $L^{E} (n)$ for the electric field can be obtained by the projection operators

$Π^{E} (n) = \frac{\bar{L^{E}} (n)}{〈 \bar{L^{E}} (n) 〉} = \frac{\bar{L^{E}} (n)}{[L^{E} (n)]} = e \cdot \tilde{e}, 〈 Π^{E} (n) 〉 = \tilde{e} e = 1,$ (3.17)

but this is complicated and we will it only do for the special case of bi-anisotropic uniaxial media in next Section.

4. Bi-Anisotropic Media as Special Case of Spatial Dispersion

The approach to the linear optics of media by spatial dispersion is much more general than the approach by bi-anisotropic media which is mainly interesting for its symmetries between electric and magnetic quantities and I am not a fan of the last for reason which will become clear in the following. Spatial dispersion is often discussed with expansion of the tensor $ε_{i j} (k, ω)$ into powers of the wave vector as follows [1] [2] [3] [4]

$ε_{i j} (k, ω) = ε_{i j} (ω) + i \frac{c}{ω} γ_{i j k} (ω) k_{k} + \frac{c^{2}}{ω^{2}} α_{i j k l} (ω) k_{k} k_{l} + \dots,$

$\begin{matrix} ε_{i j}^{- 1} (k, ω) = ε_{i j}^{- 1} (ω) + i \frac{c}{ω} δ_{i j k} (ω) k_{k} + \frac{c^{2}}{ω^{2}} β_{i j k l} (ω) k_{k} k_{l} + \dots \\ = ε_{i j}^{- 1} (ω) - i \frac{c}{ω} ε_{i m}^{- 1} (ω) γ_{m n k} (ω) ε_{n j}^{- 1} (ω) k_{k} \\ - \frac{c^{2}}{ω^{2}} ε_{i m}^{- 1} (ω) (α_{m n k l} (ω) + γ_{m p k} (ω) ε_{p q}^{- 1} (ω) γ_{q n l} (ω)) ε_{n j}^{- 1} (ω) k_{k} k_{l} - \dots . \end{matrix}$ (4.1)

The concept of bi-anisotropic media with the constitutive Equations (3.3) can be expressed as a special case of the concept of spatial dispersion with the following dependence of tensor $ε_{i j} (k, ω)$ on the wave vector $k$

$\begin{matrix} ε_{i j} (k, ω) = ε_{i j} (ω) + \frac{c^{2}}{ω^{2}} α_{i j k l} (ω) k_{k} k_{l} \\ = ε_{i j} (ω) + \frac{c^{2}}{ω^{2}} ϵ_{i k m} ϵ_{j l n} (δ_{m n} - μ_{m n}^{- 1} (ω)) k_{k} k_{l} . \end{matrix}$ (4.2)

This can be expressed in a representation without indices in the form of Equation (2.7) with the general permittivity $ε (k, ω)$ by ( $A^{T}$ means transposition of $A$ ; definition $k \equiv \frac{ω}{c} n$ would shorten the following representation)

$ε (k, ω) = ε (ω) + \frac{c^{2}}{ω^{2}} (\frac{μ^{T} k \cdot k μ^{T} - (k μ k) μ^{T}}{| μ |} - (k \cdot k - k^{2} I)) .$ (4.3)

In the general concept of spatial dispersion a bi-anisotropic medium appears as second-order effect in the expansion of the general tensor $ε_{i j} (k, ω)$ in powers of the wave vector $k$ . It does, however, not possess the general form of a tensor of forth rank $α_{i j k l} (ω)$ with only symmetry in the last both indices $(k, l)$ and, furthermore, sum terms which are linear in the wave vector $k$ are completely absent (e.g., optic gyrotropy). The reason that the tensor $α_{i j k l} (ω)$ does not possess in the concept of bi-anisotropy the general form of such a tensor comes from the neglect of electric quadrupole terms and also of higher electric and magnetic multipole terms in the expansion of the general polarization $P (k, ω)$ in powers of the wave vector $k$ . Apart from the first term $P (ω)$ which mostly provides the greatest contribution and is uniquely defined the higher contributions are difficult to separate from each other since in multipole expansions only the first multipole moment which is non-vanishing is uniquely defined whereas the others depend on the chosen origin of the multipole expansion. The magnitude of the different terms from the multipole effects is difficult to assess but one has to assume that within terms of the same order they should be comparable.

The concept of bi-isotropy with $ε (ω)$ and $μ (ω)$ as scalars in the constitutive equations is old and goes back under other names to the development of macroscopic electrodynamics by the Maxwell equations and its generalization to bi-anisotropy by transition to second-rank tensors $ε_{i j} (ω)$ and $μ_{i j} (ω)$ is natural. However, the last leads usually to very complicated formulae if one calculates propagation and reflection and refraction problems (amplitudes included), moreover, if this is made by coordinate methods. The most comprehensive and unrivaled representation was given by Szivessy [8] and long ago I thought that it remains the last which works mainly with coordinate methods. However, more than in most other sources in this respect is made in the book of Fyodorov [9] with coordinate-invariant methods which he initiated and developed. In the book [10] of the same author the concept of bi-anisotropy (Fyodorov calls it “crystals with electric and magnetic anisotropy”) is extended to inclusion of optic gyrotropy that even by coordinate-invariant treatment leads as a rule to very complicated formulae. The last chapter in this book contains linear algebra in three-dimensional Euclidean space in a form which is very useful for the application of coordinate-invariant calculations in three-dimensional spaces (chap. IV, pp. 362-450)⁵.

An extended concept of bi-anisotropy in the basic equations is maintained, in particular, in the very versatile monograph of de Groot [23] and in nonlinear optics by Bloembergen [24] (called the “Netherland school” in [14] with inclusion of some other authors). Furthermore, there are articles to the calculation of the dyadic Green functions to the Huygens principle for bi-anisotropic media ( [15] [25] and, e.g., Weiglhofer [26] with many citations published in: [27] ).

It is necessary to report here also about an unprecedented scientific plagiarism in form of a book from 1983 by Hollis C. Chen, a professor of the Ohio University in U.S.A, about which I was informed by Fyodor Ivanovich Fyodorov in the middle of the eighties. This book and papers of Chen cannot be cited in normal way under references and I make some remarks to this case in the following footnote⁶.

5. Optic Uniaxial Bi-Anisotropic Media

We consider now the special case of optic uniaxial bi-anisotropic media which is determined by the following tensors $ε$ and $μ$

$\begin{array}{l} ε (ω) = ε^{e} (ω) c \cdot c + ε^{o} (ω) (I - c \cdot c), \\ μ (ω) = μ^{e} (ω) c \cdot c + μ^{o} (ω) (I - c \cdot c), (c^{2} = 1) . \end{array}$ (5.1)

where the tensors $ε$ and $μ$ considered as operators are symmetric and commute (as consequence of axial symmetry with the same axes for the electric and magnetic properties)

$μ (ω) = μ^{T} (ω), ε (ω) = ε^{T} (ω), μ (ω) ε (ω) = ε (ω) μ (ω) .$ (5.2)

With $c$ we have denoted a unit vector in direction of the common optic axis of the permittivity tensor $ε$ and the permeability tensor $μ$ of the uniaxial bi-anisotropic medium (notations of Fyodorov [9] ) and $ε^{e}, ε^{o}$ and $μ^{e}, μ^{o}$ (upper indices “e” and “o” stand for “extraordinary” and “ordinary”) are frequency depend material scalars. The complementary operators and the invariants, for example, for $ε \equiv ε (ω)$ are (similarly, $μ ( ω ))$

$\begin{array}{l} \bar{ε} = ε^{o} (ε^{o} c \cdot c + ε^{e} (I - c \cdot c)), \\ 〈 ε 〉 = ε^{e} + 2 ε^{o}, [ε] = ε^{o} (2 ε^{e} + ε^{o}) = 〈 \bar{ε} 〉, | ε | = ε^{e} {(ε^{o})}^{2} . \end{array}$ (5.3)

Using this together with $| L^{E} (n) |$ in (3.15) and $L^{H} (n)$ in (3.13) one may specialize this determinant to

$μ^{e} {(μ^{o})}^{2} | L^{E} (n) | = (n μ n - ε^{o} μ^{e} μ^{o}) (n ε n - μ^{o} ε^{e} ε^{o}) = ε^{e} {(ε^{o})}^{2} | L^{H} (n) | .$ (5.4)

The dispersion equation that means the vanishing of the determinants $| L^{E} (n) |$ or $| L^{H} (n) |$ decomposes into a product of two separate equations as follows

$\begin{array}{l} 0 = n μ n - ε^{o} μ^{e} μ^{o} \leftrightarrow \frac{{(n c)}^{2}}{ε^{o} μ^{o}} + \frac{{[n, c]}^{2}}{ε^{o} μ^{e}} = 1, \\ 0 = n ε n - μ^{o} ε^{e} ε^{o} \leftrightarrow \frac{{(n c)}^{2}}{μ^{o} ε^{o}} + \frac{{[n, c]}^{2}}{μ^{o} ε^{e}} = 1, \end{array}$ (5.5)

which for real positive parameters $ε^{o}, ε^{e}, μ^{o}, μ^{e}$ represent two rotation ellipsoids with axes lengths which are the square roots of the denominators in (5.5) and with equal axis length in direction of the optic axis. This means that the two rotation ellipsoids touches in axis direction. The determination of polarization vectors via projection operators as described in Section 2 seems to be too tedious and we choose a more special approach. Due to $n ε Ε = 0$ polarization vectors of the electric field have to be perpendicular to the vector $n ε$ and are therefore representable by the vector product of $n ε$ with a vector which possesses a component in the plane perpendicular to $n ε$ .

We consider this for the first dispersion equation $n μ n = ε^{o} μ^{e} μ^{o}$ in (5.5). Using this equation a (non-normalized) polarization vector $e^{'}$ for the electric field with the proposition $e^{'} = [n μ, c]$ according to (3.5) and (3.6) has to satisfy the equation

$\begin{matrix} 0 = {\frac{μ n \cdot n μ - (n μ n) μ}{| μ |} + ε} e^{'} = \frac{(μ n \cdot n μ - (n μ n) μ) [n μ, c]}{| μ |} + ε e^{'} \\ = - \frac{ε^{o} μ^{e} μ^{o}}{μ^{e} {(μ^{o})}^{2}} μ [n μ, c] + ε e^{'} = - \frac{ε^{o}}{μ^{o}} (μ^{e} c \cdot c + μ^{o} (I - c \cdot c)) μ^{o} [n, c] + ε e \\ = - ε^{o} μ^{o} [n, c] + ε e^{'} \end{matrix}$ (5.6)

from which follows

$\begin{matrix} e^{'} = ε^{o} μ^{o} ε^{- 1} [n, c] = ε^{o} μ^{o} (\frac{1}{ε^{e}} c \cdot c + \frac{1}{ε^{o}} (I - c \cdot c)) [n, c] \\ = μ^{o} [n, c] = \frac{μ^{o}}{ε^{o}} [n ε, c] . \end{matrix}$ (5.7)

For a (non-normalized) polarization vector $h^{'}$ of the magnetic field follows then from (3.5) using (5.7) and in addition the first of the Equations (5.5)

$\begin{matrix} h^{'} = μ^{- 1} b^{'} = μ^{- 1} [n, e^{'}] = μ^{o} (\frac{1}{μ^{e}} c \cdot c + \frac{1}{μ^{o}} (I - c \cdot c)) [n, [n, c]] \\ = - \frac{μ^{o}}{μ^{e}} {[n, c]}^{2} c + (n c) [c, [n, c]] = (n c) n - ε^{o} μ^{o} c = \frac{1}{μ^{e}} [n μ, [n, c]] . \end{matrix}$ (5.8)

To get the analogous relations for the second dispersion equation $n ε n = μ^{o} ε^{e} ε^{o}$ in (5.5) one has only to apply the symmetry relations (3.8). Thus we find in this case a (non-normalized) polarization vector $h$

$h^{'} = ε^{o} [n, c] = \frac{ε^{o}}{μ^{o}} [n μ, c],$ (5.9)

and a (non-normalized) polarization vector $e$

$e^{'} = (n c) n - μ^{o} ε^{o} c = \frac{1}{μ^{e}} [n ε, [n, c]] .$ (5.10)

For non-normalized polarization vectors scalar factors are unimportant and can be omitted.

We now give preference to (non-normalized) polarization vectors of the electric field and omit there the unfavorable factors. Then we have for the first dispersion equations

$\begin{array}{l} n μ n = ε^{o} μ^{e} μ^{o} : \\ e = [n ε, c] = ε^{o} [n, c], h = \frac{ε^{o}}{μ^{e} μ^{o}} [n μ, [n, c]] = \frac{ε^{o}}{μ^{o}} ((n c) n - ε^{o} μ^{o} c), \end{array}$ (5.11)

and for the second dispersion equation

$\begin{array}{l} n ε n = μ^{o} ε^{e} ε^{o} : \\ e = [n ε, [n, c]] = ε^{e} ((n c) n - μ^{o} ε^{o} c), h = \frac{ε^{e} ε^{o}}{μ^{o}} [n μ, c] = ε^{e} ε^{o} [n, c] . \end{array}$ (5.12)

It is not difficult to make normalization of the polarization vectors. We left the factors in such a way that to each dispersion Equations (5.11) and (5.12) separately the vector $h$ follows from $e$ without changing a factor. It is also easy to make a transition to normalized polarization vectors that we do not write down.

The transition to the special case of “only electrically” uniaxial media can be made by definition in (5.13) by the substitution

$ε (ω) = ε^{e} (ω) c \cdot c + ε^{o} (ω) (I - c \cdot c), μ (ω) = I, \to μ^{e} (ω) = μ^{o} (ω) = 1.$ (5.13)

The two dispersion equations become asymmetric to each other and are the first for ordinary and the second for extraordinary waves

$\begin{array}{l} 0 = n^{2} - ε^{o} \leftrightarrow \frac{{(n c)}^{2} + {[n, c]}^{2}}{ε^{o}} = \frac{n^{2}}{ε^{o}} = 1, \\ 0 = n ε n - ε^{e} ε^{o} \leftrightarrow \frac{{(n c)}^{2}}{ε^{o}} + \frac{{[n, c]}^{2}}{ε^{e}} = 1. \end{array}$ (5.14)

The (non-normalized) polarization vectors for the electric and magnetic field (5.11) and (5.12) become for ordinary waves

$\begin{array}{l} n^{2} = ε^{o} : \\ e = [n ε, c] = ε^{o} [n, c], h = ε^{o} [n, [n, c]] = ε^{o} ((n c) n - ε^{o} c), \end{array}$ (5.15)

and for extraordinary waves

$\begin{array}{l} n ε n = ε^{e} ε^{o} : \\ e = [n ε, [n, c]] = ε^{e} ((n c) n - ε^{o} c), h = ε^{e} ε^{o} [n, c] . \end{array}$ (5.16)

Thus (electrically) ordinary waves are polarized perpendicular to the axis plane spanned by the axis vector $c$ and the refraction vector $n$ and extraordinary waves within this plane. Amplitude relations for reflection and refraction at the boundary between an isotropic and a uniaxial medium can be found, e.g., in [9], in [11] (a little too complicated) and in [13].

6. Group Velocity and Diffraction Coefficients

In a preliminary summary about the two discussed concepts one can say that spatial dispersion is the more general concept but the concept of bi-anisotropy leads to interesting symmetries between the electric and magnetic properties of media and is up to now often the only concept represented in excellent monographs, e.g., [7]. However, in the last concept it is difficult to include some phenomena such as, for example, natural optical activity although this is tried to make in the book [10] of Fyodorov. The concept of bi-anisotropy is used in the whole work of the Minsk Group [9] [10] [11]. Practically, in all older works about classical optics the concept of bi-anisotropy is used but not under this name and this development is comprehensively represented in the encyclopedic article of Szivessy [8]. One cannot be sure that all terms of a same level of spatial dispersion in a certain order in the wave vectors are included in this more symmetric bi-anisotropic concept or are included in doubled way. In every case one has to calculate anew such quantities as the group velocity in comparison to the classical optics with only one frequency-dependent permittivity tensor $ε (ω)$ with and without taking into account frequency dispersion and this is mostly not easy.

We now consider the concept of spatial dispersion with the permittivity tensor $ε (k, ω)$ in the wave Equation (2.7) with the operator $L (k, ω)$ given in (2.11) together with their invariants in (2.12). The dispersion equation that is the vanishing of the determinant of $L (k, ω)$ can be resolved in a function $ω = ω (k)$ with different possible branches and if we insert this function into the dispersion equation one obtains identities of the form

$0 = | L (k, ω) |, \Rightarrow ω = ω (k),$ (6.1)

where $| L (k, ω (k)) |$ depends only on the wave vector $k$ . We introduce two important notions and prepare its calculation for specialized cases. If we differentiate the identity (6.1) one and two times with respect to $k$ according to (we abbreviate $L \equiv L (k, ω (k))$ )

$\begin{array}{l} 0 = \frac{\partial | L |}{\partial k_{i}} + \frac{\partial | L |}{\partial ω} \frac{\partial ω}{\partial k_{i}}, \\ 0 = \frac{\partial^{2} | L |}{\partial k_{i} \partial k_{j}} + \frac{\partial^{2} | L |}{\partial k_{i} \partial ω} \frac{\partial ω}{\partial k_{j}} + \frac{\partial^{2} | L |}{\partial k_{j} \partial ω} \frac{\partial ω}{\partial k_{i}} + \frac{\partial^{2} | L |}{\partial ω^{2}} \frac{\partial^{2} ω}{\partial k_{i} \partial k_{j}} . \end{array}$ (6.2)

We define the group velocity

$v_{i} \equiv \frac{\partial ω}{\partial k_{i}} = - \frac{\frac{\partial | L |}{\partial k_{i}}}{\frac{\partial | L |}{\partial ω}},$ (6.3)

and in addition the symmetric diffraction coefficients

$\begin{matrix} W_{i j} \equiv \frac{\partial^{2} ω}{\partial k_{i} \partial k_{j}} = - \frac{\frac{\partial^{2} | L |}{\partial k_{i} \partial k_{j}} + \frac{\partial^{2} | L |}{\partial k_{i} \partial ω} \frac{\partial ω}{\partial k_{j}} + \frac{\partial^{2} | L |}{\partial k_{j} \partial ω} \frac{\partial ω}{\partial k_{i}}}{\frac{\partial^{2} | L |}{\partial ω^{2}}} \\ = - \frac{1}{\frac{\partial^{2} | L |}{\partial ω^{2}}} {\frac{\partial^{2} | L |}{\partial k_{i} \partial k_{j}} - \frac{\frac{\partial^{2} | L |}{\partial k_{i} \partial ω} \frac{\partial | L |}{\partial k_{j}} + \frac{\partial^{2} | L |}{\partial k_{j} \partial ω} \frac{\partial | L |}{\partial k_{i}}}{\frac{\partial | L |}{\partial ω}}} = W_{j i}, \end{matrix}$ (6.4)

where $W_{i j} = W_{j i}$ is a symmetric bilinear form. Both become important for the beam propagation in second-order approximation.

The formula for the group velocity in (10.1) using (2.18) can be written

$v_{i} = - \frac{〈 \bar{L} \frac{\partial L}{\partial k_{i}} 〉}{〈 \bar{L} \frac{\partial L}{\partial ω} 〉} = - \frac{e^{*} \frac{\partial L}{\partial k_{i}} e}{e^{*} \frac{\partial L}{\partial ω} e}, \bar{L} \propto e \cdot e^{*},$ (6.5)

where is taken into account that $\bar{L}$ is proportional to the dyadic product of (in general, non-normalized) polarization vectors $e \cdot e^{*}$ of the electric field (see (2.18); $\tilde{e} = e^{*}$ since we consider the lossless case). We find

$\begin{array}{l} \frac{\partial L_{k l}}{\partial k_{i}} = \frac{c^{2}}{ω^{2}} (k_{k} δ_{i l} + δ_{i k} k_{l} - 2 k_{i} δ_{k l}) + \frac{\partial ε_{k l}}{\partial k_{i}}, \\ \frac{\partial L_{k l}}{\partial ω} = - 2 \frac{c^{2}}{ω^{3}} (k_{k} k_{l} - k^{2} δ_{k l}) + \frac{\partial ε_{k l}}{\partial ω}, \end{array}$ (6.6)

from which follows

$\begin{array}{l} 〈 \bar{L} \frac{\partial L}{\partial k_{i}} 〉 = \frac{c^{2}}{ω^{2}} ({(\bar{L} k)}_{i} + {(k \bar{L})}_{i} - 2 〈 \bar{L} 〉 k_{i}) + 〈 \bar{L} \frac{\partial ε}{\partial k_{i}} 〉, \\ 〈 \bar{L} \frac{\partial L}{\partial ω} 〉 = - 2 \frac{c^{2}}{ω^{3}} (k \bar{L} k - 〈 \bar{L} 〉 k^{2}) + 〈 \bar{L} \frac{\partial ε}{\partial ω} 〉, 〈 \bar{L} 〉 = [L] \end{array}$ (6.7)

Neglect of spatial dispersion means that we do not take into account the term $〈 \bar{L} \frac{\partial ε}{\partial k_{i}} 〉$ in the numerator and neglect of frequency dispersion the term $〈 \bar{L} \frac{\partial ε}{\partial ω} 〉$ in the denominator of the formula for the group velocity (6.5).

Under neglect of dispersion one finds from $| L (k, ω) |$ explicitly given in (2.12)

$v_{i} = ω \frac{\frac{c^{2}}{ω^{2}} (k^{2} ({(ε k)}_{i} + {(k ε)}_{i}) + 2 (k ε k) k_{i}) - 〈 ε 〉 ({(ε k)}_{i} + {(k ε)}_{i}) + ({(ε^{2} k)}_{i} + {(k ε^{2})}_{i})}{2 (2 \frac{c^{2}}{ω^{2}} (k^{2} (k ε k) - 〈 ε 〉 k ε k + k ε^{2} k))} .$ (6.8)

From this follows after scalar multiplication with $k_{i}$ follows, e.g., [4] [9]

$k v = ω, \Rightarrow \frac{k}{ω} \frac{\partial ω}{\partial k} \equiv n s = 1,$ (6.9)

with definition of the refraction vector $n$ and of the ray vector $s$ by (notations as in [4] )

$n \equiv \frac{c k}{ω}, s \equiv \frac{v}{c} .$ (6.10)

One should not forget that the relation (6.9) is derived under neglect of the dispersion of the permittivity tensor $ε (k, ω)$ and the differences between ray vector and group velocity in regions near to resonance frequencies or zeros of the permittivity tensor can become very important and even the direction of the group velocity can be changed by this additional terms.

7. Electrically and Magnetically Isotropic Media and Group Velocity

The constitutive equations for bi-isotropic or electrically and magnetically isotropic media are

$D (k, ω) = ε (ω) E (k, ω), B (k, ω) = μ (ω) H (k, ω),$ (7.1)

with scalar functions $ε (ω)$ and $μ (ω)$ . The equation for the electric field (3.10) in the concept of bi-anisotropy with the specialized operator (3.11) after multiplication with $μ (ω)$ becomes

$0 = μ (ω) L^{E} (n) E (\frac{ω}{c} n, ω) = {n \cdot n - n^{2} I + ε (ω) μ (ω) I} E (\frac{ω}{c} n, ω)$ (7.2)

or equivalently by transition to the more general concept of spatial dispersion with $ε (k, ω) = ε (ω) μ (ω) I$

$0 = L (k, ω) E (k, ω) = {\frac{c^{2}}{ω^{2}} (k \cdot k - k^{2} I) + ε (ω) μ (ω) I} E (k, ω) .$ (7.3)

The dispersion equation for transversal waves polarized for both $E$ and $B$ in the plane perpendicular to wave vector $k$

$\frac{c^{2}}{ω^{2}} k^{2} \equiv n^{2} = ε (ω) μ (ω), k E (k, ω) = 0,$ (7.4)

and for longitudinal waves

$ε (ω) μ (ω) = 0, E (ω) \neq 0, B (ω) = 0.$ (7.5)

The longitudinal waves correspond in present approximation to pure temporal oscillations of the electric field with arbitrary possible direction of polarization (since $k = 0$ ). We are interested here merely in the transversal waves.

The dispersion Equation (7.4) for transversal waves can be resolved in the form $ω = ω (k)$ (6.1) with different branches for $ω (k)$ . In Sections 9 and 10 we will consider in detail an example where the dispersion Equation (7.4) can be explicitly resolved in the form (6.1) with different branches, the permittivity for polaritons. By differentiation of the dispersion equation in the form (6.1) with respect to the wave vector $k$ one may derive a general formula for the group velocity $v \equiv \frac{\partial ω}{\partial k}$ for bi-isotropic media and also for higher coefficients $\frac{\partial^{2} ω}{\partial k_{i} \partial k_{j}}$ and so on which play a role in higher approximations of propagation of beam-like waves in such media (diffraction). For the group velocity $v$ one finds the general formula ( $k^{2} = {| k |}^{2}$ )

$v \equiv \frac{\partial ω}{\partial k} = \frac{2 c^{2} k}{\frac{\partial}{\partial ω} (ω^{2} ε (ω) μ (ω))} = c \frac{c | k |}{ω (ε (ω) μ (ω) + \frac{ω}{2} \frac{\partial}{\partial ω} (ε (ω) μ (ω)))} \frac{k}{| k |} .$ (7.6)

Without taking into account the frequency dispersion of the permittivities we find

$v^{'} \equiv \frac{\partial ω}{\partial k} = \frac{2 c^{2} k}{ε (ω) μ (ω) \frac{\partial}{\partial ω} (ω^{2})} = c \frac{c | k |}{ω ε (ω) μ (ω)} \frac{k}{| k |} = \frac{c}{\sqrt{ε (ω) μ (ω)}} \frac{k}{| k |} .$ (7.7)

One may introduce a dispersion factor $α_{disp} (ω)$ by

$\begin{array}{l} α_{disp} (ω) \equiv \frac{ε (ω) μ (ω) \frac{\partial}{\partial ω} (ω^{2})}{\frac{\partial}{\partial ω} (ω^{2} ε (ω) μ (ω))} \\ = \frac{ε (ω) μ (ω)}{ε (ω) μ (ω) + \frac{ω}{2} \frac{\partial}{\partial ω} (ε (ω) μ (ω))} . \end{array}$ (7.8)

From (7.7) follows using the dispersion Equation (7.4)

$\begin{matrix} \frac{k}{ω} \frac{\partial ω}{\partial k} = \frac{1}{1 + \frac{ω}{2 ε (ω) μ (ω)} \frac{\partial}{\partial ω} (ε (ω) μ (ω))} \\ = \frac{1}{\frac{ω}{2} \frac{\partial}{\partial ω} (\log (ω^{2} ε (ω) μ (ω)))} \equiv α_{disp} (ω), \end{matrix}$ (7.9)

where we have given also a representation of the dispersion coefficient $α_{disp} (ω)$ by a logarithmic derivative (useful or not?). Under neglect of dispersion that means if we do not take into account the first derivative of $ε (ω) μ (ω)$ with respect to frequency $ω$ we have ( $s$ is called ray vector, e.g., [4], §97, or [9] $(n, s) \to (m, p)$ )

$\frac{k}{ω} \frac{\partial ω}{\partial k} \equiv n \underset{\equiv s}{\underset{︸}{\frac{v}{c}}} = 1, \Rightarrow α_{disp} (ω) = 1.$ (7.10)

Therefore, the dispersion coefficient $α_{disp} (ω)$ says by which factor one has to modify the group velocity in comparison to neglect of dispersion if one take it into account. It goes also into some other formulae as, for example, the energy of the wave solution. On this very general level of treatment we cannot say whether or not $α_{disp} (ω)$ is in every case positive for possible real-valued functions $ε (ω) μ (ω)$ . In last case of negative $α_{disp} (ω)$ the genuine group velocity and the ray vector would have even opposite directions. The introduction of ray vectors $s$ in addition to the refraction vectors $n$ is appropriate to formulate duality (or symmetry) relations between electric and magnetic quantities which leave invariant the basic equations of macroscopic optics [4], (§ 97) such as (for $μ (ω) = I$ )

$E \leftrightarrow D, ε_{i j} \leftrightarrow ε_{i j}^{- 1}, n \leftrightarrow s,$ (7.11)

but one should not forget that these are only approximate relations and are only true under neglect of dispersion of the permittivities.

A second coefficient which we will consider is the relation of the modulus $| v |$ of the group velocity $v$ to the light velocity c for which one derives from (7.7) the relation

$\frac{v}{c} = \frac{\sqrt{ε (ω) μ (ω)}}{ε (ω) μ (ω) + \frac{ω}{2} \frac{\partial}{\partial ω} (ε (ω) μ (ω))} \frac{k}{| k |} = β (ω) \frac{k}{| k |}, (ω > 0),$ (7.12)

with definition

$β (ω) \equiv \frac{k v}{| k | c} = \frac{\sqrt{ε (ω) μ (ω)}}{ε (ω) μ (ω) + \frac{ω}{2} \frac{\partial}{\partial ω} (ε (ω) μ (ω))} = \frac{α_{disp} (ω)}{\sqrt{ε (ω) μ (ω)}} .$ (7.13)

For negative or complex $ε (ω) μ (ω)$ it becomes imaginary or complex and is then not to interpret in easy way.

Let us write down at this opportunity the general form of the second-order coefficients $\frac{\partial^{2} ω}{\partial k_{i} \partial k_{j}}$ for bi-isotropic media which are

$\begin{matrix} \frac{\partial^{2} ω}{\partial k_{i} \partial k_{j}} = \frac{2 c^{2} δ_{i j} - \frac{\partial^{2}}{\partial ω^{2}} (ω^{2} ε (ω) μ (ω)) v_{i} v_{j}}{\frac{\partial}{\partial ω} (ω^{2} ε (ω) μ (ω))} \\ = \frac{2 c^{2}}{\frac{\partial}{\partial ω} (ω^{2} ε (ω) μ (ω))} \\ \cdot {δ_{i j} - \frac{k_{i} k_{j}}{{| k |}^{2}} + (1 - \frac{2 ω^{2} ε (ω) μ (ω) \frac{\partial^{2}}{\partial ω^{2}} (ω^{2} ε (ω) μ (ω))}{{(\frac{\partial}{\partial ω} (ω^{2} ε (ω) μ (ω)))}^{ 2}}) \frac{k_{i} k_{j}}{{| k |}^{2}}} . \end{matrix}$ (7.14)

As was to expect they are a linear combination of $δ_{i j} - \frac{k_{i} k_{j}}{{| k |}^{2}}$ and $\frac{k_{i} k_{j}}{{| k |}^{2}}$ the only second-rank symmetric tensors which can be built from vectors $k$ alone and which are covariant under transformations of the rotation group $S O (3)$ . The group velocity and the diffraction coefficients are involved in the expansion of the equation for the slowly varying amplitudes of beams with respect to spatial and temporal derivatives. We will shortly consider the corresponding equations in next Section.

8. Approximate Beam Equations for Homogeneous Isotropic Media

We consider an isotropic medium with permittivity $ε (ω)$ and for simplicity with $μ (ω) = 1$ . The wave equation for the electric field with the involved operator $L (k, ω)$ in such a medium is

$0 = L (- i \nabla, i \frac{\partial}{\partial t}) E (r, t), L (k, ω) \equiv \frac{c^{2}}{ω^{2}} (k \cdot k - k^{2} I) + ε (ω) I .$ (8.1)

The necessary equation for solutions is the vanishing of the determinant

$0 = | L (- i \nabla, i \frac{\partial}{\partial t}) | E (r, t) .$ (8.2)

We make now the proposition of slowly varying amplitudes $E_{0} (r, t)$

$E (r, t) = E_{0} (r, t) e^{i (k_{0} r - ω_{0} t)} + E_{0}^{*} (r, t) e^{- i (k_{0} r - ω_{0} t)} .$ (8.3)

Inserting this into (8.2) we find first

$\begin{array}{l} 0 = e^{i (k_{0} r - ω_{0} t)} L (k_{0} - i \nabla, ω_{0} + i \frac{\partial}{\partial t}) E_{0} (r, t) \\ + e^{- i (k_{0} r - ω_{0} t)} L (- k_{0} - i \nabla, ω_{0} - i \frac{\partial}{\partial t}) E_{0}^{*} (r, t), \end{array}$ (8.4)

We suppose that both sum parts are separated in a way that we can set them equal to zero independently. For example, we may think that we include into one part all frequencies $0 < ω < + \infty$ and into the other part all frequencies $- \infty < ω < 0$ . With this assumption we have the equation

$\begin{matrix} 0 = L (k_{0} - i \nabla, ω_{0} + i \frac{\partial}{\partial t}) E_{0} (r, t) \\ = {L_{0} - i ({(\frac{\partial L}{\partial k})}_{0} \nabla - {(\frac{\partial L}{\partial ω})}_{0} \frac{\partial}{\partial t}) - \frac{1}{2} ({(\frac{\partial^{2} L}{\partial k_{i} \partial k_{j}})}_{0} \nabla_{i} \nabla_{j} \\ - 2 {(\frac{\partial^{2} L}{\partial k_{i} \partial ω})}_{0} \nabla_{i} \frac{\partial}{\partial t} + {(\frac{\partial^{2} L}{\partial ω^{2}})}_{0} \frac{\partial^{2}}{\partial t^{2}}) + \dots} E_{0} (r, t), \end{matrix}$ (8.5)

where we wrote the first terms in an expansion in powers of the differential operators. From this equation follows as necessary condition for all components of the solutions $E_{0} (r, t)$ the vanishing of the determinant and with the analogous expansion as in (8.5)

$\begin{matrix} 0 = | L (k_{0} - i \nabla, ω_{0} + i \frac{\partial}{\partial t}) | E_{0} (r, t) \\ = {| L_{0} | - i ({(\frac{\partial | L |}{\partial k_{i}})}_{0} \nabla_{i} - {(\frac{\partial | L |}{\partial ω})}_{0} \frac{\partial}{\partial t}) - \frac{1}{2} ({(\frac{\partial^{2} | L |}{\partial k_{i} \partial k_{j}})}_{0} \nabla_{i} \nabla_{j} \\ - 2 {(\frac{\partial^{2} | L |}{\partial k_{i} \partial ω})}_{0} \nabla_{i} \frac{\partial}{\partial t} + {(\frac{\partial^{2} | L |}{\partial ω^{2}})}_{0} \frac{\partial^{2}}{\partial t^{2}}) + \dots} E_{0} (r, t), \end{matrix}$ (8.6)

plus the corresponding complex conjugate equation. With the general formula for the differentiation of the determinant $\frac{\partial}{\partial λ} | A | = 〈 \bar{A} \frac{\partial}{\partial λ} A 〉$ of an operator $A$ with respect to a parameter $λ$ this equation may be also written (we do not insert a more complicated formula for the second derivative of a determinant with respect to two parameters which also exists)

$\begin{matrix} 0 = {- i (〈 {\bar{L}}_{0} {(\frac{\partial L}{\partial k_{i}})}_{0} 〉 \nabla_{i} - 〈 {\bar{L}}_{0} {(\frac{\partial L}{\partial ω})}_{0} 〉 \frac{\partial}{\partial t}) - \frac{1}{2} ({(\frac{\partial^{2} | L |}{\partial k_{i} \partial k_{j}})}_{0} \nabla_{i} \nabla_{j} \\ - 2 {(\frac{\partial^{2} | L |}{\partial k_{i} \partial ω})}_{0} \nabla_{i} \frac{\partial}{\partial t} + {(\frac{\partial^{2} | L |}{\partial ω^{2}})}_{0} \frac{\partial^{2}}{\partial t^{2}}) + \dots} E_{0} (r, t) . \end{matrix}$ (8.7)

with

$| L_{0} | \equiv | L (k_{0}, ω_{0}) | = 0,$ (8.8)

and where index 0 means that the derivatives are to take at $(k, ω) = (k_{0}, ω_{0})$ . By division of this equation with $〈 {({\bar{L}}_{0} \frac{\partial L}{\partial ω})}_{0} 〉$ one finds

$\begin{array}{l} 0 = {i (\frac{\partial}{\partial t} - \frac{〈 {\bar{L}}_{0} {(\frac{\partial L}{\partial k_{i}})}_{0} 〉}{〈 {\bar{L}}_{0} {(\frac{\partial L}{\partial ω})}_{0} 〉} \nabla_{i}) \\ - \frac{1}{2} \frac{{(\frac{\partial^{2} | L |}{\partial k_{i} \partial k_{j}})}_{0} \nabla_{i} \nabla_{j} - 2 {(\frac{\partial^{2} | L |}{\partial k_{i} \partial ω})}_{0} \nabla_{i} \frac{\partial}{\partial t} + {(\frac{\partial^{2} | L |}{\partial ω^{2}})}_{0} \frac{\partial^{2}}{\partial t^{2}}}{〈 {({\bar{L}}_{0} \frac{\partial L}{\partial ω})}_{0} 〉} + \dots} E_{0} (r, t) . \end{array}$ (8.9)

The dispersion equation $| L (k, ω) | = 0$ can be resolved in the form $ω = ω (k)$ for the different branches of the solution. In application to the slowly varying amplitudes with average wave vector $k_{0}$ and frequency $ω = ω_{0}$ this means the resolution

$\begin{matrix} 0 = {ω_{0} + i \frac{\partial}{\partial t} - ω (k_{0} - i \nabla)} E_{0} (r, t) \\ = {i (\frac{\partial}{\partial t} + v_{0} \nabla) + \frac{1}{2} \nabla W_{0} \nabla + \dots} E_{0} (r, t), (ω_{0} = ω_{0} (k_{0})), \end{matrix}$ (8.10)

with the group velocity $v_{0}$ and the quadratic form $W_{0}$ defined by

$v_{0} \equiv {(\frac{\partial ω}{\partial k})}_{0}, W_{0} \equiv {(\frac{\partial^{2} ω}{\partial k \cdot \partial k})}_{0}, (or W_{0, i j} \equiv {(\frac{\partial^{2} ω}{\partial k_{i} \partial k_{j}})}_{0}) .$ (8.11)

Obviously (8.9) and (8.10) are identical and one may find the correspondences.

We define the polarization vectors $e_{0}$ and $e_{0}^{*}$ which are right-hand and left-hand eigen-vectors to the operator $L_{0}$ to eigenvalue 0 according to

$0 = L_{0} e_{0}, 0 = e_{0}^{*} L_{0}, k_{0} e_{0} = 0, e_{0}^{*} k_{0} = 0.$ (8.12)

By comparison of (8.9) with (8.10) we find for the group velocity $v_{0}$

$v_{0, i} \equiv {(\frac{\partial ω}{\partial k_{i}})}_{0} = - \frac{{(\frac{\partial | L |}{\partial k_{i}})}_{0}}{{(\frac{\partial | L |}{\partial ω})}_{0}} = - \frac{〈 {\bar{L}}_{0} {(\frac{\partial L}{\partial k_{i}})}_{0} 〉}{〈 {\bar{L}}_{0} {(\frac{\partial L}{\partial ω})}_{0} 〉} = - \frac{e_{0}^{*} {(\frac{\partial L}{\partial k_{i}})}_{0} e_{0}}{e_{0}^{*} {(\frac{\partial L}{\partial ω})}_{0} e_{0}},$ (8.13)

where we applied $L_{0} \propto e_{0} \cdot e_{0}^{*}$ meaning that $L_{0}$ is proportional to the dyadic product of the polarization vectors $e_{0}$ and $e_{0}^{*}$ (see also formulae (2.18) and (2.19)).

The beam solutions in their Fourier decomposition contain components to wave vectors and frequencies around the average wave vectors and frequency $(k_{0}, ω_{0})$ and therefore the solution cannot possess solution which are exactly proportional to polarization vector $e_{0}$ . Therefore we make now the following proposition for solutions of the beam Equation (8.5)

$E_{0} (r, t) = e_{0} A_{0} (r, t) + [e_{0}, A^{'} (r, t)],$ (8.14)

where $e_{0} A_{0} (r, t)$ is the main part with polarization $e_{0}$ and $[e_{0}, A^{'} (r, t)]$ a small additional part with polarization perpendicular to $e_{0}$ . Both parts have to satisfy Equation (8.5) that means for the main part the following approximate scalar equation up to second-order derivatives of the slowly varying amplitude $A_{0} (r, t)$

$0 = {i (\frac{\partial}{\partial t} + v_{0} \nabla) + \frac{1}{2} \nabla W_{0} \nabla} A_{0} (r, t), (\nabla W_{0} \nabla \equiv \nabla_{i} W_{0, i j} \nabla_{j}) .$ (8.15)

The additional part $[e_{0}, A^{'} (r, t)]$ of the beam solution is not independent of the main part $e_{0} A_{0} (r, t)$ . Inserting both parts into Equation (8.5) we get approximately using $L_{0} e_{0} = 0$

$\begin{matrix} 0 = {L_{0} - i ({(\frac{\partial L}{\partial k})}_{0} \nabla - {(\frac{\partial L}{\partial ω})}_{0} \frac{\partial}{\partial t}) + \dots} (e_{0} A_{0} (r, t) + [e_{0}, {A^{'}}_{0} (r, t)]) \\ = {L_{0} [e_{0}, {A^{'}}_{0} (r, t)] - i ({(\frac{\partial L}{\partial k})}_{0} \nabla - {(\frac{\partial L}{\partial ω})}_{0} \frac{\partial}{\partial t}) e_{0} A_{0} (r, t) + \dots}, \end{matrix}$ (8.16)

that has to be resolved to $[e_{0}, {A^{'}}_{0} (r, t)]$ . As approximation we use only the two explicitly written sum terms in the second line. First we find using the dispersion equation $\frac{c^{2}}{ω_{0}^{2}} k_{0}^{2} = ε_{0}$

$\begin{matrix} L_{0} [e_{0}, {A^{'}}_{0} (r, t)] = (\frac{c^{2}}{ω_{0}^{2}} (k_{0} \cdot k_{0} - k_{0}^{2} I) + ε_{0} I) [e_{0}, {A^{'}}_{0} (r, t)] \\ = \frac{c^{2}}{ω_{0}^{2}} [k_{0}, e_{0}, {A^{'}}_{0} (r, t)] \cdot k_{0} = ε_{0} \frac{[k_{0}, e_{0}, {A^{'}}_{0} (r, t)] \cdot k_{0}}{k_{0}^{2}}, \end{matrix}$ (8.17)

that is proportional to the average wave vector $k_{0}$ . Furthermore follows for the operator part of the second sum term in (8.16) which acts onto $A_{0} (r, t)$ and here written with indices

$\begin{array}{l} {{(\frac{\partial L_{i j}}{\partial k_{k}})}_{0} \nabla_{k} - {(\frac{\partial L_{i j}}{\partial ω})}_{0} \frac{\partial}{\partial t}} e_{0, j} \\ = {\frac{c^{2}}{ω_{0}^{2}} (δ_{i k} k_{0, j} + k_{0, i} δ_{j k} - 2 k_{0, k} δ_{i j}) \nabla_{k} + 2 \frac{c^{3}}{ω_{0}^{3}} (k_{0, i} k_{0, j} - k_{0}^{2} δ_{i j}) \frac{\partial}{\partial t}} e_{0, j} \\ = \frac{c^{2}}{ω_{0}^{2}} {k_{0, i} e_{0} \nabla - 2 e_{0, i} (k_{0} \nabla + \frac{c}{ω_{0}} \frac{\partial}{\partial t})} . \end{array}$ (8.18)

This possesses two sum terms proportional to the vector $k_{0}$ and to the main polarization $e_{0}$ and shows a typical difficulty consisting in the correct neglect of terms deriving equation for additional components with no contradictions. The second sum term is proportional to the polarization $e_{0}$ of the main component and has to be neglected. If we do so we find from (8.17) and (8.18) the following formula for the additional component in direction of $k_{0}$

$[k_{0}, e_{0}, {A^{'}}_{0} (r, t)] = i (e_{0} \nabla) A_{0} (r, t) .$ (8.19)

In special case of vacuum we have $v_{0} = c \frac{k_{0}}{| k_{0} |}$ and $W_{0} = \frac{c}{| k_{0} |} (I - \frac{k_{0} \cdot k_{0}}{{| k_{0} |}^{2}})$ and Equation (8.15) for the main component becomes

$0 = {i (\frac{\partial}{\partial t} + c \frac{k_{0}}{| k_{0} |} \nabla) + \frac{c}{2 | k_{0} |} \nabla (I - \frac{k_{0} \cdot k_{0}}{{| k_{0} |}^{2}}) \nabla} e_{0} A_{0} (r, t) .$ (8.20)

In Section 9 we derive a more complicated case for media with the polariton permittivity.

Thus the approximate equations for beam solutions taking into account diffraction in first order consists of the Equation (8.15) for the main part of the slowly varying amplitude plus the Equation (8.19) for a “small” additional part in direction of $k_{0}$ which can be determined alone from the main part by differentiations. We wanted to show how the group velocity and the diffraction coefficients are involved in approximate beam equations but a detailed consideration of these equations and of the solution of (8.15) requires much place and is here not intended.

9. Permittivity to Polariton Dispersion in Isotropic Media

We consider in this Section the following special permittivity $ε (ω)$ and permeability $μ (ω)$ of an isotropic medium with two real parameters $ω_{l}$ and $ω_{t}$ (or $λ$ and $ω_{t}$ ) called polariton permittivity, e.g., [6] (§17, 18)

$\begin{matrix} ε (ω) = 1 - \frac{λ}{ω^{2} - ω_{t}^{2}} = 1 - \frac{λ}{2 ω_{t}} (\frac{1}{ω - ω_{t}} - \frac{1}{ω + ω_{t}}) = 1 - \frac{ω_{l}^{2} - ω_{t}^{2}}{ω^{2} - ω_{t}^{2}} \\ = \frac{ω^{2} - ω_{l}^{2}}{ω^{2} - ω_{t}^{2}}, (μ (ω) = 1), λ \equiv ω_{l}^{2} - ω_{t}^{2}, \end{matrix}$ (9.1)

where for $ω \approx (\geq) ω_{t}$ the second sum term in round brackets can be neglected. In Figure 1 this permittivity is illustrated for the two principal cases with different properties which we call the passive case $ω_{l} \geq ω_{t}$ and the active case $ω_{t} > ω_{l}$ (occupation inversion) and which are also characterized by (for $ω > 0$ )

$\begin{array}{l} λ > 0, ω_{l} > ω_{t} : \frac{\partial ε}{\partial ω} (ω) > 0, (passive case), \\ λ < 0, ω_{l} < ω_{t} : \frac{\partial ε}{\partial ω} (ω) < 0, (active case) . \end{array}$ (9.2)

The indices “ $l$ ” and “t” in (9.1) mean “longitudinal” and “transversal”. Polaritons (or real excitons) are a mixing of excitons in a medium and of photons in the vacuum (e.g., [1] [2] [17] ) and correspond to the possible real light excitations in a medium. In this simple model $ω_{t}$ is the frequency to a transition between two energy levels in the medium or a lattice oscillation and $λ = ω_{l}^{2} - ω_{t}^{2}$ is beside other parameters proportional to the difference of the occupation of the two involved levels. For models of the medium in thermal equilibrium with temperature $T \neq 0$ the permittivity $ε (ω)$ has to be generalized (see general form of the permittivity of an isotropic medium, e.g., in [4] chap XII, [3] ). With

Figure 1. Polariton permittivity $ε (ω)$ for $ω_{l} > ω_{t}$ (passive case) and for $ω_{l} < ω_{t}$ (active case). Apart from the jump from plus infinity to minus infinity (or vice versa) all derivatives of $ε (ω)$ with respect to frequency are positive $\frac{\partial ε}{\partial ω} (ω) \geq 0$ in the passive case and negative in the active case $\frac{\partial ε}{\partial ω} (ω) \leq 0$ (for $ω > 0$ right half-plane.

respect to the propagation of light beams in such a medium it is equivalent to a medium which possesses the right-hand form for the product $ε (ω) μ (ω)$ instead for $ε (ω)$ alone and can be included into the last case.

One may think that the distinction in passive and active case in (9.2) corresponds in certain way to the usual distinction in normal and anomalous dispersion but beside analogies there are also essential differences. Normal dispersion is usually discussed for the passive case alone and appears for real frequencies if one adds in the denominators for $ε (ω)$ in (9.1) an imaginary part to take into account losses in the medium and if we consider then the real part of arising permittivity and is present in (small) parts between the (main) regions of normal dispersion. In the model (9.1) these regions are reduced to the points $ω^{2} = ω_{t}^{2}$ and normal dispersion is present in the whole region with exclusion of these points. In contrast, in the active case we have in the whole region “anomalous” dispersion also with exclusion of the points $ω^{2} = ω_{t}^{2}$ only and imaginary parts in the denominator do not play a role. This picture can change in some way for thermal equilibrium but the distinction in (9.2) is meant without losses. The occupation inversion in active case corresponds in some sense to a negative absolute temperature (notion occasionally used in second half of last century) but a thermal equilibrium in this case is only possible for a finite number of energy levels.

For $ω_{l} \equiv ω_{p}$ and $ω_{t} = 0$ we have the permittivity of a cold isotropic plasma

$ω_{l} \equiv ω_{p}, ω_{t} = 0 : \Rightarrow ε (ω) = 1 - \frac{ω_{p}^{2}}{ω^{2}}, ω_{p} \equiv \sqrt{\frac{4 π n_{e} e^{2}}{m_{e}}},$ (9.3)

with $ω_{p}$ the plasma frequency given here for an electron plasma (indices “e”; $e, n_{e}, m_{e}$ mean electron charge, electron density, electron mass and electron charge). This is again idealized for temperature $T = 0$ .

Longitudinal waves are in the idealized form (9.1) of the permittivity only possible for the frequency $ω_{l}$

$ω = ω_{l},$ (9.4)

and are pure oscillations with no dispersion. Our main interest concern transversal waves which we now consider. The dispersion relation for transversal waves (7.4) specialized for the polariton case (9.1) becomes

$k^{2} = \frac{ω^{2}}{c^{2}} ε (ω) μ (ω) = \frac{ω^{2}}{c^{2}} \frac{ω^{2} - ω_{l}^{2}}{ω^{2} - ω_{t}^{2}},$ (9.5)

and depends only on the product $ε (ω) μ (ω)$ . For such waves the relation between the energy flow density $S$ and the energy density w in the lossless case for quasi-plane and quasi-monochromatic waves and in the transition to the limiting case of plane monochromatic waves with real wave vector $k$ and real frequency $ω$ (homogeneous waves)

$S = v w,$ (9.6)

remains in every case the same and depends only from the dispersion relation ( $v$ is group velocity; see next Section). However, the splitting of the energy density w in (9.6) into a part from the electric field and into a part from the magnetic field depends on $ε (ω)$ and $μ (ω)$ separately and therefore also the calculation of the corresponding energy flow density $S$ which can be made from the energy density w by (9.6). As illustrated in Figure 1 it is interesting to extend the permittivity (9.1) to $λ = ω_{l}^{2} - ω_{t}^{2} < 0$ which corresponds to a model medium with inverse occupation density of the two involved levels. The condition $ω_{l} > ω_{t}$ in the form of the permittivity (9.1) which belongs to the passive case is satisfied for taking into account only one transition with frequency $ω_{t}$ between two energy levels and for sufficiently low temperatures. It may be converted into $ω_{l} < ω_{t}$ for pumping to a higher energy level of a laser medium to get inversion of occupation but to keep their difference $ω_{t}^{2} - ω_{l}^{2}$ constant can be only a very rough approximation for the laser action near the threshold. Such a permittivity falls under the active case (not to be confused with notion (natural) optical activity!). By far, not all consequences for the active case are clear and are well understood.

Let us begin with a general consideration to dispersion equations in a homogeneous isotropic and infinitely extended medium. The general case is that both wave vector $k$ and frequency $ω$ are complex quantities. For the vacuum with the dispersion equation $c^{2} k^{2} = ω^{2}$ with the splitting of wave vector and frequency in real and imaginary parts

$k = k^{'} + i k^{″}, ω = ω^{'} + i ω^{″},$ (9.7)

this leads to a complex equation with the following separation into a real and imaginary part

$\begin{array}{l} c^{2} {(k^{'} + i k^{″})}^{2} = {(ω^{'} + i ω^{″})}^{2}, \Rightarrow \\ c^{2} ({k^{'}}^{2} - {k^{″}}^{2}) = {ω^{'}}^{2} - {ω^{″}}^{2}, c^{2} (k^{'} k^{″}) = ω^{'} ω^{″} . \end{array}$ (9.8)

These are $2$ scalar equations for 5 real variables (for example,

$| k^{'} |, | k^{″} |, \cos (φ) \equiv \frac{k^{'} k^{″}}{| k^{'} | | k^{″} |}, ω^{'}, ω^{″}$ ) that restricts the number of free variables to 3

real variables. It is impossible to represent this in a single graphical representation and one has to make compromises. For example, for real frequency ( $ω^{'} = ω, ω^{″} = 0$ ) we find from (9.8) the orthogonality $k^{'} k^{″} = 0$ of real to imaginary part of the wave vector $k$ . It is known that such waves are generated in the vacuum under total reflection within an isotropic medium with $k^{'}$ parallel to the boundary plane and with $k^{″}$ in direction of the normal vector to the boundary plane corresponding to exponential decrease. Such waves are called inhomogeneous waves (not to confuse with inhomogeneous media!). All this is well known and understood. Which of the components $(k^{'}, k^{″}, ω^{'}, ω^{″})$ are involved into a process can be only determined if one knows the boundary together with the boundary conditions. If we apply this to the polariton permittivity (9.1) with the complex dispersion Equation (9.5) we find the following equation

$c^{2} ({k^{'}}^{2} - {k^{″}}^{2} + i 2 k^{'} k^{″}) = ({ω^{'}}^{2} - {ω^{″}}^{2} + i 2 ω^{'} ω^{″}) \frac{{ω^{'}}^{2} - {ω^{″}}^{2} - ω_{l}^{2} + i 2 ω^{'} ω^{″}}{{ω^{'}}^{2} - {ω^{″}}^{2} - ω_{t}^{2} + i 2 ω^{'} ω^{″}} .$ (9.9)

Separated into real and imaginary part this leads to the two equations

$\begin{array}{l} 0 = ({ω^{'}}^{2} - {ω^{″}}^{2} - ω_{t}^{2}) ({ω^{'}}^{2} - {ω^{″}}^{2} - c^{2} ({k^{'}}^{2} - {k^{″}}^{2})) \\ - 4 ω^{'} ω^{″} (ω^{'} ω^{″} - c^{2} (k^{'} k^{″})) - (ω_{l}^{2} - ω_{t}^{2}) ({ω^{'}}^{2} - {ω^{'}}^{'}^{2}), \\ 0 = ({ω^{'}}^{2} - {ω^{″}}^{2} - ω_{t}^{2}) (ω^{'} ω^{″} - c^{2} (k^{'} k^{″})) \\ + ω^{'} ω^{″} ({ω^{'}}^{2} - {ω^{″}}^{2} - c^{2} ({k^{'}}^{2} - {k^{″}}^{2})) - (ω_{l}^{2} - ω_{t}^{2}) ω^{'} ω^{″}, \end{array}$ (9.10)

which are of forth degree with respect to the real components of wave vector and frequency. Both equations have to be satisfied at the same time. This means that we have different possibilities of two-dimensional graphical representations for inhomogeneous waves and this is relatively complicated in such generality.

If we choose real wave vectors as free variable then the dispersion Equation (9.5) leads to a bi-quadratic equation for the frequency $ω = ω (k)$ in dependence on the modulus $| k |$ of the wave vector as follows ( $k^{2} \equiv {| k |}^{2}$ )

$0 = ω^{4} - (ω_{l}^{2} + c^{2} k^{2}) ω^{2} + ω_{t}^{2} c^{2} k^{2},$ (9.11)

which resolved provides two branches of squared solutions

$\begin{matrix} ω_{\pm}^{2} (k) = \frac{1}{2} {ω_{l}^{2} + c^{2} k^{2} \pm \sqrt{{(ω_{l}^{2} + c^{2} k^{2})}^{2} - 4 ω_{t}^{2} c^{2} k^{2}}} \\ = \frac{1}{2} {ω_{l}^{2} + c^{2} k^{2} \pm \sqrt{ω_{l}^{4} + 2 (ω_{l}^{2} - 2 ω_{t}^{2}) c^{2} k^{2} + {(c^{2} k^{2})}^{2}}}, \end{matrix}$ (9.12)

or of frequency solutions⁷

$\begin{matrix} ω_{\pm}^{(\pm)} (k) = (\pm) \frac{1}{2} {\sqrt{ω_{l}^{2} + 2 ω_{t} c | k | + c^{2} {| k |}^{2}} \pm \sqrt{ω_{l}^{2} - 2 ω_{t} c | k | + c^{2} {| k |}^{2}}} \\ = (\pm) \frac{1}{2} {\sqrt{{(ω_{t} + c | k |)}^{2} + ω_{l}^{2} - ω_{t}^{2}} \pm \sqrt{{(ω_{t} - c | k |)}^{2} + ω_{l}^{2} - ω_{t}^{2}}} . \end{matrix}$ (9.13)

This means that we have to given $| k |$ two different solutions for $ω_{(\pm)}^{2}$ signified by the upper indices “(±)” where one has to pay attention mainly to the two different lower signs “±” to the two sum terms with square roots. In the hatched region the solution (9.13) for the frequency in dependence on the modulus of the wave vector becomes complex and can be better represented in the form

$ω_{\pm}^{(\pm)} (k) = (\pm) \frac{1}{2} {\sqrt{{(ω_{t} + c | k |)}^{2} - (ω_{t}^{2} - ω_{l}^{2})} \pm i \sqrt{ω_{t}^{2} - ω_{l}^{2} - {(ω_{t} - c | k |)}^{2}}} .$ (9.14)

Figure 2 represents the different branches of solutions $ω = ω (k)$ in dependence on the modulus $| k |$ of the wave vector for the both principal cases $ω_{l} > ω_{t}$ and $ω_{l} < ω_{t}$ . The left-hand picture is well known mostly in form of the right upper quadrant (see, e.g., [17], chap. III, Figure 8 or [2], chap. 11, Figure 11.4).

Figure 2. Frequencies in dependence on real wave vector for $ε (ω) = \frac{ω^{2} - ω_{l}^{2}}{ω^{2} - ω_{t}^{2}}$ , $μ (ω) = 1$ for $ω_{l} > ω_{t}$ and $ω_{l} < ω_{t}$ . The contours of the hatched parts in the right-hand picture are imaginary and describe amplification. For the figures we have chosen the values $ω_{l} = 1.05, ω_{t} = 0.95$ in passive case (to the left) and $ω_{l} = 0.95, ω_{t} = 1.05$ in active case (to the right).

Asymptotically, for large $| k |$ we have in both cases a branch $ω = \pm c | k |$ the same as for light beams in vacuum and a branch with constant $ω = \pm ω_{t}$ where $ω_{t}$ is the resonance frequency to the energy difference of the considered two levels. The case $ω_{l} - ω_{t} > 0$ corresponds to lower occupation of higher level in comparison the lower level and the case $ω_{t} - ω_{l} > 0$ to inverse occupation. The last case can be achieved by pumping this level and as was to expect it possesses properties of amplification in a certain region of wave vectors. This is exactly separated by the two sum terms with square roots in the solutions in the form (9.13) or (9.14). We have hatched the imaginary parts inside their contours. Their real parts are in the figure to find over, respectively, under the hatched contours and look like straight lines but are such only in the limiting case $ω_{t} - ω_{l} \to 0$ as follows from the first sum term in (9.14).

In Figure 3 the case $ω_{t} > ω_{l}$ is presented enlarged for the right upper quadrant of Figure 2 in two numerical cases which show the dependence on the parameters $ω_{t}$ and $ω_{l}$ and, in particular, on the difference $\sqrt{ω_{t}^{2} - ω_{l}^{2}}$ . Microscopic models for the permittivity (9.1) show that the difference $ω_{t}^{2} - ω_{l}^{2}$ is among other parameters as factors of the active transition levels proportional to the density occupation inversion $σ \propto ω_{t}^{2} - ω_{l}^{2} \equiv - λ$ of these levels. This means that in our idealized model the product of height h with width w of the amplifier contour (see Figure 2) is proportional to the square root of the density of inverse occupation of the considered active levels whereas h and w themselves are proportional to its square root. In laser theory to our knowledge the height h is

Figure 3. Function $ω = ω (k)$ for $ε (ω) = \frac{ω^{2} - ω_{l}^{2}}{ω^{2} - ω_{t}^{2}}$ , $μ (ω) = 1$ with $ω_{l} < ω_{t}$ (amplification) in two numerical cases. The left-hand figure is practically the right upper quadrant of Figure 2 but amplified for better visibility and for making a comparison with a similar figure with other parameters. For the figures we have chosen the values $ω_{t} = 1.05, ω_{l} = 0.95$ and $ω_{t} = 1.25, ω_{l} = 0.75$ .

usually assumed or calculated as direct proportional to the density of inverse occupation whereas for its width are made complicated considerations about natural line width and its enlargement.

For application to laser theory it is necessary to add feedback by a resonator. The unspecific losses of the involved resonator modes in the concerning region cut off an upper part of the amplification contour and thus lower it whereas their frequencies are determined mainly from the real part (first sum term in (9.14))

$\begin{array}{l} ω_{Re} (| k |) = \frac{1}{2} \sqrt{{(ω_{t} + c | k |)}^{2} - (ω_{t}^{2} - ω_{l}^{2})} \\ = \frac{\sqrt{3 ω_{t}^{2} + ω_{l}^{2}}}{2} + \frac{ω_{t}}{\sqrt{3 ω_{t}^{2} + ω_{l}^{2}}} (c | k | - ω_{t}) (1 - \frac{ω_{t}^{2} - ω_{l}^{2}}{4 (3 ω_{t}^{2} + ω_{l}^{2})} (c | k | - ω_{t}) + \dots), \\ (ω_{t} - \sqrt{ω_{t}^{2} - ω_{l}^{2}} \leq c | k | \leq ω_{t} + \sqrt{ω_{t}^{2} - ω_{l}^{2}}) . \end{array}$ (9.15)

For a long resonator in comparison to the transverse dimensions only the longitudinal modes without reflection at the side wands play a role. For such a resonator with ideal mirrors at the end the field at the mirrors has to be vanishing and the possible resonator modes have to possess a multiple m of the half the wavelengths $λ$ , ( $| k | \equiv \frac{2 π}{λ}$ ), which fit into the resonator length L. Thus for the possible wave vectors and frequencies we find in this idealized case ( $m_{0} \equiv m_{\min}$ )

$\begin{array}{l} | k | = m \frac{π}{L}, ω_{Re} (| k |) = \frac{1}{2} \sqrt{{(ω_{t} + m π \frac{c}{L})}^{2} - (ω_{t}^{2} - ω_{l}^{2})}, \\ ω_{t} - \sqrt{ω_{t}^{2} - ω_{l}^{2}} \leq m π \frac{c}{L} \leq ω_{t} + \sqrt{ω_{t}^{2} - ω_{l}^{2}}, (m = m_{0}, m_{0} + 1, \dots, m_{\max}) . \end{array}$ (9.16)

The resonator losses as said lower the amplification contour and therefore narrow the possible values for m and change also a little the relation (9.15) for the possible wave vectors and frequencies. With losses both the wave vectors and frequencies may become slightly complex. The imaginary part of the frequency determines also a line width by only classical considerations. For $\frac{ω_{t}^{2} - ω_{l}^{2}}{ω_{t}^{2}} \leq 1$ one finds in approximation from (9.15)

$\frac{ω_{t}^{2} - ω_{l}^{2}}{ω_{t}^{2}} \leq 1 : ω_{Re} (k) \approx ω_{t} + \frac{1}{2} (c | k | - ω_{t}),$ (9.17)

and the density of possible frequencies in the corresponding frequency interval is doubled in comparison to the density within a resonator with vacuum. If we assume that there is a process (pumping) which keeps constant the density of the inverse occupation then it may be considered as a very simple and idealized classical model of laser action, at least, near the threshold. However, this model cannot provide information at which level of occupation inversion the equilibrium between pumping and radiation is reached. In this sense it is similar to thermal equilibrium where without additional information it cannot be said how it was reached. Clearly, quantum-mechanical generalization makes further modifications also to the line widths.⁸

Generalizations of the permittivity (9.1) in different directions are possible and interesting, for example, by an additional constant sum term on the right-hand side taking into account summarily the contribution of all other resonances of two levels or taking into account losses by imaginary terms in the denominator. We consider now shortly the case of taking into account two resonance frequencies $ω_{t,1}$ and $ω_{t,2}$ leading to a permittivity of the form [6] (§18, Equation (6))

$ε (ω) = 1 - \frac{λ_{1}}{ω^{2} - ω_{t,1}^{2}} - \frac{λ_{2}}{ω^{2} - ω_{t,2}^{2}},$ (9.18)

with two further parameters $λ_{2}$ and $ω_{t,2}$ which determine the strength of a second resonance. This can be also written in the following form

$ε (ω) = \frac{(ω^{2} - ω_{l, -}^{2}) (ω^{2} - ω_{l, +}^{2})}{(ω^{2} - ω_{t,1}^{2}) (ω^{2} - ω_{t,2}^{2})},$ (9.19)

with the definitions

$ω_{l, \pm}^{2} \equiv \frac{1}{2} {ω_{t, 1}^{2} + λ_{1} + ω_{t, 2}^{2} + λ_{2} \pm \sqrt{{(ω_{t, 1}^{2} + λ_{1} - ω_{t, 2}^{2} - λ_{2})}^{2} + 4 λ_{1} λ_{2}}},$ (9.20)

where $ω_{l, \mp}^{2}$ is real-valued or may become even complex-to complex-conjugate-valued and where $ω_{l, -}$ and $ω_{l, +}$ cannot be properly assigned to $ω_{t,1}$ and $ω_{t,2}$ . The dispersion Equation (7.4) resolved to an equation for $ω^{2}$ in dependence on the squared wave vector ${| k |}^{2}$ becomes a bi-cubic equation in $| k |$ which is already difficult to solve for $ω$ and to discuss. In dependence on the 4 parameters $ω_{t,1}, ω_{t,2}$ and $λ_{1}$ and $λ_{2}$ one would have to distinguish many principal cases.

10. Group Velocity to the Polariton Permittivity in Passive Case and Group Velocities Faster Than Light Velocity in Active Case

We now consider the group velocity of transversal waves for the polariton permittivity (9.1). By differentiation of the dispersion Equation (9.5) with respect to $k$ we calculate for the group velocity in direction $\frac{k}{| k |}$ of the wave vector and in dependence of its modulus on the frequency $ω$ only

$\begin{matrix} v \equiv \frac{\partial ω}{\partial k} = \frac{2 c^{2} k}{\frac{\partial}{\partial ω} (ω^{2} ε (ω))} = \frac{c^{2}}{ω} \frac{{(ω^{2} - ω_{t}^{2})}^{2}}{ω^{4} - 2 ω_{t}^{2} ω^{2} + ω_{l}^{2} ω_{t}^{2}} k \\ = c \underset{\frac{c}{ω} | k | = \sqrt{ε (ω)}}{\underset{︸}{\sqrt{\frac{ω^{2} - ω_{l}^{2}}{ω^{2} - ω_{t}^{2}}}}} \frac{{(ω^{2} - ω_{t}^{2})}^{2}}{{(ω^{2} - ω_{t}^{2})}^{2} + (ω_{l}^{2} - ω_{t}^{2}) ω_{t}^{2}} \frac{k}{| k |} \equiv φ (ω) \frac{k}{| k |} . \end{matrix}$ (10.1)

It possesses in this model in every case the direction of the wave vector also if it is complex-valued with different directions of real and imaginary part but the frequency-dependent coefficients $φ (ω)$ can also become complex-valued (for real values of $ω$ ) due to presence of the square root. Without taking into account the dispersion of $ε (ω)$ at the considered frequency $ω$ (setting

$\frac{\partial ε (ω)}{\partial ω} \to 0$ ) it is

$v^{'} = \frac{2 c^{2} k}{ε (ω) \frac{\partial}{\partial ω} (ω^{2})} = c \frac{c k}{ω ε (ω)} = c \frac{c | k |}{ω ε (ω)} \frac{k}{| k |} = \frac{c}{\sqrt{ε (ω)}} \frac{k}{| k |} = c \sqrt{\frac{ω^{2} - ω_{t}^{2}}{ω^{2} - ω_{l}^{2}}} \frac{k}{| k |},$ (10.2)

but the dispersion cannot be switched off. For such points where $ε (ω)$ possesses a minimum (or maximum) the derivative vanishes (i.e., $\frac{\partial ε (ω)}{\partial ω} (ω) = 0$ ) and the group velocity with and without taking into account the dispersion are equal. The dispersion factor $α_{disp} (ω) \equiv α (ω)$ is

$\begin{array}{l} α (ω) = \frac{ε (ω) \frac{\partial}{\partial ω} (ω^{2})}{\frac{\partial}{\partial ω} (ω^{2} ε (ω))} = \frac{(ω^{2} - ω_{l}^{2}) (ω^{2} - ω_{t}^{2})}{{(ω^{2} - ω_{t}^{2})}^{2} + (ω_{l}^{2} - ω_{t}^{2}) ω_{t}^{2}} \equiv \frac{(ω^{2} - ω_{l}^{2}) (ω^{2} - ω_{t}^{2})}{(ω^{2} - ω_{-}^{2}) (ω^{2} - ω_{+}^{2})}, \\ ω_{\mp}^{2} \equiv ω_{t} (ω_{t} \mp \sqrt{ω_{t}^{2} - ω_{l}^{2}}), ω_{\mp} \equiv \frac{\sqrt{ω_{t} (ω_{t} + ω_{l})} \mp \sqrt{ω_{t} (ω_{t} - ω_{l})}}{\sqrt{2}} . \end{array}$ (10.3)

Due to an extremum of $ε (ω)$ for $ω = 0$ its derivative with respect to frequency vanishes there and the dispersion factor becomes $α (ω = 0) = 1$ .

One may distinguish 2 different cases and a limiting case between them represented by

$| v | = φ (ω) = c \frac{\sqrt{1 - \frac{ω_{l}^{2} - ω_{t}^{2}}{ω^{2} - ω_{t}^{2}}}}{1 + \frac{ω_{l}^{2} - ω_{t}^{2}}{ω^{2} - ω_{t}^{2}} \frac{ω_{t}^{2}}{ω^{2} - ω_{t}^{2}}} \to {\begin{array}{l} < c, & (ω_{l} > ω_{t}) \\ = c, & (ω_{l} = ω_{t}) \\ > c, & (ω_{l} < ω_{t}) \end{array},$ (10.4)

which in relation to the light velocity obviously are determined by different properties. In last case of $ω_{l} < ω_{t}$ the group velocity can even be opposite to the direction of the wave vector in certain regions of the frequency.

If one wants to have the full dependence $v \equiv v (k)$ of the group velocity on the wave vector one has to distinguish the 4 branches in (9.13) or in (9.14) and finds by differentiation with respect to $k$

$\begin{matrix} v_{\pm}^{(\pm)} (k) = (\pm) \frac{c}{2} {\frac{ω_{t} + c | k |}{\sqrt{{(ω_{t} + c | k |)}^{2} + ω_{l}^{2} - ω_{t}^{2}}} \mp \frac{ω_{t} - c | k |}{\sqrt{{(ω_{t} - c | k |)}^{2} + ω_{l}^{2} - ω_{t}^{2}}}} \frac{k}{| k |} \\ = (\pm) \frac{c}{2} {\frac{ω_{t} + c | k |}{\sqrt{{(ω_{t} + c | k |)}^{2} - (ω_{t}^{2} - ω_{l}^{2})}} \pm i \frac{ω_{t} - c | k |}{\sqrt{(ω_{t}^{2} - ω_{l}^{2}) - {(ω_{t} - c | k |)}^{2}}}} \frac{k}{| k |} \\ \equiv c β (ω) \frac{k}{| k |} . \end{matrix}$ (10.5)

which we have written in two favorable representations for the passive case $ω_{l} > ω_{t}$ and the active case $ω_{t} > ω_{l}$ .

The calculation of the second-order coefficients $W_{i j}$ in the Equation (8.15) for beam propagation in isotropic media can be calculated, for example, from the principal structure of the group velocity

$v_{i} \equiv \frac{\partial ω}{\partial k_{i}} = φ (ω) \frac{k_{i}}{| k |}, φ (ω) \equiv c \frac{{(ω^{2} - ω_{l}^{2})}^{\frac{1}{2}} {(ω^{2} - ω_{t}^{2})}^{\frac{3}{2}}}{{(ω^{2} - ω_{t}^{2})}^{2} + (ω_{l}^{2} - ω_{t}^{2}) ω_{t}^{2}},$ (10.6)

by further differentiation with respect to variables $k_{j}$ . Using

$\frac{\partial | k |}{\partial k_{i}} = \frac{k_{i}}{| k |}, \Rightarrow \frac{\partial^{2} | k |}{\partial k_{i} \partial k_{j}} = \frac{\partial}{\partial k_{j}} (\frac{k_{i}}{| k |}) = \frac{1}{| k |} (δ_{i j} - \frac{k_{i} k_{j}}{{| k |}^{2}}),$ (10.7)

and due to $ω \equiv ω (k)$ and (10.7)

$\frac{\partial φ}{\partial k_{j}} (ω) = \frac{\partial φ}{\partial ω} (ω) \frac{\partial ω}{\partial k_{j}} = \frac{\partial φ}{\partial ω} (ω) φ (ω) \frac{k_{j}}{| k |} = \frac{1}{2} \frac{\partial}{\partial ω} ({(φ (ω))}^{2}) \frac{k_{j}}{| k |},$ (10.8)

we find the general structure

$W_{i j} \equiv \frac{\partial^{2} ω}{\partial k_{i} \partial k_{j}} = φ (ω) \frac{1}{| k |} (δ_{i j} - \frac{k_{i} k_{j}}{{| k |}^{2}}) + \frac{1}{2} \frac{\partial}{\partial ω} ({(φ (ω))}^{2}) \frac{k_{i} k_{j}}{{| k |}^{2}}$ (10.9)

and, finally, with the special function $φ (ω)$ in (10.6)

$\begin{array}{l} W_{i j} \equiv \frac{\partial^{2} ω}{\partial k_{i} \partial k_{j}} = \frac{c^{2}}{ω} \frac{{(ω^{2} - ω_{t}^{2})}^{2}}{{(ω^{2} - ω_{t}^{2})}^{2} + (ω_{l}^{2} - ω_{t}^{2}) ω_{t}^{2}} \\ \cdot {δ_{i j} - \frac{k_{i} k_{j}}{{| k |}^{2}} + (ω_{l}^{2} - ω_{t}^{2}) \frac{ω^{2} (ω^{4} + 2 ω_{t}^{2} ω^{2} - 3 ω_{l}^{2} ω_{t}^{2})}{{({(ω^{2} - ω_{t}^{2})}^{2} + (ω_{l}^{2} - ω_{t}^{2}) ω_{t}^{2})}^{2}} \frac{k_{i} k_{j}}{{| k |}^{2}}} . \end{array}$ (10.10)

This result was checked by a slightly modified calculation which I do not present here. It possesses two sum terms proportional to the tensors $δ_{i j} - \frac{k_{i} k_{j}}{{| k |}^{2}}$ and $\frac{k_{i} k_{j}}{{| k |}^{2}}$ describing transversal and longitudinal diffraction (or diffusion) of the light beam during propagation.

In the special case of an isotropic cold plasma with the permittivity $ε (ω) = 1 - \frac{ω_{p}^{2}}{ω^{2}}$ setting $ω_{l} = ω_{p}, ω_{t} = 0$ the formulae (10.1) for the group velocity and (10.10) for the diffraction coefficients simplify to

$v_{i} = c \frac{c | k |}{ω} \frac{k_{i}}{| k |} = c \sqrt{1 - \frac{ω_{p}^{2}}{ω^{2}}} \frac{k_{i}}{| k |}, \Rightarrow k v = ω (1 - \frac{ω_{p}^{2}}{ω^{2}}),$ (10.11)

that due to taking into account the frequency dispersion does not agree with (6.9). For the diffraction coefficients one finds

$W_{i j} = \frac{c^{2}}{ω} {δ_{i j} - \frac{k_{i} k_{j}}{{| k |}^{2}} + \frac{ω_{p}^{2}}{ω^{2}} \frac{k_{i} k_{j}}{{| k |}^{2}}},$ (10.12)

and it contains also a term proportional to $\frac{k_{i} k_{j}}{{| k |}^{2}}$ leading to a diffusion in longitudinal direction of the beam.

Without taking into account the dispersion we would obtain from (10.2)

${W^{'}}_{i j} = \frac{c^{2}}{\sqrt{ε (ω)}} (δ_{i j} - \frac{k_{i} k_{j}}{{| k |}^{2}}),$ (10.13)

that is without a “longitudinal” contribution proportional to $\frac{k_{i} k_{j}}{{| k |}^{2}}$ .

In Figure 4 we represent the permittivity $ε (ω)$ (blue curves) together with the group velocity in relation to the light velocity without taking into account dispersion of the permittivity $α (ω) (\equiv α_{disp} (ω))$ (yellow curves) and with taking it into account as $β (ω) \equiv \frac{| v |}{c}$ (red curves) for the passive ( $ω_{l} > ω_{t}$ , to the left) and the active case ( $ω_{t} > ω_{l}$ to the right). In the passive case we see that the group velocity remains smaller than the light velocity in every case and that it is real-valued in regions where $ε (ω)$ is positive and that taking into account the dispersion the deviations in comparison to neglect of dispersion are important, in particular, in the neighborhood of the resonance frequency $ω_{t}$ and in the neighborhood of the “(longitudinal)” frequency $ω_{l}$ . In addition, the group velocity possesses in every case the same direction as the wave vector. In the region $ω_{t} \leq ω \leq ω_{l}$ no relation of group velocity to light velocity (red) is drawn because the group velocity is there imaginary. All is so as expected. However, the same picture for the active case (to the right) contains an unexpected surprise.

In the active case $ω_{l} \leq ω_{t}$ (right-hand picture in Figure 4) the group velocity is here in certain regions of frequency larger than the light velocity or it is in opposite direction to the wave vector. The regions of $ω$ where it is larger than the light velocity but in direction of the wave vector $k$ are

$0 < ω < ω_{-}, ω_{+} < ω < \infty .$ (10.14)

Figure 4. Polariton permittivity $ε (ω) = \frac{ω^{2} - ω_{l}^{2}}{ω^{2} - ω_{t}^{2}}, (μ (ω) = 1)$ , $α (ω)$ and $β (ω) = \frac{| v |}{c}$ for $ω_{l} > ω_{t}$ and for $ω_{l} < ω_{t}$ . In regions $| ω_{t} | < | ω | < | ω_{l} |$ for $ω_{l} > ω_{t}$ and $| ω_{l} | < | ω | < | ω_{t} |$ for $ω_{l} < ω_{t}$ the relation of the group velocity to the light velocity (red curves with and purple without dispersion) becomes imaginary that is not drawn. In passive case $ω_{l} > ω_{t}$ wave vector and group velocity possess in every case the same direction and the last is smaller than the light velocity. In active case $ω_{l} < ω_{t}$ the group velocity is greater than the light velocity in the regions $0 < ω < ω_{-}$ and $ω_{+} < ω$ but is in the same direction as the wave vector. In the region $ω_{t} < ω < ω_{+}$ it is even in the opposite direction to the corresponding light velocity but only with taking into account the dispersion. The difference between $β (ω)$ (red) and $β^{'} (ω)$ (purple is that last is calculated under neglect of the frequency dispersion of the permittivity. We have chosen for the pictures $ω_{l} = 1.25, ω_{t} = 0.75$ in passive case and $ω_{l} = 0.75, ω_{t} = 1.25$ in active case, the same as in Figure 1.

We will not come here with a quick physical explanation of this phenomenon although we considered the polariton permittivity with frequency $ω_{l} < ω < ω_{t}$ already much earlier mainly with respect to its description of amplification in a certain region of wave vectors and frequencies. This phenomenon of group velocity greater than light velocity may likely come from a correlation within all parts of the medium by preparation of the occupation inversion of a level which is made already before it gets this property. The diffraction coefficients $W_{i j}$ according to (10.10) possess the same zeros ( $ω_{\pm}$ in (10.3) and in Figure 4)) in the denominators as the group velocity $v_{i}$ in (10.1) and can become very large. It is possible that the expansion (8.15) in the equation for the slowly varying amplitude (8.15) does not converge and that this equation is not applicable but a more general treatment should not change this basically. A practical use is likely very difficult to make and far from now. The most chances has its use for guided waves in long resonators and, in principle, such use due to the amplification properties of these similar media is already realized by lasers but not due to group velocity larger than light velocity as subsidiary effect. On the other side we cannot fully exclude that this phenomenon is already known and somewhere discussed in literature. Hypothetical particles in free space which move with a velocity greater than that of light are called tachyons and were occasionally considered from the sixtieths on, in detail, e.g., by Terletski [28] (Chapters V. and VI.) and by others but in our case they may be only quasi-particles within an active medium with occupation inversion if we associate them with the group velocity greater than that of light. It is known that the existence of tachyons with imaginary mass is not excluded by the Relativity theory but experimentally such are not found. However, in our case they cannot exist as such which usually form beams due to very large diffraction and in this sense they are not the same as were discussed as tachyons.

There is a second fully unexpected phenomenon. In the regions $ω_{-} < ω < ω_{l}$ and $ω_{t} < ω_{+}$ the group velocity $v$ is in opposite direction to the direction of the wave vector $k$ but only if we take into account the dispersion of the permittivity that let us believe at first in an error of signs but all was calculated with the same reliable formulae as in the passive case and the formula (10.3) for the dispersion factor does not involve square roots where the sign is unclear. It seems that this opposite group velocity to refraction vector violets the causality but we cannot exclude an explanation by the established correlation between different parts of the medium. So the described phenomenon requires further attention.

In search for references to the unexpected phenomena I came via the interesting book of Vaas [29] (part II, Section 7, e.g., p. 145, 146) to the reference of authors Nimtz and Haibel [30]. In [30] it is claimed that G. Nimtz (together with H. Aichmann, p. 111) transmitted in 1994 a symphony using the tunneling effect through a sub-dimensioned wave guide (full length about 12 cm) with approximately the five-fold velocity of the light velocity in vacuum⁹. In the transmission of information (e.g. music) the frequency modulation of a signal is used and has to be detected from it. Despite the large spatial distortion of a beam via the propagation the frequency is more stable and not very distorted over longer length of a beam that may explain some results. All this is a challenging theme and one has to wait for further clarification.

11. Remark to Reflection and Refraction of Beams at Isotropic and Bi-Isotropic Media

The following considerations concern only passive cases which are not very problematic with respect to basic discussions. In reflection and refraction problems of beams at a boundary between isotropic and (or) bi-isotropic media maximum 4 different beams with the same average frequency $ω$ can be related to each other since they must possess the same tangential component of the

average wave vectors $k_{ν}^{i, r}$ or refraction vector $n_{ν}^{i, r} \equiv \frac{c}{ω} k_{ν}^{i, r}$ (lower indices “1” and “2” stand for the both media and upper indices “i” for incident and “r” for reflected or refracted wave. The group velocities of the corresponding beams are calculated in (7.6). This is represented in Figure 5 on the left-hand picture and the corresponding beam propagation on the right-hand picture. If $N$ is a normal unit vector to the boundary at the considered point of beam reflection and refraction then the tangential component of all refraction vectors is

$\bar{n} \equiv [N, [n, N]] = n - n N \cdot N, (N^{2} = 1),$ (11.1)

where for $n$ an arbitrary of the involved refraction vectors can be inserted. Nothing changes in this picture if the refraction vectors in one or both media satisfy a dispersion equation $n^{2} = ε (ω) μ (ω)$ in comparison to $μ (ω) = 1$ . Also in the active case of isotropic media the group velocities $v$ are in every case parallel to the corresponding refraction vectors $n$ although we found that in this case the possibility exists that they are in opposite direction to the refraction vectors and may possess super-luminal velocities that is not yet fully understood and affirmed. Mostly one has only an incident beam from one side of the two media and only three waves (incident, reflected and refracted) are coupled at the boundary. All this is necessary to take into account if one discusses the work of Pendry [22] (see Appendix D).

Figure 5. Refraction vectors $n$ with equal tangential components and group velocities $v$ at a boundary. The two media possess the lower indices 1 and 2 where medium 2 may possess an electric permittivity and a magnetic permeability. The upper indices “i” and “r” mean “incident” and “reflected or refracted” waves seen from the different sides of the boundary. It does not play a role whether a positive product $ε_{2} μ_{2}$ is obtained from both positive or both negative values of $ε_{2}$ and $μ_{2}$ . All refraction vectors $n$ possess the same tangential component $\bar{n}$ to the boundary $N r = 0$ and only their normal components are different in general case. For isotropic media the directions of the refraction vectors $n$ and the corresponding group velocities $v$ are the same. With respect to polarization of the 4 waves one may distinguish the two cases of polarization perpendicular and within the incidence plane spanned by the normal unit vector $N$ to the boundary plane and an arbitrary of the refraction vectors $n$ .

The normal components $(n N) \cdot N$ of all involved refraction vectors $n$ can be obtained from the dispersion equations $n_{1}^{2} = ε_{1} (ω)$ and $n_{2}^{2} = ε_{2} (ω) μ_{2} (ω)$ , respectively

$n_{1}^{i, r} N = \pm \sqrt{ε_{1} (ω) - {\bar{n}}^{2}}, n_{2}^{i, r} N = \pm \sqrt{ε_{2} (ω) μ_{2} (ω) - {\bar{n}}^{2}} .$ (11.2)

Furthermore, in Figure 5 two case of polarizations are possible, with the electric field polarized in the incidence plane spanned by vectors $N$ and $\bar{n}$ and perpendicular to it. The discussion of the amplitude relations for the involved wave is not necessary here for the intended purpose.

12. Conclusion

We compared two concepts of representing the constitutive equations in classical macroscopic optics of homogeneous anisotropic media, first the more general concept of spatial dispersion and then the more special concept of bi-anisotropic media with two constitutive equations for the electric and the magnetic induction and made this in coordinate-invariant way. Then this was specialized to uniaxial and, finally, to isotropic media where a possible equation for quasi-plane and quasi-monochromatic beam propagation in approximation with the first two terms of an expansion of the slowly varying beam amplitudes with respect to derivatives in space and time was derived taking into account diffraction. This was then applied for the isotropic case to the polariton permittivity (9.1) with a detailed discussion of the passive and the active case. The active case is obtained from the passive case by changing a sign in the susceptibility and describes amplification in certain regions of the frequency that somehow goes connected with feedback in resonators in direction of (a prestep of) laser action in the stationary regime. Losses in form of imaginary parts in the permittivity, we did not include for more simplicity and calculability of the formulae.

When we calculated the group velocity for beams in the active case we found as a surprise for some regions of frequency the possibility of velocities faster than the light velocity in vacuum and as a yet greater surprise the possibility of an opposite direction to the direction of the wave vectors (but different signs with and without taking into account dispersion). A certain physical explanation is possibly by the preparation of occupation inversion in establishing the active case and therefore a correlation between all parts of the medium which is not contained in the equations. We found after this that it is not fully unknown from literature. We have to think furthermore about these phenomena and have to find access to more literature about this.

In the process of working with the topics into our viewpoint came also the notion of “negative refraction” but we could not find some positive aspects and the possibility of its realization in isotropic media and make a few remarks to this in the text and in Appendix D.

Primarily, I intended to include also such peculiar cases as inhomogeneous waves (for example, total reflection) and, in particular, the cases of optic axes with calculation of the two-dimensional projection operators for the electric field and the conical approximation of the dispersion surface in the neighborhood of optic axes in coordinate-invariant treatment. However, for extent and necessary time hoping to realize it we move this to a possible later time.

Appendix A. Some Important Relations for Three-Dimensional Operators

The most important relation for general three-dimensional operators $A$ is the Cayley-Hamilton identity

$A^{3} - 〈 A 〉 A^{2} + [A] A - | A | I = 0,$ (A.1)

with the invariants with respect to similarity transformations which are the trace $〈 A 〉 \equiv A_{j}^{j}$ , the second invariant $[A]$ and the determinant $| A |$ according to (for spaces with metric tensor $g_{i j} = g_{j i}$ we may set $A_{i k} \equiv g_{i j} A_{k}^{j}$ )

$〈 A 〉 \equiv A_{i i}, [A] \equiv \frac{1}{2} ({〈 A 〉}^{2} - 〈 A^{2} 〉), | A | \equiv \frac{1}{6} ({〈 A 〉}^{3} - 3 〈 A 〉 〈 A^{2} 〉 + 2 〈 A^{3} 〉) .$ (A.2)

As consequence of the Cayley-Hamilton identity the inverse operator to an arbitrary operator $A$ can be represented by

$A^{- 1} \equiv \frac{\bar{A}}{| A |}, \bar{A} \equiv A^{2} - 〈 A 〉 A + [A] I, \Rightarrow \bar{A} A = A \bar{A} = | A | I,$ (A.3)

where $\bar{A}$ is the so-called complementary (or associated) operator to $A$ . The invariants of this operator are

$〈 \bar{A} 〉 = [A], [\bar{A}] = | A | 〈 A 〉, | \bar{A} | = {| A |}^{2},$ (A.4)

and it possesses the properties

${\bar{A}}^{2} = [A] \bar{A} + | A | (A - 〈 A 〉 I), \bar{\bar{A}} \equiv \bar{(\bar{A})} = \frac{A}{| A |} .$ (A.5)

All relations in (A.3), (A.4) and (A.5) are general relations for arbitrary three-dimensional operators $A$ .

For the invariants of the sum of two operators $A$ and $B$ one derives the following identities (the complementary operators $\bar{A}$ to operator $A$ we define later)

$\begin{array}{l} 〈 A + B 〉 = 〈 A 〉 + 〈 B 〉, \\ [A + B] = [A] + 〈 A 〉 〈 B 〉 - 〈 A B 〉 + [B], \\ | A + B | = | A | + 〈 \bar{A} B 〉 + 〈 A \bar{B} 〉 + | B | . \end{array}$ (A.6)

Special cases are $B = β I$

$\begin{array}{l} 〈 A + β I 〉 = 〈 A 〉 + 3 β, \\ [A + β I] = [A] + 2 β 〈 A 〉 + 3 β^{2}, \\ | A + β I | = | A | + β [A] + β^{2} 〈 A 〉 + β^{3} . \end{array}$ (A.7)

The complementary operator of the sum of two operators is

$\bar{A + B} \equiv \bar{A} + A B + B A + \bar{B} - (〈 B 〉 A + 〈 A 〉 B) + (〈 A 〉 〈 B 〉 - 〈 A B 〉) I .$ (A.8)

The projection operator $Π$ for determination of eigenvectors of an operator $A$ to non-degenerate eigenvalue $α$ is

$Π = \frac{\bar{A - α I}}{〈 \bar{A - α I} 〉} = \frac{A^{2} - (〈 A 〉 - α) A + ([A] - α 〈 A 〉 + α^{2}) I}{[A] - 2 α 〈 A 〉 + 3 α^{2}}, Π^{2} = Π .$ (A.9)

For the differentiation of a determinant $| A |$ of an operator $A$ with respect to a parameter $λ$ we find from (A.2) and (A.3) the identity

$\frac{\partial | A |}{\partial λ} = 〈 \bar{A} \frac{\partial A}{\partial λ} 〉 = \frac{1}{2} 〈 \frac{\partial \bar{A}}{\partial λ} A 〉,$ (A.10)

where the complementary operator $\bar{A}$ to operator $A$ is defined in (A.3).

Appendix B. Identities for Vector and Volume Products in Connection with Operators

We derive here mathematical identities for volume and vector products in connection with operators which are almost unknown or less known in case that they are already somewhere published. A part of them is used in the main text of our considerations.

We consider the volume product $[x, y, z]$ of three vectors $x, y, z$ and apply now to each vector the same operator $A$ that means we consider the volume product $[A x, A y, A z]$ . Due to complete antisymmetry of the volume product with respect to permutations of neighbored vectors the volume product $[A x, A y, A z]$ is proportional to the volume product $[x, y, z]$ with a proportionality factor which we denote by $| A |$ and call the determinant of $A$ that means

$[A x, A y, A z] = | A | [x, y, z] .$ (B.1)

One may convince oneself that this is really a good possibility to define the determinant of a three-dimensional operator. Now we write the chain of identities for a general volume product

$[A x, A y] A z = [A x, A y, A z] = [x, y, z] | A | = [x, y] | A | z = [x, y] \bar{A} A z,$ (B.2)

where we substituted the determinant $| A |$ according to $| A | \to | A | I = \bar{A} A$ with $\bar{A}$ the complementary operator to $A$ (see also (A.3)). Since $z$ is an arbitrary vector we may omit $A z$ in the identity (B.2) and obtain the identity for vector products

$[A x, A y] = [x, y] \bar{A}, \Leftrightarrow [A x, A y] A = | A | [x, y] .$ (B.3)

From this also follows almost immediately

$[\bar{A} x, \bar{A} y] = [x, y] \bar{(\bar{A})} = | A | [x, y] A, (\bar{(\bar{A})} \equiv \bar{\bar{A}} = | A | A) .$ (B.4)

If we let act the operator $A$ onto vectors $\tilde{x}, \tilde{y}, \tilde{z}$ to the left then from

$[\tilde{x} A, \tilde{y} A, \tilde{z} A] = | A | [\tilde{x}, \tilde{y}, \tilde{z}],$ (B.5)

follows in analogous way

$[\tilde{x} A, \tilde{y} A] = \bar{A} [\tilde{x}, \tilde{y}], \Leftrightarrow A [\tilde{x} A, \tilde{y} A] = | A | [\tilde{x}, \tilde{y}] .$ (B.6)

If we substitute in (B.1) according to $A \to A - α I$ with arbitrary scalar $α$ and use for determinants

$| A - α I | = | A | - [A] α + 〈 A 〉 α^{2} - α^{3},$ (B.7)

then by collecting on both sides of the obtained identity the terms to equal powers of $α$ we find in addition to (B.1) two further identities of the form

$\begin{array}{l} [A x, A y, z] + [A x, y, A z] + [x, A y, A z] = [A] [x, y, z], \\ [A x, y, z] + [x, A y, z] + [x, A y, A z] = 〈 A 〉 [x, y, z] . \end{array}$ (B.8)

If we remove from (B.8) the vector $z$ or make the substitution $A \to A - α I$ in the identity (B.3) using

$\bar{A - α I} = \bar{A} - (〈 A 〉 I - A) α + I α^{2},$ (B.9)

and collect all terms on both sides to equal powers of $α$ we find in addition to (B.3) the identities

$\begin{array}{l} [A x, A y] + ([A x, y] + [x, A y]) A = [A] [x, y], \\ [A x, y] + [x, A y] + [x, y] A = 〈 A 〉 [x, y], \end{array}$ (B.10)

or equivalently to the last

$[A x, y] + [x, A y] = [x, y] (〈 A 〉 I - A) .$ (B.11)

All these mathematical identities may be also derived by means of the Levi-Civita pseudo-tensors. It is also clear that analogous identities can be derived for the action of operators onto the vectors to the left similar to (B.4) and (B.5) which we do not write down here.

We make a short remark to vector and volume products. It is very convenient to define $[y]$ as an antisymmetric covariant second-rank pseudo-tensor to the contravariant vector $y$ . Then one can use identities for vector and volume products by displacement of the squared brackets such as¹⁰

$\begin{array}{l} [y] z = [y, z], x [y] = [x, y], \\ x [y] z = x [y, z] = [x, y] z = [x, y, z] . \end{array}$ (B.12)

The second line written with contra- and covariant indices and with the Levi-Civity pseudo-tensor $ε_{j k l}$ written is for example

$[x, y, z] = ε_{j k l} x^{j} y^{k} z^{l} = x^{j} (ε_{j k l} y^{k}) z^{l} = x^{j} {[y]}_{j l} z^{l} = x^{j} {[y, z]}_{j} = {[x, y]}_{l} z^{l},$ (B.13)

and ${[y]}_{j l} \equiv ε_{j k l} y^{k}$ is the antisymmetric pseudo-tensor to vector $y^{k}$ . An “antisymmetric” operator ${[y]}_{l}^{i}$ which transforms vectors (or pseudo-vectors, depending on the kind of $y$ ) can be made from ${[y]}_{j k}$ only for Euclidean or pseudo-Euclidean spaces which possess a symmetrical metric tensor $g_{j k} = g_{k j}, (g^{i j} g_{j k} = δ_{k}^{i})$ by ${[y]}_{l}^{i} \equiv g^{i j} {[y]}_{j l}$ . With all this which may be presented in more precise form I made good experience for a long period of scientific work.

Appendix C. Algebra to a Special Operator for Bi-Anisotropic Media

We consider the algebra to the following three-dimensional operator $L$ which we need to obtain, in particular, to the calculation of the determinant of the operator for a general bi-anisotropic medium

$L \equiv \frac{A x \cdot \tilde{x} A - (\tilde{x} A x) A}{| A |} + B \equiv M + B,$ (C.1)

where $A$ and $B$ are general three-dimensional operators. It is a little more general than we need it since, in principal, we use in the main text only the special case of equality $\tilde{x} = x$ of the vectors $x$ and $\tilde{x}$ . With $M$ we abbreviate the operator

$M \equiv \frac{A x \cdot \tilde{x} A - (\tilde{x} A x) A}{| A |} .$ (C.2)

The determinant $| L |$ of the operator $L$ can be calculated by

$| L | = | M | + 〈 \bar{M} B 〉 + 〈 M \bar{B} 〉 + | B |,$ (C.3)

where overlining an operator means the transition to the complementary operator (see Section 2). This formula is known [9] [10] and surely some others and is easily to obtain by coordinate-invariant calculation. Thus we have first to consider some algebra of the more complicate part $M$ in (C.1).

First we have to calculate the powers $M^{2}$ and $M^{3}$ and their traces that is straightforward to make and that we do not write down. In particular, we find for the invariants of $M$

$\begin{array}{l} 〈 M 〉 = \frac{\tilde{x} A^{2} x - 〈 A 〉 \tilde{x} A x}{| A |}, \\ [M] = \frac{(\tilde{x} A x) \tilde{x} x}{| A |}, \\ | M | = 0, \end{array}$ (C.4)

that means the determinant of $M$ is vanishing which simplifies the application of the formula (C.3). That the determinant $| M |$ is vanishing follows also because the operator $M$ possesses the eigenvalue $μ = 0$ to right-hand eigenvector $x$ and to left-hand eigenvector $\tilde{x}$ . For the complementary operator $\bar{M}$ to $M$ using (A.3) we find then

$\bar{M} = \frac{(\tilde{x} A x) x \cdot \tilde{x}}{| A |}, 〈 \bar{M} 〉 = [M] = \frac{(\tilde{x} A x) \tilde{x} x}{| A |},$ (C.5)

and, furthermore, it is easy to check

$\bar{M} M = M \bar{M} = | M | I = 0.$ (C.6)

Now, according to the formula (C.3) with the explicit form (C.5) of $M$ and $| M | = 0$ we find

$\begin{matrix} | L | = \frac{(\tilde{x} A x) (\tilde{x} B x) - 〈 A \bar{B} 〉 (\tilde{x} A x) + \tilde{x} A \bar{B} A x + | A | | B |}{| A |} \\ = \frac{(\tilde{x} A x) (\tilde{x} B x) - \tilde{x} (〈 A \bar{B} 〉 A - A \bar{B} A) x + | A B |}{| A |} . \end{matrix}$ (C.7)

This agrees with the unessentially more special results of Fyodorov [9] (Equations (36.9), (36.10)) which, however, were calculated for the operator $B^{- 1} L$ (in our notation) from multiple vector products which brings an additional factor $| B |$ into the denominator of (C.7) but since he applies this immediately to the wave equation for which the determinant has to vanish he can omit this factor $| A B |$ in the denominator in Equation (36.10). The numerator in (C.7) is symmetric with respect to permutation of the operators $A ⇌ B$ . This can be achieved using a general operator identity from which results the following equivalent representation

$| L | = \frac{(\tilde{x} A x) (\tilde{x} B x) - \tilde{x} (\bar{A B} + \bar{B A} - [A B] I - ([A] \bar{B} + [B] \bar{A} - [A] [B] I)) x + | A B |}{| A |},$ (C.8)

which though a little longer in the numerator but shows this property. The mentioned operator identity is

$\begin{matrix} [A] \bar{B} = A B A B + A B^{2} A + B A B A - 〈 B 〉 A B A - 〈 A B 〉 (A B + B A) \\ + (〈 A B 〉 〈 B 〉 - 〈 A B^{2} 〉) A + [A B] I, \end{matrix}$ (C.9)

and can be derived from a more general operator identity with 3 operators $A$ , 1 operator $B$ and 1 operator $C$ in each sum term and in symmetric way which generalizes the Cayley-Hamilton identity by substitutions and specialization $C = B$ . A derivation by means of the Levi-Civita pseudo-tensors is also possible. Since it is long we do not derive it here but hope to find opportunity to do this in future. Finally, we give here the other two invariants of $L$ which are

$\begin{array}{l} 〈 L 〉 = \frac{(\tilde{x} A^{2} x - 〈 A 〉 \tilde{x} A x) + | A | 〈 B 〉}{| A |}, \\ [L] = \frac{(\tilde{x} A x) (\tilde{x} x) - (\tilde{x} A B A x - 〈 B 〉 \tilde{x} A^{2} x + (〈 A 〉 〈 B 〉 - 〈 A B 〉) \tilde{x} A x) + | A | [B]}{| A |}, \end{array}$ (C.10)

which are not symmetric in $A$ and $B$ and the complementary operator to $L$ is

$\begin{array}{l} \bar{L} = \frac{1}{| A |} {(\tilde{x} A x) x \cdot \tilde{x} + A x \cdot \tilde{x} A B + B A x \cdot \tilde{x} A - \tilde{x} A x (A B + B A) \\ - 〈 B 〉 A x \cdot \tilde{x} A + \tilde{x} A x (〈 B 〉 A + 〈 A 〉 B) - (\tilde{x} A^{2} x) B \\ - (\tilde{x} A B A x - 〈 B 〉 \tilde{x} A^{2} x + (〈 A 〉 〈 B 〉 - 〈 A B 〉) \tilde{x} A x) I} + \bar{B}, 〈 \bar{L} 〉 = [L] . \end{array}$ (C.11)

Appendix D. Is the Notion “Negative Refraction” Useful in Geometric and Wave Optics?

In connection with the paper of Pendry [22] and the mass of reactions to it I will express here also my thoughts though I did not follow this development from the beginning and cannot exclude that similar thoughts are already discussed in literature.

In wave optics in the treatment of beams we have a mean value $k_{0}$ of the wave vector and a mean value $ω_{0}$ of the frequency and both together with the complex conjugate part but well separated are involved in the main factors $e^{\pm i (k_{0} r - ω_{0} t)}$ . From the wave vector one may form the refraction vector $n$ by the definition

$k \equiv \frac{ω}{c} n, {| n |}^{2} = n^{2}, \Rightarrow k = \frac{ω}{c} | n | \frac{n}{| n |} = | k | \frac{n}{| n |},$ (D.1)

where both signs of $| n |$ (refraction index; in general it is complex) can be chosen without something changing at the wave. Only the direction described by the

unit vector $\frac{n}{| n |}$ in connection with the product $k = | k | \frac{n}{| n |}$ is invariant but not

one of the two factors alone. If one change the sign of only one factor then one describes a wave propagating in the opposite direction and for isotropic media in no other than of these directions.

The second important fact is that under reflection and refraction of a beam at a surface $N r = a$ with the normal unit vector $N$ the tangential components $\bar{k} \equiv [N [k, N]]$ of all coupled wave vectors $k$

$k = [N, [k, N]] + k N \cdot N \equiv \bar{k} + k N \cdot N, (N^{2} = 1),$ (D.2)

in both media have to be the same and only the normal components $(k N) N$ can be different with both possible signs and the same is true for the refraction vectors $n$ . Even both signs of the normal component of the wave vector in the second medium are possible one as the refracted beam from the incident beam in the first medium and the second as an incident beam in the second medium which generates the refracted beam in the first medium which is identical in its direction with the reflected beam from the incident beam in the first medium. Both these beams possess a group velocity for isotropic media in the same direction as the wave vector independent on positive or negative signs of the electric permittivity and the magnetic permeability and whether or not the wave vectors are real or complex quantities. This excludes the possibility of refraction as drawn in Figure 1 in [22]. Only in case that the group velocity is in opposite direction to the wave or refraction vector in second medium we have in our Figure 5 a right-hand picture which is somehow similar to mentioned of Pendry. This, however, is not connected with negative $ε (ω)$ and $μ (ω)$ but merely with an active medium with all its problems as discussed (e.g., super-luminal velocities, super-diffraction of beams) and is hardly realizable. The situation changes but not basically if the medium is anisotropic and, obviously, this is not meant.

Wave vectors and equally refraction vectors possess only a modulus and a certain direction in space but not a certain sign of each separately and the notion in the heading seems to me not being really useful.

NOTES

¹Long ago I visited a lecture of U. Leonhardt about this which I did not fully understand but also did not trace the original paper to this time. However, it seemed to me that U.L. assessed this paper very positively and as correct.

²Sometimes called magnetic induction but in this concept $B$ and $H$ are identical (see next Section) and $D (r, t)$ is the electric induction.

³In the general case of three-dimensional operators if $| L | \neq 0$ we have according to (A.5) the relation ${\bar{L}}^{2} = [L] \bar{L} + | L | (L - 〈 L 〉 I)$ as can be straightforwardly calculated from the definitions using the Cayley-Hamilton identity.

⁴All these problems could be removed if one defines $B = H μ$ instead of (3.6) but this possesses the danger to cause some confusion.

⁵For inclusion of higher than second-rank tensors the concept of coordinate-invariant treatment without using tensor indices fails but in three-dimensional case also third-rank tensors which are anti-symmetric in two indices can be included since they may be mapped onto second-rank pseudo-tensors.

⁶In the beginning of the eighties I sent my papers with application of coordinate-invariant methods (about 10) to Fyodorov who as it proved did not know them. Since this time we were in loose correspondence up to the end of the eighties and I used a big Optics Conference in Minsk in the eighties especially to meet him there personally and this took place in the main building of Academy of Sciences of Belarus on the main boulevard in Minsk. Once in the eighties I received a letter from Fyodorov about an unprecedented plagiarism in a book of H. C. Chen “Theory of Electromagnetic Waves, A Coordinate-Free Approach” from McGraw-Hill (1983). Fyodorov is not cited there and all is made to camouflage the real authorship of these methods. When I tried to see this book I could not find it and, as usual in such cases, a search programme in the libraries of GDR was started. It was not found in GDR and then in such cases it could be searched in West-Germany. In West-Berlin (about 15 km airline from me but unreachable that time) it was found in the Library of the Technical University and I got it for one month. Altogether, it lasted almost three quarters of a year to get it. All what Fyodorov said turned out to be true. The main part of Chen’s book is almost a free translation of main chapters of the book of Fyodorov [9]. However, what Fyodorov obviously did not know was that two chapters of Chen’s book are a plagiarism of my paper [15] to Huygens principle. Obviously also that Chen did not know my paper [25]. This journal was hardly known in the world and ceased to exist after the turn in GDR and he also did not use my papers to amplitude relations for reflection and refraction at anisotropic media in “Ann. d. Physik”, likely because my notations were too different from that of Fyodorov and were more adapted to Landau and Lifshits [4]. When I tried to inform a branch office of McGraw-Hill in Hamburg (FRG) by a letter about this and to get an exemplar without payment (I could not pay to this time West-German currency) my chief at this time Witlof Brunner demanded to write not about the “plagiarism” that was not acceptable for me and I renounced to send this letter. Later, after the turn in GDR, I found this same exemplar of the book of Chen which I earlier have had for a month in the reading room of the Library of the Technical University in West-Berlin and made a copy. Obviously, to this time it was already taken from market (I tried earlier to get it through a colleague, H. Haake, from Essen in West Germany whom I met at a Workshop in Poland but he wrote me then that it was impossible to order). Fyodorov and his scientific colleagues reached that in scientific media in the West and in Russian newspapers was written about the plagiarism. Weiglhofer [26] (see [27] ) and also some others did not know all this and cite the fraud of Chen instead of the genuine authors (but in [26] a book of Chen from 1992 is cited which obviously later could appear in U.S.A.). My chief to this time, W. Brunner, was indifferent and uninterested in all this and was not ready to support me.

⁷A bi-quadratic equation of the form $z^{4} - 2 p z^{2} + q^{2} = 0, (p = p^{*}, q = q^{*}, \Rightarrow q^{2} \geq 0),$ with real p and q and therefore non-negative q² possesses 4 solutions which may be written in the following form $z_{\pm}^{(\pm)} = (\pm) \frac{1}{\sqrt{2}} (\sqrt{p + q} \pm \sqrt{p - q}) = (\pm) \frac{\sqrt{2} q}{\sqrt{p + q} \mp \sqrt{p - q}},$ with only real or imaginary sum terms $\sqrt{p \pm q}$ . Clearly, these solutions are also true in general case but usually then do not provide separation into real and imaginary parts.

⁸Sometime in the eighties I asked H. Paul from our Institute to discuss with me the polariton model and he agreed. My first aim was to see did he know whether or not this model or something similar was already discussed in literature since I did not find neither theoretical considerations nor experimental hints for some results. He also did not know something and likely nothing existed. Thank! My second aim was to show him that the height as well as the width of the amplification contours in form of ellipses are proportional to the square root of the inverse occupation density and I made a drawing of this contour but nothing of the kind of the more general and complicated figures made by computer and presented here. I could not expect that we may discuss the formulae in detail (more than the permittivity he did not want to see) and soon H. Paul finished the discussion saying approximately: I believe you only if you have also the nonlinear terms in the equations. This, however, was not my intention, my possibility and my official task at that time.

⁹Long ago I heard or saw in popular sources something about experiments of G. Nimtz but did not take the super-luminal velocities seriously and did not try to find the original papers. Now, when I find as it seems to me by reliable mathematics such possibilities of group velocities greater than light velocity in vacuum I changed my opinion and look for physical explanations. However, I have to emphasize that I cannot support by own calculation the effects concerning tunneling and the conditions for them since I did not make such.

¹⁰This is likely also the intention when Fyodorov writes instead of my $[y]$ the form $y^{\times}$ with the consequence for the vector product “ $[y z] = y^{\times} z$ ” (similar to “ $y \times z$ ” (Gibbs); besides, F. writes vector products also with squared brackets but without a comma between the vectors) however, with the disadvantage that it works only to the right onto vectors.

Conflicts of Interest

The author declares no conflicts of interest regarding the publication of this paper.

References

[1]	Agranovich, V.M. and Ginzburg, V.L. (1979) Spatial Dispersion in Crystal Optics and the Theory of Excitons. Second Edition, Nauka, Moskva.
[2]	Ginzburg, V.L. (1981) Theoretical Physics and Astrophysics. Nauka, Moskva.
[3]	Silin, V.P. and Rukhadze, A.A. (1961) Elektromagnitnye svoistva plasma i plasmapodobnykh sred, (Electromagnetic Properties of Plasma and Plasma-Like Media). Gosatomisdat, Moskva.
[4]	Landau, L.D. and Lifshitz, E.M. (1982) Electrodynamics of Continuous Media. 2nd Edition, Revised and Complemented by E.M. Lifshits and L.P. Pitayevski, Nauka, Moskva.
[5]	I.E. Tamm, (1976) Osnovy teorii elektrichestva (Foundations of the Theory of Electricity). 9th Edition, Nauka, Moskva.
[6]	Sommerfeld, A. (1959) Optik. 2nd Edition, Akademische Verlagsgesellschaft, Geest and Portig, Leipzig.
[7]	Born, M. and Wolf, E. (1999) Principles of Optics. Seventh Edition, Cambridge University Press, Cambridge.
[8]	Szivessy, G. (1928) Kristalloptik. In: Geiger, H. und Scheel, K., Eds., Handbuch der Physik, Band XX, Springer, Berlin, 635-904. https://doi.org/10.1007/978-3-642-90780-7_11
[9]	Fyodorov, F.I. (1958) Optika anisotropnykh sred (Optics of Anisotropic Media). Izdatelstvo Akademya nauk BSSR, Minsk. (In Russian)
[10]	Fyodorov, F.I. (1976) Teoriya girotropii (Theroy of Gyrotropy). Nauka i tekhnika, Minsk.
[11]	Fyodorov, F.I. and Filippov, V.V. (1976) Otrazhenye i prelomlyenye sveta prosrachnymi fristallami (Reflection and Refraction of Light by Transparent Crystals). Nauka i Tekhnika, Minsk.
[12]	Wünsche, A. (1970) Annals of Physics (Leipzig), 25, 201-214. https://doi.org/10.1002/andp.19704800207
[13]	Wünsche, A. (1978) Annals of Physics (Leipzig), 35, 303-320. https://doi.org/10.1002/andp.19784900407
[14]	Wünsche, A. (2021) Journal of Modern Physics, 12, 1866-1921. https://doi.org/10.4236/jmp.2021.1213108
[15]	Wünsche, A. (1970) Annals of Physics (Leipzig), 25, 179-200. https://doi.org/10.1002/andp.19704800206
[16]	Knox, R.S. (1963) Theory of Excitons. Academic Press, New York.
[17]	Agranovich, V.M. (1968) Teoriya Eksitonov (Theory of Excitons). Nauka, Moskva.
[18]	Davydov, A.S. (1968) Teoria molekularnykh eksitonov (Theory of Molecular Excitons). Nauka, Moskva.
[19]	Agranovich, V.M. and Galanin, M.D. (1978) Perenos energii elektronnovo vozbuzhdyeniya v kondensirovannykh sredakh, (Transmission of Energy of Electronic Excitations in Condensed Media). Nauka, Moskva.
[20]	Pekar, S.I. (1982) Kristallooptika i dobavochnyje volny (Crystal Optics and Additional Waves). Naukova Dumka, Kiev.
[21]	Wünsche, A. (2021) Relativistic-Covariant Energy-Momentum Tensor for Homogeneous Anisotropic Dispersive Media. https://doi.org/10.4236/jmp.2021.1213108
[22]	Pendry, J.B. (2000) Physical Review Letters, 89, 3966-3969. https://doi.org/10.1103/PhysRevLett.85.3966
[23]	de Groot, S.R. and Suttorp, L.G. (1972) Foundations of Electrodynamics. North Holland Publ. Comp., Amsterdam.
[24]	Bloembergen, N. (1965) Nonlinear Optics. W.A. Benjamin, Inc., New York.
[25]	Wünsche, A. (1971) der Wissenschaften zu Berlin, 13, 754-769. https://doi.org/10.1515/9783112531341-007
[26]	Weiglhofer, W.S. (1995) Frequency-Dependent Dyadic Green Functions for Bianisotropic Media. In: Barrett, T.W. and Grimes, D.M., Eds., Advanced Electromagnetism: Foundations, Theory and Applications, World Scientific Publishing, Singapore, 376-389. https://doi.org/10.1142/9789812831323_0013
[27]	Barrett, T.W. and Grimes, D.M. (1995) Advanced Electromagnetism, Foundations, Theory and Applications. World Scientific, Singapore. https://doi.org/10.1142/2599
[28]	Terletsky, Ya.P. (1966) Paradoksy teoriya otnositel”nosti, (Paradoxes of Relativity Theory). Nauka, Moskva.
[29]	Vaas, R. (2013) Tunnel durch Raum und Zeit. 5th Edition, Kosmos, Stuttgart.
[30]	Nimtz, G. and Haibel, H. (2004) Tunneleffekt—Räume ohne Zeit. Wiley VCH, Weinheim.

Journals Menu

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals Menu

Home

About SCIRP

Service

Policies