1. Introduction
The simplest eigenvalue problem in Algebra is that of finding the value of λ which makes the square matrix A diagonal, and this is solved by writing the characteristic equation which is
(1)
And solving for its roots, I is the unit matrix. However, the problem becomes more interesting when dealing with symmetric matrices since those are of importance in applied fields such as in sound in determining natural modes of vibrations or in elasticity. Such problems will stimulate using different computer software such as subroutine JACOBI [1].
Another kind of eigenvalue problems is that encountered in integral equations, e.g. to solve the eigenvalue problem [2]
(2)
Most important are the eigenvalue problems, faced with, in quantum mechanics, i.e. when the time-dependent Schrodinger equation or the time-independent one are under consideration and the energy values are required for a certain system (e.g. the energy levels of the Hydrogen atom). The time-independent Schrodinger equation, as an eigenvalue problem, is written as
(3)
where
is the Hamiltonian, E is the energy eigenvalue, and φ is the eigenfunction. Note that
is given by
(4)
is the Laplacian.
It is to be noted that dealing with eigenvalue problems, the operators encountered in the calculation are of the Hermitian type and that the Hamiltonian is Hermitian [3].
Keeping in mind the above small piece of information about the eigenvalue problems, we now turn to answer the important question: What is perturbation theory?
Perturbation theory is a theory built on certain mathematical methods used in order to get an approximate solution to some given problem with the condition that we start with an exact known solution for the related simpler problem. This means that the problem is partitioned into two parts: one is solvable and the other part represents the perturbation. Hence, perturbation theory is only used when the original problem cannot be solved exactly, but it can be formulated through the addition of a small perturbation term to the problem which can be solved exactly.
The ultimate solution to the original problem is obtained as a series called perturbation series. The first term of which is the exact solution of the related problem (without perturbation) and the other terms give the deviations in the exact solution; in fact, this gives us the approximate solution to the problem. It is clear that the more terms we take from the series, the better result we get.
In principle, the preliminary steps of performing perturbation techniques are as follows:
Let
be a small parameter and S is a certain solution to be approximated in the problem, then we arrive to a series of the type:
(5)
where
is the solution to the unperturbed problem and the approximation in the original problem starts with the term
. Hence, we get the approximate solution for the problem by truncating the above series; e.g. the approximate solution of the first order is
(6)
In a way, perturbation theory is similar to the methods used in numerical analysis. The first early time, the theory was used was in space mechanics to evaluate the moon’s orbit; however, perturbation theory is now widely used in many fields, such as wave mechanics, statistical mechanics, quantum mechanics, and chemistry [4].
In quantum mechanics, perturbation theory is used to get approximate solutions for complex quantum systems in terms of simpler ones. The idea is to start with a simple system which is solvable then to add a weak perturbation Hamiltonian, and if the perturbation is small then the different physical phenomena (observables) which are connected with the perturbed system, such as energy levels and eingenfunctions, can be expressed as corrections to the corresponding simple system. Due to the smallness of these corrections they can be calculated via approximate methods in the form of converging series.
Perturbation theory is a good and important tool to describe real quantum systems noting that the exact solutions of Schrodinger equation for some Hamiltonians are difficult to obtain even if these Hamiltonians are not that complex [5].
In the next section, we give some examples on the use of perturbation theory in solving algebraic equations and in solving differential and integral equations. In Section 3, we direct our attention to its uses in solving eigenvalue problems in many applied fields. Finally we conclude shedding light on some recent uses of perturbation theory.
2. Algebraic Perturbation Theory
As mentioned, earlier, one of the important eigenvalue problems are of the type of Equation (2), i.e. it is an integral equation and where
is the kernel; if this kernel is separable, which means that:
(7)
Then, one can get non-trivial solutions (eigenvalues and eigenfunctions).
Taking into account Equation (2) and Equation (7), we see that
(8)
Substituting, the eigenvalue will be given by
(9)
Now if
, then
and the eigenvalue is given by
[2].
2.1. Small Perturbation Parameter
If we consider the eigenvalue problem:
(11)
And if we are looking for a value of λgiving a solution
for this equation fulfilling the condition that
; then
is an eigenfunction corresponding to the eigenvalue λ. In this case the spectrum of the eigenvalues is discrete.
For the eigenvalue λ, it is clear, with a straight forward manipulation that
and the corresponding eigenfunction is
[6].
Now, consider the following eigenvalue problem
(11)
where
is a given real number. If
is assumed to be of small value, then the problem in this case is to find the eigenvalue
and the corresponding eigenfunction
. To solve the problem, we write the two following series
(12)
And
(13)
Moreover we put
(14)
So as to get
(15)
And if we assume that the series
is convergent in order to be able to perform multiplication term by term, then on comparing coefficients of
we get
(16)
In general for the nth terms, one gets
(17)
From Equation (16) we see that
and
, as expected.
The last equation is written in the form
, (18)
and where
(19)
Taking the inner product of Equation (18) with
and noting that the kernel is symmetric in x and y, one gets
(20)
is the first order correction to the eigenvalue. Equation (20) can be stated as “The approximation of first order in the eigenvalue is equal to the mean value of the perturbation operator B with respect to the normalized unperturbed eigenfunction
” [6] [7].
In this problem
. Moreover, the correction to the eigenfunction
is given by
.
Calculating
and
in the same manner, one obtains
and U as
(21)
And
(22)
2.2. Algebraic Equations
To illustrate the use of perturbation theory in solving certain algebraic equations, we give the following examples:
Example 1
Consider the cubic equation
, where
is a small perturbation number and assume that the solution is of the form
where
is a term containing all terms with
and higher. Substituting in the cubic equation and equating coefficients of
, we get
From which, we obtain
and
. Therefore, for
we get
and
; while for
we get
.
Continuing this process we can have the solution as convergent expansion in
. The truncation in the series is determined by the degree of precision we like to reach. Moreover, the same result can be obtained using the usual Taylor expansion [7].
Example 2
Again, we consider the cubic equation
; using this with the familiar expansion in
and on substitution, we get
,
,
; for
, we have
, this gives the linear part of the solution. As to the two other solutions, we note that as
,
. This is actually a standard singular perturbation problem. To overcome this problem, some sort of scaling is needed; this is done by writing x as
so as
as
, and from the cubic equation one gets
.
needs to be determined in order to fulfill our
requirement, namely to get a definite answer. For that, the principle of dominant balance is to be used; this requires that the two leading terms in the equation, at
least, should have the same order of magnitude. Hence
giving
. Our cubic equation, now, becomes, in terms of the variable y,
. Following the steps of the last example, we get
or
; which are the other two solutions of the problem [7].
Example 3
If the quadratic equation
is to be solved, then making the usual expansion in
and comparing coefficients of
, we get
,
; the first equation leads to the solution
which is a repeated root, substituting in the second equation we get a contradiction (0 = 1?) which means that the series techniques do not work here. Hence, we calculate the roots of the equation using the simple general square root method to get
; that gives us a hint for writing the expansion as
instead of the expansion
. Now
following the same techniques used before we get
,
and hence the solutions are
as expected [7].
Example 4
Given that
, then we see that there are two possible solutions for it for the case
, namely
1) When
and
.
2) When
as
, for this case x has to behave like
; and in order to get a convergent solution, we write the problem as
which implies that
, note that
is the initial starting point for any recurrence techniques we choose.
Using the formula
and defining
,
, we see that
,
, etc.
Note that the successive terms of the series converge slowly as
[7].
2.3. Examples from Different Mathematics Fields
In this subsection, we give applications of perturbation theory in other different fields of mathematics such as differential equations and geometry. These are illustrated by few examples.
Example 5
Given the ordinary differential equation
,
,
,
which represents a projectile vertical motion in air with air resistance force taken into account.
, where v is the initial velocity, m is the projectile mass, k is the friction constant, and g is the gravitational constant. If we want now to
solve for y, we get the exact solution as
; and if
which corresponds to the case when the air resistance is weak, then we use Taylor’s series to obtain the first few terms of the expansion as
. The expansion is valid for
.
Now, assuming that we are not aware of the exact solution, then we use perturbation theory to write
; and where
are to be determined; as before, we substitute this in the original equation, following the same old techniques, keeping in mind that the initial conditions become
,
; then we get the system of differential equations
;
;
, on equating the coefficients of
. Solving these
equations, we get
,
,
, in conformation with the results obtained using Taylor’s series expansion [8].
Example 6
Consider a right angle triangle with sides 1, 0.1, and
, it is difficult to differentiate between this right angle triangle and its being an isosceles one; since there is not that difference between the two sides 1 and 1.0049. Let us make a simulation of the situation using perturbation theory by considering the side as 1,
and
. Using binomial theorem we have
. Hence, the difference between the side 1 and the hypotenuse is of order
which is a small number. Note that the angle between
them is given by
which is a sensible angle and can be
observed better than the difference between the side and the hypotenuse. Actually, if we make the value of
smaller, we will be misled to the conclusion that the triangle is segment of a straight line [9].
2.4. The Smallness of the Perturbation Parameter and Its Relation with the Unperturbed Operator
If we consider the following eigenvalue problem
(23)
where a is a given positive constant, then the discrete eigenvalues are given by
(24)
And the corresponding eigenfunctions are
(25)
is a Hermite polynomial.
This problem describes the eigenvalue problem for the simple harmonic oscillator [10].
Now, adding a perturbation term of the form
, we get
(26)
We may look at the term
as not small. This is because the variable value x takes values from −∞ to ∞; however if we compare this perturbation parameter with the term
which is the non-perturbed one, the perturbation parameter (for small
) is then considered as a small perturbation. One concludes then that: In perturbation problems, the perturbing parameter has to be small in comparison with the unperturbed operator [6].
Now the eigenvalues and eigenfunctions for the perturbed problem are given by
(27a)
and
(27b)
Note that the functions
are regular functions in
.
At the end of this subsection, we note that the main goal of perturbation theory is the precise definition of smallness of the perturbing parameter with respect to the unperturbed operator [6].
Before we move to the applications of perturbation theory in different applied fields, we make a little digression, in the next section, on spectral perturbation theory.
2.5. Spectral Perturbation Theory
The theory deals with the way the perturbation affects the spectrum of a certain operator when it is perturbed. This compels us to reformulate the problem in a more precise and different manner [6].
Definition 1
If
is a Hilbert space with inner product
and that
is a linear operator in
such that
and
is the domain of
which depends on a small parameter
; assume that
1)
Is a self-adjoint operator, i.e.
(28)
2)
has a smooth branch of simple eigenvalues
with eigenvectors
; namely
(29)
Then, the job of perturbation theory is the calculation of the eigenvalues with respect to their values at
when
is small but not equal to zero [6].
To clarify this idea, we take as an example the perturbation in the eigenvalues for a symmetric matrix; in this case
, with the Euclidean inner product
(30)
And
is a linear transformation for a symmetric matrix of the order
(
). [note that we are concentrating on symmetric matrices because these are what we face in physics]
Moreover, the perturbation in the eigenvalues for a Hermitian matrix is of our concern, in this case
with the inner product
(31)
Problems of this sort are found in many applications especially in quantum mechanics and where Hilbert spaces of infinite dimensions are dealt with. In this case expansions of the form
, (32)
, (33)
(34)
Are used.
Now following the same techniques we have used in solving previous examples and in particular equating coefficients of
, we get the following equations
(35)
(36)
(37)
[I is the
unit matrix].
Now, suppose that
, then from Equation (35)
is the eigenvalue of the unperturbed operator
; and the corresponding eigenvector is
[7].
As to the second correction
and
, we note that Equation (36) is a singular equation and in order to be able to solve it, we need a certain hypothesis and the so-called Fredholm alternative [7].
3. Applications
In this section, we give some representative well-known problems in different branches of physics; we start first with the ones from classical physics, then we move to the ones from modern physics (or more specific in quantum mechanics).
3.1. Problems from Classical Physics
1) String Problem in the Theory of Sound
The vibrations of a string, with a small stiffness can be described the following eigenvalue problem
(38)
where
is a small positive number and is proportional to the stiffness, i.e.
represents the perturbation. To solve this problem boundary conditions are needed. For instance to impose the conditions at the end points;
and
[6].
If the two ends of the string are to be supported, then the boundary conditions will be
,
; with these conditions in mind, one can get the eigenfunctions and eigenvalues as
(39)
The set
is complete in the interval
; this means that
represents the full spectrum of the problem. For
, the spectrum is bounded below and the smallest value of the eigenvalues is
. However, for
the spectrum is not bounded below and this is not allowed because
measure the stiffness which is a physical quantity and is not negative.
2) The Simple Pendulum
The equation of motion for the simple pendulum is
(40)
where
is the angle between the light string (of length L) and the vertical equilibrium position, m is the mass, and g is the gravitational acceleration. The equation can be written as
(41)
where
.
Now, taking into account the initial conditions, the problem, for a small angle, then becomes
(42)
And
,
.
Putting
and dividing by
, the last equation can be written as
(43)
If the solution is required to be of second order, then the higher order terms can be neglected. Making the transformation
, where t is the time, Equation (43) is then written as
(44)
And where the differentiation is with respect to
. Now, expanding in terms of powers of
, we get
(45)
And
(46)
Substituting with Equations (45)-(46) in Equation (44) and equating coefficients of
, one obtains
(47)
And
(48)
Choosing
, the solution
is given by
, (50)
while
satisfies the equation
(51)
If we choose
the solution
will then be
(52)
and the final solution is
(53)
Moreover,
[11].
3) Steady State Diffusion Through a Corrugated Slab
In this application, the simplest type of heat transfer is dealt with, which is the steady state type, i.e. when
. The flow of heat is through a corrugated
slab of thickness H, with the assumption that the lower side (
) is at temperature
; and the upper side (
) is at temperature
. Then, for steady state flow we have
(54)
(this is because the temperature does not depend on time explicitly
).
From the boundary conditions, the solution is given by
(55)
And the heat flux through the slab is determined by
(56)
Now, assuming that the slab occupies the region
,
, and suppose that the slab is not homogenous with a boundary condition on the lower part at
given by
(57)
where
; in this case Laplace equation for the steady state case is given by
(58)
with Dirichlet boundary conditions given as
(59)
Introducing the dimensionless variables
(60)
The problem becomes dimensionless and the boundary conditions will be as
(61)
where
. It is clear that
and hence
.
For small amplitudes,
, and this is fulfilled when
is made to be very small (
with
kept as a constant) [12].
Now, the boundary condition
can be written as a Taylor series as
(62)
Also, using perturbation theory, T is written as
(63)
Using the last two equations, following the same old procedure, and equating coefficients of
, we get
(64)
This is for the first boundary condition; as for the second one
, it tends to
and
, with all
satisfying Laplace equation
. putting, now
; we get the solution as
(65)
4) Incompressible Flow Past a Circle
Here, the stream function is given by
(66)
is the velocity; and if the motion is inviscid, the function satisfies the equation
(67)
where
is the vorticity, which is equal to zero upstream if the flow is uniform; hence Equation (66) becomes, in cylindrical coordinates, as
(68)
Also, the boundary conditions are given by
as
(upstream) (69)
and
(on the surface) (70)
where a is the radius of the circle. Moreover, it is assumed that there is no rotational motion, this means that the flow should be symmetrical about the line
and this means that
; hence the solution, in this case, will be
(71)
Now, in the presence of the circle and if the vorticity number
is small, then we expect that the solution will not depart from the one we got in Equation (70). So, we write the solution in the presence of vorticity as
(72)
is the unperturbed solution given by Equation (70).
On substituting Equation (71) in Equation (67), and equating coefficients of
, we get
(73)
Also, we get the boundary conditions as
(74)
Therefore, the solution is written as
(75)
And the equation in
is given by
(76)
With the conditions
(77)
Using separation of variables, the solution in final form is
(78)
Equation (77) gives the solution in first order in
[12].
3.2. Problems in Modern Physics
In some physics subjects, one can solve for the eigenvalues and eigenfunctions of the Hamiltonian of the problem exactly and precisely such as the simple harmonic oscillator and the Hydrogen atom in atomic physics. However, many physical systems in nature cannot be solved exactly and that requires the need for the development of appropriate procedures and tools to deal with such situations. As we stated before, perturbation theory was successful in treating these cases in which the distortion in the system, that can be solved exactly, is small.
Mathematically, the Hamiltonian of the system
is written as
(79)
is the unperturbed part of the Hamiltonian,
is a small parameter, and
is the potential, related to the perturbation, and supposed to be independent of time.
We assume that we can solve the time-independent Schrodinger equation for the unperturbed Hamiltonian i.e. we can get the eigenvalues
and the eigenfunctions
, such that
(80)
Such solutions may give unperturbed eigenvalues
, nondegenerate or degenerate.
3.3. Nondegenerate Solutions
Let the time-independent Schrodinger equation for the total Hamiltonian be
(81)
Since
is a small parameter, we can use Taylor series to expand the wave function
in terms of
as
(82)
The same thing can be done with the state energy to get
(83)
Substituting with these two equations in Equation (81) one gets
(84)
[
are the corrections in E and
].
Comparing terms of
, one gets
For the order
(85)
This equation actually shows that
is the unperturbed eigenfunction corresponding to the eigenvalue
.
For the order
(86)
Taking the scalar product of this equation with
, one gets
(87)
Using the hermiticity of
and that
is the eigenvalue of
, one gets
(88)
represents the shift in E to first order; which means that
(89)
This is a very important result which can be stated as
{The perturbed energy levels can be calculated from those which are not perturbed} [13].
Now, taking the scalar product of Equation (86) with
and using the orthogonality of the eigenfunctions, and the hermiticty of the unperturbed Hamiltonian, one obtains
(90)
Hence and in first order, the perturbed eigenfunction is given by
(91)
This result is for the ground state; a similar result can be obtained for the other energy levels
and
(92)
Note that
is normalized to first order in
, namely
; where Dirac notation is used here.
Example 7. Infinite Square Well
Consider a particle moving under the potential described as
, and assume that it is required to compute the
ground state energy to first order using perturbation theory. Then, to do that we need to seek solutions of the corresponding infinite square well problem. The
potential in this case is given by
; this means that the particle
(m) is moving inside a box (
) freely, while there is a very strong repulsion force at the walls (
). Hence, the time-independent Schrodinger equation
is given by
, its solution is then
; moreover, the energy eigenvalues are given by
; and the least value energy ( ground state) is
[13].
Now, going back to the original problem, we see that the ground state energy needs to be calculated to first order where the perturbation is given by
; this is small compared with the term
.
Or
. Using perturbation theory techniques, one obtains the shift in ground state energy as
(it is to be noted that n = 1 for the ground state energy) [13] [14].
Example 8. Simple Harmonic Oscillator under a Linear Perturbation
The unperturbed Hamiltonian for the simple harmonic oscillator in one dimension is given by
,
where
. Putting
, one gets for the unperturbed states energy
,
; the superscript 0 means that the energy is for the unperturbed case. Also, the unperturbed normalized eigenfunctions are given by
;
;
are Hermite polynomials.
Now, in the presence of the linear perturbation
, we need to compute the matrix elements of
; which is
; again Dirac notation is used here for the eigenfunctions. Using the recurrence relation for Hermite polynomials [10],
, and the form
of
, one obtains that
. Taking the product of this equation with
and using the orthogonality of these functions, we get
; and hence, for a linear perturbation of the form
, the total Hamiltonian is
; where
. Therefore, the total energy, including the perturbation term, is given by
[14].
3.4. Degenerate States
The correction in the eigenfunctions, in first order of perturbation, is small only when
(93)
If the difference between the unperturbed energy eigenvalues is small compared with the matrix element in the numerator
, then the perturbation will be large and the approximation fails. In particular, if there exist levels of the degenerate type (levels with the same energy), the denominator will be equal to zero which is a singular situation and the solution obtained for the nondegenerate states is not acceptable.
Now, if unperturbed state is l-fold degenerate, then we use the projection
on the state and the projection
perpendicular to it. Acting with
on the space of these degenerate states, one gets
(94)
Choosing an appropriate basis for the space of the degenerate states
;
can be written as
(95)
So, one gets
(96)
Hence,
is an eigenvalue of the matrix
. this will lead to l roots. Equation (96) is a generalization of Equation (89) for the nondegenerate case. Moreover, if the states are degenerate, one can, partially, remove the degeneracy as will be shown in the following example.
Example 9. Stark Effect
The Schrodinger equation for the Hydrogen atom is given by [10]
(97)
are the mass and charge of the electron which is orbiting around the proton.
Equation (96) can be written as
, where
is the Hamiltonian and u is the eigenfunction which describes the electron motion under the attractive
force
; while E is the eigenvalue (the energy). Using separation of variables in spherical coordinates, the eigenvalues are found to be [10]
(98)
n is the principal quantum number and for each n there exist an orbital angular quantum number l such that
and
; also, there exist a magnetic quantum number m such that
. This means that for each orbital quantum number l the degeneracy in energy states equals
.
The state with
is labelled as s-state, while the one with
is called the p-state, … and so on. Moreover, the ground eigenstate for Hydrogen atom is
given
[10]. Note that, in this case,
. In terms of the fine structure formula, the wave function in this situation is expressed as
. If
, then
and there is a 4-fold degeneracy with the same energy and the eigenstates are
,
,
,
,
Now, if the Hydrogen atom is acted upon by a weak electric field
in the z-direction, and
;
is the magnitude of the electric field. The potential, in this case, is
.
To compute the matrix elements of
, for the unperturbed states for the Hydrogen atom with energy
, we note that we are computing a 4 × 4 Hermitian matrix keeping in mind that the perturbing potential is odd under parity operation; and hence the only non-zero elements are those with different parity (e.g.
, and
, this because
is of even parity, while
is of odd parity). Note, also that the Hydrogen eigenstates are eigenstates of the total angular momentum
and the z-component of the angular momentum
. Therefore,
between states p and s may not be zeros; moreover V commutes with
. Taking previous information into account, the only non-zero elements are
and its Hermitian conjugate. i.e.
. Note that to reach these results, the forms of
the Hydrogen eigenstates were used;
is Bohr radius. From these results we conclude that the electric field was able to remove the degeneracy between the two states
and
(it made what was hidden apparent); while the other two states
are still without any change in their status. What happened for the states
and
was a discretization in the energy, so that the energy of one of them becomes
and the other is
. This is the so-called Stark effect [13].
Example 10. The Approximation in Calculating the Ground State Energy for the Helium Atom
In the He atom, the nucleus is composed of two protons and two neutrons with two electrons moving around the nucleus. The unperturbed total Hamiltonian is given by
where
are the Hamiltonians for the first and second electrons; the total, ground state, eigenfunction of the total unperturbed Hamiltonian is given by
;
are the position vectors for electron 1 and electron 2 respectively. While,
has to do with the spin of the ground state.
For the total Hamiltonian
, when the repulsion between the two electrons is taken into account (
) and which is considered as the perturbation in the
problem, is given by
and
,
.
Now, to compute the first order correction in the energy, we compute the expectation value for
, and where
; hence
; (
is the atomic number for Helium and
is Rydberg constant) [13].
Therefore,
, and this represents the corrected value of the energy, in first order, for the Helium ground state [13] [15].
Example 11. Zeeman Effect
In this problem, an electron (−e) moves under the effect of a field resulting from a nucleus with positive charge Ze positioned at the center of the coordinates system
; moreover, there is a uniform magnetic field in the z-direction
so that the wave equation is given by
,
is a
small parameter representing the perturbation and
is related to the energy (it is the eigenvalue of the equation).
Now, if
(no perturbation), the eigenvalues,
, are then given by
,
; and the corresponding eigentates are
,
;
. Bu,
. The effect of this last operator is clear, since
. Hence, the eigenvalue, in the presence of perturbation, is given by
. While the eigenfunctions do not change with the perturbation [6].
Thence, the magnetic field removed the degeneracy in eigenvalues, e.g. if
then
; and the energy levels become
, respectively.
4. Concluding Discussion
In this brief review article, we have shown the importance of perturbation theory in evaluating the eigenvalues and eigenfunctions for physical systems in many applied fields, in classical mechanics, fluid mechanics, heat flow in certain bodies, and most prominent in quantum mechanics where we gave sample examples of well-known problems in the literature and cited in the references given below.
Perturbation theory will keep to be a vital subject leading to many applications in many fields; as an example, recently a PhD dissertation is written on perturbation theory and its role in the evaluation of Feyman Integrals using integration by parts identities [16].