Conceptual Problems in Bell’s Inequality and Quantum Entanglement ()
1. A Brief Historical Review
The description of the microscopic particles in quantum mechanics is very different from that in classical physics. The coordinates of an electron in atom are uncertain, and the electron stays at a place according to probability. A physical quantity is uncertain before it is measured. The wave function of the measured particle collapses to an eigenstate of the corresponding operator according to a certain probability. The most distinctive feature of the micro-world is uncertainty, observation and measurement results only obey statistical law. Conjugate physical quantities satisfy the Heisenberg uncertainty relation and cannot be accurately measured at the same time, such as position and momentum of a particle cannot be determined simultaneously.
There are many famous remarks about the loss of certainty and reality. For example, Einstein oppugned Bohr, “Does the moon exist only when someone is looking at it?” The Schrödinger’s cat in the black box is in a superposed state of both life and death. Feynman said, “I think I can safely say, nobody understands quantum mechanics”. Niels Bohr, one of the founders of quantum theory, pointed out, “If quantum mechanics hasn’t profoundly shocked you, you haven’t understood it yet.”
As for the philosophical interpretation of quantum theory, two opposing camps were formed from the beginning, represented by Einstein and Bohr. The first encounter between the two factions was at the fifth Solvay Conference in 1927. It was an unprecedented summit in physics, with 17 of the 29 participants winning the Nobel Prize in physics. Einstein’s main points of questioning quantum theory are three aspects: determinacy, reality and locality. Einstein always adheres to that, a complete physical theory should be of realism and causality. With regard to certainty, he said, God does not play dice. Reality holds that the existence in the world is independent of human observation. Locality means that there is no interaction at a distance between two points far away from each other.
In 1935, Einstein and his assistants Boris Podolsky and Nathan Rosen (EPR) at the Princeton Institute of Advanced Studies examined the completeness of quantum mechanics in depth [1]. EPR’s paper puts forward a viewpoint that a satisfactory physical theory should not only be consistent with the experimental results, but also that every element of the physical system must have a corresponding concept in the theory, that is, the theoretical description should be complete. Quantum mechanics holds that the physical system is described by a state function, each physical quantity corresponds to a Hermitian operator, and the corresponding physical quantity is determined only when the state function is the eigenstate of an operator. If the two operators are not commutable, the state function cannot be the common eigen state of the two operators at the same time, so at least one physical quantity is uncertain.
In EPR paper, the correlation between two particles after decoupling is discussed. The wave function of the correlated particles is at a mixed state before it is measured, If the physical quantity of the first particle is measured as
, the wave function of the second particle will also be at the corresponding eigenstate, so that the corresponding
also has a definite value. If the conjugate variable
of the first particle is measured, the corresponding variable
of the second particle also has a definite value.
EPR believes that there is no interaction between the two particles after they are separated, but by measuring the physical quantity of the first particle, we can accurately know the physical quantity of the other particle, and both conjugate physical quantities have definite values. This contradicts the basic principle of quantum mechanics, so the description of reality in quantum mechanics is incomplete. The correlation phenomenon between two particles leaving away was called quantum entanglement, but Einstein himself never believed in the existence of this correlation. Later, David Bohm changed the momentum correlation of particle pairs in EPR paper to spin correlation which is easier to test [2] [3].
In order to coordinate the contradiction between classical mechanics and quantum mechanics, David Bohm proposed an explanation of hidden variables theory about quantum mechanics in 1952, which linked the classical orbit of particles to the probability distribution of wave function [4] [5]. The main points of his explanation are as follows (for simplicity, we take
as unit). Let
be the solution of the time-dependent Schrödinger equation of a particle, that is,
(1)
where
is the momentum operator of the particle.
is expressed in the form of absolute value and corresponding phase as follows
(2)
Calculation shows that the phase function
satisfies the generalized Hamilton-Jacobi equation in classical mechanics
(3)
In which Q is the quantum potential introduced by Bohm. Let
, then S satisfies the classical Hamilton-Jacobi equation. If the relation
in Hamilton-Jacobi theory is introduced, by solving the momentum equation of the particle
(4)
we get a classical trajectory of the particle
. Because we don’t know the initial position
of particles exactly, one can only use the sets of these trajectories to explain the probability distribution of particle motion.
(5)
This is the classical explanation for quantum mechanics made by Bohm, which links quantum mechanics to the classical mechanics by (4) and Hamilton-Jacobi equations.
At the technical level, the book “Mathematical Foundations of Quantum Mechanics” published by J. von Neumann is a milestone [6]. According to the operator theory on Hilbert space, this book provides a strict mathematical framework for non-relativistic quantum mechanics, and proves the equivalence between matrix mechanics and wave mechanics. By introducing the concept of density matrix, the statistical theory of quantum mechanics is established. According to the uncertain characteristics of quantum theory, J. von Neumann proved that a realistic hidden variable theory is impossible.
John Stewart Bell carefully checked von Neumann’s proof of the impossibility of hidden variables and found that there is a loophole in the correspondence between mathematics and physics. von Neumann, in his proof, used an assumption that “the average of the sum of the two observables is equal to the sum of the average of the observables” [3] [7]. Bell points out that if the two observables are conjugate variables, that is, when they obey the uncertainty principle in quantum mechanics, this conclusion is incorrect. However, in Bell’s rebuttal, whether the conjugate variables can be summable and whether the sum has physical meaning were not explained. After intensive study of the theory of hidden variables and the EPR paradox, Bell derived a famous inequality, whose purpose is to establish an experimental criterion for locality theory and quantum correlation [8].
Bell’s discovery thus shifted Einstein and Bohr’s debate from epistemology to the realm of experimental physics. Within a few years, Bell’s inequalities were adopted to a practical scheme [9]. The first experiments were carried out in 1972 at the University of California, Berkeley [10], and at Harvard [11] [12], then in 1976 at Texas A&M [13]. Later, it was found that there were some loopholes in these experiments, and some improved schemes were put forward. Some more reliable experiments were carried out, but the experimental results violate Bell’s inequality [14] [15] [16] [17].
In the following sections, we directly move into the theme of this article, where we examine concretely the physical meaning of the Bell inequality and derive the correlation functions of quantum entanglement. The interpretation of quantum theory involves more philosophical viewpoints and fundamental theories, we put it in the last section to discuss in detail.
2. The Physical Meaning of Bell’s Inequality?
If we do not consider the specific physical meaning of the inequality at first, Bell and CHSH actually proved the following mathematical theorem [9].
Theorem Assume
1˚ For
, the function
satisfies
(6)
2˚ For any two unit direction vectors
and
, the functions
satisfy
(7)
Define the correlation function between the two directions as
(8)
Then we have the following inequality
(9)
Now we examine the above theorem by corresponding to physical processes. I. What is the physical meaning of the function
, and does it correspond to specific particles and physical processes? Each term in the left hand of (9) corresponds to different particle and different process, but in proof of this inequality, several calculations similar to
are also used. The proof is valid only if
used in all calculations are the same function. This is a universal function that contains everything, whose requirement is equivalent to the theory of everything, and in which the so-called hidden variable
is not clear whether it is related to specific particles and processes. If the existence of the universal function and its physical meaning cannot be explained clearly, the physical meaning of the Bell’s inequality is also unclear.
1) Let’s examine a specific example. Suppose there is a coupling positron-electron pair with a total spin of 0. For
, they are moving away relatively after decoupling, and their spins are opposite. Assuming the electron has coordinate
and moves to
, and the positron has coordinate
and moves to
. Their wave function is given by
. Record the spin of the
electron measured at time
, place
and direction
as
,
and measured the spin of the positron at
, place
and direction
as
.
Because we only care about spin of particle now, the coordinates
can be regarded as part of the hidden variable according to the Bohm’s point of view, and the probability density function is given by
. These are the usual concepts of quantum mechanics, and they obviously satisfy the conditions 1˚ and 2˚ of the theorem. The time parameter t is introduced here because all measurements must have time to complete, but Bell’s proof uses a density function of
without time, so the proof implies the requirement of simultaneity [18]. However, the four measurements in the Bell’s inequality cannot be synchronized at all, and the functions
for different particle pairs are also different. Considering the physical process, the physical meaning of (9) is not clear, so the Bell’s inequality cannot explain the spin correlation of entangled particle pairs.
2) We clearly assume that the spin of the entangled pair is opposite before the measurement, but Bell, when defining (8), says that the two measurements are independent. We should replace
by
to meet the actual situation, and it is logical to calculate the spin correlation function by calculating the conditional probability. Therefore, the physical meaning of corresponding concepts in (8) is not clear.
3) Spin of a particle is a very sensitive physical quantity. Only an infinitesimal energy is needed to drive the spin to follow the magnetic field. The above measured results are not the actual spin values of the particles before the measurement, but the spin values polarized by the magnetic field of instrument. These values already have little to do with the spin values before the measurement.
To sum up, we find that the Bell’s inequality is only a strange mathematical theorem, which has not clear corresponding relationship with the physical process that Bell wants to explain. The physical content it represents is actually unclear. There are also a lot of studies that question Bell’s inequality from other aspects [18] - [25].
3. Spin Correlation Function of Entangled Particle Pair
The above analysis shows that, the physical meaning represented by Bell’s inequality is actually unclear. Now let’s check the calculation of spin correlation function of entangled particles in quantum mechanics [3] [26]. As in the previous section, denoting the particle has coordinate
and moves to
, and the other has coordinate
and moves to
. Their wave function is given by
. Record the spin of the particle measured at
, place
and
dirction
as
, and measured at
, place
and dirction
as
.
The following is the expression of quantum theory, and the wave function matrix of this entangled particle pair is represented as “Bell state”
(10)
The expected value of quantum mechanics for the correlation function
is given by
(11)
However, there are the following problems in the calculation of the above formula:
1) From the point of view of quantum theory, the spin direction of the entangled particles is uncertain until it is measured, and the statistical expectations should be calculated averagely in all directions. However, in (10), the state functions of entangled pairs are polarized, and the direction can only be along the direction of
, only up or down cannot be determined and the probability is 50% each.
2) In (10) and (11), the direct product is Kronecker product
. Obviously, two entangled particles
are completely symmetrical, which particle is the first one or the second one is completely designated. Similarly, the two measuring instruments
are also completely symmetrical. But the calculating result of the direct product is neither symmetrical nor antisymmetrical to
or
, which is clearly illogical in theory. How about a few more entangled particles?
3) It is clear that there is also a time-dependent problem in calculating statistical expectations in (11). Although the final result of quantum theory
(12)
is correct, the result was only obtained by guess, because the concept of direct product for multi-particle systems is not reasonably defined. The original intention of EPR’s paper is to question the unclear concepts of quantum theory, but the problem is still not solved.
Now we deal with this problem according to normal logic, and calculate the expected value of the correlation function reasonably. For the description of particle spin, to use Clifford algebra is more convenient, and the calculation is more standard and simple. The element in three-dimensional Euclidean space is given by
(13)
in which the generators
meet the
Clifford algebra
The simplest representation of these generators is the Pauli matrices.
They form a natural representation of the vector basis of the 3-d Euclidean space. The basis of the complete Clifford algebra is given by
They correspond to scalar, vector, pseudo vector and pseudo scalar respectively. Clifford algebra naturally converts geometric calculation into algebra operation, according to which all 3-d Euclidean geometry and field theory can be derived, and the conclusion can be simply extended to the case of curved space [27] [28] [29].
Any unit vector
in the spherical coordinate system can be represented as
(14)
The spin operator
along the direction of
can be represented by matrix form
(15)
Originally,
corresponds to a pseudo vector, for the sake of simplicity,
we ignore the factor
. It is easy to check the eigenvalues of
are
and the corresponding characteristic vectors are given by
(16)
Therefore, before the spin of the particle is polarized, the correct form of the state vector represented by (10) should be
(17)
The spin direction of a particle is just the direction of the symmetry axis of the wave function. Because the electromagnetic interaction has dissipation effect for a particle at the non-energy eigenstate, the steady state wave function of each charged particle cannot be a combination of several energy eigenstates. Therefore, representations such as (10) are contrary to logical and physical laws and cannot be realistic. The expected value of the correlation function
should be
(18)
where
represents the unit sphere, and the integral is the expected value of quantum mechanics, which is the pure geometric projection average. The above formula uses the conditions
. g is a measuring factor, which reflects the dynamic effect of the measuring instrument on the particle spin, that is, the polarization of the magnetic field on the particle spin.
In accordance with the logical rules, the calculation of correlation function should be under the conditional probability, which is equivalent to the calculation of the direct product in the above formula is just ordinary multiplication, and the spin projection calculation of two particles is separated as follows,
(19)
If
, (19) is the same as (12).
has a profound physical meaning and reflects the polarization of particle spin by the instrument. An extremely weak magnetic field can turn the original spin in the other two vertical directions into the direction of the magnetic field. The projection calculation of quantum theory obviously does not include this dynamic effect of measuring instruments on particles, and such effect cannot also be simply explained by the uncertainty relation. (19) shows that the correlation of spins of the entangled particle pair does not have any mysterious content that goes beyond the classical concept and the normal logic, so the previous dispute may be caused by incorrect correspondence between concept and physical reality.
4. Polarization Correlation of Entangled Photons
Let the angles between the two direction vectors
and the x axis be a and b,
respectively, then
,
are the angles of two vertical directions.
Let
and
represent the states of polarization of photons in
and
directions respectively, in the form of a matrix we have
The calculation of state function for tangled photon pair in the published literature is given by [3] [26]
(20)
in which
is the line polarization state functions in the direction of x and y respectively. (20) obviously does not represent the state of all associated photon pairs. For example, if the angle between the polarization direction of a photon and the x-axis is
, the polarization state functions of this photon and the photon with perpendicular polarization direction are respectively given by
(21)
Therefore, before the measurement, the state function of the entangled polarized photon pair should be
(22)
Denote the operators of projection onto the direction of
and
respectively by
The projection expectation function is given by
(23)
Then the expected value of quantum mechanics for the correlation functions of the two measurement directions is as follows:
(24)
where k is the measuring factor, reflecting the dynamic effect of the polarizer on the measurement results. If
, (24) is the same as the experimental result [26].
5. Discussion and Suggestion
The mathematical framework of modern quantum mechanics is mainly the following four basic assumptions [6] [30]:
A1. Any isolated physical system has a Hilbert space as the state vector space, and the state of the system is completely described by a unit state vector.
A2. The evolution of a closed quantum system can be characterized by a unitary transformation of a state vector, that is, from t0 to t we have
.
A3. An observable quantity corresponds to an Hermitian operator M, the eigenvalue m of operator M is a possible result of the experimental measurement of observable quantity, and the modular square of the projection from the state vector
to the eigenvector
is the probability of measuring the value of m, that is,
(25)
A4. The state space of a composite physical system is the tensor product of the state space of each subsystem. If the state function of the k-th subsystem is
, the state function of the composite system is given by
.
As mathematical axioms of linear operators on a Hilbert space, the above framework may be elegant and effective. However, as the physical theory that emphasizes the correspondence between theory and reality, the above framework and concept are broad, vague and fallible. The misunderstanding and debate about quantum mechanics are related to the abstraction and ambiguity of the framework. In what follows, we analyze the defects of the above framework on the specific issues already involved in this paper. From these analyses, we can also see the similarities and differences between the academic styles and requirements of mathematics and physics.
1) In order to be clear, the evolution law of a physical system must be described by specific dynamics, as is the case with all classical theories such as Newton mechanics, electrodynamics and general relativity. But the above framework of quantum theory has lost this scientific tradition, which is the main cause of the confusion. The hypothesis A2 is used to characterize the evolution process of the system, but it is only said that there is a unitary transformation of a state vector. However, the specific information about this unitary operator is not given clearly, the concept is similar to the “Tao” in Chinese philosophy, so this hypothesis is too vague.
Then, can all unitary transformations describe physical systems? Of course not, so this assumption is too broad. But is it certain that all microsystems can be characterized by a linear unitary transformation? Nor is it, because the evolution operator is nonlinear for an elementary particle with nonlinear potential, this assumption is too grudging in this case. Mathematical axiom system pursues abstraction, generality, purity and compatibility, but physical principles pursue the accuracy of description, the simplicity of expression, the clarity corresponding to reality, and of course, logical compatibility. Then, what assumptions should we choose to describe the evolution of a system? In the author’s opinion, in the non-relativistic case, it should be the Schrödinger equation, and if the potential contains spin-magnetic coupling term, the dynamic equation should be the following Pauli’s equation [31],
(26)
in which
is the total Hamiltonian operator of the system,
is the energy operator of the k-th particle in external fields,
is the interaction potential between k-th and l-th particles. In the case of N electrons we have
The total state function is defined by
Because the Hamiltonian operator is Hermitian, the state function
, as the solution of Schrödinger equation, satisfies the normalization condition. So, the evolution of the state function satisfies
, which is indeed equivalent to the unitary transformation of a unit vector. However, at this time, the transformation operator
is an operator determined by
, not a general linear operator in mathematical sense. Since
is equivalent to the total energy of the system, it has the clearest physical meaning and correspondence to reality. It is natural choice to adopt (26) as the basic principle of quantum mechanics in the case of non-relativistic theory, which is the same as the classical mechanics using Newton’s second law as the basic principle.
2) In the case of non relativity, if (26) is used as the basic dynamic equation, then the assumption A1 is superfluous, because the solution of Schrödinger equation or Pauli equation is equivalent to a unit vector in Hilbert space, the assumption A1 is actually a theorem. In addition, the assumption A4 is also superfluous, because the solution of Schrödinger equation of composite system is just like this. The detailed derivation of this conclusion were given in [31] [32] [33], and this result is also used in the calculation of (18) and (19). In the above framework, only the assumption A3 is reasonable to some extent, which provides the physical interpretation of the solution to Schrödinger equation and the corresponding rules between theory and measurement. This point has long been discussed in the literature [31]: “The only essentially new concept of the quantum theory imposed on the classical field theory is the hypothesis that an initial state
evolving into a final state
is undetermined in some cases and the probability is in proportion to
. However
is the projection of
on
, and
is a measurement of distance between the two vectors, so this interpretation is really reasonable in logic.” Thus it can be seen that the above framework A1-A4 may be not very suitable as a physical theory.
Besides, the Schrödinger Equation (26) has an important advantage, if an operator
can commutate with the Hamiltonian
, then they have common eigenstates, which is the condition to solve the equation by separating variables. If the Hamiltonian
can be decomposed into two commutating Hermitian operators
, and satisfy
, then the Equation (26) can be decomposed into two independent Schrödinger equations,
This decomposition is the condition that quantum mechanics can analyze some properties of the subsystems separately, such as the coupling effect of spin and magnetic field.
3) The above framework is misleading to researchers. For example, in EPR’s paper [1], let the eigenvalues of the physical quantity A acting on system I be
, and the corresponding eigenfunctions
. Then the wave function of the composite system of two particles is given by
(27)
where
is the coordinate of the particle II,
is just coefficients of the series expansion of wave function
. If we measure the particle I of the correlated system obtain a physical quantity
, then the state function of the composite system will reduce to a pure state
.
However, at first, the coefficient of
is not necessarily the eigenstate of A acting on particle II, and
is also not necessarily the solution to Schrödinger equation. For two particles moving away relatively, the solution of the Schrödinger equation should be time-dependent, that is, the coefficients in the series (27) should be time-dependent
. Therefore, according to Schrödinger equation, when the particle I is measured, the particle II does not necessarily collapse to eigenstate. Secondly, if the particles are charged, the electromagnetic interaction has dissipative effect on the non-energy eigenstate, so the non-energy eigenstate is unstable, which has energy exchange with the environment. In this case the system is in the quantum process. The wave function can only steadily stay at the energy eigenstate, i.e., the particle state, but cannot stay at the mixed state depicted by (27). Similarly, (10) is a mixed state rather than an energy eigenstate, so it is not a solution to Schrödinger equation. If we do not refer to the Schrödinger equation, but only refer to the framework A1-A4, these problems will be overlooked and concealed.
There is nothing mysterious about the explanation of probability. For particles in metastable state, the wave function takes an energy eigenstate or “particle state”. It is the critical point of the energy functional. At this state, the Schrödinger equation is very sensitive to external interference, and at what time the wave function will evolve into which possible eigenstate is indeed uncertain. This situation is similar to a small ball at the top of a smooth sphere, what time and which direction the small ball will roll off are uncertain. However, the “quantum process” is a deterministic process, and the quantum transition is transient.
4) Newton mechanics for N particles is only a 6N dimensional dynamic system, while Schrödinger equation (26) is infinite dimensional, so Schrödinger equation contains far more information than momentum, coordinates and so on. With regard to the relationship between quantum theory and classical mechanics, the correct procedure should be to explain classical mechanics by Schrödinger equation. This procedure requires a logical conceptual system and rigorous reasoning. It is not difficult to solve these problems, see the literature [31] [32] [33] for details.
However, the current interpretation of quantum mechanics is not feasible. For example, quantum mechanics holds that particle momentum is the eigenvalue of momentum operator, but in general, a solution to Schrödinger equation is impossible to be the eigenfunction of
. On the other hand, according to Noether’s theorem, the energy and momentum of particles correspond to the translation invariance of spacetime, and the momentum of any system must exist accurately. This leads to contradiction. In addition, the traditional view that classical mechanics is the limit of quantum mechanics as
is also improper, how can a basic physical constant take a limit? The real reason should be that when the moving scale of a particle is much larger than its own scale, then its motion can be described by classical mechanics. If a particle goes inside an atom, it has to be described by wave function. In fact, the macro body is also the case. In the solar system, the treatment of the earth as a point particle is accurate, but to study the earth’s movement near the earth, it must be treated as continuous medium. Of course, Schrödinger Equation (26) itself has limitations, and it can only describe the motion and properties of electrons in atoms and molecules. For more complex cases, it is not enough. For example, in the case of high-speed motion, the relativity effect must be taken into account; the nonlinear potentials are obviously contained in the elementary particles, and so on. In these cases, nonlinear Dirac equation must be used. Although this is another problem, all the researchers should keep these boundaries in mind.
5) For Schrödinger Equation (26), any reversible and differentiable 1-1 corresponding transformation of the wave function Ψ, such as (2), is mathematically equivalent to the original representation. This is the same as the legal coordinate transformation in the general theory of relativity, and an improper representation of the wave function will only make the problem more complicated. So the Bohm’s explanation of hidden variables for quantum mechanics is superfluous. Wave function is the basic property of an elementary particle. Like the electromagnetic field of a charge, there is nothing more important for elementary particles than these fields. The classical concepts such as coordinates, momentum and energy of a particle can be defined strictly by wave function, and the electric charge, the strong charge and so on are only some interaction coefficients in the dynamic equation [31] [32] [33]. The microscopic particle is firstly a 4 dimensional field, then a particle in macroscopic view.
6) With respect to the relationship between quantum theory and classical mechanics, some experiments can be designed to illustrate. These experiments should be related to the wave properties and particle properties simultaneously. The experiment to test the Einstein’s mass-energy relation is this type [33]. Here we propose again a simpler experiment. In the Stern-Gerlach experiment, assume the particles move along y-axis. The magnetic field to measure the magnetic moment of particles is substantially uniform along this direction, but it is symmetrical with respect to z-axis and non-uniform in the z direction, so that the particle beam is split into two beams in the z direction. This confirms that
the spin of an electron is
.
If we set up a rotating magnetic field distribution along the y-axis, that is, we convert the magnets in multi-stage Stern-Gerlach experiment into a continuous and spiral one, and set a movable display screen perpendicular to the y-axis. When the particle beam passes through the magnetic field along the y-axis, it also leaves two bright spots on the screen. If the screen moves slowly along the y-axis, we should see two bright spots rotating around the y-axis. This proves that the spin of the particles follows continuously the force line of magnetic field, rather than jumping as described by quantum theory.