Logical Difficulty from Combining Counterfactuals in the GHZ-Bell Theorems ()
1. Introduction
The Greenberger, Horn, Zeilinger (GHZ) theorem [1] has achieved a status similar to that of Bell’s theorem in its acceptance as a proof that local hidden variables are impossible in quantum mechanics. It has a similarity to Bell’s theorem in that it considers a mathematical relation among predicted results of alternative measurements that are not all performed. The alternatives consist of procedures that if performed together are noncommuting, and whose results then depend on their order of execution. Thus, if all the measurements were actually performed, their explicit noncommutation would have to be taken into account in predicting measurement outcomes.
The use of counterfactuals in no local hidden variables theorems relies on the assumption that counterfactual reasoning is intrinsically sound classically, but not quantum mechanically. However, examples given in Section 2.2 reveal that counterfactual reasoning commonly fails in the classical domain if it neglects the noncommutation of component procedures. It is then shown that parallel reasoning leads to the paradoxical results of the GHZ and Bell theorems in its neglect of noncommutation. The conclusion is that the discrepancy between quantum mechanical eigenvalues and calculations using counterfactuals of noncommuting procedures can no longer be taken as proof that local hidden variables are inconsistent with quantum mechanical observations.
A definition of counterfactuals and examples showing inconsistencies in their classical use are given in Sections 2.1 and 2.2. In Section 3, the accepted interpretation of the GHZ theorem is described following the treatment by Mermin [2], Home [3], Afriat and Selleri [4], and Greenberger [5], but in a manner showing the roll of counterfactual reasoning. In Section 4, a similar inconsistency due to use of counterfactuals in the Bell theorem is outlined. In this case, the logical inconsistency is manifested by violation of the Bell inequality, an algebraic expression that must be universally satisfied by cross-correlations of any data sets whatsoever.
2. Counterfactual Reasoning
2.1. Definition
In general, the term counterfactual refers to the predicted result of an unperformed act or consequence of a condition that is not true. In the present paper, the definition is further narrowed to distinguish it from various alternatives [6]. If one considers two procedures A and B that do not commute, the result of carrying out a sequence of the two depends on whether A or B is performed first. However, one may consider each procedure in isolation from the other in an exclusive-OR sense. The predicted measurement results of such procedures, that if performed together require noncommutation to be taken into account, are herein termed counterfactuals. (Since measurement outcomes for commuting procedures have simultaneous existence in quantum mechanics, they are of little concern here.)
2.2. Flaws in Classical Counterfactual Reasoning
It has been stated in the context of “no-go” theorems for hidden variables that counterfactual reasoning is used frequently in the classical world without any problem: it is logically trustworthy. The author proposes that this belief is in error as will now be shown by classical counter-examples.
We first take note of characteristics of classical noncommuting operations using a semi-facetious example given several years ago by Leon Cohen in a lecture at the Naval Research Laboratory: putting on shoes and socks. Consider this example from the point of view of counterfactuals. One may put on shoes alone, or socks alone in an exclusive-OR sense, and these acts have perfectly well defined meanings. However, one cannot consider these acts in a logical-AND sense unless noncommutation is taken into account. In that case, putting on socks and then shoes gives a different result from putting on shoes and then socks. Thus, converting the logical-OR case to an AND case in the sense of simultaneous existence, or conversion to commutation without conditionality, makes no physical or logical sense.
Classical operations are commonly noncommutative. Consider [7] the rotation of rigid bodies in three dimensional space. A rotation of +90˚ about the x-axis followed by +90˚ about the y-axis produces a different final orientation than if these rotations are carried out in reverse order.
Navigation on the surface of the earth is noncommutative: traveling 100 miles north followed by 100 miles west produces a different final position than the same actions carried out in reverse order due to the definitions of north and west on the spherical surface of the earth. However, in the special case where one begins from a position 50 miles south of the equator, the results are the same—but then the operations commute.
Consider a beam of polarized light into which are inserted polarizer filters, one at 45˚, one at 90˚ to the initial beam polarization direction. If both polarizers are simultaneously inserted into the beam, the overall transmitted energy depends on their order of placement. If any meaning can be attached to a combination of the two individual states separately, it must be different from that which follows from their actual joint noncommutative placement in the beam. (The measurement is noncommutative because the projection of a vector along a pass axis is transmitted while the perpendicular component is absorbed/discarded.)
Examination of these examples shows that in the classical world, one cannot combine counterfactuals of separate noncommuting operations and expect the result to equal the outcome of the operations performed together. The situation in quantum mechanics is the same on the basis of analysis to be given in Sections 3 and 4. Interestingly, Griffiths has reached a similar conclusion [8] stating that (counterfactual) results of noncommutative operations “cannot be combined to form a meaningful quantum description” in the consistent histories interpretation of quantum mechanics, and that their joint use is meaningless.
3. The GHZ Theorem
The Pauli spin operators, , and are used to define three-particle operators, ,:
, , , (1)
where the superscripts indicate the particle to which the operator is applied. In the theorem, , , and are ultimately applied to an entangled state of three spin particles and each corresponds to a measurement of the product of their spins. Using the anti-commutation properties of the spin operators,
(2)
and the fact that operators on different particles commute, it is found that, , and commute. For example, to show that and commute, multiply
. (3)
Using the anti-commutation property of Equation (2),
(4)
The other commutations may be demonstrated similarly.
One may now consider the product operator:
. (5)
Since operations on different particles commute, this may be written
, (6)
as long as the right to left sequence of operators on each particle is unchanged from Equation (5). Relation (6) may be simplified by using Equation (2), particularly the anti-commutation relation, to obtain
(7a)
If one defines then and
. (7b)
It is emphasized that the minus sign in Equation (7a) that results from multiplying is due to the noncommutation of operations on particle 2. Finally from Equation (2) it follows similarly to the examples just given, that, , , and commute for any.
The GHZ theorem depends on the above state-independent properties of the and further consequences that follow from the fact that they have a common entangled eigenstate
. (8)
In Equation (8), and for particles designate the eigenkets of with eigenvalues +1 and, respectively. From the well known relations [9]:
(9)
and the definitions of, , and, one obtains
. (10)
Then from Equation (7), which follows from the spin anti-commutation relations, action of on yields
. (11)
Since the’s commute and have a common eigenstate, they are simultaneously measurable on three particles in state. Thus, for example, the same value of occurs at each of its occurrences in the sequence. However, a measurement of must be made in such a way that only the product of the spin values and not their individual values are revealed [5]. Otherwise, a state produced by measuring an individual would collapse to one yielding a specific spin eigenvalue for each particle, and this state would not be an eigenstate of any other. Thus, Equations (10) and (11) would no longer hold.
To see this in the case of, expand in terms of x and y spin basis states using:
(12)
This produces a sum of terms, each with spin-product equal to 1:
(13)
A measurement of that reveals individual spins would collapse Equation (13) to one of these terms. Suppose it is the first for which all spins are positive. Now consider measurement of: the resulting measured spin product is no longer necessarily +1. On the other hand, if is measured first, its spin product will be the predicted +1, but a following measurement of may now produce a spin product other than +1. The situation is the same for any pair of’s if actual spins are revealed. The spin products corresponding to measurement of any one satisfy Equation (10) or (11), but measurement of a second, , no longer necessarily satisfies these relations. Thus, spinrevealing measurements of different do not commute.
The usual argument of the GHZ theorem that states that Equations (10) and (11) are inconsistent with local realism, i.e., the existence of hidden variables or preexisting values for measurement outcomes, is as follows: if local hidden variables supplementing the information in are assumed to exist, their values would determine the components of spin found in individual measurements performed on the three particles. If the particles were separated after the formation of under the assumption of locality, the measured value obtained for any selected particle spin component would be independent of the choice of component measured on any other distant particle. Thus, the value of obtained for particle 3 would be independent of whether one chose to measure or on particle 2. In view of the analysis above, one could measure one of, , or to obtain
(14a)
(14b)
(14c)
respectively, where the m’s, each equal to 1, denote numerical values of the measurements. But values of the same symbol occurring in different relations (14a-c) must be the same based on the condition that the particles do not interact after separation, and on the assumption that the values result from initial conditions determining the measurements of, and, even though only one of the’s can be measured. On the assumption that noncommuting counterfactuals may be combined, the product of (14a-c) is
(15)
where values for each of the two occurrences of, are equal as deduced above.
However, quantum mechanics gives a different result for the combined operation of, and on due to noncommutation of operations on particle 2:
(16a)
so that
. (16b)
On the assumption that the use of combined counterfactuals in the proof of “no-go” theorems leads to an unavoidable result of classical reasoning, the confirmed measurements of quantum mechanics are concluded to be inconsistent with preexisting or predetermined values for local variables. However, as seen from examples, combined counterfactuals of noncommuting operations do not yield results consistent with those obtained by taking noncommutation into account in the classical realm. Thus, such reasoning does not distinguish between quantum and classical logic. Finally, even if cases exist where a counterfactual construction makes logical sense in and of itself, a different numerical value will be produced than when real experimental operations are performed that are noncommutative.
4. Bell’s Theorem
The present analysis would not be complete without a review of Bell’s theorem, and the contribution of the use of counterfactuals of noncommuting operations to its flaws. Previously identified logical problems in the theorem will only be outlined here since their analysis has been given in [10-14].
It is easy to show that the inequality that Bell derived is universally satisfied by cross-correlations of any three or four (as appropriate) data sets consisting of [10,11]. This fact depends only on the assumed existence of the data sets, and is independent of any other property such as their origin in random, deterministic, local, or nonlocal processes. Bell did not realize that his inequality resulted from the use of cross-correlation alone in its development. He attributed the result to consciously chosen assumptions: all measurements are represented by a function of random variables (counterfactually) defined at all instrument settings, and locality. He then assumed that the random function he postulated resulted in a second order stationary process [15] in that all correlations could be represented by the same co-sinusoidal function of coordinate differences. Spin measurement operations on a given particle at different instrument settings were mathematically treated as commutative [14].
However, for more than two measurements on a side in Bell experiments, actual performable measurements are noncommutative according to quantum mechanics. Bell mistakenly indicated in his book that this consideration could be ignored by using noncommuting counterfactuals interchangeably with measurements [16] to produce counterfactual experimental data. Of course, since the Bell inequality results from the mere fact of crosscorrelation, it is satisfied by the cross-correlations of data sets of commutative second order stationary processes as assumed by Bell, and it is thus derivable upon the assumption of such processes, even though it holds generally.
In the quantum mechanical two particle experiments to which this inequality has been applied, consideration of more than one measurement per particle implies that noncommutation must be taken into account. The author has previously identified two experiments yielding more than one measurement per side that could produce data for cross-correlation under this condition. One would use an additional apparatus in tandem on either side of the usual Bell experiment operating in a retrodictive mode [10]. The second would use separate experiments from which correlations conditional on the usual outcomes could be computed [12,13]. Either of these would yield a third correlation that is functionally different from those obtained in standard Bell experiments such that the three would satisfy the Bell inequality as required by basic mathematics.
In practice, the data from Bell experiments have not been cross-correlated, and each pair of correlations is derived from an independent experimental run. If the underlying process were second-order stationary as assumed by Bell, there would only be one correlation functional form to determine, and ensemble averaged crosscorrelations would yield the same function as that measured in independent runs. The Bell inequality would be satisfied by this correlation function as measured in independent runs up to small statistical fluctuations. The violation of the Bell inequality by experimentally confirmed cosine correlation functions proves that the underlying process is not statistically stationary, contrary to what is widely assumed. This is consistent with the noncommutative process described by quantum mechanics that predicts different correlation functional forms between some variables. The logical inconsistencies in the usual interpretation of the Bell theorem begin with combining counterfactuals of noncommutative processes with real data, and are manifested in the violation of the Bell inequality, an inequality that must be universally satisfied by the cross-correlations of any data sets whatsoever consisting of.
5. Conclusions
If a theory calculates results of actually performable experiments, the results must logically depend on taking noncommuting operations properly into account. The nohidden-variables theorems historically contrast quantum results based on noncommutation with classical results based on its neglect. The narrative accompanying these theorems is that neglect of noncommutation of counterfactuals is logically sound in the classical domain, so it is appropriate to attribute the peculiarly inconsistent results that follow to nonlocality, or the nonexistence of hidden variables or pre-existing values for measurements. But if, as has been shown, the use of counterfactual reasoning in no-hidden-variables theorems is flawed both quantum mechanically and classically, the usual paradoxical choices emerging from these theorems no longer have logical motivation. That said, lack of validity of no-hidden-variables theorems does not, in and of itself, imply that local hidden variables exist.
The content of this paper was presented in [17]. The present paper discusses the central idea in greater detail than does the longer [18], while the latter includes additional variations not dealt with here.
6. Acknowledgements
I would like to thank Mike Steiner of Inspire Institute for useful critical comments on the manuscript, and Armen Gulian and Joe Foreman of the quantum group at Chapman University, Burtonsville, MD for many useful discussions relating to the presentation of the material.