1. Introduction
Kantorovich-Type inequalities for positive matrices on finite dimensional Hilbert Spaces are extensions of the original Kantorovich inequality. Let T be a positive matrix with the smallest eigenvalue m and the largest eigenvalue M, then the original Kantorovich inequality states that
(1)
In fact
(2)
Please see [1] . In 1968 Gustafson independently proved that for such a positive matrix we have
(3)
Please see [2] [3] . It turned out that (2) and (3) are equivalent. Thus Gustafson gave a totally different proof for the original Kantorovich inequality. Furthermore, he called the quantity
(4)
the antieigenvalue of T. Gustafson’s proof of (3) was based on his min-max theorem which states that for such a positive operator we have
(5)
At the same time, Gustafson defined the antieigenvalue of an arbitrary operator T to be
(6)
Thus computing (6) for an arbitrary operator T is equivalent to extend the original Kantorovich inequality to an arbitrary operators T. The first attempt to compute a value for (6) was made by Davis in [4] , using the shell of a matrix. He found some partial and implicit results for (6) when T is an accretive normal matrix on a finite dimensional Hilbert space. In [5] and [6] Gustafson and Seddighin found more explicit results for (6), assuming that T is a normal matrix on a finite dimensional space. They proved that in this case (6) is always expressed by at most two eigenvalues of T. This property that later was generalized by Seddighin as The Two Nonzero Component Lemma (or TNCL for short) was implicitly proved in [5] .
Lemma 1 (The Two Nonzero Component Lemma) Let
be the set of all sequences with nonnegative terms in the Banach Space
, i.e.,
(7)
Let
be a function from Rm to R. Assume
(8)
Then the minimizing vectors for the functional
(9)
on the convex set
(10)
have at most two nonzero components.
A geometric proof for this lemma in the finite dimensional case is implicit in the proof of Theorem 5.1 in [5] . Using the notations in the Lemma 1 above, in Theorem 5.1 of [5] we had the specific functions
(11)
(12)
and
(13)
Here we have replaced
,
,
in Theorem 5.1 of [5] with
,
,
respectively to compare the situation with Lemma 1 above. Also, an analytical proof for Lemma 1 is implicit in the proof of Theorem 2.2 in [7] for the specific functions
(14)
with
(15)
and
(16)
Please note there is a harmless error in expression 2.18 in [7] . In that expression we must have
, instead of
). What make the geometrical and analytical proofs of the Lemma in these special cases possible are the following two facts: First, the convexity of the set
(17)
Second, a special property that the functions
(18)
involved possess. If we set
(19)
then all restrictions of the form
(20)
of
(21)
have the same algebraic form as
(22)
itself. For example, if
(23)
(this is the function appearing in the proof of Theorem 2.2 in [7] ), then we have
(24)
which has the same algebraic form as
(25)
Indeed, for any j,
; all restrictions of the function
(26)
obtained by setting an arbitrary set of j components of
(27)
equal to zeros have the same algebraic form as
(28)
Obviously, not all functions have this property. For instance, for the function
(29)
we have
(30)
which does not have the same algebraic form as
(31)
To avoid repetitions in our papers, we will not present a separate proof for Lemma 1 here. Instead, we note that the proof of this Lemma is embedded in the proof of Theorem 2.2 of [7] . There, one can redefine the function
or
(depending on the dimension) to be
, where F,
are as outlined in the statement of Lemma 1. With this change the proof of Lemma 1 is obtained.
The Two Nonzero Component Lemma was formulated as above by Seddighin in [8] and has since been applied in a number of his papers (see [6] [9] - [15] ). It is important to mention that TNCL does not identify which one or two components of the minimizing vectors are nonzero. Almost all Kantorovich-Type inequalities and corresponding antieigenvalue quantities are reduced to functions which meet the conditions of TNCL. Nevertheless, TNCL is a dimension reducing optimization lemma and as such can be used in instances where matrices or operators are not involved (see, for example, [11] ). Furthermore, in [12] Seddighin proved that Gustafson’s min-max Theorem can be obtained as a corollary from The Two Nonzero Component Lemma. Please see [16] for the min-max Theorem, which was the foundation for Gustafson’s Antieigenvalue Theory that he also calls it Operator Trigonometry.
2. (q,F) Kantorovich-Type Inequality
Let T be a positive matrix on a finite dimensional space satisfying
. Also let
be a real valued convex function on
and q be a real number, then the inequality
(32)
holds for every unit vector x under one of the following conditions
(33)
or
(34)
The Inequality (32) is a nontrivial Kantorovich-Type inequality which is a generalization of the original Kantorovich inequality. In this paper we call (32) the (q,F) Kantorovich-Type inequality. Please see [17] . The Inequality (32) is equivalent to
(35)
under the conditions stated above. Therefore, the inequality is established if we show
(36)
The quantity
resembles
and in accordance to
Antieigenvalue Theory we call it
antieigenvalue of T and denote it by
. In general in accordance to Antieigenvalue Theory, if T is a normal
operator, we call
the
Antieigenvalue of T and denote it by
.
The following is a generalization of
Kantorovich-Type inequality to normal Hilbert space operators.
Theorem 2 Let T be a normal operator on a separable Hilbert space. Suppose
,
, are the eigenvalues of T. Let
be the eigenspace corresponding to
and let
be the orthogonal projection on
. Assume F is an analytic function defined on
. For each vector x let
. If x is a minimizing vector with
for
, then
we have one of the following cases: 1) Only one of the vectors
is nonzero, i.e.,
, for some i, and
for
. In this case we have
(37)
2) Only two of the vectors
and
are nonzero and the rest of the components of f are zero. i.e.,
,
and
if
and
. In this case we have
(38)
and
(39)
Furthermore,
(40)
Proof. Direct computations show that
(41)
Let
. Then the problem is reduced to finding
(42)
on the convex set
(43)
Now by the Two Nonzero Component Lemma, a minimizing vector t for
(44)
has either one or two nonzero components. First, if for a minimizing vector t we have
and
,
then
(45)
Second, if a minimizing vector t for
(46)
has two nonzero components
and
then the problem is reduced to finding the minimum of the function
(47)
on the line segment
(48)
An application of Lagrange Multipliers shows that we must have
(49)
and
(50)
If we substitute (49) and (50) in (47) and simplify, we obtain
(51)
The following corollary states the (q,F) Kantorovich-Type inequality for normal operators on a separable Hilbert space, without mentioning the
minimizing vectors for
(which as we saw make the inequality an
equality). Traditionally, some inequalities are written without stating when the inequality becomes equality. The reason is that, as we explain later in this paper, they were driven by other methods without computing the vectors that make the inequality an equality. However, as we remark at the end of this paper, vectors which make an inequality equality have applications of their own.
Corollary 3 Let T be a normal operator on a separable Hilbert space. Suppose
,
, are the eigenvalues of T, F is an analytic function defined on
, and q is a real number. Then one of the following inequalities is satisfied: 1) There exist an eigenvalue
such that
(52)
for all unit vectors x. 2) There exist a pair of eigenvalues
and
such that
(53)
for all unit vectors x.
3. Weighted (q,F) Kantorovich-Type Inequality
In [18] Gustafson and Seddighin generalized the definition of antieigenvalue given by (6) to weighted antieigenvalue defined by
(54)
where a and b are a pair of real numbers, at least one of them nonzero. In particular when
we have
(55)
which is called the symmetric antieigenvalue of T and is denoted by
. The symmetric antieigenvalue is a balanced definition of antieigenvalue because it depends on both
and
by the same factor. Note that, as we proved in [18] , it turned out that the weighted antieigenvalue of an operator T is the same as the antieigenvalue of another operator, namely
(56)
where
. (57)
If we define weighted
antieigenvalue of T by
(58)
then we have,
Theorem 4 For any normal operator T we have
(59)
where
(60)
Proof. Using spectral mapping theorem we have
(61)
and
(62)
Theorem 5 Let T be a normal operator on a separable Hilbert space. Suppose
,
, are the eigenvalues of T. Let
be the eigenspace corresponding to
and let
be the orthogonal projection on
. Assume F is an analytic function defined on
and q is a real number. Furthermore Let a and b be real numbers, at least one of them nonzero. For each vector x let
. If x is a minimizing vector with
for
then we have one of the following cases: 1) Only one of the vectors
is nonzero. i.e.,
, for some i, and
for
. In this case we have:
(63)
2) Only two of the vectors
and
are nonzero and the rest of the components of f are zero. i.e.,
,
and
if
and
. In this case we have
(64)
and
(65)
Furthermore
(66)
Proof. Let
(67)
and let
,
, be the set of antieigenvalues of A. By the spectral mapping theorem, we have
(68)
By Theorem 2 we have one of the following two cases. 1) Only one of the vectors
is nonzero. i.e.,
, for some i, and
for
. In this case we have:
(69)
2) Only two of the vectors
and
are nonzero and the rest of the components of f are zero. i.e.,
,
and
if
and
. In this case we have
(70)
and
(71)
Furthermore,
(72)
By Theorem 4 we have
(73)
The proof is completed by substituting A in terms of T and eigenvalues of A in terms of eigenvalues of T in (69), (70), (71), and (72).
Corollary 6 Let T be a normal operator on a separable Hilbert space. Suppose
,
, are the eigenvalues of T, F is an analytic function defined on
, and q is a real number. Also assume a and b are real numbers, at least one of them nonzero. Then one of the following inequalities is satisfied, 1) There exist an eigenvalue
such that
(74)
for all unit vectors x. 2) There exist a pair of eigenvalues
and
such that for all unit vectors x,
(75)
where C and D are defined by
(76)
and
(77)
Remark 7 In this paper and in some of our other papers we have proved Kantorovich-Type inequalities by converting them to an Antieigenvalue-Type problem and then finding the minimizing vectors for the Antieigenvalue-Type problems. These vectors are the vectors that Kantorovich-Type inequalities become equalities. Traditionally authors have established Kantorovich-Type inequalities for a positive operator T by going through a two-step process which consists of computing upper bounds for suitable functions on intervals containing the spectrum of T and then applying the standard operational calculus to T (see [17] ). These methods have limitations as they do not shed light on vectors or matrices for which inequalities become equalities. This is one of the significant aspect aspects of our work here, as the minimizing vectors have applications of their own. They have particularly applications in numerical analysis (see [16] ). Some authors have taken a variational approach to find the minimizing vectors for Antieigenvalue-Type problems (vectors that make the corresponding Kantorovich-Type inequality an equality). In a variational approach one differentiates the quantities involved to arrive at an “Euler Equation” and then solve the Euler Equation to obtain the minimizing or maximizing vectors. A direct variational approach generally does not produce a Euler equation which can be solved easily (see [16] ). Thus, identifying the minimizing vectors is another significant aspect of our work here.
Remark 8 While we have used TNCL to show that one or two eigenvalues are involved in expressing the (q,F) Kantorovich Type inequality for normal operators acting on a separable Hilbert space, TNCL does not enable us to pinpoint one or two eigenvalues involved. However, if T is a normal matrix it is possible to pinpoint exactly which eigenvalues express Antieigenvalue-Type quantities. We have shown this in [6] and [8] , using detailed convexity arguments.
Remark 9 If T is an arbitrary operator on an infinite dimensional Hilbert space we can numerically approximate its Antieigenvalue-Type quantities for T and hence establish approximate Kantorovich-Type inequalities for T (please see [19] ).
Conclusion 10 The results in this paper are milestones in the evolution of Kantorovich-Type inequalities. The simplest Kantorovich-Type inequality is the Kantorovich inequality for real numbers which states
(78)
where
(79)
and
are non-negative numbers with
(80)
The Inequality (1) was the first generalization of (78) to positive matrices by Kantorovich himself. In 1980, C. Davis extended the Kantorovich inequality from positive matrices to normal matrices. However, Davis assumed that the numerical range of the normal matrix is contained in the right half plain (i.e., the matrix is accretive). Furthermore, Davis was not able to identify the vectors for which the inequality becomes equality. Our results here are major steps in generalizing Kantorovich-Type inequalities for the following reasons: 1) We generalized Kantorovich-Type inequalities from positive matrices to infinite dimensional normal operators. 2) There are no conditions on the numerical range of normal operators. 3) We shed light on the vectors for which Kantorovich-Type inequalities become equalities.