Least-Squares Solutions of Generalized Sylvester Equation with Xi Satisfies Different Linear Constraint ()
Received 12 March 2016; accepted 11 June 2016; published 14 June 2016

1. Introduction
A matrix
is said to be a Centro-symmetric matrix if
for all
. A matrix
is said to be a Bisymmetric matrix if
for all
. Let
and
denote the set of
real matrices,
real symmetric matrices,
real Centro-symmetric matrices and
real Bisymmetric matrices, respectively.
where
denotes ith column of
unit matrix. For a matrix
, we denote its transpose, traced by
respectively. In space
, we define inner product as:
for all
, then the norm of a matrix A generated by this inner product is, obviously, Frobenius norm and denoted by
.
Denote
![]()
Obviously, K, i.e.
, is a linear subspace of real number field.
In this paper, we mainly consider the following two problems:
Problem I. Given matrices
, ![]()
, find matrix group
such that
![]()
Problem II. Denote by
the solution set of Problem I. Find matrix group
, such that
![]()
In fact, Problem II is to find the least norm solution of Problem I.
There are many valuable efforts on formulating solutions of various linear matrix equations with or without linear constraint. For example, Baksalary and Kala [1] , Chu [2] [3] , Peng [4] , Liao, Bai and Lei [5] and Xu, Wei and Zheng [6] considered the nonsymmetric solution of the matrix equation
(1)
by using Moore-Penrose generalized inverse and the generalized singular value decomposition of matrices, while Chang and Wang [7] considered the symmetric conditions on the solution of the matrix equations
(2)
Zietak [8] [9] discussed the
-solution and Chebyshev-solution of the matrix equation
(3)
Peng [10] researched the general linear matrix equation
(4)
with the bisymmetric conditions on the solutions. Vec operator and Kronecker product are employed in this paper, so the size of the matrix is enlarged greatly and the computation is very expensive in the process of solving solutions. Iterative algorithms have been received much attention to solve linear matrix equations in recent years. For example, by extending the well-known Jacobi and Gauss-seidel iterations for
, Ding, Liu and Ding in [11] derived iterative solutions of matrix equations
and generalized Sylvester matrix equations
. By absorbing the thought of the conjugate gradient method, Peng [12] presented an iterative algorithm to solve Equation (1). Peng [13] , Peng, Hu and Zhang [14] put forward an iterative method for bisymmetric solution of Equation (4). These matrix-form CG methods are based on short recurrences, which keep work and storage requirement constant at each iteration. However, these iteration methods are only defined by the Galerkin condition, but lack of a minimization property, which means that the algorithm may exhibit a rather irregular convergence, and often results in a very slow convergence. Lei and Liao [15] presented that a minimal residual algorithm could remedy this problem, and this algorithm satisfies a minimization property, which ensures that this method possesses a smoothly convergence.
However, to our best knowledge, the unknown matrix with different linear constraint of linear matrix equations, such as Equations ((1)-(4)), has not been considered yet. No loss of generality, we research the following case
(5)
which has four unknown matrices and each is required to satisfy different linear constraint. We should point out that the matrices
are experimentally occurring in practices, so they may not satisfy solvability conditions. Hence, we should study the least squares solutions, i.e. Problem I. Noting that it is obvious difficulties to solve this problem by conventional methods, such as matrix decomposition and ver operator, hence iterative method is considered. Absorbing the thought of the minimal residual algorithm presented by Lei and Liao [15] , and combing the trait of problem, we conduct an iterative method for solving Problem I. This method can both maintain the short recurrence and satisfy a minimization property, i.e. the approximation solution minimizes the residual norm of Equation (5) over a special affine subspace, which ensures that this method converges smoothly.
The paper is organized as follows. In Section 2, we first conduct an iterative method for solving Problem I, and then describe the basic properties of this method; we also solve Problem II by using this iterative method. In Section 3, we show that the method possesses a minimization property. In Section 4, we present numerical experiments to show the efficiency of the proposed method, and use some conclusions in Section 5 to end our paper.
2. The Iterative Method for Solving Problem I and II
In this section, we firstly introduce some lemmas which are required for solving Problem I, we then conduct an iterative method to obtain the solution of Problem I. We show that, for any initial matrix group
, the matrix group sequences
generated by the iterative method converge to a solution of Problem I within finite iteration steps in the absence of roundoff errors. We also show that the unique least norm solution of Problem I can be obtained by choosing a special kind of initial matrix group.
Lemma 1. [16] [17] . A matrix
if and only if
.
A matrix
if and only if
.
Lemma 2. Suppose that a matrix
, then
.
Suppose that a matrix
, then
.
Proof: Its proof is easy to obtain from Lemma 1. W
Lemma 3. Suppose that
,
,
,
then
![]()
Proof: It is easy to verify from direct computation. W
Lemma 4. (Projection Theorem) [18] . Let X be a finite dimensional inner product space, M be a subspace of X, and
be the orthogonal complement subspace of M. For a given
, there always exists an
such that
![]()
where
is the norm associated with the inner product defined in X. Moreover,
is the unique minimization vector in M if and only if
![]()
Lemma 5. Suppose
is the residual of matrix group
corresponding to Equation (5), i.e.
, if the following conditions are satisfied simultaneously,
(6)
then the matrix group
is a solution of Problem I.
Proof: Let
![]()
obviously, Z is a linear subspace of
. For matrix group
, denote
![]()
then
. Applying to Lemma 4, we know that
is a solution of Problem I if and only if
![]()
i.e. for all
,
![]()
By Lemma 3, it is easy to verify that if the equations of (6) are satisfied simultaneously, the expression above holds, which means
is a solution of Problem I. W
Lemma 6. Suppose that matrix group
is a solution of Problem I, then arbitrary matrix group
can be express as
where matrix group
satisfies
(7)
Proof: Assume that matrix group
is a solution of Problem I. If
, then by Lemma 5 and its proof process, we have
![]()
which implies matrix group
satisfies (7).
Conversely, if matrix group
![]()
where matrix group
satisfies (7), then
![]()
which means matrix group
W
Next, we develop iterative algorithm for the least-squares solutions with
satisfies different linear constraint of matrix equation
![]()
where
,
and C are given constant matrices, and
is the unknown matrices group to be solved.
Algorithm 1. For an arbitrary initial matrix group
, compute
Step 1. ![]()
![]()
Step 2. If
, then stop; else,
, and compute
Step 3. ![]()
![]()
Step 4. Go to step 2.
Remark 1. 1) Obviously, matrices sequence
generated by Algorithm 1 satisfies
![]()
2)
is the residual of Equation (5), when ![]()
3) Algorithm 1 implies that if
, then the corresponding matrix group
is the solution of Problem I.
In the next part, we will show the basic properties of iteration method by induction. First for convenience of discussion in the later context, we introduce the following conclusions from Algorithm 1. For all ![]()
![]()
Lemma 7. For matrices
,
(r = 1, 2, 3, 4) and
generated by Algorithm 1, if there exist a positive integer k such that
,
, and
for all
, then we have
1) ![]()
2) ![]()
3) ![]()
Proof: For
, it follows that
![]()
![]()
![]()
Assume that the conclusions
![]()
hold for all
, then
![]()
![]()
![]()
By the assumption of Equation (3), we have
![]()
Then for j = s,
![]()
![]()
![]()
Then the conclusion
and the assumption
show that
for all
. By the principal of induction, we know that Eq.(3) holds for all
, and Equation (1) and Equation (2) hold for all
due to the fact that
holds for all matrices A and B in
. W
Lemma 7. shows that the matrix sequence
![]()
generated by Algorithm 1 are orthogonal each other in the finite dimension matrix space
. Hence the iterative method will be terminated at most
steps in the absence of roundoff errors.
It is worth to note that the conclusions of Lemma 7 may not be true without the assumptions
and
. Hence it is necessary to consider the case that
or
.
If
, which implies
, it follows that
.
If
, which implies
, then we have
, making inner product with
by both side, yields
![]()
So the discussions above show that if there exist a positive integer i such that the coefficient
or
, then the corresponding matrix group
is just the solution of Problem I.
Together with Lemma 7 and the discussion about the coefficient
, we can conclude the following theorem.
Theorem 1. For an arbitrary initial matrix group
, the matrix group sequence
generated by Algorithm 1 will converge to a solution of Problem I at infinite iteration steps in exact arithmetic.
By choosing a special kind of initial matrix group, we can obtain the unique least norm of Problem I. To this end, we first define a matrix set as follows
![]()
where
. Evidently, S is a linear subspace of K.
Theorem 2. If we choose the initial matrix group
, especially, let
, we can obtain the least norm solution of Problem I.
Proof: By the Algorithm 1 and Theorem 1, if we choosing initial matrix group
, we can obtain the solution
of Problem I with finite iteration steps and there exist a matrix
such that the solution
can be represented that
![]()
By Lemma 6 we know that arbitrary solution of Problem I can be express as
![]()
where matrix group
satisfies (7).
Then
![]()
So we have
![]()
which implies that matrix group
is the least norm solution of Problem I. W
Remark 2. Since the solution of Problem I is no empty, so the
is a closed convex linear subspace, hence it is certain that the least norm solution group
of Problem I is unique, and
. If matrix group
is a solution of Problem I, then it just be the unique least norm solution of Problem I, i.e.
.
3. The Minimization Property of Iterative Method
In this section, the minimization property of Algorithm 1 is characterized, which ensures the Algorithm 1 converges smoothly.
Theorem 3. For an arbitrary initial matrix group
, the matrix group
generated by Algorithm 1 at the kth iteration step satisfies the following minimization problem
![]()
where F denote a affine subspace which has the following form
![]()
Proof: For arbitrary matrix group
, there exist a set of real number
such that
![]()
Denote
![]()
by the conclusion Equation (2) in Lemma 7, we have
![]()
where
is the corresponding residual of initial matrix group
. Algorithm 1 show that the matrix
can be express as
![]()
Because
is a continuous and differentiable function with respect to the k variable
, we easily know that
![]()
if and only if
![]()
It follows from the conclusion in Lemma 7 that
![]()
By the fact that
![]()
We complete the proof. W
Theorem 3 shows that the approximation solution
minimizes the residual norm in the affine subspace F for all initial matrix group within K. Furthermore, by the fact
, then we have
![]()
which shows that the sequence
![]()
is monotonically decreasing. The descent property of the residual norm of Equation (5) ensures that the Algorithm 1 possesses fast and smoothly convergence.
4. Numerical Examples
In this section, we present numerical examples to illustrate the efficiency of the proposed iteration method. All the tests are performed using Matlab 7.0 which has a machine precision of around 10−16. Because of the error of calculation, the iteration will not stop within finite steps. Hence, we regard the approximation solution group
as the solution of Problem I if the corresponding
satisfies
.
Example 1. Given matrices
and C as follows:
![]()
![]()
![]()
![]()
![]()
![]()
Choose the initial matrices
where 0 denotes zero matrix in appropriate dimension. Using Algorithm 1 and iterating 74 steps, we have the unique least norm solution
as follows:
![]()
![]()
![]()
![]()
with
![]()
And
![]()
If we let the initial matrix
, noting that
within K but not within S, then we have
![]()
![]()
![]()
![]()
with
![]()
And
![]()
Example 2. Suppose that the matrices
are the same as Example 1, let
, where
,
,
,
, that is to say, Equation (5) is consistent over set K. Then similarly Algorithm 2.1 in Peng [14] we can conduct another iteration algorithm as follows:
Algorithm 2. For an arbitrary initial matrix group
, compute
Step 1. ![]()
![]()
Step 2. If
, then stop; else,
, and compute
Step 3. ![]()
![]()
Step 4. Go to step 2.
The main differences of Algorithm 1 and Algorithm 2 are: in Algorithm 1 the selection of coefficient
make
, and
such that
but in Algorithm 2, the choosing of
such that
, and
such that
. Noting that Algorithm 2 satisfies
the Galerkin condition, but lacks of minimization property. Choosing the initial matrix
where 0 denotes zero matrix in appropriate dimension, by making use of Algorithm 1 and Algorithm 2, we can
![]()
Figure 1. The comparison of residual norm between these two algorithm.
obtain the same least norm solution group, and we also obtain the convergence curves of residual norm shown in Figure 1. The results in this figure show clearly that the residual norm of Algorithm 1 is monotonically decreasing, which is in accordance with the theory established in this paper, and the convergence curve is more smooth than that in Algorithm 2.
Acknowledgements
We thank the Editor and the referee for their comments. Research supported by the National Natural Science Foundation of China (11301107, 11261014, 11561015, 51268006).
NOTES
![]()
*Corresponding author.