Explicit Iterative Methods of Second Order and Approximate Inverse Preconditioners for Solving Complex Computational Problems

Abstract

Explicit Exact and Approximate Inverse Preconditioners for solving complex linear systems are introduced. A class of general iterative methods of second order is presented and the selection of iterative parameters is discussed. The second order iterative methods behave quite similar to first order methods and the development of efficient preconditioners for solving the original linear system is a decisive factor for making the second order iterative methods superior to the first order iterative methods. Adaptive preconditioned Conjugate Gradient methods using explicit approximate preconditioners for solving efficiently large sparse systems of algebraic equations are also presented. The generalized Approximate Inverse Matrix techniques can be efficiently used in conjunction with explicit iterative schemes leading to effective composite semi-direct solution methods for solving large linear systems of algebraic equations.

Share and Cite:

Lipitakis, A. (2020) Explicit Iterative Methods of Second Order and Approximate Inverse Preconditioners for Solving Complex Computational Problems. Applied Mathematics, 11, 307-327. doi: 10.4236/am.2020.114023.

1. Introduction

During the last decades, considerable research effort has been directed to the solution of complex linear and nonlinear systems of algebraic equation by using a class of iterative methods. This class includes the conjugate gradient method and its hybrid multi-variants. The conjugate gradient method originally introduced by Hestenes and Stiefel , was a direct solution method but later on has been extensively used as an iterative method for solving efficiently large sparse linear and nonlinear systems of complex computational problems   . The incomplete factorization methods and various hybrid preconditioning techniques      are well known and have been efficiently applied for solving iteratively large sparse linear systems of algebraic equations      .

Certain theoretical issues on this subject, such as 1) the stability and conditions of correctness of approximate system matrix in the form of product of triangular matrix factors, and 2) the convergence analysis of the iterative methods and the quantitative evaluation of the convergence rate of the iterative schemes 4, are of particular interest for the choice of the most efficient methods for systems with predetermined properties.

In the framework of this research work and in order to substantiate the main motivation for the efficient usage of the second order iterative method for solving nonlinear systems, the following basic related questions will be considered: 1) are the second order iterative methods comparable (or even superior) to first order iterative methods for solving nonlinear systems? A survey of related research work will be given; 2) are second order iterative schemes preferable than the first order iterative schemes for solving very large complex computational problems? The computational complexity per iterative step will be also examined.

The structure of this research paper is as follows: in Section 2, several advantages of second order iterative methods in comparison with the first order iterative methods for solving nonlinear systems are presented. In Section 3, a class of general iterative methods of second order is described, while in Section 4, certain explicit iterative schemes and approximate inverse preconditioners are introduced. In Section 5, exact and approximate inverse matrix algorithmic techniques are introduced, while in Section 6, some aspects of stability and correctness of incomplete factorization methods are presented. Finally, in Section 7, the convergence analysis and quantitative evaluation of convergence rate of incomplete factorization methods are discussed.

2. General Iterative Methods of Second Order: Part I

In recent years, considerable research effort has been focused in the topic of second order iterative methods. In order to substantiate our motivation in our research study, we present a synoptic survey of related computational methods developed on the subject. Several early iterative methods of second order with fixed parameters or variable parameters have been extensively studied    .

A class of (optimal) second degree iterative methods for accelerating basic linear stationary methods of first degree with real eigenvalues has been presented   and has been extended as application of conformal mapping and summation techniques for the case when eigenvalues are contained in elliptical regions in the complex plane   . Another similar contribution on optimal parameters for linear second degree stationary iterative methods applied to unsymmetric linear systems have been computed by solving the minimax problem used to compute optimal parameters for Chebyshev iteration that is asymptotically equivalent to linear second-degree stationary method . A nonstationary iterative method of second order for solving nonlinear equations without requiring the use of any derivative has been presented. This method for algebraic equations coincides with Newton’s method and is more efficiently . Note that Newton’s methods and high order iterative schemes (Householder’s iterative methods), under some conditions of regularity of the given function and its derivatives, have been used for the numerical treatment of single nonlinear equations .

A three-step iterative method for solving nonlinear equations by using Steffensen’s method in the third step having eight order convergence has been recently presented. This method requires a small number of calculations and does not require calculation of derivative in the third iterative step. A two-step iterative method for solving nonlinear equations, a modified Noor’s method without computing the second derivatives and with fourth order convergence has been presented . A second order iterative method for solving quasi-variational inequalities has been introduced and sufficient conditions for convergence rate have been given . Two iterative methods of order four and five respectively by using modified homotopy perturbation techniques for solving nonlinear equations and the convergence analysis have been presented . An efficient second order iterative method for IR drop analysis in power grid has been presented . In this research study, they consider a first order iterative method

${x}_{n+1}^{*}=G{x}_{n}+{K}_{1}$ (1)

and the resulting second order iterative method

${x}_{n+1}={x}_{n}+a\left({x}_{n}-{x}_{n-1}\right)+b\left({x}_{n+1}^{*}-{x}_{n}\right)$, (2)

where ${x}_{n+1}^{*}$ denotes the first-order iteration, and a and b are real accelerating parameters effecting the convergence rate. For the consistency and convergence of the second order iterative methods the following statements hold:

Preposition 1: If the 1st-order iterative method converges to the exact solution, then the 2nd-order method will converge to the same solution for any values of $a\ne 0$ and $b\ne 0$.

Preposition 2: The iterative matrix of the 2nd-order method is known. A necessary and sufficient condition that the iterative method converges for all initial conditions is that, if the spectral radius of matrix G is minimized then the convergence rate is maximized .

2.1. Some Problems in Solving Very Large Complex Computational Problems

It is known that due to the extremely large sizes of power grids, IR drop analysis has become a computationally challenging problem in terms of memory usage and runtime, and second-order iterative algorithms that can significantly reduce the runtime have been proposed. Specifically, the main problems include the following:

1) Very large-scale simulations (millions of elements in power grids) and run time is slow.

2) Memory inefficiency (1 million nodes and trillion elements in matrix)

3) Trade-off between runtime and predetermined accuracy

4) Power delivery issues (increased complexity of VLSI circuits, increased power (current) consumption, decreasing supply voltage, reduced noise margin, increased gate delay)

5) Modelling and analysis of power grid network must be accurate and power grid networks tend to be very large .

Typical applications include also a large class of initial-boundary value problems of general form in 3 space dimensions with strong nonlinearities:

${u}_{t}+\underset{i=1}{\overset{N}{\sum }}{a}_{i}\left(x,t,u\right){u}_{xi}+b\left(x,t,u\right)=\epsilon \underset{i=1}{\overset{N}{\sum }}{u}_{xi}{u}_{xi}$ (2a)

where the positive perturbation parameter tends to zero .

The discrete analogues of Equations (1) (2) lead to the solution of the general linear system

$Au=s$ (2b)

where the coefficient matrix A is a large sparse unsymmetric real (n × n) matrix of irregular structure.

2.2. Computational Complexity per Iterative Step

Computational complexity of algorithms is an important subject in which considerable research has been focused in the last decades     .

An interesting topic in the framework of comparison of the number of flops per iteration to be performed for several classes of solution methods, i.e. 1) direct second order methods (based on direct matrix decomposition), 2) iterative first order methods and 3) iterative second order methods, for the cases of dense and sparse problems reveals the following:

1) In first order iterative methods, the number of flops per iteration is generally much smaller than that of second order methods, while the former generally need many iterations to converge (maybe few hundreds up to few thousands) and also have difficulties to obtain highly accurate optimal solutions.

2) The second order iterative methods usually converge quickly (few tens iterations) to highly accurate solutions.

3) The first order iterative methods overall are considered to be much superior to second order methods based on direct factorization in solving large scale SDP problems. However, the latter methods in solving linear systems may be prohibitively expensive (even impossible) due to high number of flops required and the high amount of memory space needed for storing the coefficient matrix.

4) The computational complexity of second order iterative methods has the following characteristics:

a) There is no need for computing and/or store the coefficient matrix

b) The computational work and storage requirements of inner iterations are very comparable to that of iterations of log-barrier method

c) The total number of inner iterations performed by iterative second order method depends on the choice of preconditioner used in the iterative algorithm of the original system.

Note that a logarithmic barrier term, which emphasizes the improvement of poor quality elements, solves the constrained optimization problem using the gradient of the objective function  .

Conclusively, the second order iterative methods, as far as the inner iterations are concerned, behave quite similar to first order methods. Furthermore, the development of efficient preconditioners for solving the original linear system is a decisive factor for making the second order iterative methods superior to the first order iterative methods.

3. General Iterative Methods of Second Order: Part II

In this section, a class of iterative methods of second order for solving large sparse linear systems of the form Au = b is presented and explicit preconditioned methods for approximating the solution of complex computational problems are given. Apart of the extensive research work that has been done for solving linear systems by using second order iterative methods as indicated in section 2, it is worthwhile to mention that the efficient usage of second order iterative schemes for solving nonlinear systems has been reported by Lipitakis (1990) and these iterative schemes are originated from the E-algorithm, a modified version of the well-known algorithm of Euclid, written in the form of a general second order iterative scheme . This general iterative scheme is synoptically described in the following:

Let us consider a class of non-stationary iterative methods of second order the form:

${u}_{i+1}={\pi }_{i}{u}_{i}+{\tau }_{i}{u}_{i-1}+{\delta }_{i},\text{\hspace{0.17em}}\text{ }\text{ }i\ge 0$ (3)

where ${\pi }_{i}$, ${\tau }_{i}$ are real parameters, the so-called E-parameters , and ${\delta }_{i}$ is a “correction” term. The iterative scheme is known as the E-iterative method .

These iterative schemes with appropriate selection of the parameters can be used in conjunction with various explicit preconditioned methods, such as explicit Richardson, Chebychev, fractional step iterative method, Grank-Nickolson, multiple explicit Jacobi, explicit preconditioned Conjugate Gradients, etc. for solving complex computational problems, such as 3D-initial boundary value problems. It is known that the corresponding approximate factorization algorithms and the approximate inverse algorithms are tediously complicated, especially in the three-dimension space with complicated boundary conditions in irregular domains.

Let us consider an unsymmetric large sparse linear system Au = b. Then, the choice of E-parameters:

${\pi }_{i}=\left[I-{H}^{-1}A\right],\text{\hspace{0.17em}}\text{\hspace{0.17em}}{\tau }_{i}=0,\text{\hspace{0.17em}}\text{\hspace{0.17em}}{\delta }_{i}={H}^{-1}b$ (4)

where the non-singular matrix H is chosen such that the computational work required for solving the linear system Hu = s (where s is given) is considerably small compared to that required to solve the system Au = b, leading to the iterative procedure,

${u}_{i+1}=\left[I-{H}^{-1}A\right]{u}_{i}+{H}^{-1}b,\text{\hspace{0.17em}}\text{\hspace{0.17em}}i\ge 0$ (5)

Note that several choices of matrix H lead to well-known iterative methods, i.e. Richardson, Block Jacobi, Gauss-Seidel, SOR, SSOR, etc.

Consider the unsymmetric linear system Au = b, a family of ellipses of centred with foci d ± c and assuming that the E-parameters are selected as

${\pi }_{i}=\left[1-{\beta }_{\iota }-{a}_{i}A\right],\text{\hspace{0.17em}}\text{\hspace{0.17em}}{\tau }_{\iota }=-{\beta }_{\iota },\text{\hspace{0.17em}}\text{\hspace{0.17em}}{\delta }_{\iota }={a}_{i}b,\text{\hspace{0.17em}}\text{\hspace{0.17em}}i\ge 0$ (6)

where the acceleration parameters ${a}_{i},{\beta }_{\iota }$ are defined as

${a}_{i}={\left[d-{\left(c/2\right)}^{2}{a}_{i-1}\right]}^{-1},\text{\hspace{0.17em}}\text{\hspace{0.17em}}{\beta }_{i}=d{a}_{i}-1,\text{ }\text{\hspace{0.17em}}i\ge 1$ (7)

with

${a}_{1}=2d/\left(2{d}^{2}-{c}^{2}\right),\text{\hspace{0.17em}}\text{\hspace{0.17em}}{\beta }_{1}=d{a}_{0}-1,\text{\hspace{0.17em}}\text{\hspace{0.17em}}{a}_{0}=1/d,\text{\hspace{0.17em}}\text{\hspace{0.17em}}{\beta }_{0}=0$ (8)

The iterative method (3) then can be equivalently written as a second order non-stationary iterative scheme (known as Chebychev iterative method):

${u}_{i+1}=\left(1+{\beta }_{i}-{a}_{i}A\right){u}_{i}-{\beta }_{i}{u}_{i-1}+{a}_{i}b,\text{\hspace{0.17em}}\text{\hspace{0.17em}}i\ge 1$ (9)

or equivalently

${u}_{i+1}={u}_{i}+{a}_{i}{r}_{\iota }+{\beta }_{i}\left({u}_{i}-{u}_{i-1}\right),\text{\hspace{0.17em}}\text{\hspace{0.17em}}i\ge 1$ (10)

where r is the residual factor $r=b-Au$, and ${u}_{0},{u}_{1}$ are arbitrary chosen initial values.

The simplified choice of E-parameters

${\pi }_{i}=\left[1-\beta -aA\right],\text{\hspace{0.17em}}\text{\hspace{0.17em}}{\tau }_{\iota }=-\beta ,\text{\hspace{0.17em}}\text{\hspace{0.17em}}{\delta }_{\iota }=ab,\text{\hspace{0.17em}}\text{\hspace{0.17em}}i\ge 0$ (11)

leads to second-order iterative scheme (known as the second-order Richardson’s method):

${u}_{i+1}=\left[{u}_{i}+a{r}_{i}+\beta \left({u}_{i}-{u}_{i-1}\right)\right],\text{\hspace{0.17em}}\text{\hspace{0.17em}}i\ge 1$ (12)

where the parameters α, β are the asymptotic values of ${a}_{i},{\beta }_{i}$ .

Let us consider the approximate factorization $A\approx {L}_{s}{U}_{s}$, where ${L}_{s}$ and ${U}_{s}$ are sparse decomposition factors   and let us assume that M a non-singular real (n × n) matrix, is the inverse of $\left({L}_{s}\text{\hspace{0.17em}}\text{\hspace{0.17em}}{U}_{s}\right)$, i.e. ${M}_{r}=\left({L}_{r}\text{\hspace{0.17em}}{U}_{r}\right)$, $r\in \left[1,m-1\right]$, where r is the “fill-in” parameter, i.e. the number of outermost-off diagonal entries which are retained in semi-bandwidth m. Then, the explicit iterative methods based on the generalized approximate inverse matrix techniques , can be obtained by determining the element of A-1 without inverting the decomposition factors ${L}_{r},{U}_{r}$.

Note that the effectiveness of explicit iterative methods is mainly related to the fact that the exact inverse of the sparse matrix A, although is full, exhibits a similar “fuzzy” structure as A, i.e. the largest elements are clustered around the principal diagonal and main diagonals .

The selection of the E-parameters

$\begin{array}{l}{\pi }_{i}=\left[1+b-a{M}_{r}A\right],\text{\hspace{0.17em}}\\ \text{\hspace{0.17em}}{\tau }_{i}=-\beta ,\text{\hspace{0.17em}}\text{\hspace{0.17em}}{\delta }_{i}=a{M}_{r}b,\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}i\ge 0\end{array}$ (13)

and

$\begin{array}{l}{\pi }_{i}=\left[1+{\beta }_{i}-{a}_{i}{M}_{r}A\right],\text{\hspace{0.17em}}\\ \text{\hspace{0.17em}}{\tau }_{i}=-{\beta }_{i},\text{\hspace{0.17em}}{\delta }_{i}={a}_{i}{M}_{r}b,\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}i\ge 0\end{array}$ (14)

where ${M}_{r}$ is an approximate form of the inverse of A and α, β and ${a}_{i}$, ${\beta }_{i}$ are preconditioned parameters and sequences of parameters respectively, leads to the following explicit iterative schemes:

${u}_{i+1}=\left(1+\beta -\alpha {M}_{r}A\right){u}_{i}-\beta {u}_{i-1}+a{M}_{r}b,\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}i\ge 1$ (15)

${u}_{i+1}=\left(1+\beta -{\alpha }_{i}{M}_{r}A\right){u}_{i}-{\beta }_{i}{u}_{i-1}+{a}_{i}{M}_{r}b,\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}i\ge 1$ (16)

which are known as the Explicit second order Richardson and Explicit Chebychev methods respectively .

In these explicit iterative schemes several approximate forms of the inverse can be effectively used by retaining only ${\delta }_{U}$ and ${\delta }_{L}$ $\left({\delta }_{U}\in \left[2,n-1\right],{\delta }_{L}\in \left[1,n\right]\right)$ diagonals in the upper and lower triangular parts of ${M}_{r}$, the remaining elements being just not computed at all .

4. Explicit Iterative Schemes and Approximate Inverse Preconditioners

4.1. A Class of Optimized Approximate Inverse Variants

A class of optimized approximate inverse variants can be obtained by considering a (near) optimized choice of the approximate inverse M depends on the selection of related parameters, i.e. fill-in parameters r1, r2, retention parameters δl1, δl2 and entropy-adaptivity-uncertainty (EAU) parameters  (Figure 1). Note that the selection of retention parameter values as multiples of the corresponding semi-bandwidths of the original matrix leads to improved numerical results . Then, the following sub-classes of approximate inverses, depending on the accuracy, storage and computational work requirements, can be derived

Figure 1. Certain subclasses of approximate inverse matrices.

where ${M}_{{r}_{1}=m-1,{r}_{2}=p-1}^{\delta {l}_{1},\delta {l}_{2}}$ of sub-class I is a banded form of the exact inverse retaining $\delta {l}_{1},\delta {l}_{2}$ elements along each row and column respectively, while its elements are equal to the corresponding elements of the exact inverse. The term ${M}^{S}{\text{\hspace{0.17em}}}_{{r}_{1}=m-1,{r}_{2}=p-1}^{\delta {l}_{1},\delta {l}_{2}}$ of sub-class II is a banded form of M, retaining only $\delta {l}_{1},\delta {l}_{2}$ elements along each row and column during the computational procedure of the approximate inverse and under certain hypotheses can be considered as a good approximation of the original inverse, while the entries of the approximate inverse in sub-class III have been retained after computing M* ( ${r}_{1} and ${r}_{2} ) and are less accurate than the corresponding entries of ${M}^{*}{\text{\hspace{0.17em}}}_{{r}_{1},{r}_{2}}^{\delta {l}_{1},\delta {l}_{2}}$.Finally, in sub-class IV the elements of the approximate inverse can be computed.

Note that the generalized Approximate Inverse Matrix (AIM) techniques can be efficiently used in conjunction with explicit iterative schemes leading to effective composite semi-direct solution methods solving large linear systems of algebraic equations on multiprocessor systems.

4.2. Explicit Iterative Schemes

Analogous relationships can be derived for various explicit CG-type methods as indicated in the following. The generalized alternating group explicit (AGE) iterative methods, based on the known ADI methods    , can be derived as follows: consider the splitting

$A=D+{G}_{1}+{G}_{2}$ (18)

where D is a non-negative diagonal matrix and D, ${G}_{1}$, ${G}_{2}$ satisfy the conditions

1) The matrices $\left({G}_{i}+\theta D+pI\right),i=1,2$ are non-singular for any $\theta \ge 0$, $\rho >0$,

2) The system

$x={G}_{1}^{-1}v$ and $y={G}_{2}^{-1}z$ (19)

can be easily solved explicitly for any vectors v and z. Note that this statement holds only to certain model problems. Then, the following generalized AGE scheme can be obtained

$\left({G}_{i}+{p}_{i+1}I\right){u}_{i+1/2}=\left({p}_{i+1}I-{G}_{2}\right){u}_{i}+b$ (20)

$\left({G}_{i}+{p}_{i+1}I\right){u}_{i+1}=\left({G}_{2}-\left(I-\omega \right){p}_{i+1}I\right){u}_{i}+\left(2-\omega \right){p}_{i+1}{u}_{i+1/2}$ (21)

where ω is a non-negative acceleration parameter .

In an analogous manner can be derived the explicit preconditioned methods 1) Richardson + AGE method and 2) Chebychev semi-iterative method + AGE method . Note that the AGE method which is based on the combination of certain elementary first order difference processes permitting the reduction of a given complicated problem into a sequence of simpler problems, can be considered as a fractional (splitting-up) method  .

The multiple explicit Jacobi (μ-EJ) method and its several parametric forms, provided that their spectral radius ρ satisfies the corresponding convergence condition, in combination with the Lanczos economized Chebychev polynomials of degree μ, has proved to be effective for solving elliptic boundary-value problems in parallel processors . The multiple explicit Jacobi method in conjunction to economized Chebychev polynomial and Neumann series of certain degree can be effectively applied for solving explicitly large sparse linear systems resulting from the discretization of initial boundary-value problems .

4.3. Explicit Preconditioned Conjugate Gradients Method of Second Order

Let us assume that the E-parameters are selected as follows:

${\pi }_{i}={p}_{i+2}\left(1+{\gamma }_{i+2}{M}_{\mu }A\right),\text{\hspace{0.17em}}\text{\hspace{0.17em}}{\tau }_{\iota }=1-{p}_{i+1},\text{\hspace{0.17em}}\text{\hspace{0.17em}}{\delta }_{i}={p}_{i+1}{\gamma }_{i+1}{M}_{\mu }b,\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}i\ge 0$ (22)

where

${p}_{i+1}=1+\left({a}_{i}/{a}_{i-1}\right){\beta }_{\iota },\text{\hspace{0.17em}}\text{\hspace{0.17em}}{\gamma }_{i+1}={\alpha }_{i}/{p}_{i+1},\text{\hspace{0.17em}}\text{\hspace{0.17em}}i=0,1,2,\cdots ,k-1$ (23)

with k the smallest integer such that ${r}_{k}=b-A{u}_{k}=0$, ${p}_{i}=1$ and ${a}_{i},{\beta }_{i}$ are the scalar quantities as defined in the original CG paper .

The iterative scheme (21) then becomes

${u}_{i+1}={p}_{i+1}\left(1+{\gamma }_{i+1}{M}_{\mu }A\right){u}_{i}+\left(1-{p}_{i+1}\right){u}_{i-1}+{p}_{i+1}{\gamma }_{i+1}{M}_{\mu }b,\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}i\ge 1$ (24)

or equivalently the second order explicit preconditioned CG scheme can be obtained as

${u}_{i+1}={u}_{i-1}+{p}_{i+1}\left({\gamma }_{i+1}{M}_{\mu }{r}_{i}+{u}_{i}-{u}_{i-1}\right),\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}i\ge 1$ (25)

In the following, the selection of E-parameters for various explicit preconditioned methods is presented.

The proposed explicit iterative methods can be used for solving large sparse linear systems and the E-iterative schemes can generate useful explicit iterative schemes of higher order with suitable selection of E-parameters for solving a wide class of very large sparse linear systems in multiprocessor systems.

Typical applications include a large class of initial-boundary value problems of general form in three space dimensions with strong nonlinearities

${u}_{t}+\underset{i=1}{\overset{N}{\sum }}{a}_{i}\left(x,t,u\right){u}_{xi}+b\left(x,t,u\right)=\epsilon \underset{i=1}{\overset{N}{\sum }}{u}_{xi}{u}_{xi}$ (25a)

where the positive perturbation parameter tends to zero .

The discrete analogues of Equation (25a) lead to the solution of the general linear system

$Au=s$ (25b)

where the coefficient matrix A is a large sparse unsymmetric real (n × n) matrix of irregular structure.

4.4. Selection of Explicit Iterative Solver and Algorithmic Implementation

In this section an iterative solver of second order for solving linear and nonlinear systems of irregular structure is presented in pseudo algorithmic form:

Algorithm SOIS-1 (EXSOM, SEIS, U-1, U0, NMAX, Π, R, Δ, U)

Purpose: This algorithm describes the general selection of an iterative solver of second order for solving linear and nonlinear nonsymmetric systems of irregular structure and the selection of corresponding parameters.

Input: the computational module EXSOM selecting the explicit iterative solver SEIS and corresponding parameters, the initial values ${u}_{-1}$, ${u}_{0}$, max number of iterations NMAX.

Output: final explicit iterative solver SEIS and approximate iterative solution U.

Computational Procedure:

step 1: Call the explicit iterative module EXSOM and determine the explicit iterative solver to be used

step 2: Select initial vectors ${u}_{-1}$ and ${u}_{0}$ and set the max number of iterations NMAX

step 3: Select the corresponding E-parameters π, r and correction term δ

step 4: For $i=0,1,2,\cdots$

step 5: Compute approximate vector ${u}_{i+1}$ as ${u}_{i+1}={\pi }_{i}{u}_{i}+{r}_{i}{u}_{i-1}+{\delta }_{i}$

// the E-parameters π, r and correction term δ are selected from the corresponding explicit iterative scheme of computational module EXSOM //

step 6: If the convergence criterion is satisfied and the max number of iteration NMAX is hold,

then go to step 7,

else go to step 4 and

continue

step 7: Print the explicit iterative solver SEIS

step 8: Form approximate solution u

The explicit computational module EXSOM can be used for solving an explicit iterative solver as follows:

module EXSOM (ERI, ECH, GAGE, RAGE, CHAGE, GFSI, MEJ, EPCGSO, SEIS)

Purpose: this module determines the explicit iterative solver that will be used for solving the given linear nonsymmetric system of irregular structure

Input: the explicit iterative solvers ERI, ECH, GAGE, RAGE, CHAGE, GFSI, MEJ, EPCGSO, IS integer (=1, 2, 3, 4, 5, 6, 7, 8) is the number of explicit iterative solver to be used

Output: The selected explicit iterative solver SEIS = IS

Computational Procedure:

step 1: Consider one of the following available explicit iterative solvers and set the corresponding value of IS

step 1.1: If IS==1,

then the explicit Richardson method ERI is to be used;

return

step 1.2: If else IS==2,

then the explicit Chebychev ECH is to be used

return

step 1.3: If else IS==3,

then the generalized AGE method GAGE is to be used;

return

step 1.4: If else IS==4,

then the Richardson + AGE method RAGE is to be used;

return

step 1.5: If else IS==5,

then the Chebychev + AGE method CHAGE is to be used;

return

step 1.6: If else IS==6,

then the general fractional step iteration GFSI is to be used;

return

step 1.7: If else IS==7,

then the multiple explicit Jacobi MEJ is to be used;

return

step 1.8: If else IS==8,

then the explicit preconditioned CG of second order is to be used

return

step 2: return the selected explicit iterative solver (SEIS)

Additional explicit iterative solvers can be used in module EXSOM for solving the given linear system. The algorithm SOIS-1 can be used as a general explicit iterative solver of second order for solving linear and nonlinear non-symmetric systems of irregular structure by selecting the corresponding parameters.

5. The Exact and Approximate Inverse Matrix Algorithmic Techniques

5.1. On the Selection of Approximate Inverse Preconditioners

The selection of an efficient approximate inverse preconditioner for solving explicitly complex computational problems is an interesting research topic of critical importance. Let us assume that a non-singular large sparse unsymmetric matrix of irregular structure can be factorized as A = LU (Figure 2), where L and U are triangular matrices and $A\approx {L}_{s}{U}_{s}$ is an approximate factorization with ${L}_{s}$ and ${U}_{s}$ the corresponding sparse decomposition factors (Figure 3 and Figure 4). The elements of the decomposition factors can be efficiently computed . Let us assume that ${M}_{S}^{*}$ is an approximate inverse of A, i.e. ${A}^{-1}\approx {M}_{s}^{*}$.

In the following, a class of adaptive exact and approximate inverse solvers based on exact/approximate LU decompositions and approximate inverse methods is described. Let us assume the general linear system

Figure 2. Structure of the unsymmetric coefficient matrix A.

Figure 3. Structure of the lower triangular factor Ls.

Figure 4. Structure of the upper triangular factor Us.

$Au=s$ (25b)

where the coefficient matrix A is a large sparse real (n × n) matrix of irregular structure.

The structure of A is shown in the following diagram:

Note that such regular matrix structures occur only for certain model type problems.

Let us consider, the factorization A = LU and an approximate factorization of the coefficient matrix A,

$A\approx {L}_{s}{U}_{s}$ (27)

where ${L}_{s}$ and ${U}_{s}$, are lower and upper sparse triangular matrices of irregular structures of semi-bandwidths m and p retaining ${r}_{1}$ and ${r}_{2}$ fill-in terms respectively. The decomposition factors ${L}_{s}$ and ${U}_{s}$ are banded matrices with l1 and l2 the numbers of diagonals retained in semi-bandwidths m and p respectively, of the following form:

The elements of the decomposition factors Ls and Us can be obtained from the algorithmic procedure FELUBOT .

A class of exact and approximate inverse matrix techniques can be considered containing several sub-classes of approximate inverses according to memory requirements, computational work, accuracy, as indicated in the following scheme:

Let us assume that ${M}_{{r}_{1},{r}_{2}}$, a non-singular (n × n) matrix, is an approximate inverse of A, i.e. ${M}_{{r}_{1},{r}_{2}}={\left\{{M}_{i,j}\right\}}^{{r}_{1},{r}_{2}},\text{\hspace{0.17em}}i,j\in \left[1,n\right]$. Note that if ${r}_{1}=m-1$ and ${r}_{2}=p-1$ non-zero elements have been retained in the corresponding decomposition factors, then ${M}_{{r}_{1},{r}_{2}}=M$, where M is the exact inverse of A. The elements of M can be determined by solving recursively the systems

$ML={U}^{-1}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{and}\text{\hspace{0.17em}}\text{\hspace{0.17em}}UM={L}^{-1}$ (28)

having main disadvantages, i.e. high storage requirements and computational work involved particularly in the case of solving very large unsymmetric linear systems.

A class of approximate inverses ${M}^{\delta {l}_{1},\delta {l}_{2}}$ can be obtained by retaining only δl1 and δl2 diagonals in the lower and upper triangular parts of inverse respectively, the remaining elements being just not computed at all. Optimized forms of this algorithm are particularly effective for solving banded sparse FE systems of very large order, i.e. $\left[\delta {l}_{1}+\delta {l}_{2}\right]>n/2$ or in the case of narrow-banded sparse FE systems of very large order, i.e. $\left[\delta {l}_{1}+\delta {l}_{2}\right]\ll n/2$. Then an explicit approximate inverse preconditioner can be described in the following adaptive algorithmic form.

5.2. The Exact and Approximate Inverse Preconditioners

The elements of the exact and approximate inverse of a given unsymmetric and irregular structure can be obtained as follows and the explicit banded approximate inverse algorithm can be described by the following algorithmic procedure in pseudocode form:

Algorithm EBAIM-1 (A, n, εEM, r1, r2, m, p, l1, l2, δl1, δl2, M)

Purpose: This algorithm computes the elements of the exact inverse of a given real unsymmetric (n × n) matrix of irregular structure arising in FE/FD discretization of elliptic and parabolic boundary value in three space dimensions. The algorithm can also compute approximate inverse matrices of the given coefficient matrix.

Input: given matrix A; n order of A; submatrices F, H, Γ, Z; parameter εEM (indicating the exact inverse or approximate inverse matrix), s m, p; l1 and l2 numbers of diagonals retained in semi-bandwidths m and p respectively, δl1 and δl2 widths of bands in A

Output: elements μi,j of the exact inverse M

Computational Procedure:

//for the appropriate value of adaptive parameter εEM the algorithm computes the exact inverse matrix or approximate inverse matrices//

Call module exactmode-1(εEM)

//If the module exactmode-1is activated then the algorithm EBAIM-1 computes the exact inverse of a given unsymmetric matrix of irregular structure using an exact LU factorization, otherwise the algorithm can be used for computing an approximate inverse matrix of the given coefficient matrix//

step 1: Let $rl1={r}_{1}+{l}_{1}$ ; $rl2={r}_{2}+{l}_{2}$ ; $rl11=rl1-1$ ; $rl21=rl2-1$ ; $mr1=m-{r}_{1}$ ; $ml1=m+{l}_{1}$ ; $pr2=p-{r}_{2}$ ; $pl2=p+{l}_{2}$ ; $nmr1=n-m+{r}_{1}$ ; $npr2=n-p+{r}_{2}$

step 2: For i = n:1

step 3: For $j=i:\mathrm{max}\left(1,i-\delta \text{l}+\text{1}\right)$

step 4: If $j>nmr1$ then

step 5: If i = j then

step 6: If i = n then

step 7: ${\mu }_{n,n}=1$

step 8: else

step 9: ${\mu }_{i,j}=1-{g}_{j}{\mu }_{i,j+1}$

step 10: ${\mu }_{i,j}={\omega }_{n}-{\beta }_{j}{\mu }_{i,j+1}$

step 11: else

step 12: ${\mu }_{i,j}=-{g}_{j}{\mu }_{i,j+1}$

step 13: ${\mu }_{i,j}=-{\beta }_{j}{\mu }_{i,j+1}$

step 14: else

step 15: If $j>npr2$ and $j\le nmr1$ then

step 16: If i = j then

step17: ${\mu }_{i,j}=1-{g}_{j}{\mu }_{i,j+1}-\underset{k=0}{\overset{nmr1-j}{\sum }}{h}_{rl11-k,j+k+1-r1}{\mu }_{i,j+mr1+k}$

step18: ${\mu }_{i,j}={\omega }_{1}-{\beta }_{j}{\mu }_{i,j+1}-\underset{k=0}{\overset{nmr1-j}{\sum }}{\gamma }_{rl11-k,j+k+1-r1}{\mu }_{i,j+mr1+k}$

step 19: else

step 20: ${\mu }_{i,j}=-{g}_{j}{\mu }_{i,j+1}-\underset{k=0}{\overset{nmr1-j}{\sum }}{h}_{rl11-k,j+k+1-r1}{\mu }_{i,j+mr1+k}$

step 21: ${\mu }_{i,j}=-{\beta }_{j}{\mu }_{i,j+1}-\underset{k=0}{\overset{nmr1-j}{\sum }}{\gamma }_{rl11-k,j+k+1-r1}{\mu }_{i,j+mr1+k}$

step 22: else

step 23: If $j\ge r{l}_{1}$ and $j\le npr2$ then

step 24: If i = j then

step 25: ${\mu }_{i,j}=1-{g}_{j}{\mu }_{i,j+1}-\underset{k=0}{\overset{nmr1-j}{\sum }}{h}_{rl11-k,j+k+1-r1}{\mu }_{i,j+mr1+k}-\underset{k=0}{\overset{npr2-j}{\sum }}{f}_{rl12-k,j+k+1-r2}{\mu }_{i,j+pr2+k}$

Step 26: ${\mu }_{i,j}={\omega }_{1}-{\beta }_{j}{\mu }_{i,j+1}-\underset{k=0}{\overset{nmr1-j}{\sum }}{\gamma }_{rl11-k,j+k+1-r1}{\mu }_{i,j+mr1+k}-\underset{k=0}{\overset{npr2-j}{\sum }}{z}_{rll2-k,j+k+l-r2}{\mu }_{i,j+pr2+k}$

Step 27: else

Step 28: ${\mu }_{i,j}=-{g}_{j}{\mu }_{i,j+1}-\underset{k=0}{\overset{nmr1-j}{\sum }}{h}_{rl11-k,j+k+1-r1}{\mu }_{i,j+mr1+k}-\underset{k=0}{\overset{npr2-j}{\sum }}{f}_{rl12-k,j+k+1-r2}{\mu }_{i,j+pr2+k}$

Step 29: ${\mu }_{i,j}=-{\beta }_{j}{\mu }_{i,j+1}-\underset{k=0}{\overset{nmr1-j}{\sum }}{\gamma }_{rl11-k,j+k+1-r1}{\mu }_{i,j+mr1+k}-\underset{k=0}{\overset{npr2-j}{\sum }}{z}_{rll2-k,j+k+l-r2}{\mu }_{i,j+pr2+k}$

Step 30: else

Step 31: If i = j then

Step 32: If i = 1 then

Step 33: ${\mu }_{1,1}=1-{g}_{1}{\mu }_{1,2}-\underset{k=1}{\overset{l1}{\sum }}{h}_{1,k}{\mu }_{1,m+k-1}-\underset{k=1}{\overset{l2}{\sum }}{f}_{1,k}{\mu }_{1,p+k-1}$

Step 34: ${\mu }_{1,1}={\omega }_{1}-{\beta }_{1}{\mu }_{1,2}-\underset{k=1}{\overset{l1}{\sum }}{\gamma }_{1,k}{\mu }_{1,m+k-1}-\underset{k=1}{\overset{l2}{\sum }}{z}_{1,k}{\mu }_{1,p+k-1}$

Step 35: else

Step 36: ${\mu }_{i,j}=1-{g}_{j}{\mu }_{i,j+1}-\underset{k=j+1-r1}{\overset{l1}{\sum }}{h}_{j,k}{\mu }_{i,m+k-1}-\underset{k=1}{\overset{j-1}{\sum }}{h}_{j-k,l1+k}{\mu }_{i,ml1+k-1}$

Step 37: $\begin{array}{l}{\mu }_{i,j}={\omega }_{1}-{\beta }_{j}{\mu }_{i,j+1}-\underset{k=j+1-r1}{\overset{l1}{\sum }}{\gamma }_{j,k}{\mu }_{i,m+k-1}-\underset{k=1}{\overset{j-1}{\sum }}{\gamma }_{j-k,l1+k}{\mu }_{i,ml1+k-1}\\ \text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}-\underset{k=1}{\overset{j-1}{\sum }}{z}_{j-k,l2+k}{\mu }_{i,pl2+k-1}-\underset{k=j+1-r2}{\overset{l2}{\sum }}{z}_{j,k}{\mu }_{i,p+k-1}\end{array}$

Step 38: else

Step 39: $\begin{array}{l}{\mu }_{i,j}=-{g}_{j}{\mu }_{i,j+1}-\underset{k=j+1-r1}{\overset{l1}{\sum }}{h}_{j,k}{\mu }_{i,m+k-1}-\underset{k=1}{\overset{j-1}{\sum }}{h}_{j-k,l1+k}{\mu }_{i,ml1+k-1}\\ \text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}-\underset{k=1}{\overset{j-1}{\sum }}{z}_{j-k,l2+k}{\mu }_{i,pl2+k-1}-\underset{k=j+1-r2}{\overset{l2}{\sum }}{z}_{j,k}{\mu }_{i,p+k-1}\end{array}$

Step 40: $\begin{array}{l}{\mu }_{i,j}=-{\beta }_{j}{\mu }_{i,j+1}-\underset{k=i+1-r1}{\overset{l1}{\sum }}{\gamma }_{j,k}{\mu }_{i,m+k-1}-\underset{k=1}{\overset{j-1}{\sum }}{\gamma }_{j-k,l1+k}{\mu }_{i,ml1+k-1}\\ \text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}\text{\hspace{0.17em}}-\underset{k=1}{\overset{j-1}{\sum }}{z}_{j-k,l2+k}{\mu }_{i,pl2+k-1}-\underset{k=j+1-r2}{\overset{l2}{\sum }}{z}_{j,k}{\mu }_{i,p+k-1}\end{array}$

Step 41: For j = i − 1 to $\mathrm{max}\left[1,i-\delta l+1\right]$

Step 42: ${\mu }_{i,i}={\mu }_{i,j}$

Step 43: Print out the inverse elements ${\mu }_{i,i}$

Step 44: End

Note that if the computational module exactmode-1 is activated, as indicated below, then the algorithm EBAIM-1 computes the elements of the exact inverse of a given unsymmetric matrix of irregular structure using an exact LU factorization.

Module exactmode-1 (εEM)

/If this module is activated (εEM = 1), then the algorithm EBAIM-1 computes the exact inverse of a given unsymmetric matrix of irregular structure using an exact LU factorization, otherwise the algorithm can be used for computing an approximate inverse matrix of the given coefficient matrix/

If εEM = 1 then

Set ${r}_{1}=p-\text{1}$ ; ${r}_{2}=m-\text{1}$

Set ${l}_{1}=n-\text{2}$ ; ${l}_{2}=n-p$

Set $\delta {l}_{1}=n-\text{2}$ ; $\delta {l}_{2}=n-\text{2}$

else

exit

end module exactmode-1

The computational work of the EBAIM-1 algorithm is $\approx O\left[n\left(\delta {l}_{1}+\delta {l}_{2}\right)\left({r}_{1}+{r}_{2}+{l}_{1}+{l}_{2}+2\right)\right]$ multiplications, while the memory requirements are (n × n) words. In the case of very large systems, the memory requirements could be prohibitively high and the usage of efficient memory requirements of approximate inverse matrices is desirable.

5.3. An adaptive Preconditioned Conjugate Gradient Method Using the Explicit Approximate Preconditioner

The preconditioned PCG method can solve the problem $\mathrm{min}‖b-A{R}^{-1}x‖$, where R is the sparse, non-singular QR factor, while the preconditioned CGLS method can solve the equations:

$M={R}^{\text{T}}R$, ${R}^{-\text{T}}{A}^{\text{T}}A{R}^{-1}u={R}^{-\text{T}}{A}^{\text{T}}b$ and $u=Rx$. (29)

Note that the factor Q cannot be stored, while the only additional computational work is solving the two equations ${R}^{\text{T}}w=v$ and $Rz=w$. All the factorization processes are numerically stable. It should be pointed out that the sparse preconditioner M* used in the modified PCG method is the approximate inverse of factor R, which using is closely related with the so-called mrTIGO method.

In order to compute efficiently the solution of the linear system Ax = b, a modified Preconditioned Conjugate Gradient (mPCG) method in conjunction to the modified rTIGO method  is applied in the following algorithmic form.

Algorithm mPCG (A, b, tol, x0, M*, x)

Purpose: a modified PCG method is used for solving a given system of linear equations

Input: A is a symmetric and positive definite coefficient matrix, b is the right hand side vector, tol is the predetermined tolerance, ${x}_{0}$ is the initial guess, M* is the required preconditioner

Output: x the solution vector

Computational Procedure:

Step 1: Given ${x}_{0}$ and preconditioner M*

Step 2: Set ${r}_{0}=A{x}_{0}-b$

Step 3: Solve $M{y}_{0}={r}_{0}$

Step 4: Set ${p}_{0}=-{y}_{0},k=0$

Step 5: While ${r}_{k}\ne 0$

Step 6: Compute a step length ${a}_{k}=\left({r}_{k}^{\text{T}}{y}_{k}\right)/\left({p}_{k}^{\text{T}}A{p}_{k}\right)$

Step 7: Update the approximate solution ${x}_{k+1}={x}_{k}+{a}_{k}{p}_{k}$

Step 8: Update the residual ${r}_{k+1}={r}_{k}+{a}_{k}A{p}_{k}$

Step 9: Solve ${M}^{*}{y}_{k+1}={r}_{k+1}$

Step 10: Compute a gradient correction factor ${\beta }_{k+1}=\left({r}_{k+1}^{\text{T}}{y}_{k+1}\right)\left({r}_{k}^{\text{T}}{y}_{k}\right)$

Step 11: Set the new search direction ${p}_{k+1}=-{y}_{k+1}+{p}_{k+1}{p}_{k}$

Step 12: $\kappa =\kappa +1$

Step 13: End (while)

This algorithm requires the additional work that is needed to solve the linear system

${M}^{*}{\stackrel{˜}{y}}_{n}={r}_{n}$ (29)

once per iteration. Therefore, the preconditioner M* should be chosen such that can be done easily and efficiently.

The preconditioner M* = G that results in a minimal memory use. The storage requirement was the vectors r, x, y, p and the upper triangular matrix G, in the data implementation. The convergence rate of preconditioned CG is independent of the order of equations and the matrix vector products are orthogonal and independent. The preconditioned CG method in not self-correcting and the numerical errors accumulate every round. Therefore, to minimize the numerical errors in the pCG, it was used double precision variables at the cost of memory use. The explicit pCG method of second order can be alternatively used in conjunction with the explicit approximate inverse Mμ* for solving complex computational problems with the appropriate selection of the E-parameters of Table 1. Note that the topics of stability and correctness of incomplete factorization methods have been discussed in a recent research work .

The presented second order iterative schemes can be efficiently used for solving 2d and 3d initial and boundary value problems.

6. Conclusion

A class of general iterative methods of second order is described. Explicit adaptive iterative schemes and exact/approximate inverse preconditioners have been introduced and the selection of iterative parameters has been discussed. Exact and Approximate Inverse Matrix Algorithmic Techniques have been presented. Exact and Approximate Inverse Preconditioners have been described in adaptive algorithmic form. Explicit preconditioned Conjugate Gradients methods of

Table 1. Certain cases of the E-iterative scheme which are no more than various explicit preconditioned methods.

second order are given. An Adaptive Preconditioned Conjugate Gradient Method using explicit approximate preconditioners has been also presented. Future research work will be focused on the implementation of the presented methods to parallel computer environments and related applications.

Conflicts of Interest

The author declares no conflicts of interest regarding the publication of this paper.     customer@scirp.org +86 18163351462(WhatsApp) 1655362766  Paper Publishing WeChat 