1. Introduction
Computational efficiency is one of the most concerned problems towards industrial applications of high-order methods. While various high-order methods have been gaining popularity, their computational efficiency can still be enhanced by employing a fast time marching method. To this end, a class of exponential time integration methods has been developed for fast time stepping of high-order discretized compressible Navier-Stokes equations, demonstrated in an arbitrarily high-order discontinuous Galerkin framework HA3D [1] - [6] on hybrid curved elements, although it is generally applicable to any spatial discretization. The developed predictor-corrector exponential (PCEXP) time scheme [3] is shown to be able to eliminate the Courant-Friedrichs-Lewy (CFL) restriction with low absolutely temporal errors, showing a special capability of achieving fast time stepping for steady and unsteady, inviscid and viscous flows.
In this work, we exploit the possibility of reducing the computational cost associated with the use of uniform-order exponential DG methods in a variable-order solution adaptation frame, and to our best knowledge, it has not been reported yet. In the current adaptation framework, cell order of accuracy is dynamically determined with a local cell error indicator in a modal discontinuous Galerkin method, resulting in a cell-wise adaptive distribution of cell orders and variable-order elemental Jacobian matrices.
The remaining parts of this abstract are organized as follows. Section 2 introduces the spatial discretization method used. Section 3 gives the overview of the exponential time-marching methods including the accuracy (p) adaptation methods. Finally, the last section presents some numerical tests and the Caradonna-Tung rotor in hover [7] is simulated with the adaptive method for demonstrating its applicability to real-world problems.
2. Spatial Discretization
2.1. Governing Equations
Governing equations consider the three-dimensional compressible Navier-Stokes equations with a source term in the rotating reference frame
(1)
where
are the conventional conservative vector and viscous flux. For rotational reference frame, the source term
and inviscid flux
are defined as
(2)
where
is the absolute velocity,
is the angular velocity of the rotating frame of reference,
the relative velocity;
,
the viscous stress tensor and the heat flux vector.
, p, and e denote the
flow density, pressure, and the specific internal energy;
and
denote the total energy and total enthalpy, respectively;
denotes the
unit matrix; and the pressure p is given by the equation of state for a perfect gas
(3)
where
is the ratio of specific heats for perfect gas.
2.2. Modal Discontinuous Galerkin Method
The discontinuous Galerkin discretization of Equation (1) is defined on a computational domain
divided into curved elements of arbitrary shape. The present adaptive discontinuous Galerkin method seeks an variable-order approximation
in each element
with a p-order polynomial
, namely
(4)
For 3-D problems
. The cell order p is a local variable for each cell, thus an adaptive approximation is simply obtained by adding or removing terms of (18), accordingly. An orthonormal basis set
expressed in the Cartesian coordinates is used to facilitate the implementation of the adaptation framework [1] [3]. By multiplying Equation (1) with the adaptive-order basis functions, the weak form is obtained
(5)
where the Einstein summation convention is used. Here
denotes the outward unit normal of the surface element
of the element E. The flux terms are defined as
(6)
where
is computed by a Riemann solver, and the viscous flux
is computed with the second approach of Bassi and Rebay (BR2) which introduces the local and global lifting operators
and
[8]. The BR2 viscous flux discretization is compact where the global lifting operator is used for volume integration while the local lifting operator is used face-wised
(7)
and the local lifting operator is solved by Galerkin projection on each surface
,
where we introduce the average operator
(8)
The implementation of BR2 formulation to exponential schemes uses analytical global Jacobian [5] [6], which is composed of variable order cell Jacobians in the current framework.
3. Exponential Time Integration
The PCEXP scheme that originally developed for fast time stepping of semi-discretized equations is independent of the choice of spatial discretization. And this feature makes it applicable as a general time integration tool. The previous results show it permits the use of very large time steps [1] - [6], and are more efficient than the third-order explicit TVD Runge-Kutta method and the second-order implicit backward difference formula [3]. In this section, the PCEXP scheme is briefly overviewed and its variable-order implementation is discussed.
We start with the following semi-discrete system of ordinary differential equations which may be obtained from a spatial discretization with problem associated boundary conditions:
(9)
where
denotes the vector of the solution variables and
the right-hand-side term which may be the spatially discretized residual terms of the discontinuous Galerkin method used in this work. The dimension K is the degrees of freedom which can be very large for 3-D problems. Without loss of generality, we consider
in the interval of one time step, i.e.,
. Splitting the right hand side leads to a different exact expression
(10)
where the subscript n indicates the value evaluated at
,
denotes the Jacobian matrix
and
denotes the remainder, which in general is nonlinear. Equation (10) admits the following formal solution:
(11)
Here, the matrix exponential is defined similarly to its scalar version
(12)
is the integrating factor. The formal solution (11) is the starting point to derive the proposed exponential scheme in which the stiff part is computed analytically whereas the nonlinear term is approximated numerically. The PCEXP approximation of (11) is read as
(13)
(14)
where a new matrix function is defined as
(15)
and
denotes the
identity matrix.
The physical nature of such type of exponential schemes relies on the global coupling feature via the global Jacobian matrix
, so that flow transportation information can be broadcasted to the whole computational domain without a CFL restriction. That is why the exponential schemes behave like a fully implicit method but only depends on the current solution, i.e., in an explicit way of formulation (14). While the second-order PCEXP scheme is more appropriate for computing unsteady problems, its first-order version, EXP1 only takes the first stage of (14), thus is more efficient for steady flows as shown in reference [2] [3] [5]. In this paper, we used PCEXP for all the test cases for both steady and unsteady flows.
3.1. p-Adaptive Time Steps
The time step of the adaptive exponential DG is dynamically determined via a residual monitoring strategy [3]. The local time step
is chosen by the minimum of the convection time step
and the diffusion time step
, which are given below
(16)
Here, p denotes the cell order,
and c the velocity vector and sound speed evaluated at the cell center, d the spatial dimension, and
represents a characteristic size of a 3-D cell defined by the ratio of volume and surface area. 2-D computations are also supported by the HA3D solver developed by the author which uses a quasi-3D mesh obtained by extruding the 2-D mesh by one layer of cells. In the 2-D computations,
is used instead for eliminating the effect of z dimension. Given the cell size
in the z direction,
is determined by
(17)
A proven efficient CFL formula is employed in (16). For the current adaptive computations, we take the global minimal time step for time advancement. Using local time steps is possible, but might suffer from stability issues with very large time steps.
3.2. Adaptation Strategy for the p-Refinement
An adaptive, p-refinement strategy defines DG polynomial order p in a cell-wise way so that higher-order approximations can be placed locally in key flow regions such as those of near body and wake flows. To identify these regions, a so-called spectral delay error (SDE) indicator [9] is developed for locating shock waves originally. The SDE indicator, relating to the numerical resolution of the approximation, can also be used for p-adaptive refinement. Recalling the modal DG expansion of (18), we have
(18)
The truncated expansion
for a lower degree (
) is computed by cutting the summation
(19)
Applying the truncated expansion to the total energy variable
leads to
(20)
The variable
is finally used as the cell error indicator of the p-adaptation. The adaptation process starts with
globally, namely first-order approximation. And then the cell error indicator
is computed in each cell after a full convergence is obtained in each level of adaptation. The cell order p is updated in each cell with the following criterion
(21)
The above criterion is applied only when the variation of global maximal Mach number is within 10−5 to prevent occurring premature adaptive solutions that are still in the process of dynamic evaluations.
3.3. Variable Order Residual Jacobians
The global residual Jacobian
is required by the PCEXP scheme, which is composed of cell-wise, variable order Jacobians in the current adaptation framework. Unlike traditional nodal based high-order methods that require special treatments of interface order coupling [10] when different order approximations exist in two adjacent cells, the current modal based method is naturally compatible with variable accuracy without any treatment. The dimension of global residual Jacobian dynamically depends on the distribution of cell orders so that the dimension of elemental residual Jacobian depends on
.
4. Numerical Results
4.1. Rotating Flow between Two Concentric Cylinders
The implementation of high-order discontinuous Galerkin discretization for the Navier-Stokes equations is verified on the rotating flow between two concentric cylinders or Taylor Couette flow [6]. The fluid is driven by two concentric cylinders which are rotating with constant angular velocities of
and
. A low Reynolds number
is used for maintaining a laminar state, and the Reynolds number is defined by the tangential velocity and the radius of the inner cylinder. The analytical solution of this problem is given below
(22)
where
is the tangential velocity,
and
are the inner radius and the outer radius. The isothermal boundary condition is set on the inner cylinder with angular velocity
of Mach number
. The outer cylinder is stationary with
and uses the adiabatic wall boundary condition. The front and the back faces of z-direction use the symmetric boundary condition for conducting quasi-2D computations. The order of accuracy is computed on a sequence of three meshes, using first- to fifth-order DG schemes (
). Quadratic curved elements are used on the boundary surfaces. For this case, we focus on the error convergences. The
norms of velocity errors are detailed in
Table 1. They are computed as
integrating in the entire
computational domain
. The expected order of convergence is observed for all the p levels, thereby verifying the high-order implementation of DG viscous discretization and also the use of curved elements.
4.2. Lid-Driven Cavity Flows
The developed adaptive framework is evaluated on the lid-driven cavity flows. The accuracy-adaptive solutions with
are computed and compared with the baseline results of Ghia [11]. The top boundary is moving at a velocity at
, and the left, right, and bottom walls are set as the nonslip boundary condition. The front and the back faces of z-direction use the symmetric boundary condition for conducting quasi-2D computations. For the flow condition of
, secondary vortices show up in the corners of the cavity and high-resolution simulations are usually required. In this work, instead of using fine mesh, the adaptive frame is used so that a very coarse mesh (
) can be used. In this case, we start by studying the impacts of discrete accuracy on the flow solution. Figure 1 presents the streamline plots using the
,
solutions. It is observed that the
solution is over-damped, where corner vortices are nearly disappeared due to the presence of high numerical viscosity of low-order approximations. The situation is improved when using higher-order adaptations where the corner vortices exhibit increased resolution of flow structures. It is also verified that the
solution is essentially identical to the uniform
solution of Figure 2(a). This agreement confirms the effectiveness of
![]()
Table 1. Rotating flow between two concentric cylinders: uniform and adaptive order results of the
error expressed in the
scale.
denotes the percentage of the number of P4 cells of all the cells obtained with the adaptive method using cellwise polynomial refinement from P0 to P4 order.
![]()
Figure 1. Lid-driven cavity flow at
with polynomial adaptation:
,
,
,
( from left to right, from top to bottom).
the procedure of variable accuracy refinement. Interestingly, the polynomial refinement occurs primarily in the boundary layer and strong shear regions where energy cascade occurs that transfers energy from large-scale vortices to small-scale ones, as shown in Figure 2(b). A quantitative comparison is shown in Figure 3, where horizontal and vertical velocity profiles along the central lines of x and y axes are given. We notice that the uniformly and adaptively refined solutions are in good agreement with the reference solutions of Ghia [11] while the low-order solutions fail to match the results. In Table 2, we compare the
computational cost of all the runs, all the adaptation procedures offer cost reduction in terms of the total degree of freedoms (DOFs) and work unit. Especially, the
solution that starts up at
and ends up at
offers a twofold reduction in DOFs and three times speedup in work units compared with the uniform
solution. This case shows that the adaptive strategy is effective and accurate for viscous flows.
4.3. Caradonna-Tung Rotor in Hover
A real-world problem is considered in this case corresponding to the experimental model hover test conditions of Caradonna and Tung [7]. The experimental model consists of a two-bladed rigid rotor with rectangular planform blades with no twist or taper. The blades are made of NACA0012 airfoil sections with an aspect ratio of 6 as shown in Figure 4. The computational condition uses the case of tip Mach number
, collective pitch
, and the Reynolds number
based on the blade tip speed and chord. More computational parameters are listed in Table 3. For this case, adaptive
solutions are computed and compared with the experimental data [7]. The adaptive order distribution is shown in Figure 5(a) and the cell order is locally refined just around the trajectory of wake vortex shown in Figure 5(b). Comparisons of pressure coefficients at different span-wise locations defined by r/R are given in Figure 6, where r is the span-wise radius from the rotation axis and R is the tip radius defined in Table 3. The results demonstrate the feasibility of using p-adaptation framework of exponential time marching for 3-D problems.
![]()
Figure 4. The experimental configuration of Caradonna-Tung rotor in hover [7].
![]()
Table 3. Computational parameters of the Caradonna-Tung rotor in hover.
5. Conclusions
An adaptive high-order DG framework has been developed with the PCEXP exponential time marching scheme. By using the adaptive control strategy, the performance of PCEXP scheme is shown to be enhanced in terms of computational cost and memory usage. The correctness of the variable order implementation of PCEXP is firstly validated in the case of rotating flow between two concentric cylinders. While uniform order solver can deliver formal convergence rates, the adaptive
solution can even give a comparable error to the uniform
solution. In the cavity flow case, it is found that the use of spectral decay error indicator is effective in capturing key flow structures such that the secondary corner vortices can be captured. Performance statistic shows that a three-fold reduction in CPU time and a two-fold reduction in DOFs are gained. Finally, the adaptive solver is applied to the Caradonna-Tung rotor in hover, where three-dimensional wake flows are adaptively captured with the coarse mesh. The results are in good agreement to the experimental data [7] thus demonstrates its applicability to practical 3D complex flows.
Acknowledgements
This work is funded by the National Natural Science Foundation of China (NSFC) under the Grant U1930402. The computational resources are provided by Beijing Computational Science Research Center (CSRC).
Nomenclatures
: specific heat ratio
: Prandtl number
: k-order DG polynomial
: cell number of the k-order cells
: quasi-2D characteristic cell size
: 3D characteristic cell size
: mesh dimension of coordinate z