The First-Order Comprehensive Sensitivity Analysis Methodology (1st-CASAM) for Scalar-Valued Responses: I. Theory

Dan Gabriel Cacuci

doi:10.4236/ajcm.2020.102015

American Journal of Computational Mathematics > Vol.10 No.2, June 2020

The First-Order Comprehensive Sensitivity Analysis Methodology (1st-CASAM) for Scalar-Valued Responses: I. Theory

Dan Gabriel Cacuci
Department of Mechanical Engineering, University of South Carolina, Columbia, SC, USA.
DOI: 10.4236/ajcm.2020.102015 PDF HTML XML 343 Downloads 955 Views Citations

Abstract

This work presents the first-order comprehensive adjoint sensitivity analysis methodology (1st-CASAM) for computing efficiently, exactly, and exhaustively, the first-order sensitivities of scalar-valued responses (results of interest) of coupled nonlinear physical systems characterized by imprecisely known model parameters, boundaries and interfaces between the coupled systems. The 1st-CASAM highlights the conclusion that response sensitivities to the imprecisely known domain boundaries and interfaces can arise both from the definition of the system’s response as well as from the equations, interfaces and boundary conditions defining the model and its imprecisely known domain. By enabling, in premiere, the exact computations of sensitivities to interface and boundary parameters and conditions, the 1st-CASAM enables the quantification of the effects of manufacturing tolerances on the responses of physical and engineering systems. Ongoing research will generalize the methodology presented in this work, aiming at computing exactly and efficiently higher-order response sensitivities for coupled systems involving imprecisely known interfaces, parameters, and boundaries.

Keywords

Adjoint Sensitivity Analysis (1st-CASAM), Response Sensitivities for Coupled Nonlinear Systems, Imprecisely Known Interfaces, Imprecisely Known Parameters, Imprecisely Known Boundaries

Share and Cite:

Cacuci, D.G. (2020) The First-Order Comprehensive Sensitivity Analysis Methodology (1st-CASAM) for Scalar-Valued Responses: I. Theory. American Journal of Computational Mathematics, 10, 275-289. doi: 10.4236/ajcm.2020.102015.

1. Introduction

Many works have been published on using adjoint operators for computing first- and second-order sensitivities (i.e., functional derivatives) of model responses (i.e., results produced by models) to imprecisely known model parameters since the original work of Wigner [1] on the linear neutron transport equation and the introduction of the first-order adjoint sensitivity analysis methodology for nonlinear systems by Cacuci [2] [3]. Representative works in this regard are cited in the books by Cacuci [4] [5], along with the original presentations of the first- and the second-order adjoint sensitivities analysis methodologies. It is well known that the adjoint method of sensitivity analysis [2] [3] [4] [5] enables the most efficient computation of the exact (to machine or to a priori set precision) response sensitivities to model parameters. The efficiency of the second-order adjoint sensitivity analysis methodology developed by Cacuci [5] has been recently demonstrated by its application to a OECD/NEA reactor physics benchmark [6] to compute [7] - [12] exactly the 21,976 first-order sensitivities and 482,944,576 second-order sensitivities of this benchmark’s response with respect to the benchmark’s model parameters, showing in particular that the effects of the 2^nd-order sensitivities on the uncertainty in the model’s response are even more important than the effects of the 1^st-order ones. Another step towards overcoming the curse of dimensionality in sensitivity analysis, uncertainty quantification and predictive modeling has been provided by the third-order adjoint sensitivity analysis methodology for linear systems provided recently by Cacuci [13].

However, none of the works cited above are capable of computing response sensitivities to imprecisely known domain internal and/or external boundaries. Very few works have attempted to develop mathematical/computational methodologies for computing exactly the first-order sensitivities of responses to imprecisely known boundaries. The representative works (Komata [14], Larsen and Pomraning [15], Rahnema and Pomraning [16], McKinley and Rahnema [17], Favorite and Gonzalez [18]) that have addressed this issue were limited to specific linear neutron transport or diffusion problems. Furthermore, none of the works published thus far have addressed, in a general theoretical/mathematical setting, the simultaneous computation of response sensitivities to imprecisely known model parameters, imprecisely known internal boundaries/interfaces between nonlinear systems that model coupled yet distinct physical processes, and/or imprecisely known external boundaries.

This work presents the mathematical foundations of a new method for computing efficiently, exactly and exhaustively, the first-order response sensitivities for coupled nonlinear physical systems characterized by imprecisely known parameters that describe not only processes within the system but also at the physical interfaces between systems, as well as at the systems’ imprecisely known domain boundaries. This new method will be called the first-order comprehensive adjoint sensitivity analysis methodology (1st-CASAM). Notably, the 1st-CASAM enables the quantification of the effects of manufacturing tolerances on the responses of physical and engineering systems.

This work is structured as follows: Section 2 presents the mathematical framework of two coupled generic nonlinear physical systems comprising imprecisely known parameters, internal interfaces, and external boundaries. Section 3 presents the mathematical framework of the 1st-CASAM, which enables the efficient computation of the exact sensitivities of a scalar-valued response with respect to the imprecisely known parameters, interfaces, and boundaries that characterize the generic coupled nonlinear physical systems. As is well known [19], the availability of response sensitivities to imprecisely known parameters, interfaces and boundaries is essential for a variety of subsequent uses, including uncertainty quantification, optimization, data assimilation, model calibration and validation, and reduction of uncertainties in predicted model results. Section 4 offers concluding remarks.

The sequel to this work [20] presents an illustrative application of the 1st-CASAM to a benchmark problem [21] [22] [23] that models coupled heat conduction and convection in a physical system comprising an electrically heated rod surrounded by a coolant which simulates the geometry of an advanced (“Generation-IV”) nuclear reactor [24]. This benchmark problem [21] [22] [23] admits exact closed-form solutions for the sensitivities of the temperature distribution in the coupled rod/coolant system which can be used to benchmark thermal-hydraulics production codes. In particular, this benchmark [21] [22] [23] was used to verify the numerical results produced by the FLUENT Adjoint Solver [25], showing that that the current “FLUENT Adjoint Solver” cannot compute any sensitivities for the temperature distribution within the solid rod. Although the “FLUENT Adjoint Solver” is capable of computing sensitivities of fluid temperatures to boundary parameters (e.g., boundary temperature, boundary velocity, boundary pressure), it yields accurate results only for the sensitivities of the fluid outlet temperature and the maximum rod surface temperature to the inlet temperature and inlet velocity, respectively.

2. Mathematical Modeling of Generic Coupled Nonlinear Physical Systems Comprising Imprecisely Known Parameters, Interfaces and Boundaries

The physical system considered in this work comprises two nonlinear physical systems which are coupled to one another across a common internal interface (boundary) in phase-space. Each system comprises imprecisely known model parameters, including imprecisely known parameters that characterize the interface between the systems and the systems’ outer boundaries. The first physical system is represented mathematically as follows:

$A [u (x); α] = Q^{(A)} (α; x), x \in Ω_{x}$ (1)

Bold letters will be used in this work to denote matrices and vectors. Unless explicitly stated otherwise, the vectors in this work are considered to be column vectors. The second system is represented mathematically as follows:

$B [v (y); α] = Q^{(B)} (α; y), y \in Ω_{y}$ (2)

If differential operators appear in Equations (1) and (2), a corresponding set of boundary and/or initial/final conditions must also be given; these conditions can be represented in operator form as follows:

$C [u (x), v (y); α; x, y] = 0, x \in δ Ω_{x}, y \in δ Ω_{y}$ (3)

The quantities appearing in Equations (1)-(3) are defined as follows:

1) $α ≜ {(α_{1}, \dots, α_{Z_{α}})}^{†} \in ℝ^{Z_{α}}$ denotes a $Z_{α}$ -dimensional column vector whose scalar-valued components are all of the imprecisely known internal and boundary parameters (both of) the physical systems, including imprecisely known parameters that characterize the interface and boundary conditions. Some of these parameters will be common to both physical systems, particularly those that characterize common interfaces. These scalar parameters are considered to be imperfectly known, subject to uncertainties. The minimum information needed for these parameters is their nominal or average values, which will be denoted as $α^{0} ≜ {(α_{1}^{0}, \dots, α_{Z_{α}}^{0})}^{†}$ . The superscript “zero” will be used in this work to denote known nominal or average values of various quantities. The symbol “ $≜$ ” will be used to denote “is defined as” or “is by definition equal to” and transposition will be indicated by a dagger ( $†$ ) superscript.

2) $x ≜ {(x_{1}, \dots, x_{Z_{x}})}^{†} \in ℝ^{Z_{x}}$ denotes the $Z_{x}$ -dimensional phase-space position vector of independent variables for the system defined in Equation (1). The vector of independent variables $x$ is defined on a phase-space domain denoted as $Ω_{x}$ which is defined as $Ω_{x} ≜ {- \infty \leq a_{i} (α) \leq x_{i} \leq b_{i} (α) \leq \infty; i = 1, \dots, Z_{x}}$ . The lower-valued imprecisely known boundary-point of the independent variable $x_{i}$ is denoted as $a_{i} (α)$ , while the upper-valued imprecisely known boundary-point of the independent variable $x_{i}$ is denoted as $b_{i} (α)$ . For physical systems modeled by diffusion theory, for example, the “vacuum boundary condition” requires that the particle flux vanish at the “extrapolated boundary” of the spatial domain facing the vacuum; the “extrapolated boundary” depends on the imprecisely known geometrical dimensions of the system’s domain in space and also on the system’s microscopic transport cross sections and atomic number densities. The boundary $\partial Ω_{x} ≜ {a (α) \cup b (α)}$ of the domain $Ω_{x}$ comprises all of the endpoints $a (α) ≜ {[a_{1} (α), \dots, a_{Z_{x}} (α)]}^{†}$ and $b (α) ≜ {[b_{1} (α), \dots, b_{Z_{x}} (α)]}^{†}$ of the intervals on which the respective components of $x$ are defined. It may happen that some components $a_{i} (α)$ and/or $b_{j} (α)$ are infinite, in which case they would not depend on any imprecisely known parameters.

3) $y ≜ {(y_{1}, \dots, y_{Z_{y}})}^{†} \in ℝ^{Z_{y}}$ denotes the $Z_{y}$ -dimensional phase-space position vector of independent variables for the physical system defined in Equation . The vector of independent variables $y$ is defined on a phase-space domain denoted as $Ω_{y}$ which is defined as follows:

$Ω_{y} ≜ {- \infty \leq c_{j} (α) \leq y_{j} \leq d_{j} (α) \leq \infty; j = 1, \dots, Z_{y}}$ . The lower-valued imprecisely known boundary-point of the independent variable $y_{j}$ is denoted as $c_{j} (α)$ , while the upper-valued imprecisely known boundary-point of the independent variable $y_{j}$ is denoted as $d_{j} (α)$ . Some or all of the points $c_{j} (α)$ may coincide with the points $b_{j} (α)$ . Also, some components of $y$ may coincide with some components of $x$ .

4) $u (x) ≜ {[u_{1} (x), \dots, u_{Z_{u}} (x)]}^{†}$ denotes a $Z_{u}$ -dimensional column vector whose components represent the system’s dependent variables (also called “state functions”). The vector-valued function $u (x)$ is considered the unique nontrivial solution of the physical problem described by Equations (1) and (2).

5) $v (y) ≜ {[v_{1} (y), \dots, v_{Z_{v}} (y)]}^{†}$ denotes a $Z_{v}$ -dimensional column vector whose components represent the system’s dependent variables (also called “state functions”); The vector-valued function $v (y)$ is considered the unique nontrivial solution of the physical problem described by Equations (2) and (3).

6) $A [u (x); α] ≜ {[A_{1} (u; α), \dots, A_{i} (u; α), \dots, A_{Z_{u}} (u; α)]}^{†}, i = 1, \dots, Z_{u}$ denotes a column vector of dimensions $Z_{u}$ whose components are operators (including differential, difference, integral, distributions, and/or infinite matrices) acting nonlinearly on $u (x)$ and $α$ .

7) $B [v (y); α] ≜ {[B_{1} (v; α), \dots, B_{i} (v; α), \dots, B_{Z_{v}} (v; α)]}^{†}, i = 1, \dots, Z_{v}$ denotes a column vector of dimensions $Z_{v}$ whose components are operators (including differential, difference, integral, distributions, and/or infinite matrices) acting nonlinearly on $v (y)$ and $α$ .

8) $Q^{(A)} (α; x) ≜ {[Q_{1}^{(A)} (α; x), \dots, Q_{Z_{u}}^{(A)} (α; x)]}^{†}$ denotes a $Z_{u}$ -dimensional column vector whose elements represent inhomogeneous source terms that depend either linearly or nonlinearly on $α$ . The components of $Q^{(A)} (α; x)$ may involve operators, rather than just finite-dimensional functions, and distributions acting on $α$ and $x$ .

9) $Q^{(B)} (α; y) ≜ {[Q_{1}^{(B)} (α; y), \dots, Q_{Z_{v}}^{(B)} (α; y)]}^{†}$ denotes a $Z_{v}$ -dimensional column vector whose elements represent inhomogeneous source terms that depend either linearly or nonlinearly on $α$ . The components of $Q^{(B)} (α; y)$ may involve operators, rather than just finite-dimensional functions, and distributions acting on $α$ and $y$ .

10) The vector-valued operator $C [u (x), v (y); α; x, y]$ comprises all of the boundary, interface, and initial/final conditions for the coupled physical systems. If the boundary, interface and/or initial/final conditions are inhomogeneous, which is most often the case, then $C [0, 0; α; x, y] \neq 0$ .

11) Since $Q^{(A)} (α; x)$ and may involve operators and distributions acting on $α$ and $y$ , all of the equalities in this work, including Equations (1)-(3), are considered to hold in the weak (“distributional”) sense, since the right-sides (“sources”) of and of other various equations to be derived in this work may contain distributions (“generalized functions/functionals”), particularly Dirac-distributions and derivatives and/or integrals thereof.

The nominal solutions of Equations (1)-(3) will be denoted as $u^{0} (x)$ and $v^{0} (y)$ ; they are obtained by solving these equations at the nominal parameter values $α^{0}$ . In other words, the vectors $u^{0} (x)$ and $v^{0} (y)$ satisfy the following equations:

$A [u^{0} (x); α^{0}] = Q^{(A)} (α^{0}; x), x \in Ω_{x}$ (4)

$B [v^{0} (y); α^{0}] = Q^{(B)} (α^{0}; y), y \in Ω_{y}$ (5)

$C [u^{0} (x), v^{0} (y); α^{0}; x, y] = 0, x \in δ Ω_{x}, y \in δ Ω_{y}$ (6)

Equations (4)-(6) represent the “base-case” or nominal state of the physical system. Throughout this work, the superscript “0” will be used to denote “nominal” or “expected” values.

The response considered in this work is a generic scalar-valued operator (i.e., a functional) of the state functions, denoted as follows:

$R [u (x), v (y); α; x, y]$ . (7)

The nominal value of the response, denoted as $R^{0} ≜ R [u^{0} (x), v^{0} (y); α^{0}; x, y]$ , is determined by computing the response at the nominal values $α^{0}$ , $u^{0} (x)$ and $v^{0} (y)$ .

3. Mathematical Framework of the 1st-CASAM for Operator-Valued Responses for Coupled Linear Physical Systems Comprising Imprecisely Known Parameters, Interfaces and Boundaries

As has been mentioned in the foregoing, the model and boundary parameters are considered to be imprecisely known quantities. Their true values may differ from their nominal (average, or “base-case”) values by variations denoted as $δ α ≜ (δ α_{1}, \dots, δ α_{N_{α}})$ , where $δ α_{i} ≜ α_{i} - α_{i}^{0}$ , $i = 1, \dots, N_{α}$ . In turn, the parameter variations $δ α$ will cause variations $δ u (x) ≜ {[δ u_{1} (x), \dots, δ u_{Z_{u}} (x)]}^{†}$ and $δ v (y) ≜ {[δ v_{1} (y), \dots, δ v_{Z_{v}} (y)]}^{†}$ in the state functions, through Equations (1)-(3). Furthermore, the variations $δ α$ , $δ u (x)$ and $δ v (y)$ will cause variations in the response $R [u (x), v (y); α; x, y]$ around the nominal response value $R^{0}$ . Sensitivity analysis aims at computing the functional derivatives (called “sensitivities”) of the response to the imprecisely known parameters $α$ . Subsequently, these sensitivities can be used for a variety of purposes, including quantifying the uncertainties induced in responses by the uncertainties in the model and boundary parameters, combining the uncertainties in computed responses with uncertainties in measured response (“data assimilation”) to obtain more accurate predictions of responses and/or parameters (“model calibration,” “predictive modeling”, etc.). As has been shown by Cacuci [2] [3], the most general definition of the 1^st-order total sensitivity of an operator-valued model response to parameter variations is provided by the first-order “Gateaux-variation” (G-variation) of the response under consideration. To determine the first G-variation of the response $R [u (x), v (y); α; x, y]$ , it is convenient to denote the functions appearing in the argument of the response as being the components of a vector $e ≜ {[u (x), v (y); α]}^{†}$ , which represents an arbitrary “point” in the combined phase-space of the state functions and (all) parameters. The point which corresponds to the nominal values of the state functions and parameters in this phase space is denoted as $e^{0} ≜ {[u^{0} (x), v^{0} (y); α^{0}]}^{†}$ . Analogously, it is convenient to consider the variations in the model’s state functions and parameters to be the components of a “vector of variations”, $δ e$ , defined as follows: $δ e ≜ {[δ u (x), δ v (y); δ α]}^{†}$ . The 1^st-order Gateaux- (G-) variation of the response $R (e)$ , which will be denoted as $δ R (e^{0}; δ e)$ , for arbitrary variations $δ e$ in the model parameters and state functions in a neighborhood ( $e^{0} + ε δ e$ ) around $e^{0}$ , is obtained, by definition, as follows:

$δ R (e^{0}; δ e) ≜ {\frac{d}{d ε} R [u^{0} (x) + ε δ u (x), v^{0} (y) + ε δ v (y); α^{0} + ε δ α; x, y]}_{ε = 0}$ (8)

The existence of the G-variation $δ R (e^{0}; δ e)$ does not guarantee its numerical computability. Numerical methods most often require that $δ R (e^{0}; δ e)$ be linear in the variations $δ e$ in a neighborhood ( $e^{0} + ε δ e$ ) around $e^{0}$ . The necessary and sufficient conditions for the G-differential $δ R (e^{0}; δ e)$ of a nonlinear operator $R (e)$ to be linear in $δ e$ in a neighborhood ( $e^{0} + ε δ e$ ) around $e^{0}$ , and thus admit partial and total G-derivatives, are as follows:

1) $R (e)$ satisfies a weak Lipschitz condition at $e^{0}$ ; (9)

2) for two arbitrary vectors of variations $δ e_{1}$ and $δ e_{2}$ , the operator $R (e)$ satisfies the relation

$R (e^{0} + ε δ e_{1} + ε δ e_{2}) - R (e^{0} + ε δ e_{1}) - R (e^{0} + ε δ e_{2}) + R (e^{0}) = o (ε)$ (10)

If the G-variation $δ R (e^{0}; δ e)$ is linear in $δ e$ , then the function $δ R (e^{0}; δ e)$ is called the G-differential of $R (e)$ and is usually denoted as $D R (e^{0}; δ e)$ . Furthermore, the result of the differentiations indicated on the right-side of the definition provided in Equation (8) can be written as follows:

$D R (e^{0}; δ e) = {D R (e^{0}; δ α)}^{d i r e c t} + {D R (e^{0}; δ u, δ v)}^{i n d i r e c t},$ (11)

where the so-called “direct-effect” term is defined as follows:

${D R (e^{0}; δ α)}^{d i r e c t} ≜ {\frac{\partial R}{\partial α}}_{(e^{0})} δ α,$ (12)

while the so-called “indirect-effect” term is defined as follows:

${D R (e^{0}; δ u, δ v)}^{i n d i r e c t} ≜ {\frac{\partial R}{\partial u}}_{(e^{0})} δ u (x) + {\frac{\partial R}{\partial v}}_{(e^{0})} δ v (y) .$ (13)

In Equations (12) and (13), the vectors $\partial R / \partial u$ , $\partial R / \partial v$ and $\partial R / \partial α$ comprise, as components, the first-order partial G-derivatives computed at the phase-space point $e^{0}$ . The G-differential $D R (e^{0}; δ e)$ is an operator defined on the same domain as $R (e)$ and has the same range as $R (e)$ . The G-differential $D R (e^{0}; δ e)$ satisfies the relation $R (e^{0} + ε δ e) - R (e^{0}) = D R (e^{0}; δ e) + Δ (δ e)$ , with $\lim_{ε \to 0} [Δ (ε δ e)] / ε = 0$ .

The “direct effect” term ${D R (e^{0}; δ α)}^{d i r e c t}$ depends only on the parameter variations $δ α$ and can therefore be computed immediately, since it does not depend on the variations $δ u$ and $δ v$ . On the other hand, the “indirect effect” term ${D R (e^{0}; δ u, δ v)}^{i n d i r e c t}$ depends indirectly on the parameter variations $δ α$ through the yet unknown variations $δ v$ and $δ v$ in the state functions, which are the solutions of the system of equations obtained by applying the definition of the G-differential to Equations (1)-(3), to obtain the following relations:

$\begin{array}{l} {\frac{d}{d ε} A [u^{0} (x) + ε δ u (x); α^{0} + ε δ α]}_{ε = 0} \\ = {\frac{d}{d ε} Q^{(A)} (α^{0} + ε δ α; x)}_{ε = 0}, x \in δ Ω_{x}, \end{array}$ (14)

$\begin{array}{l} {\frac{d}{d ε} B [v^{0} (y) + ε δ v (y); α^{0} + ε δ α]}_{ε = 0} \\ = {\frac{d}{d ε} Q^{(B)} (α^{0} + ε δ α; y)}_{ε = 0}, y \in δ Ω_{y}, \end{array}$ (15)

$\begin{array}{l} {\frac{d}{d ε} C [u^{0} (x) + ε δ u (x), v^{0} (y) + ε δ v (y); α^{0} + ε δ α; x, y]}_{ε = 0} \\ = 0, x \in δ Ω_{x}, y \in δ Ω_{y} . \end{array}$ (16)

Performing in Equations (14)-(16) the differentiations with respect to $ε$ and setting $ε = 0$ in the resulting expressions yields the following system of equations:

${\frac{\partial A (u; α)}{\partial u}}_{(e^{0})} δ u (x) = {Q_{1}^{(1)} (u; α; δ α)}_{(e^{0})},$ (17)

${\frac{\partial B (v; α)}{\partial v}}_{(e^{0})} δ v (y) = {Q_{2}^{(1)} (v; α; δ α)}_{(e^{0})},$ (18)

$\begin{array}{l} {\frac{\partial C [u (x), v (y); α; x, y]}{\partial u}}_{(e^{0})} δ u (x) + {\frac{\partial C [u (x), v (y); α; x, y]}{\partial v}}_{(e^{0})} δ v (y) \\ + {\frac{\partial C [u (x), v (y); α; x, y]}{\partial α}}_{(e^{0})} δ α = 0, x \in δ Ω_{x}, y \in δ Ω_{y} . \end{array}$ (19)

where

${Q_{1}^{(1)} (u, α; δ α)}_{(e^{0})} ≜ {\frac{\partial [Q^{(A)} (α; x) - A [u (x); α]]}{\partial α}}_{(e^{0})} δ α,$ (20)

${Q_{2}^{(1)} (v, α; δ α)}_{(e^{0})} ≜ {\frac{\partial [Q^{(B)} (α; y) - B [v (y); α]]}{\partial α}}_{(e^{0})} δ α,$ (21)

The system of equations comprising Equations (17)-(19) is called the “First-Level Forward Sensitivity System” (1^st-LFSS) and could be solved to obtain the variations $δ v$ and $δ v$ in the state functions in terms of the parameter variations $δ α$ which appear as sources in the 1^st-LFSS equations. Subsequently, the variations $δ v$ and $δ v$ thus obtained could be used to compute the indirect-effect term defined in Equation (13).

However, since there are at least $Z_{α}$ variations to consider, it becomes prohibitively expensive computationally to solve in practice the 1^st-LFSS, which may comprise differential and or integral operators, for all possible parameter variations $δ α_{i}, i = 1, \dots, Z_{α}$ . The need for solving repeatedly the 1^st-LFSS for every possible parameter variation $δ α_{i}, i = 1, \dots, Z_{α}$ can be circumvented by applying the concepts first outlined by Cacuci [2] [3] to construct a “First-Level Adjoint Sensitivity System” (1^st-LASS), the solution of which will be used to eliminate the appearance of the variations $δ v$ and $δ v$ in the expression of the indirect-effect term defined in Equation (13). The 1^st-LASS is constructed by implementing the following sequence of steps:

1) Introduce a Hilbert space pertaining to the domain $Ω_{x}$ , denoted as $H_{u}$ , comprising square-integrable vector-valued elements of the same form as the vectors $u (x)$ and $δ u (x)$ . The inner product underlying $H_{u}$ , between two

elements $g^{(α)} (x) ≜ {[g_{1}^{(α)} (x), \dots, g_{Z_{u}}^{(α)} (x)]}^{†} \in H_{u}$ and $g^{(β)} (x) ≜ {[g_{1}^{(β)} (x), \dots, g_{Z_{u}}^{(β)} (x)]}^{†} \in H_{u}$ is denoted as ${〈 g^{(α)} (x), g^{(β)} (x) 〉}_{u}$ and defined as follows:

${〈 g^{(α)} (x), g^{(β)} (x) 〉}_{u} ≜ \sum_{n = 1}^{Z_{u}} \int_{a_{1} (α)}^{b_{1} (α)} d x_{1} \dots \int_{a_{j} (α)}^{b_{j} (α)} d x_{j} \dots \int_{a_{Z_{x}} (α)}^{b_{Z_{x}} (α)} g_{n}^{(α)} (x) g_{n}^{(β)} (x) d x_{Z_{x}}$ (22)

2) Introduce a Hilbert space pertaining to the domain $Ω_{y}$ , denoted as $H_{v}$ , comprising square-integrable vector-valued elements of the same form as the vectors $v (y)$ and $δ v (y)$ , i.e., $h^{(α)} (y) ≜ {[h_{1}^{(α)} (y), \dots, h_{Z_{v}}^{(α)} (y)]}^{†} \in H_{v}$ and $h^{(β)} (y) ≜ {[h_{1}^{(β)} (y), \dots, h_{Z_{v}}^{(β)} (y)]}^{†} \in H_{v}$ The Hilbert space $H_{v}$ is endowed with an inner product denoted as ${〈 h^{(α)} (y), h^{(β)} (y) 〉}_{v}$ , which is defined as follows:

${〈 h^{(α)} (y), h^{(β)} (y) 〉}_{v} ≜ \sum_{n = 1}^{Z_{y}} \int_{c_{1} (α)}^{d_{1} (α)} d y_{1} \dots \int_{c_{j} (α)}^{d_{j} (α)} d y_{j} \dots \int_{c_{Z_{y}} (α)}^{d_{Z_{y}} (α)} h_{n}^{(α)} (y) h_{n}^{(β)} (y) d y_{Z_{y}}$ (23)

3) In the Hilbert $H_{u}$ , form the inner product of Equation (17) with a yet undefined vector-valued function $ψ_{1}^{(1)} (x) ≜ {[ψ_{11}^{(1)} (x), \dots, ψ_{1 Z_{u}}^{(1)} (x)]}^{†} \in H_{u}$ to obtain the following relation:

${〈 ψ_{1}^{(1)} (x), {\frac{\partial A (u; α)}{\partial u}}_{(e^{0})} δ u (x) 〉}_{u} = {〈 ψ_{1}^{(1)} (x), {Q_{1}^{(1)} (u; α; δ α)}_{(e^{0})} 〉}_{u} .$ (24)

4) Using the definition of the adjoint operator in the Hilbert space $H_{u}$ , recast the left-side of Equation (24) as follows:

$\begin{array}{l} {〈 ψ_{1}^{(1)} (x), {\frac{\partial A (u; α)}{\partial u}}_{(e^{0})} δ u (x) 〉}_{u} \\ = {〈 δ u (x), {A^{*} (u; α)}_{(e^{0})} ψ_{1}^{(1)} (x) 〉}_{u} + {P_{A}^{(1)} {[δ u (x); ψ_{1}^{(1)} (x); α]}_{δ Ω_{x}}}_{(e^{0})}, \end{array}$ (25)

where ${P_{A}^{(1)} {[δ u (x); ψ_{1}^{(1)} (x); α]}_{δ Ω_{x}}}_{(e^{0})}$ denotes the bilinear concomitant evaluated on the boundary $δ Ω_{x}$ . In Equation (25), the operator $A^{*} (u; α)$ is the formal adjoint of $\partial A (u; α) / \partial u$ .

5) Replace the left-side of Equation (24) by the right-side of Equation (25) to obtain the following relation:

$\begin{array}{l} {〈 δ u (x), {A^{*} (u; α)}_{(e^{0})} ψ_{1}^{(1)} (x) 〉}_{u} \\ = {〈 ψ_{1}^{(1)} (x), {Q_{1}^{(1)} (u; α; δ α)}_{(e^{0})} 〉}_{u} - {P_{A}^{(1)} {[δ u (x); ψ_{1}^{(1)} (x); α]}_{δ Ω_{x}}}_{(e^{0})} . \end{array}$ (26)

6) In the Hilbert $H_{v}$ , form the inner product of Equation (18) with a yet undefined vector-valued function $ψ_{2}^{(1)} (y) ≜ {[ψ_{21}^{(1)} (y), \dots, ψ_{2 Z_{v}}^{(1)} (y)]}^{†} \in H_{v}$ to obtain the following relation:

${〈 ψ_{2}^{(1)} (y), {\frac{\partial B (v; α)}{\partial v}}_{(e^{0})} δ v (y) 〉}_{v} = {〈 ψ_{2}^{(1)} (y), {Q_{2}^{(1)} (v; α; δ α)}_{(e^{0})} 〉}_{v}$ . (27)

7) Using the definition of the adjoint operator in the Hilbert space $H_{v}$ , recast the left-side of Equation (24) as follows:

$\begin{array}{l} {〈 ψ_{2}^{(1)} (y), {\frac{\partial B (v; α)}{\partial v}}_{(e^{0})} δ v (y) 〉}_{v} \\ = {〈 δ v (y), {B^{*} (v; α)}_{(e^{0})} ψ_{2}^{(1)} (y) 〉}_{v} + {P_{B}^{(1)} {[δ v (y); ψ_{2}^{(1)} (y); α]}_{δ Ω_{y}}}_{(e^{0})}, \end{array}$ (28)

where ${P_{B}^{(1)} {[δ v (y); ψ_{2}^{(1)} (y); α]}_{δ Ω_{y}}}_{(e^{0})}$ denotes the bilinear concomitant evaluated on the boundary $δ Ω_{y}$ . In Equation (28), the operator $B^{*} (v; α)$ is the formal adjoint of $\partial B (v; α) / \partial v$ .

8) Replace the left-side of Equation (27) by the right-side of Equation (28) to obtain the following relation:

$\begin{array}{l} {〈 δ v (y), {B^{*} (v; α)}_{(e^{0})} ψ_{2}^{(1)} (y) 〉}_{v} \\ = {〈 ψ_{2}^{(1)} (y), {Q_{2}^{(1)} (v; α; δ α)}_{(e^{0})} 〉}_{v} - {P_{B}^{(1)} {[δ v (y); ψ_{2}^{(1)} (y); α]}_{δ Ω_{y}}}_{(e^{0})} . \end{array}$ (29)

9) Add Equations (29) and (26) to obtain:

$\begin{array}{l} {〈 δ u (x), {A^{*} (u; α)}_{(e^{0})} ψ_{1}^{(1)} (x) 〉}_{u} + {〈 δ v (y), {B^{*} (v; α)}_{(e^{0})} ψ_{2}^{(1)} (y) 〉}_{v} \\ = {〈 ψ_{1}^{(1)} (x), {Q_{1}^{(1)} (u; α; δ α)}_{(e^{0})} 〉}_{u} + {〈 ψ_{2}^{(1)} (y), {Q_{2}^{(1)} (v; α; δ α)}_{(e^{0})} 〉}_{v} \\ - {P_{A}^{(1)} {[δ u (x); ψ_{1}^{(1)} (x); α]}_{δ Ω_{x}}}_{(e^{0})} - {P_{B}^{(1)} {[δ v (y); ψ_{2}^{(1)} (y); α]}_{δ Ω_{y}}}_{(e^{0})} . \end{array}$ (30)

10) The next step is to relate the right-side of Equation (30) with the indirect-effect term ${D R (e^{0}; δ u, δ v)}^{i n d i r e c t}$ defined in Equation (13). Since the response considered is a functional of $u$ and $v$ , the G-differential $R (e)$ is also a functional of $δ u (x)$ and $δ v (y)$ . Consequently, the well-known Riesz representation theorem (which states that every functional can be expressed uniquely in terms of the inner product pertaining to the respective Hilbert space) ensures that the indirect-effect term ${D R (e^{0}; δ u, δ v)}^{i n d i r e c t}$ can be expressed uniquely as follows:

${D R (e^{0}; δ u, δ v)}^{i n d i r e c t} ≜ {〈 δ u (x), {{(\frac{\partial R}{\partial u})}^{†}}_{(e^{0})} 〉}_{u} + {〈 δ v (y), {{(\frac{\partial R}{\partial v})}^{†}}_{(e^{0})} 〉}_{v} .$ (31)

11) Identifying the right-side of Equation (31) with the left-side of Equation (30) indicates that the indirect-effect term ${D R (e^{0}; δ u, δ v)}^{i n d i r e c t}$ would be equal to the right side of Equation (30) provided that the following relations are satisfied by the yet undetermined functions $ψ_{1}^{(1)} (x)$ and $ψ_{2}^{(1)} (y)$ :

${A^{*} (u; α)}_{(e^{0})} ψ_{1}^{(1)} (x) = {{(\frac{\partial R}{\partial u})}^{†}}_{(e^{0})}$ (32)

${B^{*} (v; α)}_{(e^{0})} ψ_{2}^{(1)} (y) = {{(\frac{\partial R}{\partial v})}^{†}}_{(e^{0})} .$ (33)

12) Using Equations (31)-(33) in Equation (30) transforms the latter into the following form:

$\begin{array}{l} {D R (e^{0}; δ u, δ v)}^{i n d i r e c t} \\ = {〈 ψ_{1}^{(1)} (x), {Q_{1}^{(1)} (u; α; δ α)}_{(e^{0})} 〉}_{u} + {〈 ψ_{2}^{(1)} (y), {Q_{2}^{(1)} (v; α; δ α)}_{(e^{0})} 〉}_{v} \\ - {P_{A}^{(1)} {[δ u (x); ψ_{1}^{(1)} (x); α]}_{δ Ω_{x}}}_{(e^{0})} - {P_{B}^{(1)} {[δ v (y); ψ_{2}^{(1)} (y); α]}_{δ Ω_{y}}}_{(e^{0})} . \end{array}$ (34)

13) The boundary, interface and initial/final conditions for the functions $ψ_{1}^{(1)} (x)$ and $ψ_{2}^{(1)} (y)$ are now determined by imposing the following requirements:

a) Implement the boundary, interface and initial/final conditions given in Equation (19) into the bilinear concomitants in Equation (34).

b) Eliminate the remaining unknown boundary, interface and initial/final conditions involving the functions $δ u (x)$ and $δ v (y)$ from the expression of the bilinear concomitants in Equation (34) by selecting boundary, interface and initial/final conditions for the functions $ψ_{1}^{(1)} (x)$ and $ψ_{2}^{(1)} (y)$ such that the selected conditions for $ψ_{1}^{(1)} (x)$ and $ψ_{2}^{(1)} (y)$ must be independent of unknown values of $δ u (x)$ , $δ v (y)$ and $δ α$ while ensuring that Equations (32) and (33) are well posed. The boundary conditions thus chosen for the adjoint functions $ψ_{1}^{(1)} (x)$ and $ψ_{2}^{(1)} (y)$ can be represented in operator form as follows:

${C_{A}^{(1)} [u (x); v (y); ψ_{1}^{(1)} (x), ψ_{2}^{(1)} (y); α; x, y]}_{(e^{0})} = 0, x \in \partial Ω_{x}, y \in δ Ω_{y}$ (35)

where the subscript “A” indicates “adjoint”.

14) The selection of the boundary conditions for the adjoint functions $ψ_{1}^{(1)} (x)$ and $ψ_{2}^{(1)} (x)$ represented by Equation (35) eliminates the appearance of any unknown values of the variations $δ u (x)$ and $δ v (y)$ in the bilinear concomitants in Equation (34) and reduces these concomitants to a residual quantity that contains boundary terms involving only known values of $δ α$ , $u (x)$ , $v (y)$ , $ψ_{1}^{(1)} (x)$ , $ψ_{2}^{(1)} (y)$ , $α$ . This residual quantity will be denoted as ${{\hat{P}}^{(1)} [u (x); v (y); ψ_{1}^{(1)} (x), ψ_{2}^{(1)} (y); α; x, y; δ α]}_{(e^{0})}$ . In general, this residual quantity does not automatically vanish, although it may do so in particular instances. In principle, ${{\hat{P}}^{(1)} [u (x); v (y); ψ_{1}^{(1)} (x), ψ_{2}^{(1)} (y); α; x, y; δ α]}_{(e^{0})}$ could be forced to vanish, if necessary, by considering extensions, in the operator sense, of the linear operators $A^{*} (u; α)$ and/or $B^{*} (v; α)$ , but such extensions seldom need to be used in practice.

15) Using the conditions represented by Equations (19) and (35) in Equation (34) yields the following (final) expression for the indirect-effect term ${D R (e^{0}; δ u, δ v)}_{i n d i r e c t}$ :

$\begin{array}{l} {D R (e^{0}; δ u, δ v)}^{i n d i r e c t} \\ = - {{\hat{P}}^{(1)} [u (x); v (y); ψ_{1}^{(1)} (x), ψ_{2}^{(1)} (y); α; x, y; δ α]}_{(e^{0})} \\ + {〈 ψ_{1}^{(1)} (x), {Q_{1}^{(1)} (u; α; δ α)}_{(e^{0})} 〉}_{u} + {〈 ψ_{2}^{(1)} (y), {Q_{2}^{(1)} (v; α; δ α)}_{(e^{0})} 〉}_{v} \\ \equiv {D R (e^{0}; ψ_{1}^{(1)}, ψ_{2}^{(1)})}^{i n d i r e c t} . \end{array}$ (36)

As the expression in Equation (36) indicates, the desired elimination of the unknown variations $δ u$ and $δ v$ from the original expression of ${D R (e^{0}; δ u, δ v)}^{i n d i r e c t}$ given in Equation

(13) has been accomplished by having replaced them by expressions involving the functions $ψ_{1}^{(1)} (x)$ and $ψ_{2}^{(1)} (y)$ , which do not depend on any parameter variations, a fact that has been underscored by having explicitly indicated that the indirect-effect term can now be written in the form

${D R (e^{0}; ψ_{1}^{(1)}, ψ_{2}^{(1)})}^{i n d i r e c t}$ .

The system of equations represented by Equations (32), (33), and (35) is called the First-Level Adjoint Sensitivity System (1^st-LASS) and the functions $ψ_{1}^{(1)} (x)$ and $ψ_{2}^{(1)} (y)$ are called the “first-level adjoint sensitivity functions.” The essential feature of the 1^st-LASS is that it is independent of parameter variations (in contradistinction to the 1^st-LFSS), so it needs to be solved only once per response to obtain the first-level adjoint sensitivity functions $ψ_{1}^{(1)} (x)$ and $ψ_{2}^{(1)} (y)$ . Once the adjoint functions $ψ_{1}^{(1)} (x)$ and $ψ_{2}^{(1)} (y)$ are available, they can be used

in Equation (36) to compute the indirect-effect term ${D R (e^{0}; δ u, δ v)}_{i n d i r e c t}$

exactly and efficiently, using quadrature formulas, which are many orders of magnitude faster to compute then solving the operator (differential, integral) equations that underlie the 1^st-LFSS. As is well known [2] [3] [4] [5], it is this property that makes the adjoint sensitivity analysis method “unbeatable” when needing to compute the sensitivities of functional-valued responses to many imprecisely known parameters.

4. Concluding Remarks

This work has presented the first-order comprehensive adjoint sensitivity analysis methodology (1^st-CASAM) for computing efficiently, exhaustively and exactly, the first-order response sensitivities for coupled nonlinear physical systems characterized by imprecisely known parameters characterizing the systems, the interfaces between systems and the systems’ domain boundaries. The 1^st-CASAM fundamentally generalizes and extends all previously published theoretical works on this topic, also enabling the quantification of the effects of manufacturing tolerances on the responses of physical and engineering systems. The 1^st-CASAM highlights the conclusion that response sensitivities to the imprecisely known domain boundaries and interfaces can arise both from the definition of the system’s response as well as from the equations, interfaces and boundary conditions defining the model and its imprecisely known domain. Ongoing research will generalize the methodology presented in this work, aiming at computing exactly and efficiently higher-order response sensitivities for coupled systems involving imprecisely known interfaces, parameters, and boundaries. The sequel [20] to this work illustrates the application of the 1^st-CASAM to a benchmark problem [21] [22] [23] that models heat conduction and convection in a physical system comprising an electrically heated rod surrounded by a coolant which simulates the geometry of an advanced (“Generation-IV”) nuclear reactor [24].

Conflicts of Interest

The author declares no conflicts of interest regarding the publication of this paper.

References

[1]	Wigner, E.P. (1945) Effect of Small Perturbations on Pile Period. Chicago Report CP-G-3048.
[2]	Cacuci, D.G. (1981) Sensitivity Theory for Nonlinear Systems: I. Nonlinear Functional Analysis Approach. Journal of Mathematical Physics, 22, 2794-2802. https://doi.org/10.1063/1.525186
[3]	Cacuci, D.G. (1981) Sensitivity Theory for Nonlinear Systems: II. Extensions to Additional Classes of Responses. Journal of Mathematical Physics, 22, 2803-2812. https://doi.org/10.1063/1.524870
[4]	Cacuci, D.G. (2003) Sensitivity and Uncertainty Analysis: Theory, Volume 1. Chapman & Hall/CRC, Boca Raton. https://doi.org/10.1201/9780203498798
[5]	Cacuci, D.G. (2018) The Second-Order Adjoint Sensitivity Analysis Methodology. CRC Press, Taylor & Francis Group, Boca Raton. https://doi.org/10.1201/9781315120270
[6]	Valentine, T.E. (2006) Polyethylene-Reflected Plutonium Metal Sphere Subcritical Noise Measurements, SUB-PU-METMIXED-001; International Handbook of Evaluated Criticality Safety Benchmark Experiments; NEA/NSC/DOC(95)03/I-IX; Organization for Economic Co-Operation and Development (OECD). Nuclear Energy Agency (NEA), Paris.
[7]	Cacuci, D.G., Fang, R. and Favorite, J.A. (2019) Comprehensive Second-Order Adjoint Sensitivity Analysis Methodology (2nd-ASAM) Applied to a Subcritical Experimental Reactor Physics Benchmark: I. Effects of Imprecisely Known Microscopic Total and Capture Cross Sections. Energies, 12, 4219. https://doi.org/10.3390/en12214219
[8]	Fang, R. and Cacuci, D.G. (2019) Comprehensive Second-Order Adjoint Sensitivity Analysis Methodology (2nd-ASAM) Applied to a Subcritical Experimental Reactor Physics Benchmark: II. Effects of Imprecisely Known Microscopic Scattering Cross Sections. Energies, 12, 4114. https://doi.org/10.3390/en12214114
[9]	Cacuci, D.G., Fang, R., Favorite, J.A., Badea, M.C. and Di Rocco, F. (2019) Comprehensive Second-Order Adjoint Sensitivity Analysis Methodology (2nd-ASAM) Applied to a Subcritical Experimental Reactor Physics Benchmark: III. Effects of Imprecisely Known Microscopic Fission Cross Sections and Average Number of Neutrons per Fission. Energies, 12, 4100. https://doi.org/10.3390/en12214100
[10]	Fang, R. and Cacuci, D.G. (2020) Comprehensive Second-Order Adjoint Sensitivity Analysis Methodology (2nd-ASAM) Applied to a Subcritical Experimental Reactor Physics Benchmark. IV: Effects of Imprecisely Known Source Parameters. Energies, 13, 1431. https://doi.org/10.3390/en13061431
[11]	Fang, R. and Cacuci, D.G. (2020) Comprehensive Second-Order Adjoint Sensitivity Analysis Methodology (2nd-ASAM) Applied to a Subcritical Experimental Reactor Physics Benchmark: V. Computation of 2nd-Order Sensitivities Involving Isotopic Number Densities. Energies, 13, 2580. https://doi.org/10.3390/en13102580
[12]	Cacuci, D.G., Fang, R. and Favorite, J.A. (2020) Comprehensive Second-Order Adjoint Sensitivity Analysis Methodology (2nd-ASAM) Applied to a Subcritical Experimental Reactor Physics Benchmark: VI. Overall Impact of 1st- and 2nd-Order Sensitivities. Energies, 13, 1674. https://doi.org/10.3390/en13071674
[13]	Cacuci, D.G. (2019) Towards Overcoming the Curse of Dimensionality: The Third-Order Adjoint Method for Sensitivity Analysis of Response-Coupled Linear Forward/Adjoint Systems, with Applications to Uncertainty Quantification and Predictive Modeling, Energies, 12, 4216. https://doi.org/10.3390/en12214216
[14]	Komata, M. (1977) A Generalized Perturbation Theory Applicable to Reactor Boundary Changes. Nuclear Science and Engineering, 64, 811-822. https://doi.org/10.13182/NSE77-A14496
[15]	Larsen, E.W. and Pomraning, G.C. (1981) Boundary Perturbation Theory. Nuclear Science and Engineering, 77, 415-425. https://doi.org/10.13182/NSE81-A18954
[16]	Rahnema, F. and Pomraning, G.C. (1983) Boundary Perturbation Theory for Inhomogeneous Transport Equations. Nuclear Science and Engineering, 84, 313-319. https://doi.org/10.13182/NSE83-A15451
[17]	McKinley, M.S. and Rahnema, F. (2002) High-Order Boundary Condition Perturbation Theory for the Neutron Transport Equation. Nuclear Science and Engineering, 140, 285-294. https://doi.org/10.13182/NSE02-A2261
[18]	Favorite, J.A. and Gonzalez, E. (2017) Revisiting Boundary Perturbation Theory for Inhomogeneous Transport Problems. Nuclear Science and Engineering, 185, 445-459. https://doi.org/10.1080/00295639.2016.1277108
[19]	Cacuci, D.G. (2018) BERRU Predictive Modeling: Best Estimate Results with Reduced Uncertainties. Springer, Heidelberg/New York. https://doi.org/10.1007/978-3-662-58395-1
[20]	Cacuci, D.G. (2020) The First-Order Comprehensive Sensitivity Analysis Methodology (1st-CASAM) for Scalar-Valued Responses: II. Illustrative Application. American Journal of Computational Mathematics;, accepted for publication.
[21]	Cacuci, D.G., Fang, R., Ilic, M. and Badea, M.C. (2015) A Heat Conduction and Convection Analytical Benchmark for Adjoint Solution Verification of CFD Codes Used in Reactor Design. Nuclear Science and Engineering, 182, 452-480. https://doi.org/10.13182/NSE15-69
[22]	Cacuci, D.G. (2016) Second-Order Adjoint Sensitivity and Uncertainty Analysis of a Heat Transport Benchmark Problem—I: Analytical Results. Nuclear Science and Engineering, 183, 1-21. https://doi.org/10.13182/NSE15-81
[23]	Cacuci, D.G., Ilic, M., Badea, M.C. and Fang, R. (2016) Second-Order Adjoint Sensitivity and Uncertainty Analysis of a Heat Transport Benchmark Problem—II: Computational Results Using G4M Reactor Thermal-Hydraulic Parameters. Nuclear Science and Engineering, 183, 22-38. https://doi.org/10.13182/NSE15-80
[24]	GEN4 ENERGY, INC. (2012) Reactor Core Design. GEN4 ENERGY, INC., Denver.
[25]	(2015) ANSYS^® Academic Research, Release 16.0, FLUENT Adjoint Solver, ANSYS, Inc.

Journals Menu

Follow SCIRP

	+1 323-425-8868
	customer@scirp.org
	+86 18163351462(WhatsApp)
	1655362766

	Paper Publishing WeChat

Journals Menu

Home

About SCIRP

Service

Policies