Dose-Injury Relation as a Model for Uncertainty Propagation from Input Dose to Target Dose ()
1. Introduction
In many injury assessment situations, injury status of a subject is simply characterized in the form of binary outcome. For example, in a study of skull fracture injury related to highway traffic safety [1]and in a study of rib fracture injury caused by blunt-impact non-lethal weapons [2], in each situation subjects tested are classified as either fractured or not fractured. Mathematically, occurrences of binary injury outcomes are statistically described by injury probability (also called injury risk). Let
•
be a list of input factors that affect the injury outcome,
• I be the binary injury outcome (random variable), and
• p be the corresponding injury probability: p = Pr (I = “injured”)
Here the binary injury outcome I is a random variable even when all input factors
are given and fixed. One approach of building a simple and practical model for assessing the injury risk is to use a single metric x to capture the overall effects of all input variables
[3][4]. Quantity x is called the input dose, serving as the single metric best predictor of the injury probability. Input dose x may be one of the input variables
or a combination of these input variables. Depending on the application situations, input dose x is also called the determinant of injury, the risk factor, the exposure level, or the predictor variable [3][4].
When the input dose x is directly controllable and measurable, an experimental data set consists of m entries, each containing a measured value of input dose and the corresponding binary injury outcome in an independent trial:
(1)
Injury models are constructed in the general form of injury probability vs input dose.
In many application situations, however, the input dose is not directly measurable. For example, for bone fracture injuries, we may use the stress at the impact site as the input dose. But it is difficult to measure directly the stress at impact site. In a study of behind-armor blunt trauma (BABT) [5]and a study of human body response to blunt impacts using advanced total body model (ATBM) [6], an estimated value of stress caused by the impact is calculated via computer simulations. The estimation is based on the measured mass and velocity of projectile and using representative median material properties of the projectile, the subject body and the armor. When the true input dose is not directly measurable, an experimental data set contains pairs of estimated input dose and the corresponding binary injury outcome:
(2)
In these situations, practical injury models are constructed in the form of injury probability vs the estimated input dose
The estimated input dose, in general, is different from the true input dose, and the discrepancy between the two is population dependent since the actual material properties of individual subjects are different from the selected representative material properties and are population dependent. In addition, the relation of injury probability vs true input dose is also population dependent because the material properties of subjects significantly affect the injury outcome even when the true input dose is fixed. For example, at a fixed impact force, the injury probability varies considerably among groups of different ages, among groups of different body types, body sizes and body compositions. The experimentally established relation of injury probability vs estimated input dose is heavily influenced by the particular population tested. As a result, applying the injury model established for one population, straightforwardly without modification, to assess the injury risk of a different population will inevitably lead to large errors. In many applications, however, we face exactly this task: we are given an injury model established on a particular test population and we need to predict the injury risk of a different population. For example, a data set for human forearm fracture was assembled in [7]from drop test results conducted on PMHS forearms from cadaver donors of average age 55. The purpose of assembling the data set, however, is to build an injury model for assessing the risk of forearm fracture among a population of live human subjects with an age distribution significantly different from that of cadaver donors. In this study, we develop a simple mathematical framework for this task. The key idea is based on interpreting the probabilistic injury model as the consequence of dose propagation uncertainty from input dose to target dose at the active site for injury where the binary outcome is uniquely determined by the target dose. The framework of dose propagation uncertainty makes it mathematically convenient to accommodate different uncertainties associated with different populations. The formulation developed provides a mechanism of mapping injury function from one population to another by simply updating the model parameters.
2. Mathematical Formulation
We first review the logistic model for binary outcomes [8]. Note that the injury probability p is not directly observable in experiments unless we repeat the experiment a large number of times at each fixed value of input dose. The logistic regression model was designed by working with the hidden injury probability p and considering the logit function of p, which is defined as the
logarithm of the injury odds:
. In the logistic model,
is postulated to be a linear function of input dose x,
(3)
Writing probability p as a function of x, we obtain the logistic dose response relation
(4)
We write the linear function in (3) as
so that constant
has the meaning of the median injury dose, at which the injury probability is 50%:
[9]. In general, we introduce
to denote the dose value at which
. For example,
is the dose with
. Coefficient
controls the steepness of transition (i.e., the sensitivity of injury probability with respect to dose change). We define the width of injury function as
Conceptually, the width W is not the 10 - 90 percentile range of x since dose x is not a random output of an experiment; it is the controlled input. However, if we view the injury function as the cumulative distribution function (CDF) for x and draw random samples of x based on the CDF, then the width W is indeed the 10 - 90 percentile range of random samples drawn. For simplicity, we shall call W the 10 - 90 percentile width even though x is not a random variable. In the logistic model, the width W is inversely proportional to coefficient
.
(5)
We point out that the steepness coefficient
exists only in the logistic model. In contrast, the width of injury function is universally defined and meaningful for all injury models. To facilitate the comparison of various models, we shall use the width (W) instead of coefficient
whenever it is appropriate to do so. The logistic model in terms of shape parameters
has the expression.
(6)
Logistic model is widely used as a phenomenological model for binary outcomes [3][4][10]. In this study, we interpret it and approximate it in the framework of dose propagation uncertainty from input dose to target dose. The key assumption in our interpretation is that there is an active site at which target dose Z uniquely determines the binary outcome I. Mathematically, target dose Z at the active site has the features described below:
• Binary outcome I is the indicator function of
(7)
where
is the critical threshold for target dose in transition from non-injury to injury. The transition is a discontinuous jump with respect to target dose Z at the active site. However, with respect to the input dose x that is away from the active site, the injury probability vs x generally is a smooth and gradual transition.
• The target dose Z is caused by the input dose x. While in most experiments the input dose x can be controlled, at least to some extent, the target dose Z is neither directly observable nor directly controllable.
• For a given input dose x, the corresponding target dose Z is a random variable, reflecting the uncertainty in the propagation from input dose to target dose.
We use an example to illustrate the propagation from input dose to target dose.
Example: Passing exam vs amount of study time
In this example, the input dose x is the amount of study time. Note that although the target dose Z is caused by the input dose x, quantities Z and x may have different physical dimensions. For passing an exam, the target dose Z is the effective fraction of actual exam contents correctly completed in the exam by the student. We use a flow chart to show a possible propagation from input dose to target dose.
x = the nominal amount of study time invested
®Z1 = effective amount of study time
affected by the student’s attentiveness, effciency, and overall load
®Z2 = amount of course contents learned
affected by the student’s prior preparation and ability of memorizing key items
®Z3 = fraction of actual exam contents learned
affected by the exam scope and weighting of components in exam
®Z = effective fraction of actual exam contents correctly completed
affected by the student’s general health condition on exam day, and ability of working under time pressure and in presence of noise/disturbance (8)
Mathematically, we write the target dose explicitly as
, emphasizing that Z is a random variable depending on the input dose x and depending on the random factor
in the dose propagation. The probability that a given input dose x leads to injury is
(9)
We consider two models for uncertainty in dose propagation: 1) target dose
has a normal distribution; and 2) target dose
is expressed in terms of a normally distributed intermediate variable. For example, intermediate variable
has a normal distribution, and target dose
is a shifted log normal distribution, expressed in terms of intermediate variable
as
.
3. Logistic Dose-Injury Relation Interpreted as Normally Distributed Target Dose
We model the target dose as proportional to the sum of the input dose and an additive Gaussian noise.
where
, a standard normal random variable. We scale target dose Z and the associated critical threshold
to make
by changing the physical unit for measuring z-values, or equivalently by changing the physical unit for measuring x-values. Thus, we set
and proceed with
(10)
In this section, we first examine the dose-response relation for normally distributed dose uncertainty, which is the probit model [11]. Then we discuss how to accommodate different uncertainties corresponding to different populations, including how to incorporate additional uncertainties into the dose-response relation.
3.1. Dose-Response Relation
The binary injury outcome is governed by the sign of random variable
(11)
The injury probability (p) corresponding to input dose x is
Recall that the cumulative distribution function (CDF) of standard normal is given by the error function,
, which is defined as
The dose response relation for normally distributed target dose Z has the expression:
(12)
We approximate dose-response relation (12) using the logistic function form (4) with tunable parameters
and
. First, we match the two functions at
to obtain
. To simplify the search for optimal
, we apply the transformation
After the transformation, (4) and (12) as functions of
have standard forms:
(13)
(14)
where the scaled coefficient
is related to
by
. For conciseness, we denote
simply as x. The task of approximating (12) with (4) is reduced to finding an optimal value of
such that the distance between
and
is minimized. Using numerical optimization, we find that the best approximation is achieved at
.
Figure 1 compares functions (14) and (13) at
. It is clear that the two functions are very good approximations of each other. The maximum difference is bounded by 0.01 (i.e., difference in predicted injury probability is less than 1%). With that error tolerance, the logistic model and the normal distribution model can practically substitute each other. In other words, the widely used logistic model can be viewed as a very good approximation of the normal distribution model, which was derived based on normally distributed dose propagation uncertainty from input dose to target dose.
Models (13) and (14) are nevertheless mathematically different. When the data set of binary injury outcomes (I) is sufficiently large, eventually, the two models will be distinguishable. Let m be the number of samples in the data set. We look into the question of how large m needs to be in order to statistically distinguish the two models. We consider a collection of independent data sets, each of the form
Figure 1. Comparison of
and
at
. Left panel: plots of the two functions. Right panel: plot of the difference between the two functions. The results shown demonstrate that the two functions are very good approximations to each other.
where
is the input dose of the j-th experiment and
the corresponding binary injury outcome. To test if the two models are statistically distinguishable, we generate data sets according to the normal distribution model
in (14). In all data sets, values of input dose
are uniformly distributed in
, and for each input dose
the corresponding binary injury outcome
is sampled using injury probability
.
Given data set D, the log-likelihood for a general probability function
is
(15)
We use log-likelihood (15) to compare models
and
. Since
is the exact probability model for the data set while
is a slightly incorrect model, the difference in log-likelihood
is expected to be positive. However, due to randomness of data sets, the difference in log-likelihood between two models fluctuates from one date set to another. We examine the sample distribution of differences in log-likelihood based on
independent data sets. Figure 2 plots the histograms of
for various values of m, the size of each data set.
Figure 2. Histograms of
for various values of m, the size of individual data sets, each yielding a sample for the histogram. Top left panel: histogram based on
independent data sets, each containing
samples; top right panel:
; bottom left panel:
; and bottom right panel:
.
To clarify, here N is the number of data sets used in each histogram and m is the number of binary outcomes in each data set. In Figure 2, each sample of difference in log-likelihood requires one data set. That is why we use
independent data sets to plot each histogram.
Suppose we use the sign of
to classify data sets as the normal distribution model (positive sign) or as the logistic model (negative sign). All data sets examined in Figure 2 are generated based on the normal distribution model. Thus, data sets with
will be falsely identified as the logistic model (false negative). In Figure 2, all counts to the left of the dashed black line in each histogram correspond to false negative identification. For data sets of
samples each (top left panel), the false negative rate is 25.26%. For
(top right panel), the false negative rate decreases to 19.44%. When the sample size is increased to
(bottom left panel), the false negative rate falls to 12.49%. Finally, when the sample size is doubled again to
(bottom right panel), the false negative rate drops down to 5.63%. Based on the simulation results, we see that to reduce the false negative rate to less than 20%, for example, we need to work with data sets, each consisting of
samples. This is above the typical sample size of data sets for injury models. Thus, in real applications, the normal distribution model (14) and logistic model (13) are practically the same unless we work with injury data sets of very large sample size.
We go back to the pre-transformation logistic model, function (4) specified by steepness coefficient
, and function (6) specified by width W. The corresponding optimal values for
and for W are respectively
(16)
Since the 10 - 90 percentile width is well defined for all injury functions, we choose to specify the logistic model using width W instead of coefficient
. We conclude that normal distribution model (12) based on dose propagation uncertainty is practically equivalent to logistic model (6) with shape parameters
given by
(17)
(17) describes the best approximation to the normal distribution model (12) from the logistic model family (6). The best approximation is obtained numerically by minimizing the distance between the two functions (Figure 1). Alternatively, a straightforward approximation can be written out by simply matching the widths of two injury functions. The width of normal distribution model is given by the inverse error function
Notice that the two widths, the width of normal distribution model
and the width of its best logistic model approximation
, are indeed very close to each other. We will use these two interchangeably.
Similar to the situation of logistic model, the normal distribution model is also completely specified by the shape parameters
. It has the form
(18)
where shape parameters
are related to parameters of dose propagation uncertainty in (17). It should be pointed out that in general, the target dose Z is hidden, not observable or controllable; none of parameters
,
or
is directly observable. These are internal quantities in the mathematical model, explaining why the injury probability follows the normal distribution model (12). In an idealized situation, the input dose x should be a controllable/measurable variable, and shape parameters
may be determined from experimental measurements. In realistic applications, however, the true input dose x may not be directly measurable, which we will discuss in next subsection. At the end of this subsection, we summarize the normal distribution model for dose propagation uncertainty, and its connection to the widely used logistic model.
Summary of the injury model based on dose propagation uncertainty
• We select the physical unit for measuring the target dose Z such that in the absence of dose propagation uncertainty, target dose Z is the same as input dose x:
• In the normal distribution model, the difference between target dose and input dose is an additive Gaussian noise:
• The binary injury outcome is completely determined by the condition
where
is the critical threshold for target dose Z.
• The probability of injury caused by the input dose x is described by the CDF of normal distribution. Practically the injury probability is very well approximated by the widely used logistic dose-response relation.
• As given in (17), the median injury dose of injury function is the critical threshold for the target dose, shifted by the bias in the dose propagation:
and the width of injury function is proportional to the uncertainty in dose propagation (standard deviation of the Gaussian noise):
The larger the uncertainty, the more spread out the injury function is.
• In terms of shape parameters
, the logistic model is expressed in (6); the normal distribution model is given in (18).
Next, we study how to incorporate additional uncertainties in the framework of dose-response relation, and how to model a new population with different uncertainty.
3.2. Effects of Additional Uncertainties
In the previous subsection, we interpreted the dose-response relation as a consequence of dose propagation uncertainty. In this subsection we study how to incorporate additional uncertainties by changing the shape parameters
in logistic model (6) or in normal distribution model (18).
We start by considering a homogeneous population consisting of statistically identical subjects, which means quantities
,
and
are fixed and stay the same for all subjects in the population. In a homogeneous population, the dose propagation uncertainty is statistically the same for all subjects. Its effect is already reflected in the dose response relation specified by shape parameters
, which are related to internal parameters
in (17). In particular, the width W is proportional to the standard deviation of uncertainty. If there is no uncertainty present in the dose propagation, the dose-response relation would be a sharp transition (a step function).
Now we consider a more realistic situation: a heterogeneous population consisting of subjects with variable critical threshold
, denoted here in the new setting as
, following the convention of using uppercase letters for random variables. In addition to the uncertainty in
, the input dose x may not be directly measurable. In some situations, the input dose x is not directly measured; instead, input dose x is derived from a controllable/measurable variable y. In these situations, the value of input dose x is calculated via computer simulations from measurable quantities using idealized representative properties of subjects, such as the 50-percentile properties of the general population [5][6]. We use the example below to illustrate the situation of controllable variable y vs true input dose
vs estimated input dose
. Consider the experiment in which we test the shatter resistance of a product by dropping it from a specified height. In this example, the various quantities in the model are described as follows:
• The height y is the controllable/measurable variable.
• The estimated input dose
is the impact force calculated in a computer simulation from height y using the representative median properties, such as the weight of the product, the aerodynamic properties, the mechanical properties of the product and the ground surface, and the orientation angle of the product at impact.
• The true input dose
is the actual impact force, which in general is different from the estimated input dose
. The difference
depends on how much the true properties deviate from the selected representative properties. The distribution of difference varies from one population to another.
• The target dose
is the maximum stress at the most vulnerable part of the product.
The bottom line is that the true input dose
is a random variable when the controllable variable y is specified. We model the difference
, the dose propagation uncertainty
, and the critical threshold
as additive Gaussian noises. Mathematically, we formulate the problem as
(19)
(20)
(21)
where
are i.i.d. samples of
. The binary injury outcome is governed by the sign of random variable
(22)
At a given value of
, random variable
has the same mathematical form as random variable
in (11). As a result, the injury probability vs the estimated input dose has the expression
(23)
Injury function (23) has the same form as (12). Thus,
is described by the normal distribution model with shape parameters
given as follows.
(24)
In a well controlled lab setting, the true input dose
is measurable. For example, in experiments of male forearm fracture [12], a cylinder of specified mass is dropped from a specified height along a vertical track onto the PMHS forearm sample. Both the forearm sample and the cylinder are connected to accelerometers, allowing accurate measurements of the dynamic impactor load and the support loads. In addition, in situ strain gauges are used to record time series of strains at various locations during the loading. In this idealized setting, there is no measurement error in
. The injury probability as a function of the true input dose,
, can be determined from the observed binary injury outcomes vs measured values of
. Injury function
follows the normal distribution model with shape parameters
given below.
(25)
With this formulation, we can map back and forth between injury functions
and
. We can also revise the injury function
measured on one population to construct the injury function for a different population. We now discuss these two problems.
Problem 1:
Suppose we are given an injury model
, specified by shape parameters
. The given injury function is for an idealized setting where the true input dose is directly measured. Our goal is to extend the given injury function
to predict the injury probability,
, as a function of estimated input dose for the same population when the true input dose is not measurable.
Solution:
Injury function
is specified by shape parameters
given in (25) while injury function
is specified by shape parameters
given in (24). Combining (25) with (24), we write
as an update on
.
(26)
Problem 2:
Suppose we are given an injury model
, specified by shape parameters
. The given injury function is established based on measurements of a heterogeneous population, labeled population 1. Population 1 is characterized by uncertainties in the input dose estimation and in the critical threshold, as described in (20) and (21)
Now consider a different heterogeneous population, labeled population 2, with uncertainties described by
Here we assume that the propagation uncertainty from true input dose to target dose
is statistically the same for the two populations. Our goal is to predict the injury function
for population 2 based on the given injury function
for population 1.
Solution:
Injury function
for population 1 is specified by shape parameters
while injury function
for population 2 is specified by shape parameters
. We write
as an update on
to take into account the differences in uncertainties between the two populations.
(27)
4. Dose-Injury Function for Target Doze of Log-Normal Distribution
For the discussion below, we adopt the normal-distribution model as the base formulation, switching away from the logistic model. There are several reasons behind the switching.
• The normal-distribution model is based on 1) viewing the binary injury outcome as completely determined by the target dose at the active site, 2) explaining the randomness in injury outcome as the consequence of uncertainty in dose propagation from input dose to target dose, and 3) modeling the dose propagation uncertainty as an additive Gaussian noise. This interpretation is both theoretically and operationally appealing.
• Mathematically, the injury function form of normal-distribution model is exactly invariant when additional normally distributed noise/uncertainty is incorporated into the model.
• We will study dose-injury models based on normally distributed intermediate variable. Mathematically, such an injury model is conveniently treated as a transformation of the normal-distribution model since the target doze is expressed as a function of the normally distributed intermediate variable.
• As we demonstrated in the previous section, the logistic model is practically equivalent to the normal-distribution model with the same shape parameters
.
We first recall the function form of the normal-distribution model. In terms of internal variables
, it is given by (12). In terms of shape parameters
, it is expressed in (18). Geometric quantities
,
,
and W of the injury function are related to internal variables
as
(28)
Because of the symmetry of error function
, the normal distribution model (18) is symmetric around the median injury dose
:
(29)
We now study a skewed injury function that breaks this symmetry. Consider the situation where the target dose
has a log-normal distribution
Again
is a standard normal random variable. In this case,
and
are simply related by an additive Gaussian noise.
(30)
If we use
and
to measure, respectively, the input dose and the target dose, then the injury probability vs
follows the same function form as (12) with
replaced by
:
(31)
We examine the injury probability as a function of the original input dose x. The purpose is to investigate 1) under what condition the injury probability vs x can be approximated by the symmetric normal-distribution model, and 2) when the normal distribution approximation is invalid, what additional parameter we need to introduce to describe the injury function for the original input dose x.
Since the injury probability vs
follows the normal distribution model (12), we use results (28) for (12) to write out
,
and
for quantity
.
(32)
The corresponding
,
and
for quantity x are
(33)
In this case, it is clear that
. The injury probability vs quantity x is not exactly symmetric around
. We introduce a measure of skewness to represent the asymmetry of injury probability vs quantity x.
(34)
Specifically,
defined above measures the skewness of interval
around
.
• When
, interval
is symmetric around
.
• When
, we have
, which implies that the upper half (above
) of injury function is flatter than the lower half (below
).
• When
, we have
, and that the upper half of injury function is steeper than the lower half.
Skewness
is an indicator of how well the injury function for x can be approximated by the symmetric normal distribution model. For a target dose of log-normal distribution, the skewness is
. When
is small, the skewness
, and the injury function is nearly symmetric around
. When
, the skewness
is positive, and in (31) the injury probability as a function of x is not symmetric. In this case, the injury function is characterized by three shape parameters:
.
(35)
Notice that even though expressions of
in (35) contain three variables
, two variables
and
appear only as a combination
in
. Mathematically, the three shape parameters
are completely specified by
, and thus, have only two degrees of freedom. As a result, the three shape parameters
cannot be set independently of each other. For example, in (35) when
is small, the width W will be small unless the median dose
is large. Formulation (35), based on target dose of log-normal distribution (30), cannot accommodate any negative skewness (
). It cannot even accommodate the simple symmetric case of
with finite
and
. We like to revise the formulation and construct an injury model in which the three shape parameters
can be set independently of each other.
5. A Dose-Injury Model with Skewness Based on a Normally Distributed Intermediate Variable
We construct a model that accommodates the median injury dose (
), the width (W) and the skewness (
) as 3 independent parameters. In previous section, we studied the formulation based on target dose of log-normal distribution, in which the skewness is always positive and the 3 shape parameters
are not independent of each other. A log-normal random variable can be viewed as the exponential of normal random variable. To accommodate negative skewness and to make
independent of each other, we extend the formulation to the case of target dose being a more general function of normal random variable.
We consider the situation where the dose propagation uncertainty is an additive Gaussian noise in quantity
with
as a new tunable parameter. The target dose
and the input dose x are related by
In this setting,
has the same sign as
. The domain of x is divided by
into two regions:
and
. Only the region containing the critical threshold
will be relevant for the injury model. The other region of x produces target dose
always above or always below
. For example, when
, only the region
is relevant for the injury model; the region
leads to target doze
and thus, leads to an injury probability of 100%. We discuss separately the case of
and the case of
.
5.1. Case 1:
In this case, the region
yields target dose
and an injury probability of 0%. We focus on the region
, the relevant region for the injury model. The logarithm of shifted target dose
and logarithm of shifted input dose
are related by an additive Gaussian noise.
(36)
where
. We apply the shift
on all dose quantities (including
and
). After the shift, problem (36) above is exactly the same as problem (30) in the previous section. It follows that the injury probability has the same function form as (12) with
replaced by
(37)
Based on results (33) and (35), we write out
for injury function (37).
(38)
Note that both
and
are on the right side of
in the case of
. As we will see,
and
are always on the same side of
. With Formulas (38) for the case of
, we can accommodate shape parameters
with positive skewness
. Specifically, at any fixed
, for each given set of
there is a unique corresponding set of
.
(39)
This works for any positive skewness
, corresponding to the situation where the injury probability has a flatter rise above the median injury dose
than below it.
To accommodate negative skewness
, however, we need
.
5.2. Case 2:
In this case, we focus on the region
since the region
yields target dose
and an injury probability of 100%. The target dose and input dose are related by
(40)
where
. Here we consider quantity
with the negative sign because it is an increasing function of
. Injury occurs when the target dose is above the critical threshold:
, which translates to
The injury probability has the expression
(41)
Notice that (31) with quantities denoted by (' ) and (41) are connected by transformation
We use results (33) and (35) for injury function (31) to write out
for (41).
(42)
In the case of
, both
and
are on the left side of
. With Formulas (42) for the case of
, we can accommodate shape parameters
with negative skewness
. Specifically, at any fixed
, for each given set of
there is a unique corresponding set of
.
(43)
This works for
, which indicates that the injury probability has a steeper rise above the median injury dose
than below it.
Next we combine the results of
and
to derive a unified formulation for accommodating shape parameters
regardless of the sign of
.
5.3. A Unified Formulation for All Values of Skewness
In the previous sub-section, we studied models based on target dose of shifted log normal distribution with shift as a parameter. We now synthesize the results obtained to develop a unified formulation of injury function in which the 3 shape parameters
can be specified independently.
First, we show that at any fixed value of
, there is one-to-one correspondence between
and
. For any given set of shape parameters
regardless of the sign of
, we combine results (39) and (43) to write out the corresponding
.
(44)
Conversely, for any given set of
, we combine results (38) and (42) to write out the corresponding shape parameters
.
(45)
Again,
and
are always on the same side of
. Next we combine (37) for
and (41) for
to write out a unified injury probability vs x.
(46)
To specify the unified injury function in terms of shape parameters
, we express all quantities in (46) using only
and x.
With these expressions, we write the unified injury function as
(47)
In injury model (47), the 3 shape parameters
can be specified independently of each other. In particular, for small skewness
, expanding (47) in terms of
reduces it to the symmetric normal-distribution model (18)
Figure 3 illustrates several injury functions of the form (47), respectively, for positive, zero and negative skewness. All injury functions shown have the same width
. In the left panel of Figure 3, injury functions are aligned at
. This alignment demonstrates that for
the left half of injury function is steeper than the right half; for
the left half of injury function is flatter than the right half; and for
the injury function is symmetric. In the right panel, injury functions are shifted to be aligned at
and thus also aligned at
because they all have the same width
Figure 3. Injury functions with positive, zero and negative values of skewness. All injury functions have the same width
. Left panel: injury functions are aligned at the median injury dose
. Right panel: injury functions are shifted to have the same interval
.
. With
fixed, the median dose
varies with skewness
from
at
, to
at
, and to
at
. The alignment of interval
highlights that as
increases from negative to zero to positive, the injury function becomes more concave down.
6. Effect of Input Dose Uncertainty on the Injury Function with Skewness
We study the effect of input dose estimation uncertainty on the dose-injury function with skewness. We use the term “composite injury function” to denote the injury model after the input dose uncertainty has been incorporated into the model. In general, the composite injury function will be somewhat different from the 3-parameter function form (47) we derived in the previous section. We calculate the three shape parameters
of the composite injury function. Then we explore approximating the composite injury function using function form (47). We examine the difference between the composite injury function and model (47) with the same shape parameters
. If the approximation error is small, then the 3-parameter function form (47) is approximately invariant with respect to input dose uncertainty, and it serves as an adequate framework for accommodating uncertainty in estimating the input dose. Furthermore, framework (47) provides a mechanism of mapping the injury function for one particular dose propagation uncertainty to that for a different uncertainty. Using this mechanism, we can construct an injury model for a target population in application, based on measured injury data for a test population in experiments.
We start with a function of injury probability vs true input dose that is exactly of form (47) specified by 3 shape parameters
:
We consider the situation where the true input dose
is not measurable. Instead, an estimated input dose, x, is obtained as an approximation for
. We assume
• the difference
is a normal random variable, and
• the difference
is independent of x.
We assess the injury probability as a function of the estimated input dose x. For each fixed value of x, the corresponding
is a normal random variable:
where
. The composite injury function,
, representing the injury probability at estimated input dose x, is a Gaussian weighted average of
:
(48)
When injury function
has non-zero skewness, the Gaussian weighted average of
on the right hand side of (48) does not have a simple analytical expression. We use numerical integration to calculate the composite injury function
and calculate its shape parameters
. We examine numerically if
is still well described by function form (47) with
’s shape parameters
.
In our numerical study,
, the injury probability vs the true input dose before input dose uncertainty is incorporated, has function form (47) and is specified by shape parameters
,
, and
. We consider input dose uncertainty of normal distribution with
(mean) and various values of
(standard deviation). The composite injury function,
, contains the effect of input dose uncertainty, showing injury probability vs estimated input dose x. Figure 4 examines the composite injury function
for
between 0 and 3.
The left panel of Figure 4 shows the injury probability vs the estimated input dose x, respectively, for
. The most pronounced effect of input dose uncertainty is to spread out the injury function and increase the width. We examine the trend of shape parameters
when the input dose uncertainty
is added and increased. The right panel shows
vs
, of the composite injury function. As the input dose uncertainty
increases, both the median injury dose
and the width
increase monotonically, with W increasing more prominently than
. At the same time, when
increases, the asymmetry of injury function is smoothed out by the Gaussian noise and as a result, the skewness
decreases. The change in median injury dose
is attributed to the presence of skewness: the median injury dose increases (moves toward the right) when an injury function with positive skewness is smoothed out by a Gaussian noise. Conversely, the median injury dose decreases (moves toward the
Figure 4. Effect of the input dose uncertainty
on the injury function with skewness. Left panel: composite injury functions for several values of
. Right panel: shape parameters
vs
of the composite injury function.
left) when an injury function with negative skewness is smoothed out. The movement of the median injury dose is caused by smoothing an asymmetric function (see Figure 3 for the general shape of injury functions with positive, zero, and negative skewness). For an injury function of zero skewness,
is invariant with respect to
when the injury function is smoothed out by a Gaussian noise.
Next we examine whether or not the composite injury functions for
shown in Figure 4 are still approximately described by model (47). Figure 5 compares the composite injury function and the approximation using function form (47) with shape parameters
of the composite injury function. The left panel of Figure 5 compares the composite injury function for
and its approximation. The two functions are barely distinguishable from each other. To quantitatively examine the error of approximation, in the right panel we plot the difference between the composite injury function and its approximation. For all values of
examined, the maximum error in approximation is less than 0.01 (1%). The results demonstrate that function form (47) specified by 3 independent shape parameters
is an adequate model for quantitatively describing general injury functions with skewness.
With the framework of function form (47) and mapping transformation (48), we can filter out the effect of input dose uncertainty in measured injury data. Suppose we are given a measured injury function,
, of form (47) for a particular population with input dose uncertainty
. We use transformation (48) to map it back to
, the injury function for the case of zero input dose uncertainty (
). From there, we can apply the mapping transformation again to predict the injury model for another population with input dose uncertainty
. There is no simple analytical expression for the mapping
Figure 5. Approximation of the composite injury function
using function form (47) with shape parameters
. Left panel: comparison of
and its approximation for
. Right panel: error of the approximation for several values of input dose uncertainty
.
transformation. Both the forward and backward mappings need to be implemented numerically. The detailed numerical procedure will be discussed in a subsequent study.
7. Concluding Remarks
We considered injury models in the framework of dose propagation uncertainty. The mathematical formulation is based on that the binary injury outcome is completely determined by the target dose at the active site and the critical threshold. The randomness in the occurrence of injury at a given input dose is attributed to the dose propagation uncertainty from input dose to target dose. The normal distribution model describes the situation where the dose propagation uncertainty is normally distributed. We interpreted the widely used logistic model as a good approximation to the normal distribution model, and thus, interpreted it approximately as a consequence of normally distributed dose propagation uncertainty. In many applications, the input dose is not directly measurable. Instead, an estimated input dose is calculated via computer simulations from measured quantities using representative median parameter values of the general population. In many practical situations, injury models are constructed in the form of injury probability vs estimated input dose. The discrepancy between the estimated input dose and the true input dose can be viewed as an uncertainty in the input dose. With the interpretation of dose propagation uncertainty, the input dose uncertainty is conveniently incorporated into the injury model. The framework of dose propagation uncertainty provides a mechanism of extending an injury function established on a test population to predict the injury model for a different population in application. Both the logistic model and the normal distribution model are specified by two shape parameters: the median injury dose and the 10 - 90 percentile width. The mapping between the injury functions of two populations has a simple analytical form of updating the two shape parameters. Both the logistic model and the normal distribution model are symmetric around the median injury dose and have no skewness. To accommodate injury functions with skewness, we studied dose propagation uncertainties of shifted log normal distribution with shift as a parameter. Based on the shifted log normal model, we developed a function form for injury probability vs input dose that is specified by three shape parameters: median injury dose, the width, and the skewness. The proposed function form allows the three shape parameters to be set independent of each other. In particular, the proposed function form is capable of accommodating arbitrary skewness, positive or negative. In addition, we showed numerically that the proposed 3-parameter function form is approximately invariant with respect to additions or changes in input dose uncertainty. Therefore, the 3-parameter function form serves as a broad framework for modeling input dose uncertainty and modeling injury function skewness at the same time. This broad framework allows us to map injury function with skewness from a test population to a different population in applications.
Disclaimer and Acknowledgements
The authors thank C. Kramer and J. Swallow of Institute for Defense Analysis (IDA) for bringing the problem to their attention, and thank the Joint Non-Lethal Weapons Directorate of U.S. Department of Defense for supporting this work. The views expressed in this document are those of the authors and do not reflect the official policy or position of the Department of Defense or the U.S. Government.