Distribution of Miss Distance in Discrete-Time Controlled System with Noise-Corrupted State Information ()
1. Introduction
With the emergence of maneuverable tactical ballistic missiles (TBM) since the 1991 Gulf War, great challenge is put on the air-defense system. Successful interception of such a highly maneuvering target requires a very small miss distance or even direct hit [1] [2]. Due to the bounded missile controls, non-Gaussian noise and system’s nonlinearity, the certainty equivalence theorem (CEP) does not hold for the terminal guidance [3]. However, when the relative state between pursuer and evader is observable, the partial separation theorem still holds, when the estimator can design separately from guidance law [4].
The homing guidance of an interceptor missile is a stochastic optimal control problem with the objective to minimize the expected value of the miss distance. Under the assumption that perfect state information is available, a series of deterministic control strategies
(
means ZEM, zero-effort miss distance and its terminal value is actual miss distance itself) are proposed, among them are bang-bang strategies, as well as various linear, saturated linear strategies [5] [6] [7]. However, in practical application perfect state information could not hold, on the one hand, there is a certain error in the observed information of the sensor, and even if the above measurement is very accurate, the target acceleration (especially lateral acceleration) cannot be directly measured and only relays on the means of state estimation; On the other hand, the guidance law like OGL (Optimal Guidance Law) unconstrained the range of missile acceleration, which is bounded in reality. These facts impede significantly the implementation of theoretically robust transferring strategies, in order to use these noise-corrupted measurements as a basis of a feedback control, an estimator, restoring and filtering the state variables, becomes an indispensable component of control loop. Therefore, the control function
receives, instead of the accurate value of
, a random output
, where
is the estimation error of ZEM. As a consequence, the miss distance becomes a prior unknown variable.
To guarantee guidance performance and evaluate the extent of performance deterioration by using such a stochastic data, it is necessary to obtain the distribution of miss distance. In current practice, such a distribution is acquired by a large set of Monte Carlo simulations with system dynamic, estimator/control strategy combination, specific disturbance and noise model given. Unfortunately, this method is very suitable for validate purpose while not proper in the stage of system design. With the assumption that the distributions of
and initial state
are given, Glizer and Shinar [8] have obtained the recursive formula for the distribution of miss distance in discrete-time controlled system with disturbance and noise corrupted measurements under a linear saturated control strategy. In this paper, a recurrence formula for the distribution of
is obtained under a bang-bang control strategy. The use of bang-bang control strategy is motivated by its feature that the maximum capture zone of missile would reach. At the same time, the analytic distribution of
in this case is also acquired.
The rest of the paper is organized as follows: Section 2 formulates the estimation problem of terminal guidance for highly maneuvering target interception. Section 3 is devoted to the derivation of MD’s distribution. Section 4 validates the above derivation by comparing the results with Monte Carlo simulations. Conclusions are presented in the last section.
2. Problem Formulation
2.1. Continuous-Time Motion Model
Consider the planar interception scenario―a pursuer (interceptor) and an evader (target), as shown in Figure 1. In what follows, let
and
denote the pursuer
and evader respectively, and we make the following assumptions:
1) The control dynamics of
and
are expressed by a first-order transfer function with time constants
and
respectively;
2) The velocities of
and
are constant, denoted by
and
respectively;
3) The lateral accelerations of
and
are both bounded, with maximum values
and
respectively.
In Figure 1, the
axis of coordinate system is aligned with the initial line of sight (LOS); the
axis is normal to
axis;
,
are current coordinates of the
and
;
and
are respective aspect angles between the velocity vector of the players and
axis. Note that these aspect angles satisfy such conditions (
), the trajectories can linearize along the initial LOS. Assuming a constant closing velocity, let the initial time
then the final time of the interception can be easily computed for a given initial relative distance
along with LOS, i.e. the final time is as follows
(1)
Let denote the state vector
, and for brevity, the time
is omitted hereafter. Under above assumptions, the interception takes place between
and the dynamics of interception has a linear form:
(2)
where
is the relative distance between
and
normal to LOS;
is the relative lateral velocity;
and
are the lateral accelerations of
and
;
and
are the commanded accelerations of the
and
respectively , and satisfy
(3)
The dynamic model can be rewritten as a state vector form
(4)
where
(5)
By using the scalarizing transformation, the planar interception problem will reduce to a scalar one. In this case, the system variable in the interceptor is
, zero-effort miss distance, which means the miss distance acquired with no control effort until the final time
reached. It can be formulated as
(6)
where
,
is the state transition matrix and subjects to the original homogeneous system
,
(7)
Therefore, the zero-effort miss distance is
(8)
where
, with
(9)
The zero-effort miss distance varies in accordance with the following differential equation:
(10)
where
(11)
(12)
It is assumed that the control is given by a bang-bang strategy
(13)
where
(14)
2.2. Discrete-Time Estimation Problem
Define the division of interval
:
, where
,
. For the sake of simplicity, it is assumed that the system is disturbance free, that is
, then the equivalent discrete-time Equation of (8) is
(15)
where
and the control
(16)
In our previous work, we have acquired the recursive expression of the ZEM estimation error distribution. It can be reached that
is subject to Gaussian distribution at every time with mean
and variance
, i.e.,
. In this case,
and
satisfy
(17)
(18)
where
is the current Kalman gain;
is the measurement matrix;
and
are the covariance matrix of process noise and measurement noise respectively;
and
are the mean and covariance matrix of state estimation error, which satisfy
(19)
(20)
Therefore, the initial ZEM estimation error satisfies
with
and
where
. Once the initial state of estimator
is given, the distribution of
can be computed as
.
3. Probability Density Function of zn+1
From (15) and (16) it can be seen that
and
is dependent, hence the convolution formula cannot be used when calculating the distribution of
. The cumulative distribution of
satisfies
(21)
where
(22)
(23)
By using the formula for the probability of the product of dependent event, we can obtain
(24)
then
(25)
Note that the random variables
and
are independent, therefore
(26)
where the symbol “
” means convolution, then
(27)
Similarly, the following formula can be obtained
(28)
By substituting (27), (28) into (21) the following formula can be obtained
(29)
for
, by simplifying (29)
(30)
from the characteristic of probability density function
, then
(31)
Similarly, for
(32)
By differentiating (31) and (32) with respect to
, a same expression for the probability density function of
can be acquired
(33)
The probability density function
is computed by applying the recurrence Formula (33)
times.
4. Numerical Simulation
To validate the correctness of theoretical derivation in Section 3, this experiment compares the results of Monte Carlo simulation with the analytic results through a typical instance of TBM interception, where the simulation parameters are listed in Table 1. The number of Monte Carlo simulation is set to be 1000. In this interception scenario, the evader is assumed to move with maximum velocity
, and its command acceleration is subject to uniform distribution between −0.3 g and 0.3 g. The initial state estimation errors of
,
and
for the estimator are set to be 10 m, 5 m/s and 1 g respectively. Cumulative distribution of the miss distance
is calculated as
(34)
where
. Figure 2 shows the result, it is seen that two curves match very accurately.
Figure 2. Theoretical and simulative distribution functions of
.
5. Conclusions
In this paper, an analytic method for evaluating the probability distribution of miss distance in discrete-time controlled system is proposed. In this problem, the system is assumed without disturbance, and the control is realized by a bang-bang strategy. Results are proved by Monte Carlo simulations in the context of highly maneuvering target interception.
The problem is mathematically nontrivial, because the evaluation of the sum of two dependent random variables is required. Conclusions in this paper allow to evaluate the distribution of miss distance without carrying out a large amount of Monte Carlo simulations.