A Comparison of Global Nearest Neighbor and Linear Assignment Problem Solution Methods on a Model Tracking Problem ()
1. Introduction
Based on two disquieting experiences with the application of GNN for ballistic missile tracking data association it comes as a surprise to us that there is still active research on this method.
The first of these experiences occurred at an early 1980’s Denver conference on satellite infrared sensor ballistic missile tracking. One of the conference papers reported a GNN track-to-observation data association accuracy of 22 % in a high fidelity ballistic missile simulation.
In the late 1990’s, we had already written our Multiple Hypotheses Tracking paper [1] and were readying it for submission for publication. In it we introduced the notion of track purity as a basic data association performance metric; it is the maximum of the fractions of the number of observations in an assembled track of observations all of which come from the same sensor-detected object. In that paper we reported that track data association based on a Murty [2] K-best hypotheses algorithm with a Jonker-Volgenant (JV) [3] linear assignment problem solution algorithm as its core solution method was producing track purities in the high 90 percentiles for 200 closely spaced objects over 37 scans in a satellite infrared sensor high fidelity simulation.
In a Government funded Research and Development contract at our company we extended our satellite sensor 2d line-of-sight observation data association algorithm to 3d radar data association and were testing its performance in a radar system simulation. The systems engineer, working for another company hosting the radar simulator, had instrumented track purity performance evaluation with a GNN data association solution, scan-by-scan, as ground truth. Not unexpectedly, our extended algorithm’s track purity results came in at the low 20 percentiles.
So, it comes as a disturbing surprise to us that there is a still significant recent research devoted to GNN algorithms applied to tracking systems. Nowhere in this literature (e.g., references [4]-[9]) is track data association purity or an analogous metric ever mentioned, much less used to evaluate GNN data association performance.
The intent of this paper is to demonstrate, with the simplest of models, the striking, absolute performance dominance of the LAP algorithm over the GNN algorithm in terms of track data association purity, track state estimation accuracy, and timing.
Section 2 below covers definitions and notation. Section 2 provides derivations.
Section 3 supplies our key results.
Section 4 discusses the major conclusions that flow from our results.
Section 5 lists references cited in the paper.
2. Definitions
= number of scans.
= number of tracks.
= number of observations per scan.
= observation normal error standard deviation.
is the column vector of state expected values.
= ith track state on scan s at the beginning scan s prior to state update
.
= jth observation on the sth scan.,
.
.
=
track-to-observation data assignment matrix.
is a column vector of costs for either the GNN or LAP methods.
= the assignment solution permutation column vector for either GNN or LAP methods on the sth scan.
=
track-to-object assignment purity matrix for either GNN or LAP methods.
3. Derivations
3.1. Preliminaries
We first note that the core of our simulation model is a one dimensional Kalman filter with unit transition and measurement matrices, process noise-free, and independent identically distributed normal zero
measurement error for each observation in each scan. The track-to-observation assignment matrix is square to facilitate the exploitation of the MATLAB JV linear assignment problem solver.
3.2. Initialization
In our simulation we first create the common
matrix of observations,
, with
element given by
. (1)
For each of the GNN and LAP methods we set the first column of
to
.
For each
we establish a track with single-point initialization by setting
(2)
3.3. Recursive Computation
For each s such that
.
For each
.
For each
.
(3)
The likelihood matrix,
on scan s,
, is the matrix
.
The assignment solution vector is
and the associated cost is
.
The scan s states,
and variances are recursively updated according to
(4)
is updated by setting the sth column of
to
.
After
scans method Purity, P, is computed as:
(5)
Track state accuracy, E, is computed as:
(6)
4. Results
We ran three 100 Monte-Carlo 10 scan MATLAB simulations characterized by the number of tracks (objects) and decreasing values of
. The MATLAB code was run on an HP 2.2 GHZ computer. The figures in Tables 1-5 are the average over the 100 samples. The figures in Table 6 and Table 7 are respective GNN and LAP assignment solution costs and on the last of the 100 samples for Case 3,
.
Case 1:
Table 1. Comparative accuracy performance.
|
Purity |
GNN |
LAP |
1.0 |
0.416 |
0.600 |
0.5 |
0.437 |
0.871 |
0.25 |
0.424 |
0.997 |
|
Accuracy |
GNN |
LAP |
1.0 |
0.940 |
0.604 |
0.5 |
0.871 |
0.227 |
0.25 |
0.880 |
0.081 |
Case 2:
Table 2. Comparative purity performance
|
Purity |
GNN |
LAP |
1.0 |
0.305 |
0.485 |
0.5 |
0.322 |
0.771 |
0.25 |
0.346 |
0.997 |
Table 3. Comparative accuracy performance.
|
Accuracy |
GNN |
LAP |
1.0 |
1.560 |
0.802 |
0.5 |
1.542 |
0.332 |
0.25 |
1.525 |
0.076 |
Case 3:
Table 4. Comparative purity performance
|
Purity |
GNN |
LAP |
1.0 |
0.206 |
0.454 |
0.5 |
0.230 |
0.780 |
0.25 |
0.269 |
0.990 |
Table 5. Comparative accuracy performance.
|
Accuracy |
GNN |
LAP |
1.0 |
3.39 |
0.807 |
0.5 |
3.26 |
0.338 |
0.25 |
3.26 |
0.084 |
Table 6. Case 3 comparative cost.
Scan |
GNN |
LAP |
2 |
7.497 |
7.497 |
3 |
15.320 |
1.333 |
4 |
8.976 |
1.836 |
5 |
30.356 |
3.493 |
6 |
34.680 |
3.792 |
7 |
27.892 |
1.489 |
8 |
49.460 |
2.127 |
9 |
31.545 |
-3.089 |
10 |
32.554 |
3.335 |
Table 7. Case 3 Comparative
.
Scan |
GNN |
LAP |
2 |
5.488 |
5.488 |
3 |
17.738 |
13.723 |
4 |
12.850 |
3.958 |
5 |
133.650 |
3.022 |
6 |
24.214 |
4.745 |
7 |
31.298 |
17.841 |
8 |
180.884 |
14.128 |
9 |
222.726 |
13.406 |
10 |
107.723 |
6.597 |
As for comparative assignment problem solution timing, we note that GNN requires
floating point comparisons. This is the same complexity as the LAP JV algorithm but with, as seen from the tables above, far inferior purity and accuracy. We have timed the two methods on Case 3 samples. Typical GNN operating time/per problem/scan is 0.015 sec.; the LAP method solution times did not register above 0 sec.
5. Conclusions
We first observe that the LAP observation-to-track assignment solution is the maximally likely assignment solution among all the permutations on
letters. It is obvious from the results above that the GNN solution ranks low in minimizing assignment problem cost among the Case 3
possible assignments and is far from optimal. It is also interesting to note that GNN purity and accuracy performance essentially do not improve as
decreases while LAP performance increases sharply as
decreases.
Moreover, Table 4 confirms our two previous experiences with GNN performance, i.e., purity performances in the 20 percentiles. While we have relied on a simple idealized model, the results here and in [1], which exploited a more sophisticated Multi Hypothesis Tracking (MHT) data association algorithm, are mutually supportive.
Table 5 tells us that GNN error is more than an order of magnitude larger than LAP error. We can also infer from Table 4 and Table 5 that high track purity and accuracy are inextricably linked. High purity is necessary for state estimation accuracy; state estimation accuracy, based on first physical principles-based trajectory dynamics for the sensor-detected objects, is necessary for high purity to accurately predict where object observations will be in the space of measurements on the next scan.
Table 6 and Table 7 are taken from taking the last of the Case 3 100 Monte Carlo samples as a typical result.
Table 6 emphasizes the assignment cost superiority of LAP over the GNN solutions. Except for the first row entries, where the GNN and LAP algorithms both produced exact assignment solutions, we see that GNN costs are many times higher than LAP costs.
Given LAP perfect purity, as was the case with this particular sample, the statistics
,
, appearing in the third column of Table 7 is
distributed with
degrees-of-freedom. The expected value of such a random variable is
because, for each
and
, the residual
, appearing in Equation (4) is
distributed.
It is also clear that when it is not practical, in a real satellite infrared or radar sensor tracking system, to drive sensor detection measurement line-of-sight and/or range variances down beyond certain physical limits an MHT K-best hypotheses algorithm is required.
Finally, we come to the inescapable conclusion that the use of GNN for data association in tracking systems is highly problematic. It is reasonable to assume that GNN performance will degrade when dynamics and measurement equations, as functions of time evolving states of several dimensions and sensor-to-observed object geometry, are far more complicated than in our simple model. The far more likely inference is that GNN data association real world performance will be significantly worse than is shown above.
Conflicts of Interest
The author declares no conflicts of interest.