Modelling HIV/AIDS Cases in Zambia: A Comparative Study of the Impact of Mandatory HIV Testing

Abstract

In this study, a time series modeling approach is used to determine an ARIMA model and advance counterfactual forecasting at a point of policy intervention. We consider monthly data of HIV/AIDS cases from the Ministry of Health (Copperbelt province) of Zambia, for the period 2010 to 2019 and have a total of 120 observations. Results indicate that ARIMA (1, 0, 0) is an adequate model which best fits the HIV/AIDS time series data and is, therefore, suitable for forecasting cases. The model predicts a reduction from an average of 3500 to 3177 representing 14.29% in HIV/AIDS cases from 2017 (year of policy activation) to 2019, but the actual recorded cases dropped from 3500 to 1514 accounting for 57.4% in the same time frame.

Share and Cite:

Moyo, E. , Shakalima, J. , Chambashi, G. , Muchinga, J. and Matindih, L. (2021) Modelling HIV/AIDS Cases in Zambia: A Comparative Study of the Impact of Mandatory HIV Testing. Open Journal of Statistics, 11, 409-419. doi: 10.4236/ojs.2021.113025.

1. Introduction

Human Immunodeficiency Virus (HIV) infection remains the prominent cause of morbidity and mortality throughout the world. Since the start of the epidemic, around 76.1 million people have been infected and almost 35.4 million people have died from Acquired Immunodeficiency Syndrome (AIDs) related illnesses.

Globally, in 2019 there were an estimated 38 million people living with HIV. The vast majority of people living with HIV are located in low and middle-income countries, with an estimated 66% living in sub-Saharan Africa. Among this group 19.6 million are living in East and Southern Africa which saw increasing HIV infections due to HIV service disruptions during COVID-19 and the slowing public health response to HIV [1].

As one of the Sub-Saharan countries, the cases in Zambia are not different and HIV prevalence in Zambia continued to decline. The recent Zambia Population HIV Impact Assessment (ZAMPHIA) survey shows a reduction in the prevalence of about 1.7 percentage points from 13.3% in 2014 to 11.6% in 2016 [2]. The health sector has also recorded remarkable progress on antiretroviral treatment (ART) coverage, which stands at 72% of the eligible people against the United Nations AIDS global target of 90% according to Zambia National Health Strategic Plan (2017-2021) [2]. In 2018, around 48,000 adults and 5400 children became newly infected with HIV in Zambia. In the same year around 1.2 million people in Zambia were living with HIV and 17,000 people died from HIV-related illnesses. Also, the number of people newly diagnosed with HIV and with active tuberculosis (TB) entering care has fallen significantly from 66% in 2015 to 10% in 2017 with reference to [3].

Reference [3] articulates that in 2018, 78% of all people living with HIV were on treatment. As of 2019, 87% of people living with HIV were aware of their status, and 89% on treatment and 75% were virally suppressed. [3] also urges that in Zambia there has been progress in the number of AIDS-related deaths since 2010, with a 37% decrease, from 26,000 deaths to 17,000 deaths. The number of new HIV infections has also decreased, from 56,000 to 48,000 in the same period. The 90-90-90 targets envision that, by 2030, 90% of people living with HIV will know their HIV status, 90% of people who know their HIV-positive status will be accessing treatment and 90% of people on treatment will have suppressed viral loads [3].

The policy intervention on HIV/AIDS cases in Zambia presents a comparative (also known as counterfactual or causal impact analysis) time series problem. Reference [4] explains that this technique is used to study the causal effect resulting from the difference between the observed series and the series that would have been observed had the intervention not taken place. In this study, the Autoregressive Integrated Moving Average (ARIMA) model also known as the “Box-Jenkins” methodology following the work of Box and Jenkins was used to determine the ideal model of best fit to be used in forecasting the HIV/AIDS data. Box-Jenkins forecasting is of greatest use when the primary factors causing demand for products, services, revenue, and, in this instance, disease burden is believed to behave in the future as well as it did in the past [5].

The study by [6] evaluated the impact of cigarette exercise tax increases in Pennsylvania, from 2000 to 2009. They used segmented regression analysis of interrupted time series to evaluate the level and trend changes in Pennsylvania adult’s current smoking prevalence, age-adjusted acute myocardial infarction (AMI) and asthma hospitalization rates, age-specific asthma hospitalization, and age adjusted rates following the cigarette exercise tax increases. Results showed that after the first tax increase, no beneficial effects were noted on the outcome of interest. The second tax increase was associated with significant declines in smoking prevalence for people aged 18 to 39. A decrease was also noticed in asthma hospitalizations and men’s aged-adjusted AMI hospitalization rates.

[7] employed intervention (comparative study) time series analysis to evaluate the effectiveness of a collaborative intervention to improve quality in pre-hospital ambulance care for acute myocardial infarction (AMI) and stroke. Their findings were such that, based on the estimated change in intercept and slope from pre to post intervention using segmented regression, they found insufficient evidence of a statistically significant effect on quality of care for stroke, although potential clinically important effects for AMI cannot be ruled out.

[8] used controlled interrupted time series analysis to quantify the effect of the introduction of 20 mph (32 km an hour) traffic speed zones on road collisions, injuries, and fatalities in London. The results were, the introduction of 20 mph zones was associated with a 41.9% (95% confidence interval 36.0% to 47.8%) reduction in road casualties, after adjustment for underlying time trends. The percentage reduction was greatest in younger children and greater for the category of killed or seriously injured casualties than for minor injuries. There was no evidence of casualty migration to areas adjacent to 20 mph zones, where casualties also fell slightly by an average of 8.0% (4.4% to 11.5%). Thus, they concluded that 20 mph zones are effective measures for reducing road injuries and deaths.

The significance of this inquiry is to determine if the policy of mandatory HIV/AIDS testing for all, has had any impact on the disease trend/pattern. This was done through development of forecasting models for predicting number of expected cases under this policy intervention so as to guide policy intervention and control measures on time. The rest of the paper is organized as follows: we discuss the methodology considered in Section 2. The results and discussion are contained in Section 3 and lastly, we conclude the paper in Section 4.

2. Methods and Materials

HIV/AIDS data for the study is taken from the administrative data submitted to the Copperbelt Provincial Medical Office (CPMO) as cases by 311 health facilities. The time series for monthly HIV/AIDS aggregated data is from 2010 to 2019 accounting for 120 observations and the annual summary of recorded cases is presented in Table 1. The sample size is 2842212. X t = 2010 , , 2019 where X t represents monthly observations. MS EXCEL package was used as a primary data storage and EVIEWS version 9.0 software is used to implement stochastic models and graphical representations.

2.1. Stochastic Modelling

The Box-Jenkins methodology developed by [5] is a systematic process which is executed by using an iterative process until an adequate model is achieved. The

Table 1. Shows a summary of HIV/AIDS cases from the total samples tested annually.

procedure is achieved by a step-by-step process of model; identification, specification, estimation, diagnostics and forecast. The ARIMA model has three parameters and these are; autoregressive (p), differencing order (d) and the order of moving average (q). The generic Box-Jenkins models are denoted by ARIMA (p, d, q) given by

( B ) ( X t μ ) = θ ( B ) e t (1)

where, , θ and e t are autoregressive parameter, moving average parameter and residual respectively. The residuals are assumed to be independent and identically distributed (i.i.d) normal random variables [9] [10].

2.2. Measure of Forecast Accuracy

The statistics are used to compare how well models fit the time series. Reference [11] explains that the best fit model is one with a high number of smaller errors. These errors are; the Root Mean Square Error (RMSE), the Mean Absolute Percentage Error (MAPE), the Mean Percentage Error (MPE) and the Mean Absolute Error (MAE). Forecast error is given by,

F E t = O t F t (2)

where F E t is the forecast error, O t is the observed value and F t is the forecast value.

The AIC and SBC are given by

AIC = 2 ( log ( L ) k ) (3)

SBC = 2 log ( L ) + k log ( n ) (4)

Other measures of forecast accuracy are given by

MSE = t = 1 n ( O t F t ) 2 n (5)

MAE = t = 1 n | F E t | n (6)

MAPE = t = 1 n | F E t O t | n × 100 (7)

In these formulas, L is the value of the likelihood function evaluated at the parameter estimates, n is the number of observations, k is the number of estimated parameters, log ( . ) = log e ( . ) computes the natural logarithm and t = 1 , 2 , 3 , , n .

3. Results and Discussion

This is the first step in the Box Jenkins procedure of modelling time series where the sample autocorrelation function (SACF) and sample partial autocorrelation function (SPACF) are used as guides to choosing the appropriate Autoregressive Integrated Moving Average (ARIMA) model. The time plot for HIV/AIDS monthly cases is displayed in Figure 1.

Figure 2 displays the two graphical tools which are the autocorrelation function (ACF) and the partial autocorrelation function (PACF), and their plots shows that the ACF decays exponentially and the PACF has a single spike at lag 1 indicating that the series is generated by an ARIMA (1, 0, 0) process.

Figure 3 shows that the Dickey-Fuller test statistic (4.270636) which is greater than (2.886074 at 5% level of significance). Hence, we reject the NULL hypothesis and conclude that the series has no Unit Root (the series is stationary).

3.1. Parameter Estimation

Figure 4 shows the estimated model parameters at 5% level of significance. The autoregressive (AR) parameter is statistically significant implying that the parameter influenced the model and no other. The mathematical representation of

Figure 1. Time plot for monthly HIV/AIDS cases in Zambia from 2010 to 2019.

Figure 2. ACF and PACF for HIV/AIDS cases.

Figure 3. Augmented Dickey Fuller test (stationary test).

Figure 4. Estimated parameters and significant tests.

the AR (1) model is

( 1 B ) ( X t μ ) = e t ,

X t = μ + ( X t 1 μ ) + e t (8)

From Equation (8), we estimate the parameter ^ by the method of moments using Yule Walker estimator for ^ which is ^ = ρ ^ 1 . Thus, ^ = 0.3816 .

The moment estimator for μ is X ¯ t given by

t = 1 120 X t n = 371003 120 = 3091.68

Thus, the model is

X t = 3091.68 0.3816 ( X t 1 3091.68 ) + e t (9)

3.2. Diagnostic Checks

Figure 5 shows the Box-Djung statistics is significant at all lags and the values of ρ ^ k or ^ k are all lying within the range of ± 2 n , hence we have insufficient

evidence to reject Ho (the process is a white noise) and conclude that the model assumptions are indeed that of a white noise.

In Figure 6, the histogram shows that the average of residuals is approximately 0. And that the (Jarque-Bera) normality test of residues is statistically significant at 5% level of significance. Thus, we conclude that the residues are normally distributed.

3.3. Forecasting

Box-Jenkins approach to forecasting stationary time series is reasonably stress-free.

Figure 5. Serial correlation test (Box-Djung test) for the series.

The k step ahead forecast value X t + k given all observations up to n, is denoted by X ^ t ( k ) .

Figure 7 shows two trajectories of forecasted HIV/AIDS cases and actual cases of 18 months from 2017 to 2019 using ARIMA (1, 0, 0). Their behaviour is such that the predicted cases have a steady decline pattern compared to the actual cases which exhibits a sudden downward decline in the pattern after the intervention.

Figure 8 shows forecast error bounds, with a display of measures of forecast accuracy which were considered for AR (1) model goodness of fit. In the same setting the bias proportion tells us how far the mean of the prediction is from the mean of the actual series, the variance proportion tells us how far the variation of the forecast is from the variation of the actual one and finally the covariance proportion measures the remaining unsystematic forecasting errors. Note that the bias, variance, and covariance proportions add up to one.

3.4. Discussion

Our future ability to control HIV is dependent on the skills and proficiency of

Figure 6. Histogram and normality test of residuals

Figure 7. Trajectories of forecasted HIV/AIDS cases and actual ones.

Figure 8. Forecast error bounds, with a display of measure of forecast accuracy, and a decomposition of the mean squared forecast error.

scientists, health care providers and public health professionals working in all parts of Zambia. HIV/AIDS is a huge public health problem with respect to morbidity and its burden on health care facilities. Persons living with HIV/AIDS are vectors of the disease and potential dangers to the community. In addition, persons living with the virus face a lot of stigma and discrimination from the general populace. Which result in people shunning away from getting tested for the said virus. This norm poses a challenge towards the fight against its spread in as much as the carriers themselves are not accounted for. Thus, planning and budgeting for HIV/AIDS related activities is a challenge. However, the government of the republic of Zambia saw it fit to make it mandatory to test for HIV/AIDS virus whenever other illnesses are attended to.

The ARIMA (1, 0, 0) is a suitable model which best fits the HIV/AIDS time series cases and is therefore appropriate for forecasting HIV/AIDS cases per month in the future based on observed HIV/AIDS cases in past times. The results also review that in the last 10 years the HIV/AIDS pattern has been a fluctuating trend, with a highest peak being recorded in the late 2017 to early 2018 just after policy implementation. Then cases declined below average of around 14.29% as forecasted to a record breaking nose dive of 57.4%, towards the end of the year 2019. The potential implication of this study is that by developing forecasting models for predicting HIV/AIDS cases in advance on a regular basis is to support internal decisions and planning for programs such as test and treat.

4. Conclusion

In this paper, the Box-Jenkins modelling process is used to determine an ARIMA model and advance to forecasting. The HIV/AIDS data for the study were taken from Ministry of Health (Copperbelt province administrative office) for the period, 2010 to 2019 all-ages inclusive, with 120 observations. The results show that ARIMA (1, 0, 0) is a suitable model which best fits the HIV/AIDS time series cases and is therefore appropriate for forecasting. The model predicts a reduction from 3500 to 3177 representing 14.29% in HIV/AIDS cases from 2017 (year of policy activation) to 2019, but the actual recorded cases dropped from 3500 to 1514 accounting for 57.4% in the same time frame. Finally, the robust efforts being made by the Zambian government aimed at achieving the 90-90-90 target of ending AIDS by 2030 are indeed a workable solution. But more needs to be done such as behavioural change sensitizations and also implementing programmes aimed at reducing risk behaviours.

Acknowledgements

The authors are thankful to the Ministry of Health for providing the data. Department of Mathematics and Statistics, Mulungushi University for using their resources and to all the people who helped in making comments on this paper.

Conflicts of Interest

The authors declare no conflicts of interest.

References

[1] World Health Organization (2020) World HIV/AIDS Report. Geneva, Switzerland.
https://www.who.int/news-room/fact-sheets/detail/hiv-aids
[2] Zambia Statistical Agency (2016) Zambia Population HIV Impact Assessment (Zamphia) Survey.
https://www.zamstats.gov.zm/
[3] Unaids and Joint United Nations Programme on HIV/AIDS (2010) Getting to Zero: 2011-2015 Strategy. World Health Organization.
[4] Brodersen, K.H., Gallusser, F., Koehler, J., Remy, N. and Scott, S.L. (2015) Inferring Causal Impact Using Bayesian Structural Time-Series Models. The Annals of Applied Statistics, 9, 247-274.
https://doi.org/10.1214/14-AOAS788
[5] Box, G.E., Jenkins, G.M., Reinsel, G.C. and Ljung, G.M. (2015) Time Series Analysis: Forecasting and Control. John Wiley & Sons, Hoboken.
[6] Ma, Z.Q., Kuller, L.H., Fisher, M.A. and Ostroff, S.M. (2013) Peer Reviewed: Use of Interrupted Time-Series Method to Evaluate the Impact of Cigarette Excise Tax Increases in Pennsylvania, 2000-2009. Preventing Chronic Disease, 10.
https://doi.org/10.5888/pcd10.120268
[7] Taljaard, M., McKenzie, J.E., Ramsay, C.R. and Grimshaw, J.M. (2014) The Use of Segmented Regression in Analysing Interrupted Time Series Studies: An Example in Pre-hospital Ambulance Care. Implementation Science, 9, Article No. 77.
https://doi.org/10.1186/1748-5908-9-77
[8] Grundy, C., Steinbach, R., Edwards, P., Green, J., Armstrong, B. and Wilkinson, P. (2009) Effect of 20 mph Traffic Speed Zones on Road Injuries in London, 1986-2006: Controlled Interrupted Time Series Analysis. BMJ, 339, b4469.
https://doi.org/10.1136/bmj.b4469
[9] William, W.S. and Wei, S. (2006) Time Series Analysis: Univariate and Multivariate Methods. USA, Pearson Addison Wesley, Segunda Edicion, 212-235.
[10] Jere, S. and Moyo, E. (2016) Modelling Epidemiological Data Using Box-Jenkins Procedure. Open Journal of Statistics, 6, 295-302.
https://doi.org/10.4236/ojs.2016.62025
[11] Siluyele, I. and Jere, S. (2016) Using Box-Jenkins Models to Forecast Mobile Cellular Subscription. Open Journal of Statistics, 6, 303-309.
https://doi.org/10.4236/ojs.2016.62026

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.