Comparison of Four Methods for Handing Missing Data in Longitudinal Data Analysis through a Simulation Study

HTML  XML Download Download as PDF (Size: 2570KB)  PP. 933-944  
DOI: 10.4236/ojs.2014.411088    9,562 Downloads   16,869 Views  Citations
Author(s)

ABSTRACT

Missing data can frequently occur in a longitudinal data analysis. In the literature, many methods have been proposed to handle such an issue. Complete case (CC), mean substitution (MS), last observation carried forward (LOCF), and multiple imputation (MI) are the four most frequently used methods in practice. In a real-world data analysis, the missing data can be MCAR, MAR, or MNAR depending on the reasons that lead to data missing. In this paper, simulations under various situations (including missing mechanisms, missing rates, and slope sizes) were conducted to evaluate the performance of the four methods considered using bias, RMSE, and 95% coverage probability as evaluation criteria. The results showed that LOCF has the largest bias and the poorest 95% coverage probability in most cases under both MAR and MCAR missing mechanisms. Hence, LOCF should not be used in a longitudinal data analysis. Under MCAR missing mechanism, CC and MI method are performed equally well. Under MAR missing mechanism, MI has the smallest bias, smallest RMSE, and best 95% coverage probability. Therefore, CC or MI method is the appropriate method to be used under MCAR while MI method is a more reliable and a better grounded statistical method to be used under MAR.

Share and Cite:

Zhu, X. (2014) Comparison of Four Methods for Handing Missing Data in Longitudinal Data Analysis through a Simulation Study. Open Journal of Statistics, 4, 933-944. doi: 10.4236/ojs.2014.411088.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.