Measurement Error for Age of Onset in Prevalent Cohort Studies


Prevalent cohort studies involve screening a sample of individuals from a population for disease, recruiting affected individuals, and prospectively following the cohort of individuals to record the occurrence of disease-related complications or death. This design features a response-biased sampling scheme since individuals living a long time with the disease are preferentially sampled, so naive analysis of the time from disease onset to death will over-estimate survival probabilities. Unconditional and conditional analyses of the resulting data can yield consistent estimates of the survival distribution subject to the validity of their respective model assumptions. The time of disease onset is retrospectively reported by sampled individuals, however, this is often associated with measurement error. In this article we present a framework for studying the effect of measurement error in disease onset times in prevalent cohort studies, report on empirical studies of the effect in each framework of analysis, and describe likelihood-based methods to address such a measurement error.

Share and Cite:

Zhong, Y. and Cook, R. (2014) Measurement Error for Age of Onset in Prevalent Cohort Studies. Applied Mathematics, 5, 1672-1683. doi: 10.4236/am.2014.511160.

Conflicts of Interest

The authors declare no conflicts of interest.


[1] Zelen, M. and Feinleib, M. (1969) On the Theory of Screening for Chronic Diseases. Biometrika, 56, 601-614.
[2] Zelen, M. (2004) Forward and Backward Recurrence Times and Length Biased Sampling: Age Specific Models. Lifetime Data Analysis, 10, 325-334.
[3] Lagakos, S.W., Barraj, L.M. and De Gruttola, V. (2006) Nonparametric Analysis of Truncated Survival Data, with Applications to AIDS. Biometrika, 75, 515-523.
[4] Wolfson, C., Wolfson, D.B., Asgharian, M., M’Lan, C.E., ?stbye, T., Rockwood, K. and Hogan, D.B. (2001) A Reevaluation of the Duration of Survival after the Onset of Dementia. New England Journal of Medicine, 344, 1111-1116.
[5] Asgharian, M., M’Lan, C.E. and Wolfson, D.B. (2002) Length-Biased Sampling with Right Censoring: An Unconditional Approach. Journal of the American Statistical Association, 97, 201-209.
[6] Rothman, K.J., Greenland, S. and Lash, T.L. (2008) Modern Epidemiology. Lippincott Williams & Wilkins, Philadelphia.
[7] Cox, D.R. and Miller, H.D. (1965) The Theory of Stochastic Processes. Chapman, London.
[8] Kalbfleisch, J.D. and Lawless, J.F. (1991) Regression Models for Right Truncated Data with Applications to AIDS incubation Times and Reporting Lags. Statistica Sinica, 1, 19-32.
[9] Turnbull, B.W. (1976) The Empirical Distribution Function with Arbitrarily Grouped, Censored and Truncated Data. Journal of the Royal Statistical Society, Series B (Methodological), 38, 290-295.
[10] Wang, M.-C. (1991) Nonparametric Estimation from Cross-Sectional Survival Data. Journal of the American Statistical Association, 86, 130-143.
[11] Keiding, N. and Moeschberger, M. (1992) Independent Delayed Entry, Survival Analysis: State of the Art. Springer, New York, 309-326.
[12] Wang, M.-C., Brookmeyer, R. and Jewell, N.P. (1993) Statistical Models for Prevalent Cohort Data. Biometrics, 49, 1-11.
[13] Vardi, Y. (1982) Nonparametric Estimation in the Presence of Length Bias. The Annals of Statistics, 10, 616-620.
[14] Vardi, Y. (1989) Multiplicative Censoring, Renewal Processes, Deconvolution and Decreasing Density: Nonparametric Estimation. Biometrika, 76, 751-761.
[15] Vardi, Y. and Zhang, C.-H. (1992) Large Sample Study of Empirical Distributions in a Random-Multiplicative Censoring Model. The Annals of Statistics, 20, 1022-1039.
[16] Asgharian, M. and Wolfson, D.B. (2005) Asymptotic Behavior of the Unconditional NPMLE of the Length-Biased Survivor Function from Right Censored Prevalent Cohort Data. The Annals of Statistics, 33, 2109-2131.
[17] Huang, C.-Y. and Qin, J. (2011) Nonparametric Estimation for Length-Biased and Right-Censored Data. Biometrika, 98, 177-186.
[18] Wang, M.-C. (1996) Hazards Regression Analysis for Length-Biased Data. Biometrika, 83, 343-354.
[19] Luo, X.D. and Tsai, W.Y. (2009) Nonparametric Estimation for Right-Censored Length-Biased Data: A Pseudo-Partial Likelihood Approach. Biometrika, 96, 873-886.
[20] Tsai, W.Y. (2009) Pseudo-Partial Likelihood for Proportional Hazards Models with Biased-Sampling Data. Biometrika, 96, 601-615.
[21] Shen, Y., Ning, J. and Qin, J. (2009) Analyzing Length-Biased Data with Semiparametric Transformation and Accelerated Failure Time Models. Journal of the American Statistical Association, 104, 1192-1202.
[22] Qin, J. and Shen, Y. (2010) Statistical Methods for Analyzing Right-Censored Length-Biased Data under Cox Model. Biometrics, 66, 382-392.
[23] Wang, M.-C., Jewell, N.P. and Tsai, W.-Y. (1986) Asymptotic Properties of the Product Limit Estimate under Random Truncation. The Annals of Statistics, 14, 1597-1605.
[24] Qin, J., Ning, J., Liu, H. and Shen, Y. (2011) Maximum Likelihood Estimations and EM Algorithms with Length-Biased Data. Journal of the American Statistical Association, 106, 1434-1449.
[25] Carroll, R.J., Ruppert, D., Stefanski, L.A. and Crainiceanu, C.M. (2006) Measurement Error in Nonlinear Models. Chapman & Hall, London.

Copyright © 2023 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.