Review of guidelines and literature for handling missing data in longitudinal clinical trials with a case study
Corresponding Author
M. Liu
Clinical Biostatistics, Merck & Co., Inc., Rahway, NJ, USA
Clinical Biostatistics, Merck & Co., Inc., Rahway, NJ 07065, USASearch for more papers by this authorL. Wei
Clinical Biostatistics, Merck & Co., Inc., Rahway, NJ, USA
Search for more papers by this authorJ. Zhang
Clinical Biostatistics, Merck & Co., Inc., Rahway, NJ, USA
Search for more papers by this authorCorresponding Author
M. Liu
Clinical Biostatistics, Merck & Co., Inc., Rahway, NJ, USA
Clinical Biostatistics, Merck & Co., Inc., Rahway, NJ 07065, USASearch for more papers by this authorL. Wei
Clinical Biostatistics, Merck & Co., Inc., Rahway, NJ, USA
Search for more papers by this authorJ. Zhang
Clinical Biostatistics, Merck & Co., Inc., Rahway, NJ, USA
Search for more papers by this authorAbstract
Missing data in clinical trials are inevitable. We highlight the ICH guidelines and CPMP points to consider on missing data. Specifically, we outline how we should consider missing data issues when designing, planning and conducting studies to minimize missing data impact. We also go beyond the coverage of the above two documents, provide a more detailed review of the basic concepts of missing data and frequently used terminologies, and examples of the typical missing data mechanism, and discuss technical details and literature for several frequently used statistical methods and associated software. Finally, we provide a case study where the principles outlined in this paper are applied to one clinical program at protocol design, data analysis plan and other stages of a clinical trial. Copyright © 2006 John Wiley & Sons, Ltd.
REFERENCES
- 1 ICH E9: Statistical Principles of Clinical Trials; Section 5.3 Missing Values and Outliers.
- 2 Committee for Proprietary Medicinal Products (CPMP). (November 2001), Points to consider on missing data. CPMP, November 2001.
- 3 Carpenter J, Pocock S, Lamm CJ. Coping with missing data in clinical trials: a model-based approach applied to asthma trials. Statistics in Medicine 2002; 21: 1043–1066.
- 4 Rubin DB. Inference and missing data. Biometrika 1976; 63: 581–590.
- 5 Little RJA, Rubin DB. Statistical analysis with missing data. Wiley: New York, 2002.
10.1002/9781119013563 Google Scholar
- 6 Diggle P, Kenward MG. Informative drop-out in longitudinal data analysis (Disc: p. 73–93). Applied Statistics 1994; 43: 49–73.
- 7 Carpenter J. L.O.C.F. – Time to stop carrying it forward. Talk given to PSI Scientific Meeting on Missing Data, London, 2004.
- 8 Barnard J, Rubin DB. Multiple Imputation for Missing Data. Continuing Education Course Handout, Joint Statistical Meeting, Atlanta, 2001.
- 9 Albert PS. Longitudinal data analysis (repeated measures) in clinical trials. Statistics in Medicine 1999; 18(13): 1707–1732.
10.1002/(SICI)1097-0258(19990715)18:13<1707::AID-SIM138>3.0.CO;2-H CASPubMedWeb of Science®Google Scholar
- 10 Cnaan A, Laird NM, Slasor P. Tutorial in Biostatistics: using the general linear mixed model to analyse unbalanced repeated measures and longitudinal data. Statistics in Medicine 1997; 16: 2349–2380.
10.1002/(SICI)1097-0258(19971030)16:20<2349::AID-SIM667>3.0.CO;2-E CASPubMedWeb of Science®Google Scholar
- 11 Jennrich RI, Schluchter MD. Unbalanced repeated-measures models with structured covariance matrices. Biometrics 1986; 42: 805–820.
- 12 Laird NM, Ware JH. Random-effects models for longitudinal data. Biometrics 1982; 38: 963–974.
- 13 Zhao LP, Prentice RL. Correlated binary regression using a quadratic exponential model. Biometrika 1990; 77: 642–648.
- 14 Longford NT. Random coefficient models. Oxford University Press: Oxford, 1993.
- 15 Stiratelli R, Laird N, Ware JH. Random effects models for serial observations with binary response. Biometrics 1984; 40: 961–971.
- 16 Little RJA, Schluchter MD. Maximum likelihood estimation for mixed continuous and categorical data with missing values. Biometrika 1985; 72: 497–512.
- 17 Olkin I, Tate RF. Multivariate correlation models with mixed discrete and continuous variables (Corr: V36 p. 343). The Annals of Mathematical Statistics 1961; 32: 448–465.
10.1214/aoms/1177705052 Google Scholar
- 18 Lawless JF. Statistical models and methods for lifetime data ( 2nd edn). Wiley: New Jersey, 2003.
- 19 Kalbfleisch JD, Prentice RL. The statistical analysis of failure time data ( 2nd edn). Wiley: New Jersey, 2002.
10.1002/9781118032985 Google Scholar
- 20 Lawless JF, Kalbfleisch JD, Wild CJ. Semiparametric methods for response-selective and missing data problems in regression. Journal of the Royal Statistical Society B 1999; 61: 413–438.
- 21 Murray GD. Missing data, Chapter: Biostatistics in clinical trials. In Wiley reference series in biostatistics, C Redmond, C Theodore (eds). Wiley: New York, 2001.
- 22 Schafer JL. Analysis of incomplete multivariate data. Chapman & Hall/CRC: London, 1997.
- 23 Follmann DA, Wu MC. An approximate generalized linear model with random effects for informative missing data. Biometrics 1995; 51: 151–168.
- 24 Little RJA. Modeling the drop-out mechanism in repeated-measures studies. Journal of the American Statistical Association 1995; 90: 1112–1121.
- 25 Mori M, Woolson RF, Woodworth GG. Slope estimation in the presence of informative right censoring: modeling the number of observations as a geometric random variable. Biometrics 1994; 50: 39–50.
- 26 Schluchter MD. Methods for the analysis of informatively censored longitudinal data (Disc: p. 1881–1885). Statistics in Medicine 1992; 11: 1861–1870.
- 27 Shih WJ, Quan H, Chang MN. Estimation of the mean when data contain non-ignorable missing values from a random effects model. Statistics & Probability Letters 1994; 19: 249–257.
- 28 Wu MC, Carroll RJ. Estimation and comparison of changes in the presence of informative right censoring by modeling the censoring process (Corr: V45 p. 1347; V47 p. 357). Biometrics 1988; 44: 175–188.
- 29 Little RJA. A class of pattern-mixture models for normal incomplete data. Biometrika 1994; 81: 471–483.
- 30 Little RJA, Wang YX. Pattern-mixture models for multivariate incomplete data with covariates. Biometrics 1996; 52: 98–111.
- 31 Michiels B, Molenberghs G, Lipsitz SR. Selection models and pattern-mixture models for incomplete data with covariates. Biometrics 1999; 55: 978–983.
- 32 Shih WJ, Quan H. Testing for treatment differences with dropouts present in clinical trials—a composite approach. Statistics in Medicine 1997; 16: 1225–1239.
10.1002/(SICI)1097-0258(19970615)16:11<1225::AID-SIM548>3.0.CO;2-Y CASPubMedWeb of Science®Google Scholar
- 33 Cook RJ, Zeng Ly, Yi GY. Marginal analysis of incomplete longitudinal binary data: a cautionary note on LOCF imputation. Biometrics 2004; 60: 820–828.
- 34 Mallinckrodt CH, Clark WS, David SR. Type I error rates from mixed effects model repeated measures versus fixed effects ANOVA with missing values imputed via last observation carried forward. Drug Information Journal 2001; 35: 1215–1225.
- 35 Mallinckrodt CH, Clark WS, David SR. Accounting for dropout bias using mixed-effects models. Journal of Biopharmaceutical Statistics 2001; 11: 9–21.
- 36 Molenberghs G, Thijs H, Jansen I, Beunkens C, Kenward MG, Mallinkrodt C, Carroll RJ. Analyzing incomplete longitudinal clinical trial data. Biostatistics 2004; 5: 445–464.
- 37 Little RJA, Yau L. Intent-to-treat analysis for longitudinal studies with drop-outs. Biometrics 1996; 52: 1324–1333.
- 38 Liu M, Lunceford J, Luo E, Wei L. Phase III trial simulation to select missing data methodology. Merck Technical Report # 106, 2004.
- 39 Rubin DB. Multiple imputation for nonresponse in surveys. Wiley: New York, 1987.
10.1002/9780470316696 Google Scholar
- 40 Rubin DB, Schenker N. Multiple imputation in health-care databases: an overview and some applications. Statistics in Medicine 1991; 10: 585–598.
- 41 Lavori PW, Dawson R, Shera D. A multiple imputation strategy for clinical trials with truncation of patient data. Statistics in Medicine 1995; 14: 1913–1925.
- 42 Allison PD. Multiple imputation for missing data: a cautionary tale. Sociological Methods and Research 2000; 28: 301–309.
- 43 van Buuren S, Boshuizen HC, Knook DL. Multiple imputation of missing blood pressure covariates in survival analysis. Statistics in Medicine 1999; 18: 681–694.
10.1002/(SICI)1097-0258(19990330)18:6<681::AID-SIM71>3.0.CO;2-R CASPubMedWeb of Science®Google Scholar
- 44 Dawson TD, Lagakos SW. Size and power of two sample tests of repeated measure. Biometrics 1993; 49: 1022–1032.
- 45 Little RJ, Raghunathan TE. On summary measures analysis of the linear mixed effects model for repeated measures when data are not missing completely at random. Statistics in Medicine 1999; 18: 2465–2478.
10.1002/(SICI)1097-0258(19990915/30)18:17/18<2465::AID-SIM269>3.0.CO;2-2 CASPubMedWeb of Science®Google Scholar
- 46 Liang KY, Zeger SL. Longitudinal data analysis using generalized linear models. Biometrika 1986; 73: 13–22.
- 47 Carey V, Zeger SL, Diggle P. Modeling multivariate binary data with alternating logistic regressions. Biometrika 1993; 80: 517–526.
- 48 Lipsitz SR, Fitzmaurice GM. Estimating equations for measures of association between repeated binary responses. Biometrics 1996; 52: 903–912.
- 49 Lipsitz SR, Laird NM, Harrington DP. A three-stage estimator for studies with repeated and possibly missing binary outcomes. Applied Statistics 1992; 41: 203–213.
- 50 Robins JM, Rotnitzky A, Zhao LP. Analysis of semiparametric regression models for repeated outcomes in the presence of missing data. Journal of the American Statistical Association 1995; 90: 106–121.
- 51 Gould AL. A new approach to the analysis of clinical drug trials with withdrawals. Biometrics 1980; 36: 721–727.
- 52 Lachin JM. Worst-rank score analysis with informatively missing observations in clinical trials. Controlled Clinical Trials 1999; 20: 408–422.
- 53 Lachin JM. Statistical considerations in the intent-to-treat principle. Controlled Clinical Trials 2000; 21: 167–189.
- 54 Liu G, Gould L. Comparison of alternative strategies for analysis of longitudinal trials with dropouts. Journal of Biopharmaceutical Statistics 2002; 12: 207–226.
- 55 Toyoda I, Matsuyama Y, Ohashi Y. Estimation and comparison of rates of change in repeated-measures studies with planned dropout. Controlled Clinical Trials 2001; 22(6): 620–638.