Intended for healthcare professionals

Endgames Statistical Question

Randomised controlled trials: missing data

BMJ 2014; 349 doi: https://doi.org/10.1136/bmj.g4656 (Published 18 July 2014) Cite this as: BMJ 2014;349:g4656
  1. Louise Marston, senior research statistician1,
  2. Philip Sedgwick, reader in medical statistics and medical education2
  1. 1Department of Primary Care and Population Health, University College London, London NW3 2PF, UK
  2. 2Centre for Medical and Healthcare Education, St George’s, University of London, London, UK
  1. Correspondence to: L Marston l.marston{at}ucl.ac.uk

Researchers assessed the effectiveness of a range of weight management programmes for weight loss. A randomised controlled trial study design, incorporating eight treatment arms, was used. Each intervention—Weight Watchers, Slimming World, Rosemary Conley, a group based dietetics led programme, general practice one to one counselling, pharmacy led one to one counselling, and a choice of any of the six programmes—lasted for 12 weeks. The control treatment consisted of 12 vouchers enabling free entrance to a local leisure (fitness) centre. Participants were 740 obese or overweight men and women identified from general practice records.1

The primary outcome was weight loss at the end of the programme (12 weeks). Secondary outcomes included weight loss at one year. Baseline characteristics were available for all participants, whereas follow-up data were available for 658 (88.9%) participants at the end of the programme and 522 (70.5%) at one year.

Analyses were performed according to intention to treat, using “baseline observation carried forward” to account for missing data. All treatment programmes achieved significant weight loss from baseline to programme end. When compared with the control treatment at 12 weeks, the only programmes that resulted in significantly more weight loss were Weight Watchers (mean difference 2.53 kg, 95% confidence interval 1.30 to 3.76; P<0.001) and Rosemary Conley (2.18, 0.96 to 3.41; P=0.004). All programmes except general practice and pharmacy provision resulted in significant weight loss at one year. At one year, only the Weight Watchers programme resulted in significantly greater weight loss than the control treatment (2.5 kg, 0.8 to 4.2; P=0.022). It was concluded that commercially provided weight management services are more effective than primary care based services.

Which of the following statements, if any, are true for the method of baseline observation carried forward?

  • a) It maintained comparability between treatment groups at baseline

  • b) For participants who had their weight measured only at baseline, it assumed that they had no weight change during follow-up

  • c) It resulted in sample estimates with too narrow confidence intervals, thereby overestimating accuracy

Answers

Statements a, b, and c are all true for the method of baseline observation carried forward.

The aim of the trial was to assess the effectiveness of a range of weight management programmes for weight loss. A randomised controlled trial study design was used. Outcome measurements and baseline characteristics were available for all participants, whereas follow-up data were available for 658 (88.9%) participants at the end of the programme (12 weeks) and 522 (70.5%) at one year.

It is common for trial participants’ outcome measurements to be missing. Referred to generally as “missing data,” this can occur for a variety of reasons. Participants may not start the treatment that they are randomised to, they may be lost to follow-up, or they may not provide outcome measurements when required during follow-up. How missing data are dealt with in the analysis of a trial can have a major effect on the results and therefore the conclusions.

In the study above, the most straightforward method for dealing with missing data would have been to exclude those participants who did not provide all outcome measurements during follow-up. Such an approach is referred to as “complete case analysis.” However, if the occurrence of missing data is associated with a treatment or the outcomes, this approach may introduce confounding in the comparison of treatment groups. Participants were randomised to treatment groups. The aim of randomisation, if the sample size for a trial is large enough, is to produce groups of patients similar in baseline characteristics. Such factors include demographics, prognostic factors, and characteristics that influence participants to take part in a trial. Therefore, any differences between treatment groups in outcome would be due to differences in treatment received, not differences in baseline characteristics, and the trial would have internal validity. Described in a previous question,2 internal validity is the extent to which the observed treatment effects can be ascribed to differences in treatment and not confounding. However, the exclusion of those participants with missing data from the analysis would introduce confounding because the balance in baseline characteristics would no longer exist. As a result, internal validity would be threatened. Furthermore, the exclusion of participants would ultimately lead to reduced statistical power.

Imputation is an alternative approach to deleting participants with missing data. This involves replacing a missing observation with a plausible data value, which may be an existing value or one that is predicted from the participant’s available data. Missing data were imputed in the trial above using the method of “baseline observation carried forward.” Using this method, participants’ outcome measurements at baseline were used for all time points during follow-up where the data were missing. The method is appealing because it is straightforward. In particular, it ensures that all randomised patients are included in the analyses, thereby maintaining comparability between treatment groups at baseline (a is true) and minimising confounding.

It is difficult to predict the weight at 12 weeks or one year of participants with missing data. No doubt all participants would have experienced some weight change—whether a gain or decrease—during follow-up. Use of the method of baseline observation carried forward meant that participants’ outcome measurement at baseline was used at all times during follow-up when data were missing. The method assumed that the weight of participants who did not have their weight measured at 12 weeks and one year did not change during follow-up from baseline (b is true). This is unlikely, however, so the method may have provided biased outcome measurements and hence biased sample estimates.

In addition to using baseline observation carried forward to impute missing data, the researchers also undertook analyses with missing data imputed using the method of last observation carried forward. The two methods are similar approaches to the imputation of missing data. Use of the method of last observation carried forward meant that participants’ last outcome measurements were used for all subsequent missing data. As for the method of baseline observation carried forward, the method of last observation carried forward assumes that participants’ weights remain unchanged from when they last had their weight recorded. However, this does not seem plausible. The method of last observation carried forward was used as part of a sensitivity analysis. Because these methods of imputation of missing data may bias the sample estimates, the two methods were used to assess whether they produced similar results. However, both methods of imputation probably underestimate the standard deviation of the outcome of weight change for a treatment in a trial. This is likely to lead to confidence intervals that are too narrow and to provide estimates that are too accurate (c is true). Because both methods of imputation would probably lead to biased sample estimates for treatment groups, any comparisons between treatment groups would be inaccurate.

Although the methods of baseline observation carried forward and last observation carried forward have some advantages, they are generally discouraged as methods of imputation for the reasons described above. Much methodological work has recently been undertaken on the development of approaches to imputation. More sophisticated methods, including regression mean imputation and multiple imputation, have been developed that are thought to give more plausible and unbiased values. These methods for imputation will be explored in future endgames.

Notes

Cite this as: BMJ 2014;349:g4656

Footnotes

  • Competing interests: None declared.

References

View Abstract