Discrimination-based sample size calculations for multivariable prognostic models for time-to-event data

Rachel C Jinks; Patrick Royston; Mahesh K B Parmar

doi:10.1186/s12874-015-0078-y

Discrimination-based sample size calculations for multivariable prognostic models for time-to-event data

BMC Med Res Methodol. 2015 Oct 12:15:82. doi: 10.1186/s12874-015-0078-y.

Authors

Rachel C Jinks¹, Patrick Royston², Mahesh K B Parmar³

Affiliations

¹ MRC Clinical Trials Unit at UCL, Aviation House, 125 Kingsway, London, WC2B 6NH, UK. rcjinks@gmail.com.
² MRC Clinical Trials Unit at UCL, Aviation House, 125 Kingsway, London, WC2B 6NH, UK. j.royston@ucl.ac.uk.
³ MRC Clinical Trials Unit at UCL, Aviation House, 125 Kingsway, London, WC2B 6NH, UK. m.parmar@ucl.ac.uk.

Abstract

Background: Prognostic studies of time-to-event data, where researchers aim to develop or validate multivariable prognostic models in order to predict survival, are commonly seen in the medical literature; however, most are performed retrospectively and few consider sample size prior to analysis. Events per variable rules are sometimes cited, but these are based on bias and coverage of confidence intervals for model terms, which are not of primary interest when developing a model to predict outcome. In this paper we aim to develop sample size recommendations for multivariable models of time-to-event data, based on their prognostic ability.

Methods: We derive formulae for determining the sample size required for multivariable prognostic models in time-to-event data, based on a measure of discrimination, D, developed by Royston and Sauerbrei. These formulae fall into two categories: either based on the significance of the value of D in a new study compared to a previous estimate, or based on the precision of the estimate of D in a new study in terms of confidence interval width. Using simulation we show that they give the desired power and type I error and are not affected by random censoring. Additionally, we conduct a literature review to collate published values of D in different disease areas.

Results: We illustrate our methods using parameters from a published prognostic study in liver cancer. The resulting sample sizes can be large, and we suggest controlling study size by expressing the desired accuracy in the new study as a relative value as well as an absolute value. To improve usability we use the values of D obtained from the literature review to develop an equation to approximately convert the commonly reported Harrell's c-index to D. A flow chart is provided to aid decision making when using these methods.

Conclusion: We have developed a suite of sample size calculations based on the prognostic ability of a survival model, rather than the magnitude or significance of model coefficients. We have taken care to develop the practical utility of the calculations and give recommendations for their use in contemporary clinical research.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Decision Making
Humans
Liver Neoplasms / mortality*
Models, Theoretical*
Prognosis
Research Design*
Sample Size*

Grants and funding

Medical Research Council/United Kingdom