The uncertainty with using risk prediction models for individual decision making: an exemplar cohort study examining the prediction of cardiovascular disease in English primary care

Alexander Pate; Richard Emsley; Darren M Ashcroft; Benjamin Brown; Tjeerd van Staa

doi:10.1186/s12916-019-1368-8

The uncertainty with using risk prediction models for individual decision making: an exemplar cohort study examining the prediction of cardiovascular disease in English primary care

BMC Med. 2019 Jul 17;17(1):134. doi: 10.1186/s12916-019-1368-8.

Authors

Alexander Pate¹, Richard Emsley², Darren M Ashcroft^{3

4}, Benjamin Brown^{4

5}, Tjeerd van Staa^{6

7}

Affiliations

¹ Centre of Health eResearch, School of Health Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Oxford Road, Manchester, M13 9PL, UK. alexander.pate@manchester.ac.uk.
² Department of Biostatistics and Health Informatics, Institute of Psychiatry, Psychology and Neuroscience, King's College London, De Crispigny Park, London, SE5 8AF, UK.
³ NIHR Greater Manchester Patient Safety Translational Research Centre, School of Health Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Oxford Road, Manchester, M13 9PL, UK.
⁴ NIHR School for Primary Care Research, Centre for Primary Care, Division of Population of Health, Health Services Research and Primary Care, Manchester Academic Health Science Centre, University of Manchester, Manchester, M13 9PL, UK.
⁵ Public Health England North West, 3 Piccadilly Place, London Road, Manchester, M1 3BN, UK.
⁶ Centre of Health eResearch, School of Health Sciences, Faculty of Biology, Medicine and Health, The University of Manchester, Oxford Road, Manchester, M13 9PL, UK.
⁷ Division of Pharmacoepidemiology and Clinical Pharmacology, Utrecht Institute of Pharmaceutical Sciences, Utrecht University, Utrecht, Netherlands.

Abstract

Background: Risk prediction models are commonly used in practice to inform decisions on patients' treatment. Uncertainty around risk scores beyond the confidence interval is rarely explored. We conducted an uncertainty analysis of the QRISK prediction tool to evaluate the robustness of individual risk predictions with varying modelling decisions.

Methods: We derived a cohort of patients eligible for cardiovascular risk prediction from the Clinical Practice Research Datalink (CPRD) with linked hospitalisation and mortality records (N = 3,792,474). Risk prediction models were developed using the methods reported for QRISK2 and 3, before adjusting for additional risk factors, a secular trend, geographical variation in risk and the method for imputing missing data when generating a risk score (model A-model F). Ten-year risk scores were compared across the different models alongside model performance metrics.

Results: We found substantial variation in risk on the individual level across the models. The 95 percentile range of risks in model F for patients with risks between 9 and 10% according to model A was 4.4-16.3% and 4.6-15.8% for females and males respectively. Despite this, the models were difficult to distinguish using common performance metrics (Harrell's C ranged from 0.86 to 0.87). The largest contributing factor to variation in risk was adjusting for a secular trend (HR per calendar year, 0.96 [0.95-0.96] and 0.96 [0.96-0.96]). When extrapolating to the UK population, we found that 3.8 million patients may be reclassified as eligible for statin prescription depending on the model used. A key limitation of this study was that we could not assess the variation in risk that may be caused by risk factors missing from the database (such as diet or physical activity).

Conclusions: Risk prediction models that use routinely collected data provide estimates strongly dependent on modelling decisions. Despite this large variability in patient risk, the models appear to perform similarly according to standard performance metrics. Decision-making should be supplemented with clinical judgement and evidence of additional risk factors. The largest source of variability, a secular trend in CVD incidence, can be accounted for and should be explored in more detail.

Keywords: Cardiovascular disease; Primary care; Risk prediction; Uncertainty analysis.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Adult
Aged
Aged, 80 and over
Cardiovascular Diseases / diagnosis*
Cardiovascular Diseases / epidemiology
Cardiovascular Diseases / mortality
Cohort Studies
Decision Making*
Decision Support Techniques*
England / epidemiology
Female
Humans
Hydroxymethylglutaryl-CoA Reductase Inhibitors / therapeutic use
Incidence
Male
Middle Aged
Precision Medicine* / methods
Precision Medicine* / statistics & numerical data
Primary Health Care / methods
Primary Health Care / standards
Primary Health Care / statistics & numerical data*
Risk Assessment / methods
Risk Factors
Uncertainty

Substances

Hydroxymethylglutaryl-CoA Reductase Inhibitors

Grants and funding

MR/N013751/1/Medical Research Council/United Kingdom