Abstract
Background Natriuretic peptide (NP) testing is recommended for patients presenting to primary care with symptoms of chronic heart failure (HF) to prioritise referral for diagnosis.
Aim To report NP test performance at European Society of Cardiology (ESC) and National Institute for Health and Care Excellence (NICE) guideline referral thresholds.
Design and setting Diagnostic accuracy study using linked primary and secondary care data (2004 to 2018).
Method The sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) of NP testing for HF diagnosis was assessed.
Results In total, 229 580 patients had an NP test and 21 102 (9.2%) were diagnosed with HF within 6 months. The ESC NT-proBNP threshold ≥125 pg/mL had a sensitivity of 94.6% (95% confidence interval [CI] = 94.2 to 95.0) and specificity of 50.0% (95% CI = 49.7 to 50.3), compared with sensitivity of 81.7% (95% CI = 81.0 to 82.3) and specificity of 80.3% (95% CI = 80.0 to 80.5) for the NICE NT-proBNP ≥400 pg/mL threshold. PPVs for an NT-proBNP test were 16.4% (95% CI = 16.1 to 16.6) and 30.0% (95% CI = 29.6 to 30.5) for ESC and NICE thresholds, respectively. For both guidelines, nearly all patients with an NT-proBNP level below the threshold did not have HF (NPV: ESC 98.9%, 95% CI = 98.8 to 99.0 and NICE 97.7%, 95% CI = 97.6 to 97.8).
Conclusion At the higher NICE chronic HF guideline NP thresholds, one in five cases are initially missed in primary care but the lower ESC thresholds require more diagnostic assessments. NP is a reliable ‘rule-out’ test at both cut-points. The optimal NP threshold will depend on the priorities and capacity of the healthcare system.
INTRODUCTION
Heart failure (HF) affects around 1 million people in the UK and accounts for 3%–4% of NHS expenditure.1,2 Effective management improves quality and length of life for patients but establishing a diagnosis can be challenging because of the overlap of symptoms (such as breathlessness, exhaustion, or ankle swelling) with other conditions.3–5 Natriuretic peptides (NPs) are released by the myocardium in response to pressure or fluid overload and act on both the vasculature to relax smooth muscle and on the kidney to induce diuresis.6 NP levels are raised in people with HF and testing can aid diagnostic decision making. Guidelines recommend referral for cardiac imaging and specialist assessment depending on the NP level.7–9 Two types of NP test — B-type NP (BNP) and NT-proBNP — are currently available in clinical practice. BNP is biologically active and has a shorter half- life making it less stable over time.6
The European Society of Cardiology (ESC) and National Institute for Health and Care Excellence (NICE) recommend NP testing in both acute and chronic HF. The threshold values to rule out an acute HF diagnosis in an emergency department setting are consistent across guidelines (BNP <100 pg/mL and NT-proBNP <300 pg/mL) and supported by a large body of evidence.7,10 However, ESC and NICE guideline thresholds differ by more than threefold in chronic HF. These patients present to primary care with gradual onset of symptoms and NP testing is useful to inform the referral process. The ESC recommend referral for echocardiography and specialist assessment of BNP ≥35 pg/ mL or NT-proBNP ≥125 pg/ mL whereas NICE recommend a higher cut-off level of BNP ≥100 pg/mL or NT-proBNP ≥400 pg/mL.7,8 There is limited evidence on the optimal diagnostic NP threshold for primary care and current guidelines draw on small diagnostic accuracy studies, screening substudies, and consensus to make recommendations.11–18
The aim of this study was to report the real-world diagnostic performance of NP testing for chronic HF diagnosis at ESC and NICE referral thresholds.
METHOD
A diagnostic accuracy study in a population- based cohort was conducted using linked electronic healthcare records. Primary care data from the Clinical Practice Research Datalink (CPRD) GOLD and Aurum databases were linked to inpatient Hospital Episodes Statistics (HES) admitted patient data and Index of Multiple Deprivation (IMD) socioeconomic data in England. Trends in NP testing have been published previously for this cohort.9 The combined CPRD databases contain data from over 1400 general practices in the UK, or 15.7% of the whole general practice population, and have been shown to be representative of the general population.19
International guidelines recommend natriuretic peptide (NP) testing in primary care to prioritise referral for heart failure (HF) diagnostic assessment. European Society of Cardiology (ESC) and National Institute for Health and Care Excellence (NICE) guidelines differ significantly in their recommended NP referral threshold. The current study found at the lower ESC threshold fewer HF diagnoses were missed but more referrals from primary care would be required. Healthcare systems need to balance the risk of a missed or delayed diagnosis for individual patients with capacity in diagnostic services. An NP level below both the ESC and NICE thresholds was reliable in ruling out HF. |
How this fits in
Patients aged ≥45 years in the two CPRD databases with an NP test result in their primary care record between 1 January 2004 and 31 December 2018 were included. Patients entered the cohort on the date of their NP test and exited the cohort on the date of their HF diagnosis or 6 months after their NP test date if they were not diagnosed with HF. Patients were only included if their primary care records were deemed acceptable for research purposes (a CPRD quality measure), eligible for linkage, and had been registered at a practice for ≥12 months. NP tests and HF diagnosis codes were identified in CPRD using clinical coding lists (see Supplementary Tables S1 and S2) derived from the NHS terminology and classifications browser and the Quality and Outcomes Framework guidance. Patients with a previous HF diagnosis were excluded.
NP testing (index test)
NP level was analysed both as a continuous and a categorical variable using ESC (≥35 pg/mL and ≥125 pg/mL for BNP and NT-proBNP, respectively) and NICE (≥100 pg/mL and ≥400 pg/mL for BNP and NT-proBNP, respectively) referral thresholds for chronic HF diagnosis. NP test performance at the NICE thresholds for rapid referral (to be seen by a specialist within 2 weeks) of >400 pg/mL and >2000 pg/mL for BNP and NT-proBNP, respectively, were also explored.
HF diagnosis (reference standard)
The primary outcome of HF diagnosis within 6 months of the most recent NP test was obtained from either a diagnostic code entered in the CPRD database or from HES Admitted Patient Care data based on hospital admission because of HF or echocardiography findings consistent with HF. HF diagnoses from primary care were also validated through data linkage with HES using International Classification of Diseases, 10th revision codes.
Statistical analysis
Sociodemographic variables were summarised with median and interquartile range (IQR) for continuous variables, and frequencies and percentages for categorical variables. These were estimated overall, among participants with a NP test, and among those with and without HF.
Diagnostic accuracy for HF diagnosis was assessed by calculating sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV), likelihood ratio, and diagnostic odds ratio using the ‘epitools’ package.20 Exact confidence intervals (CIs) for proportions were calculated using the binomial distribution. CIs for ratios were calculated using the Wald’s normal approximation. Receiver operating characteristic (ROC) curves were plotted for both tests to allow comparison of overall test performance between test types. The area under the ROC curve (AUC) was estimated using the ‘pROC’ package.21 All analyses were done in R (version 4.0.0) and used a 0.05 threshold to define statistical significance.
RESULTS
A total of 229 580 patients had an NP test recorded in their primary care record with a median age of 61.2 years (IQR 52.0– 69.1), more females (57.5%, n = 132 002), and the majority were of White ethnicity (91.4%, n = 209 941) (Table 1). The median body mass index was 28.6 kg/m2 (IQR 25.1– 32.9), with a low-to-medium level of deprivation. Many participants were current or ex-smokers (66.7%, n = 117 521 and n = 35 676), had a diagnosis of hypertension (59.1%, n = 135 788), diabetes (24.7%, n = 56 782), or other cardiovascular disease (13.9%, n = 32 024).
Sociodemographic characteristics and medical history of primary care patients aged ≥45 years overall, and with a BNP or NT-pro BNP test
NP testing
Two-thirds of all study participants had an NT-proBNP test (n = 155 347) and the other third had a BNP test (n = 74 233). BNP was the most common test until 2007, but since 2008 NT-proBNP has been more widely used (see Supplementary Table S3). There were no major differences between patients who underwent BNP versus NT-proBNP testing (Table 1), except those with a BNP test appeared to be more likely to come from a less deprived background.
HF diagnosis
Overall, 21 102 (9.2%) participants were diagnosed with HF within 6 months (Figure 1). Participants with HF were older than those without HF (median age 67 versus 61 years) and were more likely to be of White ethnicity (94.9%), current/ex-smokers (70.0%), with a history of hypertension (68.0%), diabetes (28.1%), or other cardiovascular diseases (21.0%) (Table 2).
Flowchart of patient inclusion. BNP = B-type NP. CPRD = Clinical Practice Research Datalink. HF = heart failure. NP = natriuretic peptide.
Sociodemographic characteristics and medical history of primary care patients aged ≥45 years with and without HF
NP testing was within 6 weeks of a confirmed HF diagnosis in 13 005 participants (61.6%), with 7672 participants (36.4%) diagnosed within 2 weeks overall. Participants with NT-proBNP >2000 pg/mL were more likely to be diagnosed within 2 weeks (44.9% compared with 35.4%, 24.8%, and 17.0% among those with 400–2000 pg/mL, 125–400 pg/mL, and <125 pg/ mL, respectively). This was similar for BNP >400 pg/mL, compared with lower levels of BNP (see Supplementary Table S4).
Diagnostic test accuracy
The prevalence of HF in the populations tested with BNP and NT-proBNP was 8.8% and 9.4%, respectively. The median level of BNP was 336 pg/mL (IQR 163– 845) and NT-proBNP was 1358 pg/mL (IQR 534– 3230) among those diagnosed with HF (Table 2). The individual diagnostic test accuracy parameters for a HF diagnosis at both BNP and NT-proBNP referral thresholds are shown in Table 3.
Diagnostic test accuracy parameters for the diagnosis of HF using BNP and NT-proBNP level NICE and ESC referral thresholdsa
The lower ESC referral thresholds of ≥35 pg/mL for BNP and ≥125 pg/ mL for NT-proBNP had sensitivities of 96.9% (95% CI = 96.4 to 97.3) and 94.6% (95% CI = 94.2 to 95.0), and specificities of 35.3% (95% CI = 35.0 to 35.7) and 50.0% (95% CI = 49.7 to 50.3), respectively (Table 3). PPV for NP testing was 12.6% (95% CI = 12.3 to 12.9) for BNP and 16.4% (95% CI = 16.1 to 16.6) for NT-proBNP, and NPVs of 99.2% (95% CI = 99.0 to 99.3) and 98.9% (95% CI = 98.8 to 99.0), were found for BNP and NT-proBNP, respectively.
At the higher NICE referral threshold of ≥100 pg/ml for BNP and ≥400 pg/ml for NT-proBNP, sensitivity was 88.5% (95% CI = 87.7 to 89.3) and 81.7% (95% CI = 81.0 to 82.3), specificity was 69.1% (95% CI = 68.7 to 69.4) and 80.3% (95% CI = 80.0 to 80.5), PPV was 21.6% (95% CI = 21.1 to 22.1) and 30.0% (95% CI = 29.6 to 30.5), and NPVs were 98.4% (95% CI = 98.3 to 98.5) and 97.7% (95% CI = 97.6 to 97.8), respectively (Table 3).
In summary, there was a clinically meaningful difference in all the key test performance measures between the ESC and NICE guideline recommended cut- off levels. As expected at the higher NICE threshold, specificity was much better (80.3% versus 50.0%) but at the expense of a lower sensitivity (81.7% versus 94.6%) for NT-proBNP (Table 3). Figure 2 shows the sensitivity and specificity for possible values of BNP and NT-proBNP. There was a small but statistically significant improvement in overall performance using NT-proBNP compared with BNP (AUC 0.874, 95% CI = 0.871 to 0.877) for NT-proBNP versus 0.855 (95% CI = 0.851 to 0.860) for BNP.
ROC curve for HF diagnosis for BNP and NT-proBNP tests at ESC and NICE referral thresholds. AUC = area under the ROC curve. BNP = B-type NP. ESC = European Society of Cardiology. HF = heart failure. NICE = National Institute for Health and Care Excellence. NP = natriuretic peptide. ROC = receiver operating characteristic.
DISCUSSION
Summary
In this large, real-world diagnostic accuracy study in 229 580 primary care patients who underwent NP testing, the ESC chronic HF referral threshold of NT-proBNP ≥125 pg/mL had a high sensitivity (94.6%, 95% CI = 94.2 to 95.0), but low specificity (50.0%, 95% CI = 49.7 to 50.3) compared with a moderate sensitivity (81.7%, 95% CI = 81.0 to 82.3) and moderate specificity (80.3%, 95% CI = 80.0 to 80.5) for the NICE NT-proBNP ≥400 pg/mL referral threshold. At the current NICE threshold, one in five cases of HF are initially missed by NP testing; however, for each additional new HF diagnosis at the lower ESC cut-off, around 20 extra patients require diagnostic assessment. Both guidelines accurately ruled out HF (NPVs for ESC of 98.9%, 95% CI = 98.8 to 99.0 and NICE 97.7%, 95% CI = 97.6 to 97.8). Better diagnostic performance was shown for NT-proBNP than BNP.
Strengths and limitations
A strength of using real-world patient data to evaluate diagnostic test accuracy is a larger sample size than would routinely be used in cross-sectional studies, and this is especially important for diseases of lower prevalence.22 The largest prospective diagnostic accuracy studies of NP testing to identify people with HF in primary care had <750 participants.7,8,13 The current study included 229 580 patients with an NP test over a 14-year period.
Routinely collected data do have limitations as the reference standard is not assessed for research purposes or measured in everyone. Instead, a comprehensive clinical coding list was used to determine the presence or absence of a HF diagnosis.23 Assuming GPs in the UK follow NICE guidelines then those with a primary care NP test above the lower ESC cut-off but below NICE would not be referred for diagnostic assessment, but greater sensitivity was observed at the lower ESC level, suggesting HF diagnosis was made through a variety of routes. However, some people who did have HF may have been missed and, if so, false negatives may be underestimated and the sensitivities overestimated.
Baseline NP levels may have been affected by population characteristics such as age, HF symptoms, medications (for example, diuretics and renin–angiotensin system antagonists), and other medical conditions including chronic kidney disease and atrial fibrillation.24 These covariates were not stratified for as the aim was to explore test accuracy for guideline referral thresholds, and both ESC and NICE do not currently have differing thresholds based on other factors. The current study is also most relevant to healthcare systems with a strong primary care base where NP testing is required before referral for specialist diagnostic assessment. The way that HF is defined has changed over time, with a greater emphasis in the recent ESC 2021 guidelines on the importance of elevated NP levels when making a diagnosis.7 This may have resulted in a change to the classification of some people in the early years of the study if these contemporary HF definitions were used. There were also insufficient data available on left ventricular ejection fraction to report these results by HF classification (HF with reduced versus preserved ejection fraction).
Comparison with existing literature
The current results using primary care data are consistent with the findings of a large systematic review of NP testing for chronic HF across ambulatory settings, published in 2018.18 The review included 39 diagnostic accuracy studies of NP testing, five of which were conducted in primary care and reported a pooled sensitivity of 95% (95% CI = 90 to 98) for the BNP ≥100 pg/mL referral threshold, very similar to the current study, but with no difference between BNP and NT-proBNP test performance overall. For primary care studies that used NT-proBNP testing, sensitivity was 99% (95% CI = 57 to 100) and specificity was 60% (95% CI = 44 to 74) but these were limited to only two small studies. The current study included >20 000 patients with a new HF diagnosis and is therefore likely to be more representative of NP test performance in NHS primary care.
The current study showed significantly lower sensitivities for BNP and NT-proBNP at the NICE referral thresholds than those reported for patients with acute HF in a large systematic review and meta- analysis of diagnostic accuracy studies in the acute care setting.25 However, the NT-proBNP threshold was 300 pg/mL in this meta- analysis, so a higher sensitivity is expected, and the diagnostic accuracy of a test is also greater in settings where patients have more severe symptoms. Furthermore, the review only included original studies from patients admitted to hospital with acute HF and the prevalence of HF was much greater (ranging from 23% to 82%) than in the current primary care- based study (∼9%).
Using real-world patient data to evaluate diagnostic test accuracy is an established methodology that has been used in cancer diagnostics in primary care. CPRD data have been used to examine the diagnostic performance of the biomarker cancer antigen 125 that is frequently measured in females who present to their primary care physician with symptoms that could be caused by ovarian cancer and these data were used to estimate probability in females of different ages.26 Routinely collected data have also been used to evaluate the test accuracy of faecal calprotectin at different thresholds in the diagnosis of inflammatory bowel disease.27
Implications for research and practice
The current study suggests that at the current NICE threshold, one in five cases are initially missed following NP testing in primary care and diagnosed through an alternative route within 6 months. However, for every additional case of HF detected at the lower ESC threshold, around 20 extra patients would need to be referred for echocardiography and specialist assessment. The optimal NP threshold for referral for HF diagnosis will therefore depend partly on capacity within the healthcare setting. Patients with very high NP levels have worse outcomes and require timely referral for diagnosis and treatment initiation, and this could be hampered if diagnostic services are overwhelmed by a lower NP referral threshold.28 In the current study, an NP value below either the ESC or NICE thresholds effectively ruled out HF and this can be extremely valuable to aid diagnostic decision making in primary care.29 This trade-off between ensuring capacity in out-patient clinics is not breached versus missing a diagnosis of HF is challenging for both the referring primary care clinician and the HF specialist. Where the patient only has mild symptoms and an NP level below the threshold, monitoring in primary care may be necessary.
The current study also found that people with risk factors (ex-smoker, hypertension, high cholesterol) and established cardiovascular disease were more likely to receive an NP test, suggesting clinicians are targeting testing at those felt to be at increased risk of HF.30 In practice, NP levels are dynamic and will vary in relation to a range of factors, including weight status, renal function, age, and atrial fibrillation. Although current ESC and NICE guidelines have maintained the current NP thresholds used in this analysis, a recent position paper from the ESC on the use of NP testing suggested some of these variables, particularly obesity, should be considered when interpreting results.7,8,31 Further research could help refine referral thresholds for these key subgroups to help inform future guidance. In the meantime, clinicians in primary care should be aware that if higher NP thresholds for specialist assessment are recommended in national guidelines, patients with results that fall just below this level may still require referral for diagnostic assessment. Repeat testing of NP results over time in patients who are symptomatic may also help improve the sensitivity of testing and help minimise delays in HF diagnosis.
Acknowledgments
This work uses data provided by patients and collected by the NHS as part of their care and support and would not have been possible without access to this data. The NIHR recognises and values the role of patient data, securely accessed and stored, both in underpinning and leading to improvements in research and care.
Notes
Funding
The study was funded by the National Institute for Health and Care Research (NIHR) Collaboration for Leadership in Applied Health Research and Care (CLAHRC) Oxford. The funders did not have any role in the design of the study, analysis and interpretation of the data, or writing of the results for publication. Clare J Taylor is an NIHR Academic Clinical Lecturer. Andrea K Roalfe, Sarah L Lay-Flurrie, and José M Ordóñez-Mena are supported by the NIHR Oxford Biomedical Research Centre (BRC) and NIHR Oxford and Thames Valley Applied Research Collaboration (ARC OTV). FD Richard Hobbs acknowledges support from NIHR ARC OTV and the NIHR Oxford BRC. This work was also supported by the NIHR Community Healthcare Medtech and In Vitro Diagnostics Cooperative (MIC) at Oxford Health NHS Foundation Trust as part of the long-term condition theme led by Clare J Taylor and FD Richard Hobbs. The views expressed are those of the authors and not necessarily those of the NHS, the NIHR, or the Department of Health and Social Care.
Ethical approval
The protocol was approved by the Independent Scientific Advisory Committee (ISAC) of the Medicines and Healthcare products Regulatory Agency (MHRA) (ISAC protocol number: 19_136; available from the authors on request). Ethics approval for observational research using the Clinical Practice Research Datalink (CPRD) with approval from ISAC was granted by a National Research Ethics Service committee (Trent MultiResearch Ethics Committee, reference number: 05/MRE04/87).
Data
Data for this study were obtained on licence from CPRD and cannot be shared. Equivalent data can be obtained directly from CPRD with relevant ISAC approval. Hospital Episode Statistics and Office for National Statistics data are subject to copyright © (2018) and can be re-used with the permission of The Health and Social Care Information Centre. All rights reserved.
Provenance
Freely submitted; externally peer reviewed.
Competing interests
Sarah L Lay-Flurrie reports grants from the NIHR Biomedical Research Centre and NIHR ARC OTV during the conduct of the study. Clare J Taylor reports personal fees and non-financial support from Roche outside the submitted work. The other authors have declared no competing interests.
Discuss this article
Contribute and read comments about this article: bjgp.org/letters
- Received May 30, 2022.
- Revision requested July 8, 2022.
- Accepted August 27, 2022.
- © The Authors
This article is Open Access: CC BY 4.0 licence (http://creativecommons.org/licences/by/4.0/).