Strengths and limitations
The key strength of this study is its unprecedented scale; >58 million people were included, 96% of the population in England. In contrast with many studies that use EHR data, in this study it was possible to compare long-COVID diagnostic codes between practices that use different software systems. A striking disparity was found: this has important implications for understanding whether clinicians are using the codes appropriately. A key weakness of this data for estimating true prevalence of long COVID in primary care, and factors associated with the condition, is that it relies on clinicians formally entering a diagnostic or referral code into the patient’s record: this is a limitation of all EHR research for all clinical conditions and activity, however, the emergence of a new diagnosis and the recent launch of a new set of diagnostic codes may present challenges in this regard. As a result of these current limitations, this study did not aim to estimate the prevalence of long COVID, or aim to make causal inferences about the observed variation.
Comparison with existing literature
To the authors’ knowledge, there are no other studies on prevalence of long COVID using clinicians’ diagnoses or EHRs data. There are numerous studies using self-reported data from patients on the prevalence of continued symptoms following COVID-19, with estimates varying between 4.5% and 89%, largely because of highly variable case definitions;5 individual symptoms characterising long COVID have been reported as fatigue, headache, dyspnoea, and anosmia.6 The Office for National Statistics COVID Infection Survey estimates prevalence of self-diagnosed long COVID at 13.7%.7 Separately, numerous cohort studies have reported an increased risk of serious cardiovascular and metabolic outcomes following hospital admission with COVID-19,8,9 and there are various prospective studies such as the Post-hospitalisation COVID-19 study following-up patients for the year following their hospital admission.10 Other studies have examined variation in clinical coding, with some finding that ‘poor’ coding can lead to altered incidence estimates,11 whereas others implicate the design of clinical software systems in influencing variation.12–14
Implications for research and practice
The prevalence of long-COVID codes in primary care that are reported in this study is extremely low when compared with current survey data on long-COVID prevalence.15,16 This conflict may be attributable to a range of different possible causes related to information bias including: patients not yet presenting to primary care with long COVID; different clinicians and patients holding different diagnostic thresholds or criteria for long COVID; and issues around coding activity including clinicians not yet knowing about the long-COVID diagnostic codes, the design and text of the long-COVID diagnostic codes, and the design of EHR systems in which the codes can be selected for entry onto a patient record.
The large variation in the apparent rate of long COVID between different geographic regions, practices, and EHR systems strongly suggests that clinicians’ coding practice is inconsistent at present. This suggests variation in awareness of the new diagnostic codes that were only launched in November 2020, and only available in EMIS at the end of January 2021. In addition, the codes for long COVID and associated synonyms do not currently contain the term ‘long COVID’: this was an active choice by NHS Digital who manage SNOMEDCT UK codes.1 The October 2020 NICE consultation on management of the long-term effects of COVID-19 does mention the term ‘long COVID’, although the term was not incorporated into the clinical definitions that were translated into diagnostic codes by NHS Digital.1 These decisions were carefully thought through at the time they were made; however, as a result of broader contextual shifts in language over time there is now a clear mismatch between formal clinical terminology and popular parlance among clinicians and patients. The view of the authors of this study is that those managing SNOMED-CT terminology for England should either update the long-COVID codes to include the phrase ‘long COVID’, ideally in advance of the upcoming new SNOMED-CT international release; or energetically disseminate their preferred new phrasing to all frontline clinicians, to ensure more appropriate use of these codes. Similarly NICE and other authoritative bodies giving guidance on long COVID should energetically communicate to clinicians the importance of correctly coding long COVID in patient records. It is a high national priority to estimate the prevalence of long COVID, identify its causes and consequences, and plan services appropriately.
The variation in the rate of diagnostic code usage between users of different EHR software is also striking. This difference could plausibly be responsible for some of the other variation described. For example, as noted in the results, some regions have a high percentage of coverage from one software provider. After speaking with clinicians and both software vendors, the reasons for the difference remain unclear, but are likely attributable to differences in user interface, which has previously been shown to influence clinicians’ treatment choices.13,14 This should be addressed by interviewing GPs about their experiences with diagnosing and treating people with long COVID in each system.
Despite these issues around correct recording of clinicians’ diagnoses, there also remains a strong possibility that clinicians are not currently diagnosing their patients as having long COVID. This may be because patients are not presenting with long COVID to services, for a range of reasons during a pandemic; or their clinicians are not diagnosing them with long COVID when they are seen. The view of the authors is that this can only be resolved by conducting prospective surveys with clinicians themselves, evaluating how many patients they have seen with a condition they would understand to be diagnosable as long COVID, alongside qualitative research on the topic.
The issues with recording of long COVID described here also have implications for future research. It is likely that recording will improve over time, as disease definitions are improved, guidelines are iterated on, and clinicians become more aware of the condition. It is likely also worth considering additional approaches to identifying long COVID in routine medical data. This might include identifying and measuring broad groups of symptoms that are associated with long COVID.17
If it is accepted that the different rates of long COVID usage in each subgroup reflects the true comparative risk for each demographic then there are two key findings. First, the lower rate in older patients, despite their higher prevalence of severe acute COVID-19 outcomes,18 which may be affected by the competing risk of death in patients with COVID-19. Second, the higher rate of long COVID in females, despite the higher prevalence of severe acute COVID outcomes in males,18 which may be explained in part by differences in routine consultation rates between males and females.19
In conclusion, current recording of long COVID in primary care is very low, and variable between practices. This may reflect patients not presenting; clinicians and patients holding different diagnostic thresholds; or challenges with the design and communication of diagnostic codes. This analysis will be updated regularly with extended follow-up time.