PT - JOURNAL ARTICLE AU - Nada F Khan AU - Sian E Harrison AU - Peter W Rose TI - Validity of diagnostic coding within the General Practice Research Database: a systematic review AID - 10.3399/bjgp10X483562 DP - 2010 Mar 01 TA - British Journal of General Practice PG - e128--e136 VI - 60 IP - 572 4099 - http://bjgp.org/content/60/572/e128.short 4100 - http://bjgp.org/content/60/572/e128.full SO - Br J Gen Pract2010 Mar 01; 60 AB - Background The UK-based General Practice Research Database (GPRD) is a valuable source of longitudinal primary care records and is increasingly used for epidemiological research.Aim To conduct a systematic review of the literature on accuracy and completeness of diagnostic coding in the GPRD.Design of study Systematic review.Method Six electronic databases were searched using search terms relating to the GPRD, in association with terms synonymous with validity, accuracy, concordance, and recording. A positive predictive value was calculated for each diagnosis that considered a comparison with a gold standard. Studies were also considered that compared the GPRD with other databases and national statistics.Results A total of 49 papers are included in this review. Forty papers conducted validation of a clinical diagnosis in the GPRD. When assessed against a gold standard (validation using GP questionnaire, primary care medical records, or hospital correspondence), most of the diagnoses were accurately recorded in the patient electronic record. Acute conditions were not as well recorded, with positive predictive values lower than 50%. Twelve papers compared prevalence or consultation rates in the GPRD against other primary care databases or national statistics. Generally, there was good agreement between disease prevalence and consultation rates between the GPRD and other datasets; however, rates of diabetes and musculoskeletal conditions were underestimated in the GPRD.Conclusion Most of the diagnoses coded in the GPRD are well recorded. Researchers using the GPRD may want to consider how well the disease of interest is recorded before planning research, and consider how to optimise the identification of clinical events.