Introduction

Over the past years, the proportion of the European population aged 65 years and over has been increasing [1]. Aging is often associated with a growing number of chronic diseases and hence polypharmacy, which increases the risk for adverse drug events (ADEs) [2, 3], drug-related hospitalizations, and related costs [2, 4]. A recent systematic review reported an ADE prevalence up to 23 % for older adults in ambulatory care with preventability rates up to 53 % [3].

Previous studies have identified potentially inappropriate prescribing (PIP) as one of the main risk factors for ADEs in older adults [59]. PIP is defined as the prescribing of medication that could introduce a significant risk of an ADE, in particular when there is an equally or more effective alternative with lower risk available [10, 11]. PIP encompasses three main categories: over-, under-, and misprescribing (e.g., inappropriate dose or duration) [12]. Both explicit (criteria-based) and implicit (judgment-based) screening methods were developed and used to detect PIP [1316]. Studies using these tools clearly demonstrated that PIP prevalence is high and that an early detection may indeed prevent hospitalizations and improve health outcomes [10, 1719]. Research to detect and reduce PIP was initially mainly situated in hospital and nursing home settings as one of the strategies to prevent and lower the prevalence of ADEs [2022]. Over the past decade, however, PIP screening in primary care received increasing attention among health-care workers because detecting and tackling PIP at that point in the health-care system could be more (cost-)effective [10, 19].

In 2012, Opondo et al. published the first systematic review on PIP in the primary care setting [10]. Eight of the 19 included studies were carried out in Europe, describing an overall median PIP rate of 19.1 % (range 2.9–38.5 %). However, this review only included studies that reported on ‘unconditionally inappropriate medication prescriptions.’ Studies investigating drug-drug interactions, drug-disease interactions, or other types of PIP were excluded. In addition, risk factors for PIP have not yet been reviewed. This information is however needed to enable the design of well-defined targeted interventions to improve the quality of prescribing for older adults. Furthermore, since the publication of the systematic review of Opondo et al. [10], several new studies in the European ambulatory setting have emerged, which have contributed to a better characterization of the contemporary field [2326].

In order to improve quality of care for community-dwelling older adults, it is necessary to determine the magnitude, nature, and relevance of PIP in Europe. Only then, straightforward and targeted management plans for patient groups at risk can be developed to tackle the most problematic PIPs in primary care. The aim of this systematic review is (1) to synthesize observational research on PIP prevalence in community-dwelling older adults in Europe, (2) to present an overview of the risk factors mostly described in association with PIP, and (3) to summarize the drugs or drug groups most often involved in PIP.

Methods

Searches

We searched the PubMed and Thomson Reuters Web of Science™ databases for relevant manuscripts from January 1, 2000 to December 31, 2014. The search strategy contained terms and combinations related to older patients, medications, (in)appropriateness, ambulatory care, outpatient care, or community-dwelling patients, including the MeSH terms “Aged” and “Inappropriate prescribing” as shown in the Online Supplement. The final literature search was performed on December 31, 2014. The database search was completed with a manual search of the reference lists of included articles (i.e., “snowballing”). The quality of the systematic review was supported by the use of the PRISMA guidelines [27].

Eligibility criteria

Manuscripts were eligible for inclusion if they met the following criteria: (1) study design was observational, (2) study participants were community-dwelling older patients (65 years and older) in Europe, and (3) a published screening method, either implicit or explicit for PIP was used. Manuscripts could be published in English, French, Dutch, German, or Spanish. Studies that focused on specific pathologies (e.g., patients with dementia) or that focused on merely one inappropriate prescribing issue (e.g., benzodiazepine use) were excluded in addition to manuscripts that not specifically mentioned a focus on the primary care setting or the older age group.

Study selection

Duplicate manuscripts were removed after exporting search results to endnote (Thomson Reuters, Times Square, New York, NY, USA). Subsequently, two reviewers (ET and EM) independently screened the title, abstract, and full text of the retrieved manuscripts for eligibility. Each manuscript showing uncertainty regarding inclusion criteria was discussed until consensus about inclusion in the following selection round was reached.

Data collection, synthesis, and analysis

Data concerning country, study period, inclusion criteria, used data collection method, and used screening method were collected from the selected manuscripts. Additional extracted data included sample size, mean (SD), age of the screened population, mean (SD), or median (IQR) number of drugs taken (as presented in the original manuscript and therefore sometimes including OTC-drugs), and PIP prevalence (reported as the percentage of patients or prescriptions with at least one PIP). When repeated measurements were reported, we included only the most recent rate of PIP. When data could not be retrieved from the published manuscript, we contacted the corresponding author to request for additional information. If the corresponding author did not reply, a reminder e-mail was sent 1 month later. The study quality (“risk of bias”) of the included studies was evaluated by using a slightly adapted quality assessment scale from the Cochrane Collaboration group, including the following domains: study participation, data collection, screening method used, outcome measurement, and statistical analysis. All studies were judged having a low or moderate risk of bias.

We estimated an overall prevalence with its 95 % confidence interval (CI), using a random effects model with random intercepts. Prevalence data were modeled on a logit scale using sample size as a weighting factor. Likewise, a conditional prevalence with its 95 % CI was estimated for the most used PIP screening methods (Beers, STOPP, START, and PRISCUS), each method of data collection ((1) face-to-face interview, (2) prescribing, dispensing, or primary health database, (3) insurance data, (4) questionnaire, (5) medical record, or (6) combinations), and their interactions, using a (saturated) random effects model with fixed effects. For this analysis, all Beers analyses and combinations of data collection methods were collapsed into single categories. Analyses were carried out using R® (R Foundation for Statistical Computing, Vienna, Austria).

Subsequently, we selected those manuscripts that assessed risk factors associated with PIP and extracted the results as mentioned in the original manuscript. For the factor ‘polypharmacy,’ different groups used other definitions (varying from the use of ≥4 up to ≥7 drugs), which were merged for reasons of comparability. Finally, we selected those manuscripts that provided detailed information about PIP prevalence for specific drugs or drug groups and summarized the information. The ten drugs or drug groups most frequently associated with PIP were extracted as originally mentioned in the included manuscript and classified according to the Anatomical Therapeutic Chemical (ATC) classification system (3rd level) (e.g., ‘M01AB05—diclofenac’ was classified as ‘M01A—anti-inflammatory and antirheumatic products, nonsteroids’ or ‘B01AC—platelet aggregation inhibitors’ as ‘B01A—antithrombotic agents’) [28]. Due to the heterogeneity of the methodologies applied in the included studies, the overview of risk factors and drugs or drug groups most frequently associated with PIP is merely reported in a descriptive way.

Results

We identified 1375 manuscripts and screened 1138 titles and 154 abstracts for eligibility after duplicate removal. We screened the full text of 62 manuscripts and excluded 23. Thirteen manuscripts were added via manual search of the references. Our final sample comprised 52 manuscripts reporting on 82 sample screenings of PIP in community-dwelling older adults in Europe (Fig. 1). These studies were performed in 23 different European countries. The eventual list of included manuscripts is presented in Table 1, arranged by country and year of publication.

Fig. 1
figure 1

Flowchart of the literature search

Table 1 Summary of the manuscripts included in the systematic review

PIP prevalence

The overall estimated weighted PIP prevalence was 22.6 % (CI 19.5–26.7 %). PIP prevalence ranged from 0.0 to 98.0 %, with sample sizes varying from 50 to 1,019,491 patients. A large heterogeneity and inconsistency in study design (data collection methods) and outcome measures (screening methods used) was however observed.

Eighteen of the 52 included manuscripts used more than one screening method to assess PIP prevalence, resulting in 82 different sample screenings. Nineteen of the sample screenings were based on Beers 2003 criteria [11], 10 on the Beers 1997 criteria [31], 14 on STOPP criteria (2008 version) [55], 8 on START criteria (2008 version) [55], and 7 on the PRISCUS list [16]. Twenty-two sample screenings were carried out using compilations of the previously mentioned lists or used country-specific lists such as the Laroche criteria [42], the improving prescribing in the elderly tool (IPET), [53], or NORGEP-criteria [64]. Only two sample screenings used the medication appropriateness index (MAI) but accounted however for the two highest mentioned prevalence rates (84 and 98 %) [34, 80]. In only 14 of the 82 sample screenings (17 %), the complete original screening method was maintained. In all other sample screenings (n = 68), the screening method was adapted.

Accordingly, large differences were seen in the method of data collection. Of the 52 included manuscripts, 16 collected data via patient interviews (30 sample screenings). Eleven studies used insurance data (16 sample screenings) while 11 manuscripts used another specific type of database (dispensing, prescribing, or primary care database—13 sample screenings). Four studies used medical records (9 sample screenings), 4 used a questionnaire, either filled in by a general practitioner or patient (5 sample screenings), and 6 used a combination of data sources (9 sample screenings).

To estimate an overall conditional prevalence for different methods of data collection, different screening methods, and possible interactions, 59 sample screenings were included in the random effects model (parameter estimates from the model are presented in Table OS1 of the Online Supplement). Only the data collection method proved to be a significant predictor in the model, and in addition, it appears that only PIP prevalence calculated from insurance data significantly differs from the other data collection method categories (p < .05).

Factors associated with PIP

Twenty-seven of the 52 included manuscripts evaluated factors associated with overuse and misuse without taking underuse into consideration. Seventeen used multivariate logistic regression analyses, 8 used a bivariate logistic regression, and 2 used a univariate logistic regression. The studies evaluated a total of 24 different risk factors. All risk factors, evaluated in at least three studies, are presented in Table 2. Polypharmacy, advanced age, and female gender were most often taken into account; however, only polypharmacy showed a consistent positive association with PIP. Factors that were less often taken into account but showed a repeated positive association in multiple studies with PIP were presence of depression, moderate self-rated health quality, a low functional status, or a poor economic status.

Table 2 Summary of factors associated with PIP (overuse and misuse) (n = 27)

Only 2 manuscripts reviewed factors associated with underuse of medication, taking into account polypharmacy, advanced age, and female gender in the analyses. One study found a positive association between underuse of medication and polypharmacy and the other between underuse and advanced age. One study found a negative association between underuse of medication and female gender [2, 3, 24].

Drugs most frequently involved in PIP

Forty of the 52 included manuscripts mentioned detailed drug information on 53 sample screenings (Table OS2 of the online supplement).

Forty-seven of those 53 sample screenings used a screening method to detect overuse or misuse. The most frequently overused or misused drugs were (1) anxiolytics (N05B), (2) antidepressants (N06A), and (3) nonsteroidal anti-inflammatory and anti-rheumatic products (M01A). PIP dependent on an underlying diagnosis is part of misuse of drug. However, only 15 of the 47 sample screenings used a screening method that—additional to PIPs independent of underlying diagnosis—also included PIP dependent on confirmed diagnosis (e.g., different STOPP criteria, the second part of the Beers list). Therefore, no PIPs dependent on underlying diagnoses are mentioned in Table OS2. We believe however that this information is of importance, and therefore, we performed a sub-analysis to detect these PIPs and reported them separately in a subsection of Table OS2. Eleven of the 15 sample screenings taking into account underlying diagnoses used the STOPP criteria and 4, the second part of the Beers list. Most frequent were (1) the long-term use of NSAIDs (>3 months) in mild osteoarthritis, (2) the use of calcium channel blockers in chronic constipation, and (3) the use of noncardioselective β-blockers in patients with COPD.

The six remaining sample screenings used a tool to detect underuse of medication, and all made use of START-criteria. When considering the ten most prevalent items of each screening, a total of 17 different START-criteria were detected. The START criterion most often mentioned by the included sample screenings was the omission of antiplatelet therapy in diabetes mellitus with co-existing major cardiovascular risk factors. This criterion was detected in all of the six sample screenings. An overview of the other most prevalent underused drugs detected by START-criteria is given in Table OS2 of the online supplement.

Discussion

In this systematic literature review, we evaluated the prevalence of PIP in community-dwelling older adults across Europe. Our review included studies evaluating all types of PIP (overuse, misuse, and underuse) and included all studies irrespective of the fact that they presented detailed information on the specific PIP items or used a specific type of data collection. Fifty-two manuscripts were selected, describing 82 different sample screenings with an estimated weighted overall PIP prevalence of 22.6 % (CI 19.5–26.7 %).

PIP prevalence

Consistent with the reviews performed by Aparasu et al. [21] and Opondo et al. [10] (both mainly USA-based), our review found that about one in five older patients in Europe is exposed to PIP. This suggests that the possible inappropriateness of prescribing in Europe and the USA is comparable. To obtain a contemporary image of the problem, we only included manuscripts published after 2000. It seems though that the overall PIP rate has not substantially decreased since the previous reviews despite considerable attention in the scientific literature and increasing research on this topic (22 manuscripts published between 2000 and 2009 compared to 30 between 2010 and 2014). Three other reviews, by Shade et al. [18], Hill-Taylor et al. [17], and Guaraldo et al. [87], did not calculate an overall PIP prevalence. Our estimated mean falls however within the ranges observed in these studies (22.7–74 % [17], 21.4–79 % [17], and 11.5–62.5 % [87]).

The wide range in observed PIP prevalence probably relates to the wide diversity of screening methods used and the sometimes extensive adaptations to these screening methods. In 83 % of the sample screenings, the original screening method was adapted; either because it was not compatible with the applied data collection method (e.g., STOPP/START on insurance data with no clinical data detected significantly lower prevalence of 6 % compared to 30 % when used on medical record—Table OS1) or because it didn’t fully match the European setting (e.g., Beers criteria). Using such adapted screening methods seems contra-intuitive, hampers interpretation of the data, and probably leads to an underestimation of the true PIP prevalence. If limited clinical data are available, the use of the GheOP [3], S-tool [88], or the Matanovic criteria [33] may offer a better approach since they both present a comprehensive protocol that screens for overuse, misuse, and underuse; are adapted to the European setting; and do not require clinical data or confirmed diagnoses.

Moreover, as underuse, overuse, and misuse are all types of PIP [12], it is surprising that in multiple research, PIP prevalence is separately given for START and STOPP criteria [23, 69, 72]. Presenting the prevalence as the proportion of patients with at least one START or STOPP criterion might represent a more accurate image of PIP. Additionally, different data collection methods, differences in quality of prescribing across geographical regions, or the status of medication review practices in the European countries could furthermore also have contributed to the reported wide range [89, 90].

Despite the impossibility of comparing the individual sample screenings, the information presented in the sample screenings did give an interesting insight in the way data collection methods or used screening methods influence PIP prevalence (see Table OS1). Although not significant, there is a trend that the combination of data collection methods leads to higher prevalence. As well, the two screenings that used an implicit screening method—the MAI— showed the highest PIP prevalence. It will be interesting to see whether the new version of STOPP/START [91], where some implicit criteria were added, will follow this trend and lead to higher prevalence rates.

Factors associated with PIP

Many research studies showed that advanced age and polypharmacy are important and independent risk factors for the presence of PIPs and ADEs [17, 87]. It was however unclear whether other factors are also of significance and could help to further target patient groups at risk for PIPs and ADEs. The present evaluation of risk factors showed that polypharmacy, a low functional status, depression, a moderate self-rated health quality, poor economic situation, a high comorbidity score, and reduced cognition are most often positively associated with a higher risk for PIP. It was however remarkable that from those studies that evaluated the association between PIP and advanced age, only in about half a positive association was found [17, 87].

Drugs most frequently involved in PIP

In concordance with previous findings [10], we observed that the use of anxiolytics, hypnotics, and sedatives are most often involved in PIP. In addition, antidepressants (such as amitriptyline and doxepin), NSAIDs, and antithrombotic agents (such as ticlopidine and dipyridamole) are also often involved in PIP. One drug group that was not mentioned in previous systematic research as substantially associated with PIP, but highly present in this review, is the “antihistamines for systemic use” group.

In our review, we additionally focused on PIPs depending on a confirmed diagnosis and underuse because until now, only little systematic research on these types of PIP has been performed. It appears that NSAIDs are most often interfering with underlying diseases such as peptic ulcer and moderate or severe hypertension. In contrast to other research [9294], interactions with chronic heart failure, dementia, and renal impairment are not often detected in our review. This discrepancy could be explained by the fact that many of the research included in the current review used data collection methods without confirmed diagnoses as three were solely dispensing databases, five were based on patient interviews, and seven (partially) on medical records.

Considering underuse, the results of our review are in line with a previous review performed by Hill-Taylor et al., reviewing studies that used START criteria [17]. Besides the underuse of calcium/vitamin D supplements and statin therapy, we observed that antiplatelet therapy and metformin are often omitted in diabetes mellitus as well as β-blockers in chronic stable angina.

Strengths and limitations

In order to obtain as much available research as possible, more than one electronic database was used and literature was supplemented by manually checking reference lists of all included manuscripts. Compared to previously published reviews [10, 87], this systematic review gives a much more global overview of PIP, including underuse, drug-disease interactions, and drug groups most often associated with PIP. In addition, the review specifically focuses on the European setting which was not the focus of any other previous research [10, 17, 18, 21, 87]. Furthermore, this is the first review summarizing all research regarding risk factors for PIP. By limiting the time period and including all European studies, we attempted to provide a contemporary and country-specific overview of PIP, offering a point of reference for countries in which PIP is still poorly characterized. Nevertheless, several limitations remain when interpreting the findings of this systematic review. Comparing results from the included manuscripts was difficult due to use of different inclusion criteria, different screening methods, and inconsistent adaptations of these tools. Additionally, differences in health-care settings and countries may also have impacted PIP prevalence.

Conclusion

This systematic review shows that PIP prevalence in community-dwelling older adults in Europe remains high and depends partially on the screening method used. Additionally, the review gives an insight in the risk factors most commonly associated with PIP and the drug groups most commonly involved in PIP. The results can contribute to outlining cross-border and country-specific action plans to reduce PIP in primary care as they represent an important opportunity to improve the prescribing quality in the primary care setting. There is a need for randomized controlled trials evaluating interventions that resolve PIP in the most cost-effective way to improve patient-related outcomes such as quality of life and to prevent drug-related problems leading to hospitalizations. A formal and straightforward screening method can be a great support in the evaluation of the patient’s pharmacotherapy, but should always be embedded in a global patient assessment by a multidisciplinary care team. Only then, positive effects on patients’ health outcomes can be shown [95]. Moreover, the complexity of the matter makes it unlikely that a single intervention at one point in the medication management process will be sufficient to tackle PIP. It had been demonstrated that multidisciplinary interventions including education and a systematic screening method are the most efficient for resolving and preventing all types of PIP [9698].