Abstract
Background Patients attend primary care for many reasons and to achieve a range of possible outcomes. There is currently no Patient Reported Outcome Measure (PROM) designed to capture these diverse outcomes, and trials of interventions in primary care may thus fail to detect beneficial effects.
Aim This study describes the psychometric testing of the Primary Care Outcomes Questionnaire (PCOQ), which was designed to capture a broad range of outcomes relevant to primary care.
Design and setting Questionnaires were administered in primary care in South West England.
Method Patients completed the PCOQ in GP waiting rooms before a consultation, and a second questionnaire, including the PCOQ and seven comparator PROMs, after 1 week. Psychometric testing included exploratory factor analysis on the PCOQ, internal consistency, correlation coefficients between domain scores and comparator measures, and repeated measures effect sizes indicating change across 1 week.
Results In total, 602 patients completed the PCOQ at baseline, and 264 (44%) returned the follow-up questionnaire. Exploratory factor analysis suggested four dimensions underlying the PCOQ items: health and wellbeing, health knowledge and self-care, confidence in health provision, and confidence in health plan. Each dimension was internally consistent and correlated as expected with comparator PROMs, providing evidence of construct validity. Patients reporting an improvement in their main problem exhibited small to moderate improvements in relevant domain scores on the PCOQ.
Conclusion The PCOQ was acceptable, feasible, showed strong psychometric properties, and was responsive to change. It is a promising new tool for assessment of outcomes of primary care interventions from a patient perspective.
INTRODUCTION
Primary care has been evolving in recent years to meet changing population needs and public expectations.1–6 As health services globally contend with ageing populations and increasing multimorbidity,7 there have been sustained endeavours to improve service quality, costs, and outcomes in primary care. Innovations include electronic consultations,8 health coaching, and behavioural change therapies,9 and interventions addressing frequent attenders.10
Assessing the effectiveness of such interventions from a patient perspective involves Patient Reported Outcome Measures (PROMs). Many PROMs are disease specific and tailored to the symptoms and impacts on function of a particular condition.11 These are of limited value in studies where patients have various conditions. As a first-contact, comprehensive, and coordinating service,12 primary care requires a generic PROM that can be administered regardless of condition. This should be suitable for large-scale trials, based on outcomes that matter to patients and are influenced by GPs. It should also be responsive, that is, able to detect changes over time.13
A problem with many generic PROMs is that they are limited to symptoms and function. Primary care patients frequently present with other problems14 and many have long-term conditions,7,15,16 whereby improvement in function may be unrealistic. Therefore, leading generic PROMs, such as the SF-36 (Short Form-36 Health Survey)17 and EQ-5D (European Quality of Life-5 Dimensions),18 often show no change following interventions in primary care.19–21 Because of this, alternative measures have been designed specifically for primary care.21–23 For example, the Measure Yourself Medical Outcome Profile (MYMOP) allows patients to specify their problems and shows change when other PROMs do not.21 However, MYMOP is administered through interviews, making it unfeasible for trials. It also remains limited to symptoms and function. In contrast, the Patient Enablement Instrument (PEI) encompasses broader outcomes related to coping, understanding, and confidence in health, but does not capture symptoms or function. Though it has been validated for primary care,22 the PEI measures outcomes following a single consultation. For many patients, outcomes will become apparent only after longer episodes of care.24 Such outcomes may be multi-layered, capturing aspects of enablement, resilience, symptoms and function, and health perceptions. Without a generic PROM that captures such domains it is impossible to properly assess the outcome of new primary care service configurations from a patient’s perspective.
The Primary Care Outcomes Questionnaire (PCOQ) was developed in this context, through a rigorous process25 underpinned by a conceptual model of outcomes that included patient health status and ability to impact health status (Figure 1). The authors first interviewed patients and clinicians to establish outcomes that both groups sought to achieve within this framework.26 They then consulted with patients, clinicians, and academics in a Delphi study to identify outcomes most relevant to health and able to be addressed in primary care.27 The authors then developed and tested, through cognitive interviews, a PROM that addressed these outcomes.28 This included health status outcomes, internal features of health empowerment (for example, understanding and ability to self-care), external features (including having access to support and availability of good health care), and outcomes about patients’ health perceptions (for example, health concerns and confidence that they are dealing with their health conditions).
Figure 1. Conceptual models of outcomes influenced by primary care. The darker shaded boxes indicate the constructs that the PCOQ was designed to capture. PCOQ = Primary Care Outcomes Questionnaire.
How this fits in
Patients attend primary care with many types of problems and to achieve a range of possible outcomes, but there is currently no Patient Reported Outcome Measure (PROM) designed to capture these diverse outcomes. As such, trials of interventions in primary care may fail to detect beneficial effects. The Primary Care Outcomes Questionnaire (PCOQ) was developed to measure a range of outcomes commonly influenced in primary care, including health and wellbeing, health knowledge and self-care, confidence in health provision, and confidence in health plan. Testing showed it was acceptable, feasible, and had strong psychometric properties, including responsiveness to change. It is a promising new tool for assessment of outcomes of primary care interventions from a patient perspective.
The current study reports on the final stage of this process, which aimed to establish the psychometric properties of the PCOQ in primary care patients.
METHOD
Sample and procedures
Waiting room recruitment was chosen as an appropriate method for recruiting patients seeking primary health care. Adult patients were approached in waiting rooms, prior to consultations, in five practices in South West England, and those consulting a GP for themselves were invited to participate. These included a mix of urban and rural, and affluent and deprived areas, with patients from different ethnic backgrounds. Participants self-completed the PCOQ and questions about patient characteristics as they waited for a consultation. Because the baseline questionnaire needed to be short enough to be completed in waiting rooms, the authors collected comparator questionnaires at follow-up only. Patients were asked to take home a second copy of the PCOQ and comparator PROMs for completion 1 week later and return via post. A follow-up email or text reminder was sent after 5 days.
Measures
Prior to testing and item reduction, the PCOQ contained 27 items scored on a 5-point unipolar adjectival scale (from ‘No problems’ to ‘Extreme problems’). The scale wording varies according to attribute, as determined by the qualitative study.26 For example, the item ‘How much are you currently affected by pain or discomfort’ is anchored at ‘Not at all’ and ‘Extremely’. In contrast, the item ‘How much do you understand your health problems’ is anchored at ‘I understand as much as I want’ and ‘I understand very much less than I want’. (The PCOQ, and associated scoring files and instructions, are available from the University of Bristol Centre for Academic Care website: http://www.bristol.ac.uk/primaryhealthcare/resources/pcoq/.) Patient characteristics were collected at baseline, and seven comparator measures at follow-up. These were: the EQ-5D-5L,29 the Patient Activation Measure (PAM),30 a single item on likelihood of recommending a GP,31 the last appointment score,31 a single item on support for long-term conditions,31 the Illness Perception Questionnaire (IPQ),32 and a single item on change in main problem.33
Analysis
Psychometric testing of a multi-item PROM includes evaluations of feasibility, structural validity, internal consistency, construct validity, and responsiveness.13,34 In this study, feasibility was assessed by the amount and pattern of missing data, readiness of patients to complete the measure, and by response rate between baseline and follow-up. Structural validity was tested using exploratory factor analysis with principal axis factoring on fully completed questionnaires.35 This is a statistical technique used to reduce a larger number of items into a smaller number of common factors that reflect shared variance.35 The number of factors extracted was decided by a combination of Kaiser’s rule (eigenvalues >1), the scree plot,35 and by interpretability of domains. Internal consistency reliability of each domain was assessed using Cronbach’s α.25 Construct validity was explored by testing pre-specified hypotheses about the relationships between PCOQ domain scores and comparator PROMs, using Spearman correlations.25 Finally, responsiveness was tested by comparing Glass’s Δ25 for patients expected to improve versus those expected to remain unchanged, based on patient responses to the item asking about change (‘Thinking about the main problem you consulted your GP with at your recent appointment, is this problem: completely better, much better, better, slightly better, same, slightly worse, worse, n/a?’). Glass’s Δ is a repeated measures effect size, calculated as the mean change in scores (baseline to follow-up) divided by the standard deviation of scores at baseline.
RESULTS
Feasibility
The PCOQ was accepted by 718 people in the waiting room, and finished by 602 (84%). Missing data at baseline varied between 1% and 7% per item, with 2.5% missing overall. In all, 512 questionnaires (85%) had no missing data. Of the 602 patients completing the PCOQ at baseline, 264 (44%) completed the follow-up questionnaire.
Table 1 shows patient characteristics at baseline and at follow-up. Older patients (apart from the ≥75-year-olds group) had higher response rates at follow-up.
Table 1. Patient characteristics at baseline and follow-up
Structural validity
Kaiser’s rule suggested a four- or five-factor solution, and the scree plot a two-factor solution. (Further details are available from the author.) Exploratory factor analyses were thus conducted using the complete baseline data (n = 512) for solutions ranging from one to five factors. The four-factor solution with oblique (promax) rotation35 provided the most interpretable simple structure (Table 2). Three items that did not load highly on any factor were removed. Two of these items were related to health concerns, a construct also reflected in other items. The third was on medication side effects. The obliquely rotated factors were moderately correlated (0.29–0.51). The factors were labelled and described (Box 1).
Table 2. Final four-factor solution after promax rotation on baseline data (n = 512)
Box 1. Four dimensions underlying Primary Care Outcomes Questionnaire items
Factor 1: | Health and wellbeing: Measures overall health status, including symptoms, effects of symptoms on life, and health concerns (8 items). |
Factor 2: | Health knowledge and self-care: Measures health knowledge and patients’ ability to self-care and manage symptoms (4 items). |
Factor 3: | Confidence in health provision: Measures patients’ confidence in their healthcare providers and ability to access good health care (6 items). |
Factor 4: | Confidence in health plan: Measures patients’ confidence in their health plan, their adherence to this plan, and the level of support they have in managing their health-related problems (6 items). |
A score was calculated for each domain using a simple average of item scores for each domain (scored 1–5). Alternative scores incorporating factor weights were also produced, but converged with the non-weighted scores (r = 0.99) and the simpler method was thus preferred. The PCOQ score distributions are shown in Table 3. There is evidence of a ceiling effect in some domains. For example, all patients scoring in the top quintile for both health knowledge and self-care, and confidence in health provision scored at the ceiling. However, the ceiling effect for confidence in health provision was lower than the ceiling for the main comparator, last appointment score (25% as opposed to 36%). Similarly, the ceiling effect for health and wellbeing was lower than the EQ-5D (12% at the ceiling as opposed to 20% for the EQ-5D).
Table 3. PCOQ factor scores (complete baseline data)
Internal consistency
Cronbach’s α was above the standard of 0.70 for each domain34 as follows: health and wellbeing (α = 0.88), confidence in health provision (α = 0.95), health knowledge and self-care (α = 0.85), and confidence in health plan (α = 0.77).
Construct validity
Spearman rho correlations of PCOQ domain scores with comparator PROMs (Table 4) indicate convergent and discriminant construct validity:25 domain scores correlate with PROMs as expected on conceptual grounds (for example, health and wellbeing with EQ-5D, P = 0.75), and less strongly with the other PROMs that reflect different constructs (for example, health and wellbeing with PAM-13, P = 0.28).
Table 4. Actual and hypothesised correlations of factors with comparator PROMs
Responsiveness
The authors hypothesised that patients responding positively to the change in main problem item (‘Slightly better’ to ‘Much better’) would have a positive Glass’s Δ (>0.2) for health and wellbeing, health knowledge and self-care, and confidence in health plan, if they indicated problems in these domains when they attended their GP (patients were excluded if they were at the ceiling at baseline, as this indicated that they had no problem in that domain when attending). The authors also expected patients who were perfectly satisfied on the last appointment score to have a positive Glass’s Δ for the domain confidence in health provision. They only used the top score because the last appointment score had a strong positive skew, which meant that using, for example, the top two categories would have included most patients.
The effect sizes for patients expected to improve are positive, and small to moderate in magnitude, with confidence intervals excluding zero (indicating statistical significance at the 0.05 level). Effect sizes for patients expected to stay the same all approach zero (Table 5). The sample of patients reporting a deterioration on the change item was very small and was therefore not analysed.
Table 5. Effect sizes (Glass’s Δ) between PCOQ baseline and follow-up scores
DISCUSSION
Summary
Following a rigorous process of development, the authors have tested the psychometric properties of a PROM designed to capture outcomes that patients want to obtain from primary care and which doctors seek to deliver. This meets a need for an instrument that can determine the effects of alternative forms of primary care, where patients have various problems and reasons for consultation.
Strengths and limitations
The PCOQ has advantages over existing PROMs, and meets recommended standards for psychometric testing in this sample of primary care patients.34 Strengths of the study include successful data collection, a simple factor structure with good construct validity, and a prospective design that enabled tests of responsiveness.
The study also had limitations. Some patients did not accept a questionnaire, and 16% were called to an appointment before completing the baseline PCOQ. Though efforts were made to obtain a mix of urban and rural practices, with different deprivation scores, data were collected in a relatively small number of practices. The sample was representative of general practice consultation in terms of sex36 and number of long-term conditions.16 However, patients >75 years were slightly under-represented,37 perhaps because housebound patients were excluded, or more declined to take part. Response rates from ethnic groups may have been affected by the lack of interpretation facilities. Only 44% of patients returned the follow-up form, and this differed systematically by age (though not by sex, long-term conditions, or ethnicity). This response rate is comparable to similar postal questionnaires38 and was anticipated by the authors’ protocol. Baseline and follow-up scores were compared only for the final responsiveness tests. Furthermore, this responsiveness analysis was based on comparing patients who were separated into two groups based on a change score, and there is no reason to assume different numbers of non-responders across groups. However, the authors recognise that selection bias may have been introduced. As with comparable questionnaires,39,40 most items and domain scores were positively skewed. The factor solution had high uniqueness for some items. This can indicate that the item is not strongly related to others,35 but, because of the important content of these variables (for example, pain and adherence), the authors chose to include them. The Cronbach’s α of 0.95 for the confidence in health provision factor may indicate some item redundancy,25 and future validation might further reduce the items. A final limitation relates to the lack of a gold standard for measuring change. The change in main problem and last appointment score were used as proxies, but these are imperfect measures. However, this is a necessary limitation of developing a PROM with a new and unique set of constructs, and the evidence for responsiveness is akin to the concept of ‘construct responsiveness’ as described by the COSMIN (COnsensus-based Standards for the selection of health Measurement INstruments) group.41,42
Comparison with existing literature
Taken together, the four domains of the PCOQ have similarities with the concept of health capability, defined as combining health agency (an individual’s ability to achieve health goals and act as agents of their own health) and health functioning (the outcome of actions to maintain or improve health).43 The four domains are scored separately, and each domain has benefits over other generic PROMs.
The health and wellbeing domain, which includes physical and/or emotional symptoms, life effects, and health concerns, was the most responsive, and had a lower ceiling effect than the EQ-5D, another measure of health status. However, the authors did not carry out a head-to-head comparison of responsiveness, and this is a future research requirement. Similarly, the responsiveness of the health knowledge and self-care domain should be compared with similar measures such as the PAM-13.30 Although it showed a stronger ceiling effect than the PAM-13, this domain includes areas often receptive to intervention, such as patient understanding of health problems, while excluding areas, captured by PAM-13, that may be less responsive, such as figuring out solutions to new health problems. The confidence in health plan domain refers to patients trusting and following their health plan, and having support to enable this. This is a broad construct that subsumes different concepts, and no existing PROM is readily comparable. Lastly, the confidence in health provision domain includes concepts similar to those normally found in Patient-Reported Experience Measures (PREMs) as opposed to PROMs, such as whether the clinician listens.31 However, unlike a PREM, which refers to perceptions of a particular consultation, the PCOQ refers to patients’ current levels of confidence in these aspects of their health providers. These are best viewed as outcomes rather than experiences. That these levels are amenable to change following a consultation is a valuable result of this study.
Implications for research and practice
This study has demonstrated that the PCOQ is valid, internally consistent, and responsive among this sample of primary care patients. Because this study represents the first validation of the PCOQ, the authors recommend usage alongside other PROMs until properties are confirmed. The PCOQ was specifically developed to test the benefits of service-level interventions in primary care, and thus fills an important gap in the literature. The alternative to the PCOQ is multiple instruments measuring four different constructs. This would not only require a longer questionnaire, but also usage of instruments not designed to measure outcomes that primary care patients seek. No existing PROM covers the PCOQ unique construct. It therefore offers a timely opportunity to enhance research and policymaking in primary care during a period of high demand for new interventions in this area.
Notes
Funding
This study was funded by the National Institute for Health Research School for Primary Care Research (NIHR SPCR), which is a partnership between the universities of Bristol, Cambridge, Keele, Manchester, Newcastle, Nottingham, Oxford, Southampton, and University College London. The views expressed are those of the authors and not necessarily those of the NHS, the NIHR, or the Department of Health. Mairead Murphy was partly funded by the Avon Primary Care Research Collaborative.
Ethical approval
Ethical approval was granted through the National Health Service Research Ethics Committee process. Approval was granted by London Central Committee, reference 14/LO/2036.
Provenance
Freely submitted; externally peer reviewed.
Competing interests
The authors have declared no competing interests.
Discuss this article
Contribute and read comments about this article: bjgp.org/letters