Impact of physician empathy on patient outcomes: a gender analysis

Background Empathy in primary care settings has been linked to improved health outcomes. However, the operationalisation of empathy differs between studies, and, to date, no study has concurrently compared affective, cognitive, and behavioural components of empathy regarding patient outcomes. Moreover, it is unclear how gender interacts with the studied dimensions. Aim To examine the relationship between several empathy dimensions and patient-reported satisfaction, consultation’s quality, and patients’ trust in their physicians, and to determine whether this relationship is moderated by a physician’s gender. Design and setting Analysis of the empathy of 61 primary care physicians in relation to 244 patient experience questionnaires in the French-speaking region of Switzerland. Method Sixty-one physicians were video-recorded with two male and two female patients. Six different empathy measures were assessed: two self-reported measures, a facial recognition test, two external observational measures, and a Synchrony of Vocal Mean Fundamental Frequencies (SVMFF), measuring vocally coded emotional arousal. After the consultation, patients indicated their satisfaction with, trust in, and quality of the consultation. Results Female physicians self-rated their empathic concern higher than their male counterparts did, whereas male physicians were more vocally synchronised (in terms of frequencies of speech) to their patients. SVMFF was the only significant predictor of all patient outcomes. Verbal empathy statements were linked to higher satisfaction when the physician was male. Conclusion Gender differences were observed more often in self-reported measures of empathy than in external measures, indicating a probable social desirability bias. SVMFF significantly predicted all patient outcomes, and could be used as a cost-effective proxy for relational quality.


INTRODUCTION
Empathy in primary care settings has been linked to improved health outcomes, such as patient satisfaction, adherence to treatment, and, by trickle effect, fewer malpractice complaints. 1 However, there is as yet no consensus on the definition and operationalisation of empathy, making cross-study comparisons challenging. 2 A comprehensive definition of empathy has been proposed by Decety and Jackson: 'Feeling what another person is feeling, knowing what another person is feeling, and having the intention to respond compassionately to another person's distress.' 3 This distinguishes affective, cognitive, and behavioural components of empathy. When it comes to the operationalisation of empathy, instruments used to measure these components can be classified into three categories: selfreported questionnaires (level of agreement with various empathy-oriented statements describing oneself), tests (performance tasks in which there is a correct empathic answer), and observational ratings (behaviours coded by external evaluators). Many studies have reported on the beneficial impact of physicians' empathy; 4,5 nevertheless, no study has concurrently compared these different measures in regard to patient outcomes. Different outcomes are expected, because selfreported empathy, tests, and observed empathy do not measure precisely the same construct of empathy. 6 Moreover, self-reported measures are more prone to biases (for example, social desirability) [7][8][9] than other measures.
Literature shows that empathy is highly influenced by gender. Stereotypically, females are considered more prosocial than males, 10,11 and female physicians self-assess their empathy higher than male physicians do. 12 Though females are expected to show more empathy, 13 it is unclear whether gender differences can be observed across different types of empathy measures. If this difference is primarily driven by gender stereotypes, it is likely that more gender differences will be observed in self-reported questionnaires than in tests or external observations of empathy. 7,14 On the contrary, if empathy is indeed more enacted by female physicians as a result of natural predisposition and/or social construct, 15 gender differences will be observed in tests and external observations of empathy as well. Finally, patients may evaluate the display of empathy differently when standing in front of a male or female physician. Indeed, patients positively evaluate female physicians behaving in line with expected gender roles (softer voice, less dominance), whereas, for their male counterparts, a larger range of behaviour is related to patient satisfaction. 16

Abstract Background
Empathy in primary care settings has been linked to improved health outcomes. However, the operationalisation of empathy differs between studies, and, to date, no study has concurrently compared affective, cognitive, and behavioural components of empathy regarding patient outcomes. Moreover, it is unclear how gender interacts with the studied dimensions.

Aim
To examine the relationship between several empathy dimensions and patient-reported satisfaction, consultation's quality, and patients' trust in their physicians, and to determine whether this relationship is moderated by a physician's gender.

Design and setting
Analysis of the empathy of 61 primary care physicians in relation to 244 patient experience questionnaires in the French-speaking region of Switzerland.

Method
Sixty-one physicians were video-recorded with two male and two female patients. Six different empathy measures were assessed: two self-reported measures, a facial recognition test, two external observational measures, and a Synchrony of Vocal Mean Fundamental Frequencies (SVMFF), measuring vocally coded emotional arousal. After the consultation, patients indicated their satisfaction with, trust in, and quality of the consultation.

Results
Female physicians self-rated their empathic concern higher than their male counterparts did, whereas male physicians were more vocally synchronised (in terms of frequencies of speech) to their patients. SVMFF was the only significant predictor of all patient outcomes. Verbal empathy statements were linked to higher satisfaction when the physician was male.
The present project strives to fill in the literature gap regarding the concurrent analysis of different empathy dimensions with a gender perspective. The specific aims of this study are to investigate gender differences in six different empathy measures, compare these empathy measures regarding their relation to patient outcomes, and determine whether physicians' gender impacts this relationship.

METHOD Study design and participants
The present study is a secondary analysis of data collected for a physician-patient communication study that received ethical approval from the regional ethic committees. 17 More than 400 GPs in the French-speaking region of Switzerland were contacted to participate in a study on patient-physician communication. In total, 61 physicians (43% female) participated in the study. This represents a convenience sample. After being enrolled in the study, they filled in online questionnaires and took a test measuring their empathy and sociodemographic information.
Each participating physician was then video-recorded with the first two female and first two male patients agreeing to participate (recruited in the waiting room during a usual day of consultation), ending with 244 video-recorded consultations. Participating patients had to be aged >18 years, fluent in French, and present no documented psychiatric disorder. At the end of the consultation, patients indicated sociodemographic characteristics, as well as their satisfaction with the consultation, quality of the consultation, and their trust in the physician.

Measures
This study compared six different measures of empathy measured through self-reported questionnaires, an online test, and external observation (Table 1).
Self-reported questionnaires of empathy. Physicians' self-reported empathy was measured with two subscales of the Interpersonal Reactivity Index, 18 known for its internal consistency. 19 In the present study, the empathic concern subscale was used (which measured affective empathy), as was the perspective-taking subscale (which measured cognitive empathy).
Empathy test. Physicians filled in a validated emotion recognition test (the Diagnostic Analysis of Nonverbal Accuracy [DANVA]) 20 online. It consisted of 24 pictures of faces displaying one of four emotions (happiness, sadness, anger, or fear). Each picture was presented for 2 seconds, and the participant indicated which emotion was displayed. The final score was the number of emotions correctly recognised.
Observational empathy. Three external observational empathy assessments were included in the present study.
Verbal empathy statements (VES) were measured with the Roter interaction analysis system (RIAS), 21 a validated coding system specifically designed for medical interactions. Certified coders classified the physician's speech into 41 categories. To measure VES, a cluster used in previous studies in the field was applied. 22 The number of statements for the categories 'Empathy', 'Shows concern or worry', 'Reassures, encourages or shows optimism', and 'Legitimise' (see Table 1 for more details) were aggregated and divided by the total number of intelligible statements.
Overall rating of physicians' empathy was coded using the Therapist Empathy Scale (TES), a nine-item scale measuring behavioural display of empathy that showed internal consistency in past research. 23 The Synchrony of Vocal Mean Fundamental Frequencies (SVMFF) has been proposed as a cost-effective alternative to the very time-consuming behavioural coding. 24 This measure is based on the assumption that two individuals tend to synchronise their behaviour in highly empathic interactions, [24][25][26][27] and thus are expected to synchronise their mean

How this fits in
The operationalisation of empathy differs between studies, and it is not known whether different empathy dimensions impact patient experience differently. This study examined the relationship between six empathy measures and patient satisfaction with, trust in, and quality of the consultation. As empathy is stereotypically viewed as a feminine quality, the gender of physicians was taken into account. This study pointed out the influence of stereotypes on self-reported empathy (with male physicians self-reporting lower empathic concern) but no gender difference in most of the behaviourally based empathy measures, and a significant link between Synchrony of Vocal Mean Fundamental Frequencies and patient outcomes. fundamental frequency (MFF), which relates to emotional arousal. 28 Patients' and physicians' MFF was automatically measured every 0.25 seconds using Praat software version 5.3.82. The correlation between the patient's and physician's MFF was then computed across minutes while controlling for physician's and patient's gender (see Gaume et al 29 and Baldwin et al 30 for model details), ending with SVMFF scores ranging from -1 = total dyssynchrony (for example, patient displaying elevation of voice pitch while physician uses low pitch) to 1 = total synchrony.

Patient outcomes
Patient outcomes were measured with three commonly used measures in healthcare studies: satisfaction, quality of consultation, and trust. These measures have been shown to relate to positive clinical outcomes such as less work impediment, 31 better adherence to treatment, 32,33 or higher quality of life, 34 and were thus used as indicators of medical outcomes. Clinical outcomes were not measured as such. Satisfaction with the consultation was measured with the reversed single item: 'I am not completely satisfied with my consultation with this doctor'. Quality of the consultation was assessed with the reversed single item: 'Certain aspects of my consultation with this doctor could have been improved'. Both items originate from a validated scale 35,36 and have shown good reliability in previous research. [37][38][39] Finally, patients indicated their trust in the physician with the average (Cronbach's α = .73) of four items (for example, 'I completely trust my doctor's decisions about which treatments are best for me').
All outcome items were rated on a scale from 1 (do not at all agree) to 5 (completely agree). Because of the important ceiling effect (between 47% and 84% of the patients giving the maximum score), the outcome measures were dichotomised into two categories as follows: best score (5) versus any other score (1)(2)(3)(4).

Covariates
Four covariates were included: patient gender, frequency of consultations with this physician, years since first consultation with this physician, and physician clinical experience (aggregation of physician's age, years since graduation, years of practice, and years since start of private practice; Cronbach's α = .97).

Statistical analysis
To investigate gender differences in the six empathy measures, separate independent sample t-tests were run comparing female and male physicians' scores for each measure. Owing to skewness (indices between -0.94 and 0.94), nonparametric tests were also run, which showed similar results and are not presented in the result section.
To compare the different empathy measures regarding their relation to patient outcomes, and to determine whether the physician's gender impacted this Seven items: for example, 'I am often quite touched by things that I see happen. External coding of empathy VES with RIAS Aggregation of the statement frequencies of four categories (physician statements only): empathy (paraphrasing, interpreting, recognising, or naming other's emotional state), shows concern or worry (indicates that a condition/event is serious, worrisome, distressing, or deserving special attention), reassurance (indicates optimism, encouragement, relief of worry, or reassurance), and legitimise (indicates that the other's actions, emotions, or thoughts are understandable and normal) Scale: number of statements per category divided by the total number of statements Score: mean across the four categories n = 243 sessions; missing values: n = 1 (0.4%)

TES
Nine items assessing affective, cognitive, and attitudinal aspects of the physician's empathy such as concern for the patient, warmth, or understanding of the patient's feelings. relationship, 18 logistic regression models were run (six empathy measures times three outcomes). Finally, these logistic regression models were replicated with an interaction term between physician's gender and the empathy measure to test for gender effect on the relation between empathy and patient outcome. Each model controlled for the four covariates. Robust estimation was applied and the nested structure of the data (four patients nested in each physician) was accounted for with standard errors (SEs) adjusted for the clustering of the data. All analyses were performed using Stata (version 13.0).

RESULTS
Male and female physicians did not significantly differ in terms of age and experience. However, they differed in the number of years since their beginning of private practice (average of 2.9 years later for females, adjusting for age), and in their working hours, with more females working part-time (Table 2). When it came to patients, males and females were similar in terms of age, education, severity of reason for consultation, and frequency of visits with this physician ( Table 2). The patients participating in the present study had a slightly lower level of education on average, but similar age and health status compared with the general practice patients of other Swiss studies. [40][41][42] T-tests analysing physician gender differences in empathy measures showed that most empathy measures (4/6) did not significantly differ between female and male physicians (Table 3). Nevertheless, female physicians self-rated their empathic concern significantly higher than male physicians did, and male physicians were significantly more vocally synchronised with their patient compared with female physicians.
As shown in Table 4, the logistic regressions testing the relationship between the empathy measures and the patient outcomes showed that SVMFF was the only empathy measure related to patient outcomes. Additional logistic regression models with the interaction term between physician's gender and empathy showed that the physician's gender did not significantly impact the relation between empathy measures and patient outcomes, except for VES on patient satisfaction. In this model, a significant interaction was observed between VES and physician's gender (χ² = 18.28, P<0.05, odds ratio [OR] = 1.33, SE = 0.18, P<0.05). This result indicates that VES was linked to lower patient satisfaction when the physician was female, but to higher satisfaction when the physician was male.

DISCUSSION Summary
This study aimed to compare six different empathy measures in relation to patient outcomes and physician gender. The study points out the influence of gender  Each empathy measure was run in independent logistic regressions; ending with a total of six models for each outcome (that is, 18 models). Every model included the following covariates: frequency of consultations with this physician, time since the first consultation with this physician, an aggregate of highly correlated indicators of physician experience stereotype on self-reported empathy, with male physicians self-reporting lower empathic concern, but not differing from female physicians in most behaviourally based empathy measures. The divergent results between emotional concern and behavioural demonstration of empathy or emotion recognition tests could suggest that self-reported measures were influenced by gender stereotypes, that is, female physicians aligning their self-reported empathic concern with the stereotypical prosocial characteristics expected for their gender. 7 Nevertheless, it is also possible that the number of opportunities to demonstrate empathy during these general practice consultations were too few, impeding the detection of any difference between female and male general physicians.
Synchrony measured with SVMFF showed a significant gender difference, with male physicians showing higher synchrony than their female counterparts. However, unlike the other empathy measures, synchrony was computed while considering both patient's and physician's behaviour. It may be the case that it was actually the patients who synchronised their vocal frequencies more when facing a male physician, and not the other way around. This could indicate that patients reacted to the status of power usually attributed to males (especially male physicians) 43 by aligning their vocal frequency to them. More studies are needed to back up this hypothesis.
Counterintuitively, whereas numerous studies have underlined the beneficial impact of empathy on patients' outcomes, 1,4,10,[44][45][46][47][48][49] this study revealed very few significant relationships between the empathy measures and patient outcomes, SVMFF being the only measure positively related to all outcomes. The setting of this study in primary care, with patients consulting for varied reasons (such as hypertension control or laboratory test feedback) may not have been the ground for an extensive demonstration of empathy. Thus, empathic display might have not been expected or acknowledged by the patients, explaining why empathy measures failed to predict outcomes. Moreover, synchrony may show different results compared with the other empathy measures, because it encompasses a broader concept than strictly empathy and could be considered as a proxy for relationship quality.
A higher count of VES was related to lower likelihood of patient satisfaction within consultations led by female physicians. This indicates that male physicians might be better rewarded than females when expressing their empathy. On the other hand, it is more surprising to observe that female physicians' verbal empathy is related to less patient satisfaction. As other studies in the field suggest, 50 female physicians' verbal display of empathy might actually trigger more patient empowerment and enable them to feel more confident and dare to express more negative feedback, but more studies are needed to assess this.

Strengths and limitations
The main strength of this study was to compare six measures of empathy covering the affective, cognitive, and behavioural components of empathy with outcomes. A variety of empathy measures was used (self-reported assessments, emotion recognition test, as well as external coding and a novel cost-effective proxy measure of empathy). However, VES and SVMFF encompass broader aspects of patientphysician communication than strictly empathy. In any case, the patient outcomes measured in the present study showed a typical high-ceiling effect, which lowered the variance that could be explained by the statistical models. Furthermore, the context of general practice might carry fewer or subtler opportunities for empathic display as compared with other settings such as psychiatry or oncology. [51][52][53][54] Moreover, the sample of voluntary physicians, who tend to be interested in medical communication, have high interpersonal skills. This may have lowered the chances of revealing more important gender differences. Thus, the results of the present study may not be generalisable to the whole GP population.

Comparison with existing literature
This study's results showed that female physicians self-reported higher emotional concern than their male counterparts did, in line with existing literature regarding medical students 12,55,56 and physicians. 12 Similar results were reported in nonmedical settings in youth 57 and adults. 58 Synchrony measured with SVMFF showed a significant gender difference, with male physicians showing higher synchrony than their female counterparts. Unfortunately, research on synchrony of voice frequency in clinical settings is rare, and studies focusing on other types of synchrony (facial mimicry, position, gesture, or lexical field alignment) report genderaggregated data 24,59 or use same-gender dyads, 26,60,61 impeding any conclusions regarding gender-dyad differences.
SVMFF significantly predicted all patient outcomes. This result corroborates precedent studies showing that synchrony 'embodies the patients' self-reported quality of the relationship' 26 and is positively related to better medical outcomes, 62 therapeutic alliance, 63 and interpersonal trust. 64 VES was only related to higher satisfaction within male-conducted consultations, in line with other studies reporting that male physicians seem to be better rewarded than females for their use of a patient-centred communication style, 65,66 and that female physicians with better emotional recognition skills receive more ambivalent patient reactions than their male counterparts. 50

Implications for research
In the present study, self-reported empathy displayed more gender differences in comparison with other coded empathy.
This result challenges the common notion that female physicians are more empathic than their male counterparts, and asks questions about the influence of gender stereotypes and gender expectations on empathy. Nevertheless, opportunities to demonstrate empathy may have been too rare in the present study's setting, and more research should be conducted in fields where empathy is more central, such as in oncology, palliative care, or psychiatry. SVMFF significantly predicted patient outcomes, and could be used as a cost-effective proxy for relational quality in future studies. As SVMFF showed a significant gender difference, more gender studies of synchrony should be conducted in clinical settings to understand genderdyad dynamics of synchrony.

Funding
The study was funded by the 'medicine and gender' grant, an institutional university funding of the faculty of Biology and Medicine, University of Lausanne.

Ethical approval
The data collection protocol was approved by the Human Research Ethics Committees of Vaud (protocol number: 35/2013) and Geneva (protocol number: 13-064).

Provenance
Freely submitted; externally peer reviewed.