INTRODUCTION

The physical examination is an essential component of clinical diagnosis.1,2 Careful, unbiased visual observation or “inspection” can provide information critical to the accuracy of a diagnostic evaluation.24

Despite evidence of substantial inadequacy in physical examination skills among medical students, residents, and practicing physicians,57 the teaching and confidence level in such skills has diminished among trainees and faculty.2,8 The result is a decline in the use of fundamental bedside procedures, such as inspection, which are often replaced by expensive laboratory tests and radiological studies.9 These trends suggest broad opportunities to improve patient care with the implementation of better physical examination teaching methods.

One potential solution is the teaching of “visual literacy,” i.e., the ability to find meaning in imagery, which in medical parlance translates into the ability to reason physiology and pathophysiology from visual clues.10 Educators have confirmed that visual literacy can be developed,11, and limited efforts to introduce similar teaching methods into clinical training have been promising.1214 Clinical habits are formed early in training,15 and therefore we designed a novel pre-clinical course to enhance medical students’ diagnostic acumen by expanding their visual skills through the (1) close observation and guided discussion of visual art, (2) exploration of core artistic concepts, and (3) opportunity to apply these skills to the clinical assessment of patients with a broad range of disorders. We hypothesized that this experience would increase the frequency of visual observations made by participants compared to an otherwise similarly trained group of non-participants.

METHODS

Design

We used a prospective, partially randomized study design with pre- vs. post-course evaluations and mixed-methods data collection16 to assess the impact of our course on pre-clinical medical and dental students.

Sample and Randomization

In 2004–2005, all pre-clinical students, in their 1st or 2nd years, at Harvard Medical and Dental School (HMDS) were invited, via an e-mail announcement, to enroll in the course “Training the Eye: Improving the Art of Physical Diagnosis.” The study was originally designed to have 24 subjects in the intervention group and 34 students in the control group based on power analysis and sample size estimations. Fifty-six students expressed an interest in the course. Twenty-four students were randomized to participate in the course, and 32 were randomized to the control group. All 24 students in the intervention group and 19 of the 32 students in the control group agreed to participate. To establish adequate power based on pre-determined calculations, an extra 15 HMDS students were recruited to the control group via an additional e-mail solicitation, leading to a total of 34 controls. We found no statistically significant differences in demographic characteristics or pre-course visual skills examination scores between the randomized (n = 19) and recruited (n = 15) controls. (Data available from the authors.) These groups were combined for all further analyses. All control students received a $100 gift certificate, as compensation for the estimated 2.5 h required to complete the demographic survey and pre- and post-course evaluations. Institutional Review Board approval was obtained, and all study participants gave written informed consent.

Intervention

Training the Eye:Improving the Art of Physical Diagnosis is a 9-week pre-clinical course offered to students as a spring term elective. It consisted of 8 weekly 2.5-h didactic sessions with a curriculum design based on Schubert.17 The didactic sessions were divided into a 75-min observation exercise at the Boston Museum of Fine Arts (MFA) followed, after a short break, by a 1-h lecture linking visual arts concepts with physical diagnosis (Text Box 1).

Text Box 1 Examples of Observation Exercise Concepts, Corresponding Didactic Sessions and Viewed Art Works

Observation exercises were facilitated by art educators trained in Visual Thinking Strategies, a methodology that uses art discussion to develop critical thinking, communication skills, and visual literacy.10,11 In groups of up to 12, students practiced inspecting, verbally describing, interpreting, and actively building on others’ analyses of artworks that were pre-selected to strategically exercise fine arts concepts linked with medical didactics. In the associated classroom series, HMDS faculty explored the application of various artistic concepts to specific components of the physical examination. Examples of a broad array of clinical findings discussed during these sessions ranged from dermatology (palor, jaundice, erythema, vesicles vs. papules, grouped lesions) to pulmonology (stomach-breathing, use of accessory breathing muscles, pursed lips) and neurology (cranial nerve III, IV, VI, and VII palsies, and gait). At two sessions, students were guided by faculty through their observations of volunteer patients with unknown pulmonary and neuromuscular disorders. An optional ninth session was a 2.5-h opportunity to draw the human figure, from a live model, with professional art instruction. Students were asked to complete weekly assignments, including focused readings and brief visual training exercises (Text Box 2).

Text Box 2 Out of Class Assignments Completed Before Each Didactic Session

The major cost of the course was faculty time, which amounted to a total of 32 physician hours (3 h of preparation for each hour of presentation) and 40 h of museum supervision by art educators, including pre-selection of artworks. Costs were contained through the existing partnership between HMDS and MFA that covered admission fees and classroom rental.

Outcome Measures

Before and after the Training the Eye course, the intervention and control groups completed a demographic survey and 1-h written visual skills examination. Students were given 8 min to report their free-text observations and interpretations of five slides depicting three patients with a variety of clinical disorders and two artworks in different genres, none of which were shown or discussed during the course. The clinical images included physical findings associated with upper extremity DVT, Wallenberg syndrome, and relapsing polychondritis. The pre- and post-test images were different and were chosen from a larger group of images sent to a group of art experts and clinical faculty, in order to select images comparable in subject, genre, and complexity.

Analysis of students’ responses was blinded to participation in the course and pre- vs. post-course evaluation. Responses were graded for frequency of accurate and unique visual components of the image. A “point” was assigned for each “accurate” observation based on observations that were initially catalogued by a group of senior art educators (art imagery) and physicians (clinical imagery). A total mean score was created for the control and intervention groups by summing the frequency of observations for each image across all images then dividing by the number of students in that group. The pre-test was then subtracted from the post-test score to obtain a change in mean scores for each image. To obtain clinical and art imagery scores, the scores on the three clinical and two art imageries were divided by 3 and 2, respectively.

In addition, to gain insight into qualitative differences, responses to the visual skills examinations were evaluated using content analysis. An iterative process, according to Miles and Huberman, was used to identify the themes that emerged from the data.18 A group of educational researchers, faculty, and art educators: (1) read responses to the visual skills examination blinded to timing and intervention status, (2) developed codes for students’ responses to each image, and (3) came to consensus on codes and categories. Students’ responses were subsequently coded and categorized by three independent researchers (MAB, JPH, and SN) who analyzed the data independently for reliability.

STATISTICAL METHODS

Sample size estimations were conducted, assuming at least 80% completion of the visual skills examination in both groups. The main outcome was to compare the post-course minus pre-course change in scores between the intervention and control groups. With 20 students in the intervention group and 32 in the control group, a standard deviation as high as 2.0 and a 5% significance level, this study had a >94% power to detect a 2-point difference in the mean change score, using a non-parametric t-test. Non-parametric statistics were used since the data (frequency of accurate observations) were ordinal and would likely be skewed.

Demographic characteristics between the intervention and control groups were compared with Wilcoxon rank sum and Fisher’s exact tests. Statistical analyses of the change in the mean frequency of observations (post-course minus pre-course) in the control versus intervention group were performed using non-parametric t-tests and paired t-tests from generalized estimating equations for repeated measures data.19 This method is appropriate for non-normal outcome data (frequency of accurate observations) and also minimizes the bias that may result from subjects who drop out of the study. Statistical analyses were performed using SAS, Version 9.1. All tests were two-sided with an alpha of 0.05. We did not adjust alpha to account for multiple comparisons; thus, the p-values should be interpreted cautiously.

RESULTS

The intervention and control groups had similar demographic characteristics (Table 1). Their mean age was 24 years (SD, 21.4), 59% were women, and most of the students were in their first year of medical school. The mean frequency of observations on the pre-course visual skills examination did not differ among intervention and control students (Table 1 and Appendix 1). Study completion was high and consistent between groups, with 83% of intervention and 94% of controls completing the pre- and post-course visual skills examination (p = 0.22).

Table 1 Characteristics of Study Participants

After completing the course, intervention students made more overall observations on the visual skills examination compared to control students (mean 18.67, SD 0.79 vs. mean 13.29, SD 0.63, respectively, p < 0.0001) (Appendix). For intervention students, this was an increase of 5.41 (SD 0.63, p < 0.001) observations, on average, after the course compared to before the course (Table 2). The control group had no improvement in their total mean observation score (0.36, SD 0.53, p = 0.49). In other words, intervention students had a 38% increase in observations over the control students. This pattern was similar for all five pre- and post-course image pairs. Improvements in observation scores for the intervention group were seen in both the clinical and art imagery slides (Appendix).

Table 2 Change in Mean Frequency of Accurate Observations for Intervention and Control Groups

The average number of sessions attended was 8.15 out of 9 sessions (range 6–9). Students who attended 8–9 sessions did not differ on pre-course scores from those who attended fewer than 8 sessions (Fig. 1). However, students who attended 8 or 9 sessions (n = 15) had a greater increase in their mean change score, compared to those who attended 7 or fewer sessions (6.31, SD 0.81; 2.76 SD 1.2, p < 0.030).

Figure 1
figure 1

Graded impact of attendance on total mean observation score on pre- vs. post-test observations of all images.

Qualitative analysis of students’ responses by expert reviewers resulted in the emergence of five categories of responses and their clinical correlates: (1) observations, (2) interpretations supported by evidence, (3) speculative thinking to generate multiple interpretations (using words such as “suggest,” “seem,” “appears,” “judging from,” “as evidenced by”), (4) awareness of absence of observations (pertinent negatives), and (5) use of fine arts concepts (Text Box 3). Qualitative data demonstrated that students who completed the course compared to control students used more fine arts concepts linked to physical findings in their post-course descriptions of images (Table 3), including specific mention of color/shadow/light and symmetry/balance. There was no increase in use of any fine arts concepts by the control group.

Text Box 3 Themes That Emerged from the Qualitative Analysis of Visual Skills Examinations and Examples of Students’ Observations
Table 3 Mean Frequency of Fine Art Categories Used by Control and Intervention Students Pre- and Post-course*

DISCUSSION

The goal of Training the Eye: Improving the Art of Physical Diagnosis, a multidisciplinary course, was to improve the visual literacy skills of medical and dental students by developing the practice of “active looking”-unbiased inspection, and accurate reporting. The course was based on the Visual Thinking Strategies methodology10,11 and facilitated by professional art educators. It provided a focused structure for the use of fine art to build these skills. Our qualitative and quantitative findings suggest that observation skills, including those directly relevant to clinical medicine, can be successfully acquired through active, structured study of works of art and medical imagery. Participants demonstrated improved ability to make accurate observations and were more likely to incorporate fine arts concepts relevant to physical diagnosis (e.g., color and symmetry) into their written descriptions. These findings suggest a more sophisticated level of visual literacy achieved by course participants. In addition, there may be a graded effect of class attendance on post-test observation scores, suggesting that the more the students practiced the better their visual skills.

Educational activities designed with a similar intent have been shown to be beneficial, but have been more limited in their scope, duration, participation of professional art educators, and evaluation. Previous interventions have typically been brief, occurring in one to three sessions. Dolev et al. conducted a randomized controlled study to analyze the impact of the observation of art on 81 1st year medical students.12 During a one-afternoon session, students described pre-selected representational paintings, including landscapes and human figures, moderated by the museum’s curator of education. Compared to the controls, who attended a clinical tutorial session focused on history taking and physical examination skills, course participants showed improvement in accuracy of visual diagnostic skills based on higher scores in describing patient dermatology photographs. Bardes et al. found that pre-clinical and clinical students enrolled in a three-session course focused on examining and discussing painted portraits and patient faces demonstrated improved precision in observations and enhanced awareness of emotion in human facial expression in unblinded post-course assessment.20 A single structured discussion of artwork facilitated by a museum instructor at a family medicine residency retreat was enthusiastically received by residents and faculty, and stimulated dialogue, listening skills, and analytic thinking as judged by self-report. However, this program’s impact on learning was not formally studied.21

The current study expands on prior studies by introducing an extended nine-session multidisciplinary course, including didactics and coordinated observation exercises to increase the depth and duration of exposure, practice of active looking, and likelihood of content retention. This intervention promotes strategies for confronting and deciphering visual information. Students exercise visual problem-solving in art and medical imagery using the repeated practice of observing and describing. By exploring a wide range of artistic concepts, art genres, and medical conditions, course faculty challenged students to see beyond specific content areas. The attainment of visual skills that cross subject content, in the current study, is consistent with the findings of prior educational research demonstrating that core principles of visual literacy transfer to problem-solving in other fields, such as reading, writing, and mathematics.10,11,22

This study has several limitations. First, the decision to establish an adequately sized control group came at the cost of true randomization. Randomized and recruited controls did not differ demographically or on the pre-course visual skills examination; however, unmeasured variables may confound the relationship between course participation and improved scores on the visual skills examination. In addition, we did not adjust alpha for multiple comparisons, so some of the statistically significant differences between the student groups may have occurred by chance alone. Second, students were evaluated immediately after the course. Long-term follow-up was not done, precluding the assessment of sustained learning. Third, the combined effect of fine art observation and didactics was studied, limiting our ability to distinguish the relative contributions of each. Comparison of this course to an alternative physical examination course was not performed. Fourth, clinical outcomes were not measured, so there is no direct evidence that these skills can be transferred into improved patient care. Finally, the study was located at one medical school, and although other schools have successfully implemented shorter courses with similar design elements, the ability to generalize these findings to other schools is unknown. This course may be difficult to replicate at medical schools that do not have art museums or art educators available to them.

CONCLUSION

Improved inspection skills can play an instrumental role in an accurate medical evaluation. A novel course that combines systematic fine art observation exercises with lectures linking artistic concepts to physical examination skills led to increases in frequency of participants’ observations and sophistication of responses on a structured post-course evaluation. At a time when physical examination skills are waning, an interdisciplinary course developing visual literacy can expand medical students’ observational acumen and diagnostic capabilities.