Surgeons face multiple challenges while performing a laparoscopic procedure [13]. To achieve a designed surgical goal using long-shafted surgical instruments, surgeons need to coordinate visual input with hand motions in order to control these surgical tools. The eye–hand coordination in laparoscopic surgery is more complicated than in performing an open procedure. Consequently, surgeons reported higher levels of physical and mental stresses at the end of performing a laparoscopic than an open procedure [4, 5].

Various methods have been used for assessing the mental workload of surgeons to develop strategies for managing intraoperative stresses. Some used subjective survey forms, such as NASA-TLX, to collect feedback from surgeons at the end of an operation [69]. Surgeons can give a general description of the stresses they experienced during the procedure; however, it is difficult to recall the causes of the stresses. This poses difficulties when providing constructive feedback for future improvement. In addition, surgeons’ answers to the survey questions can be affected by their working memory and ability in judging the task difficulty [7].

Other investigators used an information-processing model to examine surgeons’ mental workload in laparoscopic surgery [10, 11]. Surgeons, as human beings, have limited capability to process multiple sources of information. When the primary surgical task requires a high level of attention, the surgeon will be left with a small amount of mental resources with which to manage a secondary task presented to his/her peripheral vision. As a result, the deterioration of the performance of the secondary task often correlates to the increasing difficulty of the primary task [10, 11]. This gives us a chance to label the stressful surgical steps and develop measures to deal with difficulties. However, the secondary task can also be recognized as a distraction to the primary task, and it is irrational for us to use it in the real operating room (OR) due to concerns about patient safety [7].

Given the drawbacks of the above two methods, several authors have turned toward using the physical response of surgeons for assessing their mental workload [5]. While surgeons are performing a surgical task, sensors placed on the chest, forehead, and ear can measure heart rate as well as respiratory and skin conductance. These physiological responses, controlled by the sympathetic nervous system, are associated with the change in the mental stress experienced by the operator [7].

To further explore the method of using physical signs for mental assessment of surgeons, we decided to include eye motions as a psychophysiological sign. Among the metrics of eye motions, blinks are an important indicator of mental activity [1214]. Involuntary eye blinks in humans are controlled by a central mechanism that can be affected by the mental state. Changes in blink frequency and duration are correlated with the moment of mental fatigue, lapsed attention, and stress overload. This has been documented in various studies outside the context of surgery, including checking the drowsiness of individuals [15] and evaluating the mental workload of drivers [16, 17] and pilots [18].

In this study we tested the feasibility of using blink frequency and duration as a way to assess surgeons’ mental workload and correlated the surgeons’ eye blinks with their score on the NASA Task Load Index (NASA TLX), a known assessment instrument, based on the surgeons’ comments on six domains of task requirements. We hypothesized that surgeons who display physical signs of stress (less frequent blinks and shorter blink duration) during a laparoscopic procedure will report a high level of mental workload, assessed at the end of the procedure with the NASA TLX.

Methods

Environment and apparatus

The study was conducted in a simulated OR located in the Department of Surgery at the University of British Columbia. In the centre of this simulated OR, we set up a PC-based virtual simulator (SurgicalSim VR, METI Inc., Sarasota, FL, USA) which can create a virtual environment for training several intracorporeal procedures. Surgeons in this project were required to remove the gallbladder from attached liver tissue (laparoscopic cholecystectomy). For this procedure, the SurgicalSim displayed a series of images of human gallbladder and adjacent organs, similar to settings in our previous paper [19]. Images of the surgical site refreshed at 36 Hz in response to the surgeon’s manipulation on the tissues using a pair of laparoscopic graspers and a monocautery. The SurgicalSim did not provide haptic feedback to the surgeon, but visual feedback was close to what the surgeon receives during a real procedure. The system can record a surgeon’s performance during each session and provides immediate feedback once the task is completed.

When surgeons were called to the lab, each was required to perform two simulated cases of laparoscopic cholecystectomy whilst wearing a pair of goggles which were attached to an eye-tracker (PT-Mini, Locarna System Inc., Victoria, BC, Canada). The goggles include two small cameras, one aimed toward the wearer’s eye to capture the eye motions and the other pointing forward to capture the scene observed by the wearer. The video recorded by the eye camera was used for processing blink data for the study.

Participants

General surgery residents, minimally invasive surgery (MIS) fellows, and attending surgeons with various levels of experience were recruited from the Department of Surgery at the University of British Columbia for this study.

Procedure and measures

Consent was obtained from the participants before entering the study. Before starting the procedure, the surgeons needed to watch nine black dots printed on a paper poster to complete the calibration process with the Locarna eye-tracker (Fig. 1).

Fig. 1
figure 1

Calibration process is performed on a subject before starting the trial after wearing a special goggle for tracking eye motions

During the task, the SurgicalSim VR recorded the surgical performance along with the following measures: (1) total procedure time (in seconds), (2) instrument tip trajectory (in centimeters), and (3) number of errors during the procedure, which includes misplacement of electrocautery onto the nontarget tissue, and the number of excessive traction movements on tissues. To achieve the best performance, surgeons needed to perform the procedure in a short amount of time and with caution.

While performing the simulated tasks, the surgeons’ eye-motions were recorded by the head-mounted eye-tracker (Locarna PT-Mini) at 30 Hz. In the postoperative phase, a custom-designed algorithm was applied to the eye-motion videos to compute the number of blinks, as we previously described [20]. The algorithm was developed on the basis of computer vision techniques. Briefly, the pupil and the eyelid were identified by the computer in each video frame. The positions of the eyelid and the pupil were compared between any two consecutive frames. The frame where the eyelid starts to drop down was annotated as the moment of a blink start. The moment that the pupil size returns to full size indicated the blink end. The blink duration was calculated as the number of video frames between blink start and end and then converting that number and reporting the duration in milliseconds. To avoid bias in eye measures, participants were not informed of blink measures during the data collection phase.

Once the simulated task was completed, the NASA TLX form was used to evaluate the workload experienced by the surgeon. NASA-TLX requires participants to rate their perceived levels of mental, physical, and time demands associated with a task on a scale of 20 points, as well as their effort, performance, and frustration during that task. NASA-TLX has been used extensively in a variety of projects for assessing the mental workload experienced while performing both open and laparoscopic operations [7, 10, 21].

Data analyses

Descriptive analysis was conducted on the blink data. Median of blink frequency was used to divide surgeons into infrequent and frequent blinking groups. Task performance (task times, distance of tool trajectories, and number of errors) and the NASA TLX measures were compared between these two groups. Surgeons’ blink frequency and duration were correlated to the outcomes of the NASA TLX reports using a Pearson test on correlation.

Results

A total of 46 surgical trials were recorded from 23 surgeons. Eye motions were not recorded in 4 trials due to the equipment failure. The descriptive analysis of blinks based on 42 valid trials is reported in Table 1. The median blink frequency (6 blinks/min) was used to divide data into two groups—those with low blink frequency (n = 21) and those with high blink frequency (n = 21). The correlation coefficients (Pearson test) between NASA TLX and the blink frequency and duration were –0.17 (P = 0.282) and 0.446 (P = 0.776), respectively. Surgeons who blinked less frequently (≤6 blinks/min) made a higher level of effort on performance (60 vs. 42, P = 0.005) and reported more frustration (46 vs. 34, P = 0.047) and overall level of workload (57 vs. 47, P = 0.045) compared with those who blinked more frequently (>6 blinks/min; see Table 2). Interestingly, surgical task performance was not associated with the blink frequency of the surgeons. Although the low-blink surgeons were more satisfied with their performance (60 vs. 43, P = 0.005), task time (P = 0.439), length of tool tip trajectory (P = 0.925), and number of surgical errors made in the procedure (P = 0.671) were not significantly different between the two blink groups.

Table 1 Descriptive statistics
Table 2 Group comparison

Discussion

Our research hypothesis was supported by the results of our study: surgeons who made fewer and shorter blinks reported higher mental workloads through a self-reported instrument.

A surgeon’s workload in the operating room can be affected by multiple factors, including hours of work and hours of sleep prior to the operation, as well as the complexity of surgical procedure [7, 11, 22]. Previous studies have found that sleep deprivation of surgeons leads to a higher level of mental workload [23], and more complex surgical procedures such as endoscopic procedures will increase the mental workload compared to routine laparoscopic procedures [24]. Our long-term goal in this line of research is to further our knowledge on what elevates surgeons’ workloads and how they should cope with this factor for a safe performance. To achieve this goal, we have taken the first step, i.e., to develop a reliable tool for assessing surgeons’ mental workload.

The novelty of this study is that we employed computer vision techniques for capturing eye blinks with the eye camera of a head-mounted eye-tracker. The high-quality image of the eye enabled us to compute the number of blinks and report blink durations. As we reported elsewhere, the accuracy of this algorithm can be as high as 98 % when compared to the detection by human operators using video [20].

We noticed that the correlation between NASA-TLX and blink duration and frequency is not significant (both P > 0.05) due to some outliers. This is quite disappointing for us. Our interpretation is based on findings from two blinks groups. We understand that separating data into two groups and comparing the means between them may limit the generalizability of our findings. Nevertheless, the tendency was still presented for all data and was confirmed by the group difference.

There is a strong advantage in using the blink to assess surgeons’ mental workload. This technology provides the opportunity to continuously monitor the change in the mental status of the surgeon during a surgical procedure. Explicitly, we can plot the change in blinks over the course of an operation, label the stressful moments, and subsequently identify which surgical tasks were associated with the stress. In this study, we did not plot the data in this way because the procedure was too short, only about 3 min on average. In addition, the task in this virtual training simulation was relatively simple, requiring only the dissecting movement from beginning to end. There were no steps in the operation that increased task difficulty for the surgeons. Although we have not had the chance to compare blinks over different surgical tasks, we will include this analysis in our next project where complex surgical procedures with multiple surgical tasks will be performed.

The second limitation of this study was that we did not record surgeons’ blinks before they performed the task. Baseline blink data would help us understand individual differences and would provide the basis for making adjustments on blink changes with differing experimental conditions. We will include baseline blink data in our future study to improve the reliability of the data.

Conclusion

Eye-tracking technology can be a valuable tool for assessing the mental workload of surgeons in the operating room. As we can record the eye motion of surgeons in addition to intraoperative video, we expect to develop a way to annotate difficult steps in the procedure and visualize the surgeon’s mental state during the surgical procedure.