Introduction

Adverse events occur not uncommonly among hospitalized patients [1,2,3] and can lead to significant morbidity [4, 5], mortality [1, 6], and increased healthcare expenditure [7, 8]. Several studies demonstrated that a large number of adverse events take place in the operating room (OR) [5, 9]. Further, about half of these events were deemed preventable [4, 6]. Thus, significant improvement in patient safety may be achieved by better understanding the patterns, nature, risks, and effects of intraoperative adverse events (iAE).

Frameworks to measure postoperative outcomes, such as the one developed by Clavien and colleagues [10], have been validated and adopted by several surgical communities. They provide the uniform language for clinicians and healthcare researchers to discuss the prevalence and characteristics of postoperative outcomes. Such exchange of structured information provides insights into how to reduce rates of postoperative complications and allow developments of educational and quality initiatives. On the other hand, although the need to systematically identify and characterize iAE has been advocated in the past, it has rarely been done [11]. This knowledge gap on iAE may be due to several factors. Several terms were used in place of iAE in the literature, and various methods were utilized to detect them, including patient charts, self-reported incident reports, and direct field observations in the OR. The heterogeneous nature of reporting in the literature has hindered any attempt to systematically synthesize evidence on the characteristics and incidence rates of iAE.

This systematic review summarizes published data on iAE in adult and pediatric patients undergoing elective or emergent surgery. The objectives of the present study are to (1) present the terms used in the literature to describe iAE, as well as their definitions, (2) identify types of reported iAE and summarize them into discrete categories, and (3) describe reported frequencies and severity assessments of iAE.

Materials and methods

We performed a systematic review according to the guidelines outlined in the Cochrane Collaboration handbook [12] and reported the findings following the Preferred Reporting Items for Systematic reviews and Meta-Analyses (PRISMA) statements [13]. Research ethics approval and informed consent were not required.

Data sources and search strategy

Ovid Medline, Embase, and Cochrane Central Register of Controlled Trials were searched from inception to June 2016. We used database-specific combinations of key index terms and text words related to medical errors, iatrogenic disease, patient harm, never event, and near miss, and combined these with terms related to surgical procedures and operating rooms, and terms related to risk management and reporting. We limited our search to studies reported in English. Detailed search strategies for each database are presented in Supplemental text. In addition, references of the included studies were manually searched for eligibility. EndNote X7 (Reuters, New York) was used to organize references.

Inclusion and exclusion criteria

We included original research studies that reported iAE from elective or non-elective operations on adult or pediatric populations in academic or non-academic and urban or rural hospitals around the world. Specifically, we included studies that reported types and incidence rates of iAE. We included randomized/quasi-randomized trials and cohort, case–control, cross-sectional, and case-series studies. We excluded abstracts, dissertation/thesis work, unpublished reports or data, reviews, commentaries, protocols, and letters to editors. Additionally, we excluded studies that took place in endoscopy or procedure rooms.

Study selection and data extraction

Two reviewers evaluated the titles and abstracts of all eligible articles and created a subset for full-text review. The same two reviewers independently applied the inclusion and exclusion criteria in the full-text review to select studies for data abstraction. A data abstraction form was created based on the protocol using Excel sheet (Microsoft, WA). It was pilot-tested on five studies for feasibility and acceptability by both reviewers. Then, the two reviewers independently abstracted data. When disagreement in selection or data abstraction occurred, it was resolved through discussions or by a third reviewer if no consensus was reached. We did not contact study authors to obtain additional data.

Assessment of bias

We used the National Institute of Health (NIH) Quality Assessment Tool for Observational Cohort and Cross-Sectional Studies to assess the quality of studies included in the review [14]. We used four items in the tool that specifically assessed internal validity of the included studies (Table S1, supplemental material). Two items assessed risk of sampling bias, and the other two items assessed risk of measurement bias. All items were rated as yes, no, cannot determine, not reported, or not applicable. The overall quality rating per included study was provided. A rating of “good” was given if all four items were rated as yes. A rating of “poor” was assigned for studies that contained more than 3 items rated as no or not reported. The rest of the combinations of item ratings were given “fair” overall assessment. Two authors independently performed assessments of bias. Differences in the assessments were discussed, and consensus was reached.

Data synthesis

The primary objectives of the present review study were to characterize iAE and to assess frequencies of their occurrences. First, several terms were used in place of iAE in the literature. We summarized these terms and their respective definitions. Second, we presented the types of iAE that were reported in the included studies. Further, we summarized the iAE types into seven discrete categories. Third, we attempted to estimate frequency of iAE occurrences; however, a statistical meta-analysis was not possible due to the wide heterogeneity in patient samples, procedure types, data sources, and outcome definitions and measurement techniques. Therefore, we performed descriptive synthesis of the reported frequencies from individual studies that were deemed to have fair to good methodological quality. We presented these findings by the methods used to detect iAE. Specifically, the detection methods of iAE included direct observation (i.e., human observers in the OR), patient charts, administrative data, incident reports, and surveys. Additionally, we summarized the studies that measured severity of iAE and reported corrective processes taken to rectify the events.

Results

We identified 3346 articles through database search and additional 16 titles by hand for a total of 3362 articles. After 340 duplicates were removed, 3022 titles and abstracts were screened and 2691 articles were excluded as they did not fit our inclusion criteria. We reviewed the full texts of the remaining 331 articles and excluded 284 of them. Thus, 47 articles were included in the final cohort. A flowchart of study selection process is illustrated in Fig. 1.

Fig. 1
figure 1

PRISMA flow diagram

Study characteristics

Table 1 demonstrates characteristics of the 47 included studies. The studies were published in years 1999 to 2016 with a steady increase in the number of publications per year with time (Figure S1, supplemental material). Nine surgical specialties were represented, and neurosurgery had the most number of included studies (17%). All included articles were observational studies. Thirty articles were cross-sectional (64%), 3 were cohort (6%), and 14 were case-series studies (30%). Various data sources were used to identify iAE. Thirteen studies (28%) examined incident reports, 8 studies (17%) administered surveys to healthcare providers, and 5 studies (11%) used patient charts. Eight studies (17%) deployed human observers in the clinical settings to identify iAE. The methodological quality assessment of the included studies is presented in Table S1 (supplemental material). Eight of 47 studies [15,16,17,18,19,20,21,22] were deemed to be of poor methodological quality. Ten studies [6, 23,24,25,26,27,28,29,30,31] were deemed to have good methodological quality, and the rest were rated as fair.

Table 1 Characteristics of the included studies

Definitions and types of intraoperative adverse events

Several terms were used to describe iAE as demonstrated in Table 1. Nine of 47 studies (19%) [21, 32,33,34,35,36,37,38,39] identified wrong-site or wrong-patient operations specifically. The rest of the studies (81%) reported various types of iAE and adopted sixteen different terms to describe them. Adverse events (30%) was the most frequently used term, followed by error (11%) [22, 40,41,42,43] and incident (6%) [25, 44, 45]. Twenty-four studies explicitly stated definitions of the terms used to describe intraoperative events. Two [42, 46] of them used the same definitions as other studies, leaving 22 unique definitions. While several definitions were found, there was a pattern of recurrent concepts. Nine definitions [31, 43, 45, 47,48,49,50,51,52] included the notion that an iAE was unintended or unanticipated. Eight definitions [6, 27, 31, 48, 50,51,52,53] stated that an iAE was caused by medical teams and was not due to patients’ underlying conditions. Further, an iAE resulted in increased potential harm in 7 definitions [20, 45, 47, 49, 54,55,56] and in actual physical harm in 13 definitions [6, 16, 27, 45, 48,49,50,51,52,53, 55,56,57]. The included studies characterized a total of 74 unique iAE types as demonstrated in Table S2 (supplemental material). Equipment (identified in 13 studies), communication (12), wrong site/wrong patient surgery (12), diagnosis (11), tissue injury (11), technical (10), and medication (9) were the iAE types most frequently identified in the studies.

Frequency of intraoperative adverse events by detection method

Five studies [23, 29, 47, 58, 59] used direct observation in the OR by human observers during surgical procedures to identify iAE (Table 2). Two studies [47, 58] reported major and minor events in pediatric cardiac surgery. Catchpole et al. [58] reported 7 major events and 366 minor events in a sample of 24 patients and Barach et al. [47] found 90 major events and 991 minor events in a sample of 102 patients. Another study [59] reported a mean of 10 minor problems and a mean of 6 operating problems per case in 24 pediatric cardiac operations and means of 13 minor and 5 operating problems per case in 18 adult orthopedic procedures. Two studies [23, 29] observed consecutive adult complex arterial surgical cases. Albayati et al. [23] observed 66 cases during a 9-month period and identified 1145 failures. Mason et al. [29] developed a structured tool based on the work by Albayati et al. to identify 256 intraoperative error events in 21 consecutive cases.

Table 2 Frequency of iAE by data source types

Three studies reported frequency of iAE by reviewing patient charts [28, 50] or administrative data [36] (Table 2). Kaafarani et al. [28] identified 181 patients with at least one iAE out of 9292 patients who underwent general surgery procedures by reviewing the validated and risk-adjusted database based on patient charts. Proctor et al. [50] performed a chart review of 64 pediatric general surgery patients and found 18 intraoperative errors, five of which led to adverse outcomes. The Veteran Health Administration (VHA) database of more than 2 million surgical procedures performed in the USA during 54 months was assessed to identify 108 intraoperative wrong site or wrong patient operations [36].

Seven studies [24, 26, 30, 32, 39, 45, 51] identified iAE through reviews of incident reports (Table 2). Of these, two studies [26, 30] utilized a tool dedicated to prospectively report adverse events in spine surgery called the Spine AdVerse Events Severity system, version 2 (SAVES V2). Street et al. [30] found 113 intraoperative events in 942 patients admitted to a referral spine center during 12 months. In the same center, Dea et al. [26] also used SAVES V2 to identify 32 patients with at least one intraoperative event out of 101 patients who underwent emergency operations with a diagnosis of metastatic oncologic spine disease during 48 months. Another incident reporting tool called the Northwestern Online Surgical Quality Improvement (NOSQI) system was used to allow healthcare providers to disclose intraoperative events [24]. The authors reported 957 adverse events from 15,524 operations, and 43 events were detected in the intraoperative phase out of the 340 events in which timing of the occurrence was known. Further, four studies reported low frequencies of wrong site surgery [32, 39], intraoperative death [45], and never events [51].

Seven studies [33,34,35, 37, 38, 42, 43] administered surveys to healthcare providers to identify iAE. Of these, five studies reported proportions of the surveyed surgeons who had any wrong-site or wrong-patient operations in neurologic [33, 35], hand [34], head and neck [37], and eye [38] surgery. Nine to fifty percent of the respondents admitted to having performed at least one wrong-site or wrong-patient operation in their careers. In the two other studies, surveys to identify iAE that occurred within the last 6 months of practice were administered to Head and Neck physicians in years 2003 [43] and 2012 [42]. These studies reported 91 and 101 iAE from 466 to 681 respondents, respectively.

Severity of intraoperative adverse events

Twenty studies rated iAE by their severity (Table 3). In 18 studies, the severity was classified based on patients’ postoperative outcomes. It ranged from resulting in no harm to transient harm to permanent harm or death. In six studies [24, 37, 42, 43, 48, 58], the severity was categorized based on whether the patients required additional postoperative interventions including noninvasive and invasive treatments, as well as initial or prolonged hospitalization. In four studies [23, 29, 58, 59], the severity was classified by the degree of disruption in flow of operations caused by the iAE. Kaafarani et al. [28] determined the severity based on the level of intraoperative interventions required to rectify the iAE ranging from no intervention to intervention not requiring organ removal to intervention requiring tissue or organ removal and to missed injury.

Table 3 Severity of intraoperative adverse event classifications

Characteristics of intraoperative rectification of adverse events

Four studies [18, 28, 47, 54] characterized how iAE were rectified during the same procedure. In two studies [47, 54] where human observers were deployed in the OR, rectification processes were classified as cognitive, system, monitoring, collaboration/compromise, adaptation/innovation, leadership/authority, verification, luck, and surgical technical types. Kaafarani et al. [28] categorized rectification processes by the invasiveness of surgical interventions needed in response to tissue injuries. Pollock and Hayward [18] characterized rectifications of events by providing brief descriptions, such as revision for blocked ventriculoperitoneal shunt and repair for dural laceration during lumbosacral discectomy.

Discussion

In this systematic review, we found that intraoperative adverse events were reported under various terms and definitions and by using different detection methods, including direct observation in the OR, patient charts, administrative data, incident reports, and surveys. Frequencies of iAE appeared to vary depending on the detection methods, with higher numbers reported when direct observations took place in the OR. Collectively, the included studies identified 74 types of iAE. When severity of iAE was measured, most studies stratified it based on the seriousness of postoperative outcomes or the invasiveness of additional interventions required. Corrective processes to rectify intraoperative events were rarely assessed.

Various terms and definitions have been used in the literature to describe iAE. The World Health Organization (WHO) adopted the definition of an adverse event as “an injury related to medical management, in contrast to a complication of disease,” originally published in the Harvard medical practice study [2, 60]. The WHO also stated that other terms such as incidents, mishaps, unanticipated events, or accidents were sometimes used in place of adverse events. In our review, the included studies used various terms including problems, surgical confusion, patient safety issues, and errors to describe iAE. This variability in the chosen terms can hinder anyone wanting to synthesize evidence on and discuss with others about iAE. Errors, in particular, carry a distinct definition from adverse events. Errors are defined as “the failure of a planned action to be completed as intended or the use of wrong plan to achieve an aim” and may occur in the absence of adverse events [61]. However, in four of the five included studies that used the term errors, in fact, described iAE, such as tissue injuries and wrong site surgery [22, 41,42,43]. Based on our summary of the definitions used by several studies, we propose that an iAE to be defined as any unintended event caused by medical teams during surgical procedure and unrelated to patients’ underlying conditions that led to potential or actual physical harm. Our review identified a total of 74 iAE types, several of which shared similarity and could be classified in the same broader category. Therefore, we summarized the iAE types to 7 categories based on similarity while achieving maximum mutual exclusivity possible between the categories. In our proposed framework, our iAE types included cognitive, behavioral, expertise/technical skill, tissue injury, equipment/medication, environmental/organizational, and patient-related events (Table 4).

Table 4 Proposed framework of intraoperative adverse event types and their examples

Frequency of reported iAE varied widely depending on the detection method chosen. Due to the heterogeneous nature of included data, a statistical meta-analysis was not possible. However, some observations could readily be made. Direct observation method yielded a higher number of iAE than reviews of patient charts, incident reports, and surveys. Retrospective analysis using patient charts to recreate chain of adverse events is often done without first-hand knowledge of the situation and results in suboptimal data collection. Incident reports require consistent and complete inputs from healthcare providers, who often fail to comply due to time constraints and fear of punishment, litigation, and lessened reputation [62]. Survey studies are often limited by respondents’ recall bias and lack of detail in their input. In a highly complex system like the OR, a prospective direct observation is a more sensitive technique to detect iAE that either lead to or have potential to result in harm [63].

Assessment of iAE severity allows identification of the most pertinent events from a patient safety perspective. Most of the studies in our review measured severity based on patient outcomes or requirement for additional therapeutic interventions in the postoperative phase and this method has limitations. It is often unclear whether the postoperative outcomes can be attributed to the identified iAE. Instead, frameworks that measure severity based on the characteristics of iAE may be more elucidating. Kaafarani et al. [28] classified the iAE severity based on the level of intraoperative therapeutic treatments performed and demonstrated construct validity evidence by predicting 30-day postoperative morbidity. Similarly, a Delphi study from Rosenthal et al. [64] constructed an iAE severity classification tool based on the level of intraoperative therapeutic interventions required and postoperative outcomes. Both tools were designed to be applied when assessing iAE using patient charts. Using direct observation methods, more frequent identification and detailed characterization of iAE are possible. Therefore, a severity measurement tool that can be implemented in direct observation methods may help better understand the relationship between the severity of iAE and postoperative outcomes.

Our study has limitations. Our review relied mainly on search of articles in a wide range of academic journals indexed in major databases and may have missed evidence published in non-indexed journals and in the gray literature. Individual studies adopted various definitions of iAE using different detection methods, and thus, our review was restricted to a synthesis of the results as reported in these studies. Assessment of bias in the included studies was performed using a modified version of NIH Quality Assessment Tool for Observational Cohort and Cross-Sectional Studies [14], a relatively new tool still awaiting full validation. However, the four items we chose from this tool to assess the quality of methodology represented key components of internal validity.

This study systematically reviewed the current evidence in prevalence and characteristics of iAE that were detected by direct observation, surveys and interviews of healthcare providers, and reviews of patient charts, incident reports, and administrative data. There is a lack of general agreement on how iAE should be measured and reported. Our findings suggest that direct observation method has the most potential to identify and characterize iAE in detail. Further, a framework to measure severity and corrective processes to rectify iAE needs to be developed and validated. Structured framework (Table S3) for reporting may facilitate discussions among clinicians and healthcare researchers to discover ways to prevent iAE from recurring and help foster more responsible attitudes and heightened awareness toward surgical safety.