Introduction

Pancreatoduodenectomy (PD) was first reported in 1898 by an Italian surgeon [1]. The classic PD was popularized and named the Whipple operation, for the American surgeon [2]. Initial outcomes were limited due to high mortality rates. After the technique was modified to be pylorus-preserving, the overall morbidity and mortality outcomes were improved over time by experienced surgeons at high-volume centers [3,4,5,6,7]. The mortality rate has been decreasing and is accepted at 5%, whereas the morbidity rate ranges between 41 and 52% [8, 9]. Surgeon and patient factors affect the surgical outcomes [10,11,12,13].

Patient safety and individual competency of the surgeon are significant concerns for global health care and are continuously investigated [14]. Surgical experience is a critical factor for improving surgical outcomes [15]. A number of factors contribute to surgical experience, such as training programs, technical skills, leaning curve, hospital facility, supporting system, and health care policy. The main objectives of residency training are to improve doctors’ ability to provide quality patient care, conduct research, and improve global health care. Inadequate expertise in hospitals involving patient care and coaching of young surgical attendants is a global health care problem. In the past, young surgical attendants were required to practice independently and without mentoring. These conditions created work-related stress, which affected decision-making and patient safety. We hypothesized that the competency of a young surgical attendant in performing complex abdominal surgery would be comparable with standard outcomes. We observed that no benchmark outcomes, for independently performed PDs, were available for young surgical attendants who become members of the general surgical staff after completing resident training.

Thus, this study aimed to develop and validate a novel benchmark to qualitatively assess the outcomes of PDs performed by a young surgical attendant.

Materials and methods

Database and surgeon selection

A retrospective study was performed to review data of 150 patients who underwent successful open PDs between July 13, 2006, and July 13, 2020, by a young surgical attendant at the academic university hospital.

Morbidity and mortality were reviewed 90 days after surgery. The final date for following up the data for statistical analysis was August 31, 2022. This study evaluated the work of a young surgical attendant, who was defined as a member of the general surgical staff following certification by the General Surgical Board, who performed their first PD within 3 months of certification and subsequently performed consecutive PDs for over 10 years. During the study period, our institution had 11 new young general surgical attendants. Four of them were interested in performing hepatobiliary and pancreatic surgeries; of these young surgical attendants, one had an experience of more than 10 years performing PDs and the remaining three had < 10 years’ experience. The young surgeons’ experience was divided into three phases: the initial 5 years, between 5 and 10 years, and after 10 years. A year of experience comprised all 365 working days, and the first year of experience was considered as the period from July 13, 2006, to July 12, 2007. Patient survival and cause of death were confirmed by the Bureau of Registration Administration, Thailand.

This study was approved by the Faculty of Medicine medical ethical committees (REC.63–338-10–1). The study was registered at Thai Clinical Trials Registry (registration identification TCTR20220714002). The requirement for informed consent was waived owing to the retrospective nature of the study.

Variables and definition

Heart disease included ischemic heart disease (history of or currently on treatment). Coagulopathy was defined as a prothrombin time value greater than the normal upper limit (4 s), an activated partial thromboplastin time value greater than the normal upper limit (10 s), conditions associated with bleeding disorders due to underlying disease, and current medication use (continuous use of antiplatelets or anticoagulants). Surgical waiting time was the duration from the first patient visit to the operation date. Blood loss (mL) was estimated through visualization by the anesthesiologist. The operative time (min) was recorded from the first incision until the patient was transported to the recovery room. Postoperative complications included postoperative pancreatic fistula (POPF) [16, 17], delayed gastric emptying (DGE) [18], chyle leak [19], and surgical site infections (SSI) [20]. The primary tumor was identified, and its size was recorded by a pathologist. Complex PD was defined as radical en bloc PD with adjacent organ resection or PD plus other intra-abdominal procedures. A high-risk patient was assessed as an American Society of Anesthesiologists (ASA) class ≥ III.

Benchmark outcomes

We identified and used 20 benchmark outcomes (15 metrics from previous publications and five new metrics from the literature review) to evaluate the quality of a young surgical attendant who performed PDs. Due to heterogeneity, metrics were developed from previous publications [21,22,23,24,25,26]; we selected and merged metric cut-off points based on the outcomes of our own metrics. The aim of the adjusted metrics was to reduce variations in their development and enable easy use for high- and low-volume centers. Metrics developed from previous publications were time from surgeon visit to surgery < 21 days, operation duration ≤ 450 min, hospital morbidity (patient had at least one complication, graded as per the Clavien–Dindo Classification) ≤ 73% [27], POPF grade B ≤ 15% and grade C ≤ 5%, biliary fistula < 14%, severe postoperative bleeding < 7%, reoperation < 20%, postoperative hospital stay (POHS) ≤ 15 days, readmission rate ≤ 21%, positive margin resection < 10%, in-hospital mortality ≤ 1.6%, 90-day mortality ≤ 5%, and 1- and 3-year disease-free survival (DFS) ≥ 53% and ≥ 9%, respectively. The five new metrics we used included blood loss ≤ 500 mL, SSI ≤ 25%, DGE ≤ 30%, chyle leakage ≤ 15%, and 5-year overall survival ≥ 14%.

Statistical analysis

Continuous variables with a normal distribution are expressed as mean ± standard deviation (SD) and were analyzed using the ANOVA F-test (or Student’s t-test). Data with a non-normal distribution are expressed as median and interquartile range (IQR) and were analyzed using the Kruskal–Wallis test. Categorical variables are described using frequency distributions (%) and were compared using a Chi-squared test (or Fisher’s exact test). Logistic regression analyses were performed to determine the association of the potential risk factors with the primary outcome variables and to estimate adjusted odds ratios and their 95% confidence intervals. P-values of < 0.05 were considered statistically significant. The learning curve was calculated based on the cumulative sum (CUSUM) analysis. The CUSUM chart plots the cumulative sums of deviations in each direction (positive and negative) of the sample values from a target value. The upper and lower CUSUMs called the reference values (normally indicated by K). K is often set to half the shift to be detected, in sigma units. All statistical analyses were performed using R software version 4.1.1 (R Foundation for Statistical Computing, Vienna, Austria).

Results

General database

A total of 150 PDs were performed by the young surgical attendant. Most patients were classified as ASA class II (68.7%; 103/150) or class III (30%; 45/150); the sex ratio was 1:1. The proportion of patients with at least one comorbidity was 53.3% (80/150). The median body mass index (BMI) was 21.5 (IQR, 19.1–24.6; range, 14.2–41.7) kg/m2. Among the patients, 19.3% (29/150), 19.3% (29/150), and 4% (6/150) were underweight, overweight, and obese, respectively. High-risk patients were significantly older and had more comorbidities than low-risk patients (Table 1).

Table 1 Pre- and postoperative data comparing high- and low-risk patients

Preoperative management

Endoscopic retrograde cholangiopancreatography was performed in 55.3% (83/150) of patients, and its use has decreased over the last 10 years from 68.3% to 39.5% (P = 0.002) (Table 2). The rate of preoperative biliary drainage with internal stent placement was 41.3% (62/150), and its use has decreased over the last 10 years from 53.7% to 26.6% (P = 0.003). Preoperative percutaneous transhepatic biliary drainage was performed in only 2% of patients (3/150). The median surgical waiting time was 19 (IQR, 10–32) days.

Table 2 Pre- and post-operative data comparing the level of experience

Operative data

Pylorus-preserving PD (PPPD) was the routine operation for PD. Conventional PD (CPD) was performed in 10% of patients (15/150). Complex PD, which has significantly increased in the last 10 years (P = 0.040) (Table 2), was performed in 15.3% of patients (23/150). Additional surgical procedures included wedge liver resections (10/150), colectomies (6/150), abdominal wall resections (3/150), wedge resection gastric tumors (2/150), gynecologic procedures (2/150), and jejunal resection with extended lymph node dissection (1/150). Complex PD was performed with PPPD in 76% (19/25) of patients. Major venous resection was performed in 4.7% (7/150) of patients.

The median operative time was 302.5 (IQR, 265–370) min. The mean PPPD operative time was 324.5 ± 90.5 min, which significantly differed (P = 0.007) from that of CPD (397.7 ± 156 min). The prolonged operative time was significant when associated with BMI ≥ 30 kg/m2 (P = 0.018), CPD (P = 0.007), and blood loss (P < 0.001). The median blood loss was 425 (IQR, 300–700) mL.

Pathological reports

The pathology of PD specimens included adenocarcinoma of the ampulla (44.0%; 66/150), pancreas (16.0%; 24/150), bile duct (6.0%; 9/150), and duodenum (4.7%; 7/150); other malignancies (13.3%; 20/150); benign tumors (11.3%; 17/150); pancreatic neuroendocrine tumors (6.0%; 9/150); and chronic pancreatitis (2.0%; 3/150). There were 9.3% (14/150) specimens with positive resection margins.

Hospital course and morbidity

Postoperative intensive care unit (ICU) admission was requested in 43.3% (65/150) of cases and significantly reduced after the beginner stage (P < 0.001). Morbidity occurred in 34.7% (52/150) of cases; the overall morbidity rate did not differ significantly among the three levels of experience (Table 3). Common morbidities included SSI (18%; 27/150), DGE (10%; 15/150), POPF grade B (5.35%; 8/150), chyle leakage (4.0%; 6/150), bleeding (2.7%; 4/150), reoperation (2.7%; 4/150), bile leakage (2.0%; 3/150), and POPF grade C (0%; 0/150). The causes of reoperation included complicated SSI in two patients, bleeding, and bile leakage. The median postoperative hospital stay was 14 (IQR, 10–20.8) days and has significantly decreased over the 10 years of experience (P = 0.041). The readmission rate was 2.7% (4/150).

Table 3 Surgical outcomes of pancreatoduodenectomies by the number of years of experience of a young surgeon

The 90-day mortality and survival

The 90-day mortality was 2.7% (4/150), and one patient (0.7%) expired in the hospital from acute myocardial infarction. The 1- and 3-year DFS were 71.3% (107/150) and 51.4% (73/150), respectively. The overall 5-year overall survival rate was 47.1% (65/138). Pancreatic cancer was the worst prognostic outcome (Table 4).

Table 4 Distribution survival by pathology report

Benchmark outcomes

Overall benchmark outcomes of 150 PDs were achieved by a young surgical attendant (Table 5). No significant differences were observed in the metrics between high- and low-risk patients. In 46% of the cases, the waiting time for surgery was > 21 days; most of the patients had a waiting time of < 60 days. Operative time more than 450 min was encountered in 14.7% cases, and blood loss more than 500 mL was encountered in 38.7% cases. The morbidity rate was consistently stable at less than 30% after 60 cases of PD (Fig. 1). The operative time and blood loss decreased after the second year of experience. The CUSUM charts showing the waiting time for surgery, operative time, blood loss, and POHS are illustrated in Fig. 2. Only the operative time showed a decreasing trend over the study period. A total of 47 PDs were required to overcome the lower limit of the learning curve in terms of operative time. The POHS and blood loss exhibited an initial increasing trend and resumed the target values after 34 and 53 procedures, respectively.

Table 5 Pancreatoduodenectomy benchmark outcomes of a young surgical attendant
Fig. 1
figure 1

Surgical outcomes of pancreatoduodenectomies according to the number of years of experience of the surgeon. Operative time and blood loss illustrate a decreasing trend after the 2nd year of experience. Morbidity rate revealed a decreasing trend that stabilized after the 4th year of experience

Fig. 2
figure 2

CUSUM (cumulative sum) analysis of pancreatoduodenectomy outcomes. Only the operative time depicted a decreasing trend after 47 procedures. The waiting time for surgery was longer than the upper limit in seven cases. The postoperative hospital stay (POHS) and blood loss exhibited an initial increasing trend followed by a normal trend (at the target line) after 34 and 53 procedures, respectively

Discussion

Our study showed that all benchmark outcomes for PD were achieved by a young surgical attendant. No significant differences were observed between high- and low-risk patient groups for all included metrics. The pancreatic cancer group had the worst prognostic outcome. Therefore, this group had a significantly poor 5-year survival. The overall morbidity and mortality were not significantly different among the year of experience of the surgeon. The operative time was directly related to the blood loss; both decreased over time as the surgeon’s competency, experience, and familiarity with surgical anatomy increased. Additionally, overnight postoperative observation in ICU was not necessary after 5th year of experience, and postoperative care was no longer routinely requested. The overall morbidity rate stabilized and remained unchanged, even after the surgeon had attained adequate skills in performing PD. Contrastingly, POHS decreased over time to less than 7 days due to minor morbidity in PD. Although 90% of the cases were PPPD, the most common morbidity was SSI due to low incidence of POPF and intra-abdominal collection.

PD is one of the most challenging abdominal procedures for a general surgeon. For over 50 years, the PD mortality rate has decreased to < 5%; however, the morbidity rate has remained stable at 40–45% [13, 28]. Benchmark outcome is an effective tool for assessing quality of treatment and can improve practices. To our knowledge, there are no benchmark outcomes for evaluating the competency of a young surgeon to perform PD. We identified and used 20 metrics to assess the quality outcomes of our young surgical attendant in performing PD. The first 15 were selected from previous publications, and we added five new metrics from the literature review. Previous publication benchmarks for pancreatic surgery were developed using observational methods such as a survey of the experts, Delphi consensus process, and best in class analysis [21,22,23,24,25,26]. There is no international consensus for previous metrics, which require further validation [29]. Based on the literature review, we used the term “time from surgeon visit to surgery” instead of “time from diagnosis to surgery” [30, 31]. This was because some centers had limitations for multidisciplinary team meetings and could not schedule regular weekly meetings. Our outcomes were not focused on hospital or surgeon volumes (pancreatic resections/year). We used CUSUM control charts to analyze the operative time for the learning curve of young surgeons because this method shows cumulative deviations over time and flags significant deviations from the respective group means. This method of analysis exhibits greater sensitivity for detecting shifts or trends as compared with the traditional control charts (Xbar charts) as a continuous process and improves the ability to detect small shifts. Similar to our result, a recent systematic review reported that the number of procedures to overcome the learning curve for open PDs is 30 [32]. Our CUSUM for blood loss did not show a decreasing trend over time. This result may be due to many factors such as the complexity of the PDs and inaccurate visual estimation of blood loss. The limitation of this study was that only one surgeon met the inclusion criteria for evaluation. This individual outcome cannot represent all young surgical attendants in achieving the benchmark. Further studies from multiple centers or large number of young surgeons’ population are required to validate the metrics. Our metrics did not include patient report outcomes such as quality of life and postoperative satisfaction. We did not emphasize metrics for hospital structure, process, and supporting team quality. However, hospital outcomes were based on surgeon outcomes. Further investigation is required to assess hospital quality. Benchmark parameters and cut-off values should be revised regularly based on the new evidence, treatment trends (laparoscopic or robotic surgery), and world international consensus.

In conclusion, PD is associated with a high morbidity and mortality, and requires good standard of care. Young surgeons overcome the learning curve in terms of operative time after 47 procedures. This benchmark outcome is based on only one surgeon’s learning curve; therefore, it must be compared with the benchmark outcomes of other institutions.