Hepatocellular adenomas (HCAs) are rare benign liver tumors with the potential for malignant transformation [1] and a prevalence that ranges between 0.007 and 0.012% in the adult population [2]. The risk of developing HCA increases with the protracted use of oral contraception (OC) and metabolic syndrome [35]. The estimated annual incidence is 3/1,000,000, but this value might be 100-fold higher in long-term and high-dose users of OCs [3, 6]. Due to the widespread diffusion of OC use and the increase in obesity in western populations in addition to the frequency of incidental diagnosis upon imaging, the prevalence of HCA is expected to increase in the future [7].

The surgical indications are related to the prevention or treatment of complications that primarily consist of bleeding and cancer transformation, which are estimated to occur in 17 and 4% of cases, respectively [8, 9]. The following four subtypes of HCA have been identified: steatotic, inflammatory, β-catenin-activated, and unclassified [1, 10]. The patient’s background and the size and subtype of the lesion, which may be correlated with radiologic features [11], define the risk of complications and guide surgeons in the decision-making process [2, 1215].

The increasing diffusion of and expertise in laparoscopic liver resections (LLRs) in recent years has aided the defining of benign liver lesions, such as HCAs, as an excellent indication for minimally invasive surgery [1618]. Nevertheless, even if minor LLRs are confirmed to be a standard practice in surgery, major LLRs involve an innovative procedure that is still being explored and is associated with incompletely defined risks compared to open liver resections (OLRs) [19, 20]. The few studies in the literature that have focused on the laparoscopic management of HCAs have described very small and heterogeneous populations of patients with low proportions of major liver resections, and this information does not allow for the making of any clear recommendation [2125].

Thus, the primary objective of the present study was to compare the short-term outcomes and surgical morbidities of elective LLRs and OLRs in a large population of patients with indications for liver resection for HCA.

Materials and methods

Patient selection

Based on our previous retrospective cross-sectional study of 117 HPB European surgical centers [15], we performed a nested and dedicated study of LLRs.

To this end, we excluded the following patients from the previous population study: those who underwent emergency resections for a ruptured tumor, and non-exclusive LLRs (i.e., hand-assisted or hybrid techniques) [19]. The choice of the surgical approach was based on imaging and the patient’s characteristics. All participating centers shared similar criteria regarding the selection of the patients who were eligible for laparoscopy for major liver resection. These criteria included the size of the lesion (<10 cm), a safe distance between the major vessels and the transection lines, and the lack of a need for vascular or biliary reconstruction. The analyses were performed in an intention-to-treat manner; therefore, the patients who underwent LLR and were converted to OLR were considered to be in the laparoscopic group. Liver resections were classified according to the Brisbane classification [26]. Central hepatectomy was defined as resection of segments four, five, and eight [27]. The ethics committees of all hospitals approved the study. The study was conducted in accordance with the ethical principles of the Declaration of Helsinki.

Surgical technique

Regardless of the surgical approach, the central venous pressure was maintained at a low level (<5 mmHg) during the liver transections via the control of intravenous fluid administration. Abdominal drainage was selectively used in cases with concerns about intraoperative bile control regardless of the approach.

All LLRs were performed using 4–6 ports as previously described [18, 23, 2831]. In cases of major procedures, the operative technique consisted of the isolation and division of the hepatic inflow, the mobilization and transection of the liver, and the division of the hepatic outflow. Liver parenchyma transections were performed in each center using both an ultrasonic dissection device with bipolar cautery and the LigaSure® (Valleylab, Covidien Inc., USA) or the Harmonic® Scalpel (Ethicon EndoSurgery Inc., Cincinnati, OH, USA). The resected specimen was retrieved in a plastic bag through a Pfannestiel incision. The incision was closed, and the absence of bleeding and biliary leakage was then verified by laparoscopy. Conversion was defined as the need for laparotomy at any time during the procedure.

For OLR, either a right subcostal J-shape or midline incision was created. The crush and clamp technique, bipolar cautery, ultrasonic surgical aspirators, and vascular staplers were variably used in the different centers for parenchyma transection. In all patients, prior to transection, intraoperative liver ultrasounds were performed to guide the resection margins. Inflow clamping was performed in cases of bleeding.

Data collection and perioperative outcomes

Epidemiologic data [age, gender, body mass index (BMI), circumstances of the diagnosis, symptoms, and the use of OCs or anabolic steroids], comorbidities, American Society of Anesthesiologists (ASA) score, years of surgery, blood tests (prothrombin time, total bilirubin, hemoglobin, transaminases, and α-fetoprotein serum levels), serological tests (hepatitis B and C), operative variables (operative time, blood loss, transfusion, duration of inflow clamping, and conversion rate), postoperative variables (complications and hospital stay), and pathological variables (fibrosis, steatosis, hepatocellular injury, HCA subtype, and malignant transformation) were recorded.

Postoperative complications were stratified according to the Clavien-Dindo classification (CD) [32]. Liver function tests were measured on postoperative days (PODs) 1, 3, 5, 7, and 10 in cases of major hepatectomy. Liver failure was defined according to the “50–50 criteria” on POD 5 [33]. Ultrasound and/or CT scans were performed in cases of suspicion of complications. Complications were considered as those that occurred within 90 days after surgery. The mid- and long-term follow-ups included clinical, biological, and radiological evaluations 90 days after surgery and every six months thereafter in all centers.

Statistical analysis

The continuous variables are summarized with the median and the first (Q1) and third (Q3) quartiles. The categorical variables are presented as the frequencies and percentages. To improve the validity of the comparisons between groups, two stages were followed.

First, some data were missing from the dataset. The variables with appreciable amounts of missing data were the following: BMI (19.7% missing), prothrombin time (40.7%), bilirubin, hemoglobin, and ALAT and ASAT values, ranging from 22 to 27%. Therefore, multiple imputation (MI) via a chained equation was used to create the 20 multiply imputed datasets: the ALAT, ASAT, and bilirubin serum values were log-transformed before MI due to skewed distributions. Convergence was examined by graphical diagnosis. The incomplete variables were imputed under fully conditional specification because of its flexibility in the specification of the method and the set of predictors to be used for each incomplete variable [34].

Second, a propensity score (PS) method was used to reduce the bias caused by the differences between the two groups. The variables included in the PS estimation were age, gender, BMI, ASA grade, diabetes, hemoglobin, quick time, bilirubin, ALAT, ASAT, F0/F1-F2 fibrosis, steatosis, the number and maximum size of lesions, the type of hepatectomy, the year of surgery, and the surgical team. Because the number of patients included was relatively small, to balance the covariates across treatment groups, PS was implemented by assigning a weight of 1 to each patient in the LLR cohort and weighting each patient in the OLR cohort by the propensity odds [PS/(1 − PS)]. This PS weighting resulted in a pseudo-population of OLR that had the same distribution of measured covariates as that observed in the patients in the LLR group (standardized mortality ratio weighting, SMRW) [35]. With this approach, no LLR individuals were excluded from the analysis. PSs and SMRWs were generated for each imputed dataset.

Comparisons between groups were performed with Chi-square tests for categorical variables and Wilcoxon score tests for continuous variables for each imputed dataset, and the statistics were combined to obtain the final results. All of the analyses were performed with SAS® 9.3 (SAS Institute Inc., Cay, NC, USA). A p value < 0.05 was considered statistically significant.

Results

Demographic and operative characteristics

Among the source population of 573 patients [15], 40 patients were excluded as follows: emergency resections for ruptured HCA (n = 13), hybrid LLR (n = 14), and hand-assisted LLR (n = 13).

The baseline characteristics of the LLR and OLR groups before MI are provided in Table 1. Before MI, there were no differences between the LLR and OLR groups in terms of OC use (51 vs. 57.5%; p = 0.153), bleeding HCAs before surgery (20.2 vs. 16.9%; p = 0.359), patients presenting with abdominal pain (50.5 vs. 54.4%; p = 0.375), prothrombin activity (98 vs. 96.1%; p = 0.205), AFP values > 10 ng/ml (1.4 vs. 3.7%; p = 0.180), or hemoglobin (13 vs. 12.7 g/l; p = 0.185). In contrast, the LLR patients had lower total bilirubin (8 vs. 9 mmol/l; p = 0.005), AST (23 vs. 30UI/l; p = 0.002), and ALT serum level values (26.5 vs. 30UI/l; p < 0.001) than the OLR patients.

Table 1 Preoperative data and surgical characteristics before Multiple Imputation

The baseline characteristics after MI are provided in the supplementary material (Suppl. Table 1B). After MI, there were higher prevalence of women (93.3 vs. 84.6%; p = 0.008) and patients with diabetes (9.1 vs. 4.3%; p = 0.026) in the LLR than the OLR group. Moreover, LLR patients presented with higher hemoglobin (13.1 vs. 12.7 g/l; p = 0.007) and prothrombin activity (96.9 vs. 90.7%; p < 0.001) values and lower AST (26.3 vs. 30 UI/l; p = 0.021) values than the OLR patients. There were no group differences in the proportions of symptomatic patients, BMI, ASA score, complicated HCA, number of lesions, or the incidence of mild fibrosis or steatosis. The median tumor size was significantly smaller in the LLR group than the OLR group (60 vs. 70 mm, respectively; p = 0.009). Major liver resections were more frequent in the OLR group than the LLR group (34.8 vs. 9.6%; p < 0.001), although the overall bisegmentectomy proportions did not significantly differ between the two groups. Left lateral sectionectomy (LLS) was more prevalent in the LLR than the OLR group (79.7 vs. 49.3%; p < 0.001). Wedge resections were more frequent in the LLR group than the OLR group (50.5 vs. 30.2%; p < 0.001; Supplementary Table).

Perioperative outcomes before adjustment

The perioperative outcomes before propensity score adjustment (PSA) revealed that the patients in the OLR group presented longer operative times (180 vs. 150 min, respectively, p < 0.001), greater blood loss (300 vs. 100 ml, respectively, p < 0.001), a greater transfusion rate (45 vs. 8 RBC units, respectively, p < 0.001), and a greater need for inflow clamping (56.6 vs. 21.2%, respectively, p < 0.001) compared with the LLR group. The conversion rate for the LLR group was 6.3%.

No mortality was registered across the entire study population. The OLR group exhibited a higher rate of postoperative morbidity than the LLR group before adjustment (41.2 vs. 24.0%, respectively, p < 0.001), and there were no group differences in the proportion of complications. Postoperative bleeding was the most frequently observed complication in both groups (5.8 vs. 2.9%; p = 0.143). Overall, the severity of postoperative complications based on CD classification revealed greater proportions of severe complications (≥grade III) and mild complications (CD I/II) in the OLR group than the LLR group (10.5 vs. 4.8%; p = 0.033 and 31.7 vs. 19.7%; p = 0.003, respectively). The median duration of hospital stay was significantly shorter for the LLR group than the OLR group (5 vs. 8 days; p < 0.001).

Propensity score adjusted model

After the construction of the SMRW-PSA model, 208 LLR patients were compared with 216 OLR patients. The balance of covariates was assessed for the accuracy of the model. The baseline characteristics of the matched study population are summarized in Table 2. There were no significant differences in the demographic or surgical characteristics between the groups with the exception of preoperative arterial embolization, which occurred in one patient in the OLR group and nine patients in the LLR groups (0.5 vs. 4.3%; p < 0.001).

Table 2 Preoperative data and surgical characteristics after Propensity Score adjustment

After PSA analysis, the patients in the OLR group presented with greater blood loss (196 vs. 93 ml, respectively; p < 0.001), transfusion rates (24 vs. 8 RBC units, respectively, p < 0.001), and the need for inflow clamping (40.3 vs. 21.2%, respectively, p = 0.002) compared with the LLR group (Table 3). In contrast, no difference was observed between the LLR and OLR groups in the overall postoperative morbidity rate (24 vs. 22.7%, respectively, p = 0.778). The proportion of specific complications, which was non-significant in the pre-weighted analysis, exhibited the same behavior in the post-PSA analysis between the LLR and OLR groups. Overall, the post-PSA severity of postoperative complications based on CD classification did not differ between the groups. After adjustment, the difference in favor of the laparoscopic approach compared with the OLR group in the duration of hospital stay was maintained (5 vs. 7 days, respectively, p < 0.001).

Table 3 Operative and postoperative outcomes after Propensity Score adjustment

Discussion

The present study compared the short-term outcomes of pure LLR and OLR in patients with HCA. The relevance of this study is derived from the large sample of patients, which reflects the synthesis of the international experiences of a rare pathology of twenty-seven tertiary hepatobiliary centers in Europe. To date, this study represents the largest series in this setting. Overall, the present study revealed a benefit of the laparoscopic approach over the open approach for liver resection in patients with HCA.

The present study has strengths related to its relatively large sample of patients who were operated on via laparoscopy for the same indication. To make it possible, a dedicated statistical approach using MI, PSs, and SMRWs was performed. This method, via the inclusion of the year of the surgery and the center, was able to adequately handle time effects and center effects.

During the Second International Consensus Conference in Morioka about LLR, evidence was evaluated using GRADE and recommendations were made according to the Zurich-Danish consensus conference model [20]. To do so, the jury, based on their relative importance, retained 14 comparators. Among these comparators, the following five were relevant for benign tumors: mortality, complications, blood loss, length of stay, and recovery. The present study was able to report the relevant results and their comparisons for all but one comparator, i.e., mortality, which was nil. LLRs were associated with an equivalent rate of complications relative to OLRs, but the LLRs exhibited less blood loss and a shorter length of stay. Recovery, which was considered as a major comparator by the jury in Morioka, was not evaluated because of the retrospective design of the study. In addition to those major comparators, LLR group was associated with reduced blood loss and reduced need for clamping, with a related effect of reduced red blood cell transfusion rate.

After PSA analysis, there are still differences regarding preoperative arterial embolization rate that was more frequent in the LLR group in a significant way (4.3% vs. 0.5%, p < 0.001; Table 2). Some authors can argue that percutaneous embolization prior to resection could influence blood loss, especially for bigger lesions that may receive important arterial blood supply. However, arterial embolization procedure appears to have a little impact on blood loss in both groups. Indeed, although the median arterial embolization rate was significantly higher in the LLR group than in the OLR group, an embolization proportion inferior to 5% in the LLR group did not influence the blood loss amount in a relevant way, as represented by a median value that was more than double in OLR group compared with the LLR group (196 vs. 93 ml, respectively).

The inflow clamping was used less often in the laparoscopic group than open group, but there were no differences in clamping time when it was applied. The hemostatic effect of the pneumoperitoneum as well as the use of ultrasonic dissection devices were likely responsible for the reduced blood loss in the LLR group. As a consequence of this, a reduced rate of Pringle maneuver was necessary in the laparoscopic group. A similar outcome was reported in previous studies [3638], which observed a reduced use of pedicle clamping in LLR associated with a reduced blood loss compared with OLR. Nevertheless, these outcomes must be interpreted with caution because of the inherent risk of bias related to retrospective nature of the studies. A prospective randomized study of LLR vs. OLR could help clarifying this issue.

Aside from these additional two comparators, the conversion rate was 6.3%, which is a proportion that is comparable to series that have combined minor and major liver resections [16]. These results are in agreement with previous studies on LLRs [36, 3941]. These excellent results may reflect the fact that all of the resections were performed in tertiary HPB referral centers, some of which were pioneers in this particular field. However, because we included all cases regardless of the year of the surgery and we matched patients based on the year of surgery, these results may be interpreted to indicate that there is almost no learning curve for minor pure LLR once the surgeons already have major expertise in both the liver and minimally invasive surgeries.

In the present series, after 2008, LLR became the approach of choice in AFC-HCA-2013 Study Group centers [15]. This result is not surprising because all AFC-HCA-2013 Study Group centers are tertiary HBP surgery reference centers and minor resections are now performed laparoscopically by many teams worldwide [16]. Some detractors might argue that the surgical need was influenced by the introduction of laparoscopy [7, 42]. This argument was rejected for the first time by Bryant and coworkers in 2009 [18] and more recently in a nationwide US study that included 2633 patients who underwent surgery for benign hepatic tumors in which only 5.5% of patients underwent laparoscopic surgeries. Notably, an increase in the rate from 1.9% in 2000 to 7.4% in 2011 was observed [7].

However, the present study has some limitations that should be considered. First, regarding the external validity of the study findings, it is important to highlight that experienced surgeons performed all of the operations. Thus, the present results must not be generalized to centers with limited experience in hepatobiliary and minimally invasive surgeries. Second, despite the use of the MI method, to control bias due to missing data, there is the possibility of statistical confounding effects in the generation of the PS analysis. Third, the proportions of major resections were less than 10% in each study group after PSA; therefore, this study cannot be generalized to this subset of challenging hepatectomies.

In conclusion, based on the comparators that were retained by the jury during the Second International Consensus Conference in Morioka and the results of comparisons of the short-term outcomes of LLR and OLR with adjustments for the effects of the year of resection, the center, and the background factors, LLR is confirmed as the standard approach for minor liver resections.