Introduction

The human microbiome contributes to many systemic diseases, including gastrointestinal diseases. The importance of a diverse colonic microbiome is highlighted with the success of Clostridium difficile infections treated by fecal transplants.1 In inflammatory bowel disease, different fecal bacterial populations have been found in Crohn’s disease, ulcerative colitis, and healthy patients.2 The intestinal microbiota may even influence behavior and psychiatric disease.3 The distal bowel microbiome is more diverse than the proximal bowel, as the gastrointestinal microbiome is impacted by acidic gastric secretions and transit times.4 However, the association of the microbiome with biliary diseases—particularly gallstone formation—is not well studied.

Gallstone disease, the most common inpatient gastrointestinal diagnosis in the USA,5 is multifactorial. Cholesterol stones are the most common type of gallstones,6 while bacterial infections are known to cause brown pigment stones.79 Previously, normal bile in a healthy individual was considered sterile, but this assumption was based on data using less sensitive culture methods to detect the presence of bacteria,10  as well as known inhospitable culture aspects unique to bile.11 Recently, sequencing of bacterial ribosomal 16S RNA in the bile of healthy controls for liver transplant donations revealed the presence of bacteria.12 As such, bacteria may play a role in the pathogenesis of biliary disease in healthy individuals, as well as in patients with biliary stones.

Given the limited data of small cohorts using mostly 16S ribosomal sequencing, we aimed to characterize the microbiome of bile and biliary stones in patients referred for cholecystectomy using metagenomic sequencing. We hypothesized that the microbiome of the gallbladder will differ in patients depending on the type of surgical pathology. Secondarily, we looked for microbiome correlates with patient demographics, especially those associated with higher prevalence of cholelithiasis, such as age, gender, body mass index (BMI), and Hispanic ethnicity.13

Materials and Methods

Patient and Sample Characteristics

Human bile samples and biliary stones were obtained from the gallbladders of patients undergoing a cholecystectomy between 2017 and 2019 at two New York City hospitals—Jamaica Hospital Medical Center, an inner-city hospital that cares primarily for an indigent population, and New York-Presbyterian Hospital/Weill Cornell Medical Center, a tertiary referral center. Patients with the following post-operative diagnoses were included: acute cholecystitis, biliary dyskinesia, biliary colic, cholangitis, choledocholithiasis, cholelithiasis, and gallstone pancreatitis. Patients undergoing cholecystectomy for other indications, including malignancy, were excluded. Bile (1–10 mL) and gallstones (1–3 stones) were collected at the time of surgery from intact gallbladder specimens. Samples were then immediately stored at − 80 °C.

Patient demographic information, including age, sex, race, ethnicity, and nationality, was also collected. In addition, comorbidities [diabetes mellitus (DM), hypertension (HTN)], other clinical factors [BMI, smoking status, alcohol use, perioperative antibiotic use oral contraceptive (OCP) use], and any additional biliary procedures performed around the time of the cholecystectomy, such as endoscopic retrograde cholangiopancreatography (ERCP), sphincterotomy, stent placement, or cholecystostomy tube placement, were recorded from chart review. This protocol was approved by the institutional review boards at both institutions and preoperative informed consent was obtained from all patients.

DNA Extraction and Quantification

The research team collected bile and biliary stones aseptically to minimize cross-contamination, and specimen were stored in sterile Falcon 15-mL conical tubes (Thermo Fisher Scientific, Waltham, MA) in − 80° freezers until DNA extraction. Total DNA was extracted from 30 mg of homogenized biliary stones or 30μL of bile utilizing the QIAamp DNA Micro Kit (Qiagen, Hilden, Germany), according to the manufacturer’s instructions with minor modifications. Briefly, 30 mg of biliary stone or 30μL of bile samples was lysed in 1.5 mL microcentrifuge tubes using 180μL of Buffer ATL and proteinase K. To ensure complete lysis, samples were placed in a thermomixer set to 56 °C for 12 h. DNA was then bound to the QIAamp MinElute column membranes by transferring the lysate to the columns and adding 200μL of Buffer AL followed by 200μL of 100% ethanol. Samples were then sequentially washed using wash buffers AW1 and AW2 as per the protocol. Elution of DNA was performed using 30μL of Buffer AE, centrifuging the samples at full speed (20,000 g) for 5 min. After the extraction was complete, DNA concentration was determined using a NanoDrop Microvolume Spectrophotometer (Thermo Fisher Scientific, Waltham, MA). The research team performed DNA extraction aseptically and changed gloves and pipette tips with each patient sample to minimize cross-contamination.

Metagenomic Sequencing

DNA samples were subsequently processed by the Weill Cornell Medicine Microbiome Core for metagenomic sequencing using the Illumina Nextera XT Preparation Kit (Illumina, San Diego, CA) according to the manufacturer’s instructions.

Shotgun Metagenomics Analysis

Raw sequence reads were filtered based on read quality and host reads were removed using kneadData software,14 which makes use of Trimmomatic15 and Bowtie2.16 Taxonomic classification of the resulting reads was performed using Kraken2.17 Species and genus relative abundance were then estimated using Bracken.18 The bacterial fraction of classified reads was determined by adding relative abundances of all bacterial species in the Bracken output, using Taxonkit19 to list bacterial species.

16S Library Generation, Library Verification, Quality Check, and Pooling

Library generation follows the protocol from Earth Microbiome Project.20 Amplicon libraries are washed using Beckman Coulter AMPure XP magnetic beads (Beckman Coulter, Pasadena, CA). Library quality and size verification was performed using PerkinElmer LabChip GXII instrument with DNA 1 K Reagent Kit (PerkinElmer, Waltham, MA). Library concentrations are quantified using Quant-iT dsDNA High Sensitivity Assay Kit using Promega GloMax plate reader on a microplate (Promega, Madison, WI). Library molarity is calculated based on library peak size and concentration. Libraries are normalized to 2 nM using the PerkinElmer Zephyr G3 NGS Workstation (PerkinElmer, Waltham, MA) and pooled together using the same volume across all normalized libraries into a 1.5-mL Eppendorf DNA tube (Eppendorf, Hamburg, Germany).

Sequencing

Pooled libraries are sequenced on the Illumina MiSeq instrument at loading concentration of 7 pM with 10% PhiX, paired-end 250 using MiSeq Reagent Kit v2, 500-cycles (Illumina, San Diego, CA).

Data Processing

Demultiplexed raw reads were processed to generate an operational taxonomic unit (OTU) table using USEARCH version 11.0.667.21 Specifically, forward and reverse reads were merged using a maximum of 5 mismatches in the overlap region, a minimum sequence identity in the overlap region of 90 percent, a minimum overlap length of 16 base pairs, and a minimum merged sequence length of 300 base pairs. PhiX contamination was then removed, followed by quality filtering based on FASTQ quality scores, with a maximum expected error number of 1.0. OTU clustering was performed using usearch -cluster_otus with default settings. Merged (pre-filter) reads were mapped to the OTU sequences to generate the OTU table. Taxonomic classification of OTU representative sequences was performed using usearch -sintax, an implementation of the SINTAX algorithm,22 using version 16 of the Ribosomal Database Project (RDP) Training Set.23 Alpha diversity estimation and principal coordinate analysis (PCoA) were performed using the phyloseq R package.24

Statistical Analysis

Statistical analysis of the microbiome data was performed using R 3.6.25 Differential prevalence was tested using Fisher’s exact tests. Shannon index of diversity and bacterial fractions were compared using the Wilcoxon rank test. Correlations of bacterial fraction in bile and stones were made using Pearson’s product-moment correlation. A multivariate linear regression for the bacterial relative abundance, including variables historically considered to contribute to gallstone disease, was performed in the non-infectious cohort. Variables included in the multivariate analysis were the type of sample (bile or stone), batch number, age, BMI, Hispanic ethnicity, and gender. BMI and age were included as dichotomous variables. BMI was characterized as above or below 30 kg/m2, and age was characterized as above or below 40 years of age. Additional demographic and clinicopathologic variables were compared via Fisher’s exact test for categorical variables or Student’s t-test for parametric continuous variables using Stata software, version 15.1 (Stata Corp. College Station, TX).

Results

Cohort Characteristics

In our surgical cohort of 49 cholecystectomy patients, 29% (14/49) had final pathology of acute cholecystitis while 71% (35/49) patients had non-infectious pathology—chronic cholecystitis, cholelithiasis, and normal gallbladder (patients with biliary dyskinesia) (Table 1). The two cohorts were similar in age, gender, and comorbidities. There were a variety of preoperative diagnoses in both cohorts, and as expected, more acute cholecystitis patients received preoperative antibiotics in addition to the standard perioperative single dose. Additionally, we had a subset (n = 25) of patients with matched bile and stone samples with characteristics representative of the entire cohort (Supplemental Table 1).

Table 1 Patient demographics and perioperative details in the entire study cohort and stratified by final pathologic diagnosis

Genera and Species Were Similar in Paired Bile and Stone Samples

To further characterize the microbiome of our patients, we assessed genera represented in all patient samples. A total of 40 different genera were categorized in our cohort (Fig. 1), with variations dependent on pathologic diagnosis, ethnicity, and gender. Paired samples of bile and stone from the same patients (n = 25) had comparable genera. The most common genera included Streptococcus, Klebsiella, Burkholderia, Staphylococcus, and Escherichia. In addition to genera, we characterized our cohort by bacterial species (Fig. 2). The relative abundance of different bacteria species varied while paired bile and stone samples had analogous species (Fig. 2). The most common species in our cohort were Klebsiella pneumoniae, Staphylococcus aureus, and Streptococcus pasteurianus.

Fig. 1
figure 1

Genus heatmap showing all genera. Forty different genera are represented and varied based on pathologic diagnosis, ethnicity, and gender while paired bile and stone had similar genera. The most common genera included Streptococcus, Klebsiella, Burkholderia, Staphylococcus, and Escherichia. The 25 paired bile and stone samples (1–25) are denoted

Fig. 2
figure 2

Relative abundance of species depicted A dichotomously and B continuously. Presence of species was determined by a minimum of 0.02% relative abundance. The most common species in our cohort were Klebsiella pneumoniae, Staphylococcus aureus, and Streptococcus pasteurianus

Bile and Stones Phyla Bacterial Fractions and Shannon Index of Diversity

When looking more broadly at the composition of phyla, Proteobacteria had the highest relative abundance in both bile and stones followed by Firmicutes (Fig. 3A). In addition to similar phyla composition, the bacterial fraction in bile and paired stones was positively correlated (Fig. 3B). The relative abundance of bacterial fraction was also similar but tended to be higher in stones, although not statistically significant (Fig. 3C). Lastly, the median Shannon index of diversity between bile and paired stones were similar (Fig. 3D).

Fig. 3
figure 3

Representations of phyla from bile and stone. A The relative abundance of phyla was similar in bile and stone, with Proteobacteria as the most common for both bile and stone, making up 49% and 65% respectively. B The bacterial fraction in paired bile and stone samples correlated (p = 0.008). C Relative abundance analysis of bacterial fraction in paired samples of bile and stone showed similar relative abundance, and the fractions tended to be higher in stones although not statistically significant. D Shannon index comparing bile and stone phyla was similar. The central rectangle of each boxplot represents the interquartile range (IQR) while the inner line represents the median. The whisker lines represent the minimum and maximum values while the dot represents a suspected outlier, outside of 1.5 times the IQR

Differential Prevalence of Bacterial Species in Bile and Stones Across Pathologies

We next compared bacterial species in bile and stones based on final surgical pathology. There were 11 species found in bile and 12 species in stones (Fig. 4A and B). When looking at the prevalence of bacterial species in bile, there were similar rates in acute cholecystitis and the non-infectious pathologies, including chronic cholecystitis and non-inflammatory pathologies such as biliary dyskinesia (Fig. 4A). The prevalence was similar for most species, with an increased prevalence of Streptococcus pasteurianus in the non-infectious pathologies (chronic cholecystitis and non-inflammatory pathologies) compared to acute cholecystitis (0.46 vs 0.08, p = 0.027). However, the differential prevalence in stones varied across the pathologies (Fig. 4B). For example, Staphylococcus aureus had the highest differential prevalence in bile samples for all surgical pathologies, but stone samples had variable prevalence depending on pathology. The highest prevalence of Staphylococcus aureus in stones was in the non-inflammatory patients. Four different species had different prevalence in the stone cohort, with a statistically higher prevalence of Veillonella parvula, Streptococcus sanguinis, Kosakonia radicincitans, and Salmonella enterica in acute cholecystitis patients compared to the non-infectious pathologies.

Fig. 4
figure 4

Comparison of acute cholecystitis and non-infectious patholigies, including chronic cholecystitis and non-inflammatory pathologies. The differential prevalence varied in bile (A) and stone (B) depending on gallbladder pathology with non-infectious; however, there was a wide range of species across all pathologies. C The bacterial fraction was higher in acute cholecystitis bile (p = 0.024) and stones (p = 0.012) compared to non-infectious pathology D while the Shannon index of diversity was similar in all pathologies

The bacterial fraction was statistically higher in bile and stones in samples taken from patients with acute cholecystitis compared to non-infectious pathologies (p = 0.024, p = 0.012) (4C) while the Shannon index of diversity was similar (4D). Overall, there was a variety of bacterial species found in infectious and non-infectious bile and stones. On further sub-analysis of the non-infectious cohort, there was a similar variety of species in bile samples from patients with non-inflammatory pathology compared to those with acute and chronic cholecystitis. Three species (Citrobacter freundii, Streptococcis sanguinis, and Enterobacter roggenkampii) were found only in bile from patients with acute or chronic cholecystitis.

A Sub-analysis of the Non-infectious Cohort Based on Demographics

Finally, a multivariate linear regression analysis of the bacterial fraction of the non-infectious group revealed that the bacterial composition may be relatedS to age, BMI, and ethnicity. The overall regression was statistically significant (R2 = 0.476, 6 = 6.65, p < 0.001). Age over 40 significantly predicted the relative bacterial abundance (β = 0.582, p = 0.049) while BMI over 40 (β =  − 0.570, p = 0.064) and Hispanic ethnicity (β = 0.602, p = 0.074) approached significance. Based on our multivariate findings, we completed an additional analysis within the non-infectious cohort to look for differences between Hispanic and non-Hispanic patients. The relative abundance bacterial fraction (Fig. 5A) and the median Shannon index of diversity (Fig. 5B) were not significantly different in bile or stones of Hispanic patients compared to non-Hispanic patients. Similarly, the differential prevalence of genera varied by ethnicity for bile (Fig. 5C) and stones (Fig. 5D), but these differences were not statistically significant. Staphylococcus was the most prevalent genus in bile and stones regardless of ethnicity.

Fig. 5
figure 5

A sub-analysis of the non-infectious cohort showed differences in patients’ Hispanic ethnicity. A The relative abundance bacterial fraction was not significantly higher in stones (p = 0.06) or bile in Hispanic patients compared to non-Hispanic patients. B The Shannon index of diversity was also similar across ethnicities.. The differential prevalence of genera varied between Hispanic and non-Hispanic patients in bile (C) and stones (D), although these differences were not statistically significant

Discussion

In this analysis of bile and gallstones from patients undergoing cholecystectomy, we characterized the biliary microbiome in the largest cohort to date via metagenomic sequencing. We found a broad range of bacteria, with Proteobacteria the most common phyla in bile and gallstones, and the most abundant species were Klebsiella pneumoniae, Staphylococcus aureus, and Streptococcus pasteurianus. The biliary bacterial compositions were similar to previous small bowel microbiome analyses which found Proteobacteria the most common phyla26 and Streptococcus the most abundant genus.27 In paired samples of bile and stones, we found similar bacterial compositions while the species varied based on final surgical pathology. We found a higher prevalence of bacteria in acute cholecystitis compared to non-infectious pathologies while diversity was similar across all pathologies. Surprisingly, non-infectious pathologies often had a similar variety of bacteria to those found in patients with acute cholecystitis, including high prevalence of Staphylococcus aureus in patients with biliary dyskinesia.

Our study used the more robust shotgun metagenomic sequencing, in which all microorganisms in a sample are sequenced,28,29 to characterize the biliary microbiome. Most of the previous literature used 16S ribosomal RNA amplification, including one study that found “no significant bacterial signal” identified in the normal bile samples,30 suggesting 16S sequencing analyses of the biliary microbiome are not comprehensive. Moreover, shotgun metagenomic sequencing has been shown to yield more genera of bacteria compared to 16S ribosomal RNA amplification.29

Although all patients in our study had an indication for cholecystectomy, the reasons varied from acute cholecystitis to biliary dyskinesia. Four patients in our cohort (n = 2 acute cholecystitis, n = 2 non-infectious pathology) underwent endoscopic ERCP with sphincterotomy prior to collection of specimen. Although sphincterotomy is believed to lead to increased rates of ascending cholangitis due to reflux of bacteria from the duodenum into the biliary tree, recent studies suggest that the impact of sphincterotomy on bacterial contamination may be minimal.31,32 Unfortunately, our cohort was limited for further analysis due to sample size. Previous studies included the bile of healthy individuals, while our study only included patients referred for surgical resection in our analysis. Given the similar bacterial profiles identified in the paired bile and stone samples—also noted in a previous study using the 16S sequencing method33—similar inferences may be drawn from the samples in which we only have bile or stone. Finding a variety of diverse bacteria across surgical pathologies, including diseases not classically considered to be contaminated, highlights that bile and gallbladder stones are not sterile, even in the absence of acute cholecystitis, as has recently been suggested.12 When compared to healthy liver donors, the biliary microbiome of patients with cholelithiasis was comprised of different families of bacteria.12 Moreover, bacterial dysbiosis has also been associated with the development of cholangiocarcinoma.30,34 As dysbiosis of the microbiome contributes to diseases of the small bowel and colon, and bile has been shown to harbor bacteria, it is possible that biliary dysbiosis contributes to gallstone formation.

Age, gender, and obesity are considered risk factors for gallstone formation,3537 as well as certain ethnicities.13,38 Previous data from the Hispanic Health and Nutrition Examination Survey have shown increased rates of gallstones among Hispanic patients, particularly Mexican American women, with rates of 44.1% in women aged 60 to 74 years old.13 This increased rate of gallstone diseases in Mexican American patients remained, even when other risk factors were controlled for in a logistic regression modeling.39 While the cohort in our study was small, there appears to be differences in the microbiomes of Hispanic patients, including higher bacterial fractions and increased Shannon index of diversity. These differences approached statistical significance and should be further explored on a larger scale.

Limitations to our study include a relatively small sample size of surgical patients. Nonetheless, this is the largest cohort of patients sequenced with shotgun metagenomics to our knowledge. Additionally, the bacterial diversity of our cohort may be limited by the threshold used for screening relevant bacteria, which was set at 0.02%. It is possible that we missed additional bacterial diversity because of this threshold; however, this was mitigated by our use of metagenomic sequencing instead of ribosomal 16S RNA sequencing. Our analysis may be impacted by the batch effect on our samples, as the samples were assessed in two batches and can be impacted by the other samples in the cohort. However, our statistical model accounts for the fact that samples were sequenced in two separate batches. Most of the limitations of our study are typical and inherent in microbiome analyses.

In conclusion, this study depicts the biliary microbiome in a variety of pathologies in patients undergoing cholecystectomy. This characterization highlights the large variety of bacterial species present in the biliary microbiome—including in patients with acute cholecystitis and non-infected pathologies—with similarities to previously characterized small bowel microbiomes. The diversity of the gallbladder’s microbiome and its potential impact on gallstone formation should be further studied on a larger scale.