Abstract
Human milk microbiome studies are currently hindered by low milk bacterial/human cell ratios and often rely on 16S rRNA gene sequencing, which limits downstream analyses. Here, we aimed to find a method to study milk bacteria and assess bacterial sharing between maternal and infant microbiota. We tested four DNA isolation methods, two bacterial enrichment methods and three sequencing methods on mock communities, milk samples and negative controls. Of the four DNA isolation kits, the DNeasy PowerSoil Pro (PS) and MagMAX Total Nucleic Acid Isolation (MX) kits provided consistent 16S rRNA gene sequencing results with low contamination. Neither enrichment method substantially decreased the human metagenomic sequencing read-depth. Long-read 16S-ITS-23S rRNA gene sequencing biased the mock community composition but provided consistent results for milk samples, with little contamination. In contrast to 16S rRNA gene sequencing, 16S-ITS-23S rRNA gene sequencing of milk, infant oral, infant faecal and maternal faecal DNA from 14 mother-infant pairs provided sufficient resolution to detect significantly more frequent sharing of bacteria between related pairs compared to unrelated pairs. In conclusion, PS or MX kit-DNA isolation followed by 16S rRNA gene sequencing reliably characterises human milk microbiota, and 16S-ITS-23S rRNA gene sequencing enables studies of bacterial transmission in low-biomass samples.
Similar content being viewed by others
Introduction
Breastfeeding provides infants with nutrients and is associated with infant health benefits [1, 2]. One way that breastfeeding can affect infant health is by modulating the gut microbiome [3] as human milk harbours bacteria and viruses that potentially originate from the maternal gut, skin and infant oral cavity [4,5,6,7,8,9,10]. Once ingested by the human milk-fed infant, milk microbes can colonise the developing infant gut [11,12,13,14,15,16], affecting the infant’s immune system maturation and health [3, 17, 18]. However, study of the origin of human milk microbiota and milk as a source of microbes to the early infant gut has been hindered by methodological challenges.
Human milk has a low microbial biomass, containing ~105–106 bacterial cells per 1 ml milk [9, 11]. It also contains many nutrients, which complicate isolation of milk DNA [12, 19,20,21]. Frequently, over 90% of DNA isolated from milk is of human rather than microbial origin [12, 19,20,21]. Most milk microbiota studies to date have therefore relied on 16S rRNA gene amplicon sequencing, which detects bacteria and archaea but limits taxonomic classification and the ability to study microbial transfer [7]. Metagenomic sequencing would resolve these limitations but remains methodologically challenging for milk samples due to their low-biomass and high host contamination [12, 19,20,21].
In this study, we aimed to identify a sample preparation and sequencing strategy that increases taxonomic resolution and allows the study of bacterial transmission in low-biomass milk samples. Of the four DNA isolation kits we tested, the DNeasy PowerSoil Pro (PS) kit and the MagMAX Total Nucleic Acid Isolation (MX) kit provided consistent results, with low contamination. We also found only limited enrichment when testing two bacterial enrichment methods prior to metagenomic sequencing. Finally, we show that long-read 16S-ITS-23S rRNA gene amplicon sequencing did not improve taxonomic classification compared to 16S rRNA gene sequencing. Still, when analysing samples of 14 mother-infant pairs, 16S-ITS-23S rRNA gene amplicon sequencing provided sufficient resolution to detect significantly more frequent sharing of bacteria between human milk and infant oral cavity and faeces, but not with maternal faeces, in related mother-infant pairs compared to unrelated pairs.
Methods
Samples
Method test samples
Human milk test samples were collected from three mothers participating in the Dutch Lifelines NEXT birth cohort [22] at 1.5 (Milk-1), 2 (Milk-2) and 6 (Milk-3) months postpartum. Milk-1 and Milk-3 were collected in three pumping sessions, with samples pooled and homogenised by pipetting up and down for 5 min. Aliquoted (pooled) milk was stored at −20 °C until use. As positive controls, we used two mock communities, the ZymoBiomics Microbial Community Standard (“bacterial mock community”, D6300, Zymo Research, Irvine, California, USA) and the ZymoBiomics Microbial Community DNA Standard (“DNA mock community”, D6305, Zymo Research), according to the manufacturer’s instructions (version 1.1.5 for D6300, version 1.1.6 for D6305). The expected composition of both mock communities (“theoretical mock”) is shown in Table S1. DEPC-treated water (46-2224, Invitrogen, Waltham, Massachusetts, USA) filtered through sterile 0.45 µm cellulose acetate filters (514-0063, VWR, Radnor, Pennsylvania, USA) was used for negative controls.
Pilot samples
Milk (n = 14), maternal faeces (n = 14), infant faeces (n = 14) and infant oral swabs (n = 14) were collected from 14 Lifelines NEXT mother-infant pairs at 3 months postpartum. Mothers were asked to collect milk, without specific cleaning of the breast tissue and using their usual breast pump, from one, preferably the right, breast from the second feeding after midnight and at least 2 h after the last feed from that breast. Mothers expressed a mean (range) of 90 (20–200) ml milk. They homogenised the milk by gentle shaking and aliquoted milk into cryotubes (122279, Greiner Bio-One, Kremsmünster, Austria) using plastic dropper pipettes (H10041, MLS, Menen, Belgium). Faecal samples were collected and aliquoted into cryotubes using plastic dropper pipettes. Milk and faecal samples were stored at −20 °C in home freezers. Infant oral samples were collected during a home visit by a research nurse who gently moved a cotton swab (479165, Paul Hartmann, Heidenheim, Germany) along the inside of the infant’s cheeks and under the infant’s tongue until the swab was fully wetted by saliva. The swabs were placed in cryotubes containing 500 µl PowerBead Solution (12955-4-BS, Qiagen, Venlo, Netherlands). Research nurses then transported all samples to the laboratory in transportable freezers and samples were stored at −20 °C (short-term) or −80 °C (long-term) until analysis. Mean (range) total storage time for milk was 17 (14–22) months and for other sample types 27 (24–30) months.
DNA isolation of method test samples
DNA was isolated from method test samples using four DNA isolation kits: the PS kit (47014, Qiagen), the MX kit (AM1840, Thermo Fisher, Waltham, Massachusetts, USA), the Milk Bacterial DNA Isolation (MD) kit (21550, Norgen Biotek, Thorold, Canada) and the QIAamp Fast DNA Stool Mini (FS) kit (51604, Qiagen). DNA was isolated from milk and negative controls in triplicates (PS kit) or duplicates (MX, MD and FS kits) using 3.5 ml sample input. DNA from the bacterial mock community was isolated once per kit using 75 µl of the bacterial mock community sample mixed with 3.425 ml DEPC-treated water. Detailed isolation procedures are described below. All DNA eluates were stored at −20 °C until further processing.
DNeasy PowerSoil Pro kit
Samples were centrifuged at 13,000 × g at 4 °C for 15 min. Fat (for milk samples) and supernatants were removed. Pellets were resuspended in 800 µl Solution CD1, moved to PowerBead Pro tubes and vortexed briefly. To lyse cells, samples were incubated at 65 °C for 10 min and bead-beat at 5000 rpm at 4 °C for 45 s using a Precellys Evolution Homogenizer (Bertin Instruments, Rockville, Maryland, USA). Afterwards, samples were centrifuged at 15,000 × g at 4 °C for 1 min. Finally, DNA was automatically extracted from 600 µl supernatant using the ‘DNeasy PowerSoil Pro Kit with Inhibitor Removal Technology Protocol’ (version May 2018) with an elution volume of 50 µl on a Qiacube (Qiagen).
MagMAX Total Nucleic Acid Isolation kit
Bead Tubes were briefly centrifuged, and 235 µl Lysis/Binding Solution was added. Samples were centrifuged at 13 000 × g at 4 °C for 15 min. Fat (for milk samples) and supernatants were removed. Pellets were resuspended in 175 µl phosphate-buffered saline, and samples were transferred to Bead Tubes. Samples were bead-beat twice for 90 s at 6800 rpm at 4 °C using a Precellys Evolution Homogenizer, with a 5 min break between. DNA was then isolated manually according to the manufacturer’s instructions (version P/N 4385118 Revision C from July 2008), starting from protocol step B.II.3, and eluted in 30 µl Elution buffer.
Milk Bacterial DNA Isolation kit
Samples were centrifuged at 20,000 × g at 4 °C for 2 min. Fat (for milk samples) and supernatants were removed. Pellets were resuspended in 100 µl Resuspension Solution A with lysozyme and moved to fresh tubes. DNA was then isolated according to the manufacturer’s instructions (version PI21550-4 from 2015), starting from protocol step 1B.d, and DNA was eluted in 50 µl Elution Buffer B.
QIAamp Fast DNA Stool Mini kit
Samples were centrifuged at 13,000 × g at 4 °C for 15 min. Fat (for milk samples) and supernatants were removed. The InhibitEX buffer was heated to 42 °C in a water bath for 10–15 min until crystals were dissolved. Pellets were then resuspended in 500 µl InhibitEX buffer and vortexed at maximum speed for 1 min. Samples were moved to fresh tubes, incubated at 95 °C for 10 min on a heating block (5355, Thermomixer Comfort, Eppendorf, Hamburg, Germany) shaking at 1000 rpm and then vortexed at maximum speed for 15 s. Samples were then centrifuged at 16,300 × g for 1 min, and 200 µl supernatant was used for automatic DNA extraction on a Qiacube using the ‘QIAamp Fast DNA Stool Mini Protocol for Isolation of DNA from Stool for Pathogen Detection’ (version March 2014) with an elution volume of 100 µl.
DNA isolation of pilot samples
DNA was isolated once from each pilot sample using the PS kit. For milk samples, DNA was isolated from 3.5 ml milk, as described above. For faeces and oral swabs, we adapted the isolation procedure prior to the heat incubation and bacterial lysis steps described above. Approximately 0.2–0.5 g faeces and 800 µl Solution CD1 were added to PowerBead Pro tubes and vortexed briefly. Oral swabs in PowerBead solution were vigorously vortexed for 3 min, swabs were removed and ~400 µl solution per sample was transferred to PowerBead tubes. Solution CD1 was added to obtain a final volume of 800 µl, and samples were briefly vortexed before continuing the protocol from the heat incubation step. DNA was stored at −20 °C until further processing.
DNA quantification
To measure DNA concentrations, the Qubit 1X dsDNA HS Assay Kit (Q33230, Invitrogen) was used according to the manufacturer’s instructions (version MAN0017455 Rev. C.0).
Quantitative PCR
To quantify bacteria, we used quantitative PCR (qPCR) of the 16S rRNA gene as described in the Supplementary Methods.
16S rRNA gene sequencing and data processing
DNA was used for library preparation and 16S rRNA gene sequencing at the Institute of Clinical Molecular Biology at the Christian-Albrecht University of Kiel, Germany. The V3–V4 region of the bacterial 16S rRNA gene was amplified using 3 µl DNA and the 357F (5′-CCTACGGGAGGCAGCAG-3′) and 806R (5′-GGACTACHVGGGTWTCTAAT-3′) primer pair according to the dual-barcoding approach from Kozich et al. [23]. PCR products were checked using gel electrophoresis, normalised using the SequalPrep Normalisation Plate Kit (A1051001, Thermo Fisher) and pooled using 1.85 ng PCR product per sample. Pooled libraries were sequenced on Illumina MiSeq v3 systems (2 × 300 bp paired-end sequencing, Illumina, San Diego, California, USA).
After sequencing, samples were demultiplexed based on zero mismatches in barcode sequences. Adapters and primers were removed from sequencing reads using Cutadapt (version 2.6). Reads were quality-trimmed with Trimmomatic (version 0.39, parameters: paired-end mode, SLIDINGWINDOW:4:25 and MINLEN:50). Read quality was investigated with FastQC (version 0.11.5) and MultiQC (version 1.7). A modified version of the DADA2 pipeline (version 1.16, https://benjjneb.github.io/dada2/tutorial.html) was used to construct amplicon sequence variant (ASV) tables. Briefly, trimmed reads with at least 160 bp length were used to learn error rates, denoise and generate ASV tables. Chimaeras were removed, and only ASVs with 400–431 bp length were kept. Taxonomies were assigned to ASVs using the SILVA (version 138.1) and FANGORN databases (GTDB Full 207 database, https://melbourne.figshare.com/articles/dataset/Fangorn_rrn_Database/20086916 [24]). ASVs assigned as chloroplasts and mitochondria were removed from the SILVA dataset. Relative abundances were calculated for both the SILVA and FANGORN datasets.
16S-ITS-23S rRNA gene sequencing and data processing
DNA was prepared for 16S-ITS-23S rRNA gene sequencing using the Shoreline WaveTM StrainIDTM kit (WAVESID-A, Shoreline Biome, Farmington, Connecticut, USA) according to the manufacturer’s instructions (version USM-050-0). Library preparation and sequencing were performed by BaseClear (Leiden, Netherlands). Briefly, samples were marked with unique barcodes, and the ~2500 bp-long 16S-ITS-23S rRNA gene fragment was amplified in a single-step PCR. Amplicons were pooled (5 µl PCR product per sample), used for SMRTbell® library preparation (100-938-900, PacBio, Menlo Park, California, USA) and sequenced on one Sequel II SMRT Cell on a PacBio Sequel System (PacBio), generating circular consensus reads.
Samples were demultiplexed and primers were removed from sequencing reads using SBanalyzer software (version 3.1-2, Shoreline Biome). The DADA2 pipeline was adapted for 16S-ITS-23S rRNA gene sequencing data. Reads were filtered and trimmed with modified settings (minQ=2, maxEE=5, trimLeft=20, trimRight=0, minLen=1900, maxLen=3000), dereplicated and used to learn error rates, denoise and construct the ASV table. Chimaeras were removed. Taxonomies were assigned using the FANGORN database [24], and relative abundances were calculated.
Bacterial enrichment and shotgun metagenomic sequencing
We used two bacterial enrichment methods, hypotonic lysis and benzonase treatment (HL-Benz) [25] and hypotonic lysis and propidium monoazide treatment (HL-PMA) [26] on the method test samples, followed by DNA isolation and shotgun metagenomic sequencing. A detailed description for this can be found in the Supplementary Methods.
Statistical analyses
We compared categorical and continuous parameters using Fisher’s exact tests and Mann–Whitney U tests, respectively. To compare continuous parameters between more than two categories, we used Kruskal–Wallis tests with Dunn’s post hoc tests. Spearman’s rank correlation was used to investigate correlation between continuous variables.
Relative bacterial abundances were centred log-ratio transformed and compared between methods using linear regression models with correction for milk sample as a fixed effect. We used the vegdist function from R package vegan to calculate Aitchison distances between samples based on the relative abundances of classified genera or (un)classified species. We used these distances to cluster mock communities or milk and negative control samples and to compare bacterial composition between samples using ‘Permutational Multivariate Analyses of Variances Using Distance Matrices’ with the adonis function.
To explore sharing of bacterial ASVs between human milk and maternal and infant samples, we selected ASVs belonging to genera present in at least two related sample pairs and aligned them with multiple sequence alignment (MSA) using R package msa with the ClustalW method. Based on the MSA, we generated a Hamming distance matrix for the ASVs of each genus using snp-dists (version 0.7.0, https://github.com/tseemann/snp-dists). This matrix was used to detect potential ASV-sharing events (presence of identical ASVs in a sample pair), and their frequency was compared between related and unrelated sample pairs using Fisher’s exact tests. In addition, we generated phylogenetic trees for Streptococcus ASVs using RAxML-NG (version 1.3, https://github.com/amkozlov/raxml-ng) with the GTR+G model and 1000 bootstrap replicates.
All statistical analyses were performed and all plots were created in RStudio (version 1.1.463) or R (version 4.2.2) using the R packages ape [27], cowplot [28], dplyr [29], FSA [30], ggplot2 [31], ggdendro [32], ggpubr [33], msa [34], plyr [35], reshape [36], phangorn [37, 38], seqinr [39], stringr [40], tidyr [41] and vegan [42]. P values were adjusted using the Benjamini–Hochberg method. A false discovery rate (FDR) < 0.05 was considered statistically significant.
Results
Study design
We tested four DNA isolation methods, two bacterial enrichment methods and three sequencing methods on test samples: bacterial and DNA mock communities, a negative control and three human milk samples. We then applied one of the best-performing DNA isolation methods (the PS kit) and two sequencing methods (16S and 16S-ITS-23S rRNA gene sequencing) to samples from 14 mother-infant pairs to investigate their ability to detect bacterial sharing between human milk, maternal and infant faeces and infant oral swabs.
Comparison of DNA isolation methods
To find an efficient, reliable method for milk DNA isolation, we tested the FS, MD, PS and MX kits on method test samples (Fig. 1A). DNA isolation was followed by 16S rRNA gene sequencing and taxonomic classification with the SILVA database.
Mock communities and negative controls
The DNA yield from the bacterial mock community was highest with the MD kit, followed by the PS, MX and FS kits (Table 1A). The highest sequencing read count was obtained with the FS kit, followed by the MD, MX and PS kits (Table 1A). We compared the bacterial composition of isolated bacterial mock communities to both the theoretical mock composition (Table S1) and to results for a DNA mock community. All four DNA isolation methods detected all eight bacterial genera expected in the mock community, but their relative abundances varied between methods (Fig. 1B). The composition of the bacterial mock communities isolated with the MX and PS kits were closest to the theoretical composition (Fig. 1B), whereas the mock communities isolated with the MD and FS kits clustered further apart and showed increases in Bacillus and Limosilactobacillus (MD kit) or Escherichia-Shigella (FS kit) and decreases in Staphylococcus (both kits). Species-level characterisation of bacteria was only possible for 0.5–3.4% of the mock sequencing reads (Table 1A, Fig. S1). In the bacterial mock community isolated with the FS kit, we unexpectedly detected traces of Bifidobacterium (<0.02% relative abundance).
All negative controls had DNA concentrations below the detection limit of the method and low read counts (median (range): 742 (0–4 890) reads) (Table 1B). Negative controls isolated with the MX kit had the lowest median number of reads, followed by the PS, FS and MD kits (Table 1B). Cutibacterium, Enhydrobacter, Escherichia-Shigella and Pelomonas were commonly detected in isolation-negative controls (Fig. S2, Table S2).
Milk samples
The median DNA yield from milk samples was significantly higher with the PS and MX kits compared to the MD and FS kits (FDR=0.001, Table 1C). The MD kit resulted in significantly higher read counts than the FS and PS kits, while the MX kit led to significantly higher read counts than the PS kit (FDR < 0.05, Table 1C). The median number of bacterial genera detected in milk was similar between the isolation methods (Table 1C).
Nine bacterial genera were present in at least 70% of milk samples: Acinetobacter, Corynebacterium, Cutibacterium, Enhydrobacter, Gemella, Rothia, Staphylococcus, Streptococcus and Veillonella (Fig. 1C, Table S3). The most abundant bacterial genera detected in milk samples (≥15% relative abundance in ≥1 milk sample) were Staphylococcus (all kits), Streptococcus (all kits), Cutibacterium (FS and MX kits), Corynebacterium (FS and PS kits), Gemella (MD kit) and Exiguobacterium (FS kit) (Fig. 1C).
The overall milk bacterial composition (beta diversity) differed significantly between mothers (R2 = 0.37, FDR = 0.002, Fig. 1D). The DNA isolation method also significantly affected milk beta diversity (R2 = 0.19, FDR = 0.01, Fig. 1D), and the relative abundances of five of the nine most-prevalent genera in milk differed significantly between the DNA isolation methods (Corynebacterium, Cutibacterium, Rothia, Staphylococcus and Streptococcus, all FDR < 0.05, Table S3). Importantly, the bacterial composition of milk differed significantly from that of negative controls (R2 = 0.17, FDR = 0.002, Fig. 1D). In addition, the median number of sequencing reads for all negative controls (742 reads) was significantly lower than for all milk samples (37,093 reads, p < 0.001). Bacteria detected in milk and negative controls are shown in Table S4. Cutibacterium was detected in milk and corresponding negative controls by all four kits and Enhydrobacter by three kits (Table S4).
Next, we quantified the extent of potential contamination in milk sample results. An ASV detected in both a milk sample and a corresponding negative control was considered a potential contaminant. When comparing DNA isolation methods, we saw no significant difference in the percentage of sequencing reads that contained a potentially contaminating ASV (Table 1C). However, the percentage of reads corresponding to ASVs in negative controls varied the most for milk samples isolated with the FS and MX kits and the least for milk samples isolated with the MD and PS kits (Table 1C). By comparing the negative controls from isolation and library preparation, we concluded that contamination was frequently introduced during DNA isolation (range 0.1–39.1% across DNA isolation methods) but very rare during library preparation (range 0–0.7%) (Table S4). There was also a significant negative correlation between DNA yield and the percentage of contamination (Spearman’s rho = −0.56, FDR = 0.005) but not between the number of milk sequencing reads and the percentage of contamination (FDR = 0.98).
Based on these analyses, we concluded that the PS and MX kits were the most suitable for milk DNA isolation because they represented the mock community well, led to little contamination and produced similar 16S rRNA gene sequencing results for milk sample replicates. We decided to continue our following tests with the PS kit for three reasons: (1) the maximum percentage of sequencing reads shared with negative controls was smaller for the PS than for the MX kit, (2) we had DNA isolation robots for the PS kit available in our lab and (3) we experienced difficulties with separating magnetic beads from milk DNA eluates in the elution step of the MX kit. Before continuing with the next tests, we further showed that the PS kit detected a median of 105 16S rRNA gene copy numbers per 1 ml human milk (Table 1C), which was similar to other studies [9, 11].
Bacterial enrichment and metagenomic sequencing
We next aimed to enrich milk bacterial DNA prior to metagenomic sequencing. Here we tried two bacterial enrichment methods, HL-Benz and HL-PMA, combined with metagenomic sequencing on the test samples (Table S5). Library preparation or sequencing was not successful for any samples prepared with the HL-Benz method and was only successful for four of nine milk samples and the bacterial mock community prepared with the HL-PMA method. For the nine non-enriched and five HL-PMA-enriched milk samples that could be sequenced, at least 98.8% of the sequencing reads were of human origin and only a median of <1% was of microbial origin (Table S5). We therefore concluded that these bacterial enrichment methods did not substantially decrease the proportion of human reads and thus they were not efficient for use with human milk samples in combination with metagenomic sequencing.
Comparison of 16S and 16S-ITS-23S rRNA gene sequencing
We next aimed to improve taxonomic resolution of milk microbiota profiling. For this, we isolated test sample DNA with the PS kit and subjected it to both 16S-ITS-23S and 16S rRNA gene sequencing (Fig. 2A). To compare results between the sequencing methods, all bacteria were classified using the FANGORN database, which contains full-length 16S-ITS-23S rRNA operon sequences [24].
Mock communities and negative controls
16S rRNA gene sequencing identified all eight genera in the mock community, and the bacterial composition closely resembled the theoretical composition (Table 2A, Fig. 2B). When using 16S-ITS-23S rRNA gene sequencing, only five of the eight genera of the mock community were detected (Bacillus, Enterococcus, Listeria, Salmonella and Staphylococcus), while Escherichia, Limosilactobacillus and Pseudomonas were missing (Table 2A, Fig. 2B). Species-level characterisation of bacteria was improved with the FANGORN database, though Bacillus subtilis and Escherichia coli (if detected) were misclassified, making species-level classification unreliable (Fig. S3). Negative controls sequenced with 16S-ITS-23S rRNA gene sequencing had a maximum of seven reads (Table 2B, Table S6).
Milk samples
The number of sequencing reads obtained for milk samples did not differ between 16S-ITS-23S and 16S rRNA gene sequencing (Table 2C). On average, four more genera were detected in milk with 16S than with 16S-ITS-23S rRNA gene sequencing (FDR=0.046, Table 2C). Seven bacterial genera (Corynebacterium, Cutibacterium, Gemella, Rothia, Staphylococcus, Streptococcus and Veillonella) were present in at least 70% of the milk samples profiled by both 16S and 16S-ITS-23S rRNA gene sequencing (Table S7). The most abundant genera (≥15% relative abundance in ≥1 milk sample) were Staphylococcus and Streptococcus (both methods), Cutibacterium and Gemella (16S-ITS-23S rRNA gene sequencing) and Corynebacterium (16S rRNA gene sequencing) (Fig. 2C). The overall bacterial composition of milk samples differed significantly between mothers (R2 = 0.63, FDR=0.002) and from that of negative controls (R2=0.30, FDR=0.002, Fig. 2D). The sequencing method had no significant effect on milk beta diversity (FDR=0.21, Fig. 2D). Higher relative abundances of Corynebacterium and Staphylococcus and lower relative abundances of Streptococcus and Veillonella were detected with 16S-ITS-23S compared to 16S rRNA gene sequencing (all FDR=0.01, Table S7).
Bacterial sharing in mother-infant pairs
Finally, we investigated the ability of 16S and 16S-ITS-23S rRNA gene sequencing to detect sharing of bacterial ASVs between human milk and infant oral cavity, infant faeces and maternal faeces (Fig. 3A). The total bacterial load (16S rRNA gene copy numbers per 1 ng DNA) differed significantly between sample types, with human milk having the lowest and maternal faeces having the highest number of 16S rRNA gene copies (Fig. S4). Also, the overall bacterial composition differed significantly between sample types (R2 = 0.27, FDR=0.002) and sequencing methods (R2 = 0.03, FDR=0.002, Fig. 3B, Fig. S5). When investigating the effect of the sequencing methods separately for each sample type, the method used significantly affected the infant oral (R2 = 0.15, FDR=0.002) and maternal faecal bacterial composition (R2 = 0.28, FDR=0.002) but not the milk (FDR=0.16) and infant faecal bacterial composition (FDR=0.59). 16S rRNA gene sequencing identified significantly more genera in each sample type than 16S-ITS-23S rRNA gene sequencing (Fig. S6). Interestingly, the bacterial genera missing in the mock community sequenced with 16S-ITS-23S rRNA gene sequencing (Fig. 2B), Escherichia, Limosilactobacillus and Pseudomonas, were detected by 16S-ITS-23S rRNA gene sequencing in some milk and faecal samples (Fig. S5, Table S8).
The number of genera detected in both human milk and another sample type from related mother-infant pairs (potentially shared genera) was consistently higher when using 16S than when using 16S-ITS-23S rRNA gene sequencing (21 vs. 10 genera for human milk-infant oral cavity, 14 vs. 5 genera for human milk-infant faeces, 5 genera vs. 1 genus for human milk-maternal faeces, Fig. S7). We focused on investigating bacterial sharing based on exact ASV-matching for potentially shared genera detected in at least two sample pairs with both sequencing methods (Fig. 3C, Fig. S7). For these potentially shared genera, the percentage of unrelated sample pairs that shared at least one ASV of the investigated genus was significantly higher with 16S (range 10–93%) than with 16S-ITS-23S rRNA gene sequencing (range 0–3%, all FDR<0.05, Fig. 3C, Table S9). With 16S-ITS-23S rRNA gene sequencing, significantly more related human milk-infant oral swab sample pairs shared ASVs belonging to Gemella, Pauljensenia, Rothia, Streptococcus and Veillonella genera when compared to unrelated pairs (Fig. 3C). 16S rRNA gene sequencing also detected significantly more sharing of Pauljensenia and Rothia, but not of Gemella, Streptococcus and Veillonella, in related human milk-infant oral swab pairs compared to unrelated pairs (Fig. 3C).
When comparing human milk and infant faeces, 16S-ITS-23S rRNA gene sequencing detected significantly more sharing of Staphylococcus and Streptococcus between related pairs compared to unrelated pairs, whereas 16S rRNA gene sequencing did not (Fig. 3C). 16S-ITS-23S rRNA gene sequencing detected no bacterial sharing between maternal faeces and human milk (Fig. 3C). By contrast, 16S rRNA gene sequencing found sharing of Streptococcus, although there was no significant difference between related and unrelated pairs (Fig. 3C). As Streptococcus was identified as potentially shared across all four sample types, we constructed phylogenetic trees for Streptococcus based on both 16S and 16S-ITS-23S rRNA gene sequencing data (Fig. 3D, Fig. S8). While the resolution of 16S rRNA gene sequencing data was too low to separate bacterial strains, hindering the study of within-sample and within-family clustering (Fig. S8), 16S-ITS-23S rRNA gene sequencing provided higher phylogenetic resolution and richer ramification, which revealed higher ASV similarity within individuals and families compared to unrelated individuals (Fig. 3D).
In summary, 16S-ITS-23S rRNA gene sequencing did not provide a complete representation of bacterial communities or reliable species-level classifications, but it was much less affected by potential contamination. 16S-ITS-23S rRNA gene sequencing was able to detect sharing of human milk bacteria with the infant’s oral cavity and faeces, but not with the maternal faeces, in more related pairs than unrelated mother-infant pairs.
Discussion
DNA isolation methods affect microbiota results
The DNA isolation methods we tested, or previous versions of them, have been used to isolate DNA for milk microbiota studies by other groups (FS kit [43,44,45,46,47], MD kit [48], PS kit [15, 21, 49]) or in our lab (MX kit). A good DNA isolation method should lyse all the bacteria in a sample, introduce minimal bias to the relative community composition, provide sufficient DNA for downstream analyses and not introduce contaminants. Previous studies describe that DNA isolation methods affect microbiota results and that mechanical lysis is important for efficient DNA isolation and accurate profiling of gram-positive bacteria [50,51,52,53,54]. In concordance with this, our 16S rRNA gene sequencing results differed between DNA isolation methods. Differences in isolation procedures, such as different sample centrifugation settings, lysis methods, and DNA capturing techniques, likely influenced DNA yield and microbiota results. The DNA isolation kits with bead-beating, the PS and MX kits, gave the closest representation of the expected mock community. These kits also produced the highest milk DNA yield, suggesting that they lysed bacteria more effectively. Importantly, we re-confirmed that a higher milk DNA yield is associated with less contamination [52].
Assessing and minimising contamination is particularly important for low-biomass samples [55]. Our results emphasise this as three of the four high-biomass bacterial mock communities we isolated had no 16S rRNA gene sequencing reads belonging to unexpected bacteria, whereas the low-biomass negative control and milk samples were potentially contaminated. Contaminants likely originated from DNA isolation kits as (1) isolation-negative controls had more contamination than library preparation negative controls, (2) all samples were processed in the same environment and (3) the extent and nature of contamination differed between kits. Indeed, the potentially contaminating bacteria we identified, e.g. Cutibacterium (formerly Propionibacterium), have previously been suggested as kit contaminants [55]. While there are various approaches for identifying and removing potential contaminants from sequencing data [55], it remains difficult to balance removal of potential contaminant signals against retaining true signals. It is therefore important to choose a DNA isolation method that introduces as little contamination as possible. Milk samples isolated with the PS kit shared only a median of 3% and a maximum of 10% of sequencing reads with corresponding negative controls, suggesting they were less affected by potential contamination than samples isolated with other kits.
Bacterial enrichment methods did not improve milk metagenomic sequencing data
The HL-PMA and HL-Benz enrichment methods have previously been shown to decrease human metagenomic sequencing read-depth and increase the microbial metagenomic sequencing read-depth in low-biomass saliva (HL-PMA) [26] and sputum samples (HL-Benz) [25]. Both methods rely on hypotonic lysis of human cells and depletion of liberated DNA to enrich bacterial DNA. Unfortunately, when applied to human milk, our enriched samples either failed sequencing or still had ≥98.8% human reads. One previous study that investigated the performance of bacterial enrichment methods in milk, including HL-PMA, described it as yielding insufficient DNA for further analysis [56]. Human milk is rich in nutrients, which potentially interfere with DNA isolation [12], and other human milk components also likely hinder bacterial enrichment. An enrichment method that effectively reduces human host contamination and enables the use of metagenomic sequencing on human milk thus remains to be found.
Sequencing methods affect microbiota results
As sequencing of 16S-ITS-23S rRNA gene amplicons has enabled tracking of bacterial strains in infant guts [57], we tested whether 16S-ITS-23S rRNA gene sequencing could improve bacterial classification and detect bacterial sharing events in low-biomass human milk samples. Our results from the bacterial mock communities show that, although bacteria were often assigned on the species level, this classification was not always correct. As 16S-ITS-23S rRNA gene sequencing detected more ASV-sharing in related mother-infant pairs compared to unrelated pairs, we suggest that the ~2500 bp-long 16S-ITS-23S rRNA gene amplicons contain sufficient information to provide taxonomic information below genus level. It may be that high variation in the ITS region complicated alignment to reference databases and the databases with 16S-ITS-23S rRNA gene sequences require further curation.
16S-ITS-23S rRNA gene sequencing missed the Escherichia, Limosilactobacillus and Pseudomonas in the bacterial mock community. However, importantly, they were detected in the bacterial mock community with 16S rRNA gene sequencing and in milk and faeces with 16S-ITS-23S rRNA gene sequencing, suggesting that their DNA was isolated from the bacterial mock community but not successfully amplified during 16S-ITS-23S library preparation or that sequencing reads were not assigned their respective taxonomy during bioinformatic processing. The primers chosen, sequencing method and database can all affect microbiota results [58, 59], and they also likely affect 16S-ITS-23S rRNA gene sequencing results. Indeed, compared to 16S rRNA gene sequencing, 16S-ITS-23S rRNA gene sequencing changed the microbiota results of infant oral and maternal faecal samples, although not those of milk and infant faeces. It is possible that the sample size was too small to detect differences between sequencing methods for the latter. On a positive note, 16S-ITS-23S rRNA gene sequencing showed remarkably little contamination. Ultimately, it remains important to assess and report a technique’s limitations and the bias it might introduce.
16S-ITS-23S rRNA gene sequencing can detect bacterial transmission
Even though studies based on 16S rRNA gene sequencing report sharing of milk bacteria with other maternal and infant body sites [5, 6, 9, 13, 16, 60,61,62,63], 16S rRNA gene sequencing is limited in its ability to study bacterial transmission. Both full-length 16S rRNA gene amplicon sequencing and targeted sequencing of the ITS region between the 16S and 23S rRNA genes improved resolution in previous studies [64, 65]. As expected, the combined 16S-ITS-23S rRNA gene sequencing approach could improve resolution for bacterial transmission studies by detecting more variation in the ~2500 bp-long 16S-ITS-23S rRNA gene amplicons compared to the ~450 bp-long 16S rRNA gene amplicons.
The origin of human milk bacteria is still an area of study. The maternal areolar skin and the infant oral cavity have been suggested to host bacteria that could reach the mammary gland via retrograde transfer [5, 7, 8], while the entero-mammary pathway hypothesis suggests that maternal gut bacteria can travel to the mammary gland and enter human milk [4, 7, 8, 60, 66]. However, studies on human milk bacterial transfer have been complicated by difficulties with obtaining high-resolution milk bacterial profiles. In our study, using 16S-ITS-23S rRNA gene sequencing data, human milk shared Rothia, Streptococcus, Gemella, Pauljensenia and Veillonella with the infant’s oral cavity, supporting 16S rRNA sequencing-based studies that report sharing of oral bacteria between a mother’s milk and her infant’s oral cavity [16, 61]. Unfortunately, our approach did not allow to study the direction of a potential bacterial transfer. In contrast to small earlier studies supporting the entero-mammary pathway hypothesis [8, 60, 66], for example, by identifying shared Bifidobacterium strains in maternal faeces and milk using cultures and (reduced) metagenomic sequencing, we found no bacterial sharing between maternal faeces and milk. Even though 16S-ITS-23S rRNA gene sequencing detected Bifidobacterium genera in infant faeces, no Bifidobacterium was detected as potentially shared in our maternal faeces-milk pairs. Larger studies with high-resolution data are needed to better define the roles of different maternal and infant body sites in shaping the human milk bacterial community.
Next to human milk sharing bacteria with the infant oral cavity, we also detected sharing of human milk Streptococcus and Staphylococcus with infant faeces. This finding is in line with other publications suggesting transfer of bacteria between a mother’s milk and her infant’s gut [5, 6, 8,9,10, 13, 16, 60, 61, 63]. The 16S-ITS-23S rRNA gene sequencing technique reported here could help identify bacterial sharing between low-biomass human milk samples and infant body sites. Still, future studies are needed to further explore the role of human milk-derived bacteria in infant oral and gut bacterial colonisation. For example, the functions of shared bacteria remain unclear and, while a recent study showed that human milk harbours viable and non-viable bacteria [67], their effect on infants remains to be elucidated.
Limitations and strengths
This study had several limitations. The small sample size limited statistical testing and the ability to draw conclusions. We did not investigate the effect of sample storage on the results, however, we expect only limited effects of (long-term) cold storage on the non-viable microbiota based on previous studies [68,69,70,71,72,73]. In addition, fat was removed from milk prior to DNA isolation, which likely led to a loss of bacteria within the fat layer [53, 74]. The DNA isolation methods we tested did not distinguish between viable and non-viable bacteria. The 16S-ITS-23S rRNA gene sequencing method detected fewer bacteria compared to the 16S rRNA gene sequencing method, potentially missing (shared) bacteria in the different sample types. A major strength of this study is that we tested various methods on the same samples and applied the same methodology (DNA isolation, sequencing, bioinformatic processing) to the steps not under study. Mock and negative controls added during DNA isolation and sequencing allowed us to identify biases and potential contaminants. We also report how to adapt the PS kit for sample types other than milk. Finally, using long-read 16S-ITS-23S rather than short-read 16S rRNA gene sequencing enabled higher-resolution study of milk bacterial transmission.
Conclusion
In conclusion, DNA isolation with the PS or MX kit and 16S rRNA gene sequencing reliably profiled the milk microbiota and, though our analysis was limited to 14 mother-infant pairs, long-read 16S-ITS-23S rRNA gene sequencing provided evidence for bacterial transfer from breastfeeding mothers to infants, but not from the maternal gut to breast milk. Our findings pave the way for large milk microbiota studies and 16S-ITS-23S rRNA gene sequencing can help to study bacterial transmission in low-biomass (human milk) samples.
Data availability
The sequencing data has been uploaded to the European Genome-phenome Archive (EGAS00001007592). Data files are available as part of the supplementary material. Scripts to process and analyse data can be found on GitHub (https://github.com/GRONINGEN-MICROBIOME-CENTRE/MilkMethods2023).
References
Victora CG, Bahl R, Barros AJD, França GVA, Horton S, Krasevec J, et al. Breastfeeding in the 21st century: Epidemiology, mechanisms, and lifelong effect. Lancet. 2016;387:475–90.
Bardanzellu F, Fanos V, Strigini FAL, Artini PG, Peroni DG. Human breast milk: Exploring the linking ring among emerging components. Front Pediatr. 2018;6:215.
Hoffman DJ, Reynolds RM, Hardy DB. Developmental origins of health and disease: Current knowledge and potential mechanisms. Nutr Rev. 2017;75:951–70.
Rodríguez JM. The origin of human milk bacteria: Is there a bacterial entero-mammary pathway during late pregnancy and lactation? Adv Nutr. 2014;5:779–84.
Biagi E, Quercia S, Aceti A, Beghetti I, Rampelli S, Turroni S, et al. The bacterial ecosystem of mother’s milk and infant’s mouth and gut. Front Microbiol. 2017;8:1214.
Fehr K, Moossavi S, Sbihi H, Boutin RCT, Bode L, Robertson B, et al. Breastmilk feeding practices are associated with the co-occurrence of bacteria in mothers’ milk and the infant gut: The CHILD cohort study. Cell Host Microbe. 2020;28:1–13.
Fernández L, Pannaraj PS, Rautava S, Rodríguez JM. The microbiota of the human mammary ecosystem. Front Cell Infect Microbiol. 2020;10:586667.
Kordy K, Gaufin T, Mwangi M, Li F, Cerini C, Lee DJ, et al. Contributions to human breast milk microbiome and enteromammary transfer of Bifidobacterium breve. PLoS ONE. 2020;15:e0219633.
Laursen MF, Pekmez CT, Larsson MW, Lind MV, Yonemitsu C, Larnkjær A, et al. Maternal milk microbiota and oligosaccharides contribute to the infant gut microbiota assembly. ISME Commun. 2021;1:21.
Qi C, Zhou J, Tu H, Tu R, Chang H, Chen J, et al. Lactation-dependent vertical transmission of natural probiotics from the mother to the infant gut through breast milk. Food Funct. 2022;13:304.
Boix-Amorós A, Collado MC, Mira A. Relationship between milk microbiota, bacterial load, macronutrients, and human cells during lactation. Front Microbiol. 2016;7:492.
Asnicar F, Manara S, Zolfo M, Truong DT, Scholz M, Armanini F, et al. Studying vertical microbiome transmission from mothers to infants by strain-level metagenomic profiling. mSystems. 2017;2:e00164–16.
Pannaraj PS, Li F, Cerini C, Bender JM, Yang S, Rollie A, et al. Association between breast milk bacterial communities and establishment and development of the infant gut microbiome. JAMA Pediatr. 2017;171:647–54.
Pannaraj PS, Ly M, Cerini C, Saavedra M, Aldrovandi GM, Saboory AA, et al. Shared and distinct features of human milk and infant stool viromes. Front Microbiol. 2018;9:1162.
Maqsood R, Reus JB, Wu LI, Holland LA, Nduati R, Mbori-Ngacha D, et al. Breast milk virome and bacterial microbiome resilience in Kenyan women living with HIV. mSystems. 2021;6:e01079–20.
Bogaert D, van Beveren GJ, de Koff EM, Lusarreta Parga P, Balcazar Lopez CE, Koppensteiner L, et al. Mother-to-infant microbiota transmission and infant microbiota development across multiple body sites. Cell Host Microbe. 2023;31:447–60.
Olin A, Henckel E, Chen Y, Lakshmikanth T, Pou C, Mikes J, et al. Stereotypic immune system development in newborn children. Cell. 2018;174:1277–92.
Henrick BM, Rodriguez L, Lakshmikanth T, Pou C, Henckel E, Arzoomand A, et al. Bifidobacteria-mediated immune system imprinting early in life. Cell. 2021;184:1–15.
Jiménez E, de Andrés J, Manrique M, Pareja-Tobes P, Tobes R, Martínez-Blanch JF, et al. Metagenomic analysis of milk of healthy and mastitis-suffering women. J Hum Lact. 2015;31:406–15.
Pärnänen K, Karkman A, Hultman J, Lyra C, Bengtsson-Palme J, Larsson DGJ, et al. Maternal gut and breast milk microbiota affect infant gut antibiotic resistome and mobile genetic elements. Nat Commun. 2018;9:3891.
Ferretti P, Pasolli E, Tett A, Asnicar F, Gorfer V, Fedi S, et al. Mother-to-infant microbial transmission from different body sites shapes the developing infant gut microbiome. Cell Host Microbe. 2018;24:133–45.
Warmink-Perdijk WDB, Peters LL, Tigchelaar EF, Dekens JAM, Jankipersadsing SA, Zhernakova A, et al. Lifelines NEXT: A prospective birth cohort adding the next generation to the three‑generation Lifelines cohort study. Eur J Epidemiol. 2020;35:157–68.
Kozich JJ, Westcott SL, Baxter NT, Highlander SK, Schloss PD. Development of a dual-index sequencing strategy and curation pipeline for analyzing amplicon sequence data on the MiSeq Illumina sequencing platform. Appl Environ Microbiol. 2013;79:5112–20.
Walsh CJ, Srinivas M, van Sinderen D, Cotter PD, Kenny JG. FANGORN: A quality-checked and publicly available database of full-length 16S-ITS-23S rRNA operon sequences. bioRxiv (2022). https://doi.org/10.1101/2022.10.04.509801.
Nelson MT, Pope CE, Marsh RL, Wolter DJ, Weiss EJ, Hager KR, et al. Human and extracellular DNA depletion for metagenomic analysis of complex clinical infection samples yields optimized viable microbiome profiles. Cell Rep. 2019;26:2227–40.
Marotz CA, Sanders JG, Zuniga C, Zaramela LS, Knight R, Zengler K. Improving saliva shotgun metagenomics by chemical host DNA depletion. Microbiome. 2018;6:42.
Paradis E, Schliep K. ape 5.0: An environment for modern phylogenetics and evolutionary analyses in R. Bioinformatics. 2018;35:526–8.
Wilke CO. cowplot: Streamlined plot theme and plot annotations for ‘ggplot2’. (2019). https://cran.r-project.org/package=cowplot.
Wickham H, François R, Henry L, Müller K dplyr: A grammar of data manipulation. (2020). https://cran.r-project.org/package=dplyr.
Ogle DH, Wheeler P, Dinno A FSA: Fisheries Stock Analysis. (2020). https://github.com/droglenc/FSA.
Wickham H. ggplot2: Elegant graphics for data analysis. New York:Springer-Verlag; 2016.
de Vries A, Ripley BD. ggdendro: Create dendrograms and tree diagrams using ‘ggplot2’. (2016). https://cran.r-project.org/package=ggdendro.
Kassambara A ggpubr: ‘ggplot2’ based publication ready plots. (2020). https://cran.r-project.org/package=ggpubr.
Bodenhofer U, Bonatesta E, Horejs-Kainrath C, Hochreiter S. msa: An R package for multiple sequence alignment. Bioinformatics. 2015;31:3997–9.
Wickham H. The split-apply-combine strategy for data analysis. J Stat Softw. 2011;40:1–29.
Wickham H. Reshaping data with the reshape package. J Stat Softw. 2007;21:12.
Schliep K. phangorn: Phylogenetic analysis in R. Bioinformatics. 2011;27:592–3.
Schliep K, Potts AJ, Morrison DA, Grimm GW. Intertwining phylogenetic trees and networks. Methods Ecol Evol. 2017;8:1212–20.
Charif D, Lobry J. SeqinR 1.0-2: A contributed package to the R project for statistical computing devoted to biological sequences retrieval and analysis. In: Bastolla U, Porto M, Roman H, Vendruscolo M, editors. Structural approaches to sequence evolution: Molecules, networks, populations. New York:Springer-Verlag; 2007. p. 207–32.
Wickham H stringr: Simple, consistent wrappers for common string operations. (2018). https://cran.r-project.org/package=stringr.
Wickham H, Henry L tidyr: Tidy messy data. (2020). https://cran.r-project.org/package=tidyr.
Oksanen J, Blanchet FG, Friendly M, Kindt R, Legendre P, McGlinn D et al. vegan: Community ecology package. (2018). https://cran.r-project.org/package=vegan.
Cabrera-Rubio R, Collado MC, Laitinen K, Salminen S, Isolauri E, Mira A. The human milk microbiome changes over lactation and is shaped by maternal weight and mode of delivery. Am J Clin Nutr. 2012;96:544–51.
Cabrera-Rubio R, Mira-Pascual L, Mira A, Collado MC. Impact of mode of delivery on the milk microbiota composition of healthy women. J Dev Orig Health Dis. 2016;7:54–60.
Urbaniak C, Angelini M, Gloor GB, Reid G. Human milk microbota profiles in relation to birthing method, gestation and infant gender. Microbiome. 2016;4:1.
Patel SH, Vaidya YH, Patel RJ, Pandit RJ, Joshi CG, Kunjadiya AP. Culture independent assessment of human milk microbial community in lactational mastitis. Sci Rep. 2017;7:7804.
Browne PD, Aparicio M, Alba C, Hechler C, Beijers R, Rodríguez JM, et al. Human milk microbiome and maternal postnatal psychosocial distress. Front Microbiol. 2019;10:2333.
Drago L, Toscano M, De Grandi R, Grossi E, Padovani EM, Peroni DG. Microbiota network and mathematic microbe mutualism in colostrum and mature milk collected in two different geographic areas: Italy versus Burundi. ISME J. 2017;11:875–84.
Seferovic MD, Mohammad M, Pace RM, Engevik M, Versalovic J, Bode L, et al. Maternal diet alters human milk oligosaccharide composition with implications for the milk metagenome. Sci Rep. 2020;10:22092.
Costea PI, Zeller G, Sunagawa S, Pelletier E, Alberti A, Levenez F, et al. Towards standards for human fecal sample processing in metagenomic studies. Nat Biotechnol. 2017;35:1069–76.
Teng F, Sankar Darveekaran Nair S, Zhu P, Li S, Huang S, Li X, et al. Impact of DNA extraction method and targeted 16S-rRNA hypervariable region on oral microbiota profiling. Sci Rep. 2018;8:16321.
Douglas CA, Ivey KL, Papanicolas LE, Best KP, Muhlhausler BS, Rogers GB. DNA extraction approaches substantially influence the assessment of the human breast milk microbiome. Sci Rep. 2020;10:123.
Ojo-Okunola A, Claassen-Weitz S, Mwaikono KS, Gardner-Lubbe S, Zar HJ, Nicol MP, et al. The influence of DNA extraction and lipid removal on human milk bacterial profiles. Methods Protoc. 2020;3:39.
Cheema AS, Stinson LF, Lai CT, Geddes DT, Payne MS. DNA extraction method influences human milk bacterial profiles. J Appl Microbiol. 2020;130:142–56.
Eisenhofer R, Minich JJ, Marotz C, Cooper A, Knight R, Weyrich LS. Contamination in low microbial biomass microbiome studies: Issues and recommendations. Trends Microbiol. 2019;27:105–17.
Yap M, Feehily C, Walsh CJ, Fenelon M, Murphy EF, McAuliffe FM, et al. Evaluation of methods for the reduction of contaminating host reads when performing shotgun metagenomic sequencing of the milk microbiome. Sci Rep. 2020;10:21665.
Graf J, Ledala N, Caimano MJ, Jackson E, Gratalo D, Fasulo D, et al. High-resolution differentiation of enteric bacteria in premature infant fecal microbiomes using a novel rRNA amplicon. MBio. 2021;12:e03656–20.
Abellan-Schneyder I, Matchado MS, Reitmeier S, Sommer A, Sewald Z, Baumbach J, et al. Primer, pipelines, parameters: Issues in 16S rRNA gene sequencing. mSphere. 2021;6:e01202–20.
Rozas M, Brillet F, Callewaert C, Paetzold B. MinIONTM Nanopore sequencing of skin microbiome 16S and 16S-23S rRNA gene amplicons. Front Cell Infect Microbiol. 2022;11:806476.
Jost T, Lacroix C, Braegger CP, Rochat F, Chassard C. Vertical mother-neonate transfer of maternal gut bacteria via breastfeeding. Environ Microbiol. 2014;16:2891–904.
Drell T, Štšepetova J, Simm J, Rull K, Aleksejeva A, Antson A, et al. The influence of different maternal microbial communities on the development of infant gut and oral microbiota. Sci Rep. 2017;7:9940.
Murphy K, Curley D, O’Callaghan TF, O’Shea C-A, Dempsey EM, O’Toole PW, et al. The composition of human milk and infant faecal microbiota over the first three months of life: A pilot study. Sci Rep. 2017;7:40597.
Corona-Cervantes K, García-González I, Villalobos-Flores LE, Hernández-Quiroz F, Piña-Escobedo A, Hoyo-Vadillo C, et al. Human milk microbiota associated with early colonization of the neonatal gut in Mexican newborns. PeerJ. 2020;8:e9205.
Schanche M, Avershina E, Dotterud C, Øien T, Storrø O, Johnsen R, et al. High-resolution analyses of overlap in the microbiota between mothers and their children. Curr Microbiol. 2015;71:283–90.
Johnson JS, Spakowicz DJ, Hong B-Y, Petersen LM, Demkowicz P, Chen L, et al. Evaluation of 16S rRNA gene sequencing for species and strain-level microbiome analysis. Nat Commun. 2019;10:5029.
Avershina E, Angell IL, Simpson M, Storrø O, Øien T, Johnsen R, et al. Low maternal microbiota sharing across gut, breast milk and vagina, as revealed by 16S rRNA gene and reduced metagenomic sequencing. Genes (Basel). 2018;9:231.
Stinson LF, Trevenen ML, Geddes DT. The viable microbiome of human milk differs from the metataxonomic profile. Nutrients. 2021;13:4445.
Ahrabi AF, Handa D, Codipilly CN, Shah S, Williams JE, McGuire MA, et al. Effects of extended freezer storage on the integrity of human milk. J Pediatr. 2016;177:140–3.
Lyons KE, Fouhy F, O’ Shea CA, Ryan CA, Dempsey EM, Ross RP, et al. Effect of storage, temperature, and extraction kit on the phylogenetic composition detected in the human milk microbiota. Microbiologyopen. 2021;10:e1127.
Stinson LF, Trevenen ML, Geddes DT. Effect of cold storage on the viable and total bacterial populations in human milk. Nutrients. 2022;14:1875.
Carroll IM, Ringel-Kulka T, Siddle JP, Klaenhammer TR, Ringel Y. Characterization of the fecal microbiota using high-throughput sequencing reveals a stable microbial community during storage. PLoS ONE. 2012;7:e46953.
Kia E, Mackenzie BW, Middleton D, Lau A, Waite DW, Lewis G, et al. Integrity of the human faecal microbiota following long-term sample storage. PLoS ONE. 2016;11:e0163666.
Furuhashi H, Takayasu L, Isshi K, Hara Y, Ono S, Kato M, et al. Effect of storage temperature and flash-freezing on salivary microbiota profiles based on 16S rRNA-targeted sequencing. Eur J Oral Sci. 2022;130:e12852.
Stinson LF, Ma J, Rea A, Dymock M, Geddes DT. Centrifugation does not remove bacteria from the fat fraction of human milk. Sci Rep. 2021;11:572.
Acknowledgements
This study was supported by funds from the Dutch Research Council (NWO-VIDI grant 016.178.056 to AZ, NWO-VICI grant VI.C.202.022 to JF, NWO gravitation grant Exposome-NL 024.004.017 to AK and AZ, NWO gravitation grant Netherlands Organ-on-Chip Initiative 024.003.001 to JF), the Dutch Heart Foundation (IN-CONTROL CVON2018-27 to JF), the European Research Council (ERC starting grant 715772 to AZ, ERC consolidator grant 101001678 to JF), an EASI-Genomics grant (PID7780 to TS and AZ), the De-Cock Hadders foundation (2021-57 to JES, 2021-08 to SG), the International Society for Research in Human Milk and Lactation (ISRHML, personal grant to JES), the Winston Bakker Fonds (WB-08, granted to TS), and the European Union’s Horizon 2020 research innovation program (824110). SG and TS hold scholarships from the Graduate School of Medical Sciences and the Junior Scientific Masterclass of the University of Groningen, the Netherlands, respectively. The Lifelines NEXT cohort study received funds from the University Medical Center Groningen Hereditary Metabolic Diseases Fund, Health~Holland (Top Sector Life Sciences and Health), the Ubbo Emmius Foundation, the European Union, the Northern Netherlands Alliance (SNN), the provinces of Friesland and Groningen, the municipality of Groningen, Philips, and the Société des Produits Nestlé. We are grateful for the participation of all parents and infants in the Lifelines NEXT study. We thank the Lifelines NEXT team for their continuous efforts in project management, administration, participant recruitment, and data and sample collection and registration. We also thank the staff of the Microbiome Laboratory at the Institute of Clinical Molecular Biology at the Christian-Albrecht University of Kiel in Germany, especially Corinna Bang, and at BaseClear in Leiden, the Netherlands, for sequencing our samples. We thank the Genomics Coordination Center at the University Medical Center Groningen, the Netherlands, for providing data infrastructure and access to high performance computing clusters. We thank Katherine Mc Intyre for revising the paper.
Author information
Authors and Affiliations
Contributions
Study design, JES, RG, and AZ; project administration, JES and AZ; supervision, JF, RG, and AZ; funding acquisition, AZ; sample preparation, JES and MK; bioinformatic processing, JES, AFP, AK, and RG; statistical analyses, JES, AFP, and RG; data interpretation, JES, AFP, RG, and AZ; writing —original draft preparation, JES; writing—review and editing, AFP, MK, AK, SG, TS, HG, HH, JF, RG, and AZ. All authors have read and agreed to the published version of the paper.
Corresponding author
Ethics declarations
Competing interests
JES, AFP, MK, AK, SG, TS, HG, HH, JF, RG, and AZ have no competing interests. The funders had no role in study design, data analysis, data interpretation, writing of the paper, and the decision to publish.
Ethical approval
This study was conducted according to the guidelines of the Declaration of Helsinki. The Lifelines NEXT study has been approved by the Ethics Committee of the University Medical Center Groningen (METC UMCG METc2015/600). Participants or their parents/legal guardians gave written informed consent before participation.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Spreckels, J.E., Fernández-Pato, A., Kruk, M. et al. Analysis of microbial composition and sharing in low-biomass human milk samples: a comparison of DNA isolation and sequencing techniques. ISME COMMUN. 3, 116 (2023). https://doi.org/10.1038/s43705-023-00325-6
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1038/s43705-023-00325-6
- Springer Nature Limited