Introduction

Most cells in an organism carry the same genetic information, but not all cell types simultaneously activate all genes, a phenomenon closely linked to epigenetic regulation. Epigenetic regulation refers to heritable changes in gene function without altering the DNA sequence, ultimately causing phenotypic changes in an individual (Waddington 2012). This regulation includes DNA methylation, histone acetylation, and RNA methylation. DNA methylation, a common DNA modification, is crucial in controlling various biological processes in organisms. The advent and widespread use of high-throughput sequencing technologies in recent years have drawn increasing attention to the role of DNA methylation in regulating human diseases, shedding light on the mechanisms behind disease occurrence. In several studies, it was observed that the development of many diseases is accompanied, to varying degrees, by abnormal DNA methylation within the host genome. Intervention targeting this aberrant methylation can be highly beneficial in treating certain diseases. For instance, 5-aza-2’-deoxycytidine (decitabine, DAC) has gained widespread use as an FDA-approved DNA methyltransferase inhibitor in the treatment of diseases such as acute myeloid leukemia (AML) and myelodysplastic syndromes (MDS) (Cruijsen et al. 2014). In addition to cancer, numerous pathogens can also manipulate host immunity through DNA methylation modifications, thereby enhancing their survival within the host. Host-related genes associated with processes like cell cycle progression, senescence, inflammation, and immunity are critical targets for pathogens involved in this regulatory process (Jaenisch and Bird 2003). Pero et al. 2011 discovered that Helicobacter pylori (Hp) induces abnormal methylation of the host COX-2 gene, leading to the development of associated intestinal diseases (Pero et al. 2011). The capacity of pathogen infections to introduce novel methylation patterns into the host's DNA suggests the need for a deeper comprehension of pathogen-organism interactions from an epigenetic perspective. Furthermore, it underscores that inhibiting abnormal methylation for therapeutic purposes also applies to infection immunity.

Mycobacterium tuberculosis (Mtb) is a pathogenic bacterium that spreads through respiratory droplets, leading to tuberculosis (TB) infection. The World Health Organization (WHO) classified TB as a major global pandemic in 1993. According to the newly released survey, TB will cause about 1.3 million deaths globally in 2022, second only to COVID-19. Although the number of deaths caused by TB in 2022 is lower than the 1.4 million in 2021 and 2020, there will be approximately 7.5 million new cases of TB globally in 2022 (WHO 2023). This is the highest number counted in a year since WHO began testing for TB globally in 2019. In addition, about a quarter of the world's population is infected with Mtb, with an additional 10 percent of them suffering from lifelong active tuberculosis. (Kinsella et al. 2021). The tuberculin skin test and the γ-interferon (IFN-γ) release assay (IGRA) are the primary methods employed to detect Mtb infection, relying on a positive T-cell response to Mtb antigens. Nevertheless, some individuals test seropositive but do not develop active TB. These cases are categorized as latent TB infection (LTBI). While latently infected individuals may not exhibit overt clinical symptoms, they face a significant risk of developing active TB and potentially contributing to the TB epidemic in their respective regions, especially when their immune systems are compromised. Recognizing the vast scale of this issue, the World Health Organization has prioritized the treatment of latent TB-infected individuals as a pivotal component of its strategy to eradicate TB. Currently widely used LTBI regimens include rifapentine plus isoniazid once weekly for 3 months, rifampicin once daily for 4 months, or isoniazid plus rifampicin once daily for 3 months. Isoniazid once daily for 6 or 9 months is also an alternative (Shah and Dorman 2021). In addition to latent TB infections, the emergence of drug-resistant Mtb strains has further undermined the effectiveness of traditional Mtb detection and treatment methods. The types of TB caused by infections with drug-resistant mutant Mtb strains include multi-drug-resistant (MDR), pre-extensively drug-resistant (Pre-XDR, and extensively drug-resistant (XDR) (Mazurek et al. 2005). The reference treatment regimens provided by the WHO for these two characteristics of Mtb are generally extended treatment cycles or multi-drug combinations. However, developing host-directed therapies (HDT) to enhance host immunity is a must if TB is to be completely eradicated. And this requires a deeper understanding of the Mtb infection process. Currently, two primary methods for diagnosing TB are in vivo or in vitro assessments of the body's response to Mtb antigens, detected using the tuberculin skin test (TST) or interferon-gamma release assays (IGRAs). However, recent research indicates that differences in IGRA responses are more valuable for assessing TB development in children, whereas these differences hold less significance in adults (Gupta et al. 2020). Furthermore, some individuals continue to test negative for TST or IGRAs despite exposure to areas with a high prevalence of TB or environments with elevated levels of Mtb. For example, a study by Catherine M. Stein revealed that approximately 10% of South African miners remained TST-negative after many years of work, even though the environments they worked in likely had some of the world's highest Mtb concentrations (Stein et al. 2018). Simultaneously, Mtb infection can introduce DNA methylation patterns into host chromosomes, affecting biological processes such as host gene expression, DNA replication, and the immune response to promote its survival. It has been established that Mtb infection introduces novel DNA methylation patterns into the host genome, and several differentially expressed methylation sites in latent versus active TB have been reported (Du et al. 2022). Given its prevalence as an epigenetic regulation mechanism in both eukaryotic and prokaryotic organisms, the role of DNA methylation in immunity to Mtb infection presents promising research avenues. This review explores DNA methylation modifications and their roles in the human body, focusing on their significance during Mtb infection. Additionally, we discuss how DNA methylation influences host-related immune modulation during Mtb infection and the potential applications of DNA methylation in TB research.

DNA methylation

DNA methylation is commonly defined as the process wherein the methyl group (–CH3) from S-adenosylmethionine (SAM) is transferred to the fifth carbon atom (C5) of cytosine, resulting in the formation of C5-methylcytosine, often referred to as 5-methylcytosine (5mC). This biochemical reaction is catalyzed by DNA methyltransferase (DNMT) (Moore et al. 2013) (Fig. 1). Apart from the C5 position of cytosine, which serves as the primary site for methylation, the sixth nitrogen atom of adenine can also undergo methylation, forming N6-methyladenine (6 mA). Additionally, the fourth nitrogen atom of cytosine can be methylated, yielding N4-methylcytosine (4mC) as a modification product (Ponger and Li 2005). While the first two DNA methylation modifications are present in prokaryotic and eukaryotic genomes, 4mC is currently exclusively identified in specific prokaryotic genomes. The predominant form of DNA methylation within eukaryotic genomes involves cytosine methylation at the C5 position, typically occurring at cytosine-guanine dinucleotides (CpG) (Suzuki and Bird 2008). CpGs are sparsely distributed in most regions of the genome and densely distributed in only a few regions of the genome, and among these densely distributed regions of CpGs, the regions that satisfy certain criteria for the density and proportion of CpGs are known as CpG islands (CGls) (Bird 2002; Illingworth and Bird 2009). Genes associated with CpG islands in their promoter regions are more likely to have their expression regulated by methylation levels. An analysis of CpG distribution across the human genome shows that 72% of gene promoter regions are CpG-rich. These genes exhibit expression in most tissues (Saxonov et al. 2006), thereby underscoring the pivotal role of DNA methylation in governing genome expression levels and highlighting that external factors, such as pathogen infections, which can induce abnormal DNA methylation in the human genome, often exert global effects.

Fig. 1
figure 1

Mechanism of DNA methylation. In the presence of DNA methyltransferase (DNMT), the methyl group (red colour) of S-adenosylmethionine (SAM) is transferred to the fifth carbon atom of a cytosine of CpG to produce s-adenosyl- l-homocysteine (SAH)and 5-methylcytosine (5-mC) (color figure online)

DNMTs in humans

DNMTs typically consist of two distinct structural domains: a regulatory structural domain at the N-terminus and a catalytic structural domain at the C-terminus. The C-terminal catalytic structural domain is highly evolutionarily conserved, primarily because the evolutionarily significant motifs of DNMT are predominantly situated in this region (Wilson 1992). These distinctive motifs of DNMT encompass motif I, motif IV, motif VI, motif IX, and motif X. Among these motifs, motif IV plays a central role in catalytic activity, mediating the nucleophilic attack on cytosine C6 to initiate the methylation reaction (Du et al. 2016). Six classes of DNMTs have been identified in eukaryotes based on these conserved motifs: DNMT1, DNMT2, DNMT3, DNMT4, DNMT5, and DNMT6, with the first three being widely distributed in animals, and the latter three identified in algae and fungi (Huff and Zilberman 2014). The human genes encod five DNMTs: DNMT1, DNMT2, DNMT3A, DNMT3B, and DNMT3L. While all these DNMTs can introduce methyl groups from SAM into nucleic acids through a base-flipping mechanism, they can be categorized into at least two groups based on their temporal activity: “de novo” DNA methylases and “maintenance” DNA methylases(Riggs 1975). DNMT3A and DNMT3B, as “de novo” DNA methylases, are responsible for adding methylation marks to previously unmodified DNA (Fig. 2a). The “de novo” catalytic function of DNMT3A and DNMT3B is particularly pronounced during embryonic and extraembryonic development (Smith et al. 2012). DNMT3L, a truncated form of DNMT3, lacks catalytic activity due to the absence of a portion of the N-terminal regulatory structural domain and certain motifs in the C-terminal catalytic structural domain. Despite its lack of catalytic activity, DNMT3L is a vital non-enzymatic cofactor for DNMT3A and DNMT3B during “de novo” methylation. Two DNMT3L molecules form a complex with two DNMT3A molecules, enhancing their affinity for DNA and thereby increasing the efficiency of DNMT3A in catalyzing DNA methylation. This complex is crucial in establishing genetic imprinting in gametes (Bourc'his et al. 2001; Jia et al. 2007). DNMT3L is also essential for DNMT3B to establish a distinct DNA methylation pattern in placental progenitor cells (Smith et al. 2017). It is worth noting that certain catalytically inactive isoforms of DNMT3B, generated through selective splicing, can also act as cofactors similar to DNMT3L under specific circumstances to promote DNA methylation (Duymich et al. 2016).

Fig. 2
figure 2

De novo” versus “maintenance” DNA methylation. (a) Catalyzed by DNMT3a/3b, the methyl group of SAM is transferred to the cytosine of unmethylated CpG dinucleotides in a process known as “de novo” DNA methylation. (b) When double-stranded DNA containing methylated CpG dinucleotides is used as a template strand for DNA synthesis, the CpG dinucleotides in the newly synthesized daughter DNA strand (green) at positions corresponding to the template strand are catalyzed by DNMT1 to add methyl, resulting in the double-stranded daughter DNA replicating the DNA methylation pattern of the template strand accurately, which is known as the process of “maintaining” DNA methylation

Compared to “de novo” DNA methylation, the process of “maintenance” DNA methylation appears to be more complex (Fig. 2b). DNA methylation is an important genetic marker that regulates various genomic processes, including transcription and splicing. Consequently, replicating methylation patterns from parental DNA to offspring DNA is essential to maintain genomic stability and cell specificity. However, the raw material required for DNA replication, dNTP, is typically unmethylated. If the parental methylation mark is not copied into the zygote DNA before the initiation of the next S phase following replication, the methylation mark can be lost, leading to dysregulation of gene expression. Therefore, to preserve genome stability, cells require a DNA methyltransferase capable of introducing methyl groups into the newly replicated daughter DNA strands – the “maintenance” DNA methyltransferase. The concept of “maintenance” DNA methylases was hypothesized in the 1970s (Holliday and Pugh 1975). In the 1980s, Reuven Stein and Arthur Weissbach demonstrated that DNMT1 exhibits preferential catalytic activity for hemimethylated sites (Stein et al. 1982; Pedrali-Noy and Weissbach 1986). Although DNMT1 can also mediate methylation of unmethylated CpG dinucleotides in vitro, its activity in this regard is only a fraction (1/50–1/5) of what it achieves when catalyzing hemimethylated substrates (Yoder et al. 1997; Pradhan et al. 1999). Further supporting evidence for the “maintenance” role of DNMT1 was reported by Chen et al. 2003 who found that overexpression of DNMT1 in the absence of DNMT3A and DNMT3B did not induce de novo methylation in mouse embryonic stem cells (Chen et al. 2003). DNMT2 possesses the characteristic catalytic structural domain found in DNMT family members. However, it lacks the N-terminal regulatory structural domain in other family members, and its specific function needs to be more clearly defined.

Although it is expected to categorize DNMTs in the human body as “de novo” and “maintenance” methyltransferases based on distinct functional roles, this classification appears to oversimplify the complex and multifaceted roles DNMTs play in an organism. Notably, the well-established “de novo” methyltransferase, DNMT3, has also been found to maintain methylation patterns in differentiated cells (Arand et al. 2012; Liao et al. 2015). Hence, the current classification may need to capture the full range of DNMT functions within the organism accurately. The precise in vivo roles of DNMTs warrant further investigation and further research into the molecular mechanisms of DNA methylation, and assessing the subtle changes in genome methylation patterns that occur during various stages of growth and development could provide valuable insights, which may, in turn, offer essential clues for the development of more sophisticated, stable and effective therapeutic strategies targeting DNA methylation in various diseases.

DNA methylation during individual development

DNA methylation is highly dynamic during mammalian embryogenesis, and the genome's methylation pattern undergoes reprogramming, particularly during the germ cell and pre-implantation stages of development. However, the genome's methylation pattern remains unaltered and heritable after somatic differentiation. In somatic genomes, the promoters of housekeeping genes typically feature unmethylated CpG sites, while CpG sites in the non-regulatory regions are mostly methylated. This methylation pattern is usually established early and stably inherited within cells of the same tissue (Bird 2002; Suzuki and Bird 2008). In addition to this relatively stable DNA methylation pattern, minor alterations in CpG methylation status play a role in certain intricate regulatory events within the cell (Weber et al. 2005). Modern studies have challenged the traditional Mendelian theory of inheritance, which posits that each parent transmits an equal set of chromosomes to the offspring, resulting in equal genetic expression from both parents. Phenomena such as X-chromosome inactivation (XCI) and chromosome imprinting in mammals demonstrate that offspring DNA does not carry equal genetic information from both parents. Both X-chromosome inactivation and chromosome imprinting involve differential DNA methylation, especially in the case of X-chromosome inactivation (Lee 2003). Methylation tags are completely removed from the genome at the time of primary germ cell formation to reduce the probability of mutations in future embryonic cells. Subsequent formation of new methylation patterns from scratch by DNMT3A and DNMT3L leads to gender-specific gamete methylation patterns (Gopinathan and Diekwisch 2022). At the end of gametogenesis, approximately 80% of genomic CpGs in sperm are methylated, with a genomic distribution similar to that of somatic cells (Wang et al. 2014). However, after fertilization, the loss of methylation imprinting in the paternal genome was completed during the first six hours of fertilised egg formation (Li et al. 2018), reaching a minimum at the blastocyst stage (Reik et al. 2001). Methylation reprogramming of genomic DNA during this period may be relevant to establishing cellular totipotency. Re-establishment of methylation patterns occurs at the blastocyst stage and is mediated by DNMT3A and DNMT3B (Breton-Larrivee et al. 2019). During the blastula stage, the inner cell mass forms, giving rise to the embryo. Subsequently, as the embryo implants in the endometrium, DNMT3B-mediated DNA methylation becomes active. This process coincides with the initiation of somatic cell differentiation and leads to the silencing of CGl promoters associated with germline-specific genes (Borgel et al. 2010; Auclair et al. 2014). DNA methylation during this developmental phase is closely associated with the loss of cellular plasticity and lineage restriction. Since the introduction of the first DNMT1 knockout mouse in 1992, our understanding of the roles of DNMTs in individual growth and development has advanced significantly. For example, deleting DNMT1 leads to embryonic death (Li et al. 1992). Subsequent research has elucidated the functions of DNMT2 and DNMT3. Deletion of DNMT3A leads to impaired postnatal development in mice, while DNMT3B deletion results in direct embryonic death (Okano et al. 1999). The findings of Andrews' team reveal the specific mechanism by which DNMT3B knockdown leads to embryonic death—loss of DNMT3B results in de-repression of germline genes in trophoblast lineages and impaired formation of the maternal-foetal interface in the placental labyrinth (Andrews et al. 2023). Although DNMT2 deletion is not lethal, it does cause abnormal hematopoiesis in mice (Tuorto et al. 2015). A recent study has shown that 5-hydroxymethylcytosine (5hmC) may play an important role in regulating gene expression in the placenta and that 5hmC also plays a dynamic role in the differentiation of trophoblast cells during pregnancy (Vasconcelos et al. 2023). These findings underscore the essential role of DNA methylation in biological development. Apart from its role in establishing cellular totipotency during embryonic development and governing the generation of cellular lineages and restriction genes, DNA methylation is a critical mechanism for repressing transposons. Transposons, unique DNA sequences capable of autonomous replication and displacement, constitute about 44% of the human genome. Typically, transposon insertions have harmful effects on the host. DNA methylation temporarily inhibits transposable element activity and facilitates permanent genetic inactivation through mutagenic deamination. Additionally, DNA methylation helps maintain genomic stability by limiting recombination between non-allelic copies of transposable elements that share high sequence similarity (Greenberg and Bourc'his 2019). Studies conducted by Mireia Jordà's team, which revealed a positive correlation between increased levels of chromosomal rearrangements and genome-wide hypomethylation in various human cancers (Jorda et al. 2017), provide further support for the idea that DNA methylation constrains gene rearrangements, implying a potential role for DNA methylation in the development of diseases.

DNA methylation in human diseases

The reprogramming of genomic methylation patterns is essential during specific periods such as embryonic development and is necessary for proper development. After somatic cell differentiation, DNA methylation patterns are adjusted to finely regulate specific events within the cell, typically involving only a small set of genes. However, large-scale changes in genome methylation patterns, unrelated to developmental needs, are often linked to human diseases. Different diseases can have distinct effects on host genome methylation patterns, resulting in significant variations in the observed disease manifestations. In the 1980s, researchers noticed that cancer cells exhibited abnormal genome hypomethylation. Conversely, hypermethylation of DNA at the promoters of tumor suppressor genes (TSGs) is common, leading to improper gene expression (Fardi et al. 2018). Hypermethylation of TSG affects various intracellular biological processes: cell cycle, apoptosis, DNA repair, etc. (Pan et al. 2021).This phenomenon has led researchers to speculate that altered methylation patterns may drive tumor formation. Additionally, tumor cells often exhibit the silencing of cell adhesion regulators such as CDH1 (E-cadherin) (Grady et al. 2000) and CDH13 (H-cadherin) (Kim et al. 2005), suggesting that aberrant DNA methylation may not only play a role in tumor initiation but also contribute to the acquisition of characteristics like migration and invasion in tumor cells. DNMT has a dual role in tumourigenesis, acting as both an oncogene and tumor suppressor gene. For example, DNMT3B is an oncogene in breast, liver, colon, and ovarian cancers but acts as an oncogene in lymphomas; DNMT3a is an oncogene in pituitary adenomas, lung cancer and melanoma, but acts as a TSG in lung cancer and chronic lymphocytic leukaemia (Zhang et al. 2020). Characteristic alterations in DNA methylation levels have been found in peripheral immune cells from a variety of cancers, such as hepatocellular carcinoma (Zhang et al. 2018), breast cancer (Wang et al. 2023), etc. As the elderly population continues to grow, neurodegenerative diseases such as Alzheimer's disease (AD), Huntington's chorea (HD), Parkinson's disease (PD), and amyotrophic lateral sclerosis are gaining widespread attention due to their increasing prevalence, considering that the most significant risk factor for these diseases is age. As individuals age, their genomes accumulate epigenetic changes that disrupt gene expression, leading to tissue dysfunction, reduced regenerative capacity, and an increased risk of developing neurodegenerative diseases (Lu et al. 2020). Changes in DNA methylation patterns during aging underlie individual senescence. Methylation levels of ribosomal DNA in somatic cells increase with age, leading to impaired ribosome biogenesis, which in turn triggers kernel stress and, ultimately, aging-related diseases (Wang et al. 2015). In addition, methylation of rDNA transcription units, including the upstream control element (UCE), core promoter, 18S rDNA, and 28S rDNA in spermatozoa, increased significantly with donor age (Potabattula et al. 2020). In addition to rDNA, the level of CpG island methylation in the promoter regions of genes associated with regulating cell growth and development increases with age (Pirazzini et al. 2021). Apart from cancer and neurodegenerative diseases, abnormal alterations in genomic DNA methylation patterns are observed in various human diseases, including those related to the cardiovascular system, autoimmune responses, inflammation, and metabolism. These alterations generally involve switching genes from their normal expression state to being turned off or vice versa. However, the specific genes and epigenetic information affected can vary among diseases. The strong link between DNA methylation and disease has driven research into targeted drugs and therapies. However, due to the lack of research to identify effective candidate genes for DNA methylation modification, epigenetic therapies currently only have some application in cancer. Additionally, epigenetic changes are closely tied to immune and metabolic processes in the body. Non-targeted approaches often result in genome-wide alterations in epigenetic patterns, which can lead to serious side effects. Therefore, future research should focus on identifying specific epigenetic factors and their underlying molecular mechanisms, which could provide a more comprehensive understanding of DNA methylation's role in regulating various diseases, enabling the development of more targeted and effective therapies.

DNA methylation and Mycobacterium tuberculosis

Epigenetic regulation in eukaryotes can operate at various levels, including nucleosome localization, DNA methylation, histone variation, and histone modification. In contrast, bacteria lack components such as nucleosomes and histones, relying primarily on DNA methylation for epigenetic regulation. DNA methylation has been observed in over 93% of prokaryotic genomes (Blow et al. 2016). Unlike eukaryotes, prokaryotic genomes primarily employ 6 mA as their main mode of DNA methylation. Mycobacterium tuberculosis, from approximately 70,000 ago in Africa (Comas et al. 2013), has evolved extensively from a common environmental microorganism into one of the most successful pathogens. DNA methylation, a form of epigenetic regulation present in both prokaryotes and eukaryotes, has played a significant role in both the evolution of Mycobacterium tuberculosis itself and in the modulation of the immune response during Mycobacterium tuberculosis infection in the host.

DNA methylation in Mtb

The combination of mycobacterial latency and drug resistance represents the two major contributors to the devastating impact of tuberculosis, presenting a significant challenge to the academic community. The acquisition of latent properties by Mtb can be traced back to a time before it had evolved into a pathogen. It is suggested that a critical event in the extensive evolution of Mycobacterium tuberculosis was its development of the ability to persist within certain protists that prey on bacteria (Jang et al. 2008). While genetic mutations may have contributed to Mtb’s tolerance to these internal conditions, the role of DNA methylation regulation in the evolution of Mycobacterium tuberculosis appears to be more crucial. DNA methylation offers a faster and more adaptable mechanism for altering gene expression in Mtb, enabling it to acquire specific resistances upon ingestion. DNA methylation is a primary mode of epigenetic regulation in bacteria. This capability to persist within the host also raises the possibility of subsequent infections by Mycobacterium tuberculosis in mammals, where the bacterium can persist and multiply. The academic community remains deeply concerned about Mtb’s ability to persist within the host organism, a trait commonly called “latency.” Through extensive research, we have come to understand that tuberculosis often undergoes a latent phase within the host that can span several decades. During this period, Mtb exhibits characteristics such as slow growth, enhanced resistance to oxidative stress, and reduced sensitivity to drugs (Kinsella et al. 2021). However, when the immune system becomes compromised due to factors like old age or other illnesses (i.e., HIV infection or type II diabetes), the previously suppressed Mycobacterium tuberculosis becomes reactivated, contributing to the local epidemiological trends of tuberculosis in a given region (Cadena et al. 2017). The acquisition of latency represents a foundational adaptation for Mtb’s survival. In the harsh natural environment, successful species evolution requires changes in gene expression to develop better traits for adaptation and, perhaps more crucially, the preservation of genetic material stability. This stability ensures the consistent inheritance of advantageous traits acquired by parents from the external environment, passing them down to offspring and securing the species’ continuity. In addition to assisting Mtb in altering genomic expression patterns in response to external stresses, DNA methylation modifications also play a pivotal role in maintaining the stability of Mtb’s genetic material. Prokaryotic DNA methyltransferases often collaborate with restriction nucleic acid endonucleases, forming a system known as the restriction-modification (RM) system (Anton and Roberts 2021). This system is considered a primitive immune defense mechanism. DNA methyltransferases safeguard their chromosomes and certain beneficial exogenous DNAs from degradation by preventing restriction endonucleases from binding to DNA by adding a methylation imprint at the binding site of the restriction endonuclease (Fig. 3). This primitive defense maintains the stability of the Mtb genome and enhances Mtb survival to some extent. DNA methyltransferases are sometimes referred to as DNA protectors. In contrast, restriction endonucleases degrade exogenous DNA lacking specific methylation marks. This division of labor maintains the Mycobacterium tuberculosis genome's stability while providing some protection against phage invasion. It is worth noting that not all exogenous DNA is harmful to bacteria. Horizontal gene transfer (HGT) has played a significant role in prokaryotic evolution. The ancestors of Mycobacterium tuberculosis acquired genes, including those encoding transferase enzymes and genes adapted to anaerobic conditions, from other bacteria (Boritsch et al. 2014; Reva et al. 2015), t which might have contributed to the ambiguity of Mtb's evolution into a specialized pathogen capable of persisting in the host’s cytosol. Plasmids often serve as carriers for these mobile elements, containing genes essential for bacterial survival and “helper” genes that degrade toxic substances or provide the bacteria with new metabolic capabilities (Attere et al. 2017; Ares-Arroyo et al. 2018). The array of advantageous traits encoded by plasmids expands the range of environmental conditions where the host can thrive and plays a crucial role in bacterial clade evolution (Rodriguez-Beltran et al. 2021). However, it is important to note that while HGT has been instrumental in Mtb’s evolution, transferring exogenous genes beyond a certain range can be detrimental to one’s genome. Therefore, RM systems often selectively add methylation markers to exogenous DNA entering the cytosol, which, to some extent, restricts the HGT of mobile elements, maintaining the relative stability of the genome (Murray 2002).

Fig. 3
figure 3

Defence mechanisms of the restriction-modification (R-M) system. The recognition site of Mtb’s own genome is methylated by DNA methyltransferase (MTase), thus avoiding degradation by restriction endonucleases (REase). Exogenous DNA (e.g. phage DNA) is degraded by REase cleavage because the recognition site is not methylated

The study of DNA methylation in immunity to Mycobacterium tuberculosis infection began with the discovery of methylation imprinting in the Mtb genome in 1925. Three DNA methyltransferases responsible for 6 mA modifications—MamA, MamB, and Hsdm—have since been identified in the Mtb genome(Zhu et al. 2016; Phelan et al. 2018). In addition to regulating Mtb’s expected growth, these enzymes mediate the methylation of relevant genes, which plays a crucial role in Mtb’s intracellular survival and drug resistance. A study by Scarlet S. Shell revealed the presence of transcription start sites (TSS) for genes like Rv0102, Rv0142, corA, and whiB7 at 4 or 5 base pairs from the MamA methylation site. Deletion of MamA results in reduced viability of Mtb under hypoxic conditions (Shell et al. 2013). Given that Mycobacterium tuberculosis is an aerobic bacterium specialized for survival, it must possess the capability to adapt to hypoxic conditions, which is especially important for Mtb’s latency within an intracellular environment where oxygen levels are lower than in the external environment. MamA-mediated DNA methylation of relevant genes is crucial for Mtb to thrive in hypoxic conditions (Shell et al. 2013). In addition to Mtb’s ability to cope with hypoxia, methyltransferases have been linked to Mycobacterium tuberculosis' resistance to antibiotics. Hongqian Chu's research team observed a significant reduction in the growth rate of the Mycobacterium tuberculosis multidrug-resistant strain 11,826 when its Hsdm gene was knocked out, varying degrees of increased sensitivity to drugs such as rifampicin, levofloxacin, and ethambutol were also observed (Chu et al. 2021). Bioinformatics analysis of the Mtb genome revealed that many Hsdm-modified genes were enriched in redox and energy metabolism-related pathways (Hu 2021). Previous reports have demonstrated that bacterial susceptibility to antibiotics is closely tied to the redox state (Lobritz et al. 2015). This suggests that the deletion of Hsdm results in an abnormal methylation status of Mtb metabolism-related genes, impairing the bacterium's normal metabolic function within host cells. Most antibiotics work by inhibiting or killing bacteria by suppressing their energy production. Therefore, the deletion of Hsdm enhances the inhibitory effect of antibiotics, potentially making the bacteria more susceptible to these drugs.

The methylation-modified locus of 6 mA methyltransferase from Mycobacterium tuberculosis contains genes closely related to Mtb’s energy metabolism(such as NuoI, ctaE, fadD, etc.); knockdown of this enzyme has been demonstrated to inhibit Mtb’s ability to survive in low-oxygen environments and increase its sensitivity to drugs (Chu et al. 2021). These findings suggest that DNA methylation holds significant research potential for investigating drug resistance and latent properties of Mtb. It may be a critical factor in the future eradication of TB. However, it is unfortunate that current research on DNA methylation in TB still needs to be improved. Many aspects of DNA methylation during Mtb infection remain unknown and unexplored, and there is a considerable distance to cover before DNA methylation can be applied in the clinical diagnosis and treatment of TB.

DNA methylation in the host during Mtb infection

Studies investigating the interaction between pathogenic bacteria and mammals have revealed that pathogenic bacteria trigger immune responses and lead to changes in gene expression and specific cellular signaling pathways (Bhavsar et al. 2007). DNA methylation, a regulatory mechanism that plays a crucial role in various biological processes of the immune system, is also involved in the infection processes of many pathogenic bacteria, leading to alterations in the host genome's methylation status. For example, Porphyromonas gingivalis infection affects gingival cytokine release through DNA methylation (Drury and Chung 2015), and Helicobacter pylori infection leads to abnormal DNA methylation in the gastric mucosa (Maekita et al. 2006). The impact of Mtb infection on the host genome's methylation status has been a significant area of research due to the long-standing battle between humans and Mtb. As the first line of immune defense against Mtb infection, macrophages activate various signaling pathways to initiate immune responses against invading pathogens. Toll-like receptors (TLRs) expressed on macrophages recognize pathogen-associated molecular patterns (PAMPs) on Mtb and subsequently induce the production of immune-related cytokines such as tumor necrosis factor (TNF) and type I interferon (IFN) (Xu et al. 2014). During Mycobacterium tuberculosis infection, cytokines such as tumor necrosis factor-alpha (TNF-α) and interleukin-1 beta (IL-1β) play pivotal roles in the host immune response. When Mtb is inhaled into the body, it binds to phagocytic receptors and gains entry into alveolar macrophages, which are subsequently recruited into the bloodstream (Bhatt and Salgame 2007). a Following the phagocytosis of Mtb, these alveolar macrophages migrate to the interstitial spaces within the lungs. This migration is accompanied by the infiltration of other immune cells, leading to the formation of structures known as granulomas (Cohen et al. 2018). Lung granulomas are commonly regarded as the focal point of the immune response against Mtb. However, animal model studies have revealed that individual granulomas exhibit varying abilities to control Mtb based on their unique microenvironments. While some granulomas can completely clear Mtb, the overall outcome typically results in a state of equilibrium where the immune system manages Mtb sustainably while allowing it to persist for an extended period (Cohen et al. 2022). Individuals in this state of equilibrium are referred to as latently infected with Mtb. Although there are no obvious clinical signs, once active TB develops in a sudden situation, these individuals are highly likely to transmit TB unknowingly. Genes impacted by DNA methylation changes induced by Mtb are primarily related to the immune system. TB patients typically exhibit a diminished immune response compared to the general population. For instance, one study found that Mtb infection led to methylation of the IL-6 promoter, potentially resulting in reduced IL-6 levels (Zheng et al. 2016). IL-6 is associated with the differentiation of macrophages and cytotoxic T-cells, and a decrease in this cytokine can facilitate Mtb survival within host cells. Another study compared methylation levels at 20 CpG sites in the promoter region of the Toll-like receptor 2 (TLR2) gene in TB patients and the normal population. TB patients exhibited higher methylation levels at five sites, leading to reduced TLR2 protein expression in monocytes (Chen et al. 2014). TLR2 plays a crucial role in innate immunity against Mtb infection. Bobak et al. observed aberrant DNA methylation in guinea pigs' spleen and lung genomes four days after Mtb infection. Bioinformatics analyses showed that hypermethylated genes were enriched in immune-related signaling pathways such as the PI3K-AKT signaling pathway, the MAPK signaling pathway, and the mTOR signaling pathway (Bobak et al. 2022). Additionally, the risk of reinfection with Mtb in the TB-healed group was significantly higher than in the healthy population (about 13 times higher than in normal individuals) (Millet et al. 2013). Furthermore, DiNardo et al. discovered that TB patients exhibit a hyporesponsive immune phenotype, characterized by reduced immune proliferation and diminished production of cytokines in response to Mycobacterium and mitogens (DiNardo et al. 2020). Genomic DNA methylation sequencing revealed these phenomena were linked to DNA hypermethylation of key immune genes and pathways. This hypermethylation persisted even after the successful completion of treatment. These findings indicate that Mtb can suppress the host immune response through DNA methylation regulation. The detrimental methylation changes imposed by Mtb persist not only during infection but also post-treatment, which can partly explain why bacterial culture tests for most TB patients typically become negative after two months of antibiotic treatment, with treatment typically lasting up to six months.

Mycobacterium tuberculosis has developed various strategies to enhance its survival within the host. Upon infecting and successfully parasitizing the host, Mtb faces pressure from the host's immune system, prompting it to evolve tactics like limiting the production of antimicrobial peptides, escaping to the cytosol, and inhibiting cellular autophagy to evade host immune clearance (Liu et al. 2017). These interference mechanisms employed by Mtb in host cells are often associated with the abnormal DNA methylation patterns it induces in the host genome. The DNA methylation imprints triggered by Mtb infection facilitate intracellular survival and dampen the host's immune response, prolonging the risk of reinfection long after TB healing (Fig. 4). This deleterious cycle disrupts the host's immune system, increasing susceptibility to Mtb and a variety of other pathogens and resulting in much higher individual mortality. Rapid and precise removal of Mtb-induced abnormal DNA methylation in the host genome is crucial to break this cycle. It holds great promise for developing more effective therapeutic and diagnostic methods.

Fig. 4
figure 4

The vicious cycle of Mycobacterium tuberculosis infection

Summary

The lag in TB diagnostics significantly hampers progress in the TB elimination strategy. TB treatment approaches have remained relatively unchanged for decades, with the mainstream methods currently in use, such as BCG and antibiotic regimens, dating back at least 40 years (Gagneux 2018). While these traditional TB treatments have been effective against common strains of Mycobacterium tuberculosis, they are less effective in combating the drug-resistant mutant strains that pose a significant global threat. There is a growing concern that drug-resistant Mtb strains potentially lead to a new global TB crisis. WHO-recommended treatment regimens for drug-resistant TB remain less effective for XDR-TB. Understanding the mechanisms behind Mycobacterium tuberculosis's rapid adaptation to antibiotic treatment and the host's immune responses is crucial. This knowledge is essential for advancing strategies to combat tuberculosis. In recent years, with the continuous development of high-throughput sequencing technology, the role of DNA methylation in Mtb’s interactions with the host has become a focal point of TB research. Evidence suggests that DNA methylation is intricately linked to Mtb’s metabolism and virulence. However it is clearly impractical to screen for latent TB infection worldwide by means of DNA sequencing. This approach consumes substantial resources, especially in TB’s high-burden countries, and requires significant time. Moreover, it does not contribute significantly to diagnosing dynamic Mtb infections. Nonetheless, the ongoing advancements in high-throughput sequencing technology can be harnessed to facilitate the study of DNA methylation during Mtb infection. This technology offers an opportunity to gain deeper insights into the specific mechanisms of methylation modification in both Mtb itself and the host genome. Such insights hold immense significance in enhancing the drug sensitivity of bacterial strains, identifying more promising drug targets, and developing cost-effective and user-friendly diagnostic methods with high sensitivity and specificity, which may even contribute to solving the challenge of Mtb latency in the future.