Background

In 1949, the American scholar Barr et al. found a deeply stained corpuscle in the nucleus of the mitotic interphase of female cats, but not in male cats. He named the corpus the “Barr body” [1]; later studies confirmed that the “Barr body” is an abnormally condensed and inactive X chromosome, the compact shape of which is thought to be the result of its nearly complete inactivation [2]. Further studies determined that while maintaining an active X chromosome in mammalian diploid cells, all remaining X chromosomes are inactivated—a phenomenon called X chromosome inactivation (XCI) [3, 4]. It is achieved by transcriptional silencing of one of the two X chromosomes in the early stages of embryonic development: one of the two X chromosomes in the female is silenced, resulting in the same gene expression as a male with only one X chromosome [3]. The dosage compensation effect based on XCI is a very delicate, systematic process, the core function of which involves the XCI center (XIC) as the main switch seat of XCI. The Xist (X-inactive specific transcript) gene located in the XIC plays a major regulatory role throughout the entire process of XCI.

XCI is a normal developmental regulatory process; however, if the process is abnormal, it will cause interference with X-linked gene silencing at the local and chromosomal levels. Thus, the expression of cancer-related and other genes is altered by XCI—a process that may ultimately cause tumors [5]. At the same time, sex differences in cancer occurrence and progression have long been known [6], with men experiencing a higher incidence of malignancies than women [7]; furthermore, male cancer mortality rates are significantly higher than those in females. These findings led us to consider whether sex differences in cancer may be related to abnormal XCI. This review will describe the mechanism of XCI and the relationship between XCI abnormalities and tumorigenesis, while investigating whether sex differences in cancer patients may also be closely related to abnormal inactivation of the X chromosome.

Mechanism of XCI

Traditional genetic studies have shown that the X chromosome is controlled by a single cis-acting total switching site known as the XIC; it is the main regulatory region for XCI. The X chromosome carrying the XIC can initiate inactivation by homeopath. On the contrary, the X chromosome lacking the XIC is not inactivated. The XIC ensures the appropriate random inactivation of the X chromosome. This locus produces a large non-coding RNA called Xist, which is the major regulatory gene of XCI. It engages in cis-binding and accumulates along the entire chromosome from the transcription site [8].

X chromosome dosage compensation mechanism

In XY sex-determined organisms, female individuals have two X chromosomes, while male individuals have only one X chromosome. Although the Y chromosome was the original homologous chromosome of the X chromosome, it has degraded with time, thus creating an imbalance of X-linked genes between the sexes. Dosage compensation is the mechanism that balances the expression of unequal doses of X-linked genes between males and females and ensures a balance between the expression of a single X chromosome and two autosomal genes. Female mammals contain two X chromosomes, one of which is "closed" to avoid gene overexpression. XCI follows the − 1 rule: no matter the number of X chromosomes, only one can be randomly retained, thereby balancing the dosage compensation effect between XX and XY chromosomes. The dosage compensation effect is widespread in the biological world. This phenomenon was first discovered in Drosophila melanogaster M. and subsequently confirmed in mammals and Caenorhabditis elegans M.; based on these findings, three possible dose compensation mechanisms were proposed [9]: (1) male single X chromosome expression is doubled in male Drosophila; (2) female mammals inactivate an X chromosome; (3) Caenorhabditis elegans reaches equilibrium by halving the expression of two X chromosomes in females. By virtue of this mechanism, not only is the expression of the X chromosome between the sexes balanced but the expression between the X chromosome and the autosomes is also balanced [10].

The regulatory mechanism of Xist in the process of XCI

Long non-coding RNA (lncRNA) is a class of genes that are longer than 200 nt and do not encode proteins [11]. Among them, lncRNA Xist is the main regulator of mammalian X inactivation [8]; it is necessary for transcriptional silencing of an X chromosome during the development of a female mammal [12, 13]. In normal human tissues, a slightly lower expression level of Xist may lead to a decrease in chromosomal coating efficiency, resulting in the destruction of silencing nuclear compartments in somatic cells [14]; its inappropriate silencing leads to qualitative abnormalities in stem cells [15,16,17,18]. The current view is that Xist is essential for the initiation of XCI in embryonic stem (ES) cell [19] and mouse models [20]; furthermore, Xist deletions lead to long-term inactive X (Xi) maintenance failure in vivo [21, 22].

The RNF12 gene acts as an activator of Xist upstream and its protein product—the ubiquitin ligase that causes the degradation of the Xist repressor protein Rex1—has been shown to play a key role in the XX dose-dependent activation of Xist [23, 24]. Xist is able to aggregate specific proteins and interact with them to cover and silence an X chromosome in each female cell [25], ultimately preventing women from possessing an additional functional X chromosome. Three proteins: SAFA (scaffold attachment factor-A), LBR (lamin B receptor), and SHARP (SMRT- and HDAC-associated repressor protein) are essential for XCI. All three proteins are indispensable; if even one is inactive, Xist will not silence the X chromosome during development. Further analysis has shown that these three proteins play three different but vital roles: (1) the localization of Xist is controlled by the hnRNP U/SAFA family of molecules [26], which bind directly to the transmembrane protein LBR anchored to the inner nuclear membrane [27]; (2) LBR is rich in silencing protein [28, 29] immobilized on the nuclear membrane and interacts with chromatin regulatory proteins and lamin B [30]; (3) LBR recruits inactive X chromosomes to the nuclear layer and changes the three-dimensional structure of DNA. LBR binds to Xist, allowing Xist and its silencing proteins to spread on the X chromosome to silence transcription, thereby remodeling the chromosome and making its genes less likely to be expressed. For a gene to produce a functional protein, the gene must first be transcribed into RNA by RNA polymerase II [13, 14, 25, 26]. The actual "silencing" work is done by the third protein SHARP, which excludes polymerase from DNA, thus preventing transcription and gene expression. The function of RNA-Xist, which does not encode proteins, on the XIC is multifaceted [31]. On the one hand, Xist acts as a modular RNA scaffold in the assembly of inhibitory protein factors. Polycomb repressive complex 1 (PRC1) maintains chromatin stability in a repressed state; it can catalyze the ubiquitination of tyrosine at position 119 of H2A histone9 (H2AK119ub) and participate in the transcriptional silencing of genes. Polycomb repressive complex 2 (PRC2) can catalyze the double or triple methylation of the 27th tyrosine on H3 histones. The methylation of H3K27 (H3K27me3) is a marker of gene silencing caused by PcG protein [32,33,34,35,36,37,38]. The Xist and polycomb complexes are interdependent and propagate during Xi [31]. On the other hand, Xist forms an inhibitory compartment by rejecting transcription and building factors to create a unique Xi chromosome conformation [2, 36, 39,40,41].

The antisense regulatory factor Tsix of Xist is localized in the XIC region, which plays a key role in the expression of Xist [42]. Tsix is upregulated on one of the X chromosomes and is responsible for recruiting the polycomb protein to cis-coat the X chromosome to induce X chromosome silencing. The accumulation of Xist RNA rapidly produces a silent nuclear compartment [43]. The mutual regulatory mechanism of Xist and Tsix has been studied and confirmed by many researchers. Both participate in the physiological process of XCI during early embryo development, including the three stages of initiation, transmission, and maintenance: first, the Xist gene encodes Xist RNA and then the Xist RNA encapsulates the X chromosome, which prepares to initiate XCI [44]. As Xist RNA expands on the X chromosome, DNA methylation and histone modifications (such as H3K9Ac, H3K4me2, and H3K4me3) occur immediately; H4 hypoacetylation occurs shortly thereafter [45], which establishes and maintains XCI. Importantly, inactivated chromosomes continue to synthesize Xist RNA to maintain their inactivation [42].

Xist expression loss and tumorigenesis

A large number of lncRNAs are reportedly associated with cancer [46]; indeed, a direct causal relationship between lncRNA Xist and tumors has been demonstrated [47]. Xist is a key regulator of dosage compensation that randomly inactivates an X chromosome in a mammal during embryonic development. Given that the X chromosome harbors important cell differentiation and proliferation genes—as well as cancer-related genes—genetic changes occurring on vulnerable sex chromosomes may cause immediate damage; moreover, such mutated cells are more likely to develop into tumor cells in the genetic process [3]. Xist-led XCI processes silence hundreds of genes (including oncogenes); therefore, the loss of Xist expression promotes tumor development [48]. Downregulation of Xist expression and loss of XCI are commonly observed in basal-like cancer, breast cancer susceptibility gene 1-null triple-negative breast cancer [49,50,51,52,53,54,55], and ovarian cancer cell lines. In addition, some patients with testicular germ cell tumors were found to have significantly higher levels of Xist demethylation [56,57,58].

Xist is critical in maintaining XCI, with the deletion of Xist expression causing the inactivated X chromosome to be reactivated, triggering a series of unfavorable genome-wide changes including involvement in DNA replication, chromosome segregation, cell cycle checkpoints, and hematopoietic genetic disorders. As mature hematopoietic cells accumulate, they can potentially cause a wide range of X-linked genes to include tumor-associated gene de-suppression, thereby driving tumorigenesis. Related studies have revealed that the loss of Xist in mouse hematopoietic cells leads to the reactivation of the inactivated X chromosome leading to a genome-wide change in cancer, inducing aggressive and fatal leukemia [47]. Xist has been found to be disordered in a variety of human cancers [59]; many cancer cell lines derived from female breast, cervical, and ovarian tumors show a loss of Xist expression levels [53, 60, 61]. In addition, the use of human induced pluripotent stem cells (iPSCs) to explore the role of Xist RNA shows that the loss of Xist expression in iPSCs is significantly associated with upregulation of X-linked oncogenes, with a higher rate of growth of oncogenes in vivo and poorer differentiation in vitro [17]. Therefore, we conclude that Xist is involved in epigenetic, transcriptional, and post-transcriptional regulation; furthermore, it is abnormally expressed in many diseases, especially in tumors. It is a novel molecule involved in carcinogenesis and tumor suppressor pathways in tumor pathology (Fig. 1).

Fig.1
figure 1

The regulation of Xist in X chromosome inactivation. a RNF12 acts as an activator of Xist and secretes a protein that degrades the Xist repressor protein Rex1. b The SAFA family controls the positioning of Xist and can prompt Xist to directly bind to the lamin B receptor; LBR binds to Xist, allowing Xist and its silencing protein to spread on the X chromosome to silence transcription, thereby remodeling the chromosome so that it does not express. c SHARP excludes polymerase from DNA, thus preventing transcription and gene expression; the Xist and polycomb complexes are interdependent, spread on Xi, and form inhibitory compartments by repelling transcription and building factors to create a unique Xi chromosome conformation. Xist guarantees that the X chromosome inactivation process is carried out in an orderly manner. If Xist is deleted, the originally inactivated X chromosome may be reactivated, causing the degeneration of tumor suppression-associated genes, thereby inducing cancer

Escape from X-inactivation tumor suppressor genes and tumorigenesis

Tumor suppressor genes escape XCI

Most of the genes located on the X chromosome of female mammals are silenced by the inactivation of the X chromosome [62, 63]. However, not all genes on Xi are silenced, as it is reported that approximately 15% of X-linked genes evade XCI to some extent and continue to be expressed on Xa and Xi [64, 65]. Many chromosomal regions of the X chromosome including Xp11-22, Xq25-26, and Xq27-28 have been proposed as potential loci for tumor suppressor genes and many tumor suppressor genes in this region can escape XCI [66,67,68,69,70,71,72,73,74,75,76,77,78]. These escaped genes are often located outside the Xist RNA-coating region [14, 79]. Given that they escape the XCI mechanism, the X chromosome tumor suppressor genes can be expressed and thus have a tumor suppressor function. We call these genes “escape X chromosome loss” or “escape from X-inactivation tumor-suppressor (EXITS)” genes.

Escaped gene sequences often exist in the pseudoautosomal region (PAR) on the X sex chromosome. PAR is a region of mammalian X and Y chromosomes that is homologous to autosomes, mainly at the two ends of the sex chromosome, and plays a crucial role in the pairing and genetic recombination of sex chromosomes during meiosis. There are two groups of major escapers on the X chromosome: one group contains escapers located in the PAR and has precise homologs on the Y chromosome, showing equal expression levels between the sexes [80]; the second group consists of genes located outside of the PAR, which have Y-chromosome homologs also located outside of the PAR and may represent evolutionary residues from the original sex chromosomes. There is a large difference in the actual degree of escape of genes from these different tissues or species in the XCI process [81,82,83].

Tumor suppressor gene mutation

The traditional Knudson’s two-hit mechanism assumes that the occurrence of tumors requires more than two mutations. As age increases, the probability of cancer increases; that is, the occurrence of cancer requires the accumulation of multiple mutations [84,85,86,87]. However, the study of X-linked tumor suppressor genes challenges the "two-hit inactivation" theory of tumor suppressor genes and introduces a new concept: a single genetic hit can also result in the loss of tumor suppressor function. The loss of function of the tumor suppressor locus is followed by a loss of heterozygosity (LOH), thus losing the wild-type tumor suppressor gene [88,89,90]. LOH is generally associated with tumor suppressor genes (such as p53) that inhibit the development of malignant tumors when both alleles are present. In a female cell, when an X chromosome has been inactivated, the tumor suppressor gene is in an inactive state; when the other tumor suppressor allele on the active X chromosome is mutated or deleted, the malignant state is no longer inhibited, resulting in a decrease in the level of tumor suppressor function during tumor development and progression, and the cells are transformed into cancer cells. This phenomenon is common in retinoblastoma, breast cancer, and other cancers caused by mutations in tumor suppressor genes [91,92,93]. LOH on the active X chromosome may result in the complete loss of tumor suppressor function in these X-linked genes, making the individual susceptible to cancer formation [3, 94]; this is in contrast to the biallelic inactivation of the autosomal tumor suppressor gene, the expression levels of which remain sufficient to function in human cancers.

Based on a comprehensive analysis of the mutation status of human cancer genes, genes such as ATRX (alpha-thalassemia retardation syndrome, X-linked), KDM6A (lysine demethylase 6A), CNKSR2 (connector enhancer of kinase suppressor of ras 2), DDX3X (DEAD-box helicase 3, X-linked), KDM5C (lysine demethylase 5C), and MAGEC3 (melanoma-associated antigen C3)—located on the X chromosome—can escape XCI and thus play a role in inhibiting tumorigenesis. However, these genes have a higher frequency of mutations in cancer and are presumed to be important candidate EXITS genes [95, 96]. Among these six genes, research on the protein KDM6A (also known as UTX) is currently more extensive. The KDM6A gene is mainly located in Xp11.2 and encodes histone H3 lysine 27 (H3K27) demethylase expressed by two X chromosomes [97]; it was found to be an EXITS gene in a mouse lymphoma model [98] that is mainly responsible for preventing the uncontrolled division of cells. In cancer patients, low expression of KDM6A results in a significant survival rate, while the absence of KDM6A not only accelerates the onset of lymphoma, but also strongly promotes tumor progression [99]. Ultimately, the mutation of the gene may weaken this constraint system and cause cancer (Fig. 2).

Fig.2
figure 2

The study of X-linked tumor suppressor genes challenges the two-hit inactivation theory. a The traditional Knudson’s two-hit hypothesis holds that malignant tumor occurs after two or more mutations, with the first mutation occurring in germ or somatic cells and the second occurring in somatic cells. b There are tumor suppressor genes on both the female and male X chromosomes. For females, if the tumor suppressor gene on the active X chromosome is mutated, the EXITS gene can function normally; males do not have the protection of the corresponding copy and a harmful mutation of the tumor suppressor gene can cause cancer

Sex differences in XCI and cancer

Abnormal inactivation of the X chromosome promotes the development of tumors, at the same time, tumors occur in different environments in males and females. An increasing number of studies report that the occurrence, progression, molecular phenotype, and response to treatment are generally biased based on sex [100]. The incidence of most tumor types in men is on the rise [101]; many malignant tumors including esophageal cancer are diagnosed more often in men and are characterized by worse prognosis leading to higher mortality [102]. The link between the X chromosome and cancer applies to men. For example, XXY males carrying the BRCA1 mutation have a 20- to 50-fold increased risk of breast cancer [103].

The biallelic expression of the EXITS gene may explain the reduction in cancer incidence in women [17]. When an X chromosome is inactivated, the tumor suppressor gene on it is inactivated; meanwhile, another tumor suppressor gene on another active X chromosome is also inactivated and a mutation or deletion is required to allow cancer to occur [104,105,106,107]. Although the EXITS gene is located on the X chromosome, it escapes from the inactivation of the X chromosome and continues to play a role in suppressing cancer, thereby reducing the risk of cancer in women. In males, there is only one X chromosome, and the inactivation of a single copy of the X chromosome tumor suppressor gene can allow cancer to occur; this may be one of the reasons why male cancer incidence and mortality are higher than those in females. Evidence has shown that these six important candidate EXITS genes evade XCI, leading to sex bias [108]. Compared to women, mutation in the EXITS gene is a common phenomenon in men [109] with ATRX, CNKSR2, DDX3X, KDM5C, KDM6A, and MAGEC3 having a higher frequency of mutation loss [95, 110,111,112]. To date, little is known about this mechanism that leads to differences in individuals and cell types [64, 80, 113,114,115]. We conclude that the expression of the biallelic gene of the female EXITS gene reduces female cancer mortality in different tumor types relative to men.

The lack of an idle X chromosome that bears a well-functioning tumor suppressor gene is one of the reasons why men have a greater propensity to develop cancer. The genes that are more frequently mutated in men are found only on the X chromosome, some of which are known to be tumor suppressor genes that evade X inactivation; the protection provided by working copies of these genes in female cells may help explain why many cancers have a lower incidence among women. One of the common causes of cancer in men is that male cells need only one harmful mutation in a tumor suppressor gene to cause cancer. In contrast, female cells need two copies to be mutated to cause cancer. In males, the homolog of KDM6A on the Y chromosome is called KDM6C (also known as UTY); its catalytic activity is very low or non-existent [116]. Furthermore, the inactivation mutation of UTX located on the male X chromosome is not compensated by UTY. An X chromosome in a female cell is closed during embryogenesis and remains inactive throughout life. The inactivated KDM6A gene on the X chromosome “avoids” this dormant state and functions normally. A “good” copy of the gene is sufficient to prevent the cell from becoming cancerous [95]. There is no doubt that the EXITS mutation is not the only explanation for the difference in the incidence of cancer in men and women. However, the environmental and hormonal factors associated with sex-specific differences in cancer may interact with the EXITS locus or its gene products to modulate cancer risk [95].

Conclusions and perspectives

XCI is closely related to the occurrence and development of tumors [5]. There are more than 1000 genes on the X chromosome, accounting for 5% of the entire genome. A large number of tumor suppressor genes are located on the X chromosome [117]. Epigenetic instability of inactivated X chromosomes appears to occur extensively in breast cancer types [43]. The heterochromatin structure of Barr bodies often disappears in invasive tumors such as breast cancer cells [103, 118, 119]; a common cause of its disappearance may be due to the overall disturbance of nuclear tissue in cancer cells and the destruction of its heterochromatic structure [43]. In theory, the use of IF and RNA FISH to detect X-linked gene reactivation and abnormal chromatin status in breast tumors can provide valuable biological indicators to assess epigenetic status and tumor response to drug therapy [120]. Given that the loss of Xist expression may potentially drive tumorigenesis [47, 121, 122], it would be reasonable to recommend Xist reactivation as a treatment strategy for cancer. In tumors, Xist expression can be reactivated by small molecules (such as Xist expression vectors), providing a novel therapeutic approach to target epigenetically functional lncRNA. In ovarian cancer cell lines, Xist expression can serve as a prognostic marker associated with chemotherapy response [123]. If a molecule that induces Xist expression is identified, improved patient prognosis may become feasible [124].

The EXITS gene escapes the inactivation of the X chromosome, allowing for the expression of X chromosome tumor suppressor genes and thereby invoking a tumor suppressing function. Perhaps during tumor treatment, the tumor suppressor gene on the X chromosome can be inactivated by artificial escape. It is believed that this will bring about a qualitative leap and breakthrough in related diseases, especially the diagnosis and treatment of female diseases.

Among the various individual differences, sex differences are one of the most interesting research topics. Sex is an important factor in the occurrence, diagnosis, and prognosis of many diseases [125]. Numerous clinical studies have shown that the incidence of many tumors involves significant sex differences. If we want to better understand different cancers, the key is to determine the reasons underlying sex differences. At present, there is a lack of targeted individualized prevention and treatment programs in clinical settings. To understand the different responses of men and women to cancer treatment due to their genetic differences, future clinical studies on cancer management should involve adequate numbers of patients and tumor tissue samples. To better understand the pathogenesis of certain diseases in humans, the sex ratio of the disease should be further clarified. This will help optimize treatment for both sexes, identify possible sex-biased protections or harmful factors, and may assist the development of new treatment strategies.