Introduction

Osteoporosis is a debilitating disease that often leads to fractures, disability, and even mortality among the elderly population [1, 2]. While it affects both genders, postmenopausal women are particularly susceptible to osteoporosis, with a prevalence six times higher than that in men [2]. Notably, gender differences may contribute to the development of osteoporosis [3]. However, most osteoporosis studies focus on the disease in postmenopausal women, neglecting gender-specific differences in osteoblasts [4, 5].

Bone metabolism is complex and is influenced by a variety of molecules and cell types, including osteoblasts, osteoclasts, and immune cells [6,7,8]. The integrity of the human skeleton is maintained through a series of repeated bone remodeling processes, which involves the dynamic balance between bone formation mediated by osteoblasts and bone resorption mediated by osteoclasts [9]. The interplay among these factors adds to the complexity of the disease. Disorders of bone metabolism can lead to various bone-related diseases, including osteoporosis and osteoarthritis. However, the intricate mechanisms underlying osteoporosis remain largely unknown due to its complexity.

Emerging therapeutic avenues, such as T-cell therapy and anti-inflammatory treatments, offer promise in osteoporosis treatment [9]. Encouraging results have been observed in animal models with anti-inflammatory therapies [10]. Furthermore, the significance of RANKL, expressed in T cells and essential for osteoclastogenesis [6], has led to the development of denosumab. It is an anti-RANKL antibody drug approved by the U.S. Food and Drug Administration, which has shown efficacy in osteoporosis treatment [9]. Therefore, elucidating the mechanisms and biomarkers of osteoporosis may enable the development of novel therapeutic approaches.

Despite significant advances in biotechnology, such as next-generation and single-cell sequencing, a comprehensive understanding of bone-related data remains a challenge. Notably, the ongoing Genotype-Tissue Expression (GTEx) project (https://www.gtexportal.org/home/) has profiled over 50 human normal tissues but still lacks bone-related transcriptomic data [11, 12]. The Human Protein Atlas (HPA) program was launched in 2003 to map all human proteins within cells, tissues, and organs [13]. HPA has profiled over 44 human tissues but also lacks bone-related data.

In this study, we compared the transcriptome data of osteoblasts from women and men, and identified differential expression genes (DEGs). We performed a weighted gene co-expression network analysis (WGCNA) to discover co-expressed modules associated with different biological processes. To identify robust key osteoporosis genes, we intersected the DEGs with known GWAS data and mouse bone mineral density-related genes. We identified nine key genes, including GMDS, SMOC2, SASH1, MMP2, AHCYL1, ARRDC2, IGHMBP2, ATP6V1A, and CTSK. The clinical relevance of these key genes was confirmed through various multi-omics data and qRT-PCR. Furthermore, we also uncovered the potential effects of the anti-RANKL antibody, denosumab. We observed its impact on the down-regulation of six key genes, suggesting a potential modulation of critical pathways. Additionally, we found the up-regulation of T cell proportions and down-regulation of fibroblast proportion in bone biopsies, prompting further exploration of their roles in osteoporosis development. In sum, our study offers not only insights into gender disparities within osteoblast transcriptomes but also illuminates potential biomarkers and therapeutic targets for osteoporosis diagnosis and treatment.

Materials and Methods

Data Collection and Preparation

Human trabecular bone osteoblast dataset GSE15678 (Illumina humanRef-8 v2.0 expression beadchip) contains 187 samples from 95 unrelated donors [7]. The dataset was used for differential expression gene identification. Blood methylation dataset GSE163970 (Illumina HumanMethylation450 BeadChip) contains 492 samples from 232 Paget’s disease and 260 controls [14]. Blood monocytes transcriptome dataset GSE56815 (Affymetrix Human Genome U133A Array) contains 40 high (20 pre- and 20 postmenopausal) and 40 low hip bone mineral density (BMD) (20 pre- and 20 postmenopausal) subjects [15]. Diversity Outbred mice femoral transcriptome dataset GSE152708 (Illumina NextSeq 500) contains 192 femoral samples with kinds of bone traits data [16]. These datasets were used for the validation of key osteoporosis genes. Denosumab treatment dataset GSE141610 (Illumina HiSeq 2000) contains 30 bone biopsies isolated three months after treatment with a placebo (n = 15) or denosumab (n = 15) in post-menopausal women [5]. The dataset was used to infer potential drug mechanisms. All these datasets were downloaded from NCBI GEO.

Osteoporosis-related GWAS data was downloaded from GWAS Catalog using the following keywords, “osteoporosis,” “bone density,” “bone mineral content measurement,” and “bone mineral density” [17]. After redundancy removal, 2250 genes were kept for GWAS. International Mouse Phenotyping Consortium (IMPC) data were downloaded using the following keywords, “abnormal bone mineral density,” “decreased bone mineral density,” “increased bone mineral density,” “abnormal bone mineralization,” “decreased bone mineral content,” “increased bone mineral content,” and “abnormal bone mineral content” [18]. After redundancy removal, 730 genes were kept for IMPC. Gene function information was derived from the NCBI Gene database.

Transcriptome and Methylation Data Analysis

For dataset GSE15678, we used limma with the adjusted P value 0.05 as a threshold to screen as many genes as possible. Gene functional enrichment was performed by clusterProfiler and g:Profiler [19, 20]. Transcription factor motif enrichment was analyzed by the TRANSFC database [21], and only the top results were retained. For dataset GSE163970, we used the ChAMP package to analyze the M value matrix [22]. The adjusted P value for the differential methylation probe was set at 0.01. For dataset GSE141610, limma-voom was used to analyze the gene counts matrix, and the significance threshold for differential expression was set at adjusted P < 0.01 [23]. The proportion of immune cells was estimated by the TIMER tool [24].

Weighted Gene Co-expression Network Analysis for Differential Expression Genes

Weighted gene co-expression network analysis (WGCNA) was conducted according to the package manual in R4.0.3 [25]. The parameters were set as power = 14, minModuleSize = 30, network type = "signed," and deepSplit = 4. The module expression values were correlated with gender, where women were assigned a value of 1 and men a value of 0. The module and pathway preservation was calculated by the modulePreservation function in WGCNA with the following parameters, networkType = "signed," nPermutations = 100, parallelCalculation = T [26]. Preservation refers to the assessment of the similarity/conservation of modules or pathways across different datasets or conditions [26]. The preservation statistic was summarized by Zsummary value with the following thresholds: if Zsummary > 10 there is strong evidence that the module/pathway is preserved; if 2 < Zsummary < 10 there is weak to moderate evidence of preservation; if Zsummary < 2, there is no evidence that the module/pathway preserved. All the results visualization was carried out in R.

Quantitative Reverse Transcription Polymerase Chain Reaction (qRT-PCR)

Eight postmenopausal women with osteoporosis and 3 healthy women were enrolled. The femur (1 g) was collected from each patient during hip arthroplasty and stored in liquid nitrogen as soon as possible. For the qRT-PCR assay, after RNA extraction, 1 ug of RNA was used for reverse transcription to cDNA. A PCR experiment was conducted using the TB GreenTM Premix ExTaqTMII (Takara, Dalian, China) and analyzed with the real-time ABI 7500 FAST PCR system (Applied Biosystems, Foster City, USA). The primers used for the qRT-PCR were SLC35D2 (forward 5ʹ-TCCACTGGAGACCTGCAACA-3ʹ, reverse 5ʹ-GCTGTCGTCAGGGCTGAAT-3ʹ), C1D (forward 5ʹ-TTGGTCAAAATGGCAGGTGGAGAAATGAAT-3ʹ, reverse 5ʹ- TTAATGTTTACTTTTTCCTTTATTAGCAAC-3ʹ), and SMOC2 (forward 5ʹ-ATGACGACGGCACCTACAG-3ʹ, reverse 5ʹ-TCGCGTTGGGGTAACTTTTCA-3ʹ), SASH1 (forward 5ʹ-CGGTCCCAGATCGAAGAGTC-3ʹ, reverse 5ʹ-GTTCTTTCGGAAGTTCTGCCA-3ʹ), MMP2 (forward 5ʹ -TACAGGATCATTGGCTACACACC-3ʹ, reverse 5ʹ -GGTCACATCGCTCCAGACT-3ʹ), SPP1 (forward 5ʹ-ATAAGCGGAAAGCCAATGATGAGAG-3ʹ, reverse 5ʹ -TTTGGGGTCTACAACCAGCATATCT-3ʹ), KIF25 (forward 5ʹ -TGTTATGGCGTATGGACAGACG-3ʹ, reverse 5ʹ -TCAGCCACTCTAGGGATAATTCC-3ʹ), LAMA5 (forward 5ʹ -CCCACCGAGGACCTTTACTG-3ʹ, reverse 5ʹ-GGTGTGCCTTGTTGCTGTT-3ʹ), ARNT (forward 5ʹ-TAGTGCCCTGGCTCGAAAAC-3ʹ, reverse 5ʹ-GGTTCAAAACAGGAGTCACGG-3ʹ), FBXO38 (forward 5ʹ-TATGGCCTTCACCCTCGATAC-3ʹ, reverse 5ʹ-ACACCCACTAAGTTTGGGCAT-3ʹ), and AHCYL1 (forward 5ʹ-TCGCTCGATCTCACAGTCCT-3ʹ, reverse 5ʹ-TCCCGGCGTCCAAATTCTG-3ʹ). The PCR conditions were as follows: initial denaturation at 95 °C for 5 s, followed by 40 cycles of 95 °C for 30 s, 60 °C for 30 s, and a final extension step at 72 °C for 5 min. The data were analyzed using the relative quantification (ΔΔCq) method, with GAPDH expression levels used for normalization.

Statistical Analysis

The ROC curve was plotted in R with packages ROCR and pROC. Clustering and dimension reduction of DEGs were conducted by heatmap and plotMDS functions in R. The enrichment of osteoporosis-related genes in DEGs was tested by the hypergeometric distribution analysis. The correlation coefficient of variables with different units was calculated by the Spearman correlation analysis. The mean difference of cell proportions was evaluated by the two-tailed t-test.

Results

1460 Differential Expression Genes were Identified in Women Compared to Men

We analyzed the GSE15678 human osteoblast dataset and found 1460 genes with differential expression (DEGs) in women compared to men. Women and men osteoblast clusters are similar, but they do not form two distinct clusters (Fig. 1A). To visualize the samples based on DEGs, we used Multidimensional Scaling (MDS) which showed most of the samples clustered into two regions (Fig. 1B). Enrichment analysis indicated DEGs were associated with immune response and neurogenesis genes (Fig. 1C). Neurogenesis genes such as CTHRC1, SERPINF1, and CDK5RAP2 were down-regulated, while LAMA5, SPP1, and EPB41L3 were up-regulated in women osteoblasts. Sex-specific DEGs were observed, including Y-linked genes RPS4Y1, CYORF15A, NLGN4Y, EIF1AY, and JARID1D, which are significantly down-regulated in women osteoblast. In contrast, some X chromosome genes, such as HDHD1A, UTX, RPS4X, GEMIN8, and EIF2S3, are up-regulated in women (Fig. 1D). Furthermore, we found that some autosomal genes showed top changes in expression, with CTHRC1, PTGDS, AKR1C3, CTSK, and PHLDA1 down-regulated and HLA-DRA, RNASE1, MYH11, ACTG2, and LYZ up-regulated (Fig. 1D).

Fig. 1
figure 1

The differential expression of genes in women's osteoblast compared to men's in the GSE15678 dataset. A The heatmap highlights the different expression patterns of osteoblasts in women (red) and men (green). Rows represent genes and columns represent samples. B The Multidimensional Scaling (MDS) plot shows the two-dimensional space with green and red dots representing men and women, respectively. C The plot depicts the results of the functional enrichment analysis of DEGs by clusterProfiler. D The barplot displays the top 5 up-regulated and down-regulated autosomal genes and the top 5 up-regulated X-linked and down-regulated Y-linked genes in osteoblasts

Differential Genes were Enriched with Osteoporosis-Related Genes

We collected data from the NHGRI-EBI GWAS Catalog, which is a curated collection of all human genome-wide association studies, and from IMPC, a database of gene-knockout mouse phenotypes. We compared our DEGs and osteoporosis-related genes in GWAS Catalog and IMPC and found significant gene overlap between DEGs and GWAS Catalog (P < 0.01, Fig. 2A). Of the 166 overlapping genes, ossification and collagen catabolism-related genes were enriched (Fig. 2B). The nine genes shared among all three lists were GMDS, SMOC2, and SASH1 (up-regulated), MMP2, AHCYL1, ARRDC2, IGHMBP2, ATP6V1A, and CTSK (down-regulated), which were identified as key osteoporosis genes using the TRANSFC database (Fig. 2C). These genes were found to be enriched with motifs targeted by transcription factors LEF1 and ETS2, both of which were important in bone formation and mass [27, 28].

Fig. 2
figure 2

Differential genes were enriched with osteoporosis-related genes. A The intersection of three gene lists, DEG, GWAS, and IMPC. B Functional enrichment analysis of the 166 DEGs that overlapped with GWAS or IMPC gene lists using clusterProfiler. C The 166 genes were enriched with the target motif of two transcription factors, LEF1 and ETS2

Gene Co-expression Network Analysis Identified 8 Functional Modules

We used WGCNA to identify co-expressed modules for the key osteoporosis genes. The network construction parameter β was set as 14, which resulted in a scale-free network (Fig. 3A–C). We identified eight gene co-expression modules (Fig. 3D), which showed different expression patterns (Fig. 3E). We then performed module preservation analysis to determine the similarity of modules between men and women. We observed a general similarity between them (Fig. 3F). Functional analysis indicates that these modules are associated with different functions (Fig. 3G). For example, M1 is significantly associated with immune response and vesicle, and M2 is associated with development and collagen metabolism. We also analyzed the correlation of these modules with sex and found that modules M2 and M3 are down-regulated in women, while the other 6 modules, including M1, M4, M5, M6, M7, and M8, were up-regulated in women (Fig. 3H).

Fig. 3
figure 3

Gene co-expression network analysis of the 1460 differential genes identified 8 functional modules in osteoblast in dataset GSE15678. A The plot shows the network construction parameter β set at 14, resulting in a scale-free network. B The plot shows the regression R2 between log10(k) and log10(p(k)), which reached 0.94 with β set as 14, indicating a good fit. C The network mean connectivity stabilized when β was set as 14. D The dendrogram shows the identification of co-expression modules. The color bar shows the assignment of genes to different modules. E The plot illustrates the clustering of the 8 identified modules. F The dot plot shows the module preservation analysis using men as a reference, and the Zsummary statistic indicates the degree of module preservation. Zsummary > 10 indicates the module is highly preserved; 2 < Zsummary < 10 indicates weak to moderate evidence of preservation; Zsummary < 2, indicates no preservation. (G) Functional enrichment analysis of the module genes by g:Profiler. H The heatmap shows the relationship between module expression and gender by correlation analysis

Pathway Preservation Analysis Indicates the General Similarity Between Men's and Women's Osteoblast Transcriptome

We used the WGCNA modulePreservation function to analyze the similarity of pathways between men and women for the DEGs, which were distributed across different pathways. We selected eight KEGG pathways, including phagosome, osteoclast differentiation, fructose and mannose metabolism, lysosome, GnRH signaling pathway, cysteine and methionine metabolism, glycosaminoglycan biosynthesis -heparan sulfate/heparin, and mTOR signaling pathway, and analyzed only DEGs within these pathways. We employed the preservation statistic Zsummary (Z) as the primary measurement for the similarity/preservation between two pathways. Z > 10 indicates strong preservation, whereas 2 < Z < 10 indicates weak-moderate preservation and Z < 2 indicates no preservation. Our analysis revealed varying levels of pathway similarity between genders (Fig. 4A–K). Notably, pathways such as osteoclast differentiation, lysosome, and phagosome demonstrated significant similarity (Fig. 4A, J). Conversely, the GnRH signaling pathway, cysteine and methionine metabolism, glycosaminoglycan biosynthesis—heparan sulfate/heparin, and mTOR signaling pathways indicated no preservation. Gene co-expression network plots were used to further visualize the differences between these pathways for the individual genes (Fig. S1). For example, in the phagosome pathway, the gene ATP6V0E2 displayed weaker and fewer connections with other genes in women than in men (Fig. S1A). Similarly, in the osteoclast differentiation pathway, the gene SIRPA had fewer connections with other genes (Fig. S1B), while in the GnRH signaling pathway, MMP14 displayed a weaker connection with ADCY3 (Fig. S1B).

Fig. 4
figure 4

KEGG pathway-based analysis of the preservation between women and men in dataset GSE15678. A–C Three preservation statistics for all eight pathways are presented. Z > 10 indicates strong preservation, whereas 2 < Z < 10 indicates weak-moderate preservation and Z < 2 indicates no preservation. The dot labels are extracted from the first word for each pathway name. D–K The kME correlation for women and men in the 8 pathways, including phagosome, osteoclast differentiation, fructose, and mannose metabolism, lysosome, GnRH signaling pathway, cysteine, and methionine metabolism, glycosaminoglycan biosynthesis—heparan sulfate/heparin, and mTOR signaling pathway

Blood Methylation and Transcriptome Data Analysis Identified Key Osteoporosis Genes

The findings of a previous study suggest that blood has the potential to reflect the bone methylome and encompass sites linked to bone regulation [29]. We analyzed blood methylation data and identified 451 differentially methylated positions (DMPs) among our DEGs with adjusted P < 0.05, corresponding to 62 genes (Fig. 5A). Some top genes, such as RNF39 and KIF25, were identified by several probes (Fig. 5B). The CpG sites were distributed across different genomic regions, with the highest proportion in the gene body and CpG island (Fig. 5C). Functional analysis revealed enrichment of two bone abnormality-related terms in the Human Phenotype Ontology (HPO) database, including spondylolysis and delayed closure of the anterior fontanela (Fig. 5D), with genes included in HPO such as CTSK, FLNB, MMP2, RUNX2, and SMC3. Eight of the 9 key osteoporosis genes were among the 62 genes, with SASH1, SMOC2, and GMDS having the highest number of detected probes (19, 14, and 10, respectively) among the 9 key osteoporosis genes (Fig. 5E, F).

Fig. 5
figure 5

Blood methylation is changed in osteoporosis patients in dataset GSE163970. A The heatmap of the top 94 significantly changed CpG sites. B Histogram for the top 70 genes enriched by significant CpGs. The y-axis represents the number of detected methylated probes. C The distribution of methylation sites in different genome regions. D Functional enrichment analysis of the 62 genes by g:Profiler. E, F The distribution of methylation proportion of genes SMOC2 and GMDS in different genome regions

In the analysis of blood monocyte transcriptomes of pre- and postmenopausal women with low or high bone mineral density, we examined whether the 9 key genes could separate high and low BMD. Our results showed that only ATP6V1A had a moderate performance in separating subjects with high and low BMD in the two cohorts (Fig. 6A). Therefore, we conducted a transcriptome-wide analysis to identify other genes. In premenopausal women, ACAP2, FBXO38, and ARNT were the top genes with high performance (AUC > 0.85). In postmenopausal women, SLC35D2, C1D, ATP5PD, CTSS, and PFDN4 were identified as the top genes with high performance (AUC > 0.9). CTSS is an overlapping gene between the top-performance genes in both cohorts and is included in our DEG list. However, its separating performance differs in the two cohorts (Fig. 6A).

Fig. 6
figure 6

Clinical relevance of the key osteoporosis genes. A Blood monocyte genes in pre- and postmenopausal women can separate subjects with high and low bone mineral density, as shown in ROC curves for ATP6V1A and CTSS in dataset GSE56815. B Validation of key genes in Diversity Outbred (DO) mice identified the significant correlations of SMOC2, MMP2, ATP6V1A, and GMDS with bone traits in dataset GSE152708. C The cell proportion of CD4+, CD8+, and fibroblast significantly changed after denosumab treatment in human bone biopsies in dataset GSE141610

Confirm the Key Genes in Diversity Outbred (DO) Mice and Denosumab Treatment

In the Diversity Outbred (DO) mouse model [16], we found that both SMOC2 and MMP2 were positively associated with trabecular separation and negatively associated with trabecular number, while SMOC2 was up-regulated in osteoblasts isolated from female human donors. ATP6V1A and GMDS were positively associated with osteoblasts number, while ATP6V1A was down-regulated in women osteoblasts (Fig. 6B). In a human dataset of needle bone biopsies from the posterior iliac crest [5], we discovered that 6 of the key genes were up-regulated after denosumab treatment, including AHCYL1, ATP6V1A, CTSK, GMDS, IGHMBP2, and SASH1. Additionally, we identified that the proportion of CD4+ and CD8+ T cells increased, and that of fibroblast decreased after treatment in bone biopsies (Fig. 6C).

Validation of DEGs

To verify the above results, gene expression levels of DEGs were detected using qRT-PCR in femur samples from both postmenopausal osteoporotic patients and healthy subjects (Fig. 7). The results indicated that the up-regulation of SMOC2, SASH1, SPP1, and LAMA5 in femur was consistent with the results of DEGs, while the expression SLC35D2 and C1D was consistent with the results of postmenopausal women. Therefore, the independent data validated the credibility of our result.

Fig. 7
figure 7

Confirmation of DEG results via quantitative PCR was performed. Based on the overlapping genes present in the three gene lists, 11 DEGs (SLC35D2, C1D, SMOC2, SASH1, MMP2, SPP1, KIF25, LAMA5, ARNT, FBXO38, and AHCYL1) were selected for further qPCR validation. Femurs were collected from both postmenopausal osteoporosis patients (n = 8) and postmenopausal healthy subjects (n = 3). **P < 0.01, *P < 0.05

Discussion

Gender disparities in osteoporosis have been widely recognized, with a higher prevalence of osteoporosis and risk of fracture in women than in men [30]. While the current research has recognized gender differences in osteoporosis to a certain degree, deeper exploration into this topic could yield valuable new perspectives [31]. Our study for the first time analyzed the differences in gene expression in men and women osteoblasts, which play a major role in bone formation. We identified 1460 DEGs in osteoblasts and found they are enriched with known GWAS genes of bone density and osteoporosis. By incorporating all three gene lists, we identified 9 potential key genes associated with osteoporosis. Using WGCNA, we discovered functional modules associated with sex, and pathway-level analysis suggested that although similar, the individual gene behaviors are different in men and women. The clinical implications of these key genes were confirmed in blood methylation, transcriptome, DO mice, and drug treatment.

The transcriptome of human cell types has been extensively studied, however, the human osteoblast is rarely analyzed. Our study used traditional differential gene expression analysis to identify DEGs and identified several sex-related genes, indicating that the transcriptome data is valid and our DEGs workflow is correct. Sex-specific differences, including increased transcription of RPS4Y1, have been proven to be associated with osteogenic differentiation in human adult neural crest-derived stem cells [32]. DEGs were enriched with immune response and neurogenesis genes. The associations between immune imbalance, neurogenesis, and osteoporosis have been established [9, 33]. For example, neurogenesis genes CTHRC1 can positively regulate osteogenic differentiation in rat alveolar bone, and CDK5RAP2 variation can promote osteoporosis in a large cohort of postmenopausal women [34, 35]. For the first time in women's osteoblasts, we identified autosome genes related to bone diseases, despite some of these genes being previously well-studied. For example, CTHRC1 was found to be down-regulated in women's osteoblasts, while it was known to increase bone mass as a positive regulator of osteoblastic bone formation in CTHRC1 transgenic mice [36]. Similarly, CTSK, whose knockout or inhibition could promote tooth alveolar bone regeneration in CTSK knockout mice [37], was found to be down-regulated in women osteoblasts. In the plasma samples of postmenopausal women, the expression of protein lysozyme C (LYZ) was negatively related to BMD [4], while it was down-regulated in women. IGHMBP2 mutation will compromise several critical parameters of bone quantity and architecture in IGHMBP2 mutation mice [38]. We found that IGHMBP2 was down-regulated in women. MMP2, which is associated with a human genetic disorder characterized by subcutaneous nodules, arthropathy, and focal osteolysis [39], was also found to be down-regulated in women. Our study also identified other significant genes, such as AHCYL1, ARRDC2, GMDS, and SASH1, as potential novel genes associated with bone biology that have not previously been reported to be involved in bone. Overall, these results highlight the potential molecular differences in osteoblasts that may contribute to the susceptibility of women to osteoporosis.

To gain a confident list of key osteoporosis-related genes, we incorporated several datasets, including GWAS Catalog and IMPC. Our identified genes were enriched with functions such as cell differentiation, collagen metabolism, and ossification, which are vital processes in bone homeostasis. Additionally, the DEGs we identified were enriched with known osteoporosis genes from the GWAS database. This suggests that sex differences in osteoblasts may contribute to osteoporosis susceptibility in women. The common genes of the three gene lists were also found to be enriched with targets of transcription factors LEF1 and ETS2, which are known to play critical roles in bone development and osteogenesis [40, 41]. The expression of LEF1 was detected in primary cranial osteoblasts. In female mice lacking LEF1, a decrease in the number of osteoblasts and a reduction in trabecular bone mass were observed. However, these defects were not observed in male mice lacking LEF1 [40].

To further understand the complex polygenic nature of osteoporosis, we used WGCNA to analyze gene expression patterns in men's and women's osteoblasts. Functional enrichment analysis revealed that collagen metabolism genes were down-regulated in women, despite its important roles in bone resorption and formation [42]. As each module contains many genes, pathway-level analysis was performed, which was mainly based on the pathways that involve the 9 key genes. Most of the pathways have been reported to be associated with bone resorption. This includes vital processes like osteoclast differentiation, which plays a pivotal role in osteolytic bone destruction [40]. At the pathway level, we found that SIRPA has more connections with other genes in women, which cannot be revealed by module-level analysis such as WGCNA. SIRPA is an osteoclast-associated marker and is involved in MMP-9 secretion, which is vital in endochondral ossification and bone formation [43, 44].

For complex diseases like osteoporosis, gene expression is just one aspect of the system, and many factors may contribute to its development, such as gene interactions, gene modifications, cell interactions, and cell proportions. The methylation status of genes in the blood can serve as a potential indicator of systemic factors that influence gene expression across multiple tissues, including osteoblasts [29]. To validate our DEGs, we used blood methylation and transcriptome data and identified that 62 methylated genes overlapped with DEGs and were associated with bone malfunctions, indicating the potential of epigenetics in studying osteoporosis. We also identified ATP6V1A and CTSS as potential biomarkers for osteoporosis in our study. We, for the first time, found that ATP6V1A could moderately differentiate subjects with high and low BMD in both pre- and postmenopausal women. Only one study reported the differentially methylated ATP6V1A in postmenopausal women [45]. However, as the current data only contains 40 samples, future studies with larger cohorts should be carried out to confirm it. We also found that CTSS could differentiate patients but with different performances in the two cohorts, indicating possible additional different mechanisms of osteoporosis in pre- and postmenopausal women. In our DEGs, CTSK was down-regulated, while CTSS was up-regulated. Cathepsin K and S (i.e., CTSK, CTSS) are two isoforms of cysteine proteases with different functions, whose roles in bone resorption have been well-established. Osteoclast-specific CTSK deletion stimulates S1P-dependent bone formation [46]. CTSS deficiency alters the balance between adipocyte and osteoblast differentiation [47]. Our analysis provides potential biomarkers for osteoporosis, which could help doctors to treat osteoporosis patients more effectively.

To validate the identified key genes, we used DO mice and denosumab treatment datasets. We found that SMOC2 had the highest correlation with trabecular number. However, its role is still controversial as research has shown that SMOC2 knockdown inhibited osteoblastogenesis while SMOC2 overexpression inhibited osteogenic differentiation and extracellular matrix mineralization [48, 49]. For denosumab treatment in osteoporosis, we first discovered that the immune cells such as T cells are increased after treatment, while fibroblast is decreased. Only recently, the role of T cells in osteoporosis is discussed [2]. Fibroblast is the main source of FGF2 which plays a critical role in the modulation of the cartilage collagen network [50]. Our data indicate the potential pharmacological mechanism of denosumab.

One limitation of our analysis pertains to the validation datasets, where bone biopsies could encompass various cell types, including osteoblasts, osteoclasts, and bone lining cells. This diversity complicates the direct attribution of gene expression changes to a specific cellular source. As a reference indicates, osteocytes dominate the bone matrix as the most abundant cell type, constituting 90–95% of all bone cells, while osteoblasts and osteoclasts account for the remaining proportion (4–6% and 1–2%, respectively) [51]. We found that only two of the nine genes were more abundantly expressed in osteoclasts than in osteoblasts, including CTSK and ATP6V1A in datasets GSE31156 and GSE190615, which is consistent with BioGPS database showing higher expression in mouse osteoclasts [52]. Another limitation is that mouse BMD-related genes might be sex-specific. The sex-agnostic analysis of mice may miss some sex-specific genes. It is important to note that the differentially expressed genes identified in this analysis only represent a subset of those altered between the study groups. The genes reported here should be considered as strong candidate biomarkers, rather than an exhaustive list of all osteoporosis genes. Further explorations using more comprehensive analytical approaches may help uncover additional genes underlying the observed phenotypes.

In summary, our study used multi-omics data to identify key genes associated with osteoporosis, focusing on gender differences in osteoblast transcriptomes. Through module and pathway-level analyses, we identified differences in gene expression patterns between men and women. Additionally, we confirmed the clinical relevance of these key genes using blood methylation, transcriptome, and Diversity Outbred mice data. Our analysis of the denosumab treatment transcriptome also revealed its potential mechanism of action. Our findings provide promising biomarkers for osteoporosis diagnosis and treatment, shedding light on the complex mechanisms and gender differences underlying the disease.