Introduction

With the rapid social and economic development together with medical service improvement in China, infectious diseases endangering children’s health have been gradually brought under control. However, birth defects and many other adverse pregnancy outcomes remain prominent. Internationally, the ‘thalidomide event’ in the 1960s was the wake-up alarm for imperative requirement of pregnancy medication safety research1. Since then, the pregnancy medication safety has been one of the major, however difficult, subjects in the international maternal-fetal medical research. It was estimated that 25%–99% of pregnant women globally had drug usage2, and about 80% of them were exposed to drugs of unknown safety information3. Due to the special physiological changes in maternal condition and fetal development during pregnancy, exposure to drugs of unknown safety information, including the drugs approved for using by regular patients, could result in a series of irreversible adverse effects on pregnancy outcomes, such as birth defects, abortion, stillbirth, and so on4,5,6. It was reported that malformations of vital and secondary organs in neonates were associated with drug exposure in pregnant women7. However, due to ethical reasons, pregnant women could not participate in regular premarket drug safety tests, and it is virtually impossible to assess the actual drug exposure to fetuses before delivery. In addition, animal safety tests cannot accurately reflect drug teratogenicity in human embryos8. Therefore, the effect of maternal drug exposure on pregnancy outcomes needs to be thoroughly studied in larger real-world human populations.

As an important resource platform for etiological research, biobanks play important roles in mechanistic studies and precision medicine research. In recent years, multiple special biobanks have been established from Chinese cohort studies, such as the Taizhou cohort by Fudan University, the Breast Cancer Cohort Study in Chinese Women (BCCS-CW), the Born in Guangzhou Cohort Study9,10,11. The drug exposure and birth cohort (DEBC) in this study has achieved the largest sample size so far in China, with the availability of high-quality and well-annotated human biospecimens besides clinical and epidemiological data, and is at the forefront of Chinese cohort research. The biospecimens of DEBC included maternal peripheral blood, urine, umbilical cord blood, and tissue samples. With these abundant biological samples, our cohort study can provide the opportunities to combine the genetic and environmental factors to study the association between drug exposure during pregnancy and pregnancy outcomes, to explore potential dose-response relationships, and to provide more powerful evidence for mechanistic research.

The DEBC study in China is a prospective, longitudinal, and multi-centered cohort study, which ultimately aims to construct a comprehensive database of drug exposure information and associated adverse pregnancy outcomes for Chinese population. The DEBC was launched in August, 2018, and expected to recruit at least 150,000 cases and construct a corresponding biobank. All participants underwent five follow-up visits: in the first trimester (<14 weeks), in the second trimester (22–26 weeks), in the third trimester (32-36 weeks), at delivery, and within 42 days after delivery, respectively. The specific objective was to establish and estimate the correlation between the maternal drug exposure during pregnancy, especially in the first trimester (<14 weeks), and the adverse pregnancy outcomes, such as miscarriage/abortion, stillbirth, preterm birth, low birth weight and significant birth defects.

In this work, a total of 112,986 pregnant women are included in the current analytical database of DEBC, and we show that the drug exposure rate in the first trimester of pregnant women in this database is 30.70%. Among the drugs used, dydrogesterone and progesterone have the highest exposure rates, which are 11.97% and 10.82%, respectively. The overall incidence of adverse pregnancy outcomes is 13.49%. Dydrogesterone exposure in the first trimester shows associations with increased incidence of stillbirth, preterm birth, low birth weight, and birth defects, while displaying a decreased incidence of miscarriage/abortion. There appears to be a lower incidence of birth defects in pregnant women exposed to aspirin during the first trimester compared to those who were not exposed. Causative conclusions cannot be drawn from this cohort study, and the follow-up period of the birth cohort is still limited. This cohort will be further followed up and the data will be analyzed in-depth in future studies.

Results

Baseline characteristics

Up until 31 December 2021, the DEBC had recruited 150,577 participants. After excluding pregnant women lost to follow-up, without quality audit or reaching the stage of pregnancy outcomes, or with multiple pregnancies, a total of 112,986 pregnant women were included in the current analytical database (Fig. 1). Table 1 shows the baseline characteristics of the study population. Of the recruited pregnant women, 11.42% were of advanced maternal age (≥35 years), 66.74% had a college degree or higher education level, 94.35% had natural conception, and 59.03% had planned pregnancy. Regarding lifestyle factors, 43.23% of pregnant women and/or their husbands smoked at least one cigarette daily, 14.50% consumed alcohol at least once during the perinatal period (from 3 months before to 3 months into pregnancy), and 88.96% took folic acid or multivitamins containing folic acid during pregnancy.

Fig. 1: Study design and the number of subjects.
figure 1

The DEBC has recruited 150,577 participants. After excluding pregnant women lost to follow-up, those without quality audit, reaching the stage of pregnancy outcomes, or with multiple pregnancies, 112,986 pregnant women are included in the current analytical database.

Table 1 The demographic characteristics of pregnant women in DEBC

Meanwhile, the biobank of this analytical cohort had preserved about 580,000 processed biological samples including about 467,600 processed blood samples (plasma, serum and buffy coat), and about 79,000 urine samples from the pregnant women. Furthermore, ~7400 processed umbilical cord blood samples (plasma, serum and buffy coat), amniotic fluids (supernatants and amniocytes), placenta samples (maternal and fetal surface tissues) and other tissues were collected from the cases with the adverse pregnancy outcomes. About 26,000 processed biological samples from corresponding controls were also stored in the biobank (data not shown). The types and numbers of specimens collected during each pregnancy period and specimens collected for cases with different delivery outcomes are shown in Fig. 2a, b. Among initially recruited 150,577 participants, about 83%, 22%, and 4% of pregnant women provided blood samples in the first trimester, the second trimester, and the third trimester, and about 43%, 4%, and 4% of pregnant women provided their urine samples in the first trimester, the second trimester, and the third trimester, respectively (Fig. 2c). These samples constituted the biobank with rich biological resources for future research.

Fig. 2: The histogram of biological samples in the biobank.
figure 2

a The histogram of maternal samples collected during pregnancy. b The histogram of biological samples collected from participants with adverse pregnancy outcomes. c The number (left, ×10000) and percentage (right, %) of maternal samples collected during pregnancy. Source data are provided as a Source Data file.

Drug exposure

Our survey in 112,986 pregnant women included in this analytical database revealed that over 20 kinds of drugs had high exposure rates during the early pregnancy (Table 2). Among the drugs, dydrogesterone, progesterone, Fuzheng recipe (traditional Chinese medicine), levothyroxine, aspirin, estradiol valerate, allylestrenol, chorionic gonadotrophin, heparin and menotrophin were the top ten frequently used drugs. Dydrogesterone and progesterone had the highest exposure rates, which were 11.97% and 10.82%, respectively. The average duration of medication exposure ranged from 4.1 days (cefaclor) to 58.9 days (progesterone). Of note, there were prominent exposures of Chinese patent medicine including Fuzheng recipe, Ganmaoling granules, Warming and tonifying kidney-yang recipe, and Ban-lan-gen granules (the single largest component is the Isatis indigotica root).

Table 2 The descriptive analyses of drug exposure rate, formulation, route, dosage, and reasons of drug use during the first trimester

About 30.70% of the pregnant women (34,690) took one or more kinds of drugs in the first trimester. Compared with the pregnant women who did not take any drugs in the first trimester, the pregnant women who took drugs in the first trimester tended to be older, have higher education levels, use assisted reproductive technology, have a planned pregnancy, take folic acid supplementation, and be non-smokers or non-drinkers. The detailed statistical descriptions for these variables are shown in Table 3.

Table 3 The comparison of demographic characteristics between the drug-exposed and unexposed groups in the first trimester of DEBC

Adverse pregnancy outcomes

The total incidence of adverse pregnancy outcomes was 13.49%, of which the miscarriage/abortion was 2.52%, the stillbirth 0.77%, the preterm birth 5.15%, the low birth weight 3.87%, and the birth defects 3.56% (Fig. 3). According to the standards of ‘Chinese Birth Defects Monitoring Network (CBDMN)’, birth defects exclude the micro-malformations, unclassified congenital heart diseases, and atrial septal defects, while these conditions/complications were included in the total incidence of ‘adverse pregnancy outcomes’ in our study. Maternal age, education level, and fertilization way were all significantly correlated with the occurrence of adverse pregnancy outcomes (p < 0.05; Supplementary Table 1). Compared to pregnant women without adverse outcomes, those with adverse pregnancy outcomes tended to be older, have lower education levels (high school or below), and use assisted reproductive technologies (Supplementary Table 1).

Fig. 3: The incidence rate (%) of adverse pregnancy outcomes.
figure 3

Data are presented as mean values ±standard deviations. The error bars represent 95% confidence intervals (n = 112,986). Source data are provided as a Source Data file.

Drug exposure during the first trimester and associated adverse pregnancy outcomes

We assessed the association between medication use during the first trimester and adverse pregnancy outcomes after adjusting maternal age and disease in our database (Table 4). In particular, the incidence of adverse pregnancy outcomes such as stillbirth, preterm birth, low birth weight, and birth defects, were higher in dydrogesterone-exposed group than those in unexposed group. Meanwhile, there was a decrease in the incidence of miscarriage/abortion in the group exposed to dydrogesterone. There was no observed increase in the incidence of adverse pregnancy outcomes associated with maternal exposure to progesterone during the first trimester. Preliminary findings suggested an association between the use of Fuzheng recipe and increased incidence of most adverse pregnancy outcomes, while the use of Warming and tonifying kidney-yang recipe showed the opposite. The incidence of adverse pregnancy outcomes in the groups exposed to the other two Chinese patent medicines (Ganmaoling granules and Ban-lan-gen granules) showed no significant difference from those in unexposed group. However, further research is needed to validate these findings due to limitations in data collection and the complex composition of proprietary Chinese medicines. There was no significant difference in the incidence of all forms of adverse pregnancy outcomes between levothyroxine-exposed and unexposed group. Infants born to mothers who were exposed to aspirin or amoxicillin during pregnancy had a reduced incidence of birth defects or stillbirth, respectively, compared to infants born to unexposed mothers. Prednisone-exposed group had higher incidence of preterm birth and low birth weight, yet a lower incidence of miscarriage/abortion.

Table 4 Adjusted relative risks (aRRs) with 95% confidence intervals (CI) from log-binomial multivariate regression model, of adverse pregnancy outcomes associated with maternal medication in the first trimester, adjusted for maternal age and the first trimester maternal diseases that needed treatments (n = 112,986)

Discussion

The study of medication usage during pregnancy and adverse pregnancy outcomes is complex and requires large, detailed and realistic databases to capture representative information that could prompt further studies. However, internationally, there have been very rare cohorts specifically focusing on maternal medication exposure during pregnancy. China Teratology Birth Cohort (CTBC), which was expected to recruit at least 300,000 participants12, was a hospital-based, open-ended prospective cohort study with the aim to assess the risk of birth defects and other adverse pregnancy outcomes associated with maternal environmental and behavioral exposures during pregnancy. The current maternal drug exposure birth cohort (DEBC) was a sub-cohort of CTBC, aiming to explore the impact of maternal drug exposure during pregnancy on pregnancy outcomes, and to establish a biobank of high-quality and well-annotated human biospecimens for further research on the mechanisms of fetal health and diseases. DEBC had recruited 150,577 pregnant women and established a biobank with about 580,000 processed biological samples. This cohort is so far the largest cohort in China that has obtained actual data on pregnancy outcomes and maternal drug exposure during pregnancy. The database collected not only comprehensive information on medication use (including medication start and end dates, accurate doses, accurate gestational ages, etc.), but also a large amount of information on maternal lives, behaviors, environment factors, laboratory test data during pregnancy, and infant health outcome information. Moreover, a large biobank with abundant and diverse biological specimens has been established. It is worth mentioning that this study included traditional Chinese medicine according to the medication habits of the Chinese pregnant population, which would enable us to fill the knowledge gap in evidence-based medicine study on the use of traditional Chinese medicines in Chinese pregnant women. The powerful database overcomes the limited information on medication usage during pregnancy and retrospective bias in previous studies, and is important for exploring the association of medication exposure during pregnancy with pregnancy outcomes. In addition, the establishment of a harmonized working mode among different cooperating institutions and a ‘green channel’ for fast-track diagnosis of suspected birth defects by ultrasound would be a good example to implement and develop other birth cohort projects.

The initial biospecimens were collected from 83% of the recruited 150,577 pregnant women in the DEBC. Because we considered that the first trimester was the relatively most important period for fetal growth and development, of which the mistakes would lead to birth defects, the maternal peripheral blood samples and their processed samples (plasma, serum and buffy coat) during the first trimester accounted for the majority of the biospecimens in the biobank. Fewer biospecimens were collected during the second trimester and the fewest during the third trimester. In the DEBC, 83%, 22%, and 4% of pregnant women provided blood samples in the first trimester, the second trimester, and the third trimester, respectively. Totally, 4% of pregnant women provided their blood samples across the three trimesters of pregnancy. Regarding the urine samples, 43% of pregnant women provided their samples in the first trimester, and 4% in the second and third trimester.

Since this specific study was focused on exploring the association between drug exposure and adverse pregnancy outcomes, the current analytic cohort only included 112,986 pregnant women whose data on maternal drug exposure and pregnancy outcomes were complete and met the quality audit criteria. This is to date the largest cohort with available actual data on maternal drug exposure during pregnancy and pregnancy outcomes in China, although the present study has not covered most pregnant women across China. Studies covering the entire or most pregnant population in China are not feasible. The factors such as limited financial resources and manpower increased the difficulty of further expanding the sample size for this project. Significantly, this study included collection of a large amount of examination results during pregnancy, collection and preservation of various biospecimens, and multiple time points questionnaire surveys covering a total of 1551 variables. However, based on the information about population demographics, incidence rates of adverse birth outcomes and sample size analysis, the results of this study potentially represented the relatively most accurate assessment of medication use among pregnant women in China. The sample size was calculated according to cohort study requirement during study design, and the actual sample size that we had achieved greatly exceeded the statistical requirement of this study.

Since we included 112,986 pregnant women out of initially recruited 150,577 pregnant women for the current analysis for the reasons further explained in Methods, the representativeness of the current analytical cohort was a concern. Our analysis showed that there were no statistically significant differences in most basic characteristics between the database with 150,577 pregnant women and the current analytical database with 112,986 pregnant women (Supplementary Table 2). The incidence rates of adverse birth outcomes observed in this study were comparable to those reported in other domestic Chinese investigations and East Asian studies. In particular, the stillbirth rate aligned with East Asian estimates13, whereas preterm birth and low birth weight incidences were marginally lower than those in Chinese surveillance statistics and research derived from East Asian data14,15 (Supplementary Fig. 1).

It is very common for pregnant women to use medications due to chronic diseases and in unknown pregnancy situations16,17. The proportions of pregnant women taking medications vary by countries or regions. A study using data from two large birth defect studies found that about 90% of women in the United States took at least one medication during pregnancy and 70% took at least one prescription drug18. Ninety-three percent of pregnant women received drug prescriptions and dispensations with the usage of an average of 7.4 ± 5.5 different drugs in France, and the 5 most frequently prescribed drugs were paracetamol, iron, folic acid, phloroglucinol and colecalciferol19. A survey of the records of the French Health Insurance Service of drug prescriptions during pregnancy in 1000 women living in Haute-Garonne, southwest France, showed that 79% of women were exposed to drugs for which information about safety in pregnancy was not available from animal or human studies3. A large sample-size study conducted in Italy showed that 73% of pregnant women in the cohort were prescribed at least one medication during pregnancy20. Another registry study showed that about 80% of registered pregnant women in Italy received at least one prescription during their pregnancy, with the most prescribed drugs being folic acid and progesterone21. An analysis of the China Health Insurance Association (CHIRA) database in 2015 showed that 11.1% of 7946 sampled pregnant women (in-patients) used at least one medication during pregnancy, and in particular 5.4% of 2896 pregnant women in the first trimester used at least one medication; and the most frequently used medications were intravenous solutions containing vitamins and minerals, progestogens, and antibiotics22. Considering that self-medications by pregnant women outside of hospitals were not included in the CHIRA database, there were significant limitations in the data source. In addition, the drug exposure rate might be higher after the implementation of the ‘comprehensive two-child’ policy since 2016, due to the increasing proportion of pregnant women with advanced maternal age and pregnancy complications. Our cohort study is the first in China to report that drug exposure rate during the first trimester was as high as 30.70% in Chinese pregnant population, with significant differences in the rates of drug exposure among pregnant women with different demographic characteristics. Notably, medications in this cohort did not include nutritional supplements such as folic acid, vitamins and minerals. This study categorized the ‘coded drugs’ exposure during pregnancy. It was very beneficial for grasping the information about the types of drug exposure to Chinese pregnant women, and would be a treasure trove for evaluation of drug safety in pregnant women.

Dydrogesterone is an orally active progestin, and previous studies indicated that dydrogesterone treatment reduced the risk of abortion23,24. A systematic review and a randomized controlled trial concluded that the use of dydrogesterone was effective in the treatment of preterm abortion25,26. These results were basically consistent with the finding of the present study. There was not much research on the association between dydrogesterone and adverse pregnancy outcomes. A review in 2009 summarized 28 reported cases of diverse congenital birth defects, with musculoskeletal system defects and complex birth defects being the most common types, followed by masculinization, genitourinary tract defects, neural tube defects, and eye defects27. The data did not provide evidence for an association between congenital malformations and dydrogesterone use27. In 2015, a case-control study from Gaza reported an association between the drug use and heart defects in offsprings, with an adjusted advantage ratio of 2.71 (95% CI, 1.54-4.24)28. However, that study did not adjust for other relevant factors. The present study showed that exposure to dydrogesterone during the first trimester was associated with increased incidence of stillbirth, preterm birth, low birth weight, and birth defects. As this analysis did not rule out the influence of other maternal environmental and behavioral factors during pregnancy, except for maternal age and diseases, the causal association remains to be further verified.

This study found that maternal exposure to progesterone in the first trimester did not increase the incidence of adverse pregnancy outcomes after adjusting for maternal age and diseases. However, previously reported findings on the association between progesterone exposure during pregnancy and birth defects were inconsistent. Teratogenicity studies in rats29,30 and monkeys31 showed that the treatment with progesterone did not cause birth defects. A report from West Jerusalem claimed that progesterone use during pregnancy increased the incidence of birth defects32, but the study did not control for subjects who had already used progesterone, and mothers of only two abnormal infants were exposed to progesterone during pregnancy. Another project investigated over 500 progesterone-exposed pregnant women, but did not find an association between progesterone exposure and birth defects33,34. A collaborative study in West Germany that included 186 progesterone-exposed pregnant women also did not confirm that the hormone increased the risk of birth defects35. Other reports from clinical centers did not find an increase in the incidences of birth defects in infants delivered by progesterone-exposed pregnant women36,37. A case-control study independently examining hypospadias showed that the use of hormones, including progestin, was associated with an increased risk of the possible abnormalities38,39, but there was a lack of the relationship between the duration of hormone therapy and the location of hypospadias and its severity. Another study on hypospadias found an odds ratio (OR) of 3.7 (95% CI, 2.3–6.0) in pregnant women with progestin exposure40, but the information about the doses or the routes of administration was not available from most participants, and the types of progestin were not specified in detail. Our present study did not exclude other maternal environmental and behavioral factors during pregnancy, except for maternal age and diseases that needed medications. We will conduct more in-depth studies in the future. We believe that the large cohort data will add more accurate evidence to future studies on this topic.

The use of proprietary Chinese medicines (pCms) is common in pregnant women in China, and there is no accurate evidence on the effects of these medicines on pregnancy outcomes. The complexity of the composition of pCms has led to few insightful studies on this topic41,42. Preliminary analysis of the data from this cohort suggested, for the first time, that the incidence of miscarriage/abortion, stillbirth, and birth defects were increased and the incidence of low birth weight was decreased in the group of pregnant women who took Fuzheng recipe (one of the classical pCms) in the first trimester. The incidence of stillbirth was decreased in the group who took Warming and tonifying kidney-yang recipe in the first trimester. It is worth noting that traditional Chinese medicine supplements are composed of various herbs from different suppliers, and quantitative testing of their active ingredients is still difficult. Furthermore, the dosages of pCms in this study were acquired via questionnaires, hence necessitating more in-depth investigations and additional evidence for substantiation. Despite the limitations of this research, the information reminds us of the urgency and importance of evaluating the safety of these pCms usage during pregnancy.

Thyroid hormones are crucial for fetal development. The frequency of pregnancy with overt hypothyroidism was reported to be 0.3%–0.5%43. The most commonly used thyroxine medication during pregnancy is levothyroxine in China. In this study, we found that the exposure rates of levothyroxine in the first trimester was 3.28% in DEBC, and no statistically significant difference was found in adverse pregnancy outcomes between levothyroxine-exposed and unexposed groups. An observational cohort study showed that levothyroxine had no effect on the rate of live births in patients with recurrent miscarriage associated with subclinical hypothyroidism (SCH)44. The use of levothyroxine did not result in a higher rate of live births in normal thyroid-functioning women with thyroid peroxidase antibodies45,46. These results are in line with the findings of this study. In contrast, data from 13 cohort studies and randomized controlled trials showed that pregnant women with SCH treated with levothyroxine had a lower rate of pregnancy losses (OR 0.78, 95%CI 0.66–0.94)47. Dhillon-Smith et al. concluded that the use of levothyroxine in women with SCH is a key therapeutic measure to prevent miscarriage based on the latest available evidence48. Therefore, further research is needed to evaluate the efficacy of levothyroxine treatment in pregnant women with SCH and its impact on pregnancy outcomes.

Additionally, the study indicated that the use of aspirin during the first trimester was associated with a decreased incidence of birth defects. The finding of the relatively high rate of exposure to aspirin and heparin in pregnant women in China is particularly interesting. Thrombophilia has been implicated in adverse obstetrical events such as miscarriage, recurrent miscarriage, intrauterine growth restriction, severe pre‐eclampsia, and placental abruption. There was also reasonable evidence to suggest that some cases of miscarriage and recurrent miscarriage were associated with thrombosis of placental vessels and infarction49. Heparin can exert anti‐clotting effects by increasing the action of antithrombin, the natural anticoagulant. According to Chinese experts consensus on prevention and treatment of spontaneous abortion with low molecular weight heparin (LMWH) (2018) and Chinese expert consensus on diagnosis and management of recurrent spontaneous abortion (2022), LMWH and low-dose aspirin (LDA) are recommended in China to reduce the risk of adverse pregnancy outcomes such as miscarriage, recurrent miscarriage and venous thromboembolism50,51. Therefore, the evaluation of the safety of these drugs during pregnancy is urgent. In the future, we intend to conduct specialized analyses on the safety of various drugs exposed during pregnancy.

The study also assessed the maternal exposure to other drugs. The incidence of stillbirth was decreased in pregnant women exposed to amoxicillin during the first trimester with an aRR of 0.28 (95% CI, 0.03-0.99), compared to that in unexposed group in the present study. Amoxicillin is widely used to treat infectious diseases during pregnancy. There is no direct evidence in the literature that amoxicillin taken during pregnancy improves stillbirths. However, infections during pregnancy are an important cause of stillbirth, and more than 40 bacterial, viral and other pathogens have been associated with stillbirth52. About 50% of stillbirths or more may be caused by infection in low- and middle-income countries53. Therefore, we speculate that taking amoxicillin during pregnancy may improve maternal infection status and have a protective effect against stillbirth.

According to a randomized controlled trial, exposure to prednisone seemed to be an independent factor contributing to the risk of preterm birth, because pregnant women treated with prednisone in combination with low-dose aspirin were more likely to have preterm labor than patients treated with aspirin alone54. Another multicenter randomized trial also found higher frequencies of ‘serious’ maternal morbidity and preterm delivery among women randomly assigned to prednisone group than those in the group using low-dose heparin55. Prednisone is a corticosteroid, and some studies have shown that the use of higher doses of corticosteroid was associated with an increase in incidence of preterm birth56,57. These findings are similar to our results. Yet, deeper studies on the independent effects of prednisone are required.

Furthermore, it is notable that we need to balance the risks of using medication against potential consequences of not using them. For example, untreated pregnant women with preeclampsia have a greatly increased risk of seizures, endangering the health of both themselves and the fetuses58. On the contrary, cautious use of anti-epileptic medication can reduce the risk59,60. Similarly, our findings indicated that the use of dydrogesterone was linked to a lower incidence of miscarriage or abortion. Careful consideration of risks and benefits, accounting for gestational timing and dosage, is warranted before taking medication. However, there is still limited research on the differences in maternal and fetal outcomes associated with different medication regimens (different medications, doses and durations) for the same disease, as well as in-depth research combining internal and external exposures. These aspects of research will be addressed in future studies based on this cohort.

The limitations of this study were the following: (i) Causative conclusions cannot be drawn from this cohort study, and the follow-up period of the birth cohort is still limited. Further follow-up is needed for the discovery of association of metabolic and functional birth defects with drug exposure during pregnancy and for assessment of the long-term correlation of early life medication exposure with human health. (ii) The present preliminary analysis did not rule out the influence of other maternal environmental and behavioral factors during pregnancy, except for maternal age and diseases that needed medications, thus the results of the current analysis should be viewed with caution. (iii) Based on our original research design, no blank samples (tubes) were stored in our biobank to check for sample contamination. Yet, because the collection and processing of samples were carried out in clinical laboratories, in which the current national clinical laboratory standards and regulations were strictly followed, we judged that the possibility of sample contamination was very small but could not be absolutely excluded. (iv) The findings of this study were based on reports of drug exposure during the first trimester of pregnancy. Some of the pregnancy outcomes might result from drug exposure later in pregnancy, requiring further analysis of medication use throughout gestation to precisely correlate drug exposures with pregnant outcomes. (v) The drug dosages in this study were obtained from questionnaires, lacking internal exposure data. Future research will integrate both internal and external drug exposure information to provide stronger evidence for the safety assessment of medication during pregnancy. (vi)While the current sample size satisfied this study’s requirement, the cohort of 150,577 was still relatively small compared to China’s approximately 10 million annual deliveries. Since comprehensive national data on the characteristics of pregnant women are unavailable, it remains uncertain whether the present cohort accurately represents the overall pregnant population in China. However, the present study might be the most rigorous investigation to date on medication use among pregnant women in China.

Future studies should focus on stratified analysis based on specific maternal diseases and investigation of the connections between medications during pregnancy and adverse pregnancy outcomes, accounting for exposures timings, durations, routes, and doses.

Methods

Ethics

The DEBC was approved by the Ethics Committee of Sichuan University, China (Protocol ID: K2017045), and followed the tenets of the Declaration of Helsinki. ‘Written informed consent” from the patients were collected by the collaborating hospitals of DEBC when the patients were admitted to the hospitals. This study has obtained all relevant approvals from the Ministry of Science and Technology of China for this work.

Cohort recruitment

The cohort study was launched on August 1, 2018, and aimed to recruit 150,000 pregnant women. The recruitment was implemented with the cooperation of 49 hospitals at different levels (including tertiary or secondary hospitals) from 15 Provinces in China, of which Sichuan Province made the major contribution12.

Subjects eligible for cohort recruitment must meet the following criteria: Pregnant women who (i) had gestational age less than 14 weeks upon first prenatal visit; (ii) planned to establish their health records and deliver in collaborating hospitals; and (iii) were well informed about this study and willing to cooperate with the research and sign the informed consent. Pregnant women were excluded from the study if: (i) they had mental health diseases and could not cooperate with questionnaire investigation; (ii) the fetus had been dead in utero at the time of the first prenatal visit; (iii) they were not willing to cooperate with the research; and (iv) they were HIV-positive.

The 49 collaborating hospitals care for the general population of pregnant women, rather than a selected sub-population. Approximately 190,000 pregnant women come for antenatal care per year, of which 95% receive antenatal care throughout their entire pregnancy and delivery period. Among those 95% of pregnant women, around 30% met the inclusion criteria for this study. Ultimately, 177,150 pregnant women were approached, of which 150,577 consented to DEBC data collection, and a biobank with about 580, 000 processed biological samples has been established.

Follow-up strategies

In DEBC, pregnant women (participants) were recruited in the first trimester (usually between 6 and 14 weeks of gestation). At this point, each recruited participant was asked to complete questionnaires, and donate 8 mL peripheral blood (4 mL with EDTA and 4 mL without EDTA), and 20 mL urine (Fig. 4, and Supplementary Table 3). About 17% of recruited participants did not donate any sample. The recruited participants were followed up in the second trimester (22-26 weeks), the third trimester (32-36 weeks), at delivery, and within 42 days after birth (Fig. 4). The designed structured questionnaires were given at each pregnancy stage through a mobile APP or face-to-face interviews12. Maternal medical charts, clinical laboratory measurements from each follow-up and the pregnancy outcomes were abstracted from medical records by our obstetricians or investigators. When the case with fetus of a birth defect was diagnosed, the investigator was asked to stamp a star on the mother’s medical records for the follow-ups. For a suspected birth defect, we established a ‘green channel’ for fast-track diagnosis, meaning that when the fetus was suspected of a birth defect, the doctor could apply for a ‘green channel’ for the mother for consultation with three skilled and experienced sonologist in the West China Second University Hospital, Sichuan University.

Fig. 4: Flow chart of measurements and time points in the cohort.
figure 4

All participants undergo five follow-up visits: in the first trimester (<14 weeks), in the second trimester (22–26 weeks), in the third trimester (32–36 weeks), at delivery, and within 42 days after delivery.

The birth cohort study also aimed to establish a biobank containing diverse biospecimens. The standard operating procedures (SOP) were followed when collecting, processing and storing biospecimens such as samples of peripheral blood, urine from pregnant women, and amniotic fluid, umbilical cord blood, tissues from cases of adverse pregnancy outcomes, and controls in all collaborating medical institutions (Fig. 5a and Supplementary Table 4). According to the SOP, plasma, serum, and buffy coat were prepared from each blood sample; amniocytes and supernatant were prepared from each amniotic fluid sample. All blood samples were aliquoted within 2 h and then frozen at −80 °C, and the dissected tissues were immediately aliquoted and frozen at −80 °C.

Fig. 5: The diagrams of epidemiology and biobank information in DEBC.
figure 5

a Flow chart of the biospecimen processing, storage, and management. b The presentation of the information platform. c The main content in the DEBC study.

When pregnant women had miscarriage/abortion, stillbirth, or very preterm delivery (<32 weeks), they would skip the rest of follow-ups, and the obstetricians or investigators would complete the pregnancy outcome records directly (Fig. 4). When adverse pregnancy outcomes occurred, the related biological samples were collected by experienced doctors or nurses, according to SOP (Supplementary Tables 3 and 4). For every case of adverse pregnancy outcome, one control (normal fetus whose mother or parents signed the informed consent) was identified in the same research institution.

Data collection

The DEBC collected a large amount of data, thoroughly covering the maternal, paternal and fetal conditions. The first structured questionnaire given in the first trimester included eight parts, covering the basic information about parents, pregnancy history, health status, working status, living environment, health behavior, diet and nutrition, and sleep condition. The health status part of the questionnaire focused on the detailed information about drug exposure, including drug name, exposure time and duration, route, dose, frequency, and reason of administration during pregnancy. The detailed information in this questionnaire has been reported12. The questionnaires given in the second and the third trimester mainly focused on the medication, pregnancy complications, nutraceuticals, and sleep condition. The investigators verified the drug information (including prescription drugs and over-the-counter drugs) in hospital medical records, and made supplements or modifications in the questionnaires (if necessary) to ensure that drug exposure information was collected comprehensively and accurately. The maternal clinical data, including fundal height, abdominal girth, blood pressure, ultrasonic inspection results and clinical laboratory results, were also abstracted from medical records during the pregnancy.

The pregnancy outcomes including delivery/termination way of pregnancy, gestational age, detailed discharge diagnosis, infant sex, birth weight, length, head circumference, birth defect status and neonatal disease screening results, were abstracted from medical records by trained investigators. The questionnaire given within 42 days after birth was about neonatal health and possible newly arisen neonatal diseases.

Biological samples were collected under different scenarios. All recruited pregnant women were asked to donate peripheral blood and urine samples on the first investigation. Chorionic tissues or placenta samples were collected when the miscarriage/abortion or stillbirth occurred with unknown reasons in pregnancy. When a fetus was diagnosed with a birth defect, amniocytes were collected through amniocentesis during the second trimester. In the case of an abortion or a birth defect, amniotic fluid, placenta tissues, umbilical cord blood and fetal muscular tissues, pathological tissues were collected. Umbilical cord blood and placenta tissues were collected from women with very preterm birth, with very low birth weight neonates (<1500 g), and from controls with normal fetuses (Supplementary Table 3).

Quality control

The recruitments, follow-ups and biospecimen collections were managed by an information platform (birth cohort management system V1.0). This system combined the collected questionnaire data, medical examination results, clinical laboratory results and biospecimen information (Fig. 5b, c). Questionnaire data could be collected through mobile phones, personal computers, or manual inputs. The platform used various automated and intelligent measures to ensure high-quality data entry, including the logic limit and intelligent reminder in data entry, traceability for data collection process, and real-time feedback and error correction. The compliance of participants in follow-ups was improved through timely sending of messages and reminders. There were specialized quality management teams responsible for the maintenance of database and the biospecimen bank, respectively. They would check the data entries regularly and contact the responsible investigators to fill in omissions and correct errors. During the project implementation period, the researchers and staff with access to the research data signed the confidentiality agreement to ensure the data security. Meanwhile, every management account had its own permissions for the access to the questionnaires, biospecimens, or other responsible data. The collected data on the platform were uploaded daily and stored in the designated server in West China Second University Hospital, Sichuan University. The platform had sufficient security measures in the network front-end, data transmission and data storage.

We also implemented strict quality control measures for our questionnaire data. Surveyors for questionnaires received standardized training. The questionnaires were filled by recruited pregnant women under the guidance of a surveyor during their one-on-one meetings. After uploading the questionnaire, the system would check for missing data and logical errors. Additionally, the contents of the questionnaires were reviewed daily to ensure that the information collected was complete. A time period was set for each interview node, requiring interview completion to reduce recall bias. For all maternal and fetal health outcomes, there was a special quality control team to assess the quality of the medical health records from each sub-center to ensure the data accuracy. All laboratory tests were conducted locally at each collaborating hospital with regular quality control measures. In addition, we established ultrasound examination cooperation channels among our partners, and all suspected birth defect cases that could not be diagnosed locally could be sent to the hospital with the strongest diagnostic capability for diagnosis to ensure the accuracy of diagnosis of all birth defect cases.

The participants excluded from this analytic cohort included the following: (i) the cases that were lost to follow-up, meaning that the four follow-up questionnaires including those in the first trimester, the second trimester, the third trimester, and pregnancy outcomes were incomplete; (ii) the participants whose data failed to pass quality audit, meaning that the important data such as medication, pregnancy complications, neonatal delivery, etc. had not yet been verified in the hospital information system; (iii) the cases that had not reached the stage of pregnancy outcomes, meaning that the cases had not yet reached the final follow-up cut-off dates for pregnancy outcomes, such as pregnant women who were still in the second or the third trimester, or had not yet delivered; (iv) the cases of multiple pregnancies, including twins and triplets. Pregnant women with multiple pregnancies were excluded from this analytic cohort because the unit of the analysis was one fetus, and including multiple pregnancies would duplicate the calculation of a pregnant woman’s exposure to a certain drug during pregnancy, which might introduce bias and affect the validity of the results.

Besides, we regularly export and analyze the data to identify potential outliers. Considering that most quantitative data collected in this study followed a normal distribution, outliers were defined based on the 3-σ principle (if the absolute residual error of a measurement exceeded 3σ, it was considered an outlier). For identified outliers, we first cross-checked against the hospital information system and national monitoring systems. For data that could not be verified, telephone follow-ups were conducted for confirmation and correction. For the small portion of outliers that simply could not be rectified through the above steps, given our large sample size and the rarity of such outliers, we treated them as missing data.

The biospecimens collected for this study were also subjected to strict quality control measures: (i) The operators in each sub-center received standardized training; (ii) Each sub-center followed the same set of SOP for biospecimen collection; (iii) All biospecimens were collected in ice boxes and processed in sub-centers within 2 h after collection. Then each type of processed biospecimen from the same participant was processed into different vials which were immediately stored in −80 °C refrigerators. The frozen biospecimens were placed in dry-ice boxes (temperature ranging from −70 °C to −78 °C) which were transported across cities; (iv) These dry-ice boxes and refrigerators were equipped with internal temperature control and monitoring systems to ensure the absolute safe storage temperature; (v) All processed peripheral blood samples and urine samples were finally stored in −80 °C refrigerators, and biospecimens from placenta, muscular tissues and pathological tissues were stored in liquid nitrogen tanks at temperatures below −150°C in West China Second University Hospital. For final storage, multiple vials containing each type of processed biospecimens such as plasma, serum, and buffy coat from the same pregnant woman were not placed in the same box or refrigerator in principle, but were stored in different boxes and refrigerators to prevent unexpected problems caused by possible sample losses or breakdown of one refrigerator.

Data standardization

The drugs taken by pregnant women in this study were coded individually using the Social Security Drug Classification and Codes (LD/T90-2012), by which the nutritional supplements such as folic acid, vitamins, calcium, and iron were not included. Thus these nutritional supplements were not included in the drug exposures in the present cohort study. Adverse pregnancy outcomes included five conditions: miscarriage/abortion, stillbirth, preterm birth, low birth weight, and birth defects. Stillbirth was defined as fetal death in utero during delivery after 22 weeks of gestation; preterm birth was defined as delivery before 37 weeks of gestation in live birth; low birth weight was defined as birth weight <2500 g in live birth; and birth defects were defined with reference to the Chinese Birth Defects Monitoring Network (CBDMN) and International Clearinghouse for Birth Defects Surveillance and Research (Supplementary Table 5 and Supplementary reference). All birth defects were coded using ICD-10 for standardization (Supplementary Table 6).

Statistical methods

This study used Pearson’s χ2 test to compare the differences in demographic characteristics between the drug-exposed and unexposed groups in the first trimester. We used log-binomial multivariate regression to estimate adjusted relative risks (aRRs) for the association of maternal medication in the first trimester with adverse pregnancy outcomes after adjusting maternal age and the first trimester maternal diseases that needed treatments (diabetes, threatened miscarriage, infertility, thyroid disorders, common cold and influenza, vaginitis, thrombosis or antiphospholipid syndrome, hypertension, hepatitis, and other inflammatory diseases). P < 0.05 was considered statistically significant. All statistical analyses were conducted using STATA® (Version 12.1, StataCorp LLC, College Station, TX, USA) and SAS® software (Version 9.4, SAS Institute Inc., Cary, NC, USA).

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.