Introduction

Breast cancer molecular diagnostic tests include in a growing number of countries the mutational analysis of PALB2 (Partner and Localizer of BRCA2), in addition to BRCA1 and BRCA2. Nevertheless, the debate about the use of PALB2 mutation status as a pre-symptomatic biomarker in genetic counselling is still open in several countries [1]. Interestingly, the frequency of PALB2 deleterious mutations appears to be variable among different populations. Thus, the precise knowledge of mutations spectrum and frequencies in each specific population is important in order to improve the molecular analysis strategy and genetic counselling.

PALB2 is one of the additional cancer predisposition genes for which germline loss-of-function mutations have been the most frequently identified, although some studies failed to identify PALB2 mutations in breast cancer series from specific populations [24]. The first PALB2 family-based association study, conducted in the UK, estimated the relative risk conferred by PALB2 mutations at 2.3 [CI 95 %: 1.4–3.9] [5]. Nevertheless, subsequent population-based studies estimated the risk associated with at least some PALB2 mutations as higher [6, 7]. Two recent analyses conducted on large series of PALB2 mutations carriers showed that PALB2 mutations confer to their carriers a risk overlapping with that estimated for BRCA2 mutation carriers, supporting the classification of PALB2 as a high-risk breast cancer gene [8]. Knowing the contribution of PALB2 mutations to familial breast cancer in specific populations is warranted to be able to implement PALB2 analysis in genetic testing, especially as the frequency of PALB2 mutations vary depending on the population [5, 915]. Several rare recurrent mutations have been described in Australia [7], the North of Italy [16], China [17], Poland [18], Quebec [19] and Finland [6]. Nevertheless, no data are available from French population.

In the present study, we screened for germline PALB2 mutations 835 breast cancer patients from breast/ovarian cancer families and 662 unrelated controls recruited in the French national study GENESIS and in the oncogenetic clinic of the Paul Strauss Cancer Centre of Strasbourg (CPS series).

Methods

Study subjects

The study was conducted on a subgroup of subjects from the GENESIS (GEne SISters) French national study (Sinilnikova et al. submitted) and a series of patients selected at the oncogenetic clinics of the Paul Strauss Cancer Centre in Strasbourg (CPS). GENESIS index cases are women diagnosed with infiltrating mammary adenocarcinoma with a family history of breast cancer, having at least one breast cancer-affected sister. The recruitment was done from April 2007 to December 2013 through the French national network of cancer genetics clinics (Groupe “Génétique et Cancer” (GGC)) covering the entire national territory. The controls are unaffected friends or colleagues of index cases matched by age (±3 years). Other family members were included in the study if they consented to participate. Information about ethnic origin is self-reported by study subjects. In the present study we analysed the first 641 index cases and 592 controls included for which blood samples were available.

The CPS series consist of a total of 194 patients selected from the routine clinical testing, on basis of familial breast and/or ovarian cancers. For four of them, the index case was a breast cancer-affected male. 70 unrelated anonymous controls were evaluated.

All index cases analysed were tested negative for BRCA1/2 point mutations and large rearrangements.

The characteristics of study subjects are presented in Table 1.

Table 1 Characteristics and distribution of study participants

Ethics statement

All participants gave written informed consent. The GENESIS study protocol was approved by the appropriate ethics committee (CCP Ile-de-France III) and by the French data protection authority (CNIL).

DNA extraction

For GENESIS, genomic DNA was extracted from blood samples using the DNA extractor Autopure-LS (Qiagen), and DNA handling (normalization and aliquoting) was done using a TECAN EVO instrument.

For CPS, genomic DNA was isolated from 3 ml of blood using the manual extraction method Flexigene (Qiagen).

PALB2 mutation screening

The 13 coding exons of PALB2 (NCBI reference sequence NM_024675.3) were screened using the EMMA technique (Enhanced Mismatch Mutation Analysis, Fluigent), based on heteroduplex analysis by capillary electrophoresis in a specific high-resolution polymer [20, 21] (see Supplementary methods for details).

In silico analyses

The in silico analyses of PALB2 missense variants were performed using the freely available web-based programs Sorting Intolerant From Tolerant (SIFT) [22, 23], Align Grantham Variation Grantham Deviation (Align-GVGD) [24, 25] and Polymorphism Phenotyping version 2 (PolyPhen2), HumDiv-trained model [26, 27]. The protein multiple sequence alignment (PMSA) that we used for SIFT and Align-GVGD is a manually curated alignment using 10 species in which the most divergent sequence is that of the fish Danio rerio. This alignment is available at the Align-GVGD website (http://agvgd.iarc.fr/alignments.php).

All variants with a Minor Allele Frequency (MAF) <1 % have been tested for their potential effect on splicing using five different splicing variant predictors included in the AlaMut program (Alamut, Interactive Biosoftware, Rouen, France): SSF [28], Max-EntScan (MES) [29], Splice site predictor by neural network (NNSPLICE) [30], GeneSplicer [31] and Human Splicing Finder (HSF) [32]. A prediction was considered positive when the score of the new consensus site was at least 15 % lower than the wild-type score using MES or 5 % lower using SSF and when at least one of the other tools showed a reduction in the consensus score [33]. We considered the prediction to be indicative of the creation of a new splice site if its score is at least equal to 50 % of the wild-type score.

Results

Subjects included in the analysis

Of the 1275 GENESIS samples screened, 42 (23 cases and 19 controls) were excluded from the analysis because their amplicon failure rate was greater than 20 %. The distribution of the remaining 641 cases and 592 controls by age and self-reported ethnic origin is detailed in Table 1a. All the CPS samples (194 index cases and 70 controls) were successfully screened. The characteristics of GENESIS and CPS families are detailed in Table 1a and b, respectively.

The cases studied belonged to breast cancer families, of which 75 % (74 % in GENESIS, 77 % in CPS) had at least 3 breast cancer cases (Table 1c). Families including women affected with ovarian cancer were more represented in CPS (55/194 = 28 %) versus GENESIS (32/641 = 5 %).

Mutations identified

Mutation screening of the PALB2 coding region and flanking intronic boundaries performed in 1497 subjects (835 cases, 662 controls) identified four new deleterious germline truncating mutations: c.1186insT, c.1857delT and c.2850delC in three index cases and c.3418dupT in a control (Table 2; Figs. 1 and 2).

Table 2 PALB2 deleterious mutations identified
Fig. 1
figure 1

PALB2 truncating mutations and potentially deleterious missense variants. Upper part Schematic diagram of the PALB2 coding sequence with the 4 truncating mutations (in red) and the 2 missense variants classified as deleterious (in black) with the in silico tools. Numbers correspond to nucleotides of the coding sequence. Lower part depiction of sequences coding for functional domains (ChAM (chromatin-association motif), WD40 domains) as well as for regions of interaction with DNA and the principal PALB2 protein partners as reported in Uniprot website (http://www.uniprot.org/uniprot/Q86YC2)

Fig. 2
figure 2

Pedigrees of truncating mutation carriers. The pedigrees of the 4 truncating mutation carriers are shown. A c.1186insT, p.Cys396Leufs*5, B c.1857delT, p.Phe619Leufs*9, C c.2850delC, p.Ser951Leufs*2, D c.3418dupT, p.Trp1140Leufs*17. For each individual, year of birth and age at inclusion (in brackets) are indicated, if known. Grey symbols with an upper black corner on the left indicate breast cancer patients; for women grey circles with a lower black corner on the right indicate ovarian cancer patients, full grey symbols indicate patients with other cancer types. The plus signs indicate the status of the tested individuals. For each cancer patient, the type of cancer is indicated with the International Statistical Classification of Diseases and Related Health Problems (ICD) codes (C50* breast cancer, C16 stomach cancer, C189 colon cancer, C56 ovarian cancer, C81 Hodgkin lymphoma). The age of diagnosis is specified in brackets. The arrows indicate the screened subjects (index cases or control)

We also identified 32 different exonic rare variants, 12 of which were novel: two in-frame (one insertion and one deletion), 19 missense, 9 synonymous and the remaining two were single-base-pair substitutions in the 5′-UTR sequence (Table 3; Supplementary Table S1). All the 19 exonic missense variants with a reported MAF < 1 % were analysed for their effect on protein function using SIFT [23], Align-GVGD [25] and Polyphen-2 [27] programs (Table 3). Two missense variants (Fig. 1) were classified as potentially deleterious by the three bioinformatics tools: c.2816T > G and c.3128G > C. The pedigrees of the carriers are shown in Supplementary Fig. S1 and S2A, respectively. c.2816T > G, p.Leu939Trp, was detected in 5 index cases (MAF = 0.30 %) and 2 unaffected controls (MAF = 0.15 %). The c.3128G > C, p.Gly1043Ala variant, localized at a position well conserved in the WD repeat (see Discussion), was found in a 51-year-old healthy control with no history of breast cancer in the family. These two missense mutations have already been reported in cases and/or controls with a relatively similar frequency [2, 5, 10, 14, 16, 3437]. One missense substitution, c.3410T > C, classified as potentially deleterious by two tools, has been found in an index case and never reported (Supplementary Fig. S2B).

Table 3 In silico assessment of the effect of PALB2 missense variants on protein function

In addition, we detected 28 intronic different rare variants, 16 reported here for the first time (Supplementary Table S1): 6 were small insertions and/or deletions and 22 were single-base changes.

All the variants, except the truncating mutations, have been tested for their potential effect on splicing using five different splicing variant predictors: Splice Site Finder (SSF) [28], MES [29], NNSPLICE [30], GeneSplicer [31] and HSF [32] (Supplementary Table S2). None of them was predicted to be a bona fide splicing variant. The exonic variant, c.3350G > A changes the last base of exon 12, potentially weakening the canonical donor splicing site, but only SSF predicts that the substitution could affect splicing (reduction of wild-type score by 13 %).

Eleven well-known SNPs with a reported Minor Allele Frequency (MAF) >=1 % were also found (Supplementary Table S3).

Discussion

Biallelic mutations in PALB2 (also called FANCN) cause Fanconi anaemia [37], a rare chromosome instability syndrome marked by congenital anomalies, bone marrow failure and severe paediatric cancer susceptibility. As it is the case for other Fanconi anaemia genes, monoallelic loss-of-function mutations in PALB2 have been shown to increase the risk of developing breast cancer [5, 13]. The PALB2 protein interacts with BRCA1 through its coiled-coil N-terminal domain and with BRCA2 through the seven-bladed WD40-type C-terminal region (Fig. 1), acting as a bridge between these two proteins [38]; PALB2 mediates BRCA2 recruitment to DNA damage sites and is therefore essential for BRCA2 function in double-strand break repair by homologous recombination [39]. As attested by numerous publications (Supplementary Table S4), PALB2 is one of the breast cancer susceptibility genes for which germline loss-of-function mutations have been the most frequently identified in breast cancer families of different populations.

This study is the first report of a complete mutation screening of PALB2 in French population. Four new germline truncating mutations were found: c.1186insT, c.1857delT, c.2850delC and c.3418dupT. The first three mutations are clearly pathogenic because they create a premature stop codon expected to trigger nonsense-mediated mRNA decay (NMD) [40]. The c.3418dupT mutation is located in the last exon, therefore it is not supposed to trigger NMD. However, it induces the loss of the last 45 amino acids that are part of the WD40 domain in the RAD51-interacting region, shown to be essential for a fully functional PALB2 protein (Fig. 1). At least two confirmed PALB2 pathogenic truncating mutations associated with breast cancer and/or Fanconi anaemia map to the last 150 nucleotides: c.3459C > G [41] and c.3497delG [42]. Thus, all four new truncating mutations detected in this study can be considered as pathogenic. One of these truncating mutations (c.3418dupT) has been found in a healthy control (55 years) whose sister was diagnosed with breast cancer at 45 (not tested) (Fig. 2d). The presence of truncating mutations at a low frequency (0.08–0.2 %) in healthy controls has already been described in two studies screening for the recurrent mutations c.509-510delGA in Poland [18] and c.1592delT in Finland [43] (Supplementary Table S4). In addition, the reported penetrance of PALB2 mutations is increasing from 45 to 80 years of age [42]. Thus, the presence of c.3418dupT in a 55-year-old control does not preclude its causality.

In our study, PALB2 mutations were detected in 2 breast cancer only families and 1 breast/ovarian cancer family but not in the 22 families with cases of male breast cancer. There are few reports of PALB2 pathogenic mutations in male breast cancer patients and no evidence of association with increased risk [34, 44, 45]. Similarly, PALB2 mutations in familial pancreatic cancer seem to be very rare [46], and indeed, we did not find any PALB2 mutation in four pancreatic cancer families that we also screened (data not shown).

To date, at least 37 publications reported PALB2 screening in breast cancer cases from different populations and cohorts (Supplementary Table S4). Ten studies analysed more than 500 cases in populations without founder mutation effect and revealed a mutation frequency spanning from 0.73 %, [CI95 %: 0.27–1.59], to 3.40 %, [CI95 %: 2.35–4.73]. In our screening, truncating PALB2 mutations have been found with a frequency of 0.36 %, [CI95 %: 0.07–1.05] (Supplementary Table S4). The lower frequencies we observed could be a consequence of ascertainment bias. On the other hand, the hypothesis that, in the French population, the frequency of PALB2 mutations in breast cancer families is lower than in other countries cannot be excluded, but further data are necessary to confirm these differences.

Two missense variants were classified as potentially deleterious with all the three in silico predictors used, but as yet, no genetic epidemiologic data support the possible pathogenicity of these variants, even if two of them mutate amino acids of the WD40 domain (c.3128G > C and c.3410T > C). It has to be noticed that to date, no PALB2 missense variants could be classified as definitely pathogenic. This does not rule out the possibility that some may be deleterious, but suggests that they should be very rare and very large cohorts of cases and controls will be needed to determine their associated risks [47].

Finally, we did not identify any founder mutation, thereby confirming the necessity of PALB2 full-gene sequencing in the diagnostic screening.

This study shows that PALB2 mutations make a small contribution to the heritable breast cancer susceptibility in French population. Nevertheless, screening for inherited loss-of-function mutations in PALB2 is recommended to enter clinical practice in France as in other countries. The generalization of inclusion of PALB2 in diagnostic gene screening panels will allow obtaining a precise evaluation of mutation frequency and will provide more data for penetrance and risk estimate for breast and ovarian cancer, an essential step to get an exhaustive cancer predisposition counselling and mutation-targeted personalized therapies.