Introduction

Lynch syndrome is the familial cancer syndrome caused by a germline mutation in one of the DNA mismatch repair (MMR) genes [1]. Cancers in this syndrome characteristically have microsatellite instability (MSI) and abnormal immunohistochemistry (IHC), in which the MMR gene with the germline mutation is not expressed in the tumor [2, 3]. MSH2 is one of the most important genes causing Lynch syndrome, but traditional exon-by-exon sequencing will not always detect the germline mutation. MSH2 incurs frequent Alu-mediated large deletions, which cannot be detected by exon sequencing [4]. Additionally, silencing of MSH2 can occur due to deletion of the polyadenylation signal of the EPCAM gene located 5′ to MSH2. Deletions of the 3′ end of the EPCAM gene abolish transcription termination, which results in transcription read-through into the MSH2 gene, and subsequent methylation-induced silencing of the MSH2 gene in tissues that express EPCAM [5]. The presence of MSI with the absence of MSH2 expression in a colorectal cancer (CRC) is highly suggestive of Lynch syndrome-MSH2 type, but in some instances, no germline mutation can be found in the MSH2 gene, even when testing for large deletions in MSH2 or EPCAM.

Our referral clinic focuses on hereditary CRC, and over the past 9 years we have identified germline mutations in DNA MMR genes in 101 patients using commercial genetic reference laboratories. We have also collected samples from twelve patients who had tumors with absent MSH2 protein expression, MSI, a family history suggestive of Lynch syndrome, but negative germline test results. The most likely explanation for this combination is that the germline testing was unable to detect true germline mutations in these patients. Even the best commercial testing methods do not detect mutations deep within introns or copy number-neutral inversions whose breakpoints are located in noncoding regions of the gene. Two of our patients with failed commercial testing were later found to have deletions in the 3′ region of EPCAM, but no explanation could be found for inactivation of MSH2 in the other ten [5].

One important type of mutation not examined by current testing methods is the presence of large inversions which result in rearrangement of the order of the exons of the gene. We sought to identify locations of potential inversion breakpoints in MSH2 by looking for allelic drop-out of single nucleotide polymorphisms (SNPs) in a series of long overlapping (~10 kb) PCR products. Starting with one patient with suspected MSH2-type Lynch syndrome, we found a long PCR product which was completely homozygous at all SNPs sequenced. This product encompassed exon 8 and part of intron 7 of MSH2. Upon review of the literature, we found two publications which had previously described a 10 Mb inversion in the MSH2 gene, in which the 3′ breakpoint was located in the same region as our long PCR amplicon [6, 7]. Thus, the objective of our study was to determine if this inversion of the MSH2 gene previously described was present in a group of patients with suspected MSH2-type Lynch syndrome of unknown origin.

Methods

Patients

Patients were selected from our familial CRC registry based upon suspicion of having MSH2-type Lynch syndrome. Patients previously had tested negative for MSH2 germline mutations by commercial genetic testing services. All patients had exhibited loss of MSH2 expression by IHC. Germline testing, IHC, and MSI results listed in the table were provided by CLIA certified labs. Family members of patients who tested positive for the inversion were subsequently enrolled in our study and tested for the inversion when possible. All patients provided written informed consent, and the study was approved by the Baylor Research Institute institutional Review Board.

Control Patients

Five control patients with no known family history of CRC were tested using the primers designed by Wagner et al. [6] for the 5′ inversion breakpoint. 22 controls with no known history of familial CRC were tested using the primers designed to amplify across the 3′ breakpoint.

Inversion PCR

Patients and controls were tested for the 5′ inversion breakpoint using primers F3 and R3 published by Wagner et al. [6]. Primers F4MV and B3MV were designed to amplify across the 3′ inversion breakpoint using MacVector (Cary, NC, USA). The forward primer sequence was 5′-GGGAGGGGAAAATGACTTACAAAG-3′. The reverse primer sequence was 5′-GCAAAAGGAACAGTCAGCAGAAGG-3′. PCR was performed using HotStar Taq (Qiagen, Valencia, CA, USA). Both inversion primer pairs only amplify a product in carriers of the inversion. Inversion PCR products were sequenced on an ABI 3100-Avant sequencer (Applied Biosystems, Foster City, CA, USA). An additional 1.6 kb PCR that amplifies exons 12–13 of MSH2 was included on all patient samples to exclude the possibility of false negative results due to poor DNA integrity (Fig. 1).

Fig. 1
figure 1

Inversion-specific PCR. Representation of PCR assays used to detect the MSH2 inversion. Primers F3 and R3 were described by Wagner et al. [6] and are used to amplify the 5′ inversion breakpoint. Primers F4MV and B3MV were designed in our lab to amplify the 3′ inversion breakpoint. Amplification is achieved in carriers of the inversion due to re-orientation of the primer directionality. No amplification occurs in patients with wild-type MSH2. Amplification of exons 12–13 of MSH2 using 12F and 13R was used to verify DNA integrity

SNP genotyping

Patients were genotyped at multiple SNPs in MSH2 by PCR and DNA sequencing and/or denaturing high performance liquid chromatography (dHPLC). Primers and dHPLC conditions are available upon request.

Allelic drop out PCR

Two PCRs were designed to look for allelic drop out in a long PCR product from inversion carriers. A short PCR product was designed to amplify and genotype SNP rs7607076, which is located in MSH2 intron 7 downstream of the 3′ inversion breakpoint. A second set of primers anneal to each side of the 3′ inversion breakpoint and only amplify the wild type allele. This results in allelic drop out in the long PCR product in carriers of the inversion who are heterozygous at SNP rs7607076. PCR products from both the short and long PCR products were sequenced and genotyped at rs7607076 (Fig. 2).

Fig. 2
figure 2

Design of PCR analyses used to detect allelic drop out in carriers of the MSH2 inversion. a Long and short PCR amplicons and their positions on chromosome 2 relative to the 3′ inversion breakpoint, MSH2 intron 8, and SNP rs7607076 are depicted. b Sequencing results of rs7607076 in the long and short amplicons for patients tested for the MSH-2 inversion by use of inversion-specific PCR. Only one allele is amplified in the long PCR of inversion carriers which results in the SNP appearing to be homozygous

Results and discussion

Starting with one patient with suspected MSH2-type Lynch syndrome, we found a long PCR product which was completely homozygous at all SNPs sequenced. Sequencing of short PCR products containing multiple SNPs confirmed allelic drop out in the long PCR product. This product encompassed exon 8 and part of intron 7 of MSH2. Upon review of the literature, we found a previously described 10 Mb inversion in the MSH2 gene, in which the 3′ breakpoint was located in the same region as our long PCR amplicon. PCR and sequencing using inversion-specific primers described by Wagner revealed that this patient carried the inversion previously reported [6].

We next performed PCR of the 5′ inversion breakpoint on our remaining suspected MSH2-type Lynch syndrome patients, and six patients were positive for the inversion (Table 1; Fig. 3). We designed additional primers to amplify the 3′ inversion breakpoint using the sequence data provided by Chen [7]. PCR products from both the 5′ and 3′ breakpoints were sequenced for confirmation (Fig. 1). No amplification of the 3′ inversion PCR product was seen in 22 controls. A 1.6 kb PCR amplification was included in all samples to exclude the possibility of false negative results due to problems with DNA integrity.

Table 1 Clinical characteristics of unexplained MSH2-type Lynch syndrome patients
Fig. 3
figure 3

Pedigrees of patients who are carriers of the MSH2 gene inversion. Cancer history is illustrated as reported by the probands. Only Lynch syndrome-related cancers are shown. Inconsistencies in cancer incidence and ages of onset were found when comparing related kindreds, emphasizing the necessity for caution when interpreting family histories provided by patients. CIS carcinoma-in situ. Plus sign indicates the individual is a carrier of the inversion. Minus sign indicates the individual tested does not carry the inversion

After identifying the six initial patients, we tested additional family members where possible, and 21 family members from five families revealed an additional seven inversion carriers (Fig. 3). To confirm our initial results we designed an additional PCR amplification to test for allelic drop-out in carriers of the 10 Mb inversion. Similar to the original long PCR, this PCR amplification relies upon a common SNP located near one of the inversion breakpoints. Primers were designed which amplified a region of the MSH2 gene encompassing both the common SNP and the neighboring region where the 3′ breakpoint is located. This strategy amplifies only the wild type allele of MSH2, and therefore patients who are heterozygous for the internal SNP are homozygous in the PCR product if they also carry the inversion in MSH2. Only carriers of the inversion displayed allelic drop out in the long PCR, and no inversion carriers had amplification of both alleles (Fig. 2).

Initial screening for mutations in the MSH2 gene had indicated that all carriers of the inversion also carried the C allele at SNP rs2303428. Genotyping of several additional SNPs within the MSH2 locus revealed that all inversion carriers may have had a common haplotype, suggesting the possibility that the inversion derived from a common ancestor (Table 2). Upon additional in-depth analysis of the pedigrees of the five affected families, we discovered that two of the carriers were first cousins. Based upon information in the pedigrees, we believe a third family may be related to these two families; however an exact connection could not be determined. No additional connections could be made among these three families and the other two families. None of these families was previously known to be related to the others.

Table 2 Patient genotypes at multiple SNPs within the MSH2 locus

Based upon our current information we are unable to determine if the mutation we have found represents a founder effect in our local population, or if it applies to a wider population of suspected Lynch syndrome cases. In order to address this issue, we are currently in collaboration with multiple groups to collect DNA samples from mutation negative Lynch syndrome patients throughout the United States.

Our results resolved a considerable proportion of previously unexplained MSH2-type Lynch syndrome families in our registry. Of 10 patients tested, six carried this specific inversion. Of the remaining four unexplained cases, two cases (213,474) had non-synonymous alterations in the coding region of the MSH2 gene which were categorized as “variants of uncertain significance” by the reference laboratory, which may have accounted for Lynch syndrome in these two patients, as both are predicted to be deleterious by SIFT analysis and “probably damaging” by PolyPhen analysis. Of the remaining two cases, one case (328) was negative for standard diagnostic criteria for Lynch syndrome (Amsterdam II criteria, Bethesda guidelines, and a low PREMM 1,2,6 score [8], but was included in the study because of loss of MSH2 expression in a Lynch syndrome-associated cancer and a family history of multiple cancers. Although it is generally assumed that loss of MSH2 expression occurs only in the setting of Lynch syndrome, it remains possible that in some cases the loss is due to somatic mutation, especially in cases with weak or no family history of CRC [9]. Our study did not include exhaustive testing for other inversions in MSH2 in the remaining patients. It is possible that other inversions may explain additional cases of Lynch syndrome in which genetic testing fails to find a causative mutation. Current genetic testing methodology uses convenient approaches and only looks for mutations which can be identified by direct exon sequencing and copy number variation. Even more labor intense methods such as Southern blots and sequencing of full length RT-PCR transcripts may fail to find more obscure mutations such as inversions. This limitation is exemplified in our study. Although Wagner initially discovered the 10 Mb inversion using Southern blotting, a commercial testing service performed Sothern blotting on one of our patients and failed to find this MSH2 gene inversion. New techniques need to be developed to capture all mutations causing Lynch syndrome, and other diseases in which the culpable mutation cannot be identified by current methods. In our analysis, we used allelic drop out in long PCR analysis as a method to find regions of possible inversions. This method could be further developed to look for other inversion in the MSH2 gene, as well as in other genes where current methods of testing fail to find a causative mutation. In the meantime, clinical testing services should add an analysis for the 10 Mb inversion to their clinical testing repertoire. This study also demonstrates the importance of enrolling patients with suspected Lynch syndrome, but negative germline testing results, into research registries where their DNA can be tested until an answer is found.