Introduction

Schneiderian inverted papillomas (IP) are rare sinonasal tumors comprising approximately 0.5% of all nasal neoplasia. Historically, the etiology of IP had been ascribed to allergy or chronic sinusitis. Evidence of an etiological role for human papillomavirus (HPV) in IP arose in the late 1980’s. Yet almost three decades later, and after major technological advances that have increased the detection sensitivity of HPV, the association between HPV and IP has yet to be universally accepted. This is probably due to the disparate HPV detection rates in IP reported in the literature.

Few would dispute the causal role of HPV in the development and progression of cervical cancer. Biological similarities between cervical and the upper aerodigestive tract epithelium, specifically the oropharynx, have led to studies of HPV in the head and neck. Early HPV studies were inspired by the presence of koilocytosis in IP and “viral-like” clinical features such as multicentricity, tendency towards recurrence, and potential for malignant transformation. IP bears some histological resemblance to exophytic papillomas, which commonly contains high copy numbers of HPV. The relationship of HPV to the etiology of IP is not merely of academic interest. The weight of published evidence supports that HPV typing is clinically significant with respect to surveillance and potential therapy for patients with cervical dysplasias and laryngotracheal papillomatosis. We assert that it is important to examine the relationship of HPV to IP, as significant associations between HPV, IP and outcome parameters (disease recurrence or malignant progression) would potentially impact clinical surveillance protocols for these patients also. Therefore, we analyzed the weight of published data [135] regarding HPV and IP. We sought to address the following questions: why do the HPV detection rates in IP vary so greatly? What is the relationship between low-risk and high-risk HPV types and HPV detection rates in IP? Is there a relationship between the presence and type of HPV in IP and recurrence and malignant progression? Is HPV also involved in sinonasal carcinogenesis unrelated to IP?

Methods

We conducted an exhaustive literature search using the NIH Pubmed search engine to identify citations published in English from January 1987 through December 2006 using the MeSH terms ‘‘HPV’’ and ‘‘Inverted”, “Exophytic”, “Oncocytic Schneiderian” or “Fungiform papilloma’’. Additional searches including terms “HPV” and “Nasal”, “Paranasal”, “Sinonasal”, or “Squamous Cell Carcinoma” were also used. Priority was given to studies with type-specific HPV results from biopsy or tissue resections, and that clearly described the HPV testing methods. No limit was set a priori on number of cases given the rare nature of the disease, and no restriction on upper aerodigestive anatomic site was imposed.

Data Abstraction

Data abstracted from publications included year of publication, study country, number of cases, method of specimen preservation (fresh frozen or paraffin embedded), method of case procurement (retrospective selection from archives or prospective collection from cohort), histology of specimens (grade of dysplasia, carcinoma, or recurrent papilloma), target for HPV analysis (DNA or RNA), method of HPV detection and typing (polymerase chain reaction [PCR], in-situ hybridization [ISH] or Southern blot [SB]), HPV primers used (for PCR-based assays), HPV types genotyped or probed for, and overall and type-specific detection of HPV. Cases was separated by histological description (no or low-grade dysplasia, moderate to severe dysplasia, carcinoma, and recurrent papilloma), and by nosological classification (fungiform/exophytic papilloma, inverted papilloma and oncocytic Schneiderian papilloma). The geographic region of each study was classified into one of three categories: Europe (UK, Denmark, Germany, Switzerland and Finland), North America (USA), East Asia and Middle East (Japan, Korea and Egypt).

Results were summarized for HPV types 6/11 and 16/18, although additional types were also included when data were provided. Overall HPV detection rate was defined as persons testing positive for any HPV type divided by the total population and summarized by method of detection (PCR vs. ISH) and whether consensus primers were used. Studies varied substantially with respect to method of HPV detection and genotyping. All but eight of the 32 studies assessed tested for the four main HPV types (6, 11, 16 and 18), of which two tested for HPV 6, 11 and 16 [1, 2], two tested for HPV6 and 11 [3, 4], one tested for HPV 16 and 18 [5], and three tested only for HPV 16 [6], HPV 11 [7], and HPV 57 [8], respectively. Type-specific HPV detection for 6/11 and 16/18 were assessed only among those specimens tested for the specific HPV types in question; therefore, the sample size varies between the type-specific analyses. Multiple infections were separated into constituent types; thus, type-specific detection represents types found alone or with other HPV types. Consensus PCR primers MY09/11, PGMY09/11, GP5+/6+, and SPF10 were considered to amplify the 18 HPV types most commonly associated with cervical and head and neck cancers (6, 11, 16, 18, 31, 33, 35, 39, 45, 51, 52, 56, 58, 59, 66, 68, 70, 73, 82, and 83) as well as additional high- and low-risk types. It is important to note that an HPV type was only considered tested for if it was, a) amplified by the primers and subsequently genotyped in analyses using PCR-based methods, or b) probed for using sequences specific to those types in the DNA-based (SB) and RNA-based (ISH) analyses.

Statistical Analysis

HPV results were stratified by histology and other variables believed to potentially impact HPV prevalence, including method of detection, geographic region, and specimen histology. Tests for heterogeneity (between-study variability) were conducted, and weighted prevalence estimates and 95% confidence intervals (CI) were calculated using a random-effects inverse-variance model stratified on study [36, 37]. Analysis of individual-level data (e.g., by gender, age, and smoking status) was not possible, as these were not available by HPV status in the studies evaluated. The association between HPV and occurrence of recurrent IP was estimated by random-effects inverse-variance weighted odds ratio (OR).

Results

There were 32 genotype-based investigations published in English peer-reviewed full manuscripts between 1987 and 2006, studying HPV in IP; these publications represent the basis of our literature review [19, 1235]. These studies are summarized in the Appendix. Two studies using only Southern blot [7, 15] and one study probing only for HPV 57 [8] were omitted. Table 1 presents pooled detection rates from our review studies by methodology. Results were tabulated specifically to each technique; Brandsma studied cases by both SB and ISH, [13], and six studies used both ISH and PCR techniques [12, 1618, 21, 31]. The inclusion of IP with dysplasia or carcinoma was slightly more common among studies using PCR (12/18) vs. ISH (10/18), although several (n = 4) studies used both methods of detection (see Appendix). A wide range of detection rates are seen for studies utilizing ISH techniques (0–79%), and PCR, either with type specific probes (0–77%) or consensus probes (2–67%). Interestingly, as detection thresholds have decreased, and the number of published reports using PCR-based techniques has increased, we see no increase in the overall range of HPV detection rates. Weighted estimates revealed similar detection rates across methods, 26.8% (95%CI 16.4–37.2%) by ISH, 25.2% (95%CI 14.7–35.6%) by consensus PCR, and 23.6% (95%CI 12.2–35.0%) by type-specific PCR. Analysis of heterogeneity by PCR primers used was not possible, as many of the PCR-based studies used several combinations of different primers.

Table 1 HPV detection rates in IP––by method

In order to assess the sources for differences in HPV detection rates between individual studies, we further stratified the results from these studies [19, 1235] by geographic location, histology (benign, recurrent, dysplastic, malignant) and viral subtype. There was insufficient reported data to determine whether age and gender influence HPV prevalence in IP, or by combined geographic and histologic categories. All studies demonstrated a male predominance with a median age in the sixth to seventh decade of life.

The location of each study was classified into one of three geographic locations: North America, Europe, and Asia. Given the large differences in diversity between regions across Asia (e.g., between East Asia and the Middle-East), countries from Asia were restricted to the Orient (East Asia) including Japan and Korea. After omitting a single study from the Middle East (n = 10 cases) [6], we see no significant differences in detection rates in studies conducted in North America (117 positive of 433 cases; weighted prevalence = 27.8%, 95%CI 18.2–37.3%) [4, 1214, 16, 1820, 2227, 29, 30, 33] when compared with Europe (45 positive of 166 cases; weighted prevalence = 36.7%, 95CI 10.9–62.5%) [2, 3, 9, 21, 31, 32, 35] and East Asia (35 positive of 108 cases; weighted prevalence = 34.5%, 95%CI 6.3–62.7%) [1, 5, 28, 34].

HPV Types Distribution in Inverted Papillomas

A preponderance of HPV 6/11 has been detected in IP as compared to HPV 16/18; the overall unadjusted ratio of low-risk to high-risk HPV types is 2.8:1. Low-risk/high-risk coinfections are rare and found in three studies [2, 9, 29]. Several studies also specifically tested for HPV 31/33/35 [12, 16, 2022, 24, 28, 31] as well as HPV 42–45, 51, 52, 56, 58 [1, 22, 24, 31] with negative results. Ogura [1] found HPV 57 in 1 of 3 cases and Wu [8] detected 57b in 9 of 15 IP studied; however this could not be confirmed by others [30].

HPV is More Commonly Detected in Exophytic Papillomas

HPV is detected more often in exophytic papillomas (EP) than IP. Table 2 summarizes the 15 studies comparing HPV in both IP and EP; some studies also included oncocytic Schneiderian papillomas (OSP). This included studies using only ISH [13, 20, 29, 34] as we believe there is no significant loss of detection sensitivity in this context (Table 1). Table 2 demonstrates a range of detection rates in IP from 0–72%, HPV was found in 61 of 375 cases with a weighted prevalence of 16.9% (95%CI 9.5–24.2%). The HPV detection in EP ranged from 0–100%, HPV was found in 63 of 104 cases with a weighted prevalence of 65.3% (95%CI 44.7–85.9%); and the overwhelming majority detected was HPV 6/11. Eight studies analyzed OSP and found HPV to be absent in all 25 cases tested.

Table 2 HPV detection rates in different sinonasal papillomas

HPV is More Often Detected in Dysplastic, Malignant and Recurrent IP

Table 3 summarizes the reported data on HPV positive IPs that included sufficient information regarding dysplasia and malignant progression with respect HPV viral type. We see that HPV detection rates significantly increase (Wald t-test p [ 0.02) in IPs with high-grade dysplasia (weighted prevalence of 55.8%, 95%CI 30.5–81.0%) and carcinoma (weighted prevalence of 55.1%, 95%CI 37.0–73.2%) as compared to IPs with no dysplasia or mild dysplasia (weighted prevalence of 22.3%, 95%CI 15.9–28.6%). Furthermore, the preponderance of low-risk HPV in benign IP (ratio low-risk/high-risk = 4.8:1) shifts in dysplastic and malignant IP. The low-risk/high-risk ratio is 1.1:1 for IPs with high-grade dysplasias, this ratio is inverted to favor high-risk HPV (1:2.4) for malignant IP.

Table 3 Association of HPV in benign versus dysplastic versus malignant IP [2, 5, 12, 14, 2125, 2629, 31, 33]

We identified a total of 230 sinonasal squamous carcinomas, unrelated to IP, that were studied for HPV; these reports included sufficient information regarding the histological or clinical absence of IP [2, 3, 5, 9, 19, 25, 26, 35, 3843]. HPV was detected in 46/230 of these carcinomas unrelated to IP, with a weighted prevalence of 21.8% (95%CI 11.6–31.9%). In most cases, high-risk HPV 16/18 are detected. This proportion is significantly lower than the rate of HPV detection in sinonasal squamous cell carcinomas that were associated with IP (weighted prevalence 55.1%, 95%CI 37.0–73.2).

We also investigated the association between HPV detection and disease recurrence in IP. Table 4 summarizes the data with respect to (benign) disease recurrence and HPV status from 9 studies that also included genotype data for recurrent IP [1, 2, 9, 12, 17, 25, 27, 28, 44]. Recurrences developed in 44 of 236 patients; HPV was detected in 27 of 44 IPs (weighted proportion of 57.9%, 95%CI 31.6–84.2%) that developed recurrences and in 24 of 192 IPs (weighted proportion of 9.7%, 95%CI 4.4–15.0%) that did not develop recurrence. The presence of HPV was significantly associated with the likelihood of developing recurrence (weighted OR of 10.2, 95%CI 3.2–32.8).

Table 4 Association of HPV in IP with Recurrence [1, 2, 9, 12, 17, 25, 27, 28, 44]

Discussion

Methodology Does Not Impact HPV Detection Rates for IP

The widely disparate HPV detection rates reported for IP served as the impetus for this review. Our detailed literature analysis confirmed a wide range of detection rates for studies utilizing ISH, and PCR, either with type-specific probes or consensus probes. Interestingly, detection sensitivity has increased over the decades and the number of published reports has also increased, yet we see no increase in the overall range of HPV detection rates. Attempts to explain these divergent HPV infection rates generally focus on technical issues such as the nature of the specimens and detection method used. Technology has progressed from direct genome detection (dot blot, ISH) to target amplification (PCR) with the potential for tremendous increased sensitivity. For example, ISH is said to require 50 viral copies per nucleus for detection, as compared to PCR, which can detect a single viral copy per sample. ISH has the advantage of confirming target localization. PCR is generally touted as being superior to ISH due to its inherent increased detection sensitivity, yet this is not evident from our review. Five studies from Table 1 directly compared HPV detection in IP by ISH and PCR; two studies were uniformly negative [16, 18], two demonstrated increased HPV detection by ISH [17, 21] and one study demonstrated a non-significant increase in HPV detection by PCR as compared to ISH (23% versus 20% respectively) [12]. It should be noted that we cannot rule out the possibility that the differences in prevalence with respect to detection method may be due to geographic or histologic differences between study populations. Nonetheless, prevalence overall was slightly higher for ISH despite that studies involving IP with dysplasia or carcinoma were more likely to have used PCR than ISH.

Theoretically, PCR with consensus primers would cast a “wider detection net” for exploratory studies, as compared to type-specific oligonucleotide primer pairs. The use of HPV consensus primers, amplifies a highly conserved L1 region of HPV 1A, 2, 5, 6, 8, 11, 13, 16/18, 26, 27, 30, 31/33, 35, 39, 40, 41, 42, 44, 45, 47, 48, 51, 52, 53, 54, 55, 57, 58, 59, as well as uncharacterized types. If IPs were generally associated with HPV types other than HPV 6/11 or 16/18, then PCR with consensus primers would result in a significantly higher detection rate. Table 1 demonstrates that the use of consensus primers did not increase detection rates as compared to type-specific primers. However, a caveat is that HPV typing following PCR with consensus primers was usually performed only in cases positive by consensus primers. In total, there have been 11 PCR studies using consensus primers, all for the L1 domain. As the L1/L2 regions, as well as E1 through E5 regions may be lost upon viral integration, using only L1 primer (such as MY09/MY11) may not detect integrated HPV. The E6 region of HPV, however, is not usually lost. Therefore, a combination of early and late region primers has been recommended to increase detection sensitivity [45].

Specimen type and fixation, and length of tissue storage, are factors that affect DNA preservation and retrieval for PCR. DNA extraction from snap frozen tissue generally leads to better samples, and thus better greater sensitivity for molecular studies. Since IP are rare, collection of frozen tissue is not feasible and virtually all studies used formalin-fixed paraffin embedded archival tissue. The protein-DNA and DNA-DNA cross-links formed with formalin fixation hamper DNA extraction for the initial steps of PCR. Only two PCR based studies used prospectively collected frozen tissue samples [32, 35] with differing detection rates (4/6 or 67% and 3/26 or 12% respectively).

Primer pairs that target large DNA fragments (e.g. MY09/MY11 which form 450 bp amplicons) tend to perform poorly in archival samples compared with primers targeting shorter segments (e.g. GP5/GP6, which form 140 bp amplicons). Only one study compared these two primer sets for IP, and confirmed the increased detection sensitivity using the GP5/GP6 probes [30]. Increasing specimen age also impacts detection sensitivity; Kashima reported the HPV detection rate in sinonasal papillomas to decline from 20% in one-year old archival specimens to 2% in 6 year-old archival specimens [19].

It is unlikely that institutional factors impact the HPV detection rates in IPs. Head and neck SCC, for instance, have inherent biological variations due to tumor subsite. Referral patterns for different medical institutions and inherent population differences can impact stage at diagnosis and outcome. By contrast, IP tend to be an anatomically and histologically uniform group of neoplasia. They are uncommon and most studies are retrospective, studying archival tissue, which has inherent limitations, as discussed above.

Latent Sinonasal HPV Infection is Rare

Studies addressing HPV latency in the oral cavity demonstrate a bimodal age distribution, peaking among neonates, and then again in adolescents and young adults. Asymptomatic HPV infection in children younger than one year of age most likely represents vertical transmission [46]. A recent meta-analysis investigating neonatal oral and/or genital HPV recovery rates at birth found a pooled mother-to-child transmission rate of 18% for vaginal delivery and 8% for Cesarean section for HPV positive mothers [47]. The pooled relative risk for neonatal HPV exposure is 4.8 (95% confidence interval: 2.1–10.9) for HPV positive mothers. However the actual HPV infection rate following vertical transmission is unknown, but assumed to be very low. With respect to sinonasal infection, HPV has been recovered from neonatal nasopharyngeal secretions of 47% of vaginally delivered babies with HPV positive mothers [48]. As infants become older, the rate of latent oral HPV infection decreases, therefore, latent neonatal infection is probably cleared or the viral load is reduced to undetectable levels.

A number of studies have addressed the issue of latent sinonasal HPV infection beyond the neonatal period. HPV was found in only one of a total of 216 sinonasal mucosal samples, and in none of the 91 sinonasal polyps or sinusitis samples studied [2, 5, 8, 12, 13, 15, 17, 24, 30, 33, 35, 4951]. The study by Bryan appears to be an outlier with an unusually high rate of HPV 6/11 detection in nasopharyngeal mucosa (9/15 or 60%) [3]. Only two studies from patients with IP specifically document the presence of HPV in Schneiderian mucosa in two samples procured adjacent to IP [13, 33]. Overall, we can conclude that latent HPV infection is highly uncommon to normal adult Schneiderian mucosa.

HPV is More Often Detected in Dysplastic and Malignant IP

HPV detection rates increase dramatically, when one stratifies HPV detection in IP for the presence of moderate-severe dysplasia and carcinomas arising in IP, (Ca-ex-IP). Caruana detected HPV in 29% (2/7) benign IP, 50% (4/8) dysplastic IP and in 50% (2/4) of Ca-ex-IP [26]. Likewise, Beck detected HPV in 54% (12/22) of IP, 71% (5/7) of dysplastic IP, and 70% (7/10) of Ca-ex-IP [22]. By contrast, the rate of HPV detection in sinonasal SCC that are unrelated to IP is lower (22.3%, 95%CI 15.9–28.6). Interestingly, El-Mofty noted that the HPV positive carcinomas tended to have a nonkeratinizing basaloid histomorphology, akin to the histology of HPV positive tonsillar carcinomas [42].

HPV Detection as a Predictor of Recurrence

It has been stated that HPV detection correlates with IP recurrence. All 4 recurrent cases of Siivonen were HPV positive (3 HPV 11, 1 HPV 16) [10]. Beck reported recurrence in 13 of 15 HPV positive IP patients and in none of 10 HPV negative patients [44]. Investigating the association between HPV detection and disease recurrence across studies in this review, we find that the presence of HPV in an IP is significantly associated with the likelihood of developing recurrence in IP.

Inverted Papilloma Represents a Benign Neoplastic Process

IP represent a benign proliferative neoplastic process. Califano demonstrated clonality with respect to X chromosome inactivation in all 4 informative IPs arising in women [52]. Loss of heterozygosity (LOH) was not detected in any IP, including those with dysplasia. Six patients in this study progressed to develop SCC; all of which demonstrated LOH of at least one microsatellite locus whereas the corresponding IPs did not contain any LOH.

In malignancies, the p53 gene status and HPV status can discern different pathways of carcinogenesis (wild type p53/high-risk HPV positive tumors versus mutated p53/HPV negative tumors). One might query as to whether genetic differences exist between benign IPs containing HPV and those which do not, as that might speak to different etiologies. However, there is no evidence for benign IP containing p53 mutations. Caruana studied 6 benign IP, and 2 of them were HPV positive, yet all had wild-type p53 sequenced; p53 mutations were found in dysplastic (3 of 8) and malignant (1 of 4) IPs [26].

HPV Integration in IP

The role of HPV integration in IP has been addressed in one study that found evidence supporting integration in one benign IP and two SCC-ex-IP [33]. Additional studies validating the issue of viral integration in IP would be interesting and necessary. Taken together, the above data support the hypothesis that IP represents a clonal tumor process, which requires additional promoting steps prior to malignant transformation. There is no data to explain why many benign IP remain HPV negative.

Does HPV produce a “Hit and Run” Phenomenon in the Pathogenesis of IP?

Viruses can transform or immortalize cell lines, but are not necessary for the maintenance of this phenotype, this has been referred to as the “Hit and Run” phenomenon [53]. Loss of HPV 18 from previously transfected, immortalized, transformed cell lines is consistent with this, vis-à-vis the maintenance of malignant phenotype [53]. The inability to detect either viral protein or genome does not necessarily negate a viral role in initiation.

We hypothesize that HPV is responsible for the initiation of IP but is not necessary for its maintenance. We offer the following explanation for the differences in HPV detection rates as detailed above. (Fig. 1) Low-risk HPV is detected in the majority of exophytic papillomas. The hyperkeratotic superficial epithelium of EP is permissive for HPV replication. We hypothesize that HPV reinfection of EP by assembled virions is a common event accounting for the high detection rate of low-risk HPV in EP. By contrast, for unknown reasons, IP tend to be nonkeratinizing; thus we would not expect much viral replication. As superficial IP epithelium is shed, HPV can be lost, accounting for the low HPV 6/11 detection rate in benign IP. Other steps are necessary for malignant IP progression; this would include either primary or secondary infection with high-risk (oncogenic) HPV infection and HPV integration. If viral integration is more common in dysplastic and malignant IP, this would explain the greater HPV 16/18 detection rates demonstrated in these lesions.

Fig. 1
figure 1

Hit and Run Hypothesis for HPV infection and Schneiderian papillomas. Figure hypothesized sequence of events in the pathogenesis of IP and EP. (A) Establishment of sinonasal HPV 6/11 infection is necessary for initiation. The biological and environmental factors which predispose an individual and particular sinonasal subsite towards infection are unknown. (B) In the nasal septum, the lamina propria is not loose and distensible, and the seromucinous ducts and glands are less concentrated as compared to the lateral nasal wall and paranasal sinus mucosa. HPV-induced epithelial hyperplasia of septal mucosa results in an exophytic growth pattern with fibrovascular cores, as seen in this diagram. EP tends to be keratinizing, the reasons for this are uncertain. The hyperkeratotic environment is permissive for viral replication (represented by grey spheres). Shedding superficial epithelial cells release assembled virions which reinfect the EP (black curved arrows); thus HPV 6/11 is commonly detected in EP. (C) The architecture of the lateral nasal wall and paranasal sinuses allows for HPV infection here to cause polypoid masses with squamous metaplasia and hyperplasia of the underlying seromucinous ducts, as seen in this diagram. For unknown reasons, IP tend to be nonkeratinizing; thus we would not predict viral replication and re-infection. As superficial epithelial cells are shed, HPV (grey spheres) can be lost from IP (black arrows), accounting for the low HPV 6/11 detection rate in IP. (D) Other steps are necessary for the progression to dysplasia and malignancy; they would include secondary infection with HPV 16/18 and/or integration of HPV 6/11 and/or HPV 16/18

Conclusion

The HPV detection rates in IPs vary widely. Increased assay sensitivity has not substantially changed the HPV detection rate in IP; we believe that this is not a function of limited detection sensitivity and it seems unlikely that future technologies will reveal new paradigms with respect to HPV detection rates in IP.

This review confirms that EP is associated with consistently higher HPV detection rates than IP. We believe that as EP are permissive for viral replication, resulting in epithelial re-infection and accounting for the high HPV DNA detection rates. As IP are generally non-keratinizing, they may not be able to maintain HPV infection. Therefore we hypothesize that low-risk HPV may induce IP formation, and then are lost as infected cells are shed. Interestingly, the HPV detection rates increase in dysplastic IP and SCC-ex-IP with increasing ratio of high-risk to low-risk HPV types, compared to nondysplastic IP. We believe that one explanation for the variation in HPV detection rates between different studies may be the actual histologic composition of the cohort. That is, if one series contains a higher frequency of dysplastic and malignant IP, it may have a higher detection rate than another series which contains only nondysplastic IP.

Dysplasia in IP results from either primary or secondary infection with high-risk HPV. Progression to malignancy requires additional steps such as loss of tumor suppressor genes and viral integration. The rate of integration may explain why HPV is detected in the majority of tonsillar SCC, despite the fact that these cancers are usually nonkeratinizing. Although HPV integration is not necessary for carcinogenesis, it is a common event in tonsillar SCC [54, 55]. Additional studies are needed to demonstrate the temporal association between viral integration and progression to dysplasia in IP. The implication of the present study is that HPV sub-type testing may identify patients at risk for recurrence, or progression to dysplasia and malignancy, and thus may impact surveillance protocols.