Introduction

It is well recognized that transcriptionally active human papillomavirus (HPV)-associated oropharyngeal squamous cell carcinoma (OPSCC) is a distinct cohort of head and neck squamous cell carcinomas that, as a group, portend a better prognosis than HPV-independent tumors [1,2,3,4]. HPV-associated OPSCC tends to affect younger, healthy men who often present with cervical lymph node metastases despite small primary tumors. The histologic features of these tumors have not been well characterized and correlated to patient disease free survival (DFS) and overall survival (OS) in a large group of patients managed in a standardized approach by a single health care delivery system [5,6,7,8,9,10,11,12]. It is the goal of this study to evaluate the clinical and histologic features of 390 patients with OPSCC and identify potential statistically significant factors which may predict patient outcome.

Materials and Methods

All patients from the 11 Southern California Permanente Medical Group hospitals with locally advanced OPSCC diagnosed between 2006 and 2012 were evaluated. Patients were excluded if: (1) they had received any chemoradiation prior to biopsy (i.e., no recurrent tumor or different tumor type) or did not receive initial treatment within our organization; (2) they had a history of head and neck cancer or prior head and neck radiotherapy; or (3) the patient had left the care of Southern California Permanente Medical Group before completion of therapy. An oropharyngeal centered tumor must have involved the base of tongue, palatine tonsillar fossa, pharyngeal wall and/or soft palate [13].

This clinical investigation was conducted in accordance with all guidelines of an Internal Review Board authorization (#5968 and 11178). It was in part supported by a grant from the Regional Research Committee of Kaiser Permanente Southern California (KP-RRC-20161103). Electronic medical records were reviewed with additional information obtained as needed.

The resulting 390 patients’ records were analyzed. All patients were treated with curative intent after initial surgery with definitive concurrent chemoradiation (n = 345), induction chemotherapy followed by chemoradiation (n = 44), or radiotherapy alone (n = 1). Specific chemotherapy protocols were employed and are separately reported [14,15,16,17]. Importantly, all patients were retrospectively clinically restaged using American Joint Committee on Cancer (AJCC) 8th edition staging, including any subsequent corrections, for both HPV-associated and HPV-independent oropharyngeal squamous cell carcinomas [18].

Review was performed of all histology materials available with preference given to excisional samples of the primary tumor (i.e., if primary and metastatic tumor were both available, core biopsy versus resection, the evaluation focused on the primary resection tumor). The histologic features were evaluated based on hematoxylin and eosin stained slides from surgical pathology material. This was done with the understanding of the limitations of identifying pertinent features in a biopsy versus resection sample. The data extracted included patient demographics along with tobacco use at the time of diagnosis (ever or never; if ever, current or former; pack-year history was aggregated into < or ≥ 10 pack-years); alcohol use (ever or never; if ever, light, moderate or heavy use), anatomic centering of the primary tumor and laterality, and metastatic disease with location, number, and size of involved lymph nodes. Imaging findings, types of treatment, current patient status (alive or dead, with or without disease and whether local, regional or disseminated) were also recorded. While alcohol use was recorded, non-standardized reporting of light, moderate, or heavy use made this error prone, thus these data were not incorporated into the reported results.

For imaging findings, the largest lymph node was documented in addition to the presence of low-neck lymphadenopathy, retropharyngeal lymphadenopathy, overt radiographic extranodal extension (ORENE), and matted lymphadenopathy. Low-neck lymphadenopathy was defined as radiographic involvement of level IV and/or Vb in the neck. ORENE was defined radiographically as a clear loss of the integrity of the nodal capsule with infiltration of tumor into the adjacent fat planes or musculature. Matted lymph nodes were defined radiographically as multiple abutting lymph nodes with the loss of intervening fat planes.

Pathology features included histologic type (as previously described [11, 19]), tumor pattern or subtype [8], crypt versus surface centered, nuclear anaplasia and tumor cell multinucleation (as previously defined [19]), comedonecrosis, type of infiltration (jagged/single cell, broad front, blended), desmoplastic stromal reaction (present or absent), extent of lymphocytic infiltrate (absent, mild, moderate, brisk), mitotic figures (number of mitotic figures per 2 mm2 [magnification at × 40 with a × 10 objective lens using Olympus BX41 microscope and a field of view of 22]), and other features. p16-negative tumors were graded as well (grade 1), moderately (grade 2), or poorly (grade 3) differentiated; p16-positive tumors were not graded by definition (although they would be considered grade 1 when nonkeratinizing, since they are similar to normal lymphoepithelium).

p16 immunohistochemistry was performed on a 4-µm-thick section of a formalin-fixed, paraffin-embedded (FFPE) tumor block with sufficient material for successful evaluation using the E6H4 Ventana (Tucson, AZ) monoclonal antibody to p16INK4a at a 1:1 dilution on a Ventana Benchmark LT automated immunostainer according to the manufacturer supplied standard protocol. This included antigen retrieval using a Ventana epitope retrieval solution. Positive and negative controls (normal tonsil) were used throughout. The tumors were interpreted to be either p16-positive or negative using established cutoffs: positive defined as > 70% strong, diffuse nuclear and cytoplasmic reactivity in the neoplastic cells and negative defined as anything less than this cutoff. Of the 48 cases interpreted to be negative, all showed staining in < 20% of the neoplastic cells, including no staining at all. Evaluated prospectively, the criteria match those published in the College of American Pathologists Guidelines [20].

Statistical Evaluation

Fischer’s Exact test and unpaired t test were used to compare frequencies of histopathologic subtypes by p16 status. The OS and DFS were defined as the time interval between the date of initial pathologic diagnosis of the patient’s tumor and the date of death due to any cause (primary end point) or the date of first tumor recurrence (secondary end point). Recurrences included locoregional failure or distant metastasis in a patient who had successfully completed primary treatment and was identified as being disease free. If a patient failed therapy and thus was never disease free, they were classified as having persistent disease and locoregional failure for statistical purposes. These parameters were identified in patients with a minimum follow-up of 60 months or to death. OS and DFS were determined by the Kaplan–Meier method with log-rank tests used to examine survival differences between cohorts. Time intervals were computed from the date of treatment completion to event occurrence. Multivariable analyses were performed using a Cox proportional hazard model to estimate the risk of disease recurrence or death. A regression model was constructed to identify histopathologic prognostic factors using a stepwise selection algorithm after accounting for confounding factors of patient age, tumor stage, and smoking history (≥ 10 pack-years vs. < 10 pack-years). All histopathologic predictors eligible for inclusion in the regression model demonstrated a p-value < 0.10 on univariable analysis. Factors in the final regression model demonstrated a p-value of < 0.05 on multivariable analysis.

Results

Clinical Findings

The clinical findings are summarized in Table 1. The vast majority of patients were male (87.4%) with a median age of 59 years at presentation (Fig. 1). Most were white (88.7%) with blacks (9.2%), Asians (1.8%), and Native Americans (0.3%) comprising the remaining patients. There were more black patients proportionately who had p16-negative tumors (33.3%) than white patients (10.4%) (p = 0.0005). Patients presented with a variety of symptoms, referable to the primary tumor or metastatic disease in the cervical lymph nodes. A neck mass was the presenting clinical finding in 49.5% of patients. The lesions were evaluated by fine needle aspiration (n = 169, 43%) or a core needle biopsy (n = 29, 7%), the result of which informed additional management. The majority of patients were ever smokers (n = 259, 66.4%), with 89 current and 170 former smokers. 131 patients were never smokers. As previously described [21,22,23,24], all patients with < 10 pack-years of smoking history, including the never smokers, were combined to a cohort of 207 patients for statistical purposes. The remaining 183 patients reported a smoking history of ≥ 10 pack-years. Based on criteria described in the materials and methods, 98 patients were never users and 290 patients were ever users of alcohol. Overall, p16-positive OPSCC patients who smoked ≥ 10 pack-years had inferior OS (HR 1.76 [1.10–2.82], p = 0.02) compared to < 10 pack-years smokers (see Table 2).

Table 1 Clinical findings of 390 oropharyngeal squamous cell carcinoma patients
Fig. 1
figure 1

Patient quintile age distribution between men and women. The majority are between 51 and 65 years

Table 2 Clinical and histopathologic prognostic factors on multivariable analysis of patients with p16-positive oropharyngeal squamous cell carcinomas

The majority of OPSCCs arose in the tonsil (n = 201, 51.5%) or the base of tongue (n = 169, 43.3%). Only a few tumors affected the pharynx or soft palate (Table 1). Most tumors were centered in a single site (n = 364, 93.3%) but may have expanded to involve more than one site (likely the case with the soft palate tumors). The remaining tumors presented with multiple separate sites including 2.6% (n = 10) that presented with bilateral disease. The particular subsite affected, even when controlled for p16 reactivity and stage, did not yield a statistically significant difference in DFS or OS (Supplemental Table A).

Overall, using the AJCC 8th edition, the majority of tumors were clinically low stage (Supplemental Table A). Specifically, for cT-categories, 211 patients (54%) were T1 or T2, comprising 57% of p16-positive tumors and 31.3% of p16-negative tumors. Of p16-positive OPSCC patients, only 18 (5.3%) were cN0. Thus 94.7% of all p16-positive OPSCC patients had lymph node metastases at the time of diagnosis even though only 49.5% presented with a neck mass clinically. Similarly, 8 patients (16.7%) with p16-negative OPSCC tumors were cN0. The remaining 83.3% had lymph node metastases although only 54% presented with a neck mass clinically.

Imaging Findings

At least an MRI or CT scan was available for evaluation with concurrent PET/CT in many patients. The definition of ORENE was used to evaluate the 342 patients with p16-positive tumors. There were 22 patients with clinical stage I extranodal extension (ENE), 7 patients with clinical stage II-ENE, and 23 patients with clinical stage III-ENE (see Supplemental Table B; Fig. 2). Analyzing only the patients with p16-positive tumors, ORENE was a poor prognostic factor for DFS and OS (HR = 2.88 [1.83–4.54]; p < 0.001). Clinical stage-matched analysis of p16-positive cases demonstrated worse survival among patients with ORENE compared to patients in the same clinical stage group without ORENE (mean years): Stage I: 132 (7.6) vs. I-ENE: 22 (6.2); II: 76 (7.5) vs. II-ENE: 7 (5.9); III: 52 (6.1) vs. III-ENE: 23 (4.2) (HR = 2.35 [1.43-3.87]; p = 0.0007) (Supplemental Table B).

Fig. 2
figure 2

a A 72 year old white female never smoker with light alcohol use history had a left tonsil nonkeratinizing p16-positive pT4 OPSCC and demonstrates a 3 cm left neck lymph node with extranodal extension (white arrow) in a setting of bilateral neck disease (arrowhead). b A 39 year old black woman with a negative alcohol and tobacco use history had a left base of tongue nonkeratinizing, spindled, p16-positive pT4 OPSCC and shows a 7 cm, matted lymph node group involving level II–Vb with extranodal extension (white arrow)

Evaluating the entire cohort, clinical low-neck disease (lymph node levels IV/Vb; Fig. 3) was a poor prognostic factor for survival after adjusting for p16 status (HR 2.60 [1.64–4.11], p < 0.0001, n = 45). Analyzing only those patients with p16-positive disease, low-neck disease remained prognostic for worse survival (HR  2.95 [1.81–4.80], p < 0.0001). Furthermore, clinical stage-matched analysis of p16-positive cases demonstrated worse survival among patients with low-neck disease compared to patients in the same stage group without this feature (see Supplemental Table C).

Fig. 3
figure 3

a A 56 year old Asian man with a 40 pack-year smoking history had a right tonsil nonkeratinizing p16-positive pT3 OPSCC and shows a 1.7 cm low level neck lymph node (white arrow), along with bilateral disease. b A 62 year old white man with a 10 pack-year smoking history and light alcohol use had a left base of tongue nonkeratinizing (lymphoepithelial pattern), p16-positive pT3 OPSCC and shows a 2.6 cm level Vb enlarged lymph node (white arrow), along with a 3 cm level II lymph node

Pathologic Features

OPSCCs were separated into nonkeratinizing (n = 309), nonkeratinzing with maturation (n = 37), and keratinizing (n = 44) (Table 3 and Supplemental Table D). Importantly, separation was based on the largest volume specimen submitted, but the majority of patients did not have surgical resections. The vast majority of tumors were nonkeratinizing, with 290 p16-positive and 301 tumors of the tonsil and base of tongue (see Supplemental Table D) showing a “blue-cell” appearance. This was characterized by large, well defined lobules, nests and trabeculae of tumor that often showed a sharp edge or pushing border with the surrounding stroma or lymphoid tissue (Fig. 4). An “inside-out” or “reverse maturation” appearance, with central basaloid cells surrounded at the periphery by cells with squamous maturation (Fig. 5), was quite common. The cells were arranged in a syncytial pattern of intermediate sized polygonal cells with a relatively high nuclear to cytoplasmic ratio. The cytoplasm was eosinophilic to amphophilic and surrounded round to oval nuclei with heavy chromatin and inconspicuous nucleoli. Mitoses were abundant, including atypical forms. Central comedonecrosis was common. Nonkeratinizing tumors still displayed areas with dyskeratosis, squamous eddies, or pearls. By contrast, keratinizing SCC showed a jagged, angulated infiltration by variably sized nests of neoplastic cells that elicited a strong desmoplastic stromal reaction. These neoplastic cells were large and polygonal with abundant, opacified, eosinophilic cytoplasm surrounding remarkably irregular nuclei with coarse, inky-black, hyperchromatic chromatin (Fig. 6). Distinct cell borders with easily identified spinous processes (intercellular bridges) and dyskeratosis, keratin pearl formation, and keratin debris were usually present. Most of these keratinizing tumors were p16-negative (59.5%) but still affected the tonsil and base of tongue (83.3%). The “nonkeratinizing with maturation” term was employed to cover hybrid tumors that showed a “basaloid” appearance but with easily identified areas of maturing squamous differentiation (Fig. 7). The majority of these tumors (92.1%) were p16-positive, and, again, involved the tonsil and base of tongue most commonly (89.5%).

Table 3 Histopathologic features in p16-positive and p16-negative oropharyngeal squamous cell carcinomas
Fig. 4
figure 4

Nonkeratinizing OPSCC p16-positive. a The majority of the tumor shows a deep crypt origin/centering and a basaloid morphology with multiple well defined nests of tumor with a sharp border. b This nonkeratinizing OPSCC shows well developed surface derivation with extension into the underlying stroma. c The surface epithelium is immediately overlying the basaloid proliferation and intimately associated with it

Fig. 5
figure 5

Nonkeratinizing OPSCC p16-positive. a Uninvolved surface epithelium is seen (top and bottom) with islands of neoplastic cells showing an “inside-out” pattern of central basaloid cells surrounded by maturing squamous cells. b The inside out pattern shows a syncytial architecture of cells with a high nuclear to cytoplasmic ratio, frequently showing a cleft from the surrounding tissue

Fig. 6
figure 6

Keratinizing OPSCC p16-positive. a Surface centered tumor with areas of central keratinization. b Irregular infiltration into stroma with desmoplasia and limited inflammation. c Jagged infiltration into the stroma of a keratinizing epithelium. d Nuclear pleomorphism, including multinucleation and giant cells, in this keratinizing OPSCC

Fig. 7
figure 7

Nonkeratinizing with maturation OPSCC p16-positive. a The surface is uninvolved, while limited keratinization can be seen. b A basaloid appearance predominates, but keratinization is noted. c Comedonecrosis with areas of keratinization and profound pleomorphism. d Keratinization is easily identified in a basaloid neoplasm, with areas of lymphovascular invasion identified

Several different histological subtypes were identified including basaloid, papillary, lymphoepithelial [undifferentiated], spindle cell, and adenosquamous (Table 3 and Supplemental Table D). Basaloid SCC is a distinct entity, different from the basaloid features in OPSCC, and six tumors showed this pattern. Specifically, the basaloid SCC has more rounded tumor cells arranged within islands that show central comedonecrosis. The cells form a jigsaw-puzzle pattern of nests that mold to one another around hyalinized basement membrane type material. The stroma is paucicellular (Fig. 8) [7, 19, 25, 26]. Papillary SCCs (n = 22) were histologically identical to previous descriptions [27] and showed delicate fibrovascular cores lined by cytologically pleomorphic squamous cells, all of which were p16-positive (Fig. 9). The lymphoepithelial pattern was detected in 53 tumors (n = 52 p16-positive; n = 1 p16-negative). This histologic presentation was identical to nasopharyngeal undifferentiated carcinoma, with solid sheets and irregular islands of discohesive cells intimately blended with lymphocytes and plasma cells. There was usually a syncytial appearance with indistinct cell borders and large, round to oval nuclei with vesicular-open chromatin and large central nucleoli (Fig. 10). Adenosquamous carcinoma, a combination of both adenocarcinoma and squamous cell carcinoma in the same tumor, was present only in a single p16-negative tumor.

Fig. 8
figure 8

Basaloid SCC p16-positive. a A jigsaw-puzzle configuration with central comedonecrosis and basement membrane-type material. b Reduplicated basement membrane separates the tumor islands and is seen as small droplets within the tumor nests. c Central comedonecrosis surrounded by basaloid cells with basement membrane material easily identified

Fig. 9
figure 9

Papillary OPSCC p16-positive. a There are numerous, delicate, filiform papillary projections of atypical squamous epithelium, showing b strong and diffuse nuclear and cytoplasmic p16 immunoreactivity

Fig. 10
figure 10

Lymphoepithelial OPSCC p16-positive. a A predominantly deep, crypt-centered tumor. b The neoplastic cells are intimately associated with the lymphoid elements. c The intimate relationship of the neoplastic cells with the lymphoid cells makes definitive separation between the elements difficult. d There is a strong and diffuse, nuclear and cytoplasmic reaction of the neoplastic cells with p16, with negative areas representing lymphoid cells

Of all tumors, 312 were crypt-centered (Fig. 4) while 78 were surfaced-centered (Table 3; Fig. 6). Of the p16-positive tumors, 293 were crypt-centered and 49 were surface-centered and of the p16-negative tumors, 19 were crypt-centered and 29 were surface-centered. This difference was statistically significant (p < 0.0001). Further, the vast majority of p16-positive tumors were crypt-centered irrespective of subsite (see Supplemental Table D). There was no statistically significant difference in patient outcome within subsite when stratified by crypt-versus surface-centered tumors, controlling for stage and p16 status.

Tumor surface ulceration was identified more commonly in p16-negative tumors (28/48) than in p16-positive tumors (48/342). This undoubtedly correlated to the surface versus crypt-centering of the tumor, and thus ulceration was associated with p16-negative tumors (p < 0.0001, Table 3 and Supplemental Table D). Similarly, concurrent dysplasia of the adjacent epithelium was more common in p16-negative than in p16-positive tumors (30/48 vs. 95/342, p < 0.0001) and, again, related to tumor centering and keratinizing versus nonkeratinizing histology. Squamous pearls/eddies were more likely to be present in the p16-negative than p16-positive tumors (39/48 vs. 219/342, p = 0.02). Tumor cell profound anaplasia and tumor cell multinucleation (Fig. 11), as previously defined [19], were easily identified in many tumors but were not specifically associated with p16-positive versus p16-negative tumors (p = 0.87 and p = 0.85, respectively). Furthermore, their presence did not yield any statistically significant difference in OS. Central comedonecrosis within the neoplastic islands (Figs. 7 and 8) was a common occurrence and unrelated to p16 status (p = 0.11). The different patterns of stromal infiltration correlated to p16 status: jagged and/or single cell infiltration significantly correlated to p16-negative tumors (p < 0.0001) and a broad-pushing front to p16-positive tumors (p < 0.0001, Table 3 and Supplemental Table D; Figs. 4, 6 and 7). A strong desmoplastic stromal reaction was seen in both tumor types when extending beyond the lymphoid stroma but was much more common in p16-negative tumors than in p16-positive tumors (p = 0.004). In crypt-centered tumors, there was always a brisk lymphoid infiltrate, usually in p16-positive tumors, while a scant to limited response was more common in the p16-negative and surface origin tumors (p < 0.0001; Fig. 6).

Fig. 11
figure 11

Multinucleation and profound anaplasia in OPSCC. a There is profound nuclear pleomorphism and multinucleated in this field. b In this keratinizing OPSCC, there are numerous multinucleated and anaplastic cells

Using the criteria established for mitoses per 2 mm2, there was a range of 3–387 mitoses/2 mm2 with an average of 69.9 in p16-positive tumors and 49.4 in p16-negative tumors. Thus, there were more mitoses in the p16-positive tumors (p = 0.01) (Table 3). When using incremental increasing groups of 20 mitoses/2 mm2, however, there was no impact on patient DFS or OS.

Clinical Treatment

See Supplemental Table E for management protocols. Patients underwent treatment after initial biopsy for curative intent with definitive concurrent chemoradiation (n = 345), induction chemotherapy followed by chemoradiation (n = 44), or radiotherapy alone (n = 1). Patients who underwent upfront chemoradiation received high-dose cisplatin (n = 143), triweekly carboplatin (n = 140), cetuximab (n = 32), weekly cisplatin (n = 25), weekly carboplatin (n = 1), and carboplatin/docetaxel (n = 1). Patients who received induction chemotherapy received either docetaxel and platinum-based induction chemotherapy with (n = 39) or without (n = 5) 5-fluorouracil. Following induction chemotherapy, concurrent regimens consisted of weekly carboplatin (n = 27), triweekly carboplatin (n = 9), weekly cisplatin (n = 4), high-dose cisplatin (n = 3), and cetuximab (n = 1). Patient follow-up for all patients was 7.2 years (average 6.7, range 0.02 up to 13.0 years) and 7.8 years for surviving patients (range 1.2–13.0 years). When separated by p16 status, the average OS was 7.0 and 4.9 years for p16-positive and p-16 negative patients, respectively (Fig. 12a). Within the p16-positive patient group, 284 were without disease at last contact (whether alive or dead), with an average follow-up of 7.8 years, and 58 patients were with disease at last contact (whether alive or dead), with an average of 2.9 years (see Supplemental Table B). The vital status at the time of last follow-up and the presence of recurrent/metastatic disease is included in Table 4. By Kaplan–Meier analysis, the 5-year DFS was 76% for p16-positive disease versus 46% for p16-negative disease (p < 0.0001; Fig. 12b). Five-year OS was 81% for p16-positive disease vs. 48% for p16-negative disease (p < 0.0001). Recurrences occurred in 84 patients, 65 of 342 (19.0%) p16-positive patients and 21 of 48 (43.8%) p16-negative patients. This shows that recurrences are more common in p16-negative patients (p = 0.0003). Locoregional recurrence developed in 47 patients (12.1%), 31 (9%) and 16 (33%) each in p16-positive and p16-negative, respectively (p < 0.0001). Distant metastases developed in 58 patients (14.9%), 49 (14%) and 9 (19%) in p16-positive and p16-negative, respectively (p = 0.39). Thus, the frequency of distant metastases was no different between p16-positive or negative tumors. There was a median time to recurrence of 0.74 years, and a median time to distant metastasis of 1.03 years; 96% of recurrences developed within 4 years, and 97% of distant metastases developed within 4 years.

Fig. 12
figure 12

Kaplan Meier analysis showing the overall survival (a) and disease free survival (b) for patients with p16-positive and p16-negative tumors

Table 4 Overall patient outcomes for patients with oropharyngeal squamous cell carcinomas by clinical and histopathologic parameters

On univariate analysis, baseline clinical features predictive of OS and DFS with statistical significance are included in Table 4. For p16-positive tumors, the DFS was worse and the OS shorter in older patients (using ≥ 55 years as a cutoff) compared to younger patients: hazard ratio 2.65 (1.62–4.33), p < 0.0001 and 3.21 (1.84–5.61), p < 0.0001, respectively. There was no difference in DFS or OS between males and females (n = 341 vs. 49; HR 0.67 [CI 0.42–1.08], p = 0.095 and HR 0.71 [0.43–1.18], p = 0.19, respectively), nor between races (white, black, Asian, Native Indian; p = 0.57 and p = 0.42, respectively).

Among all-comers, the 5-year DFS and OS was 77% vs. 67% (p = 0.003) and 83% vs. 71% (p = 0.003) for < 10 pack-year smokers vs. ≥ 10-pack-year smokers, respectively. There was no patient outcome difference between ever (n = 290) versus never (n = 98) drinkers for DFS (p = 0.23) or OS (p = 0.18).

For patients with p16-positive disease, 5-year OS was 88% for clinical stage I disease, 86% for stage II disease, and 63% for stage III disease (p < 0.0001; Fig. 13a); and 5-year DFS was 84% for clinical stage I disease, 81% for stage II disease, and 55% for stage III disease (p < 0.0001; Fig. 13b). Among patients with p16-negative disease, 5-year OS was 55% for clinical stage III disease, 61% for stage IVA disease, and 21% for stage IVB disease (p = 0.04; Fig. 14a), and 5-year DFS was 55% for clinical stage III disease, 57% for stage IVA disease, and 21% for stage IVB disease (p = 0.02; Fig. 14b).

Fig. 13
figure 13

Kaplan Meier analysis showing the overall survival (a) and disease free survival (b) for patients with p16-positive tumors separated by stage

Fig. 14
figure 14

Kaplan Meier analysis showing the overall survival (a) and disease free survival (b) for patients with 16-negative tumors separated by stage

For patients with p16-positive tumors, ORENE was a poor prognostic factor for OS (HR 2.78 [1.68–4.46], p < 0.0001) seen within stage-matched patients without ORENE (HR 2.35 [1.43–3.87], p = 0.0007). ORENE remained a prognostically significant factor on multivariate analysis (HR 2.16 [1.29–3.63], p = 0.004; Fig. 15).

Fig. 15
figure 15

Kaplan Meier analysis showing the overall survival (a) and disease free survival (b) for patients with p16-positive tumors with and without overt radiographic extranodal extension

There was a worse survival outcome for patients with low level IV/Vb lymph node involvement when matched to same stage patients and controlled for p16 status; univariate analysis: HR 2.31 [1.47–3.63], p = .0003; controlling for p16 status HR 2.60 [1.65–4.11], p < 0.0001; controlling for stage: HR 2.01 [1.28–3.17], p < 0.001; and multivariate analysis: HR 2.37 [1.36–4.15], p = 0.002 (Fig. 16).

Fig. 16
figure 16

Kaplan Meier analysis showing the overall survival (a) and disease free survival (b) for patients with p16-positive tumors with and without low level IV/Vb lymph node involvement

There was no statistically significant difference in patient DFS (p = 0.54) or OS (p = 0.10) when stratified by tumor site on univariable analysis. In p16-positive tumors, 293 were crypt centered and 49 were surface centered, but there was no clinically significant difference in DFS or OS (average 7.0 years overall survival). Histopathologic subtypes are listed in Table 3 and Supplemental Table D by p16 status. Overall, tumors that were p16-positive were more likely to be nonkeratinizing and crypt-centered with a broad front of invasion, moderate-to-brisk lymphocytic infiltrate, and higher mitotic index. Tumors that were p16-negative were more likely to be surface-centered, ulcerated, and keratinzing. They more commonly showed jagged/single cell infiltration, squamous pearl formation, a desmoplastic stromal reaction, concurrent dysplasia, mild lymphocytic infiltration, and a lower mitotic index.

On multivariable analysis of patients with p16-positive tumors, age (HR 1.07 [1.04–1.10], p < 0.0001), smoking history ≥ 10 pack-years (HR 1.76 [1.10–2.82], p = 0.02), stage (HR 1.77 [1.32–2.38], p = 0.0002), ORENE (HR 2.16 [1.29–3.63], p = 0.004), low-neck lymphadenopathy (HR = 2.37 [1.36-4.15], p = 0.002), and lymphocytic responses (mild vs. absent: HR 0.01 [0.002–0.05], p < 0.0001; moderate vs. absent: HR 0.01 [0.002–0.06], p < 0.0001; brisk vs. absent: HR 0.01 [0.002–0.05], p < 0.0001; Fig. 17a) were prognostic for OS. TNM stage (HR 1.58 [1.13–2.21], p = 0.007), the presence of ORENE (HR 2.90 [1.66–5.07], p = 0.0002), low-neck lymphadenopathy (HR 2.61 [1.42–4.80], p = 0.0001), and lymphocytic response (mild vs. absent: HR 0.01 [0.002–0.05], p < 0.0001; moderate vs. absent: HR 0.02 [0.004–0.10], p < 0.0001; brisk vs. absent: HR 0.01 [0.002–0.06], p < 0.0001; Fig. 17b) were prognostic for DFS. Five-year DFS/OS was 0%/0% for absent lymphocytic response, 66%/71% for mild lymphoid infiltrate, 72%/78% for moderate lymphoid infiltrate, and 80%/86% for brisk lymphoid response (Fig. 17).

Fig. 17
figure 17

Kaplan Meier analysis showing the overall survival (a) and disease free survival (b) for patients with p16-positive tumors with variably lymphocytic infiltrate responses

On multivariable analysis of p16-negative disease, low-neck lymphadenopathy (HR 5.59 [1.01–31.05], p = 0.049) was prognostic for DFS, with 5-year DFS of 33% vs. 44% with and without low-neck lymphadenopathy, respectively. Other histopathologic factors did not demonstrate prognostic significance.

Discussion

Tumors that were p16-positive were more likely to be crypt-centered and nonkeratinizing with a broad, pushing front of invasion, moderate-to-brisk lymphocytic infiltrate, and high mitotic index [11]. In fact, nonkeratinizing tumors were nearly all (94%) p16-positive, a finding that may have been even higher if further high-risk HPV mRNA testing had been performed [28]. Tumors that were p16-negative were more likely to be ulcerated and surface-centered. Their histologic features included jagged/single cell infiltration, squamous pearls/eddies, a desmoplastic stromal reaction, concurrent dysplasia, mild lymphocytic infiltration, and a lower mitotic index. As already described, the lack of a well-developed basement membrane in the lymphoepithelium of the oropharynx [13] results in a high incidence of lymphovascular invasion and early lymph node metastasis. This finding was seen in 364/390 (93.3%) of the patients in this cohort. Even though the lack of an inflammatory infiltrate was statistically correlated with a poor prognosis (DFS and OS), it was identified in only 2 patients, and thus may not be an event with sufficient power to warrant such a statement.

Reported 5-year overall survival for p16-positive OPSCC ranges from 60 to 94% for all comers [4, 5, 29,30,31,32]. Disease free or progression free survival ranges from 57 to 85% [5, 30,31,32]. In this study, recurrence occurred in 86 (22%) patients with 47 (12%) patients experiencing locoregional disease and 58 (15%) with distant metastases. Nineteen patients (5%) experienced both. The average time to recurrence for all comers was 15.0 months; 7.2 months for p16-negative tumors and 17.6 months for p16-positive tumors, with an average time to locoregional recurrence and distant metastasis of 11.2 months and 18.2 months, respectively. Specifically, 96% of recurrences developed within 48 months and 97% of distant metastases developed within 48 months. These findings suggest that post-treatment oncologic surveillance for at least 48 months is prudent [30, 33].

As previously reported, patient age (defined by < 55 and ≥ 55 years) is a significant factor in DFS and OS, but only for p16-positive patients in this cohort of patients (p < 0.0001). These findings are similar to those reported by others, although with different cutoffs amongst studies [6, 29,30,31,32, 34,35,36,37,38,39,40,41,42,43,44,45,46].

The majority of patients were white (88.7%), although black patients disproportionately had p16-negative tumors (50%) when compared to white patients (9%). Patients in this series were stratified based on smoking status [4]. Recognizing that pack year history, smoking duration, and interval from quitting impact patient outcomes [4, 5, 22, 24, 36, 38, 47, 48], the patients were further categorized into never smokers or < 10 pack-year smokers as a single group while all other patients were placed in the ≥ 10 pack-year smoking history. This stratification identified 46.9% of all patients had a ≥ 10 pack-year smoking history in this cohort, a significant disease cofactor. Specifically in the p16-positive tumors, a smoking history ≥ 10 pack-years was an independent predictor of inferior survival (HR 1.76 [1.10–2.82], p = 0.02), confirming smoking as an adverse prognostic factor in HPV-associated OPSCC.

The AJCC 8th edition separated the HPV-associated OPSCC from HPV-independent tumors in recognition of the different variables that need to be considered in management as treatment options are modified based on HPV-status. For HPV-associated cancers, this is predicated on whether the disease is clinically unilateral (N1), bilateral (N2), or the size of any node being over 6.0 cm (N3). Based on findings in this patient cohort, specifically a worse outcome associated with the presence of ORENE and low level IV/Vb neck lymphadenopathy, the lymph node categories may need to be further stratified.

In p16-positive OPSCC primary tumors, additional studies have identified clinical extranodal extension/matted nodes to be a strong predictor of worse outcome [49,50,51,52,53,54,55] although size within 6 cm and number of affected lymph nodes does not seem to adversely affect outcome [56, 57]. One suggested reason is that large cystic metastases, a common finding in these patients, do not carry the same disease burden as solid metastases of a similar size [58]. In this cohort, this clinical finding was encompassed by ORENE. Patients who had no ORENE had the best overall survival. Analyzing only those patients with p16-positive tumors, ORENE was a poor prognostic factor for overall survival (HR 2.74 [1.68–4.46], p < 0.0001). The prognostic impact of ORENE in the cohort of p16-positive tumors remained significant on multivariate analysis (HR 2.16 [1.29–3.63], p = 0.004), even after adjusting for potential confounding factors (age, stage, tobacco status) [58]. This suggests the cN categories may need to be modified or upstaged to account for ORENE or that de-intensified therapy be selectively applied to patients without ORENE if other findings are similar.

In p16-positive OPSCC primary tumors, involvement of low lying level cervical lymph nodes, specifically IV/Vb, is a predictor of worse outcome and development of distant metastasis [14, 59]. In this cohort of patients, especially in patients with p16-positive tumors, level IV/Vb disease was prognostic for inferior survival (HR 2.95 [1.81–4.80], p < 0.001). The prognostic impact of level IV/Vb lymph node involvement in p16-positive disease remained significant on multivariate analysis (HR 2.37 [1.36–4.15], p = 0.002). Again, this finding suggests that the cN categories may need to be modified or upstaged to account for level IV/Vb lymph node disease.

In a large national registry report of OPSCC, 1.1% of tumors presented as bilateral primaries [60]. In this cohort of patients, 2.6% (10 of 390) presented with bilateral tumors, 8 of which were p16-positive and 2 were p16-negative. Therefore, it would seem that bilateral presentation is uncommon, but not rare.

Nearly all of the tumors arose from the tonsil (n = 201) or base of tongue (n = 169). The remaining 20 cases arose from the pharynx (n = 12) or soft palate (n = 8). When broken down further, a significant proportion of tumors were p16-negative in the pharynx (75%; 9/12) and soft palate (37.5%; 3/8) when compared to the base of tongue and tonsil (9.5%; 16/169 and 10%; 20/201). However, large primary tumors may have been centered with the bulk of tumor in an anatomic site, even though adjacent sites may also be involved. As recently suggested [61], regions of the oropharynx that do not harbor tonsillar tissues, such as the soft palate, may be separated from this anatomic designation [13] and included in the oral cavity instead.

All our patients were treated in a unified and consistent approach and all were centrally reviewed prior to evaluation. Thus, even though retrospective cohort analysis from large groups of patients suggest there is a difference in OS by subsite, it is probably due to other confounding factors that are not consistently evaluated, treatment changes over the past two decades, or perhaps aggregated groupings that are not easily evaluated [62,63,64]. Further, several authors have suggested that within p16-positive tonsillar tumors, the specific location of crypt (referred to as specified tonsillary SCC [STSCC] based on tonsillar crypts and lymphoid tissue with germinal centers) versus surface (referred to as non-specified tonsillary SCC [NSTSCC] when tonsillary crypts and lymphoid tissue are absent) may be associated with a better outcome when crypt-centered versus surface located [35, 64,65,66].

In this cohort of patients, the nonkeratinizing histologic appearance was strongly correlated with p16-positive tumors (p < 0.0001) and was shown to be associated with patient outcome [9, 11, 28, 67]. There was no difference between p16-positive tumors classified as nonkeratinizing and nonkeratinizing with maturation when evaluating patient outcome, thus, while a recognized group, they can be collapsed into a single category of nonkeratinizing for management purposes [11, 19, 28, 68].

Several different histological subtypes were identified including basaloid, papillary, lymphoepithelial (undifferentiated) [8, 69], spindled cell, and adenosquamous [70, 71]. The true basaloid squamous cell carcinoma is a distinct pathological entity [7]. In this cohort of patients, this subtype did not experience a worse outcome than the remaining patients, a similar finding for any of the other variants.

Conceptually, many of the tumors were originally diagnosed as “poorly differentiated,” but these nonkeratinizing SCC are actually the best differentiated, most closely resembling the normal lymphoepithelium of the tonsil/base of tongue region. As such, tumor grade is conceptually no longer an issue for HPV-associated (p16-positive) OPSCC. The lymphoepithelium, and especially the deep crypts of the tonsil, have a morphologic appearance that is indistinguishable from the vast majority of tumors that arise from this epithelium. Thus, by definition, they are “well differentiated” tumors as they so closely approximate and resemble the epithelium native to the region [11, 19, 20, 28, 67, 68, 72,73,74,75]. However, by classical histologic standards for other anatomic sites, these squamous cell carcinomas have historically been interpreted as poorly differentiated [76] due to a basaloid appearance and cells with a high nuclear to cytoplasmic ratio, prominent nucleoli, vesicular to coarse nuclear chromatin, and increased mitoses. It has been shown in this cohort that the traditional tumor grading for p16-positive tumors does not inform prognosis, and thus no differentiation or tumor grade is used for these tumors. Instead, grade is only assigned to p16-negative tumors.

By univariate analysis, old age (≥ 55 years) , smoking status (≥ 10 pack-year history), and advanced stage were all statistically significant clinical features associated with a shorter OS. These findings are also documented in other smaller studies [22, 30, 31, 50, 77]. Other potential variables, such as sex [31, 37, 41], race [37], lymphovascular invasion [50], and more than 4 lymph nodes with metastatic disease [30, 50] have been suggested as factors associated with treatment failure and a worse outcome. In this cohort of 390 patients managed centrally, these other factors were not significant.

By multivariable analysis, p16-positive tumors, older age (≥ 55 years), ≥ 10 pack-year history smoking status, overt radiographic extranodal extension, low neck (level IV/Vb) lymph node involvement, and high tumor stage remained prognostically predictive. These findings are similar to those previously reported in smaller cohorts of patients, irrespective of geographic distribution [22, 37, 38, 50, 77,78,79].

The specific pattern of the tumor is not associated with patient outcome. Specifically, when using the term “lymphoepithelial-like” or “undifferentiated” carcinoma in the oropharynx, a concept similar to the pattern of growth seen in nasopharyngeal tumors (nasopharyngeal carcinoma, undifferentiated type) [69], there is no difference in patient outcome, as seen in the 53 patients in this series who showed this pattern. All except one were p16-positive, showing an overall 7.8 years of follow-up without a statistically significant difference in outcome in comparison to the remaining OPSCC, findings that are similar to other authors [8].

In cases demonstrating a high tumor inflammatory response with low stromal desmoplasia, patients seemed to have a better disease specific 5-year survival, a finding that we did not encounter in this cohort of patients [35, 80]. This may be due to the limitation of examining biopsy samples rather than resection samples. Because the desmoplastic stromal reaction is usually only detected in HPV-associated tumors that have invaded all the way through the lymphoid stroma before expanding into the soft tissue, the biopsy sample may not have included the area where the desmoplastic stromal reaction is identified [35, 80].

The specific treatment modalities and comparisons between therapies in this cohort of patients and smaller subsets of the cohort have already been published, specifically addressing cisplatin-based chemoradiotherapy and cetuximab-based bioradiotherapy [15], high-dose cisplatin versus triweekly carboplatin-based chemotherapy [17], induction chemotherapy followed by concurrent chemoradiation versus concurrent chemoradiation alone in the definitive management of p16-positive oropharyngeal squamous cell carcinoma with low-neck or N3 disease [14], and radiographic nodal prognostic factors in stage I HPV-associated oropharyngeal squamous cell carcinoma [58].

Conclusion

OPSCC is strongly associated with p16 positivity, and, by extension, HPV-associated. The majority of the patients are younger, white, and likely to present with lymph node metastatic disease. There is a worse OS in patients with ≥ 10 pack-year smoking history, overt radiographic extranodal extension, and low-neck (IV/Vb) lymph node involvement. Histologically, among patients with p16-positive disease, OS is worse in patients with an absent lymphocytic infiltrate and improved in p16-negative patients with non-keratinizing histology. Thus, it seems that the AJCC 8th edition criteria pose potential problems in classification, treatment, and patient outcome as it relates to clinical lymph node status. Specifically, radiographically detectable overt extranodal extension and low-neck lymph node disease are parameters which are associated with a worse overall survival and disease-free survival.