Introduction

Human papillomavirus (HPV) infection is the most common sexually transmitted disease in the world, primarily causing cervical cancer (CC) through persistent infections in the cervix [10]. Over 100 known HPV types exist, with 13 classified as high-risk genotypes (HPV 16, 18, 31, 33, 35, 39, 45, 51, 52, 56, 58, 68, and 59) [20]. The prevalence of HPV is high among women diagnosed with invasive cervical cancers, with HPV 16 and HPV 18 being the predominant genotypes among CC cases in Sri Lanka (80.6%) [15]. Therefore, it is recommended that even women who are clinically healthy and have normal cervical cytology, but test positive for HPV 16/18, undergo additional evaluations to prevent CC [6, 28].

The HPV cannot be grown easily in tissue culture systems or detected with serology based methods. Therefore, most diagnostic methods of HPV rely on detection of viral nucleic acids. Molecular testing for oncogenic HPV infections using commercial kits is the common early screening method for CC in high-resource settings [7, 11, 26, 32]. However, this is not feasible in resource-limited healthcare systems in developing countries due to high costs and a lack of skilled personnel. Polymerase chain reaction (PCR) based HPV detection is highly sensitive and can detect a broad spectrum of mucosal HPV genotypes in a single PCR using only a small amount of specimen [5, 21]. It is widely used in epidemiological studies and can be adopted for clinical setting where resources are limited and inputs are low. HPV DNA amplified in PCR can then be further analyzed with multiple HPV genotyping methods, such as sequencing [31], hybridization with HPV specific probes [12], microarray platforms [3], multiplexed PCR using matrix assisted laser desorption ionization–time of flight mass spectrometry (MALDI-TOF MS) [24] or restriction fragment length polymorphism (RFLP) [5, 21]. RFLP analysis following PCR amplification is a simple, inexpensive, robust, reproducible, and rapid method of HPV genotyping. It is based on digestion of the PCR products using one or several specific restriction enzymes. Previously, a study has described a PCR–RFLP analysis method to distinguish five clinically relevant HPV genotypes in clinical samples, HPV 16, 18, 11, 6, and 33, solely using the RsaI restriction enzyme [21]. Nevertheless, the study was constrained by its limited sample size and lack of confirmation through direct sequencing. In the present study, we intended to use the same PCR–RFLP protocol, but results will be verified through sequencing and by testing a substantially larger number of clinical specimens. For the first time we evaluated suitability of the above PCR–RFLP method for routine HPV screening.

Cytology-based HPV screening is the norm in government clinical settings for monitoring HPV or CC in Sri Lanka due to limited access to molecular diagnostics. Regular prevalence monitoring for HPV or cervical cancer is lacking due to unawareness and unavailability as well. Self-collected sampling and urine based less-invasive sampling have emerged as alternatives to clinician collected samples for HPV screening with increased acceptability and increased rate of compliance among women with social stigma for screening in remote and low resource setting [1, 4, 16, 18, 23]. However, the validity of such a sampling technique for HPV screening has not yet been evaluated in Sri Lanka.

Therefore, this study aimed to assess the feasibility and performance of a previously described PCR–RFLP based HPV screening workflow to detect HPV in low resource routine diagnostic settings with further emphasis on self-sampling HPV detection to enhance the CC prevention in Sri Lanka.

Materials and methods

Control cell lines, HPV recombinant plasmid vectors and method validation

HPV screening and genotyping workflow was established in-house using a model system consisting of 4 HPV infected cancer cell lines (DNA from CasKi, SiHa, HeLa, C4 cells) and with plasmids containing full length HPV genomes of HPV type 16 and HPV 18, kindly gifted by Dr. Stephanie Liu, Department of Obstetrics and Gynaecology, Faculty of Medicine, University of Hong Kong, Hong Kong SAR [19]. In addition, HPV genotyping based on PCR–RFLP was validated using 20 known HPV positive samples (positive with CareHPV™ kit) collected at the National Cancer Control Program, Colombo, Sri Lanka.

Primers and PCR protocols for HPV detection and genotyping

All DNA samples from HPV infected cancer cell lines and clinical material collected in this study were evaluated by amplification of the human β-globin gene using primers PCO4 (5′-CAACTTCATCCACGTTCACC-3′), and GH20 (5′-GAAGAGCCAAGGACAGGTAC-3′) as an internal control. The general primer GP5 + (5′-TTTGTTACTGTGGTAGATACTAC-3′), and GP6 + (5′-TTTGTTACTGTGGTAGATACTAC-3′), which produces a 150 bp PCR product were used for primary screening of HPV DNA in all clinical samples as described previously (GP-PCR) [13]. Further genotyping of HPV positive cases was performed using PCR–RFLP and the method was verified for use in routine diagnostic set-up by sequencing 20 known HPV positive cases.

HPV genotyping analysis by restriction fragment length polymorphism (RFLP)

Twenty HPV positive cases were selected for HPV genotyping with RFLP analysis. HPV DNA was first amplified with degenerate consensus primers MY09 (5′-CGTCCMARRGGAWACTGATC-3′) and MY11 (5′-GCMCAGGGWCATAAYAATGG-3′) which amplified a 450 bp region in the viral L1 gene region (MY-PCR) [9]. RFLP analysis of clinical samples positive for HPV was performed using the RsaI restriction enzyme as described earlier [21]. A 10 µL of the MY-PCR products were digested in a total volume of 20 µL with 10 units of RsaI at 37 °C for 2 h. Digested products were then separated electrophoretically in 10% polyacrylamide gels and stained with ethedium bromide. The banding pattern in the restriction profile of each sample was evaluated against the standard banding pattern in controls for HPV 16 and HPV 18 (HPV 16 and HPV 18 plasmid control vectors).

Confirmation of genotyping by direct sequencing

To confirm the results obtained by PCR–RFLP analysis, HPV positive GP-PCR products from 20 selected cases were sequenced. The sequence data obtained were compared with all sequences available in GeneBank database (www.ncbi.nlm.nih.gov) using the BLAST tool.

Detection of HPV prevalence in clinically normal female cohort

Ethical clearance was obtained from the Ethical Review Committee Faculty of Medicine, University of Peradeniya, Peradeniya, Sri Lanka (2017/EC/20). Cervical cytology samples for the standard Papanicolaou test (Pap smear), and Cytobrush (EDM, USA) sampling for screening for HPV infections were collected from the clinically normal females after obtaining informed written consent. Cervical brush samples were collected in 10 mL ice-cold sterile Phosphate buffered saline (PBS) (Sigma-Aldrich, USA) and DNA was extracted and purified from the cellular specimen by proteinase K digestion followed by a column clean up method (ReliaPrepTM gDNA Tissue Miniprep System, Promega, USA).

In total, 433 samples were collected from a clinically normal female cohort between 20 and 65 years of age attending CC screening in Teaching Hospitals Peradeniya and Kandy after obtaining informed written consent from 2017 to 2019. Demographic data, history for exposure to risk factors of CC, and clinical assessment of the cervix were collected from the participants. Data on age, occupation, monthly family income, parity, number of sexual partners, previous exposure to HPV vaccination, and methods of contraception were also collected. All the samples were screened using blinded cervical cytology. HPV prevalence was determined in the entire cohort of clinically normal females using the established PCR–RFLP method (Fig. 1).

Fig. 1
figure 1

Workflow for detection of HPV prevalence in clinically normal female cohort. n = number of samples collected. Categories for cervical cytology sample evaluation: NILM, negative for Intraepithelial Lesion or Malignancy; ASCUS-L, atypical squamous cells of undetermined significance; ASCUS-H, atypical squamous cells cannot exclude high-grade squamous intraepithelial lesion; LSIL, Low Grade Intraepithelial Neoplasia; HSIL, High Grade Intraepithelial Neoplasia; CC, cervical cancer

Evaluation of self-sampling for HPV testing using PCR–RFLP method

The validity of the self-collected HPV screening compared to physician-collected samples was determined using the in-house developed HPV detection assay in a cohort of females participated in CC screening program (n = 68) (Fig. 2). During their scheduled appointment for cervical smear collection with the physician, the participants were provided with a Qvintip® self-sampling device (Aprovix AB, Uppsala, Sweden) along with proper instructions. On the same day, the physician collected the samples after the participants collected their own samples. Self-collected cervico-vaginal samples were placed in 500 µL of sterile normal saline and the supernatant containing DNA was obtained using a direct lysis buffer. Sensitivity, specificity, negative predictive value, and positive predictive value were calculated using physician collected HPV detection and genotyping as the gold standard method.

Fig. 2
figure 2

Workflow for evaluation of validity of self-sample HPV screening compared to physician collected HPV screening

Statistical analysis

Statistical analysis was performed using Statistical Software for Social Sciences (SPSS version 20.0®). Mean and standard deviations were calculated for continuous variables. The kappa statistic was calculated to determine the level of chance agreement between different methods. Statistical significance was set at p < 0.05. The overall percentage of agreement between paired samples was calculated as the proportion of concordant sample sets divided by the total number of samples.

Results

Primary HPV screening and genotyping using PCR–RFLP

A successful PCR–RFLP-based strategy for HPV detection and genotyping (for HPV 16 and 18) was implemented, using a model system comprising HPV-infected cancer cell lines and HPV 16 and 18 recombinant plasmids (Fig. 3).

Fig. 3
figure 3

HPV DNA detection using PCR was established using cell culture model system consisting four HPV infected cancer cell lines (Caski, HeLa, SiHa, C4) and HPV recombinant vectors for HPV 16 and HPV 18. a Amplification of the internal control globin gene was used to confirm the integrity of the samples. b, c Amplification of the HPV DNA using GP5+/6+ and MY09/11 primers. d H16 and H18 are showing the banding pattern for HPV 16 and 18 genotyping with RsaI based RFLP. The theoretical sizes of each band of the HPV 16 reference banding pattern are 310 bp, 72 bp and 70 bp. The theoretical sizes of each band of the HPV 18 reference banding pattern are 135 bp, 125 bp, 85 bp, 72 bp and 38 bp. e PCR–RFLP analysis of HPV infected clinical specimen. Lane 1, 2, 5 and 7 are showing HPV 16 genotype isolated from clinical samples. Lane 6 showing HPV 18 isolated from clinical samples. H16: HPV 16 reference banding patter. H18: HPV 18 reference banding pattern

The RsaI enzyme showed high digestion efficiency, and no partial digestions were visualized under the RFLP conditions mentioned above (Fig. 3D).

Comparative genotyping of HPV positive clinical samples with PCR–RFLP and DNA sequencing

Genotyping results of 20 selected HPV positive samples with in-house developed assay were further verified by sequencing of GP-PCR products. There was a 100% concordance for detecting HPV 16/18 genotyping using PCR–RFLP and direct sequencing. However, the overall concordance between the two methods was 90%. Two samples that could not be genotyped using RFLP because of faint bands in PCR were sequenced and confirmed as non-HPV 16/18 types. The exact genotypes could not be determined in multiple infections with either method (Table 1).

Table 1 Comparative genotyping results of RFLP analysis and DNA sequencing HPV genotype identified by sequencing

HPV prevalence among clinically normal female cohort

The mean age of the 433 participants in primary HPV screening was 40.65 ± 9.5 years. The mean age at first sexual intercourse was 25.4 ± 4.8 years. All were non-smokers. Of these women, 18.3% were nulliparous, and 64.7% had no contraceptive use. None of the women had been previously vaccinated against HPV. Of the 433 women, 64.7% had no any contraceptive use, 12.8% had an intrauterine contraceptive device (IUCD), 14.3% were on oestro-progestins, and 8.1% had undergone tubal ligation. About 42% of the women were housewives, and 48.2% were employed (Table 3).

Only 426 samples were considered satisfactory for evaluation using the HPV DNA test (426/433), and the overall prevalence of HPV was 5.8% among clinically normal women (Table 2). The highly oncogenic HPV 16/18 genotypes constituted 52% of high-risk HPV infections in the study cohort. Two cases of multiple infections were reported. Two samples could not be genotyped using RFLP because of faint band in the MY PCR. However, they were confirmed to be non-HPV 16/18 genotypes using sequencing. Normal Pap cytology was observed in 80.5% of the cohort; however, minor cytological abnormalities like 4% of ASCUS, 1.5%, ASC-H and 1.1% LSIL were observed. Only 1.8% of the women had glandular cell atypia and 11.4% of the PAP samples were unsatisfactory for evaluation.

Table 2 HPV prevalence and type distribution among clinically normal female cohort

The prevalence of HPV was highest in the 20–29 age group. The HPV prevalence decreased with increasing age from 30 to 49 yrs. The HPV prevalence again seen higher in the 50–60 age group. There was no statistically significant relationship between HPV prevalence and age, employability, parity, or ethnicity (p > 0.05). However, high HPV prevalence rates have been reported among employed women and women of high parity. A High HPV prevalence was also reported among females who had their first sexual exposure below the age of 19 yrs (Table 3). The HPV DNA test developed in-house detected all high-grade lesions (high grade squamous cell abnormality (HSIL)/cervical intraepithelial neoplasia 2 (CIN2) or worse) with 100% sensitivity and 90% specificity.

Table 3 Distribution of HPV infectious status by socio demographic factors

Comparison of self-sampling vs physician collected samples

The HPV screening assay was further evaluated for 68 self-collected samples. The mean age of study cohort was 45 ± 7 years. The mean age at the first sexual intercourse was 25 ± 4 yrs. The peak age of respondents was 40 years. Normal Pap cytology was reported in 75% of the cohort; however minor cytological abnormalities of 10.3% of ASCUS, 1.5% of LSIL were observed. A small number (7.4%) of the samples were unsatisfactory for further evaluation. Two females had HSIL/CC in the studied cohort.

All the females who were invited to collect a self-sample for HPV testing accepted to perform self-sampling. In total 68 females provided both physicians collected cervical brush sample and self-collected cervico-vaginal fluid sample for HR-HPV testing. Eight of the self-sampled specimens were excluded from the analysis, as the specimens tested negative for β-globin. The HPV prevalence reported in self and physician collected sample types was 8.3% and 5.9% respectively (Supplementary Table 1). The overall concordance between self-collected and the physician collected samples was 93%. However, only a moderate agreement was reported between self and physician collected sample HPV detection with a κ (Kappa) = 0.47, 95% (CI 0.434–0.492). The overall sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) for HPV detection in self-samples were 75%, 96%, 60%, and 98%, respectively. The sensitivity and specificity, PPV and NPV for detecting CIN2+ with self-sample HPV testing were 100%, 90%, 40% and 100%, respectively. Therefore, no underlying cervical high-grade lesions were missed. Out of five samples HPV positive in self-collected samples, four were HPV 16/18 infected. Among the four physician-collected samples, only two were HPV 16/18 infected. There was an overall 60% concordance between HPV genotyping results of self and physician collected samples. However, self-sampling HPV testing and genotyping had 100% concordance in detecting HSIL or worse, compared to physician collected sample HPV testing and genotyping (Supplementary Fig. 1).

Discussion

There is a limited provision for HPV screening in many developing countries worldwide, including Sri Lanka and the current screening rates remain unsatisfactory. Therefore, there is a need to introduce simpler sampling methods and to implement rapid and cost-effective molecular diagnostics for HPV detection. PCR-based RFLP analysis, as tested in this study, seems to be a straightforward and economical tool for characterizing clinically important HPV subtypes, such as HPV 16 and 18, in genital HPV infections. Notably, this method doesn’t demand sophisticated or expensive equipment, making it applicable to settings with limited financial resources.

The GP-PCR assay method utilized for primary HPV screening demonstrated the capability to detect a wide range of both known and unknown HPV genotypes [29]. In the present study, the results of the GP-PCR were in good agreement with the cytological diagnosis of HSIL/CC, reporting a clinical sensitivity of 100% and specificity of 90% for detecting CIN 2+. Importantly, prior research has already established the non-inferiority of this method compared to commercial HPV detection platforms [30]. This study evaluated an RsaI based PCR–RFLP to distinguish the two most common high oncogenic risk HPV types, HPV 16 and 18, from other high-risk types as described previously [21], a capability often not achievable even with many proprietary kit-based HPV detection methods. The patterns generated were clearly distinguishable in common, inexpensive polyacrylamide gels, making it comparable with direct sequencing of PCR products or commercial qPCR methods. All HPV 16/18 genotypes identified using the PCR–RFLP method were consistent with the DNA sequencing results. Several previous studies have performed PCR–RFLP analysis to distinguish high-risk genotypes [21, 22, 27]. However, many of these studies used multiple restriction enzymes which complicated the analysis for routine diagnostic application [27]. In the present study, the use of a single restriction enzyme reduced the complexity of the analysis. The banding patterns were not shared with any other known high-risk types and hence were distinguishable among known and common HR-HPV genotypes [22]. Similar to our study, the PCR–RFLP method using the RsaI restriction enzyme was employed in a study in Brazil to identify high risk HPV genotypes in cervical fluid samples. It also showed that this technique is highly sensitive and can be used to discriminate the presence of HR-HPV types such as 16, 31, 39, 51, 58, 68, and LR-HPV types 6b, 11, 44, and 55 [2]. Thus, the RsaI based PCR–RFLP can be used for accurate genotyping the HPV 16 and 18 types in a limited-resource setting, without the need for additional costly sequencing or hybridization procedures. According to the sequencing results, HPV genotyping results can also be used to identify other genotypes such as HPV 33, HPV 58 and HPV 90 (Table 1). However, it is necessary to maintain stringent quality control measures to avoid any risk of DNA carry over contamination while performing this technique. We used sterilization methods such as ethanol disinfection and UV radiation of biosafety cabinets before and after sample processing. For each batch of samples, DNA extraction and PCR negative controls were included to monitor for any contamination. Furthermore, samples were processed in small batches, typically comprising around 10 samples per batch, to enhance control over the process and minimize the likelihood of contamination. These rigorous procedures were essential components of our methodology to ensure the reliability of the results. The crude prevalence of cervico-vaginal HPV infection among the studied cohort consisting of married women 21–65 years of age was 5.8%, and the HPV prevalence among females with normal cytology was 3.2%. Interestingly, there are no existing reports on population-based HPV prevalence in Sri Lanka or on the prevalence of HPV among females with normal cytology. A community-based study conducted in Gampaha district, Sri Lanka in 2012, involving 2000 females, reported an HPV prevalence of 3.3% among married women aged 20–59. This study remains the sole investigation into HPV prevalence in Sri Lanka to date [8]. The current study revealed a 3.2% prevalence of HPV 16/18 types in clinically normal females, while the study in Gampaha district reported a 1.2% prevalence. Among the reported HPV infections, HPV 16 and 18 were the most common genotypes identified, both of which are vaccine-preventable. We also detected significant number of non HPV 16/18 genotypes in the clinically normal screening cohort. Therefore it will be interesting to study the contribution of these genotypes to occurrence of cervical lesions or cancer among Sri Lankan females in a future study. Nevertheless, both studies agreed to the fact that the highest HPV prevalence is in the 19–29 age group.

The HPV screening and genotyping workflow was extended to include HPV detection in the self-collected samples. In this study, a higher level of agreement was observed between the self-collected cervico-vaginal specimens tested for HPV DNA and physician-collected cervical samples tested for HSIL or worse (CIN2+). These findings suggest that a self-sampling HPV DNA test could serve as a primary CC screening method, especially in settings with limited resources or where there is reluctance for physician-directed screening due to social taboos. Self-collected samples HPV analysis also circumvents the need for DNA extraction, offering a more streamlined approach to HPV genotyping utilizing PCR–RFLP, further eliminating the laboratory risk of DNA carry over. Some self-samples tested negative for beta globin (12%, 8/68) relative to physician-collected samples, possibly because they were not familiar with the procedure, as none had used the device before. Therefore, strategies to enhance awareness of the sampling procedure among Sri Lankan females are crucial before introducing self-sample-based HPV screening into routine clinical setups. There are no prior reports on the validity of self-sampling HPV detection compared to physician-collected sample HPV detection among a cohort of Sri Lankan females. Notably, the cervical cancer screening coverage remains around 10% among high risk groups in Sri Lanka [14]. One main reason for such low coverage is due to the social stigma for cervical cancer screening with physician collected samples. Therefore, the findings of this study show that self-sampling presents an intriguing alternative for cervical cancer screening, particularly in enhancing screening coverage. This approach could potentially reach underserved populations and contribute to improving overall cervical cancer detection rates in Sri Lanka.The HPV prevalence reported among the self-samples was higher than physician collected samples, and the results were consistent those of previous studies that reported higher HPV prevalence among self-collected samples using the Cobas 4800 PCR based assay [16]. The higher prevalence of HPV among self-sampling compared to physician-collected samples may be attributed to self-samples containing clinical material from the vaginal epithelium, which can include many low-risk HPV types, subsequently reducing the specificity of self-sampling HPV testing [16, 18]. False-positive HPV results with self-collected samples may lead to increased referral rates of HPV-positive but cytology-normal females to colposcopy. Secondary tests, such as triaging with HPV mRNA expression or other molecular markers with higher performance that assess not only the presence of HPV infection but also the risk for oncogenesis, could help reduce the number of such colposcopies [25].

According to the results, self-sampling based HPV detection using GP5+/6+ PCR followed by RFLP is non-inferior to physician collected samples in detecting HSIL or worse. The self-sampling HPV detection showed an overall concordance of 93% with a moderate agreement of Kappa value of 0.47 with physician collected sample HPV detection. According to the literature, this is among the few studies that used a clinically validated GP-PCR based RFLP method for HPV screening and genotyping among self-collected samples in low-resource clinical settings. Interestingly, a study conducted in South India among women with histologically confirmed CC using PCR based screening showed similar agreement values (93%) between physician and self-collected sample [17]. The self-sampling has shown increased acceptance among non-attendees around the world. However, conducting a large-scale population-based study for HPV screening using self-collected cervico-vaginal samples is recommended to gain more insight into employing self-sampling HPV testing in resource-limited settings as a primary CC screening method.