Introduction

The current outbreak of COVID-19 has instigated the need for adaptation of several aspects of modern life with consideration for social distancing, since the aerosolized particles of SARS-Cov 2 virus can remain airborne for up to 3 hours and through speaking, coughing, and sneezing contaminate surfaces for several hours/days [1]. Mitigation strategies include horizontal isolation, which directly affects the functionality of several primary services, including health care. Dental practices usually require the use of high-speed rotational drills cooled with water, which widely spreads aerosols all over a small office with limited ventilation. Infectious aerosols were considered a key etiologic factor in prior coronavirus outbreaks [2]. In addition, direct or indirect contact with exposed mucosa is related to a higher risk for transmission [3]. Previous studies reported a high rate (91.7%) of SARS-Cov 2 in the saliva [4] even before lung tissue involvement [5]. Given these circumstances, elective dental procedures are strongly recommended to be postponed.

Recently, recommendations had been made regarding the urgent need of reorganizing services enrolled in the diagnosis of head and neck cancer since the delay histological diagnosis can delay treatment and have an adverse affect on patient prognosis [6]. In this context, incisional/scalpel biopsies remain the standard procedure and should be prioritized as usual despite the risk of SARS-Cov 2 transmission. The need for histological processing of surgical specimens requires pathology laboratories to maintain and monitor their workload, operating with a few staff members (dependent upon availability of safe space). Technology can mitigate this issue with a single additional step of glass slide digitalization which can be easily conducted by one technician. Evidence shows that whole slide images (WSI) are suitable for histopathological diagnosis with a non-inferior performance to light microscopy in several subspecialties of human pathology, including head and neck pathology [4, 7, 8]. Digital pathology can also be a really useful tool for remote discussion of these cases in Multidisciplinary Team Meetings (MDTMs) to plan patient management, in addition to sharing with trainees and consultation with other pathologists.

Despite the strong evidence regarding the diagnostic usefulness of WSI systems in several subspecialties of human pathology, its utilization for primary diagnosis remains limited to only a few services in the UK [9,10,11], Netherlands [12], and Spain [13]. However, the need for maintaining pathology services despite disruptive challenges such as the COVID-19 pandemic has reinforced the need for digital pathology as an alternative and reliable diagnostic method. Subsequently, a recent survey of experience in the UK [14] shows that several services are already conducting validation studies for the implementation of digital pathology. In addition to digital pathology validation (of histology), the current outbreak has also accelerated remote reporting by pathologists outside their usual work environment, highlighting the need for remote report validation pathology [15, 16].

We have previously validated the use of WSI for primary diagnosis in oral and maxillofacial pathology [4]. In this study, we propose validation of remote WSI assessment and reporting for the diagnosis of oral and maxillofacial pathologies to substantiate the creation of the first fully oral digital pathology laboratory in Brazil.

Materials and methods

Adaptations required in laboratory infrastructure and workflow

Professionals enrolled in laboratory workflow tasks had their temperature routinely monitored and any occasional symptoms were kept under close surveillance, and staff rotation was adopted to avoid over exposure to the virus, since the correlation among viral load and symptoms/prognosis is still not clear [17, 18]. Pathology labs are usually designed to constrain fire incidences and are equipped with refrigeration and air-conditioning making them poorly ventilated which can contribute to aerosol suspension. Therefore, some adaptations to the infrastructure and workflow of the pathology laboratory were made, and additional precautions were taken to provide a safe work condition for the staff as seen in Fig. 1: (1) reduction of the number and circulation of staff in the laboratory (one pathologist (1a) and one technician (1b) were enrolled in gross analysis), reduction of the frequency of staff exposure (the team should address a schedule with rotational shifts); (2) exhaust systems—fume hood and exhaust vents—were kept functioning to minimize the potential suspension of microdroplets and a HEPA filter was installed. (3) Disinfection of surfaces, floor, and shoes of the staff prior laboratory use were encouraged; (4) disinfection of plastic containers containing biopsy tissue was reinforced due to the high risk of contamination while the clinician manipulates these recipients within surgery and given the time SARS-Cov 2 remains viable on this material [19]. (5) After processing the material, masks were also disinfected before being discarded [20]. For appropriate disinfection, a solution with 62–71% ethanol or 0.1% sodium hypochlorite within 1 minute is indicated [17, 21]. (6) Stringent and regular handwashing by all staff members was also reinforced. Further steps of processing only required one experienced technician including (7) automated tissue processing; (8) embedding within paraffin blocks; (9) tissue sectioning; (10) staining; and (11) scanning.

Fig. 1
figure 1

Adaptations on pathology laboratory workflow and suggested precautions for the staff members

The majority of adaptations in the laboratory work flow were related to work security. There were no changes in quality control procedures for gross examination, processing, embedding, microtomy, and staining. To ensure digital image quality control, an additional step of macroscopic evaluation of glass slides previous to scanning was conducted including assessment of clean, well placed and dried coverslips, as well as artifacts/bubble-free slides. After scanning, the digitalized images were checked for out of focus, missing tissue, and striping artifacts. The scanned slides were then uploaded to the institutional server and made available through a link, which redirected the pathologist to a web based WSI viewer enabling online visualization, navigation, and flipping functions.

Study design

The methodology for this study is based on the guidance for remote reporting of digital pathology slides during periods of exceptional service pressure [22], previously detailed in the RCPath guidance for digital pathology implementation [23]. We undertook the present learning process of validation for nine months to simultaneously train 8 PhD students for remote reporting of oral and maxillofacial pathologies. This report, however, is based on the learning process of 3 trainees and the lead pathologist, which evaluated the same set of cases during a 5-week period experience with a washout period of 1 month [24], as we considered our team sufficiently trained and aligned after almost 9 months of digital pathology diagnostic experience for primary diagnosis purposes. To further consolidate the learning curve and to ensure risk reduction strategies, digital pathology diagnosis and conventional microscope diagnosis were compared when challenging cases or cases with malignant clinical suspicion were assessed. These cases were deferred to glass and reviewed by the lead pathologist using conventional light microscopy (CLM). These risk reduction strategies were associated with expert interconsultation when needed [22].

The sample comprised oral biopsies, consecutively and prospectively selected from the cases processed in the Pathology Laboratory of FOP-UNICAMP for 5 weeks. Only first-time evaluation cases were considered in this analysis. Cases previously assessed and, those which had required further evaluation were only included when the entire set of digital slides (hematoxylin and eosin (H&E), immunohistochemical and special stains) for each such case were evaluated within the 5 weeks period. A total of 162 glass slides from 109 patients were included in this validation and were assessed by all 4 pathologists in both analyses (digital and conventional) to allow intraobserver comparison.

The glass slides were scanned using the Aperio Digital Pathology System (Leica Biosystems, Wetzlar, Germany) with a spatial sampling of 0.47μm per pixel, with automated focusing and magnification at ×20. WSI included all of the tissue present on conventional glass slides [24]. The workstation specifications are listed in Table 1. Prints of each screen are shown in Fig. 2 to demonstrate visual differences in the pathologist’s perception of colors and the brightness of the images. All equipment enrolled in this validation passed the University of Leeds Point of Use QA tool test (http://www.virtualpathology.leeds.ac.uk/research/systems/pouqa/) [22]

Table 1 Workstations’ specifications (hardware and graphics cards)
Fig. 2
figure 2

Prints of each screen enrolled in the present validation. a Lead pathologist; b pathologist 2; c pathologist 3; d pathologist 4

Histopathological routine workflow previous to the COVID-19 pandemic was conducted in a multi-header CLM 5 position and enhanced by the possibility to share the glass slide visualization via a television set, which receives live video images through a camera attached to the CLM. After the glass slide evaluation, a brief description of histological characteristics is usually provided by the lead pathologist and annotated by one trainee in a temporary sheet and by another trainee in the FOP-UNICAMP Anatomopathological Examination Request Website For Biopsies’ Records and Histopathological Reports (https://w2.fop.unicamp.br/patologia/index.php?sid=2) in the histological description field. Our online system provides a totally-digital environment, which enables pathologists to see the biopsy request form (as well as clinical photos, serological exams, and radiographs), to consult clinicians about details regarding the patient evolution and disease course, as well as to assess pending cases awaiting for reporting.

The diagnostic method used to validate remote WSI assessment was an online meeting platform (Google Meet). The lead pathologist shared his screen with pathology trainees, allowing a fully digital workflow for case discussion and remote reporting. Digital slides and clinical information registered on the website were also shown to all participants for clinicopathological correlation closely resembling the histopathological routine workflow previous to COVID-19 pandemic.

The digital assessment was conducted via an online meeting (Google Meet) for 5 weeks and, after a 1-month washout period, glass slides were analyzed to compare both methods. Diagnoses were classified as (1) concordant; (2) slightly discordant: no clinical or prognostic implications; or (3) discordant. Discordant cases were reviewed by the lead pathologist to reach a preferred diagnosis using both digital and conventional methods, and a stratification of the discordant cases was made as in (1) low degree of difficulty, (2) moderate degree of difficulty, and (3) high degree of difficulty. Additionally, the pathologists were required to list any problems (technical or case related) which may have impaired the digital and/or glass slide evaluation. The timing for each analysis was not assessed as a performance metric because it was necessary to register time individually (for each case) to provide median values as variables and, ask participants to assess their own analysis time would have added bias in this study.

Statistics

We assessed the global percentage of agreement (Po), and κ statistics (Fleiss’s Kappa) to establish the agreement between DM and CM since intraobserver agreement is the primary form of analysis and preferred measurement [24]. κ values of < 0.00 indicates poor agreement, 0.0–0.2 slight agreement, 0.2–0.4 fair agreement, 0.4–0.6 moderate agreement, 0.6–0.8 substantial or good agreement, and > 0.8 excellent or almost perfect agreement. The interobserver concordance was not considered in this analysis, since it is more suitable for evaluating a pathologist’s performance instead of the method’s performance. Statistical analyses were conducted using Real Statistics Resource Pack for Excell.

Results

The intraobserver metrics were calculated based on fixed categories of the final diagnosis and the interpretation of descriptive diagnosis was based on agreement on certain aspects of the report such as grading dysplasia, microinvasion, and specific structures as perineural invasion, pigmentation of basal layer, vascular structures, and presence of odontogenic components. The intraobserver agreement between DM and CM was considered almost perfect (κ ranged from 0.85 to 0.98, with 95% CI, ranging from 0.81 to 1). All intraobserver metrics—the percentage of global agreements for each pathologist, kappa statistics, disagreements, and error rate, as well as pathologist’s experiences and reported reasons for diagnostic agreements for each pathologist by method—are shown in Table 2.

Table 2 Intraobserver agreement metrics, pathologists’ experience and reported technical/case-related impairments

Among the lead pathologist’s disagreements cases, one was regarding a borderline case of severe dysplasia grade/OSCC diagnosis (which was deferred to glass and reported as OSCC), and the other encompassed a case of squamous cell carcinoma with a kerathoacantoma-like configuration, which was referred for remote interconsultation to a dermathopathologist. The most frequent pitfall from using this form of DM was lag in screen mirroring, followed by lack of radiographic and clinical photo/information. Despite not being frequent, some reported pitfalls caught included (1) necrosis, pointed as a pitfall for the diagnosis of OSCC for pathologist 2; (2) the need for a higher magnification to assess dysplasia for pathologist 3; (3) inflammation, pointed as a pitfall for the diagnosis of orthokeratinized odontogenic cyst (misinterpreted as dentigerous cyst) for pathologist 4. Among all 39 discordant diagnoses, the majority of preferred diagnoses were reached out by CM 42 (65.6%), while 30 (46.8%) were reached by DM. Individual percentage of preferred diagnosis is shown in Table 2.

The most significant and frequent disagreements within trainees encompassed epithelial dysplasia grading and differentiation between severe dysplasia (carcinoma in situ) and microinvasive OSCC. Among the 12 slides which required dysplasia grading for diagnosis, pathologist 2 had an intraobserver disagreement in 8 (66.6%) slides, pathologist 3 in 6 (50%) slides, and pathologist 4 in 9 (75%) slides. Special attention was required when assessing 7 borderline cases to establish a difference between a severe dysplasia (carcinoma in situ) or a microinvasive OSCC—in 2 slides for pathologist 2, none for pathologist 3, and in 2 slides for pathologist 4. Among all discrepant slides, 6 were common to all pathologists Table 3. Other disagreements common to all pathologists were related to the differentiation among (1) paracoccidioidomycosis and coccidioidomycosis fungal particles; (2) periapical granuloma and periapical inflammatory cyst; and (3) odontogenic keratocyst and dentigerous cyst.

Table 3 Discordant cases for all trainees, correspondent preferred diagnoses, and technical/case-related pitfalls

Discussion

The role of digital pathology in remote reporting has increased largely due to the COVID-19 pandemic. Prior to the pandemic, the extent of use of WSI in our institution was limited to teaching [25], pathology conferences, and occasionally consulting experts around the world. Additionally, cases enrolled in MDTM discussion or referred for interconsultations are not usually included in the digital workflow [14]. In our routine practice, interconsultations referred to us are almost always glass slide-based, while our cases referred to external pathologists are digitally shared through a secure link. MDTMs for expert interconsultation have broken down conventional geographic barriers, providing a more definitive diagnosis in a reduced time for challenging cases, and reducing costs associated with transportation of paraffin-embedded blocks/glass slides [16]. In this dataset, interconsultation was needed for one case in which a squamous cell carcinoma presenting in the neck was interpreted as a keratoacanthoma. According to Patel and coworkers [21], a wide variety of hardware specifications have been used worldwide for interconsultations (telepathology), even in suboptimal conditions. In the present validation, display specification was not ideal as stated by Williams (resolution of 3 megapixels or greater, 24 in. or more for a desktop display) but all equipment passed the University of Leeds Point of Use QA tool test (http://www.virtualpathology.leeds.ac.uk/research/systems/pouqa/) [24]. Experts advise that pathologists should be aware of display specifications (including luminance and contrast) and natural light interference in the digital image assessment not only for interconsultations but also for remote diagnosis in exceptional circumstances such as COVID-19.

Although we have previously conducted a validation study for primary diagnosis of oral pathology cases [4], the daily pathology practice has largely remained CLM-based. This scenario drastically changed when restrictive measures were introduced for staff safety in our institution. According to Browning, the number of pathologists using WSI for remote reporting is higher now than before the outbreak with validation processes due to start soon or almost completed in many institutions in the UK [14]. This highlights the need to maintain quality control standards while these essential health care services are being provided. Numerous reference/consult centers worldwide have already transitioned to or have introduced a fully digital pathology service with the advantages including the possibility of consulting an expert for difficult cases, better ergonomy, larger field of vision, improvement of the workflow, and quantification of prognostic parameters [12, 13]. In addition, the emergence of digital microscopy in a pandemic scenario not only offers a way to maintain essential health care but also contributes to a re-evaluation of the best practice in pathology. Back and neck pain associated with microscope use has been reported by pathologists as one specific issue which can be overcome by digitial pathology [14]. This same burden was also the initial point of a need for innovation on how reports are produced to further improve ergonomy [26].

Some studies have also stated that digitalization, despite being an extra step, allows better quality control, and also streamlines histotechnical processing of special and immunohistochemical stains. Continued education is also another important role of digital pathology when considering this unprecedented scenario [14, 15] and it must be based on best practice recommendations [27]. Pathology trainees conventional training and education now have the opportunity to be engaged in international meetings and discussions with experts around the world in their respective specialties and are likely to remain a valuable resource even in a post-pandemic scenario.

In the present validation study, the intraobserver agreement was considered almost perfect with Po = 98.7% for the lead pathologist. An excellent performance was also reported by Hanna and coworkers, who undertook a randomized, prospective study in which researchers had remote reporting validated with a high percentage of agreement (100%) among DM and CLM [15]. The error rate (ER) obtained when comparing DM diagnosis with a reference standard (in this study, the lead pathologist’s diagnosis after reviewing intraobserver discordant cases and establishing a preferred diagnose between them) should be 3%, and is acceptable if not higher than 4% [28]. The error rate of DM diagnosis is 1.2% for the lead pathologist, 8.6% for pathologist 2, 7.4% for pathologist 3, and 8% for pathologist 4. Despite the high ER for trainees, the lead pathologist’s ER suggests that experience may have a crucial role when evaluating WSI for primary diagnosis. Therefore, it is important not only to considerer individual aspects of lesions in such discordances (challenging, subjective, or borderline cases) [8] but also to take into account previous digital pathology experience.

The majority of disagreements identified in this validation enrolled borderline cases (differentiation between severe dysplasia (carcinoma in situ) or microinvasive OSCC, differentiation between squamous cell carcinoma and kerathoacanthoma), as well as dysplasia grading, similar to previous reports [7, 29]. These disagreements are related to the subjectivity of dysplasia analysis and may be explained by the experience of trainees and their previous background, especially if we consider the disagreements for each pathologist only in the dysplasia category (seven, eight, and five slides for pathologist 1, 2, and 3, respectively) and compare it with only one discordant slide for the lead pathologist (considered a challenging case with high degree of difficulty). Experience should be taken into account, as trainees had experience from different previous services and their interpretation may suffer from variations depending on how involved they were in routine diagnosis and their respective training time. The grade of difficulty in the interpretation of such cases is highly variable. One slide of epithelial dysplasia was classified as a high degree of difficulty case since it required new sections to allow a better assessment of dysplasia grade, while the other epithelial dysplasia WSI was considered a case of low degree of difficulty. It is also important to emphasize the importance of carefully analyze all the tissue present in the slide (conventional or digital) since it is expected that deeper sections would reveal different configurations of the epithelium in the same way that it is not unusual to observe different grades of dysplasia in two synchronous half of the same biopsy. Additionally, the presence of inflammation could lead to confusion in the differentiation between reactive epithelial atypia and oral epitelial dysplasia [30] (Fig. 3).

Fig. 3
figure 3

Discordant case number 50. The biopsy fragment presents extensive ulceration with associated inflammation in the dermis a. The amount of epithelial tissue in both extremities b and c presents prominent architectural and cytological changes that require higher power for a better assessment of mitotic figures, nuclear shape, number and arrangement of nucleoli (d and e), which can indicate a limitation of the current technology. Additionally, these hyperplastic edges can also present atypical changes that could be hard to differentiate among a re-epithelialization event or a true dysplasia [30], which could lead to different interpretations especially if we consider the learning curve of trainees, and the fact that the re-assessment of the cases occurs by CM. Discordant case number 161, the epithelium presents marked acanthosis and epithelial hyperplasia with cell pleomorphism and extensive inflammation in the dermis, which could be associated with more severe dysplasia changes (f and g). In this particular microenvironment, inflammation may be underestimated and some lesions interpreted as reactive epithelial atypia instead of oral epithelial dysplasia. The role of inflammation in this scenario is still unclear [30] and played a big role in the trainees’ interpretation. Additionally, this case required immunohistochemistry and special stains to rule out infections that could present a configuration of pseudoepitheliomatous hyperplasia, which lead to memorization bias of the correct diagnosis of this particular case, and consequently, a more accurate diagnosis by CM, the ultimate modality of cases re-assessment

Others disagreements were related to missing epithelium in digital slides (in those cases, an inflammatory odontogenic cyst was interpreted as a periapical granuloma by the DM), which is probably not related to misinterpretation, lack of knowledge or training time/experience, but to the lack of time and, sometimes, lack of patience to locate all fragments from a curettage biopsy on the screen, since this is not a case of high degree of difficulty. Molin and collaborators reported the lack of time as a reason for possible diagnostic discordance in one case of a missed fungus in as esophagus biopsy [26]. In this particular situation (missing epithelium), this slight discordance does not drastically interfere in treatment decisions. On the other hand, the differentiation among borderline cases such as severe epithelial dysplasia (carcinoma in situ) and OSCC (early stage), as well as dysplasia grading, presents a great dilemma, since it influences the treatment choice [31, 32] and management of pre-malignant lesions [32, 33].

An odontogenic keratocyst was misinterpreted as a dentigerous cyst on DM by pathologists 2 and 4 and on CM by pathologist 3 (as shown in Table 3). Despite trainees not having reported any challenges regarding this specific case, inflammation was identified in the reassessment of the case as a possible pitfall for this disagreement since it can cause epithelial changes. Another case of orthokeratinized odontogenic cyst, discordant only for pathologist 4, was also misinterpreted as a dentigerous cyst due to excessive inflammation that caused several reactive alterations [34, 35]. These cases are not of high complexity once the pathologist has the clinical course and information regarding previous procedures (curettage or marsupialization) that can cause inflammation and epithelium alterations.

Another disagreement common to all pathologists occurred in one case of paracoccidioidomycosis in which trainees diagnosed the fungal particles as coccidioidomycosis. A non-representative biopsy, with only a few larger than expected fungal structures (but still within the range of size compatible with the two hypothesis) without typical sprouts were raised as reasons for disagreement in this slide considered as a case of moderate degree of difficulty. Since both infections are clinically similar and present granulomatous inflammation, morphology of fungal structures and geographic patterns of endemic areas help to achieve the proper diagnosis. Sphere sizes in coccidioidomycosis are larger (10 to 80μm) with a double refractile outer wall containing several endospores (no sprouts) while in paracoccidioidomycosis, the sphere size is around 6 to 20μm, surrounded by sprouts [36].

In another case, discordant only for pathologist 3, the need for a higher magnification to assess dysplasia was reported. This technology pitfall has been reported by other groups in different evaluation aspects, for example, when assessing Helicobacter pylori in gastric biopsies or mitosis counting for breast [26]. Dysplasia grading is usually associated with its intrinsic subjectivity of analysis, but not currently associated with the need for higher magnification, since a contextual evaluation takes place based not only on cytological characteristics but also in architectural criteria and evaluation by comparing neighboring cells for pleomorphism. Cross et al. summarized a series of diagnostic difficulties pathologists may experience with assessing dysplasia, detecting metastasis and micrometastasis, identifying mitotic figures, eosinophils, and fine nuclear details highlighted as some of the challenges [22].

All participants in our study reported lag in screen mirroring as the most frequently reported pitfall. Participants in this study did not utilize the institutional virtual private network (VPN). The average home broadband connection in Brazil is around 4.84 Megabits per second (Mbps) and associated with the present resolution displays we achieved a good performance despite recurrent pixelation. The majority of challenges reported in the literature are related to internet speed and workstations limitations but without any impairment to the digital pathology workflow [14]. In one glass slide, the lead pathologist reported the lack of inflammatory cell details by DM [37]. According to Browning , the most reported challenge among students is related to technical difficulties in accessing WSI. Samueli and coworkers reported a survey in which students were trained in 4 modules: self-assigned reading, remote lecture (Zoom), quiz based on digital slide sets, and a frontal review of WSI (Zoom). Eighty percent of students reported technical issues while accessing slides as a challenge but despite that 67% stated they were more prone to attend online classes [14].

Conclusions

Given the challenging and unprecedented scenario of the current pandemic, adaptations in the laboratory workflow and digital pathology can aid to mitigate the impact of pandemic COVID-19 outbreak on oral and maxillofacial pathology laboratory routines. This validation study ensures the feasibility of the remote histopathology diagnosis while ensuring social distancing and limiting the spread of SARS-Cov 2.

Overview of results and pathologist perception

  • Flipping is a great advantage of DM (rotation of the image with a single click).

  • The wide view provided by a scanned image, automated focus, and easy navigation within different magnifications allows fast recognition of regions of interest, overcoming light, focus, and magnification handling issues, and characteristics of CLM.

  • Pathologists should be cautious to not miss important histological structures on DM when their confidence increases. By relying on the wide view provided by DM, pathologists may feel secure to give a diagnosis at a lower magnification, being prone to error—not a technology limitation.

  • Training time (experience) and calibration in pathology are crucial for good performance.

  • Reported pitfalls when using a digital environment were as follows:

    • Technology-related pitfalls: lag screen mirroring, lack of details of inflammatory cells, and need for a higher magnification to assess dysplasia.

    • Case-related pitfalls: bad quality clinical photo, challenging/borderline case, clinical information, and hypothesis do not relate with the histological characteristics, lack of clinical photo/information, lack of radiographs, misleading clinical diagnosis/hypothesis, necrosis, non-representative biopsy/small amount of tissue, need for special staining, the subjectivity of dysplasia analysis;

    • Technical processing-related pitfalls: artifact, fixation, the thickness of tissue section, inclusion, staining, and cases that required a deeper tissue sectioning.