Introduction

Accurate optic reproduction of the surgical field at orders of magnification is essential in many contemporary surgical procedures. Traditionally, the gold standard in otology surgery has been microscopy, which gives natural depth perception with binocular vision, excellent illumination and magnification, and the ability to work with two hands [1].

However, use of the traditional binocular eye-piece microscope requires fixed head positioning to optimise the microsurgical field, which has been demonstrated to compromise operating surgeon health [2]. Additionally, many procedures in otolaryngology are aerosol-generating, and during the COVID-19 pandemic, it has become necessary to wear personal protective equipment (PPE) to minimise risk in operating. Current PPE eye protection has been recently reported not to be compatible with the use of a microscope [3]. Therefore, current use of a microscope in many otology procedures requires consideration of mitigation strategies for preservation of the safety of the surgeon.

The Deep Reality Viewer (DRV) platform integrates an optical stereo microscope with a digital stereo camera module and the 3D DRV display (as shown in Fig. 1). This novel technology offers a “glasses-free 3D experience with true depth perception, extra wide field of view, high-definition resolution and excellent subject clarity” (Vision TriTeQ3) [4]. Optical and digital technologies in the 3D display combine to project two independent optical channels, so that each eye sees the subject at a slightly different angle. This replicates natural stereo cues and depth perception, providing the surgeon with a bright, high-resolution and ghosting-free stereo image, whilst allowing the user to wear prescriptive eyewear freely. The technology comprises two LCD panels, one showing the left eye’s view and the other the right, which are enclosed inside the device and, using a series of sophisticated optics, projects the stereo image to the user’s eyes. The platform allows the surgeons to operate with both hands, as with the microscope, but facilitates ergonomic posture through a heads-up display setup similar to endoscopic surgery. This allows a more natural, confident and comfortable experience. The DRV platform has not yet been used intraoperatively by any surgical specialty.

Fig. 1
figure 1

A Participant performing tympanoplasty graft placement, ossiculoplasty placement and mastoid drilling on a human cadaveric head using the DRV. B DRV display

This study aims to evaluate a novel 3D digital stereo viewer, the Deep Reality Viewer (DRV), in otology surgery, in comparison to both a 2D monitor and the gold standard of microscopy.

Methods

Study Design

This was a prospective clinical research study assessing the visual and practical applications of the DRV in comparison to microscopy and a 2D monitor display.

ENT consultants and trainees from the west of Scotland (NHS Greater Glasgow & Clyde and NHS Lanarkshire) were invited to participate in the validation study across two arms.

Participants completed a pre-study questionnaire detailing their grade and level of experience with otological procedures. The model that each candidate assessed first was also recorded.

Study Participants and Procedures

In visual assessment, 11 ENT consultants and trainees viewed pre-recorded in vivo mastoid exploration displayed on a 2D monitor and the DRV screen (Vision Triteq Ltd, Woking, UK. https://www.visiontriteq.com). Participants were randomised in which order the videos were viewed.

In practical assessment, nine ENT consultants and trainees performed otology surgical tasks on a cadaveric human head using both the microscope (Zeiss OPMI 11) and DRV. Participants were randomised in which order the systems were used first using a random number generator. An image of the DRV system in use is attached in Fig. 1.

Outcome measures

Face, task-specific and global content outcome assessments using five-point Likert scale questionnaires were the primary outcome measure in both arms. Construct validity was assessed separately.

For face validation, all participants were asked to rate each display using five-point Likert scale (‘very poor’ to ‘very good’) in each of five face validity outcomes (Table 1), and to provide qualitative assessment of strengths and weaknesses.

Table 1 Validation outcomes for face, task-specific, and global content analysis in both arms of study

Task-specific validation differed between arms of the study. In visual assessment, participants were asked to rate the statement, “This display gives effective visualization of the surgical field during mastoid exploration to allow good visualization of the following steps” using five-point Likert scale (‘strongly disagree’ to ‘strongly agree’) for each of six task-specific validity outcomes (Table 1), and to provide qualitative assessment of strengths and weaknesses. In practical assessment, participants were asked to rate the statement, “This display gives effective visualization of the surgical field during the operation to allow the following steps to be performed” using five-point Likert scale (‘strongly disagree’ to ‘strongly agree’) for each of six task-specific validity outcomes (Table 1), and to provide qualitative assessment of strengths and weaknesses.

For global content validation, all participants were asked to rate the statement, “The quality of image display effectively allowed the following actions” using five-point Likert scale (‘strongly disagree’ to ‘strongly agree’) for each of five global content validity outcomes (Table 1).

Construct validity was assessed by measuring time taken to perform ossiculoplasty and tympanoplasty by microscopy and using the DRV in a cadaveric human head (practical assessment). Data collection and analysis was stratified according to surgeon experience.

Analysis

There was a pre-determined validation threshold of four out of five on the Likert scale.

In visual assessment, descriptive statistics and Wilcoxon signed rank tests were used. In practical assessment, descriptive statistics, and Wilcoxon signed rank tests were used for face, task-specific content and global content validity assessments, and paired t tests were used for construct validity assessments, after confirming normality of the data. Statistical analyses were conducted using R statistical software through R studio (version 2021.09.0) [5]. P values < 0.05 were considered significant.

Compliance with ethical standards

This study used de-identified samples of human tissue provided by the University of Glasgow Clinical Anatomy Skills Centre (CASC). This study was deemed exempt from specific approval. CASC protocols are in accordance with the ethical standards of University of Glasgow, with the Human Tissue (Scotland) Act (2006), and with the 1964 Helsinki Declaration and its later amendments or comparable ethical standards.

Results

This study had two arms, with the visual assessment having eleven participants, and the practical assessment having nine participants. Participants were either consultants or trainee surgeons (ranging from junior Foundation year two trainees to senior ST8 Specialist registrars).

The visual assessment had four consultants and seven trainees participating. The practical assessment had five consultants and four trainees.

2D screen vs 3D DRV: visual assessment

Participants compared the face, task-specific content and global content validity between the 3D DRV viewer and a 2D display screen when observing a simulated mastoidectomy. All parameters for 3D viewer achieved the validation threshold of 4 for face, task-specific and global content assessments. The mean overall scores for the 3D viewer were 4.7, 4.9 and 4.9, for face, task-specific and global content validity, respectively.

The 2D viewer achieved lower validation scores and failed to reach the equivalent threshold of 4 set for the 3D viewer when assessing ‘colour contrast’, ‘depth perception’, ‘mastoid drilling’, ‘economy of movement’, ‘operative progression’ and ‘surgical planning’. In addition, overall face validation scores were below the required threshold (Likert score = 3.9), while overall validation scores for task-specific and global content validity were 4.2 and 4, respectively.

In direct comparison to the 2D viewer, the 3D viewer significantly outperformed it in all parameters except ‘ease of viewing’, and ‘raising a posterior tympanomeatal flap’.

Data are represented in bar charts for face (Fig. 2A), task-specific content (Fig. 2B) and global content validity (Fig. 2C).

Fig. 2
figure 2

Visual assessment outcomes for face validity (A), task-specific content validity (B) and global content validity (C) outcomes: 2D vs. 3D DRV display demonstrating mean Likert scale scores. Error bars represent the maximum to minimum values. The dotted red line represents the validation threshold. Significant results are marked with a red asterisk

Of note, there was no difference between trainees and consultants in terms of their opinion on the 3D viewer.

Microscope vs. 3D DRV: practical assessment

The 3D display achieved the validation threshold of 4 for all face, task-specific and global content parameters assessed.

In comparison to the microscope, there was no significant difference in face validity outcomes using the 3D DRV: image quality, colour contrast, depth perception, ease of viewing and overall face validity assessment were all deemed to be of equal quality in comparison to the microscope (Fig. 3A). Of note, the 3D viewer scored higher in all parameters, however, no statistical significance was demonstrated.

Fig. 3
figure 3

Practical assessment outcomes for face validity (A), task-specific content validity (B) and global content validity (C) outcomes: microscope vs. 3D DRV display demonstrating mean Likert scale scores. Error bars represent the maximum to minimum values. The dotted red line represents the validation threshold. Significant results are marked with a red asterisk

In task-specific validity outcomes, the 3D viewer was significantly better for ‘identifying middle ear anatomy’ (microscope mean = 4.2, 3D mean = 4.9, W = 0, p = 0.02) and ‘overall’ (microscope mean = 4.3, 3D mean = 4.9, W = 0, p = 0.037) (Fig. 3B). The 3D viewer performed better in other tasks ‘EUA of EAC’, ‘mastoid drilling’ and ‘ossiculoplasty PORP placement’ although these differences were not statistically significant. No difference was found in tympanoplasty graft placement between microscope and 3D viewer.

In global content validity outcomes, the 3D viewer was significantly better at ‘defining anatomy’ (microscope mean = 4.3, 3D mean = 4.9, W = 0, p = 0.037) (Fig. 3C). The 3D viewer also scored higher for ‘economy of movement’, ‘operative progression’, ‘surgical planning’ and ‘overall’, although these were not statistically significant.

Both tympanoplasty graft placement and ossiculoplasty PORP placement procedures were timed for each participant. While both of these procedures were performed more quickly with the microscope in comparison to the DRV, these outcomes were not statistically significant.

Construct validity assessment

Construct validity aims to determine if the assessment of the DRV differed according to level of experience of the participant. The timed assessments demonstrated no difference between trainees and consultants in terms of speed performing tympanoplasty either using the microscope (trainees = 86.5 s, consultants = 63.8 s, t = 0.87, df = 3.38, p = 0.44), or using the DRV (trainees = 116 s, consultants = 95.2 s, t = 0.43, df = 4.38, p = 0.69). Similarly, speed performing ossiculoplasty using the microscope (trainees = 104.5 s, consultants = 87.8 s, t = 0.78, df = 6.38, p = 0.46), and the DRV (trainees = 137.25 s, consultants = 78.2 s, t = 1.02, df = 3.31, p = 0.37), were also not significantly different.

In addition, when analysing questionnaire Likert scale scoring outcomes, there was no significant difference between trainees and consultants in terms of their assessment of the DRV. Construct validity outcomes are demonstrated in Fig. 4.

Fig. 4
figure 4

Comparison of time to completion of tympanoplasty and ossiculoplasty using a microscope and the 3D display, respectively.

Discussion

This study has demonstrated superiority of the DRV system in comparison to the traditional microscope view in several key parameters when performing simulated otological procedures. It was found to be superior to a 2D screen when visualising a procedure in almost all domains. In comparison to microscope use during a practical assessment, the DRV was rated higher in all domains measured. It was deemed particularly useful for defining anatomy (task-specific—defining middle ear anatomy, global content—defining anatomy), and improving overall progression of operative steps (overall task-specific evaluation), having demonstrated significance in these parameters.

The improved performance demonstrated by the 3D DRV in this study, particularly in comparison to the 2D visual assessment, and when defining anatomy, demonstrates the importance of 3D visual assessment intraoperatively. Preservation of 3D visualisation of the surgical field is essential in establishing an effective alternative to traditional microscopy. There is clear evidence of superior operator performance in comparison to two-dimensional (2D) visualisation. A 2018 meta-analysis showed significantly reduced operating time, less blood loss, and shorter hospital stays with 3D versus 2D endoscopic laparoscopic or thoracoscopic surgery [6]. Wilhelm et al. [7] used electromagnetic instrument pathway tracking to demonstrate an approximate 20% improvement in task performance for surgeons using a 3D compared to 2D display during laparoscopy [7]. Two systemic reviews comparing 3D and 2D laparoscopy suggest improved speed, and critically, reduced incidence of performance error with 3D technologies [8, 9]. Similarly, the benefit of 3D operating has been demonstrated within the field of ENT. In simulated functional endoscopic sinus surgery, 3D endoscopy resulted in improved operative speed, and a reduction in operative errors [10]. Similarly, in endoscopic ear surgery, a developing field within ENT, 3D endoscopy has demonstrated improved perceived stereoscopic vision and depth perception in comparison to 2D endoscopy [11, 12].

There were other important results found. No significant difference was found between the time taken to perform tasks on the DRV compared to the microscope. This is an important finding, as this was the first time any of the study participants had used the DRV, whereas all had used a microscope before. This suggests that DRV allows users to quickly become accustomed to using it. In addition to this, there was no difference found between the time it took consultants to complete the procedure compared to trainees when using the DRV. The range in experience and operative skill amongst our cohort is beneficial to this study, because it highlights the applicability of this technology to novice trainees and experts alike. It is important to note that these tasks only assessed the physical act of carrying out the procedure and did not assess operative planning as a whole. Felisati et al. [13] also identified a very rapid learning curve, with surgeons quickly developing satisfaction with a 3D system during endoscopic skull base surgery [13].

Although not specifically assessed in this study, a device must allow its user to carry out tasks repeatedly without causing injury to the user. Technologies facilitating heads-up surgery aim to avoid health issues associated with use of binocular eye-piece microscopes, particularly in specialties which rely heavily on operating microscopes. A 2015 survey of 518 UK consultant ophthalmologists reported that 62.4% of respondents had neck or back pain, and 78.7% found that operating exacerbated their pain. Of those with pain, 57.1% had used non-steroidal anti-inflammatory drugs (NSAIDs), 40% had received physiotherapy, and 4.9% had undergone surgery [2]. A recent comparative assessment of ergonomic experience in heads-up display (HUD) versus conventional microscopy showed a significant reduction in pain and discomfort with increased use of HUD, and identified heads-up operating as an important tool for surgeon wellness [14]. This replicated findings of a 2020 systematic review [15].

3D endoscopy is currently the only 3D alternative to microscopy in otology surgery but has a number of limitations both in instrument design and existing viewing technologies. The primary shortfall of endoscopy is that it requires dissection with one hand without suction, which is more challenging [1]. It also requires user experience in fulcrum effect of the endoscope, with a learning curve of approximately 70 cases evident [11, 16]. In 3D endoscopy, there is the added requirement to achieve and maintain a “stereoscopic window” or “comfort zone” to gain maximum benefit [17]. Current heads-up displays also rely on the use of polarising glasses, which are awkward to wear with prescription eyewear, and can result in deterioration of image quality. Additionally, a proportion of surgeons have been reported to suffer from side effects of incongruency in digital stereopsis (fatigue, dizziness, headache, eye strain) [17, 18]. Recent studies in otology have suggested exoscopes may be a useful alternative to endoscopy in heads-up surgery, with the added benefit of avoiding a ‘one handed’ approach. However, these report significantly worse visibility in deeper parts of the surgical field, or requirement to wear 3D eyewear [19, 20]. The use of DRV removes all of these barriers, providing an ergonomic heads-up display, with dual-hand operating, without the need for 3D glasses and the consequent eye strain.

The main limitation of our study was its small cohort size, reducing the statistical power of the analyses. However, smaller cohort sizes are common in simulation-based studies such as this one, and the study still demonstrated significance in several fields. Our study also only analysed two procedural steps within a tympanoplasty procedure and results, therefore, may not be able to be extrapolated to entire otological operations.

Conclusion

The 3D DRV achieved the validation threshold for all parameters assessed, and it outperformed both the 2D screen as well as the microscope in several of these validation parameters. This indicates that the DRV is a safe, and in some criteria, a more effective operative tool for performing otological procedures. It also suggests that it would be a useful alternative to the gold standard of microscopy in otology surgery. This study supports the assessment of the DRV system in the operating theatre for otology surgery.