Introduction

Retinopathy of prematurity (ROP) is primarily a vasoproliferative disorder affecting low birth weight premature infants. Despite advances in treatment strategies, including laser photocoagulation or anti-VEGF agents, ROP continues to be a leading cause of childhood blindness in the world due to inadequate screening [1]. Barriers to inadequate screening are many, including shortage of ophthalmologists trained and willing to screen for ROP and the lack of patient access to trained ophthalmologists in resource-poor communities.

Although binocular indirect ophthalmoscopy (BIO) by an experienced ophthalmologist remains the gold standard for staging ROP, alternative methods such as retinal imaging and telemedicine have been proposed to improve the screening for treatment-warranted ROP. Commercially available wide-field retinal photography devices, such as the RetCam (Clarity Medical Systems, Pleasanton, CA) or Phoenix Icon, have been used in several large clinical trials to evaluate the utility of retinal photography for ROP screening. For example, in the Stanford University Network for the Diagnosis of Retinopathy of Prematurity (SUNDROP) [2,3,4] initiative, 608 premature newborns requiring ROP screening were evaluated via remote retinal photography using RetCam II/III over a 6-year period. The images were interpreted by a masked ROP specialist for treatment-warranted disease. During the 6 years of the study, no cases of treatment-warranted ROP went undetected as confirmed by bedside BIO and no adverse anatomical outcomes were observed. Similarly, the e-ROP study was a multicenter clinical trial where trained non-physician imagers acquired photographs of the retina using RetCam and independent graders evaluated the retinal photographs for evidence of ROP and the image-based assessment was compared with clinical examination [5]. Of the 1257 infants screened, 19.4% infants had characteristics of referral-warranted ROP and the sensitivity of remote grading of images was 81.9% and specificity 90.1%.

These studies established the accuracy of telemedicine as a screening tool using ophthalmoscopy as the reference standard. However, many other studies have also suggested that significant variability exists in ROP grading among experienced examiners [6,7,8]. The Imaging and Informatics in ROP Research Consortium evaluated whether telemedicine or ophthalmoscopy is more accurate in diagnosing clinically significant ROP when compared to a reference standard diagnosis [9]. In this study, a reference standard diagnosis was generated by integration of the telemedicine diagnosis by 3 independent image graders and the ophthalmoscopic diagnosis. There was no difference in sensitivity between ophthalmoscopy and telemedicine for detecting type 2 or worse ROP.

These results support the use of telemedicine for the diagnosis of clinically significant ROP. Although RetCam is able to acquire high-quality, wide-field retinal photographs, there are several design features that may limit its accessibility in resource-poor communities. These include the high cost of the device, the need for an experienced photographer or technician to operate the device, and the need to make contact with the ocular surface in order to acquire an image. We have developed a novel, portable, non-contact, handheld smartphone-based retinal camera (RetinaScope) capable of capturing high-quality, wide-field fundus images [10, 11]. The use of the mobile phone platform creates a fully embedded system for acquisition, storage, and wireless transfer of images for remote evaluation. Additionally, the familiarity of users with smartphone photography technology creates an intuitive and easy-to-use system for fundus photography. In this study, we investigated the utility of RetinaScope in screening high-risk treatment-warranted ROP, compared to the gold standard of dilated fundus examination with scleral depression by indirect ophthalmoscopy. We focus on accurate detection of plus disease, defined as arterial tortuosity and venous dilation in the posterior pole, because it is a critical component of the International Classification of ROP (ICROP) [12]. The presence of plus disease is a necessary feature for threshold disease and sufficient feature for type I ROP, both of which have been shown to warrant prompt treatment with either laser photocoagulation or cryotherapy. Therefore, early and accurate diagnosis of plus disease is essential to prevent vision-threatening disease.

Methods

Ethical considerations

This study was approved by the Institutional Review Board at the University of Michigan and Sparrow Hospital, Michigan State University. All research was conducted in compliance with human subjects regulations and adhered to the tenets of the Declaration of Helsinki. This study was registered on ClinicalTrials.gov, identifier NCT03076697.

Participants and photography protocol

Infants who met the American Academy of Pediatrics and American Academy of Ophthalmology ROP screening criteria were recruited from the neonatal intensive care unit and the outpatient clinic at the C.S. Mott Children’s Hospital at the University of Michigan (Ann Arbor, Michigan) and Sparrow Hospital (Lansing, Michigan). Patients’ parents or legal guardians provided informed consent for photography. The study was conducted from May 2018 to September 2018. Each patient underwent a dilated fundus examination with scleral depression by an ophthalmologist experienced in ROP screening (PL, CB). A single drop of mydriatic and anesthetic agents consisting of phenylephrine 2.5%, cyclopentolate 0.5%, tropicamide 1%, and proparacaine 0.5% was used for dilation. An eyelid speculum was used for clinical examination and fundus photography. Following clinical examination, photographs of the retina were acquired with RetinaScope by an ophthalmology resident (TPP) and a research assistant (MTA) from standard 5 fields of view—central (posterior pole), nasal, temporal, superior, and inferior. Briefly, the RetinaScope device weighs less than a third of a kilogram and consists of a 3D-printed plastic housing containing optics for illuminating and imaging the retina using the camera of a smartphone (iPhone 5s, Apple, Cupertino, CA) [13]. The device was powered by a rechargeable lithium battery (Fig. 1A, B). The retina was imaged through a 54-diopter ophthalmic lens (Ocular Instruments OI54-A, Bellevue, WA) which forms an intermediate image that was then relayed by a 20-mm focal length achromatic lens (Edmund Optics 47661, Barrington, NJ) to the camera of an iPhone 5s. The auto-focus feature of the iPhone camera was used to correct for refractive error. Each photograph captured approximately 50° of the retina with a resolution of 52.3 pixels/retinal degree. In many cases, a continuous video recording of the fundus was acquired as the child spontaneously shifted gaze. The user then scanned through the video and selected useful frames for image grading.

Fig. 1
figure 1

(A, B) Front and back views of RetinaScope—a handheld, smartphone-based fundus photography device. The illumination and optics are compactly encased in a 3D-printed case. The retina is diffusely illuminated via an annulus of light and the retinal photograph captured using the camera of an iPhone 5s

During exams in the NICU, the nurses closely monitored infant’s vital signs, cardiopulmonary status, and oxygen saturations. If bradycardia, apnea, or other abnormalities developed during retinal imaging, the examination was halted and the NICU staff immediately stabilized the infant. The demographic data, clinical findings, and diagnoses of all participants were recorded in a Health Insurance Portability and Accountability Act (HIPAA) compliant database.

Image grading

All photographs were graded by two independent masked pediatric retina specialists who had not participated in the patients’ clinical care (VSD, CX). No additional information such as birth weight, systemic comorbidities, or postmenstrual age was provided to ensure that the graders focused only on retinal features. The images were presented in a randomized order. The graders assessed the overall photograph quality (excellent, acceptable, and not gradable) using the following criteria: a photograph was considered excellent if it was in focus and the entire posterior pole was visualized and the grader could easily discern the dilation or tortuosity of vessels. A photograph was considered acceptable if it was overexposed, underexposed, or out of focus but adequate to determine the presence or absence of vessel dilation or tortuosity. An ungradable photograph was one where the image was out of focus or obscured by glare or motion artifact.

For each photograph, graders determined the presence or absence of plus disease, defined as the presence of sufficient vascular dilation and tortuosity present in at least 2 quadrants of the eye as compared to a standard photograph. A photograph was deemed equivocal if the grader could not definitively make a determination of plus disease. Photograph was deemed “referable” if graders noted evidence of a ridge or retinal hemorrhages, regardless of location. Cohen’s kappa coefficient was computed to determine the agreement between photographic assessment and clinical assessment of plus disease, as well as intergrader agreement whether a photograph met criteria for referral.

Statistical analysis

Statistical analysis was performed using JMP version 13 (SAS Institute, Cary, NC). All variables were graphically examined for normal distributions and outliers to determine the appropriate statistical tests. Measures of central tendency and variation were used to describe the study population. Infants with plus disease were compared with non-plus cohort with respect to baseline characteristics using t test and chi-squared analysis as appropriate. Statistical significance level was set as a two-tailed test with alpha < 0.05. The graders’ assessments were compared to each other and to the gold standard clinical exam using Cohen’s kappa statistic with 95% confidence intervals (CI), which corrects the observed percentage of agreements between the raters for the effect of chance. A value of 0 implies no agreement beyond chance, whereas a value of 1 corresponds to a perfect agreement between the two graders. Clinical examination with indirect ophthalmoscopy served as the gold standard.

Results

The primary outcome measure of this study was to determine whether a handheld, portable, non-contact, smartphone-based fundus photography device can acquire photographs of diagnostic value for assessment of plus disease in premature infants undergoing ROP screening. A total of 27 premature infants (14 males, 13 females) were recruited in this study (Table 1). The mean birth weight of all patients was 838 g (standard deviation (SD) 318 g); mean gestational age was 27 weeks and 6 days (SD 3.3 weeks). All of the consented patients were included in the analysis. No patients were excluded. No patients developed bradycardia, apnea, or other abnormalities during smartphone photography requiring premature termination of the photography session.

Table 1 Patient demographics

Of the 27 infants examined, 2 patients (3 eyes) were classified as having plus disease and 5 patients (9 eyes) had ROP in zone I or II based on clinical examination by indirect ophthalmoscopy. The two infants with plus disease had a lower birth weight (645 ± 177 g) and earlier gestational age (25.7 ± 2.8 weeks) compared to the cohort of infants without plus disease.

Fundus photographs or video recordings of both eyes of each infant were acquired with RetinaScope. Average time for image acquisition was 2.2 ± 1.1 min. A single image acquired with RetinaScope captures approximately 50° of the retina. The images from different fields of view were aligned and stitched together to generate an approximately 90° wide-field montage photograph on the smartphone as described before (Fig. 2A) [11]. The overall image quality of RetinaScope was similar to commercially available RetCam3 (Fig. 2B). Representative images acquired with RetinaScope of normal, retinopathy with plus disease, and vascular abnormalities such as presence of ridge and retinal hemorrhage are shown in Fig. 3.

Fig. 2
figure 2

Comparison of wide-field montage retinal image of an infant with plus disease acquired using RetinaScope (A) and RetCam (B).

Fig. 3
figure 3

Representative photographs acquired with RetinaScope of infants undergoing ROP screening. (A, B) No posterior pole pathology. (C, D) Vessel tortuosity and dilation consistent with plus disease. (E) Ridge and (F) intraretinal hemorrhages outside the posterior pole

Image quality as assessed by the two masked graders was 2% and 5% as not gradable, 49% and 49% as acceptable, and 49% and 46% as excellent (Table 2). Images deemed “not gradable” were excluded from subsequent analysis. No infant with clinically determined plus disease was missed by the masked graders (false negative 0 for each grader). Cohen’s κ as a measure of interrater agreement between the two methods of screening for plus disease (photograph-assessment vs. indirect ophthalmoscopy examination) was 1.0 and 0.85 for the two graders, respectively. Similarly, the intergrader agreement on the presence or absence of any retinopathy in photographs was high (κ = 0.92). There were 5 and 2 eyes where the graders noted posterior pole pathology that did not meet their criteria for plus disease. Three patients had intraretinal hemorrhages in zone 2 and one patient had a ridge in zone 2. Both graders accurately noted these pathologies on photographs. Two patients had a ridge in zone 3, which was not well photographed with RetinaScope and neither grader was able to identify.

Table 2 Performance of image-based diagnosis of plus disease

Discussion

Retinopathy of prematurity screening is a global health problem. Fundus photography and remote grading by experienced ophthalmologists have the potential to drastically increase access to care for many patients in resource-poor communities. While several studies have been conducted using the RetCam system, the high cost of the device, the need for experienced photographer to operate the device, and the need to make contact with the ocular surface to acquire an image may limit its accessibility in resource-poor communities. We have developed a portable, cost-effective, smartphone-based fundus imaging device capable of acquiring diagnostic quality retinal photographs for screening of plus ROP disease in prematurely born infants. Prior work demonstrated the use of Volk Pictor device (Volk Optical, Mentor, OH) for ROP screening [14]. Our work adds to the growing body of literature on the feasibility and utility of portable non-contact fundus photography devices for ROP screening. Although RetinaScope has a smaller field of view (50°), compared to RetCam (130°), the view afforded by RetinaScope was adequate to evaluate the posterior pole for the presence of vessel dilation and tortuosity consistent with plus disease. There may be instances where abnormal vessel morphology outside the posterior pole, especially near a ridge, may meet criteria for plus disease for a clinical examiner. These cases would be missed by a grader assessing vessel morphology in the posterior pole. However, we did not encounter this scenario in this small study sample. Furthermore, a wider field image can be generated by montaging retinal photographs from different fields of view which allowed assessment of hemorrhages or ridge beyond the posterior pole. Due to worse image quality of the peripheral retina, we did not attempt to localize the transition between avascular and vascularized retina.

Graders were asked to determine whether posterior pole vessel morphology appeared abnormal but did not meet their definition of plus disease; these images were labeled “equivocal.” In a screening model, these infants would require a clinical examination by an ophthalmologist. The presence of posterior pole vessel pathology has been associated with the development of treatment-requiring ROP [15] and similarly, normal posterior pole vessel morphology is a reliable marker for the absence of stage 3 ROP [16].

Smartphone technology has advanced significantly with better cameras, faster processors, and intuitive easy-to-use software design. The availability of smartphones has increased in both developed and developing countries, which makes smartphone-based screening for ophthalmic diseases a particularly attractive target. Preventable causes of blindness, such as cataract, glaucoma, diabetic retinopathy, and ROP may benefit from the use of teleophthalmology with smartphones [17, 18]. In addition, the health care cost of ROP screening for infants born at rural and regional hospitals is exceedingly high [19], while telemedicine-based remote screening has been shown to be a more cost-effective option [20, 21]. To our knowledge, this is the first study to report the feasibility and accuracy of screening for plus ROP using smartphone-based fundus photography.

The number of ophthalmologists willing and able to manage ROP is insufficient for a number of reasons, including low reimbursement [22], high liability, and specialized training required to properly screen ROP [23]. Meanwhile, the incidence of ROP worldwide is rising because of advances in neonatology [24]. Fraught with these challenges, there is a need to develop quantitative, reliable, affordable, and easy to use approach to ROP diagnosis [25]. Additionally, clinical diagnosis of plus disease is highly variable, and there is a high interobserver variability on the diagnosis even among ROP experts [7, 26]. This is because the diagnosis of plus disease is based on interpretation of venous dilation and arteriolar tortuosity, which are both continuous variables. The grader relies on an internal threshold to transform these continuous variables into a categorical outcome. Different graders may have a variable operating threshold for what they consider to be plus disease and may focus on different pathologic features or have different interpretations of the same features. Imaging and automated analysis will play a critical role in the diagnosis of ROP. Pour et al. described an algorithm for automated assessment of plus disease by mathematically computing vessel curvature and tortuosity [27]. Recently, Brown et al. [28] reported a deep learning–based algorithm for automated detection of plus disease using retinal images from premature infants with ROP. Their algorithm diagnosed plus disease with comparable or better proficiency than ROP experts. Incorporation of such automated system into a smartphone-based photography device could provide advice at the point of care and has the potential to improve the quality, cost, and accessibility of ROP screening.