Abstract
Purpose
To assess the reliability of a tear film (TF) viscosity video grading system.
Methods
Thirty-four dynamic TF viscosity videos were obtained by a clinically available TF analyzer and objectively sorted according to the movement speed of three arbitrary reflective light particles. A 4-grade system was constructed on a specially designed window for simultaneous comparison with the three standard videos. Two masked graders were invited to grade these videos under a randomized procedure. Observer reliabilities were determined by Spearman’s correlation analysis and Bland-Altman agreement analysis.
Results
For this four-grade system, the intra-observer correlation was very strong in the two graders (ρ = 0.96 and 0.82; both P < 0.0001). However, the inter-observer correlation showed moderate strength in normal playback speed (ρ = 0.53, P = 0.002 and ρ = 0.52, P = 0.003 for 1st and 2nd gradings, respectively). In slower playback videos, the inter-observer correlation of the two graders was higher (ρ = 0.70 and P < 0.0001) when reduced to 0.8-times playback speed. Moreover, the 0.8-times mode was also significantly better than normal playback mode (P = 0.0204) in terms of inter-observer agreement.
Conclusions
The dynamic 4-grade system has an excellent intra-observer reliability and a good inter-observer reliability under 0.8-times playback speed. The grading system established in this study provides a promising solution for rapidly determining the level of TF viscosity.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Tear film (TF) acts as a buffering interface between the outer environment and ocular surface and is essential for lubricating and protecting the ocular surface [1]. Loss of TF homeostasis is the core pathophysiological mechanism of dry eye disease (DED), which is highly prevalent and multifactorial [2]. The DED should meet a symptomatic survey (e.g., Ocular Surface Disease Index) and one of the three homeostasis markers of TF: noninvasive tear breakup time (NITBUT), TF osmolarity, and ocular surface staining [2]. However, only NIKBUT could really assess the dynamic expression of TF homeostasis, but it assesses in an unnatural way in which the subject must forcefully keep his or her eyes open for at least 10 s. Children, in particular, often fails to complete this test. Thus, a better method is needed for evaluating TF homeostasis in children and other populations.
We previously proposed a novel model using TF viscosity examination to determine the dynamic TF homeostasis based on a commercialized TF analyzer (Keratograph® 5M, Oculus) [3]. This examination determined the TF viscosity by tracking the reflective light particles spreading on the cornea, of which the speed of reflective light particle was inversely proportional to the TF viscosity. This method was initially proposed by Owens and Phillips [4] to evaluate TF viscosity of DED and applied alternatively by Varikooty [5] to evaluate the hydrodynamics of pre-contact lens TF, both of which were assessed with a self-made slit lamp camera platform. It possessed noninvasive and dynamic evaluation properties and therefore could naturally reflect the TF homeostasis with the least interference. However, obtaining the results using this examination was a tedious process that involves decomposing videos to image frames, tracking the position of light particle for each image frame, and estimating the movement speed of light particle from these annotated coordinates. Consequently, it was preferred for research purposes over routine clinical practices.
A grading system based on a set of standard images was adopted in several clinical tests, including conjunctival bulbar redness, corneal epithelial dye staining, meibography, and tear ferning test [6,7,8,9]. These have provided objective assessment in our clinical practice for the diagnosis and follow-ups of ocular surface diseases. Hence, in order to make the TF viscosity test applicable in clinical practices, a grading approach may be a solution. However, there was no dynamic grading systems proposed previously.
Thus, we aimed to validate our newly proposed dynamic 4-grade system on TF homeostasis with the focus on myopic children and to evaluate the clinical feasibility of this test as a routine examination.
Methods
Participants
Dynamic TF videos were obtained from 34 children who participated in a prospective observational study for investigating the tear film stability in children asking for myopic assessment and treatment between December 2017 and May 2018 in Kaohsiung Chang Gung Memorial Hospital (CGMH). All procedures involving human participants adhered to the tenets of the Declaration of Helsinki and the ARVO statement on human subjects. Institutional review board/ethics committee approval (No. 201701393B0) was obtained from the committee of medical ethics and human experiments of CGMH. These participants were healthy individuals aged from 6 to 15 years old without ocular or systemic diseases. All participants and their legal representatives were clearly informed about the aim and procedure of this study and signed the informed consent. Each subject received complete ocular examination and dynamic TF examinations by masked examiners.
Examination of the dynamic TF viscosity
The right eye of each subject was selected for the TF viscosity examination. Before the examination, each subject was instructed to blink naturally during examination. With a TF analyzer (Keratograph® 5M, Oculus), the moving reflective light particles were observed under two illumination sources of white diode. At the beginning of this examination, the examiner focused on the anterior corneal surface under × 1.4 magnification to make mobile reflective light particles clearly visible on the TF. The recorder was then started to obtain a dynamic TF viscosity video. Generally, three to four blink cycles were recorded in a 10-s video record.
Determination of the objective moving speed of reflective light particles
The movement speeds of reflective light particles were determined according to our previous study [3]. In brief, the first qualified blink cycle was adopted for assessment, and the trajectories of particles were analyzed by decomposed image frames with a time interval of 0.1 s. The qualified sequence was defined as clearly visible TF particles under stable and undistorted superior and inferior reflective ellipses. By tracking three reflective light particles on the cornea starting from opening cycle of the qualified blink for 1 s, the movement speeds of three reflective light particles were averaged to obtain the momentary movement speed (MMS) of each subject. A power-law fitting operation MMS (t) = α × t−β was used to obtain the objective movement speed of reflective light particles, MMS (0.5), for each subject. The 34 videos from 34 subjects were then sorted from fastest to slowest objective movement speed for the purpose of establishing the subjective grading system in the section below.
Subjective grading system for the TF viscosity examination
The first, second, and third quartiles (Q1, Q2, and Q3) of the sorted videos were adopted as standard videos for comparison during subjective grading. This 4-grade system was constructed on a specially designed window environment (application video viewer in the Supplementary file 1 and instructions for use in the Supplementary file 2) for simultaneous comparison (demonstration in the Supplementary file 3) with the three standard videos (Supplementary file 4–6). Subjects were labeled as grades 1, 2, and 3 (lowest TF viscosity) if their particles moved faster than that of the Q1, Q2, and Q3 videos, respectively. Subjects were labeled as grade 4 if their particles moved slower than the Q3 (Fig. 1). The remaining videos (n = 31) were rearranged under a randomized procedure. After instructing the above grading rule and establishing the grading environment, two masked graders who were novel to this study were recruited to determine the grading of TF viscosity for each subject. After 2 weeks, the videos were regraded by the same masked graders after rearrangement of the video sequence under the same randomized procedure. In addition, the two graders were asked to regrade the same videos under slower playback speeds of 0.8-, 0.5-, and 0.4-times after the same rearrangement procedure on 4th–6th, 8th–10th, and 12th–14th weeks, respectively.
Statistical analysis
The objective movement speeds of reflective light particles, MMS (0.5), were estimated by the Microsoft Excel 2016, where the add-in program, Solver, was used to perform the power-law fitting and extract the model parameters α and β. The free online statistical software (Social Science Statistics; https://www.socscistatistics.com/) was used to calculate Spearman’s correlation coefficient ρ, which revealed the intra- and inter-observer reliabilities by correlating the grading scores of the same video for each subject. In addition, the Bland-Altman plot was used to determine the mean deviation and 95% limits of agreement (LoA) between two observers. We chose the Bland-Altman plot to directly visualize both the magnitude of bias and 95% limits of agreement between the two methods. The free online software GraphPad QuickCalcs was used to perform Fisher exact test for determining the statistical improvement of intra- and inter-observer grading concordance in lower playback speeds. P < 0.05 was recognized statistically significant.
Result
The ocular surface performance of the participants
The 34 dynamic TF viscosity videos were obtained from 34 children. The basic characteristics and representative ocular surface parameters of the participant are shown in Table 1. There were no significant differences between boys and girls in the tear meniscus height, bulbar redness, and tear breakup time. In addition, there was no correlation among age and the ocular surface parameters of tear meniscus height, bulbar redness, and tear breakup time.
Intra-observer reliability in normal playback speed of the TF viscosity video
After eliciting three videos as the grading standards, the Spearman correlation coefficients between first grading scores and second grading scores of 31 remaining TF viscosity videos of both graders (ρ = 0.96, P value < 0.001 for grader no. 1, and ρ = 0.82, p value < 0.001 for grader no. 2, respectively) were very strong (Fig. 2a, b). All videos were graded within the upper and lower limits of the Bland-Altman plot, and therefore, a high intra-observer agreement was concluded (Fig. 2c, d). Although both graders had good intra-observer reliability, they had discrepant grading performance. Grader no. 1 was apparently more consistent than grader no. 2.
Inter-observer reliability in normal playback speed of the TF viscosity video
For the 31 TF viscosity videos, the grading scores obtained from the two graders showed significantly positive correlation with Spearman’s correlation coefficient and good inter-observer agreement with Bland-Altman plot (Fig. 3). However, the inter-grader correlation was moderate and consistent in the first and second gradings (ρ = 0.53 and 0.52, respectively) in the normal playback speed (Fig. 3a, b). Only one video was outside the agreement limits of the Bland-Altman plot in the first and second gradings (Fig. 3c, d).
Intra- and inter-observer reliabilities in slower playback speed of the TF viscosity video
Intra- and inter-observer reliabilities were reassessed by using slower playback modes. For the intra-observer correlation, the grading by grader no. 1 did not change at slower playback speeds (Fig. 4a, d, g), while that of grader no. 2 had stronger correlation at 0.8-times but was decreased at 0.5-times and 0.4-times playback speeds (Fig. 4b, e, h). The inter-observer correlation of the two graders was increased at 0.8- and 0.5-times playback speeds but decreased at 0.4-times playback speed (Fig. 4c, f, i).
Although there was no significant intra-observer difference in different playback modes, grader no. 1 showed a relation between decreased playback speeding and increased consistency, of which the absolute values of MD approached zero and the amplitudes of LoA were less than that of the normal playback mode (Fig. 5). As for grader no. 2, the intra-observer agreement was significantly increased (P = 0.0485) at the 0.8-times mode, whereas no significant improvement was reached at both the 0.5-times and 0.4-times modes. Moreover, the inter-observer agreement was also significantly improved (P = 0.0204) in the 0.8-times mode instead of the two slower modes. Therefore, the 0.8-times playback mode had the best intra- and inter-observer reliability and is recommended as a routine grading setting for this video grading system.
Discussion
Currently available TF tests cannot guarantee a comprehensive determination for TF homeostasis. In addition, present modalities for monitoring TF homeostasis may not be suitable for children due to their poor cooperation during tear breakup time measurements and tear collection for osmolality analysis. Consequently, we proposed a 4-grade system to rapidly determine the level of TF viscosity of a subject by an examiner through the comparison of TF particles with three standard videos under a specially designed window environment. We found this novel grading system to be highly reliable in both intra- and inter-observer tests especially under 0.8-times playback mode, of which we recommend for routine clinical practice.
We found discrepant performance between graders using intra-observer reliability test (Fig. 2). Tatsuto et al. found that visual motion perception is different in each individual [10]. There are some factors that may influence motion perception. For example, the attention and age would influence motion perception. Massimo et al. revealed that the perceived speed of moving object is directly proportional to a person’s visual attention [11]. Moreover, previous studies showed that the ability to detect fast visual motion develops earlier in life than that of the slower visual motion [12]. The state of birth also affects the processing of visual information. The prospective control study showed that pre-term infants had poor visual perception [13]. The above evidences may be the reasons that two graders have discrepant intra-observer reliabilities in this video grading system, which may also lead to the moderate inter-observer reliability (Fig. 3). Though the inter-observer correlation showed moderate strength in normal playback speed, we could improve the strength from moderate to strong using slower playback speed of 0.8-times. We could utilize the slower playback function in this specially designed window environment to advance the inter-observer reliability to make this method more clinically dependable and available.
To minimize the difference of visual motion perception in graders, we compared the grading reliabilities of the two graders in decelerated video. Grader no. 1 had very strong intra-observer reliability and the correlation was slightly increased when the playback speed was reduced (Fig. 4a, d, g). However, grader no. 2 had inconsistent intra-grader reliabilities, which were improved in 0.8-times playback modes (Fig. 4b) but worse in 0.5- and 0.4-times playback modes (Fig. 4e–h). The possible reason is that the decrease in video speed is associated with worsening of video resolution, resulting in blurred light particles that are more difficult to track. Above all, clinically, we could do the repeatability analysis first to find better graders to perform this examination and use slower playback speeds of 0.8-times to advance inter-observer reliability.
The reliability of this dynamic grading system was highly dependent on each individual’s visual motion perception, which possibly led to discrepancies in intra- and inter-observer agreements. However, both intra- and inter-observer agreements improved under 0.8-times playback speed. For graders with weaker visual motion perception, the specially designed window environment could provide a customized adjustment to improve their grading reliability. There was currently no agreement on the average movement speed of light particles to represent TF viscosity. The MMS (0.5) was used as the standard light particle speed in this study for sorting all TF viscosity videos and selecting the three standard videos to determine the grading of TF viscosity in orthokeratology wearers. For subjects with other ocular surface disease, such as DED, the optimal speed of TF viscosity videos may need to be adjusted accordingly. However, this grading system provides a highly flexible environment to adopt new standard videos for grading subjects with different ocular surface disorders.
Diagnosing dry eye disease or evaluating TF homeostasis in children is particularly challenging due to children’s poor cooperation during examinations [14]. Thus, a novel and clinically feasible technique to easily evaluate TF homeostasis is needed for children with myopia for pre-treatment assessment and long-term monitoring. Our newly proposed grading system can rapidly determine the level of TF viscosity. We believe this system can assist eye care practitioners in the assessment of TF homeostasis for myopic children in routine clinical settings.
One of the potential limitations is that we chose the right eye of each subject for the TF viscosity examination in this study. Many studies investigating dry eye disease chose unilateral eye for analysis and drew conclusions based on monocular results [15, 16]. It may be argued that the evaluation of only one eye per subject may result in a potential waste of information or cause a bias due to non-random selection. However, inclusion of both eyes may also cause overlapping results due to high similarity of both eyes in many cases [17]. We previously found that the assessment of tear film stability of the first eye could influence that of the fellow eye [18]. Thus, we ultimately decided to choose the right eye, which is the first eye that we routinely examine. Second, the participants were all children aged from 6 to 15 years, which may have different TF presentation compared to the adults. Third, the participants were all children for myopic assessment and treatment, which results in selection bias. In conclusion, the grading system established in this study provides a promising solution for rapidly determining the level of TF viscosity in myopic children necessitating interventions. This system has an excellent intra-observer reliability and a good inter-observer reliability under 0.8-times playback speed. We believe it can be used as one of the routine examinations for assessing the TF homeostasis in myopic children.
Data availability
Data sharing not applicable to this article as no datasets were generated or analyzed during the current study.
Code availability
Not applicable.
References
Gipson IK (2007) The ocular surface: the challenge to enable and protect vision: the Friedenwald lecture. Invest Ophthalmol Vis Sci 48(4390):4391–4398. https://doi.org/10.1167/iovs.07-0770
Craig JP, Nichols KK, Akpek EK, Caffery B, Dua HS, Joo CK, Liu Z, Nelson JD, Nichols JJ, Tsubota K, Stapleton F (2017) TFOS DEWS II definition and classification report. Ocul Surf 15:276–283. https://doi.org/10.1016/j.jtos.2017.05.008
Lai HY, Kuo MT, Fang PC, Lin CC, Chien CC, Cho WH, Chen A, Lai IC (2019) Tracking the reflective light particles spreading on the cornea: an emerging assessment for tear film homeostasis. Transl Vis Sci Technol 8:32. https://doi.org/10.1167/tvst.8.3.32
Owens H, Phillips J (2001) Spreading of the tears after a blink: velocity and stabilization time in healthy eyes. Cornea 20:484–487
Varikooty J, Keir N, Simpson T (2012) Estimating tear film spread and stability through tear hydrodynamics. Optom Vis Sci 89:E1119–E1124. https://doi.org/10.1097/OPX.0b013e3182644cb7
Macchi I, Bunya VY, Massaro-Giordano M, Stone RA, Maguire MG, Zheng Y, Chen M, Gee J, Smith E, Daniel E (2018) A new scale for the assessment of conjunctival bulbar redness. Ocul Surf 16:436–440. https://doi.org/10.1016/j.jtos.2018.06.003
Schiffman RM, Christianson MD, Jacobsen G, Hirsch JD, Reis BL (2000) Reliability and validity of the ocular surface disease index. Arch Ophthalmol 118:615–621. https://doi.org/10.1001/archopht.118.5.615
Arita R (2018) Meibography: a Japanese perspective. Invest Ophthalmol Vis Sci 59:DES48–DES55. https://doi.org/10.1167/iovs.17-23631
Masmali AM, Purslow C, Murphy PJ (2014) The tear ferning test: a simple clinical technique to evaluate the ocular tear film. Clin Exp Optom 97:399–406. https://doi.org/10.1111/cxo.12160
Takeuchi T, Yoshimoto S, Shimada Y, Kochiyama T, Kondo HM (2017) Individual differences in visual motion perception and neurotransmitter concentrations in the human brain. Philos Trans R Soc Lond B Biol Sci 372:20160111. https://doi.org/10.1098/rstb.2016.0111
Turatto M, Vescovi M, Valsecchi M (2007) Attention makes moving objects be perceived to move faster. Vision Res 47:166–178. https://doi.org/10.1016/j.visres.2006.10.002
Manning C, Aagten-Murphy D, Pellicano E (2012) The development of speed discrimination abilities. Vision Res 70:27–33. https://doi.org/10.1016/j.visres.2012.08.004
Agyei SB, van der Weel FR, van der Meer ALH (2016) Development of visual motion perception for prospective control: brain and behavioral studies in infants. Front Psycho 7:100. https://doi.org/10.3389/fpsyg.2016.00100
Mac Cord Medina F, Silvestre de Castro R, Leite SC, Rocha EM, de Melo RG (2007) Management of dry eye related to systemic diseases in childhood and longterm follow-up. Acta Ophthalmol Scand 85:739–744. https://doi.org/10.1111/j.1600-0420.2007.00934.x
Arita R, Mizoguchi T, Kawashima M, Fukuoka S, Koh S, Shirakawa R, Suzuki T, Morishige N (2019) Meibomian gland dysfunction and dry eye are similar but different based on a population-based study: the Hirado-Takushima study in Japan. Am J Ophthalmol. 207:410–418. https://doi.org/10.1016/j.ajo.2019.02.024
Vu CHV, Kawashima M, Yamada M, Suwaki K, Uchino M, Shigeyasu C et al (2018) Influence of meibomian gland dysfunction and friction-related disease on the severity of dry eye. Ophthalmology 125:1181–1188. https://doi.org/10.1016/j.ophtha.2018.01.025
Murdoch IE, Morris SS, Cousens SN (1998) People and eyes: statistical approaches in ophthalmology. Br J Ophthalmol. 82:971–973. https://doi.org/10.1136/bjo.82.8.971
Kuo MT, Fang PC, Kuo SF, Chen A, Huang YT (2021) Tear proteomics study of dry eye disease: which eye do you adopt as the representative eye for the study? Int J Mol Sci. 22:422. https://doi.org/10.3390/ijms22010422
Acknowledgements
The authors thank Ming-En Hsieh, Institute of Data Science and Engineering, and Yi-Hsuan Lin, Institute of Computer Science and Engineering, Department of Computer Science, National Chiao Tung University, for their programming assistance in establishing the grading environment.
Funding
This work was supported by Chang Gung Research Proposal (CMRPG8J1091, CMRPG8K0111). The sponsors or funding organizations had no role in the design or conduct of this research.
Author information
Authors and Affiliations
Contributions
Conceptualization, MTK; methodology, HYL and MTK; validation, PCF and AC; formal analysis, HYL; investigation, HYL and PCF; resources, PCF and MTK; data curation, HYL; writing—original draft preparation, HYL; writing—review and editing, MTK and AC; visualization, HYL and AC; supervision, MTK; project administration, MTK; funding acquisition, MTK
Corresponding author
Ethics declarations
Ethics approval
Institutional review board/ethics committee approval (no. 201701393B0) was obtained from the committee of medical ethics and human experiments of CGMH.
Consent to participate
All participants and their legal representatives were clearly informed about the aim and procedure of this study and signed the informed consent.
Consent for publication
All authors have read and agreed to the published version of the manuscript.
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The original version of this article was revised. ESM 1 is now corrected.
Supplementary information
Supplementary file 1
A simple grading application, video viewer. (PDF 228 kb)
Supplementary file 2
Operating instructions for the grading software. (PDF 316 kb)
Supplementary file 3
Demonstration of the grading scenario, in which the upper row of videos were video standards Q1, Q2, and Q3 (from left to right) and the lower row of videos were three copies of a graded video. (MP4 5400 kb)
Supplementary file 4
The Q1 standard video. (MP4 664 kb)
Supplementary file 5
The Q2 standard video. (MP4 567 kb)
Supplementary file 6
The Q3 standard video. (MP4 645 kb)
Supplementary 7
The Bland-Altman plots for analyzing intra- and inter-observer agreement of the tear film viscosity grading system at different playback speeds. (PNG 1111 kb)
Rights and permissions
About this article
Cite this article
Lai, HY., Fang, PC., Chen, A. et al. Grading reliability of the tear film viscosity examination. Graefes Arch Clin Exp Ophthalmol 259, 2287–2294 (2021). https://doi.org/10.1007/s00417-021-05176-2
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00417-021-05176-2