Login / Signup

Test retest variability in stereoacuity measurements.

Jignasa MehtaAnna R O'Connor
Published in: Strabismus (2023)
Background : A clinician's choice of stereotest is influenced by the robustness of the measurement, in terms of sensitivity, specificity and test-retest variability. In relation to the latter aspect, there are limited data on the test-retest variability of these new tests and how they compare to the more commonly used stereotests. Therefore, the aim of the study was to determine the test-retest variability of four different measures of stereoacuity (TNO, Frisby, Lang Stereopad and Asteroid (Accurate STEReotest On a mobIle Device)) and to compare the stereoacuity measurements between the tests in an adult population. Methods : Stereoacuity was measured twice using TNO, Frisby, Lang Stereopad and Asteroid. Inclusion criteria included adult participants (18 years and older), no known ophthalmic condition and VA (Visual Acuity) equal to or better than 0.3 logMAR (Logarithm of the Minimum Angle of Resolution) with interocular difference of less than 0.2 logMAR. Bland-Altman analysis was used to assess agreement within and between stereotests. Differences in stereo thresholds were compared using signed Wilcoxon tests. Results : Fifty-four adults (male: 23 and female: 31) with VA equal to or better than 0.3 logMAR in either eye and interocular difference less than 0.2 logMAR were assessed (mean age: 38 years, SD: 12.7, range: 18-72). The test-retest variability of all the clinical stereotests, with the exception of the Lang Stereopad ( p  = .03, Wilcoxon signed-rank test), was clinically insignificant as the mean bias was equal or less than 0.06 log seconds of arc (equivalent to 1.15 seconds of arc). While the Asteroid test had the smallest variation between repeated measures (mean bias: -0.01 log seconds of arc), the Frisby and Lang Stereopad tests had the narrowest and widest limits of agreement respectively. When comparing results between tests, the biggest mean bias was between Frisby and Lang Stereopad (-0.62 log seconds of arc), and 64.8% and 31.5% of differences were in the medium (21-100" of arc) and larger (>100" of arc) ranges respectively. Conclusion : The TNO and Frisby tests have good reliability but measure stereoacuity over a narrower range compared to the Asteroid which shows less variation on repeated testing but has a larger testing range. The data reported here show varying degrees of agreement in a cohort of visually normal participants, and further investigation is required to determine if there is further variability when stereoacuity is reduced.
Keyphrases
  • electronic health record
  • physical activity
  • artificial intelligence
  • deep learning