Fairness and generalisability in deep learning of retinopathy of prematurity screening algorithms: a literature review.
Luis Filipe NakayamaWilliam Greig MitchellLucas Zago RibeiroRobyn Gayle DychiaoWarachaya PhanphrukLeo Anthony CeliKhumbo KaluaAlvina Pauline Dy SantiagoCaio Vinicius Saito RegatieriNilva Simeren Bueno MoraesPublished in: BMJ open ophthalmology (2023)
The reviewed articles included 180 228 images and reported good metrics, but fairness, generalisability and bias control remained limited. Reproducibility is also a critical limitation, with few articles sharing codes and none sharing data. Fair and generalisable ROP and AI studies are needed that include diverse datasets, data and code sharing, collaborative research, and bias control to avoid unpredictable and harmful deployments.