Crowdsourcing Skin Demarcations of Chronic Graft-Versus-Host Disease in Patient Photographs: Training Versus Performance Study.
Andrew J McNeilKelsey ParksXiaoqi LiuBohan JiangJoseph CocoKira McCoolDaniel FabbriErik P DuhaimeBenoit M DawantEric R TkaczykPublished in: JMIR dermatology (2023)
Crowds of nonexpert raters can demarcate cGVHD images with good overall performance. Tracking the top 5 most reliable raters provided optimal results, obtaining the best performance with the lowest number of expert demarcations required for adequate training. However, the agreement amongst individual nonexperts does not help predict whether the crowd has provided an accurate result. Future work should explore the performance of crowdsourcing in standard clinical photos and further methods to estimate the reliability of consensus demarcations.