Data infrastructures for AI in medical imaging: a report on the experiences of five EU projects.
Haridimos KondylakisVarvara KalokyriStelios SfakianakisKostas MariasManolis TsiknakisAna Jimenez-PastorEduardo Camacho-RamosIgnacio BlanquerJ Damian SegrellesSergio López-HuguetCaroline BarelleMagdalena Kogut-CzarkowskaGianna TsakouNikolaos SiopisZisis SakellariouPaschalis BizopoulosVicky DrossouAntonios LalasKonstantinos VotisPedro MallolLuis Marti-BonmatiLeonor Cerdá AlberichKarine SeymourSamuel BoucherEsther CiarrocchiLauren FromontJordi RamblaAlexander HarmsAndrea GutierrezMartijn P A StarmansFred PriorJosep Ll GelpiKarim LekadirPublished in: European radiology experimental (2023)
Artificial intelligence (AI) is transforming the field of medical imaging and has the potential to bring medicine from the era of 'sick-care' to the era of healthcare and prevention. The development of AI requires access to large, complete, and harmonized real-world datasets, representative of the population, and disease diversity. However, to date, efforts are fragmented, based on single-institution, size-limited, and annotation-limited datasets. Available public datasets (e.g., The Cancer Imaging Archive, TCIA, USA) are limited in scope, making model generalizability really difficult. In this direction, five European Union projects are currently working on the development of big data infrastructures that will enable European, ethically and General Data Protection Regulation-compliant, quality-controlled, cancer-related, medical imaging platforms, in which both large-scale data and AI algorithms will coexist. The vision is to create sustainable AI cloud-based platforms for the development, implementation, verification, and validation of trustable, usable, and reliable AI models for addressing specific unmet needs regarding cancer care provision. In this paper, we present an overview of the development efforts highlighting challenges and approaches selected providing valuable feedback to future attempts in the area.Key points• Artificial intelligence models for health imaging require access to large amounts of harmonized imaging data and metadata.• Main infrastructures adopted either collect centrally anonymized data or enable access to pseudonymized distributed data.• Developing a common data model for storing all relevant information is a challenge.• Trust of data providers in data sharing initiatives is essential.• An online European Union meta-tool-repository is a necessity minimizing effort duplication for the various projects in the area.