Training confounder-free deep learning models for medical applications.
Qingyu ZhaoEhsan AdeliKilian M PohlPublished in: Nature communications (2020)
The presence of confounding effects (or biases) is one of the most critical challenges in using deep learning to advance discovery in medical imaging studies. Confounders affect the relationship between input data (e.g., brain MRIs) and output variables (e.g., diagnosis). Improper modeling of those relationships often results in spurious and biased associations. Traditional machine learning and statistical models minimize the impact of confounders by, for example, matching data sets, stratifying data, or residualizing imaging measurements. Alternative strategies are needed for state-of-the-art deep learning models that use end-to-end training to automatically extract informative features from large set of images. In this article, we introduce an end-to-end approach for deriving features invariant to confounding factors while accounting for intrinsic correlations between the confounder(s) and prediction outcome. The method does so by exploiting concepts from traditional statistical methods and recent fair machine learning schemes. We evaluate the method on predicting the diagnosis of HIV solely from Magnetic Resonance Images (MRIs), identifying morphological sex differences in adolescence from those of the National Consortium on Alcohol and Neurodevelopment in Adolescence (NCANDA), and determining the bone age from X-ray images of children. The results show that our method can accurately predict while reducing biases associated with confounders. The code is available at https://github.com/qingyuzhao/br-net .
Keyphrases
- deep learning
- machine learning
- artificial intelligence
- big data
- convolutional neural network
- magnetic resonance
- high resolution
- electronic health record
- healthcare
- depressive symptoms
- hiv positive
- antiretroviral therapy
- small molecule
- human immunodeficiency virus
- oxidative stress
- hepatitis c virus
- resting state
- virtual reality
- data analysis
- multiple sclerosis
- magnetic resonance imaging
- computed tomography
- hiv infected
- hiv aids
- hiv testing
- functional connectivity
- high throughput
- men who have sex with men
- soft tissue