Benchmarking the generalizability of brain age models: Challenges posed by scanner variance and prediction bias.
Robert J JirsaraieTobias KaufmannVishnu BashyamGuray ErusJoan L LubyLars Tjelta WestlyeChristos DavatzikosDeanna M BarchAristeidis SotirasPublished in: Human brain mapping (2022)
Machine learning has been increasingly applied to neuroimaging data to predict age, deriving a personalized biomarker with potential clinical applications. The scientific and clinical value of these models depends on their applicability to independently acquired scans from diverse sources. Accordingly, we evaluated the generalizability of two brain age models that were trained across the lifespan by applying them to three distinct early-life samples with participants aged 8-22 years. These models were chosen based on the size and diversity of their training data, but they also differed greatly in their processing methods and predictive algorithms. Specifically, one brain age model was built by applying gradient tree boosting (GTB) to extracted features of cortical thickness, surface area, and brain volume. The other model applied a 2D convolutional neural network (DBN) to minimally preprocessed slices of T1-weighted scans. Additional model variants were created to understand how generalizability changed when each model was trained with data that became more similar to the test samples in terms of age and acquisition protocols. Our results illustrated numerous trade-offs. The GTB predictions were relatively more accurate overall and yielded more reliable predictions when applied to lower quality scans. In contrast, the DBN displayed the most utility in detecting associations between brain age gaps and cognitive functioning. Broadly speaking, the largest limitations affecting generalizability were acquisition protocol differences and biased brain age estimates. If such confounds could eventually be removed without post-hoc corrections, brain age predictions may have greater utility as personalized biomarkers of healthy aging.
Keyphrases
- resting state
- machine learning
- white matter
- functional connectivity
- computed tomography
- cerebral ischemia
- multiple sclerosis
- magnetic resonance
- convolutional neural network
- early life
- big data
- randomized controlled trial
- magnetic resonance imaging
- deep learning
- risk assessment
- gene expression
- body composition
- resistance training
- drinking water
- artificial intelligence
- mass spectrometry
- human health
- genome wide
- data analysis
- high intensity
- image quality