Using Wikipedia to learn semantic feature representations of concrete concepts in neuroimaging experiments.
Francisco PereiraMatthew BotvinickGreg DetrePublished in: Artificial intelligence (2012)
In this paper we show that a corpus of a few thousand Wikipedia articles about concrete or visualizable concepts can be used to produce a low-dimensional semantic feature representation of those concepts. The purpose of such a representation is to serve as a model of the mental context of a subject during functional magnetic resonance imaging (fMRI) experiments. A recent study [19] showed that it was possible to predict fMRI data acquired while subjects thought about a concrete concept, given a representation of those concepts in terms of semantic features obtained with human supervision. We use topic models on our corpus to learn semantic features from text in an unsupervised manner, and show that those features can outperform those in [19] in demanding 12-way and 60-way classification tasks. We also show that these features can be used to uncover similarity relations in brain activation for different concepts which parallel those relations in behavioral data from human subjects.
Keyphrases
- machine learning
- resting state
- magnetic resonance imaging
- endothelial cells
- functional connectivity
- deep learning
- electronic health record
- induced pluripotent stem cells
- neural network
- pluripotent stem cells
- computed tomography
- mental health
- data analysis
- magnetic resonance
- cerebral ischemia
- brain injury
- smoking cessation