Using sequences of life-events to predict human lives.
Germans SavcisensTina Eliassi-RadLars Kai HansenLaust Hvas MortensenLau LilleholtAnna RogersIngo ZettlerSune LehmannPublished in: Nature computational science (2023)
Here we represent human lives in a way that shares structural similarity to language, and we exploit this similarity to adapt natural language processing techniques to examine the evolution and predictability of human lives based on detailed event sequences. We do this by drawing on a comprehensive registry dataset, which is available for Denmark across several years, and that includes information about life-events related to health, education, occupation, income, address and working hours, recorded with day-to-day resolution. We create embeddings of life-events in a single vector space, showing that this embedding space is robust and highly structured. Our models allow us to predict diverse outcomes ranging from early mortality to personality nuances, outperforming state-of-the-art models by a wide margin. Using methods for interpreting deep learning models, we probe the algorithm to understand the factors that enable our predictions. Our framework allows researchers to discover potential mechanisms that impact life outcomes as well as the associated possibilities for personalized interventions.
Keyphrases
- endothelial cells
- deep learning
- healthcare
- induced pluripotent stem cells
- public health
- pluripotent stem cells
- autism spectrum disorder
- physical activity
- mental health
- cardiovascular disease
- type diabetes
- coronary artery disease
- cardiovascular events
- metabolic syndrome
- convolutional neural network
- risk factors
- health information
- single molecule
- artificial intelligence
- human health
- glycemic control