Using sequences of life-events to predict human lives

Research output: Contribution to journalJournal articleResearchpeer-review

Here we represent human lives in a way that shares structural similarity to language, and we exploit this similarity to adapt natural language processing techniques to examine the evolution and predictability of human lives based on detailed event sequences. We do this by drawing on a comprehensive registry dataset, which is available for Denmark across several years, and that includes information about life-events related to health, education, occupation, income, address and working hours, recorded with day-to-day resolution. We create embeddings of life-events in a single vector space, showing that this embedding space is robust and highly structured. Our models allow us to predict diverse outcomes ranging from early mortality to personality nuances, outperforming state-of-the-art models by a wide margin. Using methods for interpreting deep learning models, we probe the algorithm to understand the factors that enable our predictions. Our framework allows researchers to discover potential mechanisms that impact life outcomes as well as the associated possibilities for personalized interventions.

Original languageEnglish
JournalNature Computational Science
Volume4
Pages (from-to)43–56
Number of pages14
DOIs
Publication statusPublished - 2024

Bibliographical note

Publisher Copyright:
© 2023, The Author(s), under exclusive licence to Springer Nature America, Inc.

ID: 378520874