Unknown

Dataset Information

0

Constructing Causal Life-Course Models: Comparative Study of Data-Driven and Theory-Driven Approaches.


ABSTRACT: Life-course epidemiology relies on specifying complex (causal) models that describe how variables interplay over time. Traditionally, such models have been constructed by perusing existing theory and previous studies. By comparing data-driven and theory-driven models, we investigated whether data-driven causal discovery algorithms can help in this process. We focused on a longitudinal data set on a cohort of Danish men (the Metropolit Study, 1953-2017). The theory-driven models were constructed by 2 subject-field experts. The data-driven models were constructed by use of the temporal Peter-Clark (TPC) algorithm. The TPC algorithm utilizes the temporal information embedded in life-course data. We found that the data-driven models recovered some, but not all, causal relationships included in the theory-driven expert models. The data-driven method was especially good at identifying direct causal relationships that the experts had high confidence in. Moreover, in a post hoc assessment, we found that most of the direct causal relationships proposed by the data-driven model but not included in the theory-driven model were plausible. Thus, the data-driven model may propose additional meaningful causal hypotheses that are new or have been overlooked by the experts. In conclusion, data-driven methods can aid causal model construction in life-course epidemiology, and combining both data-driven and theory-driven methods can lead to even stronger models.

SUBMITTER: Petersen AH 

PROVIDER: S-EPMC11004942 | biostudies-literature | 2023 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Constructing Causal Life-Course Models: Comparative Study of Data-Driven and Theory-Driven Approaches.

Petersen Anne Helby AH   Ekstrøm Claus Thorn CT   Spirtes Peter P   Osler Merete M  

American journal of epidemiology 20231101 11


Life-course epidemiology relies on specifying complex (causal) models that describe how variables interplay over time. Traditionally, such models have been constructed by perusing existing theory and previous studies. By comparing data-driven and theory-driven models, we investigated whether data-driven causal discovery algorithms can help in this process. We focused on a longitudinal data set on a cohort of Danish men (the Metropolit Study, 1953-2017). The theory-driven models were constructed  ...[more]

Similar Datasets

| S-EPMC8494078 | biostudies-literature
| S-EPMC8924895 | biostudies-literature
| S-EPMC11760626 | biostudies-literature
| S-EPMC6647547 | biostudies-literature
| S-EPMC3490823 | biostudies-literature
| S-EPMC10115127 | biostudies-literature
| S-EPMC5660883 | biostudies-literature
| S-EPMC8555780 | biostudies-literature
| S-EPMC1388097 | biostudies-literature
| S-EPMC4472167 | biostudies-literature