Unknown

Dataset Information

0

Machine learning enables prediction of metabolic system evolution in bacteria.


ABSTRACT: Evolution prediction is a long-standing goal in evolutionary biology, with potential impacts on strategic pathogen control, genome engineering, and synthetic biology. While laboratory evolution studies have shown the predictability of short-term and sequence-level evolution, that of long-term and system-level evolution has not been systematically examined. Here, we show that the gene content evolution of metabolic systems is generally predictable by applying ancestral gene content reconstruction and machine learning techniques to ~3000 bacterial genomes. Our framework, Evodictor, successfully predicted gene gain and loss evolution at the branches of the reference phylogenetic tree, suggesting that evolutionary pressures and constraints on metabolic systems are universally shared. Investigation of pathway architectures and meta-analysis of metagenomic datasets confirmed that these evolutionary patterns have physiological and ecological bases as functional dependencies among metabolic reactions and bacterial habitat changes. Last, pan-genomic analysis of intraspecies gene content variations proved that even "ongoing" evolution in extant bacterial species is predictable in our framework.

SUBMITTER: Konno N 

PROVIDER: S-EPMC9833677 | biostudies-literature | 2023 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Machine learning enables prediction of metabolic system evolution in bacteria.

Konno Naoki N   Iwasaki Wataru W  

Science advances 20230111 2


Evolution prediction is a long-standing goal in evolutionary biology, with potential impacts on strategic pathogen control, genome engineering, and synthetic biology. While laboratory evolution studies have shown the predictability of short-term and sequence-level evolution, that of long-term and system-level evolution has not been systematically examined. Here, we show that the gene content evolution of metabolic systems is generally predictable by applying ancestral gene content reconstruction  ...[more]

Similar Datasets

| S-EPMC7538910 | biostudies-literature
2013-01-01 | E-GEOD-29210 | biostudies-arrayexpress
| S-EPMC3146072 | biostudies-literature
| S-EPMC6923510 | biostudies-literature
| S-EPMC11838293 | biostudies-literature
| S-EPMC7593340 | biostudies-literature
| S-EPMC10112389 | biostudies-literature
| S-EPMC7886887 | biostudies-literature
2013-01-01 | GSE29210 | GEO
| S-EPMC11439005 | biostudies-literature