Unknown

Dataset Information

0

Recurrent evolution of vertebrate transcription factors by transposase capture.


ABSTRACT: Genes with novel cellular functions may evolve through exon shuffling, which can assemble novel protein architectures. Here, we show that DNA transposons provide a recurrent supply of materials to assemble protein-coding genes through exon shuffling. We find that transposase domains have been captured-primarily via alternative splicing-to form fusion proteins at least 94 times independently over the course of ~350 million years of tetrapod evolution. We find an excess of transposase DNA binding domains fused to host regulatory domains, especially the Krüppel-associated box (KRAB) domain, and identify four independently evolved KRAB-transposase fusion proteins repressing gene expression in a sequence-specific fashion. The bat-specific KRABINER fusion protein binds its cognate transposons genome-wide and controls a network of genes and cis-regulatory elements. These results illustrate how a transcription factor and its binding sites can emerge.

SUBMITTER: Cosby RL 

PROVIDER: S-EPMC8186458 | biostudies-literature | 2021 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Recurrent evolution of vertebrate transcription factors by transposase capture.

Cosby Rachel L RL   Judd Julius J   Zhang Ruiling R   Zhong Alan A   Garry Nathaniel N   Pritham Ellen J EJ   Feschotte Cédric C  

Science (New York, N.Y.) 20210201 6531


Genes with novel cellular functions may evolve through exon shuffling, which can assemble novel protein architectures. Here, we show that DNA transposons provide a recurrent supply of materials to assemble protein-coding genes through exon shuffling. We find that transposase domains have been captured-primarily via alternative splicing-to form fusion proteins at least 94 times independently over the course of ~350 million years of tetrapod evolution. We find an excess of transposase DNA binding  ...[more]

Similar Datasets

2020-05-08 | GSE148789 | GEO
2020-05-08 | GSE148788 | GEO
2020-05-08 | GSE148787 | GEO
| PRJNA625721 | ENA
| PRJNA625724 | ENA
| PRJNA625725 | ENA
| S-EPMC2151751 | biostudies-literature
| S-EPMC2671141 | biostudies-literature
| S-EPMC5494522 | biostudies-literature
| S-EPMC8657788 | biostudies-literature