Unknown

Dataset Information

0

Mutagenesis of human genomes by endogenous mobile elements on a population scale.


ABSTRACT: Several large-scale Illumina whole-genome sequencing (WGS) and whole-exome sequencing (WES) projects have emerged recently that have provided exceptional opportunities to discover mobile element insertions (MEIs) and study the impact of these MEIs on human genomes. However, these projects also have presented major challenges with respect to the scalability and computational costs associated with performing MEI discovery on tens or even hundreds of thousands of samples. To meet these challenges, we have developed a more efficient and scalable version of our mobile element locator tool (MELT) called CloudMELT. We then used MELT and CloudMELT to perform MEI discovery in 57,919 human genomes and exomes, leading to the discovery of 104,350 nonredundant MEIs. We leveraged this collection (1) to examine potentially active L1 source elements that drive the mobilization of new Alu, L1, and SVA MEIs in humans; (2) to examine the population distributions and subfamilies of these MEIs; and (3) to examine the mutagenesis of GENCODE genes, ENCODE-annotated features, and disease genes by these MEIs. Our study provides new insights on the L1 source elements that drive MEI mutagenesis and brings forth a better understanding of how this mutagenesis impacts human genomes.

SUBMITTER: Chuang NT 

PROVIDER: S-EPMC8647825 | biostudies-literature | 2021 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Mutagenesis of human genomes by endogenous mobile elements on a population scale.

Chuang Nelson T NT   Gardner Eugene J EJ   Terry Diane M DM   Crabtree Jonathan J   Mahurkar Anup A AA   Rivell Guillermo L GL   Hong Charles C CC   Perry James A JA   Devine Scott E SE  

Genome research 20211112 12


Several large-scale Illumina whole-genome sequencing (WGS) and whole-exome sequencing (WES) projects have emerged recently that have provided exceptional opportunities to discover mobile element insertions (MEIs) and study the impact of these MEIs on human genomes. However, these projects also have presented major challenges with respect to the scalability and computational costs associated with performing MEI discovery on tens or even hundreds of thousands of samples. To meet these challenges,  ...[more]

Similar Datasets

| S-EPMC2943760 | biostudies-literature
| S-EPMC2987831 | biostudies-literature
| S-EPMC10675110 | biostudies-literature
| S-EPMC2527708 | biostudies-literature
| S-EPMC2818285 | biostudies-literature
| S-EPMC6299635 | biostudies-literature
| S-EPMC4178727 | biostudies-literature
| S-EPMC5512259 | biostudies-literature
| S-EPMC3141000 | biostudies-literature
| S-EPMC11529989 | biostudies-literature