Unknown

Dataset Information

0

Characterizing mobile element insertions in 5675 genomes.


ABSTRACT: Mobile element insertions (MEIs) are a major class of structural variants (SVs) and have been linked to many human genetic disorders, including hemophilia, neurofibromatosis, and various cancers. However, human MEI resources from large-scale genome sequencing are still lacking compared to those for SNPs and SVs. Here, we report a comprehensive map of 36 699 non-reference MEIs constructed from 5675 genomes, comprising 2998 Chinese samples (∼26.2×, NyuWa) and 2677 samples from the 1000 Genomes Project (∼7.4×, 1KGP). We discovered that LINE-1 insertions were highly enriched in centromere regions, implying the role of chromosome context in retroelement insertion. After functional annotation, we estimated that MEIs are responsible for about 9.3% of all protein-truncating events per genome. Finally, we built a companion database named HMEID for public use. This resource represents the latest and largest genomewide study on MEIs and will have broad utility for exploration of human MEI findings.

SUBMITTER: Niu Y 

PROVIDER: S-EPMC8934628 | biostudies-literature | 2022 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Characterizing mobile element insertions in 5675 genomes.

Niu Yiwei Y   Teng Xueyi X   Zhou Honghong H   Shi Yirong Y   Li Yanyan Y   Tang Yiheng Y   Zhang Peng P   Luo Huaxia H   Kang Quan Q   Xu Tao T   He Shunmin S  

Nucleic acids research 20220301 5


Mobile element insertions (MEIs) are a major class of structural variants (SVs) and have been linked to many human genetic disorders, including hemophilia, neurofibromatosis, and various cancers. However, human MEI resources from large-scale genome sequencing are still lacking compared to those for SNPs and SVs. Here, we report a comprehensive map of 36 699 non-reference MEIs constructed from 5675 genomes, comprising 2998 Chinese samples (∼26.2×, NyuWa) and 2677 samples from the 1000 Genomes Pro  ...[more]

Similar Datasets

| S-EPMC7035633 | biostudies-literature
| S-EPMC4785551 | biostudies-literature
| S-EPMC8219704 | biostudies-literature
| S-EPMC7102983 | biostudies-literature
| S-EPMC4218655 | biostudies-literature
| S-EPMC4338948 | biostudies-literature
| S-EPMC7385971 | biostudies-literature
| PRJNA632952 | ENA
| PRJNA632951 | ENA
| S-EPMC3698510 | biostudies-literature