Unknown

Dataset Information

0

Characterization of full-length LINE-1 insertions in 154 genomes.


ABSTRACT: Long interspersed nuclear elements (LINEs) are retrotransposons that contribute to genetic variation in the human genome. LINE-1 elements in larger-scale studies are challenging to identify using sequencing technologies due to cost and scalability. We developed an approach using optical mapping for detection of full-length LINE-1 insertions and 10× sequencing for confirmation. We found 51 true positive full-length LINE-1 insertions, of which 4 are novel insertions, in NA12878. Repeating our analysis on a larger sample set representing 26 populations, we identified 329 full-length LINE-1 elements, of which 123 are novel. 24.8% of these 329 LINE-1 insertions were shared amongst all 5 superpopulations (AFR, AMR, EUR, EAS, SAS). The African superpopulation has a higher percentage of population-specific LINE-1 insertions than any other superpopulation. These data indicate that our approach can provide high-speed, cost-effective, and increased accuracy for LINE-1 detection. These data also provide an insight into variations of LINE-1 elements between different populations.

SUBMITTER: Wong JS 

PROVIDER: S-EPMC8671192 | biostudies-literature | 2021 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Characterization of full-length LINE-1 insertions in 154 genomes.

Wong Jessica S JS   Jadhav Tanaya T   Young Eleanor E   Wang Yilin Y   Xiao Ming M  

Genomics 20210915 6


Long interspersed nuclear elements (LINEs) are retrotransposons that contribute to genetic variation in the human genome. LINE-1 elements in larger-scale studies are challenging to identify using sequencing technologies due to cost and scalability. We developed an approach using optical mapping for detection of full-length LINE-1 insertions and 10× sequencing for confirmation. We found 51 true positive full-length LINE-1 insertions, of which 4 are novel insertions, in NA12878. Repeating our anal  ...[more]

Similar Datasets

| S-EPMC9252281 | biostudies-literature
| S-EPMC4589665 | biostudies-literature
| S-EPMC7077291 | biostudies-literature
| S-EPMC3471996 | biostudies-literature
| S-EPMC4376965 | biostudies-literature
| S-EPMC7671390 | biostudies-literature
| S-EPMC4056306 | biostudies-literature
| S-EPMC7354598 | biostudies-literature
| S-EPMC6162474 | biostudies-literature
| S-EPMC4473231 | biostudies-literature