Unknown

Dataset Information

0

Characterization of genome-wide STR variation in 6487 human genomes.


ABSTRACT: Short tandem repeats (STRs) are abundant and highly mutagenic in the human genome. Many STR loci have been associated with a range of human genetic disorders. However, most population-scale studies on STR variation in humans have focused on European ancestry cohorts or are limited by sequencing depth. Here, we depicted a comprehensive map of 366,013 polymorphic STRs (pSTRs) constructed from 6487 deeply sequenced genomes, comprising 3983 Chinese samples (~31.5x, NyuWa) and 2504 samples from the 1000 Genomes Project (~33.3x, 1KGP). We found that STR mutations were affected by motif length, chromosome context and epigenetic features. We identified 3273 and 1117 pSTRs whose repeat numbers were associated with gene expression and 3'UTR alternative polyadenylation, respectively. We also implemented population analysis, investigated population differentiated signatures, and genotyped 60 known disease-causing STRs. Overall, this study further extends the scale of STR variation in humans and propels our understanding of the semantics of STRs.

SUBMITTER: Shi Y 

PROVIDER: S-EPMC10097659 | biostudies-literature | 2023 Apr

REPOSITORIES: biostudies-literature

altmetric image

Publications

Characterization of genome-wide STR variation in 6487 human genomes.

Shi Yirong Y   Niu Yiwei Y   Zhang Peng P   Luo Huaxia H   Liu Shuai S   Zhang Sijia S   Wang Jiajia J   Li Yanyan Y   Liu Xinyue X   Song Tingrui T   Xu Tao T   He Shunmin S  

Nature communications 20230412 1


Short tandem repeats (STRs) are abundant and highly mutagenic in the human genome. Many STR loci have been associated with a range of human genetic disorders. However, most population-scale studies on STR variation in humans have focused on European ancestry cohorts or are limited by sequencing depth. Here, we depicted a comprehensive map of 366,013 polymorphic STRs (pSTRs) constructed from 6487 deeply sequenced genomes, comprising 3983 Chinese samples (~31.5x, NyuWa) and 2504 samples from the 1  ...[more]

Similar Datasets

| S-EPMC4216929 | biostudies-literature
| S-EPMC3504113 | biostudies-literature
| S-EPMC7181556 | biostudies-literature
| S-EPMC7547914 | biostudies-literature
| S-EPMC5482724 | biostudies-literature
| S-EPMC5026262 | biostudies-literature
| S-EPMC7479135 | biostudies-literature
| S-EPMC3773904 | biostudies-literature
| S-EPMC2966974 | biostudies-literature
| S-EPMC4239369 | biostudies-literature