Genomics

Dataset Information

0

Cataloging human PRDM9 variability utilizing long-read sequencing technologies reveals PRDM9 population-specificity and two distinct groupings of related alleles


ABSTRACT: The PRDM9 protein determines sites of meiotic recombination in humans by directing meiotic DNA double strand breaks to specific loci. Targeting specificity is encoded by a long array of C2H2 zinc fingers that binds to DNA. This zinc finger array is hypervariable, resulting in innumerable alleles, each with a potentially different DNA binding preference. Assessment of PRDM9 diversity is important for understanding the complexity of human population genetics, inheritance linkage patterns, and the predisposition to genetic disease. Due to the repetitive nature of the PRDM9 zinc finger array, large-scale sequencing of human PRDM9 is challenging. We therefore developed a long-read sequencing strategy to infer the diploid PRDM9 zinc finger array genotype in a high-throughput manner. From an unbiased study of PRDM9 allelic diversity in 716 individuals from 7 human populations, we detected 70 high-confidence PRDM9 alleles. Several alleles differ in frequency among human populations and 33 alleles had not been identified by previous studies, which were heavily biased to European populations. PRDM9 alleles are distinguished by their DNA binding site preferences and fall into two major categories related to the most common PRDM9-A and PRDM9-C alleles. We also find that inter-conversion between allele types is likely rare. By mapping meiotic DSBs in testis, we find that small variations in PRDM9 can substantially alter the meiotic recombination landscape, demonstrating that minor PRDM9 variants may play an under-appreciated role in shaping patterns of human recombination. In summary, our data greatly expands our knowledge of PRDM9 diversity in humans.

ORGANISM(S): Homo sapiens

PROVIDER: GSE166483 | GEO | 2021/10/26

REPOSITORIES: GEO

Similar Datasets

2015-09-21 | E-GEOD-67673 | biostudies-arrayexpress
2020-05-12 | GSE139289 | GEO
2015-09-21 | GSE67673 | GEO
2019-06-07 | GSE109874 | GEO
2020-12-03 | PXD017337 | Pride
2018-04-27 | GSE104850 | GEO
2020-02-20 | GSE138915 | GEO
2021-03-25 | GSE163474 | GEO
2016-06-30 | E-GEOD-76416 | biostudies-arrayexpress
2016-06-30 | GSE76416 | GEO