Unknown

Dataset Information

0

The Relationship Between Haplotype-Based F ST and Haplotype Length.


ABSTRACT: The population-genetic statistic [Formula: see text] is used widely to describe allele frequency distributions in subdivided populations. The increasing availability of DNA sequence data has recently enabled computations of [Formula: see text] from sequence-based "haplotype loci." At the same time, theoretical work has revealed that [Formula: see text] has a strong dependence on the underlying genetic diversity of a locus from which it is computed, with high diversity constraining values of [Formula: see text] to be low. In the case of haplotype loci, for which two haplotypes that are distinct over a specified length along a chromosome are treated as distinct alleles, genetic diversity is influenced by haplotype length: longer haplotype loci have the potential for greater genetic diversity. Here, we study the dependence of [Formula: see text] on haplotype length. Using a model in which a haplotype locus is sequentially incremented by one biallelic locus at a time, we show that increasing the length of the haplotype locus can either increase or decrease the value of [Formula: see text], and usually decreases it. We compute [Formula: see text] on haplotype loci in human populations, finding a close correspondence between the observed values and our theoretical predictions. We conclude that effects of haplotype length are valuable to consider when interpreting [Formula: see text] calculated on haplotypic data.

SUBMITTER: Mehta RS 

PROVIDER: S-EPMC6727796 | biostudies-literature | 2019 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

The Relationship Between Haplotype-Based <i>F</i><sub><i>ST</i></sub> and Haplotype Length.

Mehta Rohan S RS   Feder Alison F AF   Boca Simina M SM   Rosenberg Noah A NA  

Genetics 20190708 1


The population-genetic statistic [Formula: see text] is used widely to describe allele frequency distributions in subdivided populations. The increasing availability of DNA sequence data has recently enabled computations of [Formula: see text] from sequence-based "haplotype loci." At the same time, theoretical work has revealed that [Formula: see text] has a strong dependence on the underlying genetic diversity of a locus from which it is computed, with high diversity constraining values of [For  ...[more]

Similar Datasets

| S-EPMC5565725 | biostudies-literature
| S-EPMC8804896 | biostudies-literature
| S-EPMC2581416 | biostudies-literature
| S-EPMC7961563 | biostudies-literature
| S-EPMC5006993 | biostudies-literature
2014-06-27 | GSE48386 | GEO
| S-EPMC4093846 | biostudies-literature
2014-06-27 | E-GEOD-48386 | biostudies-arrayexpress
| S-EPMC3614594 | biostudies-literature