Unknown

Dataset Information

0

Accounting for population structure in genetic studies of cystic fibrosis.


ABSTRACT: CFTR F508del (c.1521_1523delCTT, p.Phe508delPhe) is the most common pathogenic allele underlying cystic fibrosis (CF), and its frequency varies in a geographic cline across Europe. We hypothesized that genetic variation associated with this cline is overrepresented in a large cohort (N > 5,000) of persons with CF who underwent whole-genome sequencing and that this pattern could result in spurious associations between variants correlated with both the F508del genotype and CF-related outcomes. Using principal-component (PC) analyses, we showed that variation in the CFTR region disproportionately contributes to a PC explaining a relatively high proportion of genetic variance. Variation near CFTR was correlated with population structure among persons with CF, and this correlation was driven by a subset of the sample inferred to have European ancestry. We performed genome-wide association studies comparing persons with CF with one versus two copies of the F508del allele; this allowed us to identify genetic variation associated with the F508del allele and to determine that standard PC-adjustment strategies eliminated the significant association signals. Our results suggest that PC adjustment can adequately prevent spurious associations between genetic variants and CF-related traits and are therefore effective tools to control for population structure even when population structure is confounded with disease severity and a common pathogenic variant.

SUBMITTER: Kingston H 

PROVIDER: S-EPMC9136666 | biostudies-literature | 2022 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications


<i>CFTR</i> F508del (c.1521_1523delCTT, p.Phe508delPhe) is the most common pathogenic allele underlying cystic fibrosis (CF), and its frequency varies in a geographic cline across Europe. We hypothesized that genetic variation associated with this cline is overrepresented in a large cohort (N > 5,000) of persons with CF who underwent whole-genome sequencing and that this pattern could result in spurious associations between variants correlated with both the F508del genotype and CF-related outcom  ...[more]

Similar Datasets

| S-EPMC2995003 | biostudies-literature
| 2726863 | ecrin-mdr-crc
2010-08-16 | E-GEOD-23636 | biostudies-arrayexpress
2010-08-16 | GSE23636 | GEO
| S-EPMC4090102 | biostudies-literature
| S-EPMC3552343 | biostudies-literature
| S-EPMC8477396 | biostudies-literature
| S-EPMC3781476 | biostudies-literature
| S-EPMC7548908 | biostudies-literature
| S-EPMC4778803 | biostudies-literature