Unknown

Dataset Information

0

Use of >100,000 NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium whole genome sequences improves imputation quality and detection of rare variant associations in admixed African and Hispanic/Latino populations.


ABSTRACT: Most genome-wide association and fine-mapping studies to date have been conducted in individuals of European descent, and genetic studies of populations of Hispanic/Latino and African ancestry are limited. In addition, these populations have more complex linkage disequilibrium structure. In order to better define the genetic architecture of these understudied populations, we leveraged >100,000 phased sequences available from deep-coverage whole genome sequencing through the multi-ethnic NHLBI Trans-Omics for Precision Medicine (TOPMed) program to impute genotypes into admixed African and Hispanic/Latino samples with genome-wide genotyping array data. We demonstrated that using TOPMed sequencing data as the imputation reference panel improves genotype imputation quality in these populations, which subsequently enhanced gene-mapping power for complex traits. For rare variants with minor allele frequency (MAF) < 0.5%, we observed a 2.3- to 6.1-fold increase in the number of well-imputed variants, with 11-34% improvement in average imputation quality, compared to the state-of-the-art 1000 Genomes Project Phase 3 and Haplotype Reference Consortium reference panels. Impressively, even for extremely rare variants with minor allele count <10 (including singletons) in the imputation target samples, average information content rescued was >86%. Subsequent association analyses of TOPMed reference panel-imputed genotype data with hematological traits (hemoglobin (HGB), hematocrit (HCT), and white blood cell count (WBC)) in ~21,600 African-ancestry and ~21,700 Hispanic/Latino individuals identified associations with two rare variants in the HBB gene (rs33930165 with higher WBC [p = 8.8x10-15] in African populations, rs11549407 with lower HGB [p = 1.5x10-12] and HCT [p = 8.8x10-10] in Hispanics/Latinos). By comparison, neither variant would have been genome-wide significant if either 1000 Genomes Project Phase 3 or Haplotype Reference Consortium reference panels had been used for imputation. Our findings highlight the utility of the TOPMed imputation reference panel for identification of novel rare variant associations not previously detected in similarly sized genome-wide studies of under-represented African and Hispanic/Latino populations.

SUBMITTER: Kowalski MH 

PROVIDER: S-EPMC6953885 | biostudies-literature | 2019 Dec

REPOSITORIES: biostudies-literature

altmetric image

Publications

Use of >100,000 NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium whole genome sequences improves imputation quality and detection of rare variant associations in admixed African and Hispanic/Latino populations.

Kowalski Madeline H MH   Qian Huijun H   Hou Ziyi Z   Rosen Jonathan D JD   Tapia Amanda L AL   Shan Yue Y   Jain Deepti D   Argos Maria M   Arnett Donna K DK   Avery Christy C   Barnes Kathleen C KC   Becker Lewis C LC   Bien Stephanie A SA   Bis Joshua C JC   Blangero John J   Boerwinkle Eric E   Bowden Donald W DW   Buyske Steve S   Cai Jianwen J   Cho Michael H MH   Choi Seung Hoan SH   Choquet Hélène H   Cupples L Adrienne LA   Cushman Mary M   Daya Michelle M   de Vries Paul S PS   Ellinor Patrick T PT   Faraday Nauder N   Fornage Myriam M   Gabriel Stacey S   Ganesh Santhi K SK   Graff Misa M   Gupta Namrata N   He Jiang J   Heckbert Susan R SR   Hidalgo Bertha B   Hodonsky Chani J CJ   Irvin Marguerite R MR   Johnson Andrew D AD   Jorgenson Eric E   Kaplan Robert R   Kardia Sharon L R SLR   Kelly Tanika N TN   Kooperberg Charles C   Lasky-Su Jessica A JA   Loos Ruth J F RJF   Lubitz Steven A SA   Mathias Rasika A RA   McHugh Caitlin P CP   Montgomery Courtney C   Moon Jee-Young JY   Morrison Alanna C AC   Palmer Nicholette D ND   Pankratz Nathan N   Papanicolaou George J GJ   Peralta Juan M JM   Peyser Patricia A PA   Rich Stephen S SS   Rotter Jerome I JI   Silverman Edwin K EK   Smith Jennifer A JA   Smith Nicholas L NL   Taylor Kent D KD   Thornton Timothy A TA   Tiwari Hemant K HK   Tracy Russell P RP   Wang Tao T   Weiss Scott T ST   Weng Lu-Chen LC   Wiggins Kerri L KL   Wilson James G JG   Yanek Lisa R LR   Zöllner Sebastian S   North Kari E KE   Auer Paul L PL   Raffield Laura M LM   Reiner Alexander P AP   Li Yun Y  

PLoS genetics 20191223 12


Most genome-wide association and fine-mapping studies to date have been conducted in individuals of European descent, and genetic studies of populations of Hispanic/Latino and African ancestry are limited. In addition, these populations have more complex linkage disequilibrium structure. In order to better define the genetic architecture of these understudied populations, we leveraged >100,000 phased sequences available from deep-coverage whole genome sequencing through the multi-ethnic NHLBI Tr  ...[more]

Similar Datasets

| S-EPMC7875770 | biostudies-literature
| S-EPMC8394596 | biostudies-literature
| S-EPMC3524415 | biostudies-literature
2010-04-08 | GSE21248 | GEO
| PRJNA434034 | ENA