<HashMap><database>biostudies-literature</database><scores><citationCount>0</citationCount><reanalysisCount>0</reanalysisCount><viewCount>55</viewCount><searchCount>0</searchCount></scores><additional><submitter>Saini S</submitter><funding>NIMH NIH HHS</funding><funding>NIH HHS</funding><pagination>4397</pagination><full_dataset_link>https://www.ebi.ac.uk/biostudies/studies/S-EPMC6199332</full_dataset_link><repository>biostudies-literature</repository><omics_type>Unknown</omics_type><volume>9(1)</volume><pubmed_abstract>Short tandem repeats (STRs) are involved in dozens of Mendelian disorders and have been implicated in complex traits. However, genotyping arrays used in genome-wide association studies focus on single nucleotide polymorphisms (SNPs) and do not readily allow identification of STR associations. We leverage next-generation sequencing (NGS) from 479 families to create a SNP + STR reference haplotype panel. Our panel enables imputing STR genotypes into SNP array data when NGS is not available for directly genotyping STRs. Imputed genotypes achieve mean concordance of 97% with observed genotypes in an external dataset compared to 71% expected under a naive model. Performance varies widely across STRs, with near perfect concordance at bi-allelic STRs vs. 70% at highly polymorphic repeats. Imputation increases power over individual SNPs to detect STR associations with gene expression. Imputing STRs into existing SNP datasets will enable the first large-scale STR association studies across a range of complex traits.</pubmed_abstract><journal>Nature communications</journal><pubmed_title>A reference haplotype panel for genome-wide imputation of short tandem repeats.</pubmed_title><pmcid>PMC6199332</pmcid><funding_grant_id>R01 MH113715</funding_grant_id><funding_grant_id>DP5 OD024577</funding_grant_id><pubmed_authors>Gymrek M</pubmed_authors><pubmed_authors>Mitra I</pubmed_authors><pubmed_authors>Fotsing SF</pubmed_authors><pubmed_authors>Mousavi N</pubmed_authors><pubmed_authors>Saini S</pubmed_authors><view_count>55</view_count></additional><is_claimable>false</is_claimable><name>A reference haplotype panel for genome-wide imputation of short tandem repeats.</name><description>Short tandem repeats (STRs) are involved in dozens of Mendelian disorders and have been implicated in complex traits. However, genotyping arrays used in genome-wide association studies focus on single nucleotide polymorphisms (SNPs) and do not readily allow identification of STR associations. We leverage next-generation sequencing (NGS) from 479 families to create a SNP + STR reference haplotype panel. Our panel enables imputing STR genotypes into SNP array data when NGS is not available for directly genotyping STRs. Imputed genotypes achieve mean concordance of 97% with observed genotypes in an external dataset compared to 71% expected under a naive model. Performance varies widely across STRs, with near perfect concordance at bi-allelic STRs vs. 70% at highly polymorphic repeats. Imputation increases power over individual SNPs to detect STR associations with gene expression. Imputing STRs into existing SNP datasets will enable the first large-scale STR association studies across a range of complex traits.</description><dates><release>2018-01-01T00:00:00Z</release><publication>2018 Oct</publication><modification>2024-11-09T21:50:51.536Z</modification><creation>2019-03-27T00:04:22Z</creation></dates><accession>S-EPMC6199332</accession><cross_references><pubmed>30353011</pubmed><doi>10.1038/s41467-018-06694-0</doi></cross_references></HashMap>