<HashMap><database>biostudies-literature</database><scores/><additional><submitter>Fearnley LG</submitter><funding>DHB Foundation</funding><funding>Medical Research Council</funding><funding>National Health and Medical Research Council</funding><funding>Citizens United for Research in Epilepsy</funding><pagination>13124</pagination><full_dataset_link>https://www.ebi.ac.uk/biostudies/studies/S-EPMC9338934</full_dataset_link><repository>biostudies-literature</repository><omics_type>Unknown</omics_type><volume>12(1)</volume><pubmed_abstract>Bioinformatic methods for detecting short tandem repeat expansions in short-read sequencing have identified new repeat expansions in humans, but require alignment information to identify repetitive motif enrichment at genomic locations. We present superSTR, an ultrafast method that does not require alignment. superSTR is used to process whole-genome and whole-exome sequencing data, and perform the first STR analysis of the UK Biobank, efficiently screening and identifying known and potential disease-associated STRs in the exomes of 49,953 biobank participants. We demonstrate the first bioinformatic screening of RNA sequencing data to detect repeat expansions in humans and mouse models of ataxia and dystrophy.</pubmed_abstract><journal>Scientific reports</journal><pubmed_title>Detection of repeat expansions in large next generation DNA and RNA sequencing data without alignment.</pubmed_title><pmcid>PMC9338934</pmcid><funding_grant_id>MC_PC_17228</funding_grant_id><funding_grant_id>Senior Reearch Fellowship 1102971</funding_grant_id><funding_grant_id>Centenary Postdoctoral Fellowship in Neurogenetic Systems Biology</funding_grant_id><funding_grant_id>Taking Flight Award</funding_grant_id><funding_grant_id>MC_QA137853</funding_grant_id><pubmed_authors>Bahlo M</pubmed_authors><pubmed_authors>Bennett MF</pubmed_authors><pubmed_authors>Fearnley LG</pubmed_authors></additional><is_claimable>false</is_claimable><name>Detection of repeat expansions in large next generation DNA and RNA sequencing data without alignment.</name><description>Bioinformatic methods for detecting short tandem repeat expansions in short-read sequencing have identified new repeat expansions in humans, but require alignment information to identify repetitive motif enrichment at genomic locations. We present superSTR, an ultrafast method that does not require alignment. superSTR is used to process whole-genome and whole-exome sequencing data, and perform the first STR analysis of the UK Biobank, efficiently screening and identifying known and potential disease-associated STRs in the exomes of 49,953 biobank participants. We demonstrate the first bioinformatic screening of RNA sequencing data to detect repeat expansions in humans and mouse models of ataxia and dystrophy.</description><dates><release>2022-01-01T00:00:00Z</release><publication>2022 Jul</publication><modification>2024-12-03T16:12:02.751Z</modification><creation>2024-12-03T16:12:02.751Z</creation></dates><accession>S-EPMC9338934</accession><cross_references><pubmed>35907931</pubmed><doi>10.1038/s41598-022-17267-z</doi></cross_references></HashMap>