Genomics

Dataset Information

0

RNA-seq alignment to individualized genomes


ABSTRACT: The source of most errors in RNA sequencing (RNA-seq) read alignment is in the repetitive structure of the genome and not with the alignment algorithm. Genetic variation away from the reference sequence exacerbates this problem causing reads to be assigned to the wrong location. We developed a method, implemented as the software package Seqnature, to construct the imputed genomes of individuals (individualized genomes) of experimental model organisms including inbred mouse strains and genetically unique outbred animals. Alignment to individualized genomes increases read mapping accuracy and improves transcript abundance estimates. In an application to expression QTL mapping, this approach corrected erroneous linkages and unmasked thousands of hidden associations. Individualized genomes accounting for genetic variation will be useful for human short-read sequencing and other sequencing applications including ChIP-seq.

ORGANISM(S): Mus musculus

PROVIDER: GSE45684 | GEO | 2014/06/10

SECONDARY ACCESSION(S): PRJNA196894

REPOSITORIES: GEO

Similar Datasets

2014-06-10 | E-GEOD-45684 | biostudies-arrayexpress
2013-07-15 | E-MTAB-1728 | biostudies-arrayexpress
| PRJEB4265 | ENA
| PRJNA196894 | ENA
2017-03-15 | GSE92568 | GEO
2016-12-28 | GSE92928 | GEO
2022-09-05 | GSE147454 | GEO
2015-04-17 | GSE67979 | GEO
2019-05-25 | GSE81866 | GEO
2017-08-17 | GSE86594 | GEO