Dataset Information

ABSTRACT: Targeted sequencing reveals candidate causal variants for dairy bull subfertility

REPOSITORIES: ENA

Similar Datasets

Project description:Transcriptome studies in patients with rare genetic diseases can potentially aid in the interpretation of likely causal genetic variation through identification of altered transcript abundance and/or structure. RNA-Seq is the most sensitive assay for both investigating transcript structure and abundance. The primary aim of this pilot project is to investigate to what degree integrating exome-Seq and RNA-Seq data on the same individual can accelerate the identification of causal alleles for rare genetic diseases. There are two main strands to this: (i) identifying which variants discovered in exome-seq appear to be having a functional impact on transcripts, and (ii)identifying transcript outliers, especially among known causal genes, that may not necessarily have a causal variant identified from exome sequencing. The latter may identify the presence of causal variants that lie far from coding regions (e.g. the formation of cryptic splice sites deep within introns, or loss of long range regulatory elements), which can be confirmed with further targeted genetic assays. Just over 50% of all disease-causing variants recorded in theHuman Gene Mutation Database (HGMD) affect transcript structure and abundance (e.g.nonsense SNVs, essential splice site SNVs, frame shifting indels, CNVs).This pilot project will study RNA from lymphoblastoid cell-lines from 12 patients with primordial dwarfism syndromes, for 10 of these samples we have previously generate exome data as part of our collaboration with the group of Prof Andrew Jackson. The two remaining samples are positive controls where the causal mutation is known, and is known to affect transcript structure and/or abundance. Primordial dwarfism is a prime candidate for these RNA-seq studies because all known causal mutations to date have key roles in DNA replication and thus, unsurprisingly, the products of the causal genes are typically ubiquitously expressed. Each RNA will be sequenced, with two technical replicates (independent RT-PCR and libraries) per sample, and each replicate run in 1/2 of a HiSeq lane using 100bp paired reads. Samples preparation was as follows :The cells were grown to confluency, then pellets frozen at -80. RNA samples were prepared using the Qiagen RNeasy kit, then nanodropped and analyzed using the bioanalyzer to determine concentration and purity. This data is part of a pre-publication release. For information on the proper use of pre-publication data shared by the Wellcome Trust Sanger Institute (including details of any publication moratoria), please see http://www.sanger.ac.uk/datasharing/

Dataset Information

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure