Project description:We have completed the high quality reference genome for domestic sheep (Oar v3.1). Early-stage Illumina GA sequence platform sequenced less reads in high GC content regions than in other regions. To read through higher GC content regions, we generated 2 Gb MeDIP-seq data for filling gaps in sheep reference genome assembly.
Project description:28 Streptomyces strains isolated from common scab lesions of potato tubers from a wide geographic range in Norway, were selected for microarray analysis. The selected strains were subjected to species identification by microarray, 16S phylogenetic analysis and PCR; and microarray-based comparative genome analysis. To our knowledge, this is the first report of S. turgidiscabies and S. europaeiscabiei in Norway.
Project description:We have sequenced a wild Prunus mume and constructed a reference sequence for this genome. In order to improve quality of gene models, RNA samples of five tissues (bud, leaf, root, stem, fruit) were extracted from the Prunus mume. To investigate tissue specific expression using the reference genome assembly and annotated genes, we extracted RNA samples of different tissues and conducted transcriptome sequencing and DEG analysis.
Project description:Centromeres are chromosomal regions that serve as platforms for kinetochore assembly and spindle attachments, ensuring accurate chromosome segregation during cell division. Despite functional conservation, centromeric sequences are diverse and usually repetitive across species, making them challenging to assemble and identify. Here, we describe centromeres in the model oomycete Phytophthora sojae by combining long-read sequencing-based genome assembly and chromatin immunoprecipitation for the centromeric histone CENP-A followed by high-throughput sequencing (ChIP-seq). P. sojae centromeres cluster at a single focus in the nucleus at different life stages and during nuclear division. We report a highly contiguous genome assembly of the P. sojae reference strain, which enabled identification of 15 highly enriched CENP-A binding regions as putative centromeres. By focusing on 10 intact regions, we demonstrate that centromeres in P. sojae are regional, spanning 211 to 356 kb. Most of these regions are transposon-rich, poorly transcribed, and lack the euchromatin mark H3K4me2 but are embedded within regions with the heterochromatin marks H3K9me3 and H3K27me3.
Project description:Advances in sequencing and assembly technology has led to the creation of genome assemblies for a wide variety of non-model organisms. The rapid production and proliferation of updated, novel assembly versions can create create vexing problems for researchers when multiple genome as-sembly versions are available at once, requiring researchers to work with more than one reference genome. Multiple genome assemblies are especially problematic for researchers studying the genetic makeup of individual cells as single cell RNA sequencing (scRNAseq) requires sequenced reads to be mapped and aligned to a single reference genome. Using the Astyanax mexicanus this study highlights how the interpretation of a single cell dataset from the same sample changes when aligned to its two different available genome assemblies. We found that the number of cells and expressed genes detected were drastically different when aligning to the different assemblies. When the genome assemblies were used in isolation with their respective annotation, cell type identification was confounded as some classic cell type markers were assembly-specific, whilst other genes showed differential patterns of expression between the two assemblies. To overcome the problems posed by multiple genome assemblies, we propose that researchers align to each available assembly and then integrate the resultant datasets to produce a final dataset in which all genome alignments can be used simultaneously. We found this approach increased the accuracy of cell type identification and maximised the amount of data that could be extracted from our single cell sample by capturing all possible cells and transcripts. As scRNAseq becomes more widely available, it is imperative that the single cell community is aware how genome assembly alignment can alter single cell data and its interpretation, especially when reviewing studies on non-model organisms.
Project description:Custom exon aCGH analysis of copy number across the genomes of 16 horse breeds Two-condition experiment, All breed samples were compared to a single Thoroughbred reference, Reference was then compared to Twilight (DNA from horse used for reference genome assembly)