Project description:We have adapted a solution hybrid selection protocol to enrich pathogen DNA in clinical samples dominated by human genetic material. Using mock mixtures of human and Plasmodium falciparum malaria parasite DNA as well as clinical samples from infected patients, we demonstrate an average of approximately 40-fold enrichment of parasite DNA after hybrid selection. This approach will enable efficient genome sequencing of pathogens from clinical samples, as well as sequencing of endosymbiotic organisms such as Wolbachia that live inside diverse metazoan phyla.
Project description:In Europe, ticks are the most important vectors of diseases threatening humans, livestock, wildlife and companion animals. Nevertheless, genomic sequence information and functional annotation of proteins of the most important European tick, Ixodes ricinus, is limited. Here we present the first analysis of the I. ricinus genome and of the transcriptome of the unfed I. ricinus midgut. We combined and integrated data from genome, transcriptome and proteome. The de novo assembly of 1 billion paired-end sequences identified 6,415 putative genes providing an unprecedented insight into the I. ricinus genome. Mapping of our midgut mRNA reads to the assembled contigs let us estimate to cover around two third of the unique genomic sequences. In addition, more than 10,000 transcripts from naïve midgut were annotated functionally and/or locally. By combining the alignment-based with a motif-search based annotation approach, we could double the number of annotations throughout all groups without shifting the dataset. Moreover, 1,175 proteins expressed in the naïve midgut were identified by mass spectrometry confirming the high completeness of our transcriptome database, and 608 were significantly annotated for function and/or localization. This multiple-omics study vastly extends the publicly available DNA, RNA and protein databases for I. ricinus and ticks in general.
Project description:Samples from abortion material from sheep and cattle in Tanzania were subject to reverse-transcription PCR using pan-pestivirus primer sets covering the 5'UTR and Npro regions of the viral genome
Project description:Medicago truncatula endogenous small RNAs The dataset contains Medicago truncatula Gaertn. cv. Jemalong endogenous small RNA sequences in the range 18-28 nucleotides. High-throughput Solexa/Illumina sequencing was carried out at the Sainsbury Laboratory, Norwich, UK. Please see www.illumina.com for details of the technology. Small RNA sequences were mapped to Medicago truncatula genome release 2.0 (http://www.medicago.org/genome/), the number of matches to the unfinished genome, if any, is recorded in the Series supplementary file GSE13761_sequence_annotations.txt.gz.
Project description:We report the sequences bound to CENP-A in the dog genome (Canis familiaris) for high-throughput characterization of centromeric sequences. We compare these ChIPSeq reads (72 bp, single read) against a reference centromeric satellite DNA domain database for the dog genome, resulting in the annotation of sequence variation and estimated abundance of seven satellite families together with adjacent, non-satellite sequences. To study global patterns of sequence diversity and characterizing the subset of sequences correlated with centromere function, these sequences were evaluated relative to a comprehensive centromere sequence domain k-mer library. From this analysis, we identify functional sequence features from two satellite families (CarSat1 and CarSat2) that are defined by distinct arrays subtypes.