Project description:We performed whole genome single nucleotide polymorphism (SNP) based analysis of all available Venezuelan equine encephalitis (VEE) virus antigenic complex genomes and developed a high resolution genome-wide SNP microarray. We used the SNP microarray to analyze a broad panel of VEEV isolates, found excellent concordance between array and sequence based genotypes for previously sequenced strains, and genotyped unsequenced strains.
Project description:The horse, like a majority of animal species, has a limited amount of species-specific expressed sequence data available in public databases. As a result, structural models for a majority of genes defined in the equine genome are predictions based on ab initio sequence analysis or the projection of gene structures from other mammalian species. The current study used Illumina-based sequencing of messenger RNA (RNA-seq) to help refine structural annotation of equine protein-coding genes and for a preliminary assessment of gene expression patterns. Sequencing of mRNA from eight equine tissues generated 293,758,105 thirty five-base sequence tags, equaling 10.28 giga-basepairs of total sequence data. The tag alignments represent approximately 208X coverage of the equine mRNA transcriptome and confirmed transcriptional activity for roughly 90% of the protein-coding gene structures predicted by Ensembl and NCBI. Tag coverage was sufficient to define structural annotation for 11,356 genes, while also identifying an additional 456 transcripts with exon/intron features that are not listed by either Ensembl or NCBI. Genomic locus data and intervals for the protein-coding genes predicted by the Ensembl and NCBI annotation pipelines were combined with 75,116 RNA-seq derived transcriptional units to generate a consensus equine protein-coding gene set of 20,302 defined loci. Gene ontology annotation was used to compare the functional and structural categories of genes expressed in either a tissue-restricted pattern or broadly across all tissue samples. Examination of 8 equine RNA samples representing 6 distinct tissues
Project description:The horse, like a majority of animal species, has a limited amount of species-specific expressed sequence data available in public databases. As a result, structural models for a majority of genes defined in the equine genome are predictions based on ab initio sequence analysis or the projection of gene structures from other mammalian species. The current study used Illumina-based sequencing of messenger RNA (RNA-seq) to help refine structural annotation of equine protein-coding genes and for a preliminary assessment of gene expression patterns. Sequencing of mRNA from eight equine tissues generated 293,758,105 thirty five-base sequence tags, equaling 10.28 giga-basepairs of total sequence data. The tag alignments represent approximately 208X coverage of the equine mRNA transcriptome and confirmed transcriptional activity for roughly 90% of the protein-coding gene structures predicted by Ensembl and NCBI. Tag coverage was sufficient to define structural annotation for 11,356 genes, while also identifying an additional 456 transcripts with exon/intron features that are not listed by either Ensembl or NCBI. Genomic locus data and intervals for the protein-coding genes predicted by the Ensembl and NCBI annotation pipelines were combined with 75,116 RNA-seq derived transcriptional units to generate a consensus equine protein-coding gene set of 20,302 defined loci. Gene ontology annotation was used to compare the functional and structural categories of genes expressed in either a tissue-restricted pattern or broadly across all tissue samples.
Project description:Sequencing of equine mRNA (RNA-seq) identified 428 putative transcripts which do not map to any previously annotated or predicted horse genes. Most of these encode the equine homologs of known protein-coding genes described in other species, yet the potential exists to identify novel and perhaps equine-specific gene structures. A set of 36 transcripts were prioritized for further study by filtering for levels of expression (depth of RNA-seq read coverage), distance from annotated features in the equine genome, the number of putative exons, and patterns of gene expression between tissues. From these, four were selected for further investigation based on predicted open reading frames of greater than or equal to 50 amino acids and lack of detectable homology to known genes across species. Sanger sequencing of RT-PCR amplicons from additional equine samples confirmed expression and structural annotation of each transcript. Functional predictions were made by conserved domain searches. A single transcript, expressed in the cerebellum, contains a putative kruppel-associated box (KRAB) domain, suggesting a potential function associated with zinc finger proteins and transcriptional regulation. Overall levels of conserved synteny and sequence conservation across a 1MB region surrounding each transcript were approximately 73% compared to the human, canine, and bovine genomes; however, the four loci display some areas of low conservation and sequence inversion in regions that immediately flank these previously unannotated equine transcripts. Taken together, the evidence suggests that these four transcripts are likely to be equine-specific.
Project description:Equine lameller tissues were collected to compare normal vs laminitis generated differences in transcriptom level. Keywords: Laminitis, Equine, Diseased foot
Project description:While a first draft of the equine genome is available and predictions are made regarding resulting genes and proteins, little is known about the actual transcriptome. So far, published expressed sequence tags (ESTs) from different horse tissues were generally rather short (?600bp) and hardly annotated, reflecting the problem that good cDNA libraries are very difficult to analyse. In this approach, we aimed to establish and analyse a normalised immune cell cDNA library (using freshly isolated and activated lymphocytes, NK cells, monocytes and DC). In particular, we wanted to test next generation sequencing combined with a series of bioinformatic approaches. The resulting cDNA library contained 2x107 clones of which 1056 were used for an initial Sanger sequencing and 4x106 for the deep sequencing analysis. Through the latter we obtained >29k sequences for which more than 5000 matches where found on the equine reference sequences. Additionally we could identify more than 3500 sequences which had matches on both - non-equine RNA sequences as well as the equine genome. In these we find both extensions of existing RefSeq models and novel mRNAs alike. Less than 2% of sequences did not have any match in the mentioned databases. 1 pooled set of samples from one animal analysed
Project description:Purpose:The goal of this study was to evalute gene expression patterns of equine chorioallantoic membrane during different stages of the pregnancy Method: mRNA profile of equine chorioallantoic membrane (CAM) from 45days, 4months, 6months and 10months (4 samples for each time points) generated by RNA-sequencing,using a Illumina HiSeq 4000 ( HiSeq 4000 sequencing kit version 1). The sequence reads were trimmed for adapters and quality using TrimGalore Version 0.4.4,and then mapped to EquCab2.0 using STAR-2.5.2b. Final quantification at the gen level was performed by analyzing the BAM files in cufflinks using the Equus_caballus_ENSEMBL_88 gtf file as Guide.