Project description:Small RNAs (21-24 nt) are pivotal regulators of gene expression that guide both transcriptional and post-transcriptional silencing mechanisms in diverse eukaryotes, including most if not all plants. MicroRNAs (miRNAs) and short interfering RNAs (siRNAs) are the two major types, both of which have a demonstrated and important role in plant development, stress responses and pathogen resistance. In this work, we used a deep sequencing approach (Sequencing-By-Synthesis, or SBS) to develop sequence resources of small RNAs from Musa acuminata tissues (including leaves, male flowers and immature fruit). The high depth of the resulting datasets enabled us to examine in detail critical small RNA features as size distribution, tissue-specific regulation and sequence conservation between different organs in this species. We also developed database resources and a dedicated website (http://smallrna.udel.edu/) with computational tools for allowing other users to identify new miRNAs or siRNAs involved in specific regulatory pathways, verify the degree of conservation of these sequences in other plant species and map small RNAs on genes or larger regions of the maize genome under study.
Project description:Post-harvested Musa acuminata banana species from a local banana plantation in the Philippines are the subject of this article. All banana tier samples used were pre-classified into four classes by a local expert. These four classifications are extra class, class I, class II, and reject. There are six images captured per banana tier sample from the six different views. Each captured image underwent a three-step image transformation to finely extract the RGB numerical values while the size measurement feature was gathered through manual measurement. The dataset presented in this article provides a brief differentiation of the different classes of banana tiers for commercial use through image processing. This dataset can be useful in establishing an advanced intelligent system in a non-invasive approach through machine and deep learning techniques.
Project description:Small RNAs (21-24 nt) are pivotal regulators of gene expression that guide both transcriptional and post-transcriptional silencing mechanisms in diverse eukaryotes, including most if not all plants. MicroRNAs (miRNAs) and short interfering RNAs (siRNAs) are the two major types, both of which have a demonstrated and important role in plant development, stress responses and pathogen resistance. In this work, we used a deep sequencing approach (Sequencing-By-Synthesis, or SBS) to develop sequence resources of small RNAs from Musa acuminata tissues (including leaves, male flowers and immature fruit). The high depth of the resulting datasets enabled us to examine in detail critical small RNA features as size distribution, tissue-specific regulation and sequence conservation between different organs in this species. We also developed database resources and a dedicated website (http://smallrna.udel.edu/) with computational tools for allowing other users to identify new miRNAs or siRNAs involved in specific regulatory pathways, verify the degree of conservation of these sequences in other plant species and map small RNAs on genes or larger regions of the maize genome under study. Small RNA libraries were derived from leaves, male flowers and immature fruit of Musa acuminata. Total RNA was isolated using the Plant RNA Purification Reagent (Invitrogen), and submitted to Illumina (Hayward, CA, http://www.illumina.com) for small RNA library construction using approaches described in (Lu et al., 2007) with minor modifications. The small RNA libraries were sequenced with the Sequencing-By-Synthesis (SBS) technology by Illumina. PERL scripts were designed to remove the adapter sequences and determine the abundance of each distinct small RNA. We thank Nicolas Roux for providing the plant material as well as Kan Nobuta and Gayathri Mahalingam for assistance with the computational methods.
Project description:BackgroundMusa species contain the fourth most important crop in developing countries. Here, we report the analysis of 6,252 BAC end-sequences, in order to view the sequence composition of the Musa acuminata genome in a cost effective and efficient manner.ResultsBAC end sequencing generated 6,252 reads representing 4,420,944 bp, including 2,979 clone pairs with an average read length after cleaning and filtering of 707 bp. All sequences have been submitted to GenBank, with the accession numbers DX451975-DX458350. The BAC end-sequences, were searched against several databases and significant homology was found to mitochondria and chloroplast (2.6%), transposons and repetitive sequences (36%) and proteins (11%). Functional interpretation of the protein matches was carried out by Gene Ontology assignments from matches to Arabidopsis and was shown to cover a broad range of categories. From protein matching regions of Musa BAC end-sequences, it was determined that the GC content of coding regions was 47%. Where protein matches encompassed a start codon, GC content as a function of position (5' to 3') across 129 bp sliding windows generates a "rice-like" gradient. A total of 352 potential SSR markers were discovered. The most abundant simple sequence repeats in four size categories were AT-rich. After filtering mitochondria and chloroplast matches, thousands of BAC end-sequences had a significant BLASTN match to the Oryza sativa and Arabidopsis genome sequence. Of these, a small number of BAC end-sequence pairs were shown to map to neighboring regions of the Oryza sativa genome representing regions of potential microsynteny.ConclusionDatabase searches with the BAC end-sequences and ab initio analysis identified those reads likely to contain transposons, repeat sequences, proteins and simple sequence repeats. Approximately 600 BAC end-sequences contained protein sequences that were not found in the existing available Musa expressed sequence tags, repeat or transposon databases. In addition, gene statistics, GC content and profile could also be estimated based on the region matching the top protein hit. A small number of BAC end pair sequences can be mapped to neighboring regions of the Oryza sativa representing regions of potential microsynteny. These results suggest that a large-scale BAC end sequencing strategy has the potential to anchor a small proportion of the genome of Musa acuminata to the genomes of Oryza sativa and possibly Arabidopsis.