Project description:Identification of the transcription start sites (TSS) of genes is critical for understanding promoter regions and transcription initiation. One well characterized and widely used method to determine TSSs is CAGE (Cap Analysis of Gene Expression). CAGE has been commonly used in animal studies, yet in plants species, the precise TSS and core promoter landscape is not sufficient yet, no experiment has been taken on soybean to identify TSS neither. Soybean is one economically valuable species used for both food supply and oil. In this study, we present the main results of nanoCAGE-seq to reveal the genome-wide TSS conducted on shoot and root in soybean Williams 82.
Project description:Long non-coding RNAs (lncRNAs) are defined as non-protein-coding transcripts that are at least 200 nucleotides long. They are known to play pivotal roles in regulating gene expression, especially during stress responses in plants. We used a large collection of in-house transcriptome data from various soybean (Glycine max and Glycine soja) tissues treated under different conditions to perform a comprehensive identification of soybean lncRNAs. We also retrieved publicly available soybean transcriptome data that were of sufficient quality and sequencing depth to enrich our analysis. In total, RNA-seq data of 332 samples were used for this analysis. An integrated reference-based, de novo transcript assembly was developed that identified ~69,000 lncRNA gene loci. We showed that lncRNAs are distinct from both protein-coding transcripts and genomic background noise in terms of length, number of exons, transposable element composition, and sequence conservation level across legume species. The tissue-specific and time-specific transcriptional responses of the lncRNA genes under some stress conditions may suggest their biological relevance. The transcription start sites of lncRNA gene loci tend to be close to their nearest protein-coding genes, and they may be transcriptionally related to the protein-coding genes, particularly for antisense and intronic lncRNAs. A previously unreported subset of small peptide-coding transcripts was identified from these lncRNA loci via tandem mass spectrometry, which paved the way for investigating their functional roles. Our results also highlight the current inadequacy of the bioinformatic definition of lncRNA, which excludes those lncRNA gene loci with small open reading frames (ORFs) from being regarded as protein-coding.
2020-01-09 | PXD014553 | Pride
Project description:Identification of transcription start sites in soybean
Project description:This Soybean Floral Whorl Atlas is composed of RNA-Seq and ChIP-Seq data collected from individual floral whorls and floral developmental stages in Glycine max.
Project description:Hypocotyls of soybean (Glycine max) seedlings of cultivar Williams were inoculated with mycelia of the oomycete pathogen Phytophthora sojae grown in liquid V8 medium or the hypocotyls were mock inoculated. After 12 hours, the sites of inoculation were excised from the hypocotyls and frozen for RNA extraction. Phytophthora sojae mycelia used for inoculation was saved for RNA extraction also
Project description:The centromeric histone H3 (CENH3) is of great importance in centromere identification identity, chromosome mobilization during cell division and thus genome stability. The imbalance of CENH3 loading rate or dosage on parental centromeres often leads to uniparental chromosome elimination in the offspring. A body of studies on the function of CENH3 in genome stability have been reported in Arabidopsis, cotton, and many other monocots, but not in soybean (Glycine max), an important dicot crop. In our study, we identified the a single-copy functional CENH3 in soybean and found its role in genome stability and parent-of-origin effect caused by a conserved glycine site and parental genetic background. This study identified the functional GmCENH3 and would shed light on the future development of CENH3-based haploid induction system and centromere biology in soybean.
Project description:Aluminum (Al) toxicity is an important restraint to soybean (Glycine max L. Merr.) production on acid soils. However, little is known about the genes underlying Al tolerance in soybean. We used microarrays to detail the global programme of gene expression under control and Al stress in two soybean at 6, 12, and 24 h.
Project description:The nuclei of Glycine max from different tissues were collected. The samples were: soybean seed mid-maturation stage (10mm), seed late cotyledon stage (5mm), seed early cotyledon stage (3mm), seed heart stage (1mm), soybean green pods without seeds (stage), soybean flower bud (early flowering stage), soybean shoot apical meristem (stage), soybean trifoliate leaf (R5 stage), and soybean true leave (stage). The library construction was performed applying 10 Genomics technology.
Project description:Transcription start sites are the focal points of transcriptional regulation, where information from regulatory elements is integrated to stabilize initiation of transcription. In humans, most genes have more than one transcription start site, and these often exhibit different tissue specificity, serving as distinct regulatory frameworks for the same gene. Usage of such promoters can also result in differential gene function manifested on the protein level. Alternative promoter usage has been shown to be increased in several disease states, especially cancer. In this study, we have applied the nanoCAGE method to create a genome-wide map of TSS usage in sorted leukemic blasts from acute myeloid leukemia patients, and corresponding normal controls. We show that the nanoCAGE method can replace similar experiments made with microarrays in terms of expression, but also that it uniquely, allow for the identification of alternative promoter usage in cancer cells. We identify 2,162 putative promoters that are significantly differentially regulated between APL and controls. Interestingly, promoters whose usage is upregulated in APL have an increased propensity to be downstream alternative promoters, and conversely, the promoters producing the annotated longest gene variants are commonly downregulated in cancer. We show examples of genes with upregulated downstream promoters, and demonstrate protein domain loss that could contribute to leukemic induction and maintenance. In conclusion, we present the first genome wide promoterome study from rare purified human leukemic cells. nanoCAGE-seq of human acute myeloid leukemia samples vs. normal hematopoietic counterparts, both in replicates
Project description:Tropospheric ozone (O3) is a secondary air pollutant and anthropogenic greenhouse gas. Concentrations of tropospheric O3 have more than doubled since the Industrial Revolution, and are high enough to damage plant productivity. Soybean (Glycine max L. Merr.) is the worldâs most important legume crop and is sensitive to O3. Current ground-level O3 are estimated to reduce global soybean yields by 6% to 16%. In order to understand transcriptional mechanisms of yield loss in soybean, we examined the transcriptome of soybean flower and pod tissues exposed to elevated O3 using RNA-Sequencing.