Project description:Evaluation of short-read-only, long-read-only, and hybrid assembly approaches on metagenomic samples demonstrating how they affect gene and protein prediction which is relevant for downstream functional analyses. For a human gut microbiome sample, we use complementary metatranscriptomic, and metaproteomic data to evaluate the metagenomic-based protein predictions.
Project description:New tools for improved long-read transcript assembly and coalescence with its short-read counterpart are required. Using our short- and long-read measurements from different cell lines with spiked-in standards, we systematically compared key parameters and biases in the read alignment and assembly of transcripts. We report a cDNA synthesis artifact in long-read datasets that impacts the identity and quantitation of assembled transcripts. We developed a computational pipeline to strand long-read cDNA libraries that markedly improves assembly of transcripts from long-reads. Incorporating stranded long-reads in a new hybrid assembly approach, we demonstrate its efficacy for improved characterization of challenging lncRNA transcripts. Our workflow can be applied to a wide range of transcriptomics datasets for superior demarcation of transcript ends and refined isoform structure, which can enable better differential gene expression analyses and molecular manipulations of transcripts.
Project description:New tools for improved long-read transcript assembly and coalescence with its short-read counterpart are required. Using our short- and long-read measurements from different cell lines with spiked-in standards, we systematically compared key parameters and biases in the read alignment and assembly of transcripts. We report a cDNA synthesis artifact in long-read datasets that impacts the identity and quantitation of assembled transcripts. We developed a computational pipeline to strand long-read cDNA libraries that markedly improves assembly of transcripts from long-reads. Incorporating stranded long-reads in a new hybrid assembly approach, we demonstrate its efficacy for improved characterization of challenging lncRNA transcripts. Our workflow can be applied to a wide range of transcriptomics datasets for superior demarcation of transcript ends and refined isoform structure, which can enable better differential gene expression analyses and molecular manipulations of transcripts.
Project description:Transposon insertion site sequencing (TIS) is a powerful method for associating genotype to phenotype. However, all TIS methods described to date use short nucleotide sequence reads which cannot uniquely determine the locations of transposon insertions within repeating genomic sequences where the repeat units are longer than the sequence read length. To overcome this limitation, we have developed a TIS method using Oxford Nanopore sequencing technology that generates and uses long nucleotide sequence reads; we have called this method LoRTIS (Long Read Transposon Insertion-site Sequencing). This experiment data contains sequence files generated using Nanopore and Illumina platforms. Biotin1308.fastq.gz and Biotin2508.fastq.gz are fastq files generated from nanopore technology. Rep1-Tn.fastq.gz and Rep1-Tn.fastq.gz are fastq files generated using Illumina platform. In this study, we have compared the efficiency of two methods in identification of transposon insertion sites.
Project description:According to the key words, the gene set, including oxidation-reduction, RNA silence, disease resistance, phytohormone, phosphorylation, dephosphorylation, transcription factor, receptor, kinase, ubiquitination and RNA binding,etc. from sugarcane and the whole CDS sequence from smut genome, was achieved and used as targets in the present microarray assay. Based on smut infection samples from smut-susceptible sugarcane genotype YC71-374 and smut-resistant sugarcane genotype NCo376, the hybridization was conducted and validated by real-time fluorescent quantitative PCR. It would provide a basic data for the study on sugarcane-smut interaction mechanism, which referred to sugarcane smut resistance and smut pathogenesis.
Project description:Sugarcane (Saccharum hybrid, SP80-3280) was grown in the field in Araras (Brazil) for 9 months. Leaves +1 (F1), internodes 1&2 (I1), and internodes 5 (I5) were harvested every 2 h for 26 h, starting 2h before dawn.
Project description:We sequenced DNA from a bulk of Col x Ler F2 hybrid plants (WT and recq4) using Nanopore long-read sequencing and identified crossover sites with COmapper. For nanopore sequencing of gDNA from 1,000 pooled seedlings, 10-day-old seedlings were ground in liquid nitrogen using a mortar and pestle. The ground tissue was resuspended in four volumes of CTAB buffer (1% [w/v] CTAB, 50 mM Tris-HCl pH 8.0, 0.7 M NaCl, 10 mM EDTA) and incubated at 65°C for 30 min. Following chloroform extraction, isopropanol precipitation and removal of RNAs as above, the gDNA pellet was resuspended in 150 μl TE (10 mM Tris-HCl pH 8.0, 0.1 mM EDTA) buffer and gDNA was quantified using a Qubit dsDNA Broad Range assay kit (Thermo Fisher, Q32853). Nine micrograms of gDNA from pollen or seedlings was used to construct a nanopore long-read sequencing library using a Ligation Sequencing Kit V14 (Nanopore, SQK-LSK114). The libraries were sequenced using a PromethION platform (BGI, Hong Kong).
Project description:We present a long-read, single-molecule mapping technology that generates hybrid genetic/epigenetic profiles of native chromosomal DNA. The genome-wide distribution of 5-hmC in human peripheral blood cells correlates well with 5-hmC DNA immunoprecipitation (hMeDIP) sequencing. However, the long single-molecule read-length of 100 kbp-1 Mbp produces 5-hmC profiles across variable genomic regions that failed to show up in the sequencing data. In addition, optical 5-hmC mapping shows a strong correlation between the 5-hmC density in gene bodies and the corresponding level of gene expression.
Project description:We identified full-length complementary DNA by long-read RNA-seq (Iso-seq) in HBV-KMT2B integrated HCC sample. To compare the data of HCC sapmle with HBV-KMT2B integrated HPCs, which reproduces the sequence of the HCC sample, we performed long-read RNA-seq in HBV-KMT2B integrated HPCs, as well.