Project description:Adenovirus is a common human pathogen that relies on host cell processes for transcription and processing of viral RNA and protein production. Although adenoviral promoters, splice junctions, and cleavage and polyadenylation sites have been characterized using low-throughput biochemical techniques or short read cDNA-based sequencing, these technologies do not fully capture the complexity of the adenoviral transcriptome. By combining Illumina short-read and nanopore long-read direct RNA sequencing approaches, we mapped transcription start sites and cleavage and polyadenylation sites across the adenovirus genome. In addition to confirming the known canonical viral early and late RNA cassettes, our analysis of splice junctions within long RNA reads revealed an additional 35 novel viral transcripts. These RNAs include fourteen new splice junctions which lead to expression of canonical open reading frames (ORF), six novel ORF-containing transcripts, and fifteen transcripts encoding for messages that potentially alter protein functions through truncations or fusion of canonical ORFs. In addition, we also detect RNAs that bypass canonical cleavage sites and generate potential chimeric proteins by linking separate gene transcription units. Of these, an evolutionary conserved protein was detected containing the N-terminus of E4orf6 fused to the downstream DBP/E2A ORF. Loss of this novel protein, E4orf6/DBP, was associated with aberrant viral replication center morphology and poor viral spread. Our work highlights how long-read sequencing technologies can reveal further complexity within viral transcriptomes.
Project description:a chromosome-level nuclear genome and organelle genomes of the alpine snow alga Chloromonas typhlos were sequenced and assembled by integrating short- and long-read sequencing and proteogenomic strategy
Project description:Objectives: To perform long-read transcriptome and proteome profiling of pathogen-stimulated peripheral blood mononuclear cells (PBMCs) from healthy donors. We aim to discover new transcripts and protein isoforms expressed during immune responses to diverse pathogens. Methods: PBMCs were exposed to four microbial stimuli for 24 hours: the TLR4 ligand lipopolysaccharide (LPS), the TLR3 ligand Poly(I:C), heat-inactivated Staphylococcus aureus, Candida albicans, and RPMI medium as negative controls. Long-read sequencing (PacBio) of one donor and secretome proteomics and short-read sequencing of five donors were performed. IsoQuant was used for transcriptome construction, Metamorpheus/FlashLFQ for proteome analysis, and Illumina short-read 3’-end mRNA sequencing for transcript quantification. Results: Long-read transcriptome profiling reveals the expression of novel sequences and isoform switching induced upon pathogen stimulation, including transcripts that are difficult to detect using traditional short-read sequencing. We observe widespread loss of intron retention as a common result of all pathogen stimulations. We highlight novel transcripts of NFKB1 and CASP1 that may indicate novel immunological mechanisms. In general, RNA expression differences did not result in differences in the amounts of secreted proteins. Interindividual differences in the proteome were larger than the differences between stimulated and unstimulated PBMCs. Clustering analysis of secreted proteins revealed a correlation between chemokine (receptor) expression on the RNA and protein levels in C. albicans- and Poly(I:C)-stimulated PBMCs. Conclusion: Isoform aware long-read sequencing of pathogen-stimulated immune cells highlights the potential of these methods to identify novel transcripts, revealing a more complex transcriptome landscape than previously appreciated.
Project description:Transposon insertion site sequencing (TIS) is a powerful method for associating genotype to phenotype. However, all TIS methods described to date use short nucleotide sequence reads which cannot uniquely determine the locations of transposon insertions within repeating genomic sequences where the repeat units are longer than the sequence read length. To overcome this limitation, we have developed a TIS method using Oxford Nanopore sequencing technology that generates and uses long nucleotide sequence reads; we have called this method LoRTIS (Long Read Transposon Insertion-site Sequencing). This experiment data contains sequence files generated using Nanopore and Illumina platforms. Biotin1308.fastq.gz and Biotin2508.fastq.gz are fastq files generated from nanopore technology. Rep1-Tn.fastq.gz and Rep1-Tn.fastq.gz are fastq files generated using Illumina platform. In this study, we have compared the efficiency of two methods in identification of transposon insertion sites.
Project description:To identify aberrant splicing isoforms and potential neoantigens, we performed full-length cDNA sequencing of lung adenocarcinoma cell lines using a long-read sequencer MinION. We constructed a comprehensive catalog of aberrant splicing isoforms and detected isoform-specific peptides using proteome analysis.
Project description:We analyzed the chromatin accessibility and nucleosome positioning by ATAC-seq of both null and transgene-expressing strains of Komagataella phaffii (Pichia pastoris) under different growth conditions. These data enabled identification of the features that determine performance of various integration sites for transgene expression. Understanding chromatin accessibility and nucleosome positioning can provide further clarity into gene regulation and expression broadly in this organism.
Project description:We sequenced DNA from the leaves of ten Col x Ler F1 hybrid plants (WT and recq4) using Nanopore long-read sequencing and identified crossover sites with COmapper. These data were used as a negative control for COmapper, as no crossover sites were expected to be detected. For nanopore sequencing of gDNA from leaves, leaves from 10 5-week-old plants were ground in liquid nitrogen using a mortar and pestle. The ground tissue was resuspended in four volumes of CTAB buffer (1% [w/v] CTAB, 50 mM Tris-HCl pH 8.0, 0.7 M NaCl, 10 mM EDTA) and incubated at 65°C for 30 min. Following chloroform extraction, isopropanol precipitation and removal of RNAs as above, the gDNA pellet was resuspended in 150 μl TE (10 mM Tris-HCl pH 8.0, 0.1 mM EDTA) buffer and gDNA was quantified using a Qubit dsDNA Broad Range assay kit (Thermo Fisher, Q32853). Nine micrograms of gDNA from pollen or seedlings was used to construct a nanopore long-read sequencing library using a Ligation Sequencing Kit V14 (Nanopore, SQK-LSK114). The libraries were sequenced using a PromethION platform (BGI, Hong Kong).