Project description:We present a draft genome assembly that includes 200 Gb of Illumina reads, 4 Gb of Moleculo synthetic long-reads and 108 Gb of Chicago libraries, with a final size matching the estimated genome size of 2.7 Gb, and a scaffold N50 of 4.8 Mb. We also present an alternative assembly including 27 Gb raw reads generated using the Pacific Biosciences platform. In addition, we sequenced the proteome of the same individual and RNA from three different tissue types from three other species of squid species (Onychoteuthis banksii, Dosidicus gigas, and Sthenoteuthis oualaniensis) to assist genome annotation. We annotated 33,406 protein coding genes supported by evidence and the genome completeness estimated by BUSCO reached 92%. Repetitive regions cover 49.17% of the genome.
Project description:Purpose: The goal of this study is to compare endothelial small RNA transcriptome to identify the target of OASL under basal or stimulated conditions by utilizing miRNA-seq. Methods: Endothelial miRNA profilies of siCTL or siOASL transfected HUVECs were generated by illumina sequencing method, in duplicate. After sequencing, the raw sequence reads are filtered based on quality. The adapter sequences are also trimmed off the raw sequence reads. rRNA removed reads are sequentially aligned to reference genome (GRCh38) and miRNA prediction is performed by miRDeep2. Results: We identified known miRNA in species (miRDeep2) in the HUVECs transfected with siCTL or siOASL. The expression profile of mature miRNA is used to analyze differentially expressed miRNA(DE miRNA). Conclusions: Our study represents the first analysis of endothelial miRNA profiles affected by OASL knockdown with biologic replicates.
Project description:A wheat × T. timopheevii pre-breeding population was analyzed using Genotyping-by-sequencing (GBS) combined with a skim-seq pipeline to identify and characterize T. timopheevii introgressions. Read coverage analysis based on a combined T. aestivum–T. timopheevii reference genome enabled high-resolution detection of major chromosomal introgressions and copy-number changes. Sequencing reads were aligned to this combined assembly, and chromosome identity and physical position could be extracted. An \"in silico wheat × T. timopheevii hybrid\" reference genome was constructed by combining the reference sequences of the donor and the recipient species. To identify wheat-T. timopheevii introgressions, we combined the Chinese Spring reference genome (IWGSC RefSeq v1.0) (IWGSC, 2018) with the draft genome assembly of T. timopheevii (GCA_963921465.1) (Grewal et al., 2024). During the assembly process, unique identifiers were assigned to all chromosomes or pseudomolecules to maintain distinctiveness. Prior to alignment, the Illumina short reads from 42 lines, along with the previously described control genotypes, were demultiplexed and adapter-trimmed with Stacks v2.68 (Rochette et al., 2019). The processed paired-end reads were then mapped separately to the combined reference genome using HISAT v2.1.0 (Kim et el., 2019) with the – no-spliced-alignment and – no-unal parameters. Following alignment, concordant unique reads were retrieved by filtering the sequence alignment map (SAM) outputs for the YT:Z:CP and NH:i:1 tags.
Project description:We first report the use of next-generation massively parallel sequencing technologies and de novo transcriptome assembly to gain insight into the wide range of transcriptome of Hevea brasiliensis. The output of sequenced data showed that more than 12 million sequence reads with average length of 90nt were generated. Totally 48,768 unigenes (mean size = 488 bp) were assembled through transcriptome de novo assembly, which represent more than 3-fold of all the sequences of Hevea brasiliensis deposited in the GenBank. Assembled sequences were annotated with gene descriptions, gene ontology and clusters of orthologous group terms. Total 37,373 unigenes were successfully annotated and more than 10% of unigenes were aligned to known proteins of Euphorbiaceae. The unigenes contain nearly complete collection of known rubber-synthesis-related genes. Our data provides the most comprehensive sequence resource available for study rubber tree and demonstrates the availability of Illumina sequencing and de novo transcriptome assembly in a species lacking genome information. The transcriptome of latex and leaf in Hevea brasiliensis
Project description:Nitrate-reducing iron(II)-oxidizing bacteria are widespread in the environment contribute to nitrate removal and influence the fate of the greenhouse gases nitrous oxide and carbon dioxide. The autotrophic growth of nitrate-reducing iron(II)-oxidizing bacteria is rarely investigated and poorly understood. The most prominent model system for this type of studies is enrichment culture KS, which originates from a freshwater sediment in Bremen, Germany. To gain insights in the metabolism of nitrate reduction coupled to iron(II) oxidation under in the absence of organic carbon and oxygen limited conditions, we performed metagenomic, metatranscriptomic and metaproteomic analyses of culture KS. Raw sequencing data of 16S rRNA amplicon sequencing, shotgun metagenomics (short reads: Illumina; long reads: Oxford Nanopore Technologies), metagenome assembly, raw sequencing data of shotgun metatranscriptomes (2 conditions, triplicates) can be found at SRA in https://www.ncbi.nlm.nih.gov/bioproject/PRJNA682552. This dataset contains proteomics data for 2 conditions (heterotrophic and autotrophic growth conditions) in triplicates.
Project description:Here, we performed deep transcriptome sequencing for the aerial-tissues and the roots of S. japonica, generating over 2 billion raw reads with an average length of 101 nt by using an Illumina paired-end sequencing by HiSeq2000 platform. Using a combined approach of three popular assemblers, de novo transcriptome assembly for S. japonica was obtained, yielding in 81,729 unigenes with an average length as 884bps and N50-value as 1,452bps, with 46,963 unigenes being annotated based on the sequence similarity against NCBI-nr protein database.