Project description:The goal of this study, started as a part of the modENCODE project, is to detect and characterize previously unannotated transcripts of the C. elegans genome. This dataset has been imported from the Sequence Read Archive and curated by the WormBase and ArrayExpress teams.
Project description:In order to evaluate the performance of CNV detection in next-generation sequencing platform in varied sample types, we employed chromosomal microarray analysis (CMA) for validation of the samples with NGS-based detection results (NCBI Sequence Read Archive with accession number SRA296708). Besides array Comparative Genomics Hybridization (aCGH, Agilent) , we used a commerical SNP-array (Illumina) including early abortus, induced termination, prenatal samples and postnatal samples. CMA results were compared with NGS-based detection results. 100% consistency was obtained between NGS-based approach and CMA in pathogenic or likely pathogenic CNVs detection.
Project description:To unravel the fine architecture of neocentromeres found in three well-differentiated liposarcoma (WDLPS) cell lines as patchworks of multiple short amplified sequences, disclosing a much more higher complexity than previously reported. Next generation sequencing data (WGS, RNA-seq, CENP-A/ChIP-seq) are available at the Sequence Read Archive (BioProject ID: PRJNA378952).
Project description:In order to evaluate the performance of CNV detection in next-generation sequencing platform in varied sample types, we employed chromosomal microarray analysis (CMA) for validation of the samples with NGS-based detection results (NCBI Sequence Read Archive with accession number SRA296708). Besides snp-array, we used a customized array Comparative Genomics Hybridization (aCGH, Agilent) approach for a cohort of clinical samples including early abortus, induced termination, prenatal samples and postnatal samples. CMA results were compared with NGS-based detection results. 100% consistency was obtained between NGS-based approach and CMA in pathogenic or likely pathogenic CNVs detection.
Project description:In order to elucidate the general rules for gene localization and regulation mediated by CpG islands, we reanalyzed published ChIP-seq data of CXXC domain, H3K9me3, KDM2A, SUV39H1, ATF4, MYBL1, MYOD1, SPI1, and CTCF. Raw data were downloaded from Sequence Read Archive (SRA) in National Center for Biotechnology Information (NCBI) database. FASTQ files were extracted with the SRA Toolkit version 2.5.5 and aligned using Bowtie 2.2.5 onto the mouse and human genome (mm9 and hg19, respectively). For the identification of factor binding sites, model-based analysis for ChIP-seq peak caller (MACS 1.4.2) was used with a p-value cutoff of 1e-5.
Project description:The goal of this study, started as a part of the modENCODE project, is to detect and characterize previously unannotated transcripts of the C. elegans genome. This dataset has been imported from the Sequence Read Archive and curated by the WormBase and ArrayExpress teams. It is a subset of study E-MTAB-2683 (SRP00401) and only contains assays on worms not infected by pathogens.
Project description:We reanalyzed published RNA-seq data to study 1) the genomic landscapes near surrounding regions of transcriptional start sites with regard to the gene expression activities and 2) the gene expression change upon transcription factor (MYBL1, ATF4) depletion. Raw data were downloaded from Sequence Read Archive (SRA) in National Center for Biotechnology Information (NCBI) database. FASTQ files were extracted with the SRA Toolkit version 2.5.5 and aligned using STAR 2.4.2 onto the mouse and human genome (mm9 and hg19, respectively). Gene expression was calculated as RPKM values using rpkmforgenes.py (Ramsköld et al., 2009).