Project description:Purpose: The purpose of this experiment is to expand the repertoire of C. elegans edited transcripts and identify the roles of ADR-1 as indirect regulator of editing and ADR-2 as the only active deaminase in vivo. Methods: Strand-specific RNA sequencing of wild-type and adr mutant worms, followed by a novel RNA variant calling and comparative analysis pipeline. Results: Despite lacking deaminase function, ADR-1 affects editing of over 60 adenosines within the 3’ UTRs of 16 different mRNAs. Furthermore, ADR-1 interacts directly with ADR-2 substrates, even in the absence of ADR-2; and mutations within its dsRNA binding domains abolished both binding and editing regulation. Conclusions: ADR-1 acts as a major regulator of editing by binding ADR-2 substrates in vivo and raises the possibility that other dsRNA binding proteins, including the inactive human ADARs, regulate RNA editing by deaminase-independent mechanisms. Strand-specific RNA sequencing of wild-type and adr mutant worms, followed by a novel RNA variant calling and comparative analysis pipeline.
Project description:More than 2x10E9 sequences made on Illumina platform derived from the genome of E14 embryonic stem cells cultured in our laboratory were used to build a database of about 2.7x10E6 single nucleotide variant. The database was validated using other two sequencing datasets from other laboratory and high overlap was observed. The identified variant are enriched on intergenic regions, but several thousands reside on gene exons and regulatory regions, such as promoters, enhancers, splicing site and untranslated regions of RNA, thus indicating high probability of an important functional impact on the molecular biology of this cells. We created a new E14 genome assembly including the new identified variants and used it to map reads from next generation sequencing data generated in our laboratory or in others on E14 cell line. We observed an increase in the number of mapped reads of about 5%. CpG dinucleotide showed the higher variation frequency, probably because of it could be target of DNA methylation. We performed a reduced representation bisulfite sequencing on E14 cell line to test our new genome assembly with respect to the mm9 genome reference. After mapping and methylation status calling, we obtained an increase of about 120,000 called CpG and we avoided about 20,000 wrong CpG calling. genotyping of E14 embryonic stem cells (ESCs) and Reduced representation Bisulfite Sequencing (RRBS) of E14 ESCs.
Project description:Calling Cards is a platform technology to record a cumulative history of transient protein-DNA interactions in the genome of genetically targeted cell types. The record of these interactions are recovered by next generation sequencing. Compared to other genomic assays, whose readout provides a snapshot at the time of harvest, Calling Cards enables correlation of historical molecular states to eventual outcomes or phenotypes. To achieve this, Calling Cards uses the piggyBac transposase to insert self-reporting transposon (SRT) “Calling Cards” into the genome, leaving permanent marks at interaction sites. Calling Cards can be deployed in a variety of in vitro and in vivo biological systems to study gene regulatory networks involved in development, aging, and disease. Out of the box, it assesses enhancer usage but can be adapted to profile specific transcription factor binding with custom transcription factor (TF)-piggyBac fusion proteins. The Calling Cards workflow has five main stages: delivery of Calling Card reagents, sample preparation, library preparation, sequencing, and data analysis. Here, we first present a comprehensive guide for experimental design, reagent selection, and optional customization of the platform to study additional TFs. Then, we provide an updated protocol for the five steps, using reagents that improve throughput and decrease costs, including an overview of a newly deployed computational pipeline. This protocol is designed for users with basic molecular biology experience to process samples into sequencing libraries in 1-2 days. Familiarity with bioinformatic analysis and command line tools is required to set up the pipeline in a high performance computing environment and to conduct downstream analyses.
Project description:PROTEOFORMER is a pipeline for generating a search space of candidate translation products based on ribosome profiling data. The pipeline was published in 2014, but since then, several features have been added. For this project, the PROTEOFORMER pipeline 2.0 was tested on human HCT116 and Jurkat cell ribosome profiling data with all its new features and then compared to raw MS data of both cell lines, validating its use as a proteogenomic tool. In this study, results of 3 proteoform calling methods were combined (classical PROTEOFORMER proteoform calling, PRICE and SPECtre). Afterwards, results were eventually combined with the canonical or splicing-included version of human UniProt. The different described combinations were exported as a FASTA file and used as search space for searching matching MS data.
Project description:The study included 15 patients (7 males, 8 females) with JMML. Peripheral blood and/or bone marrow aspirates were collected on EDTA at diagnosis. Non-hematopoietic tissues (fibroblasts) was derived from skin biopsy for each patient. Exome sequencing was performed in several distinct series between 2012 and 2017, which explains the differences in capture kit versions and reference genome version.Targeted enrichment and massive parallel sequencing were performed on paired genomic DNA from leukocytes and fibroblasts. Exome capture was carried out using the SureSelect Human All Exon V4+UTRs or V5 or V5+UTRs or SureSelect Clinical Research (Agilent Technologies, Santa Clara, CA, USA) according to manufacturer’s instruction and protocols by IntegraGen (Evry, France). Paired-end 75 bases sequencing was performed on a HiSeq2000 or HiSeq4000 instrument (Illumina, San Diego, CA, USA). Image analysis and base calling were performed using the Real Time Analysis (RTA) pipeline v. 1.14 (Illumina) with default parameters. The alignment of paired-end reads to the reference human genome (UCSC GRCh37/hg19 or UCSC GRCh38), variant calling and generation of Quality variants scores were carried out using the CASAVA v.1.8 pipeline (Illumina).
Project description:The CRC DRAW study will assess the sensitivity and specificity of the blood-based, Next-Gen CRC Screening Test for the detection of CRC.