ABSTRACT: Agilent whole exome hybridisation capture will be performed on genomic DNA derived from 50 Essential Thrombocythemia Myeloproliferative Disease samples and matched normal DNA from the same patients. Samples will be multiplexed in one lane of Illumina HiSeq. Sequencing to a coverage of at least 30x will be performed and mapped to build 37 of the human reference genome to facilitate the identification of novel cancer genes.
Project description:Agilent whole exome hybridisation capture will be performed on genomic DNA derived from ~3 Splenic Marginal Zone Lymphoma with villous lymphocytes samples and matched normal DNA from the same patients. Samples will be multiplexed in one lane of Illumina HiSeq. Sequencing to a coverage of at least 30x will be performed and mapped to build 37 of the human reference genome to facilitate the identification of novel cancer genes.
Project description:Reduced representation bisulfite sequencing (RRBS), which couples bisulfite conversion and next generation sequencing, is an innovative method that specifically enriches genomic regions with a high density of potential methylation sites and enables investigation of DNA methylation at single-nucleotide resolution. Recent advances in the Illumina DNA sample preparation protocol and sequencing technology have vastly improved sequencing throughput capacity. Although the new Illumina technology is now widely used, the unique challenges associated with multiplexed RRBS libraries on this platform have not been previously described. We have made modifications to the RRBS library preparation protocol to sequence multiplexed libraries on a single flow cell lane of the Illumina HiSeq 2000. Furthermore, our analysis incorporates a bioinformatics pipeline specifically designed to process bisulfite-converted sequencing reads and evaluate the output and quality of the sequencing data generated from the multiplexed libraries. We obtained an average of 42 million paired-end reads per sample for each flow-cell lane, with a high unique mapping efficiency to the reference human genome. Here we provide a roadmap of modifications, strategies, and trouble shooting approaches we implemented to optimize sequencing of multiplexed libraries on an a RRBS background.
Project description:Improving our ability to construct and functionally characterize DNA sequences would broadly accelerate progress in biology. Here, we introduce DropSynth, a scalable, low-cost method to build thousands of defined gene-length constructs in a pooled (multiplexed) manner. DropSynth uses a library of barcoded beads that pull down the oligonucleotides necessary for a gene's assembly, which are then processed and assembled in water-in-oil emulsions. We used DropSynth to successfully build more than 7000 synthetic genes that encode phylogenetically diverse homologs of two essential genes in Escherichia coli We tested the ability of phosphopantetheine adenylyltransferase homologs to complement a knockout E. coli strain in multiplex, revealing core functional motifs and reasons underlying homolog incompatibility. DropSynth coupled with multiplexed functional assays allows us to rationally explore sequence-function relationships at an unprecedented scale.
Project description:Sample index cross-talk can result in false positive calls when massively parallel sequencing (MPS) is used for sensitive applications such as low-frequency somatic variant discovery, ancient DNA investigations, microbial detection in human samples, or circulating cell-free tumor DNA (ctDNA) variant detection. Therefore, the limit-of-detection of an MPS assay is directly related to the degree of index cross-talk.Cross-talk rates up to 0.29% were observed when using standard, combinatorial adapters, resulting in 110,180 (0.1% cross-talk rate) or 1,121,074 (0.29% cross-talk rate) misassigned reads per lane in non-patterned and patterned Illumina flow cells, respectively. Here, we demonstrate that using unique, dual-matched indexed adapters dramatically reduces index cross-talk to ?1 misassigned reads per flow cell lane. While the current study was performed using dual-matched indices, using unique, dual-unrelated indices would also be an effective alternative.For sensitive downstream analyses, the use of combinatorial indices for multiplexed hybrid capture and sequencing is inappropriate, as it results in an unacceptable number of misassigned reads. Cross-talk can be virtually eliminated using dual-matched indexed adapters. These results suggest that use of such adapters is critical to reduce false positive rates in assays that aim to identify low allele frequency events, and strongly indicate that dual-matched adapters be implemented for all sensitive MPS applications.
Project description:Most DNA methylation studies in classic Philadelphia-negative myeloproliferative neoplasms have been performed on a gene-by-gene basis. Therefore, a more comprehensive methylation profiling is needed to study the implications of this epigenetic marker in myeloproliferative neoplasms. Here, we have analyzed 71 chronic (24 polycythemia vera, 23 essential thrombocythemia and 24 primary myelofibrosis) and 13 transformed myeloproliferative neoplasms using genome-wide DNA methylation arrays. The three types of chronic Philadelphia-negative myeloproliferative neoplasms showed a similar aberrant DNA methylation pattern when compared to control samples. Differentially methylated regions were enriched in a gene network centered on the NF-κB pathway, indicating that they may be involved in the pathogenesis of these diseases. In the case of transformed myeloproliferative neoplasms, we detected an increased number of differentially methylated regions with respect to chronic myeloproliferative neoplasms. Interestingly, these genes were enriched in a list of differentially methylated regions in primary acute myeloid leukemia and in a gene network centered around the IFN pathway. Our results suggest that alterations in the DNA methylation landscape play an important role in the pathogenesis and leukemic transformation of myeloproliferative neoplasms. The therapeutic modulation of epigenetically-deregulated pathways may allow us to design targeted therapies for these patients.
Project description:Multiplexing samples in sequencing experiments is a common approach to maximize information yield while minimizing cost. In most cases the number of samples that are multiplexed is determined by financial consideration or experimental convenience, with limited understanding on the effects on the experimental results. Here we set to examine the impact of multiplexing ChIP-seq experiments on the ability to identify a specific epigenetic modification. We performed peak detection analyses to determine the effects of multiplexing. These include false discovery rates, size, position and statistical significance of peak detection, and changes in gene annotation. We found that, for histone marker H3K4me3, one can multiplex up to 8 samples (7 IP + 1 input) at ~21 million single-end reads each and still detect over 90% of all peaks found when using a full lane for sample (~181 million reads). Furthermore, there are no variations introduced by indexing or lane batch effects and importantly there is no significant reduction in the number of genes with neighboring H3K4me3 peaks. We conclude that, for a well characterized antibody and, therefore, model IP condition, multiplexing 8 samples per lane is sufficient to capture most of the biological signal.
Project description:Second-generation sequencing is gradually becoming the method of choice for miRNA detection and expression profiling. Given the relatively small number of miRNAs and improvements in DNA sequencing technology, studying miRNA expression profiles of multiple samples in a single flow cell lane becomes feasible. Multiplexing strategies require marking each miRNA library with a DNA barcode. Here we report that barcodes introduced through adapter ligation confer significant bias on miRNA expression profiles. This bias is much higher than the expected Poisson noise and masks significant expression differences between miRNA libraries. This bias can be eliminated by adding barcodes during PCR amplification of libraries. The accuracy of miRNA expression measurement in multiplexed experiments becomes a function of sample number.
Project description:Agilent whole exome hybridisation capture will be performed on genomic DNA derived from FRCC patients. one lane of Illumina Hiseq sequencing will be performed on the resulting exome libraries and mapped to build 37 of the human reference genome to facilitate the identification of novel cancer genes.
Project description:Agilent whole exome hybridisation capture will be performed on genomic DNA derived from 50 Primary Myelofibrosis Myeloproliferative Disease samples and matched normal DNA from the same patients. Three lanes of Illumina GA sequencing will be performed on the resulting 100 exome libraries and mapped to build 37 of the human reference genome to facilitate the identification of novel cancer genes.
Project description:Agilent whole exome hybridisation capture will be performed on genomic DNA derived from 50 Polycythemia Vera Myeloproliferative Disease samples and matched normal DNA from the same patients. Three lanes of Illumina GA sequencing will be performed on the resulting 100 exome libraries and mapped to build 37 of the human reference genome to facilitate the identification of novel cancer genes.