Project description:ChIP-Seq, which combines chromatin immunoprecipitation (ChIP) with high-throughput massively parallel sequencing, is increasingly being used for identification of protein–DNA interactions in-vivo in the genome. In general, current algorithms for ChIP-seq reads employ artificial estimation of the average length of DNA fragments for peak finding, leading to uncertain prediction of DNA-protein binding sites. Here, we present SIPeS (Site Identification from Paired-end Sequencing), a novel algorithm for precise identification of binding sites from short reads generated from paired-end Solexa ChIP-Seq technology. SIPeS uses a dynamic baseline directly via ‘piling up’ the corresponding fragments defined by the paired reads to efficiently find peaks corresponding to binding sites. The performance of SIPeS is demonstrated by analyzing the ChIP-Seq data of the Arabidopsis basic helix-loop-helix transcription factor ABORTED MICROSPORES (AMS). The robustness of SIPeS was demonstrated in higher sensitivity and spatial resolution in peak finding compared to three existing peak detection algorithms. Keywords: transcription factors (protein-DNA interactions)
Project description:ChIP-Seq, which combines chromatin immunoprecipitation (ChIP) with high-throughput massively parallel sequencing, is increasingly being used for identification of proteinM-bM-^@M-^SDNA interactions in-vivo in the genome. In general, current algorithms for ChIP-seq reads employ artificial estimation of the average length of DNA fragments for peak finding, leading to uncertain prediction of DNA-protein binding sites. Here, we present SIPeS (Site Identification from Paired-end Sequencing), a novel algorithm for precise identification of binding sites from short reads generated from paired-end Solexa ChIP-Seq technology. SIPeS uses a dynamic baseline directly via M-bM-^@M-^Xpiling upM-bM-^@M-^Y the corresponding fragments defined by the paired reads to efficiently find peaks corresponding to binding sites. The performance of SIPeS is demonstrated by analyzing the ChIP-Seq data of the Arabidopsis basic helix-loop-helix transcription factor ABORTED MICROSPORES (AMS). The robustness of SIPeS was demonstrated in higher sensitivity and spatial resolution in peak finding compared to three existing peak detection algorithms. Keywords: transcription factors (protein-DNA interactions) Examination of protein-DNA interactions in buds of Arabidopsis anther cell
Project description:<p>We have developed FusionSeq to identify fusion transcripts from paired-end RNA-sequencing. FusionSeq includes filters to remove spurious candidate fusions with artifacts such as misalignments or random pairing of transcript fragments and it ranks candidates according to several statistics. It also has a module to identify exact sequences at breakpoint junctions. FusionSeq detected known and novel fusions in a specially sequenced calibration data set, including 8 cancers with and without known rearrangements.</p>