Unknown

Dataset Information

0

DSAP: deep-sequencing small RNA analysis pipeline.


ABSTRACT: DSAP is an automated multiple-task web service designed to provide a total solution to analyzing deep-sequencing small RNA datasets generated by next-generation sequencing technology. DSAP uses a tab-delimited file as an input format, which holds the unique sequence reads (tags) and their corresponding number of copies generated by the Solexa sequencing platform. The input data will go through four analysis steps in DSAP: (i) cleanup: removal of adaptors and poly-A/T/C/G/N nucleotides; (ii) clustering: grouping of cleaned sequence tags into unique sequence clusters; (iii) non-coding RNA (ncRNA) matching: sequence homology mapping against a transcribed sequence library from the ncRNA database Rfam (http://rfam.sanger.ac.uk/); and (iv) known miRNA matching: detection of known miRNAs in miRBase (http://www.mirbase.org/) based on sequence homology. The expression levels corresponding to matched ncRNAs and miRNAs are summarized in multi-color clickable bar charts linked to external databases. DSAP is also capable of displaying miRNA expression levels from different jobs using a log(2)-scaled color matrix. Furthermore, a cross-species comparative function is also provided to show the distribution of identified miRNAs in different species as deposited in miRBase. DSAP is available at http://dsap.cgu.edu.tw.

SUBMITTER: Huang PJ 

PROVIDER: S-EPMC2896168 | biostudies-literature | 2010 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

DSAP: deep-sequencing small RNA analysis pipeline.

Huang Po-Jung PJ   Liu Yi-Chung YC   Lee Chi-Ching CC   Lin Wei-Chen WC   Gan Richie Ruei-Chi RR   Lyu Ping-Chiang PC   Tang Petrus P  

Nucleic acids research 20100516 Web Server issue


DSAP is an automated multiple-task web service designed to provide a total solution to analyzing deep-sequencing small RNA datasets generated by next-generation sequencing technology. DSAP uses a tab-delimited file as an input format, which holds the unique sequence reads (tags) and their corresponding number of copies generated by the Solexa sequencing platform. The input data will go through four analysis steps in DSAP: (i) cleanup: removal of adaptors and poly-A/T/C/G/N nucleotides; (ii) clus  ...[more]

Similar Datasets

| S-EPMC4481843 | biostudies-other
| S-EPMC5716150 | biostudies-literature
| S-EPMC4103589 | biostudies-literature
| S-EPMC3919602 | biostudies-literature
| S-EPMC4228501 | biostudies-literature
| S-EPMC3287988 | biostudies-literature
| S-EPMC3467745 | biostudies-literature
| S-EPMC2885365 | biostudies-literature
| S-EPMC4811048 | biostudies-literature
| S-EPMC3439898 | biostudies-other