Unknown

Dataset Information

0

The seeker R package: simplified fetching and processing of transcriptome data.


ABSTRACT: Transcriptome data have become invaluable for interrogating biological systems. Preparing a transcriptome dataset for analysis, particularly an RNA-seq dataset, entails multiple steps and software programs, each with its own command-line interface (CLI). Although these CLIs are powerful, they often require shell scripting for automation and parallelization, which can have a high learning curve, especially when the details of the CLIs vary from one tool to another. However, many individuals working with transcriptome data are already familiar with R due to the plethora and popularity of R-based tools for analyzing biological data. Thus, we developed an R package called seeker for simplified fetching and processing of RNA-seq and microarray data. Seeker is a wrapper around various existing tools, and provides a standard interface, simple parallelization, and detailed logging. Seeker's primary output-sample metadata and gene expression values based on Entrez or Ensembl Gene IDs-can be directly plugged into a differential expression analysis. To maximize reproducibility, seeker is available as a standalone R package and in a Docker image that includes all dependencies, both of which are accessible at https://seeker.hugheylab.org.

SUBMITTER: Schoenbachler JL 

PROVIDER: S-EPMC9648347 | biostudies-literature | 2022

REPOSITORIES: biostudies-literature

altmetric image

Publications

The seeker R package: simplified fetching and processing of transcriptome data.

Schoenbachler Joshua L JL   Hughey Jacob J JJ  

PeerJ 20221107


Transcriptome data have become invaluable for interrogating biological systems. Preparing a transcriptome dataset for analysis, particularly an RNA-seq dataset, entails multiple steps and software programs, each with its own command-line interface (CLI). Although these CLIs are powerful, they often require shell scripting for automation and parallelization, which can have a high learning curve, especially when the details of the CLIs vary from one tool to another. However, many individuals worki  ...[more]

Similar Datasets

| S-EPMC10064599 | biostudies-literature
| S-EPMC9513427 | biostudies-literature
| S-EPMC8963298 | biostudies-literature
| S-EPMC11226021 | biostudies-literature
| S-EPMC7544668 | biostudies-literature
| S-EPMC5870549 | biostudies-literature
| S-EPMC8398219 | biostudies-literature
| S-EPMC4432635 | biostudies-literature
| S-EPMC11767891 | biostudies-literature
| S-EPMC6137409 | biostudies-literature