Unknown

Dataset Information

0

Genome Modeling System: A Knowledge Management Platform for Genomics.


ABSTRACT: In this work, we present the Genome Modeling System (GMS), an analysis information management system capable of executing automated genome analysis pipelines at a massive scale. The GMS framework provides detailed tracking of samples and data coupled with reliable and repeatable analysis pipelines. The GMS also serves as a platform for bioinformatics development, allowing a large team to collaborate on data analysis, or an individual researcher to leverage the work of others effectively within its data management system. Rather than separating ad-hoc analysis from rigorous, reproducible pipelines, the GMS promotes systematic integration between the two. As a demonstration of the GMS, we performed an integrated analysis of whole genome, exome and transcriptome sequencing data from a breast cancer cell line (HCC1395) and matched lymphoblastoid line (HCC1395BL). These data are available for users to test the software, complete tutorials and develop novel GMS pipeline configurations. The GMS is available at https://github.com/genome/gms.

SUBMITTER: Griffith M 

PROVIDER: S-EPMC4497734 | biostudies-literature | 2015 Jul

REPOSITORIES: biostudies-literature

altmetric image

Publications

Genome Modeling System: A Knowledge Management Platform for Genomics.

Griffith Malachi M   Griffith Obi L OL   Smith Scott M SM   Ramu Avinash A   Callaway Matthew B MB   Brummett Anthony M AM   Kiwala Michael J MJ   Coffman Adam C AC   Regier Allison A AA   Oberkfell Ben J BJ   Sanderson Gabriel E GE   Mooney Thomas P TP   Nutter Nathaniel G NG   Belter Edward A EA   Du Feiyu F   Long Robert L RL   Abbott Travis E TE   Ferguson Ian T IT   Morton David L DL   Burnett Mark M MM   Weible James V JV   Peck Joshua B JB   Dukes Adam A   McMichael Joshua F JF   Lolofie Justin T JT   Derickson Brian R BR   Hundal Jasreet J   Skidmore Zachary L ZL   Ainscough Benjamin J BJ   Dees Nathan D ND   Schierding William S WS   Kandoth Cyriac C   Kim Kyung H KH   Lu Charles C   Harris Christopher C CC   Maher Nicole N   Maher Christopher A CA   Magrini Vincent J VJ   Abbott Benjamin S BS   Chen Ken K   Clark Eric E   Das Indraniel I   Fan Xian X   Hawkins Amy E AE   Hepler Todd G TG   Wylie Todd N TN   Leonard Shawn M SM   Schroeder William E WE   Shi Xiaoqi X   Carmichael Lynn K LK   Weil Matthew R MR   Wohlstadter Richard W RW   Stiehr Gary G   McLellan Michael D MD   Pohl Craig S CS   Miller Christopher A CA   Koboldt Daniel C DC   Walker Jason R JR   Eldred James M JM   Larson David E DE   Dooling David J DJ   Ding Li L   Mardis Elaine R ER   Wilson Richard K RK  

PLoS computational biology 20150709 7


In this work, we present the Genome Modeling System (GMS), an analysis information management system capable of executing automated genome analysis pipelines at a massive scale. The GMS framework provides detailed tracking of samples and data coupled with reliable and repeatable analysis pipelines. The GMS also serves as a platform for bioinformatics development, allowing a large team to collaborate on data analysis, or an individual researcher to leverage the work of others effectively within i  ...[more]

Similar Datasets

| S-EPMC7145631 | biostudies-literature
| S-EPMC9795662 | biostudies-literature
| S-EPMC7297266 | biostudies-literature
| S-EPMC11366637 | biostudies-literature
| 2378933 | ecrin-mdr-crc
| S-EPMC2790312 | biostudies-literature
| S-EPMC261899 | biostudies-literature
| S-EPMC5774086 | biostudies-literature
| S-EPMC2744709 | biostudies-literature
| S-EPMC9349046 | biostudies-literature