Unknown

Dataset Information

0

Precision annotation of digital samples in NCBI's gene expression omnibus.


ABSTRACT: The Gene Expression Omnibus (GEO) contains more than two million digital samples from functional genomics experiments amassed over almost two decades. However, individual sample meta-data remains poorly described by unstructured free text attributes preventing its largescale reanalysis. We introduce the Search Tag Analyze Resource for GEO as a web application (http://STARGEO.org) to curate better annotations of sample phenotypes uniformly across different studies, and to use these sample annotations to define robust genomic signatures of disease pathology by meta-analysis. In this paper, we target a small group of biomedical graduate students to show rapid crowd-curation of precise sample annotations across all phenotypes, and we demonstrate the biological validity of these crowd-curated annotations for breast cancer. STARGEO.org makes GEO data findable, accessible, interoperable and reusable (i.e., FAIR) to ultimately facilitate knowledge discovery. Our work demonstrates the utility of crowd-curation and interpretation of open 'big data' under FAIR principles as a first step towards realizing an ideal paradigm of precision medicine.

SUBMITTER: Hadley D 

PROVIDER: S-EPMC5604135 | biostudies-literature | 2017 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications


The Gene Expression Omnibus (GEO) contains more than two million digital samples from functional genomics experiments amassed over almost two decades. However, individual sample meta-data remains poorly described by unstructured free text attributes preventing its largescale reanalysis. We introduce the Search Tag Analyze Resource for GEO as a web application (http://STARGEO.org) to curate better annotations of sample phenotypes uniformly across different studies, and to use these sample annotat  ...[more]

Similar Datasets

| S-EPMC1619899 | biostudies-other
| S-EPMC4944384 | biostudies-literature
| S-EPMC1619900 | biostudies-literature
| S-EPMC6333964 | biostudies-literature
2012-01-01 | GSE27240 | GEO
| S-EPMC5643580 | biostudies-literature
| S-EPMC7336680 | biostudies-literature
| S-EPMC8061458 | biostudies-literature
| S-EPMC5052684 | biostudies-literature