<HashMap><database>biostudies-literature</database><scores/><additional><submitter>Hadley D</submitter><funding>NIAID NIH HHS</funding><funding>NLM NIH HHS</funding><funding>NCI NIH HHS</funding><funding>NIAMS NIH HHS</funding><funding>NIGMS NIH HHS</funding><pagination>170125</pagination><full_dataset_link>https://www.ebi.ac.uk/biostudies/studies/S-EPMC5604135</full_dataset_link><repository>biostudies-literature</repository><omics_type>Unknown</omics_type><volume>4</volume><pubmed_abstract>The Gene Expression Omnibus (GEO) contains more than two million digital samples from functional genomics experiments amassed over almost two decades. However, individual sample meta-data remains poorly described by unstructured free text attributes preventing its largescale reanalysis. We introduce the Search Tag Analyze Resource for GEO as a web application (http://STARGEO.org) to curate better annotations of sample phenotypes uniformly across different studies, and to use these sample annotations to define robust genomic signatures of disease pathology by meta-analysis. In this paper, we target a small group of biomedical graduate students to show rapid crowd-curation of precise sample annotations across all phenotypes, and we demonstrate the biological validity of these crowd-curated annotations for breast cancer. STARGEO.org makes GEO data findable, accessible, interoperable and reusable (i.e., FAIR) to ultimately facilitate knowledge discovery. Our work demonstrates the utility of crowd-curation and interpretation of open 'big data' under FAIR principles as a first step towards realizing an ideal paradigm of precision medicine.</pubmed_abstract><journal>Scientific data</journal><pubmed_title>Precision annotation of digital samples in NCBI's gene expression omnibus.</pubmed_title><pmcid>PMC5604135</pmcid><funding_grant_id>R01 GM079719</funding_grant_id><funding_grant_id>P30 AR070155</funding_grant_id><funding_grant_id>U01 LM012675</funding_grant_id><funding_grant_id>HHSN272201200028C</funding_grant_id><funding_grant_id>UH2 CA203792</funding_grant_id><pubmed_authors>Spatz J</pubmed_authors><pubmed_authors>Panahiazar M</pubmed_authors><pubmed_authors>Bhattacharya S</pubmed_authors><pubmed_authors>Sirota M</pubmed_authors><pubmed_authors>Rayikanti BA</pubmed_authors><pubmed_authors>Chen B</pubmed_authors><pubmed_authors>Hadley D</pubmed_authors><pubmed_authors>Hadied MO</pubmed_authors><pubmed_authors>El-Sayed O</pubmed_authors><pubmed_authors>Paik H</pubmed_authors><pubmed_authors>Pan J</pubmed_authors><pubmed_authors>Aljabban J</pubmed_authors><pubmed_authors>Musen MA</pubmed_authors><pubmed_authors>Aljabban I</pubmed_authors><pubmed_authors>Raza S</pubmed_authors><pubmed_authors>Azad TD</pubmed_authors><pubmed_authors>Aran D</pubmed_authors><pubmed_authors>Butte AJ</pubmed_authors><pubmed_authors>Himmelstein D</pubmed_authors></additional><is_claimable>false</is_claimable><name>Precision annotation of digital samples in NCBI's gene expression omnibus.</name><description>The Gene Expression Omnibus (GEO) contains more than two million digital samples from functional genomics experiments amassed over almost two decades. However, individual sample meta-data remains poorly described by unstructured free text attributes preventing its largescale reanalysis. We introduce the Search Tag Analyze Resource for GEO as a web application (http://STARGEO.org) to curate better annotations of sample phenotypes uniformly across different studies, and to use these sample annotations to define robust genomic signatures of disease pathology by meta-analysis. In this paper, we target a small group of biomedical graduate students to show rapid crowd-curation of precise sample annotations across all phenotypes, and we demonstrate the biological validity of these crowd-curated annotations for breast cancer. STARGEO.org makes GEO data findable, accessible, interoperable and reusable (i.e., FAIR) to ultimately facilitate knowledge discovery. Our work demonstrates the utility of crowd-curation and interpretation of open 'big data' under FAIR principles as a first step towards realizing an ideal paradigm of precision medicine.</description><dates><release>2017-01-01T00:00:00Z</release><publication>2017 Sep</publication><modification>2024-10-18T11:59:41.883Z</modification><creation>2019-03-27T02:56:46Z</creation></dates><accession>S-EPMC5604135</accession><cross_references><pubmed>28925997</pubmed><doi>10.1038/sdata.2017.125</doi></cross_references></HashMap>