Unknown

Dataset Information

0

BayGO: Bayesian analysis of ontology term enrichment in microarray data.


ABSTRACT: BACKGROUND:The search for enriched (aka over-represented or enhanced) ontology terms in a list of genes obtained from microarray experiments is becoming a standard procedure for a system-level analysis. This procedure tries to summarize the information focussing on classification designs such as Gene Ontology, KEGG pathways, and so on, instead of focussing on individual genes. Although it is well known in statistics that association and significance are distinct concepts, only the former approach has been used to deal with the ontology term enrichment problem. RESULTS:BayGO implements a Bayesian approach to search for enriched terms from microarray data. The R source-code is freely available at http://blasto.iq.usp.br/~tkoide/BayGO in three versions: Linux, which can be easily incorporated into pre-existent pipelines; Windows, to be controlled interactively; and as a web-tool. The software was validated using a bacterial heat shock response dataset, since this stress triggers known system-level responses. CONCLUSION:The Bayesian model accounts for the fact that, eventually, not all the genes from a given category are observable in microarray data due to low intensity signal, quality filters, genes that were not spotted and so on. Moreover, BayGO allows one to measure the statistical association between generic ontology terms and differential expression, instead of working only with the common significance analysis.

SUBMITTER: Vencio RZ 

PROVIDER: S-EPMC1440873 | biostudies-literature | 2006 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

BayGO: Bayesian analysis of ontology term enrichment in microarray data.

Vêncio Ricardo Z N RZ   Koide Tie T   Gomes Suely L SL   Pereira Carlos A de B CA  

BMC bioinformatics 20060223


<h4>Background</h4>The search for enriched (aka over-represented or enhanced) ontology terms in a list of genes obtained from microarray experiments is becoming a standard procedure for a system-level analysis. This procedure tries to summarize the information focussing on classification designs such as Gene Ontology, KEGG pathways, and so on, instead of focussing on individual genes. Although it is well known in statistics that association and significance are distinct concepts, only the former  ...[more]

Similar Datasets

| S-EPMC4103872 | biostudies-literature
| S-EPMC2703939 | biostudies-literature
| S-EPMC6541037 | biostudies-literature
| S-EPMC1851021 | biostudies-literature
| S-EPMC5079577 | biostudies-literature
| S-EPMC3392036 | biostudies-literature
| S-EPMC1382258 | biostudies-literature
| S-EPMC6760551 | biostudies-literature
| S-EPMC2736174 | biostudies-literature
| S-EPMC2474592 | biostudies-literature