Unknown

Dataset Information

0

Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters.


ABSTRACT: Ontological concepts are useful for many different biomedical tasks. Concepts are difficult to recognize in text due to a disconnect between what is captured in an ontology and how the concepts are expressed in text. There are many recognizers for specific ontologies, but a general approach for concept recognition is an open problem.Three dictionary-based systems (MetaMap, NCBO Annotator, and ConceptMapper) are evaluated on eight biomedical ontologies in the Colorado Richly Annotated Full-Text (CRAFT) Corpus. Over 1,000 parameter combinations are examined, and best-performing parameters for each system-ontology pair are presented.Baselines for concept recognition by three systems on eight biomedical ontologies are established (F-measures range from 0.14-0.83). Out of the three systems we tested, ConceptMapper is generally the best-performing system; it produces the highest F-measure of seven out of eight ontologies. Default parameters are not ideal for most systems on most ontologies; by changing parameters F-measure can be increased by up to 0.4. Not only are best performing parameters presented, but suggestions for choosing the best parameters based on ontology characteristics are presented.

SUBMITTER: Funk C 

PROVIDER: S-EPMC4015610 | biostudies-literature | 2014 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Large-scale biomedical concept recognition: an evaluation of current automatic annotators and their parameters.

Funk Christopher C   Baumgartner William W   Garcia Benjamin B   Roeder Christophe C   Bada Michael M   Cohen K Bretonnel KB   Hunter Lawrence E LE   Verspoor Karin K  

BMC bioinformatics 20140226


<h4>Background</h4>Ontological concepts are useful for many different biomedical tasks. Concepts are difficult to recognize in text due to a disconnect between what is captured in an ontology and how the concepts are expressed in text. There are many recognizers for specific ontologies, but a general approach for concept recognition is an open problem.<h4>Results</h4>Three dictionary-based systems (MetaMap, NCBO Annotator, and ConceptMapper) are evaluated on eight biomedical ontologies in the Co  ...[more]

Similar Datasets

| S-EPMC7237030 | biostudies-literature
| S-EPMC3849280 | biostudies-literature
| S-EPMC4263461 | biostudies-literature
| S-EPMC2276192 | biostudies-other
| S-EPMC5018193 | biostudies-literature
| S-EPMC4343077 | biostudies-literature
| S-EPMC7090229 | biostudies-literature
| S-EPMC5705296 | biostudies-literature
| S-EPMC4366016 | biostudies-literature