Unknown

Dataset Information

0

Improving precision in concept normalization.


ABSTRACT: Most natural language processing applications exhibit a trade-off between precision and recall. In some use cases for natural language processing, there are reasons to prefer to tilt that trade-off toward high precision. Relying on the Zipfian distribution of false positive results, we describe a strategy for increasing precision, using a variety of both pre-processing and post-processing methods. They draw on both knowledge-based and frequentist approaches to modeling language. Based on an existing high-performance biomedical concept recognition pipeline and a previously published manually annotated corpus, we apply this hybrid rationalist/empiricist strategy to concept normalization for eight different ontologies. Which approaches did and did not improve precision varied widely between the ontologies.

SUBMITTER: Boguslav M 

PROVIDER: S-EPMC5730334 | biostudies-literature | 2018

REPOSITORIES: biostudies-literature

altmetric image

Publications

Improving precision in concept normalization.

Boguslav Mayla M   Cohen K Bretonnel KB   Baumgartner William A WA   Hunter Lawrence E LE  

Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing 20180101


Most natural language processing applications exhibit a trade-off between precision and recall. In some use cases for natural language processing, there are reasons to prefer to tilt that trade-off toward high precision. Relying on the Zipfian distribution of false positive results, we describe a strategy for increasing precision, using a variety of both pre-processing and post-processing methods. They draw on both knowledge-based and frequentist approaches to modeling language. Based on an exis  ...[more]

Similar Datasets

| S-EPMC7415240 | biostudies-literature
| S-EPMC6435659 | biostudies-literature
| S-EPMC7666894 | biostudies-literature
| S-EPMC3316937 | biostudies-literature
| S-EPMC6774303 | biostudies-literature
| S-EPMC5597853 | biostudies-literature
| S-EPMC3951655 | biostudies-literature
| S-EPMC8570806 | biostudies-literature
| S-EPMC5753872 | biostudies-other
| S-EPMC5122615 | biostudies-literature