Unknown

Dataset Information

0

Natural Language Processing Techniques for Extracting and Categorizing Finding Measurements in Narrative Radiology Reports.


ABSTRACT:

Background

Accumulating quantitative outcome parameters may contribute to constructing a healthcare organization in which outcomes of clinical procedures are reproducible and predictable. In imaging studies, measurements are the principal category of quantitative para meters.

Objectives

The purpose of this work is to develop and evaluate two natural language processing engines that extract finding and organ measurements from narrative radiology reports and to categorize extracted measurements by their "temporality".

Methods

The measurement extraction engine is developed as a set of regular expressions. The engine was evaluated against a manually created ground truth. Automated categorization of measurement temporality is defined as a machine learning problem. A ground truth was manually developed based on a corpus of radiology reports. A maximum entropy model was created using features that characterize the measurement itself and its narrative context. The model was evaluated in a ten-fold cross validation protocol.

Results

The measurement extraction engine has precision 0.994 and recall 0.991. Accuracy of the measurement classification engine is 0.960.

Conclusions

The work contributes to machine understanding of radiology reports and may find application in software applications that process medical data.

SUBMITTER: Sevenster M 

PROVIDER: S-EPMC4586346 | biostudies-literature | 2015

REPOSITORIES: biostudies-literature

altmetric image

Publications

Natural Language Processing Techniques for Extracting and Categorizing Finding Measurements in Narrative Radiology Reports.

Sevenster M M   Buurman J J   Liu P P   Peters J F JF   Chang P J PJ  

Applied clinical informatics 20150930 3


<h4>Background</h4>Accumulating quantitative outcome parameters may contribute to constructing a healthcare organization in which outcomes of clinical procedures are reproducible and predictable. In imaging studies, measurements are the principal category of quantitative para meters.<h4>Objectives</h4>The purpose of this work is to develop and evaluate two natural language processing engines that extract finding and organ measurements from narrative radiology reports and to categorize extracted  ...[more]

Similar Datasets

| S-EPMC3811072 | biostudies-literature
| S-EPMC8176715 | biostudies-literature
| S-EPMC7686874 | biostudies-literature
| S-EPMC6659158 | biostudies-literature