Unknown

Dataset Information

0

Hypertension identification using inpatient clinical notes from electronic medical records: an explainable, data-driven algorithm study.


ABSTRACT:

Background

Case identification is important for health services research, measuring health system performance and risk adjustment, but existing methods based on manual chart review or diagnosis codes can be expensive, time consuming or of limited validity. We aimed to develop a hypertension case definition in electronic medical records (EMRs) for inpatient clinical notes using machine learning.

Methods

A cohort of patients 18 years of age or older who were discharged from 1 of 3 Calgary acute care facilities (1 academic hospital and 2 community hospitals) between Jan. 1 and June 30, 2015, were randomly selected, and we compared the performance of EMR phenotype algorithms developed using machine learning with an algorithm based on the Canadian version of the International Statistical Classification of Diseases and Related Health Problems, 10th Revision (ICD), in identifying patients with hypertension. Hypertension status was determined by chart review, the machine-learning algorithms used EMR notes and the ICD algorithm used the Discharge Abstract Database (Canadian Institute for Health Information).

Results

Of our study sample (n = 3040), 1475 (48.5%) patients had hypertension. The group with hypertension was older (median age of 71.0 yr v. 52.5 yr for those patients without hypertension) and had fewer females (710 [48.2%] v. 764 [52.3%]). Our final EMR-based models had higher sensitivity than the ICD algorithm (> 90% v. 47%), while maintaining high positive predictive values (> 90% v. 97%).

Interpretation

We found that hypertension tends to have clear documentation in EMRs and is well classified by concept search on free text. Machine learning can provide insights into how and where conditions are documented in EMRs and suggest nonmachine-learning phenotypes to implement.

SUBMITTER: Martin EA 

PROVIDER: S-EPMC9933992 | biostudies-literature | 2023 Jan-Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Hypertension identification using inpatient clinical notes from electronic medical records: an explainable, data-driven algorithm study.

Martin Elliot A EA   D'Souza Adam G AG   Lee Seungwon S   Doktorchik Chelsea C   Eastwood Cathy A CA   Quan Hude H  

CMAJ open 20230101 1


<h4>Background</h4>Case identification is important for health services research, measuring health system performance and risk adjustment, but existing methods based on manual chart review or diagnosis codes can be expensive, time consuming or of limited validity. We aimed to develop a hypertension case definition in electronic medical records (EMRs) for inpatient clinical notes using machine learning.<h4>Methods</h4>A cohort of patients 18 years of age or older who were discharged from 1 of 3 C  ...[more]

Similar Datasets

| S-EPMC11493107 | biostudies-literature
| PRJNA158491 | ENA
| S-EPMC8818824 | biostudies-literature
| S-EPMC6482406 | biostudies-literature
| S-EPMC6613290 | biostudies-literature
| S-EPMC5566092 | biostudies-other
| S-EPMC7203618 | biostudies-literature
| S-EPMC10865188 | biostudies-literature
| S-EPMC7651929 | biostudies-literature
| S-EPMC5072168 | biostudies-literature