Unknown

Dataset Information

0

Evaluation of the portability of computable phenotypes with natural language processing in the eMERGE network.


ABSTRACT: The electronic Medical Records and Genomics (eMERGE) Network assessed the feasibility of deploying portable phenotype rule-based algorithms with natural language processing (NLP) components added to improve performance of existing algorithms using electronic health records (EHRs). Based on scientific merit and predicted difficulty, eMERGE selected six existing phenotypes to enhance with NLP. We assessed performance, portability, and ease of use. We summarized lessons learned by: (1) challenges; (2) best practices to address challenges based on existing evidence and/or eMERGE experience; and (3) opportunities for future research. Adding NLP resulted in improved, or the same, precision and/or recall for all but one algorithm. Portability, phenotyping workflow/process, and technology were major themes. With NLP, development and validation took longer. Besides portability of NLP technology and algorithm replicability, factors to ensure success include privacy protection, technical infrastructure setup, intellectual property agreement, and efficient communication. Workflow improvements can improve communication and reduce implementation time. NLP performance varied mainly due to clinical document heterogeneity; therefore, we suggest using semi-structured notes, comprehensive documentation, and customization options. NLP portability is possible with improved phenotype algorithm performance, but careful planning and architecture of the algorithms is essential to support local customizations.

SUBMITTER: Pacheco JA 

PROVIDER: S-EPMC9898520 | biostudies-literature | 2023 Feb

REPOSITORIES: biostudies-literature

altmetric image

Publications

Evaluation of the portability of computable phenotypes with natural language processing in the eMERGE network.

Pacheco Jennifer A JA   Rasmussen Luke V LV   Wiley Ken K   Person Thomas Nate TN   Cronkite David J DJ   Sohn Sunghwan S   Murphy Shawn S   Gundelach Justin H JH   Gainer Vivian V   Castro Victor M VM   Liu Cong C   Mentch Frank F   Lingren Todd T   Sundaresan Agnes S AS   Eickelberg Garrett G   Willis Valerie V   Furmanchuk Al'ona A   Patel Roshan R   Carrell David S DS   Deng Yu Y   Walton Nephi N   Satterfield Benjamin A BA   Kullo Iftikhar J IJ   Dikilitas Ozan O   Smith Joshua C JC   Peterson Josh F JF   Shang Ning N   Kiryluk Krzysztof K   Ni Yizhao Y   Li Yikuan Y   Nadkarni Girish N GN   Rosenthal Elisabeth A EA   Walunas Theresa L TL   Williams Marc S MS   Karlson Elizabeth W EW   Linder Jodell E JE   Luo Yuan Y   Weng Chunhua C   Wei WeiQi W  

Scientific reports 20230203 1


The electronic Medical Records and Genomics (eMERGE) Network assessed the feasibility of deploying portable phenotype rule-based algorithms with natural language processing (NLP) components added to improve performance of existing algorithms using electronic health records (EHRs). Based on scientific merit and predicted difficulty, eMERGE selected six existing phenotypes to enhance with NLP. We assessed performance, portability, and ease of use. We summarized lessons learned by: (1) challenges;  ...[more]

Similar Datasets

| S-EPMC6241736 | biostudies-other
| S-EPMC9835770 | biostudies-literature
| S-EPMC6301375 | biostudies-literature
| S-EPMC10466442 | biostudies-literature
| S-EPMC9952043 | biostudies-literature
| S-EPMC9434462 | biostudies-literature
| S-EPMC10919678 | biostudies-literature
| S-EPMC7224173 | biostudies-literature
| S-EPMC10902457 | biostudies-literature
| S-EPMC10822845 | biostudies-literature