Unknown

Dataset Information

0

LeafAI: query generator for clinical cohort discovery rivaling a human programmer.


ABSTRACT:

Objective

Identifying study-eligible patients within clinical databases is a critical step in clinical research. However, accurate query design typically requires extensive technical and biomedical expertise. We sought to create a system capable of generating data model-agnostic queries while also providing novel logical reasoning capabilities for complex clinical trial eligibility criteria.

Materials and methods

The task of query creation from eligibility criteria requires solving several text-processing problems, including named entity recognition and relation extraction, sequence-to-sequence transformation, normalization, and reasoning. We incorporated hybrid deep learning and rule-based modules for these, as well as a knowledge base of the Unified Medical Language System (UMLS) and linked ontologies. To enable data-model agnostic query creation, we introduce a novel method for tagging database schema elements using UMLS concepts. To evaluate our system, called LeafAI, we compared the capability of LeafAI to a human database programmer to identify patients who had been enrolled in 8 clinical trials conducted at our institution. We measured performance by the number of actual enrolled patients matched by generated queries.

Results

LeafAI matched a mean 43% of enrolled patients with 27 225 eligible across 8 clinical trials, compared to 27% matched and 14 587 eligible in queries by a human database programmer. The human programmer spent 26 total hours crafting queries compared to several minutes by LeafAI.

Conclusions

Our work contributes a state-of-the-art data model-agnostic query generation system capable of conditional reasoning using a knowledge base. We demonstrate that LeafAI can rival an experienced human programmer in finding patients eligible for clinical trials.

SUBMITTER: Dobbins NJ 

PROVIDER: S-EPMC10654856 | biostudies-literature | 2023 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

LeafAI: query generator for clinical cohort discovery rivaling a human programmer.

Dobbins Nicholas J NJ   Han Bin B   Zhou Weipeng W   Lan Kristine F KF   Kim H Nina HN   Harrington Robert R   Uzuner Özlem Ö   Yetisgen Meliha M  

Journal of the American Medical Informatics Association : JAMIA 20231101 12


<h4>Objective</h4>Identifying study-eligible patients within clinical databases is a critical step in clinical research. However, accurate query design typically requires extensive technical and biomedical expertise. We sought to create a system capable of generating data model-agnostic queries while also providing novel logical reasoning capabilities for complex clinical trial eligibility criteria.<h4>Materials and methods</h4>The task of query creation from eligibility criteria requires solvin  ...[more]

Similar Datasets

| S-EPMC5210639 | biostudies-literature
| S-EPMC8179539 | biostudies-literature
| S-EPMC4677828 | biostudies-literature
2019-05-30 | GSE129551 | GEO
| S-EPMC7709163 | biostudies-literature
| S-EPMC3408576 | biostudies-literature
2020-02-20 | GSE112099 | GEO
| S-EPMC5553072 | biostudies-other
| S-EPMC4619869 | biostudies-literature
| PRJNA531709 | ENA