Unknown

Dataset Information

0

Inductive reasoning with large language models: a simulated randomized controlled trial for epilepsy.


ABSTRACT:

Importance

The analysis of electronic medical records at scale to learn from clinical experience is currently very challenging. The integration of artificial intelligence (AI), specifically foundational large language models (LLMs), into an analysis pipeline may overcome some of the current limitations of modest input sizes, inaccuracies, biases, and incomplete knowledge bases.

Objective

To explore the effectiveness of using an LLM for generating realistic clinical data and other LLMs for summarizing and synthesizing information in a model system, simulating a randomized clinical trial (RCT) in epilepsy to demonstrate the potential of inductive reasoning via medical chart review.

Design

An LLM-generated simulated RCT based on a RCT for treatment with an antiseizure medication, cenobamate, including a placebo arm and a full-strength drug arm, evaluated by an LLM-based pipeline versus a human reader.

Setting

Simulation based on realistic seizure diaries, treatment effects, reported symptoms and clinical notes generated by LLMs with multiple different neurologist writing styles.

Participants

Simulated cohort of 240 patients, divided 1:1 into placebo and drug arms.

Intervention

Utilization of LLMs for the generation of clinical notes and for the synthesis of data from these notes, aiming to evaluate the efficacy and safety of cenobamate in seizure control either with a human evaluator or AI-pipeline.

Measures

The AI and human analysis focused on identifying the number of seizures, symptom reports, and treatment efficacy, with statistical analysis comparing the 50%-responder rate and median percentage change between the placebo and drug arms, as well as side effect rates in each arm.

Results

AI closely mirrored human analysis, demonstrating the drug's efficacy with marginal differences (<3%) in identifying both drug efficacy and reported symptoms.

Conclusions and relevance

This study showcases the potential of LLMs accurately simulate and analyze clinical trials. Significantly, it highlights the ability of LLMs to reconstruct essential trial elements, identify treatment effects, and recognize reported symptoms, within a realistic clinical framework. The findings underscore the relevance of LLMs in future clinical research, offering a scalable, efficient alternative to traditional data mining methods without the need for specialized medical language training.

SUBMITTER: Goldenholz DM 

PROVIDER: S-EPMC10984041 | biostudies-literature | 2024 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Inductive reasoning with large language models: a simulated randomized controlled trial for epilepsy.

Goldenholz Daniel M DM   Goldenholz Shira R SR   Habib Sara S   Westover M Brandon MB  

medRxiv : the preprint server for health sciences 20240319


<h4>Importance</h4>The analysis of electronic medical records at scale to learn from clinical experience is currently very challenging. The integration of artificial intelligence (AI), specifically foundational large language models (LLMs), into an analysis pipeline may overcome some of the current limitations of modest input sizes, inaccuracies, biases, and incomplete knowledge bases.<h4>Objective</h4>To explore the effectiveness of using an LLM for generating realistic clinical data and other  ...[more]

Similar Datasets

| S-EPMC11326321 | biostudies-literature
| S-EPMC11519755 | biostudies-literature
| S-EPMC11554647 | biostudies-literature
| S-EPMC11605890 | biostudies-literature
| S-EPMC10980135 | biostudies-literature
| S-EPMC10766525 | biostudies-literature
| S-EPMC10409727 | biostudies-literature
| S-EPMC11250216 | biostudies-literature
| S-EPMC11393503 | biostudies-literature
2024-06-13 | PXD046061 | Pride