Dataset Information

Language models outperform cloze predictability in a cognitive model of reading.

ABSTRACT: Although word predictability is commonly considered an important factor in reading, sophisticated accounts of predictability in theories of reading are lacking. Computational models of reading traditionally use cloze norming as a proxy of word predictability, but what cloze norms precisely capture remains unclear. This study investigates whether large language models (LLMs) can fill this gap. Contextual predictions are implemented via a novel parallel-graded mechanism, where all predicted words at a given position are pre-activated as a function of contextual certainty, which varies dynamically as text processing unfolds. Through reading simulations with OB1-reader, a cognitive model of word recognition and eye-movement control in reading, we compare the model's fit to eye-movement data when using predictability values derived from a cloze task against those derived from LLMs (GPT-2 and LLaMA). Root Mean Square Error between simulated and human eye movements indicates that LLM predictability provides a better fit than cloze. This is the first study to use LLMs to augment a cognitive model of reading with higher-order language processing while proposing a mechanism on the interplay between word predictability and eye movements.

SUBMITTER: Lopes Rego AT

PROVIDER: S-EPMC11458034 | biostudies-literature | 2024 Sep

REPOSITORIES: biostudies-literature

ACCESS DATA

Publications

Language models outperform cloze predictability in a cognitive model of reading.

Lopes Rego Adrielli Tina AT Snell Joshua J Meeter Martijn M

PLoS computational biology 20240925 9

Although word predictability is commonly considered an important factor in reading, sophisticated accounts of predictability in theories of reading are lacking. Computational models of reading traditionally use cloze norming as a proxy of word predictability, but what cloze norms precisely capture remains unclear. This study investigates whether large language models (LLMs) can fill this gap. Contextual predictions are implemented via a novel parallel-graded mechanism, where all predicted words ...[more]

PMID: 39321153

Dataset Information

Language models outperform cloze predictability in a cognitive model of reading.

Publications

Language models outperform cloze predictability in a cognitive model of reading.

Similar Datasets

OmicsDI is part of the ELIXIR infrastructure

Tweets

Similar Datasets

Nucleotide context models outperform protein language models for predicting antibody affinity maturation.
| S-EPMC12685221 | biostudies-literature

Nucleotide context models outperform protein language models for predicting antibody affinity maturation.
| S-EPMC12262217 | biostudies-literature

Large language models can outperform humans in social situational judgments.
| S-EPMC11551142 | biostudies-literature

Large language models outperform traditional natural language processing methods in extracting patient-reported outcomes in IBD.
| S-EPMC11398594 | biostudies-literature

Large language models outperform general practitioners in identifying complex cases of childhood anxiety.
| S-EPMC11648044 | biostudies-literature

Large language models outperform traditional structured data-based approaches in identifying immunosuppressed patients.
| S-EPMC11759841 | biostudies-literature

Large Language Models Outperform Traditional Natural Language Processing Methods in Extracting Patient-Reported Outcomes in Inflammatory Bowel Disease.
| S-EPMC11772946 | biostudies-literature

Word Frequency and Predictability Dissociate in Naturalistic Reading
| S-EPMC10932590 | biostudies-literature

The relationships between oral language and reading instruction: Evidence from a computational model of reading.
| S-EPMC7612124 | biostudies-literature

Guideline-enhanced large language models outperform physician-test takers on EASL Campus quizzes multiple choice questions.
| S-EPMC12478250 | biostudies-literature