Unknown

Dataset Information

0

Language models outperform cloze predictability in a cognitive model of reading.


ABSTRACT: Although word predictability is commonly considered an important factor in reading, sophisticated accounts of predictability in theories of reading are lacking. Computational models of reading traditionally use cloze norming as a proxy of word predictability, but what cloze norms precisely capture remains unclear. This study investigates whether large language models (LLMs) can fill this gap. Contextual predictions are implemented via a novel parallel-graded mechanism, where all predicted words at a given position are pre-activated as a function of contextual certainty, which varies dynamically as text processing unfolds. Through reading simulations with OB1-reader, a cognitive model of word recognition and eye-movement control in reading, we compare the model's fit to eye-movement data when using predictability values derived from a cloze task against those derived from LLMs (GPT-2 and LLaMA). Root Mean Square Error between simulated and human eye movements indicates that LLM predictability provides a better fit than cloze. This is the first study to use LLMs to augment a cognitive model of reading with higher-order language processing while proposing a mechanism on the interplay between word predictability and eye movements.

SUBMITTER: Lopes Rego AT 

PROVIDER: S-EPMC11458034 | biostudies-literature | 2024 Sep

REPOSITORIES: biostudies-literature

altmetric image

Publications

Language models outperform cloze predictability in a cognitive model of reading.

Lopes Rego Adrielli Tina AT   Snell Joshua J   Meeter Martijn M  

PLoS computational biology 20240925 9


Although word predictability is commonly considered an important factor in reading, sophisticated accounts of predictability in theories of reading are lacking. Computational models of reading traditionally use cloze norming as a proxy of word predictability, but what cloze norms precisely capture remains unclear. This study investigates whether large language models (LLMs) can fill this gap. Contextual predictions are implemented via a novel parallel-graded mechanism, where all predicted words  ...[more]

Similar Datasets

| S-EPMC12685221 | biostudies-literature
| S-EPMC12262217 | biostudies-literature
| S-EPMC11551142 | biostudies-literature
| S-EPMC11648044 | biostudies-literature
| S-EPMC10932590 | biostudies-literature