Unknown

Dataset Information

0

Augmenting interpretable models with large language models during training.


ABSTRACT: Recent large language models (LLMs), such as ChatGPT, have demonstrated remarkable prediction performance for a growing array of tasks. However, their proliferation into high-stakes domains and compute-limited settings has created a burgeoning need for interpretability and efficiency. We address this need by proposing Aug-imodels, a framework for leveraging the knowledge learned by LLMs to build extremely efficient and interpretable prediction models. Aug-imodels use LLMs during fitting but not during inference, allowing complete transparency and often a speed/memory improvement of greater than 1000x for inference compared to LLMs. We explore two instantiations of Aug-imodels in natural-language processing: Aug-Linear, which augments a linear model with decoupled embeddings from an LLM and Aug-Tree, which augments a decision tree with LLM feature expansions. Across a variety of text-classification datasets, both outperform their non-augmented, interpretable counterparts. Aug-Linear can even outperform much larger models, e.g. a 6-billion parameter GPT-J model, despite having 10,000x fewer parameters and being fully transparent. We further explore Aug-imodels in a natural-language fMRI study, where they generate interesting interpretations from scientific data.

SUBMITTER: Singh C 

PROVIDER: S-EPMC10689442 | biostudies-literature | 2023 Nov

REPOSITORIES: biostudies-literature

altmetric image

Publications

Augmenting interpretable models with large language models during training.

Singh Chandan C   Askari Armin A   Caruana Rich R   Gao Jianfeng J  

Nature communications 20231130 1


Recent large language models (LLMs), such as ChatGPT, have demonstrated remarkable prediction performance for a growing array of tasks. However, their proliferation into high-stakes domains and compute-limited settings has created a burgeoning need for interpretability and efficiency. We address this need by proposing Aug-imodels, a framework for leveraging the knowledge learned by LLMs to build extremely efficient and interpretable prediction models. Aug-imodels use LLMs during fitting but not  ...[more]

Similar Datasets

| S-EPMC10689487 | biostudies-literature
| S-EPMC10904143 | biostudies-literature
| S-EPMC10153281 | biostudies-literature
| S-EPMC11761653 | biostudies-literature
| S-EPMC10591138 | biostudies-literature
| S-EPMC11501434 | biostudies-literature
| S-EPMC11654935 | biostudies-literature
| S-EPMC11261925 | biostudies-literature
| S-EPMC10396962 | biostudies-literature
| S-EPMC11669866 | biostudies-literature