Unknown

Dataset Information

0

GPT-4V exhibits human-like performance in biomedical image classification.


ABSTRACT: We demonstrate that GPT-4V(ision), a large multimodal model, exhibits strong one-shot learning ability, generalizability, and natural language interpretability in various biomedical image classification tasks, including classifying cell types, tissues, cell states, and disease status. Such features resemble human-like performance and distinguish GPT-4V from conventional image classification methods, which typically require large cohorts of training data and lack interpretability.

SUBMITTER: Hou W 

PROVIDER: S-EPMC10802384 | biostudies-literature | 2024 Jan

REPOSITORIES: biostudies-literature

altmetric image

Publications

Assessing large multimodal models for one-shot learning and interpretability in biomedical image classification.

Hou Wenpin W   Liu Qi Q   Ma Huifang H   Qu Yilong Y   Ji Zhicheng Z  

bioRxiv : the preprint server for biology 20250104


Image classification plays a pivotal role in analyzing biomedical images, serving as a cornerstone for both biological research and clinical diagnostics. We demonstrate that large multimodal models (LMMs), like GPT-4, excel in one-shot learning, generalization, interpretability, and text-driven image classification across diverse biomedical tasks. These tasks include the classification of tissues, cell types, cellular states, and disease status. LMMs stand out from traditional single-modal class  ...[more]

Similar Datasets

| S-EPMC11773068 | biostudies-literature
| S-EPMC11836096 | biostudies-literature
| S-EPMC11009855 | biostudies-literature
| S-EPMC11326574 | biostudies-literature
| S-EPMC11780673 | biostudies-literature
| S-EPMC9852451 | biostudies-literature
| S-EPMC11336503 | biostudies-literature
| S-EPMC11250047 | biostudies-literature
| S-EPMC10365615 | biostudies-literature
| S-EPMC11419952 | biostudies-literature