Unknown

Dataset Information

0

GeneTuring tests GPT models in genomics.


ABSTRACT: Generative Pre-trained Transformers (GPT) are powerful language models that have great potential to transform biomedical research. However, they are known to suffer from artificial hallucinations and provide false answers that are seemingly correct in some situations. We developed GeneTuring, a comprehensive QA database with 600 questions in genomics, and manually scored 10,800 answers returned by six GPT models, including GPT-3, ChatGPT, and New Bing. New Bing has the best overall performance and significantly reduces the level of AI hallucination compared to other models, thanks to its ability to recognize its incapacity in answering questions. We argue that improving incapacity awareness is equally important as improving model accuracy to address AI hallucination.

SUBMITTER: Hou W 

PROVIDER: S-EPMC10054955 | biostudies-literature | 2023 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Benchmarking large language models for genomic knowledge with GeneTuring.

Hou Wenpin W   Shang Xinyi X   Ji Zhicheng Z  

bioRxiv : the preprint server for biology 20250105


Large language models have demonstrated great potential in biomedical research. However, their ability to serve as a knowledge base for genomic research remains unknown. We developed GeneTuring, a comprehensive Q&A database containing 1,200 questions in genomics, and manually scored 25,200 answers provided by six GPT models, including GPT-4o, Claude 3.5, and Gemini Advanced. GPT-4o with web access showed the best overall performance and excelled in most tasks. However, it still failed to correct  ...[more]

Similar Datasets

| S-EPMC11842050 | biostudies-literature
| S-EPMC8049133 | biostudies-literature
| S-EPMC10829255 | biostudies-literature
| S-EPMC11419952 | biostudies-literature
| S-EPMC6951249 | biostudies-literature
| S-EPMC10640689 | biostudies-literature
| S-EPMC11240076 | biostudies-literature
| S-EPMC1488818 | biostudies-literature
| S-EPMC10733745 | biostudies-literature
| S-EPMC11364944 | biostudies-literature