Ontology highlight
ABSTRACT:
SUBMITTER: Alkaoud M
PROVIDER: S-EPMC10909174 | biostudies-literature | 2024
REPOSITORIES: biostudies-literature

PeerJ. Computer science 20240229
This work introduces a new benchmark for the bilingual evaluation of large language models (LLMs) in English and Arabic. While LLMs have transformed various fields, their evaluation in Arabic remains limited. This work addresses this gap by proposing a novel evaluation method for LLMs in both Arabic and English, allowing for a direct comparison between the performance of the two languages. We build a new evaluation dataset based on the General Aptitude Test (GAT), a standardized test widely used ...[more]