Ontology highlight
ABSTRACT:
SUBMITTER: Kirch NM
PROVIDER: S-EPMC12373810 | biostudies-literature | 2025 Aug
REPOSITORIES: biostudies-literature

Scientific reports 20250822 1
We present the TRIAGE benchmark, a novel machine ethics benchmark designed to evaluate the ethical decision-making abilities of large language models (LLMs) in mass casualty scenarios. TRIAGE uses medical dilemmas created by healthcare professionals to evaluate the ethical decision-making of AI systems in real-world, high-stakes scenarios. We evaluated six major LLMs on TRIAGE, examining how different ethical and adversarial prompts influence model behavior. Our results show that most models con ...[more]