...

Unveiling the reasoning behaviour of medical Large Language Models


View a PDF of the paper titled Critique of Impure Reason: Unveiling the reasoning behaviour of medical Large Language Models, by Shamus Sim and Tyrone Chen

View PDF

Abstract:Background: Despite the current ubiquity of Large Language Models (LLMs) across the medical domain, there is a surprising lack of studies which address their reasoning behaviour. We emphasise the importance of understanding reasoning behaviour as opposed to high-level prediction accuracies, since it is equivalent to explainable AI (XAI) in this context. In particular, achieving XAI in medical LLMs used in the clinical domain will have a significant impact across the healthcare sector. Results: Therefore, in this work, we adapt the existing concept of reasoning behaviour and articulate its interpretation within the specific context of medical LLMs. We survey and categorise current state-of-the-art approaches for modeling and evaluating reasoning reasoning in medical LLMs. Additionally, we propose theoretical frameworks which can empower medical professionals or machine learning engineers to gain insight into the low-level reasoning operations of these previously obscure models. We also outline key open challenges facing the development of Large Reasoning Models. Conclusion: The subsequent increased transparency and trust in medical machine learning models by clinicians as well as patients will accelerate the integration, application as well as further development of medical AI for the healthcare system as a whole.

Submission history

From: Shamus Zi Yang Sim [view email]
[v1]
Fri, 20 Dec 2024 10:06:52 UTC (429 KB)
[v2]
Mon, 28 Jul 2025 13:13:02 UTC (677 KB)

Source link

#Unveiling #reasoning #behaviour #medical #Large #Language #Models