New Study Puts Claude3 and GPT-4 up Against a Medical Knowledge Pressure Test

Kahun, the evidence-based clinical AI engine for healthcare providers, shares the findings from a new study on the medical capabilities of readily-available large language models (LLMs). The study compared the medical accuracy of OpenAI’s GPT-4 and Anthropic’s Claude3-Opus to each other and human medical experts through questions based on objective medical knowledge drawn from Kahun’s Knowledge Graph.