Google Research, in collaboration with Google DeepMind, has conducted a study unveiling the development of a Language Model (LLM) designed to enhance clinicians' diagnostic capabilities.
This LLM, equipped with conversational and collaborative features, specifically focuses on providing accurate differential diagnoses (DDx) for complex medical conditions. Built upon Med-PaLM 2, Google's generative AI technology, the DDx-focused LLM underwent fine-tuning using medical domain data, resulting in significant performance improvements.
In a study involving 20 clinicians assessing 302 challenging medical cases, the LLM for DDx demonstrated superior performance, achieving 59.1% accuracy compared to unassisted clinicians at 33.6%. Clinicians aided by the LLM also generated a more comprehensive list of differential diagnoses.
However, the study acknowledges certain limitations, such as redacted case reports and the selection of challenging conditions, suggesting the need for further real-world evaluation to explore the tool's potential in clinical settings.