Analyzing evaluation methods for large language models in the medical field: a scoping review

Exploring foci of: BMC Medical Informatics and Decision Making • Vol 24 • No 1 Analyzing evaluation methods for large language models in the medical field: a scoping review November 2024 • Junbok Lee, Sungkyung Park, Jaeyong Shin, Belong Cho Abstract Background Owing to the rapid growth in the popularity of Large Language Models (LLMs), various performance evaluation studies have been conducted to confirm their applicability in the medical field. However, there is still no clear framework for evaluating LLMs. Objective This study reviews studies on LLM evaluations in the medical field and analyzes the research methods used in these studies. It aims to provide a reference for future researchers designing LLM studies. Methods &amp; materials We conducte… Open Article Page

Health Informatics Medicine Computer Science Medical Education Public Health Histopathology Mathematics Social Psychology Mathematics Education Open Article