BMC Medical Informatics and Decision Making • Vol 24 • No 1
Analyzing evaluation methods for large language models in the medical field: a scoping review
November 2024 • Junbok Lee, Sungkyung Park, Jaeyong Shin, Belong Cho
Abstract Background Owing to the rapid growth in the popularity of Large Language Models (LLMs), various performance evaluation studies have been conducted to confirm their applicability in the medical field. However, there is still no clear framework for evaluating LLMs. Objective This study reviews studies on LLM evaluations in the medical field and analyzes the research methods used in these studies. It aims to provide a reference for future researchers designing LLM studies. Methods & materials We conducte…