Explanipedia

Operationalizing machine-assisted translation in healthcare Open

Iván López, David Velásquez, Jonathan H. Chen, Jorge A. Rodriguez · 2025

Over 25 million U.S. patients with a non-English language preference face unsafe care because discharge instructions and other materials are rarely translated in time. Advances in translation assisted by large language models can close thi…

Optimizing large language models for detecting symptoms of depression/anxiety in chronic diseases patient communications Open

Jiyeong Kim, P. Stephen, Michael L. Chen, Isaac R. Galatzer‐Levy, John Torous , et al. · 2025

Patients with diabetes are at increased risk of comorbid depression or anxiety, complicating their management. This study evaluated the performance of large language models (LLMs) in detecting these symptoms from secure patient messages. W…

Fine-Tuning Methods for Large Language Models in Clinical Medicine by Supervised Fine-Tuning and Direct Preference Optimization: Comparative Evaluation Open

Thomas Savage, P. Stephen, Abdessalem Boukil, Ekanath Rangan, Vishwesh Patel , et al. · 2025

Background Large language model (LLM) fine-tuning is the process of adjusting out-of-the-box model weights using a dataset of interest. Fine-tuning can be a powerful technique to improve model performance in fields like medicine, where LLM…

MedFactEval and MedAgentBrief: A Framework and Workflow for Generating and Evaluating Factual Clinical Summaries Open

François Grolleau, Emily Alsentzer, Timothy Keyes, Philip Chung, Akshay Swaminathan , et al. · 2025

Evaluating factual accuracy in Large Language Model (LLM)-generated clinical text is a critical barrier to adoption, as expert review is unscalable for the continuous quality assurance these systems require. We address this challenge with …

Quantization-aware matrix factorization for low bit rate image compression Open

Pooya Ashtari, Pourya Behmandpoor, Fateme Nateghi Haredasht, Jonathan H. Chen, Panagiotis Patrinos , et al. · 2025

Predicting treatment retention in medication for opioid use disorder: a machine learning approach using NLP and LLM-derived clinical features Open

Fateme Nateghi Haredasht, Iván López, Steven Tate, Pooya Ashtari, Min Min Chan , et al. · 2025

Objective Building upon our previous work on predicting treatment retention in medications for opioid use disorder, we aimed to improve 6-month retention prediction in buprenorphine-naloxone (BUP-NAL) therapy by incorporating features deri…

Real time machine learning prediction of next generation sequencing test results in live clinical settings Open

Grace Y E Kim, Matthew Schwede, Conor K. Corbin, Sajjad Fouladvand, Rondeep Brar , et al. · 2025

Leveraging Large Language Models and Patient Portal Messages for Early Identification of Depression Open

Jiyeong Kim, John Torous, Julia Adler‐Milstein, Peter van Roessel, Fátima Rodríguez , et al. · 2025

Importance Large language model (LLM)-assisted early warning system may help overcome existing barriers to timely depression diagnosis in patients with cardiovascular disease (CVD). This novel application of LLMs to screen patient messages…

MedAgentBench: A Virtual EHR Environment to Benchmark Medical LLM Agents Open

Yixing Jiang, Kameron Collin Black, Gloria Geng, Dae-Gyun Park, James Zou , et al. · 2025

Contrast-induced acute kidney injury and nephrogenic systemic fibrosis in children Open

Alice Ming-jie Chuah, Jonathan H. Chen, Alison Lap‐tak, Kin Fen Kevin Fung, Eugene Yu-hin Chan · 2025

Intravascular contrast media plays an important role in improving tissue and vascular characterisation in diagnostic imaging and image-guided intervention. Iodinated contrast media are commonly used in imaging modalities which utilise ioni…

Antibiotic Resistance Microbiology Dataset (ARMD): A Resource for Antimicrobial Resistance from EHRs Open

Fateme Nateghi Haredasht, Fatemeh Amrollahi, Manoj V. Maddali, N. J. Marshall, P. Stephen , et al. · 2025

The Antibiotic Resistance Microbiology Dataset (ARMD) is a de-identified resource derived from electronic health records (EHR) that facilitates research in antimicrobial resistance (AMR). ARMD encompasses big data from adult patients colle…

A typology of physician input approaches to using AI chatbots for clinical decision-making: a mixed methods study Open

Rachel Siden, Hannah Kerman, Robert J. Gallo, Joséphine A. Cool, Jason Hom , et al. · 2025

Background: Large language model (LLM) chatbots demonstrate high degrees of accuracy, yet recent studies found that physicians using these same chatbots may score no better to worse on clinical reasoning tests compared to the chatbot perfo…

From Tool to Teammate: A Randomized Controlled Trial of Clinician-AI Collaborative Workflows for Diagnosis Open

Selin Everett, Bryan Bunning, Priyank Jain, Iván López, Anup Agarwal , et al. · 2025

Early studies of large language models (LLMs) in clinical settings have largely treated artificial intelligence (AI) as a tool rather than an active collaborator. As LLMs now demonstrate expert-level diagnostic performance, the focus shift…

MedHELM: Holistic Evaluation of Large Language Models for Medical Tasks Open

Suhana Bedi, Hejie Cui, Miguel Fuentes, Alyssa Unell, Michael Wornow , et al. · 2025

While large language models (LLMs) achieve near-perfect scores on medical licensing exams, these evaluations inadequately reflect the complexity and diversity of real-world clinical practice. We introduce MedHELM, an extensible evaluation …

Discrete-Event Simulation Modeling Framework for Cancer Interventions and Population Health in R (DESCIPHR): An Open-Source Pipeline Open

Selina Pi, Carolyn M. Rutter, Carlos Pineda‐Antúnez, Jonathan H. Chen, Jeremy D. Goldhaber‐Fiebert , et al. · 2025

Simulation models inform health policy decisions by integrating data from multiple sources and forecasting outcomes when there is a lack of comprehensive evidence from empirical studies. Such models have long supported health policy for ca…

Artificial intelligence tools in supporting healthcare professionals for tailored patient care Open

Jiyeong Kim, Michael L. Chen, Shawheen J. Rezaei, Tina Hernandez‐Boussard, Jonathan H. Chen , et al. · 2025

Artificial intelligence (AI) tools to support clinicians in providing patient-centered care can contribute to patient empowerment and care efficiency. We aimed to draft potential AI tools for tailored patient support corresponding to patie…

Antibiotic Resistance Microbiology Dataset (ARMD): A Resource for Antimicrobial Resistance from EHRs Open

Fateme Nateghi Haredasht, Fatemeh Amrollahi, Manoj V. Maddali, N. J. Marshall, Shihong Ma , et al. · 2025

The Antibiotic Resistance Microbiology Dataset (ARMD) is a de-identified resource derived from electronic health records (EHR) that facilitates research in antimicrobial resistance (AMR). ARMD encompasses big data from adult patients colle…

Red teaming ChatGPT in medicine to yield real-world insights on model behavior Open

Crystal Chang, Hodan Farah, Haiwen Gui, Shawheen J. Rezaei, Charbel Bou-Khalil , et al. · 2025

Physician clinical decision modification and bias assessment in a randomized controlled trial of AI assistance Open

Ethan Goh, Bryan Bunning, Elaine C. Khoong, Robert J. Gallo, Arnold Milstein , et al. · 2025

Background Artificial intelligence assistance in clinical decision making shows promise, but concerns exist about potential exacerbation of demographic biases in healthcare. This study aims to evaluate how physician clinical decisions and …

The improved prognosis of <i>FLT3</i>-internal tandem duplication but not tyrosine kinase domain mutations in acute myeloid leukemia in the era of targeted therapy: a realworld study using large-scale electronic health record data Open

Matthew Schwede, Gladys Rodriguez, Vanessa E. Kennedy, Solomon Henry, Douglas Wood , et al. · 2025

Not available.

Embedding-Driven Diversity Sampling to Improve Few-Shot Synthetic Data Generation Open

Iván López, Fateme Nateghi Haredasht, Kaitlin Caoili, Jonathan H. Chen, Akshay Chaudhari · 2025

Accurate classification of clinical text often requires fine-tuning pre-trained language models, a process that is costly and time-consuming due to the need for high-quality data and expert annotators. Synthetic data generation offers an a…

Clinical entity augmented retrieval for clinical information extraction Open

Iván López, Akshay Swaminathan, Karthik S. Vedula, Sanjana Narayanan, Fateme Nateghi Haredasht , et al. · 2025

Large language models (LLMs) with retrieval-augmented generation (RAG) have improved information extraction over previous methods, yet their reliance on embeddings often leads to inefficient retrieval. We introduce CLinical Entity Augmente…

Toward expert-level medical question answering with large language models Open

K. K. Singhal, Tao Tu, Juraj Gottweis, Rory Sayres, Ellery Wulczyn , et al. · 2025

powerROC: An Interactive Web Tool for Sample Size Calculation in Assessing Models' Discriminative Abilities Open

François Grolleau, Robert Tibshirani, Jonathan H. Chen · 2025

Rigorous external validation is crucial for assessing the generalizability of prediction models, particularly by evaluating their discrimination (AUROC) on new data. This often involves comparing a new model's AUROC to that of an establish…

Constrained Design of a Binary Instrument in a Partially Linear Model Open

Tim Morrison, Minh Nguyen, Jonathan H. Chen, Michael Baiocchi, Art B. Owen · 2025

We study the question of how best to assign an encouragement in a randomized encouragement study. In our setting, units arrive with covariates, receive a nudge toward treatment or control, acquire one of those statuses in a way that need n…

Feasibility of Automated Precharting using GPT-4 in New Specialty Referrals. Open

April S. Liang, Juan M. Banda, Thomas Savage, Abby Pandya, R. Carey , et al. · 2025

This study evaluates the feasibility of using GPT-4 to automate precharting for specialty referrals, focusing on new patients referred to an otolaryngology clinic for nasal congestion. We describe the design decisions and strategies tested…

Establishing best practices in large language model research: an application to repeat prompting Open

Robert J. Gallo, Michael Baiocchi, Thomas Savage, Jonathan H. Chen · 2024

Objectives We aimed to demonstrate the importance of establishing best practices in large language model research, using repeat prompting as an illustrative example. Materials and Methods Using data from a prior study investigating potenti…

Learning from the EHR to implement AI in healthcare Open

Christian Rose, Jonathan H. Chen · 2024

Recommendations for Clinicians, Technologists, and Healthcare Organizations on the Use of Generative Artificial Intelligence in Medicine: A Position Statement from the Society of General Internal Medicine Open

Byron Crowe, Shreya Shah, Derek Teng, P. Stephen, Matthew DeCamp , et al. · 2024

Large Language Model Influence on Diagnostic Reasoning Open

Ethan Goh, Robert Gallo, Jason Hom, Eric Strong, Yingjie Weng , et al. · 2024

Importance Large language models (LLMs) have shown promise in their performance on both multiple-choice and open-ended medical reasoning examinations, but it remains unknown whether the use of such tools improves physician diagnostic reaso…

Jonathan H. Chen YOU? Author Swipe