Explanipedia

CrochetBench: Can Vision-Language Models Move from Describing to Doing in Crochet Domain? Open

Peiyu Li, Xiaobao Huang, Nitesh V. Chawla · 2025

We present CrochetBench, a benchmark for evaluating the ability of multimodal large language models to perform fine-grained, low-level procedural reasoning in the domain of crochet. Unlike prior benchmarks that focus on high-level descript…

Think it Image by Image: Multi-Image Moral Reasoning of Large Vision-Language Models Open

Chujie Gao, Yue Huang, Xiangqi Wang, Siyuan Wu, Nitesh V. Chawla , et al. · 2025

Proto-Yield: An Uncertainty-Aware Prototype Network for Yield Prediction in Real-world Chemical Reactions Open

Kehan Guo, Zhen Liu, Zhichun Guo, Bozhao Nan, Olexandr Isayev , et al. · 2025

SPECTRA: Spectral Target-Aware Graph Augmentation for Imbalanced Molecular Property Regression Open

Brenda Nogueira, Meng Jiang, Nitesh V. Chawla, Nuno Moniz · 2025

In molecular property prediction, the most valuable compounds (e.g., high potency) often occupy sparse regions of the target space. Standard Graph Neural Networks (GNNs) commonly optimize for the average error, underperforming on these unc…

Adaptive Testing for LLM Evaluation: A Psychometric Alternative to Static Benchmarks Open

Xiuxiu Tang, Ying Cheng, Ronald Metoyer, Hua Ting, Nitesh V. Chawla · 2025

Large language model evaluation requires thousands of benchmark items, making evaluations expensive and slow. Existing methods compute average accuracy across fixed item sets, treating all items equally despite varying quality and informat…

LLMs4All: A Review of Large Language Models Across Academic Disciplines Open

Yanfang Ye, Zheyuan Zhang, Tianyi Ma, Zehong Wang, Yiyang Li , et al. · 2025

Cutting-edge Artificial Intelligence (AI) techniques keep reshaping our view of the world. For example, Large Language Models (LLMs) based applications such as ChatGPT have shown the capability of generating human-like conversation on exte…

Explanation Difference: Bridging Procedural and Distributional Fairness Open

Joe Germino, Yuying Zhao, Tyler Derr, Nuno Moniz, Nitesh V. Chawla · 2025

Fairness in Machine Learning (Fair ML) is often presented as a trade-off between predictive performance and equality of predicted values. This view of fairness, commonly referred to as distributional fairness, fails to consider how a model…

KEO: Knowledge Extraction on OMIn via Knowledge Graphs and RAG for Safety-Critical Aviation Maintenance Open

Kuangshi Ai, Jonathan A. Karr, Meng Jiang, Nitesh V. Chawla, Chaoli Wang · 2025

We present Knowledge Extraction on OMIn (KEO), a domain-specific knowledge extraction and reasoning framework with large language models (LLMs) in safety-critical contexts. Using the Operations and Maintenance Intelligence (OMIn) dataset, …

The Reasoning Boundary Paradox: How Reinforcement Learning Constrains Language Models Open

Phuc Nguyen, Chinh La, Duy M. H. Nguyen, Nitesh V. Chawla, Binh T. Nguyen , et al. · 2025

Reinforcement Learning with Verifiable Rewards (RLVR) has emerged as a key method for improving Large Language Models' reasoning capabilities, yet recent evidence suggests it may paradoxically shrink the reasoning boundary rather than expa…

LLMs4All: A Review of Large Language Models Across Academic Disciplines Open

Yanfang Ye, Zheyuan Zhang, Tianyi Ma, Zehong Wang, Yiyang Li , et al. · 2025

Cutting-edge Artificial Intelligence (AI) techniques keep reshaping our view of the world. For example, Large Language Models (LLMs) based applications such as ChatGPT have shown the capability of generating human-like conversation on exte…

ChemOrch: Empowering LLMs with Chemical Intelligence via Synthetic Instructions Open

Yi Huang, Zhicheng Jiang, Xiaonan Luo, K. H. Guo, Haomin Zhuang , et al. · 2025

Empowering large language models (LLMs) with chemical intelligence remains a challenge due to the scarcity of high-quality, domain-specific instruction-response datasets and the misalignment of existing synthetic data generation pipelines …

Exploring Conversational Design Choices in LLMs for Pedagogical Purposes: Socratic and Narrative Approaches for Improving Instructor's Teaching Practice Open

Si Chen, Isabel R. Molnar, Peiyu Li, Adam Acunin, Ting Hua , et al. · 2025

Large language models (LLMs) typically generate direct answers, yet they are increasingly used as learning tools. Studying instructors' usage is critical, given their role in teaching and guiding AI adoption in education. We designed and e…

AI Academy: Building Generative AI Literacy in Higher Ed Instructors Open

Si Chen, Alison Cheng, Nitesh V. Chawla, Ronald Metoyer · 2025

Generative AI is reshaping higher education, yet research has focused largely on students, while instructors remain understudied despite their central role in mediating adoption and modeling responsible use. We present the \textit{AI Acade…

National Running Club Database: Assessing Collegiate Club Athletes' Cross Country Race Results Open

Jonathan A. Karr, Ben Darden, Nicholas Pell, Roland G. Fryer, Kayla Ambrose , et al. · 2025

The National Running Club Database (NRCD) aggregates 15,397 race results of 5,585 athletes from the 2023 and 2024 cross country seasons. This paper introduces the NRCD dataset, which provides insights into individual athlete progressions, …

Combating Homelessness Stigma with LLMs: A New Multi-Modal Dataset for Bias Detection Open

Jonathan A. Karr, Benjamin F. Herbst, Hua Ting, Matthew Hauenstein, Georgina Curto , et al. · 2025

Homelessness is a persistent social challenge, impacting millions worldwide. Over 770,000 people experienced homelessness in the U.S. in 2024. Social stigmatization is a significant barrier to alleviation, shifting public perception, and i…

8th Workshop on Machine Learning in Finance Open

Saurabh Nagrecha, Isha Chaturvedi, Senthil Kumar, Nitesh V. Chawla, Mahashweta Das , et al. · 2025

Graph Foundation Models: Challenges, Methods, and Open Questions Open

Zehong Wang, Chuxu Zhang, Jundong Li, Nitesh V. Chawla, Yanfang Ye · 2025

Large Language Models as Innovators: A Framework to Leverage Latent Space Exploration for Novelty Discovery Open

Grzegorz Piotrowski, Nitesh V. Chawla, Tomasz Kajdanowicz · 2025

Innovative idea generation remains a core challenge in AI, as large language models (LLMs) often struggle to produce outputs that are both novel and relevant. Despite their fluency, LLMs tend to replicate patterns seen during training, lim…

Spectral Manifold Harmonization for Graph Imbalanced Regression Open

Brenda Nogueira, Gabriel dos Passos Gomes, Meng Jiang, Nitesh V. Chawla, Nuno Moniz · 2025

Graph-structured data is ubiquitous in scientific domains, where models often face imbalanced learning settings. In imbalanced regression, domain preferences focus on specific target value ranges that represent the most scientifically valu…

Class-aware contrastive optimization for imbalanced text classification Open

Grigorii Khvatskii, Nuno Moniz, Khoa D. Doan, Nitesh V. Chawla · 2025

The unique characteristics of text data make classification tasks a complex problem. Advances in unsupervised and semi-supervised learning and autoencoder architectures addressed several challenges. However, they still struggle with imbala…

Context Attribution with Multi-Armed Bandit Optimization Open

Deng Pan, Keerthiram Murugesan, Nuno Moniz, Nitesh V. Chawla · 2025

Understanding which parts of the retrieved context contribute to a large language model's generated answer is essential for building interpretable and trustworthy generative QA systems. We propose a novel framework that formulates context …

On The Design Choices of Next Level LLMs Open

Yijun Tian, Xingjian Diao, Ming Cheng, Chunhui Zhang, Jiang Gui , et al. · 2025

Differentially-private data synthetisation for efficient re-identification risk control Open

Tânia Carvalho, Nuno Moniz, Luís Antunes, Nitesh V. Chawla · 2025

Protecting user data privacy can be achieved via many methods, from statistical transformations to generative models. However, they all have critical drawbacks. For example, creating a transformed data set using traditional techniques is h…

ChemHGNN: A Hierarchical Hypergraph Neural Network for Reaction Virtual Screening and Discovery Open

Xiaobao Huang, Yihong Ma, Anjali Gurajapu, Jules Schleinitz, Zhichun Guo , et al. · 2025

Reaction virtual screening and discovery are fundamental challenges in chemistry and materials science, where traditional graph neural networks (GNNs) struggle to model multi-reactant interactions. In this work, we propose ChemHGNN, a hype…

Graph Foundation Models: A Comprehensive Survey Open

Zehong Wang, Zheyuan Liu, Tianyi Ma, Zheyuan Zhang, Xingbo Fu , et al. · 2025

Graph-structured data pervades domains such as social networks, biological systems, knowledge graphs, and recommender systems. While foundation models have transformed natural language processing, vision, and multimodal learning through la…

Intersectional Divergence: Measuring Fairness in Regression Open

Joe Germino, Nuno Moniz, Nitesh V. Chawla · 2025

Fairness in machine learning research is commonly framed in the context of classification tasks, leaving critical gaps in regression. In this paper, we propose a novel approach to measure intersectional fairness in regression tasks, going …

MOPI-HFRS: A Multi-objective Personalized Health-aware Food Recommendation System with LLM-enhanced Interpretation Open

Zheyuan Zhang, Zehong Wang, Tianyi Ma, Varun Sameer Taneja, Steven W. Nelson , et al. · 2025

Social and economic predictors of under-five stunting in Mexico: a comprehensive approach through the XGB model Open

Brian J. Fogarty, Angélica Garcia-Martínez, Nitesh V. Chawla, Edson Serván‐Mori · 2025

social and economic deprivation, stunting, children, machine learning, XGB model, Mexico.

Rethinking Evaluation in Compound Potency Prediction Open

Brenda Nogueira, Nuno Moniz, Connor W. Coley, Nitesh V. Chawla · 2025

Regression tasks are essential in many fields, including chemistry, where property prediction models are used to prioritize chemical compounds for experimental testing. In this context, it is common to maximize properties, such as potency,…

Ventana a la Verdad (Window to the Truth): A Chatbot Application for Navigating The Colombian Truth Commission's Archives Open

Anna A. Sokol, Matthew Sisk, Josefina A. Echavarria, Nitesh V. Chawla · 2025

Nitesh V. Chawla YOU? Author Swipe