Explanipedia

Fact or Facsimile? Evaluating the Factual Robustness of Modern Retrievers Open

Hong Wei Wu, Qingcheng Zeng, Kaize Ding · 2025

Uncertainty Quantification for Multiple-Choice Questions is Just One-Token Deep Open

Qingcheng Zeng, Mingyu Jin, Qinkai Yu, Z.G. Wang, Wenyue Hua , et al. · 2025

Revisiting Multivariate Time Series Forecasting with Missing Values Open

Jie Yang, Y. Hu, Kexin Zhang, L. L. Niu, Yushun Dong , et al. · 2025

Missing values are common in real-world time series, and multivariate time series forecasting with missing values (MTSF-M) has become a crucial area of research for ensuring reliable predictions. To address the challenge of missing data, c…

AMANDA: Agentic Medical Knowledge Augmentation for Data-Efficient Medical Visual Question Answering Open

Ziqing Wang, Chengsheng Mao, Xiaole Wen, Yuan Luo, Kaize Ding · 2025

Medical Multimodal Large Language Models (Med-MLLMs) have shown great promise in medical visual question answering (Med-VQA). However, when deployed in low-resource settings where abundant labeled data are unavailable, existing Med-MLLMs c…

Fact or Facsimile? Evaluating the Factual Robustness of Modern Retrievers Open

Haoyu Wu, Qingcheng Zeng, Kaize Ding · 2025

Dense retrievers and rerankers are central to retrieval-augmented generation (RAG) pipelines, where accurately retrieving factual information is crucial for maintaining system trustworthiness and defending against RAG poisoning. However, l…

RelKD 2025: The Third International Workshop on Resource-Efficient Learning for Knowledge Discovery Open

Chuxu Zhang, Kaize Ding, Jundong Li, Dongkuan Xu, Haoyu Wang , et al. · 2025

A Survey on Model Extraction Attacks and Defenses for Large Language Models Open

Kaixiang Zhao, Lincan Li, Kaize Ding, Neil Zhenqiang Gong, Yue Zhao , et al. · 2025

Data-Efficient Graph Learning Open

Kaize Ding · 2025

A Survey on Model Extraction Attacks and Defenses for Large Language Models Open

Lincan Li, Kaize Ding, Neil Zhenqiang Gong, Yushun Dong · 2025

Model extraction attacks pose significant security threats to deployed language models, potentially compromising intellectual property and user privacy. This survey provides a comprehensive taxonomy of LLM-specific extraction attacks and d…

Cross-Domain Conditional Diffusion Models for Time Series Imputation Open

Kexin Zhang, Baoyu Jing, K. Selçuk Candan, Dawei Zhou, Qingsong Wen , et al. · 2025

Cross-domain time series imputation is an underexplored data-centric research task that presents significant challenges, particularly when the target domain suffers from high missing rates and domain shifts in temporal dynamics. Existing t…

Resource-Efficient Learning for the Web Open

Chuxu Zhang, Kaize Ding, Jundong Li, Dongkuan Xu, Haoyu Wang , et al. · 2025

RelWeb 2025: The International Workshop on Resource-Efficient Learning for the Web Open

Chuxu Zhang, Kaize Ding, Jundong Li, Dongkuan Xu, Haoyu Wang , et al. · 2025

Histone methyltransferase SMYD2 regulates the activation of hepatic stellate cells by activating TLR4 signaling Open

Kaize Ding, Rujia Xie, Bing Han, Huiling Zheng, Tian Tian · 2025

Liver fibrosis represents a pathological outcome in the progression of chronic liver diseases, primarily driven by the activation of hepatic stellate cells (HSCs) induced by various chronic liver injury factors. Substantial evidence indica…

Survey of Uncertainty Estimation in Large Language Models -Sources, Methods, Applications, and Challenge Open

Jianfeng He, Linlin Yu, Changbin Li, Runing Yang, Fanglan Chen , et al. · 2025

Large Language Models (LLMs) have demonstrated exceptional performance across a wide range of domains, including everyday life, finance, law, and healthcare. However, inaccurate LLM generation has led to significant penalties in sensitive…

A Survey of Model Extraction Attacks and Defenses in Distributed Computing Environments Open

Kaixiang Zhao, Lincan Li, Kaize Ding, Neil Zhenqiang Gong, Yue Zhao , et al. · 2025

Model Extraction Attacks (MEAs) threaten modern machine learning systems by enabling adversaries to steal models, exposing intellectual property and training data. With the increasing deployment of machine learning models in distributed co…

AD-LLM: Benchmarking Large Language Models for Anomaly Detection Open

Tiankai Yang, Yi Nian, Li Li, Ruiyao Xu, Yuangang Li , et al. · 2025

ALERT: An LLM-powered Benchmark for Automatic Evaluation of Recommendation Explanations Open

Yichuan Li, Xinyang Zhang, Chenwei Zhang, Mao Li, Tianyi Liu , et al. · 2025

Large Language Models for Anomaly and Out-of-Distribution Detection: A Survey Open

Ruiyao Xu, Kaize Ding · 2025

Explaining Length Bias in LLM-Based Preference Evaluations Open

Zhengyu Hu, Linxin Song, Jieyu Zhang, Zheyuan Xiao, Baiying Lei , et al. · 2025

AMANDA: Agentic Medical Knowledge Augmentation for Data-Efficient Medical Visual Question Answering Open

Ziqing Wang, Chengsheng Mao, Xiaole Wen, Yuan Luo, Kaize Ding · 2025

RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response Open

Junyu Luo, Xiao Luo, Kaize Ding, Jingyang Yuan, Zhiping Xiao , et al. · 2024

Supervised fine-tuning (SFT) plays a crucial role in adapting large language models (LLMs) to specific domains or tasks. However, as demonstrated by empirical experiments, the collected data inevitably contains noise in practical applicati…

AD-LLM: Benchmarking Large Language Models for Anomaly Detection Open

Tiankai Yang, Yi Nian, Songnian Li, Ruiyao Xu, Yuangang Li , et al. · 2024

Anomaly detection (AD) is an important machine learning task with many real-world uses, including fraud detection, medical diagnosis, and industrial monitoring. Within natural language processing (NLP), AD helps detect issues like spam, mi…

Political-LLM: Large Language Models in Political Science Open

Lincan Li, Jiaqi Li, Catherine Chen, Fred Gui, Hongjia Yang , et al. · 2024

In recent years, large language models (LLMs) have been widely adopted in political science tasks such as election prediction, sentiment analysis, policy impact assessment, and misinformation detection. Meanwhile, the need to systematicall…

Fusion Matters: Learning Fusion in Deep Click-through Rate Prediction Models Open

Kexin Zhang, Fuyuan Lyu, Xing Tang, Dugang Liu, Chen Ma , et al. · 2024

The evolution of previous Click-Through Rate (CTR) models has mainly been driven by proposing complex components, whether shallow or deep, that are adept at modeling feature interactions. However, there has been less focus on improving fus…

A Survey of Deep Graph Learning under Distribution Shifts: from Graph Out-of-Distribution Generalization to Adaptation Open

Kexin Zhang, Shuhan Liu, Song Wang, Weili Shi, Chen Chen , et al. · 2024

Distribution shifts on graphs -- the discrepancies in data distribution between training and employing a graph machine learning model -- are ubiquitous and often unavoidable in real-world scenarios. These shifts may severely deteriorate mo…

LEGO-Learn: Label-Efficient Graph Open-Set Learning Open

Haoyan Xu, Kay Liu, Z. P. Yao, Philip S. Yu, Kaize Ding , et al. · 2024

How can we train graph-based models to recognize unseen classes while keeping labeling costs low? Graph open-set learning (GOL) and out-of-distribution (OOD) detection aim to address this challenge by training models that can accurately cl…

Data‐efficient graph learning: Problems, progress, and prospects Open

Kaize Ding, Yixin Liu, Chuxu Zhang, Jian-Ling Wang · 2024

Graph‐structured data, ranging from social networks to financial transaction networks, from citation networks to gene regulatory networks, have been widely used for modeling a myriad of real‐world systems. As a prevailing model architectur…

Let's Ask GNN: Empowering Large Language Model for Graph In-Context Learning Open

Zhengyu Hu, Yichuan Li, Zhengyu Chen, Jingang Wang, Han Liu , et al. · 2024

Textual Attributed Graphs (TAGs) are crucial for modeling complex real-world systems, yet leveraging large language models (LLMs) for TAGs presents unique challenges due to the gap between sequential text processing and graph-structured da…

Large Language Models for Anomaly and Out-of-Distribution Detection: A Survey Open

Ruiyao Xu, Kaize Ding · 2024

Detecting anomalies or out-of-distribution (OOD) samples is critical for maintaining the reliability and trustworthiness of machine learning systems. Recently, Large Language Models (LLMs) have demonstrated their effectiveness not only in …

Mastering Long-Tail Complexity on Graphs: Characterization, Learning, and Generalization Open

Haohui Wang, Baoyu Jing, Kaize Ding, Yada Zhu, Wei Cheng , et al. · 2024

In the context of long-tail classification on graphs, the vast majority of existing work primarily revolves around the development of model debiasing strategies, intending to mitigate class imbalances and enhance the overall performance. D…

Kaize Ding YOU? Author Swipe