Explanipedia

Unlocking the Power of Multi-Agent LLM for Reasoning: From Lazy Agents to Deliberation Open

Zhiwei Zhang, Xiaomin Li, Yung‐Hsiang Lin, Hui Liu, Minhua Lin , et al. · 2025

Large Language Models (LLMs) trained with reinforcement learning and verifiable rewards have achieved strong results on complex reasoning tasks. Recent work extends this paradigm to a multi-agent setting, where a meta-thinking agent propos…

Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph Open

Fali Wang, Jingjing Chen, Shuhua Yang, Runxue Bao, Tianxiang Zhao , et al. · 2025

Test-Time Scaling (TTS) improves large language models (LLMs) by allocating additional computation during inference, typically through parallel, sequential, or hybrid scaling. However, prior studies often assume fixed collaboration archite…

xTime: Extreme Event Prediction with Hierarchical Knowledge Distillation and Expert Fusion Open

Li Quan, Wenchao Yu, Suhang Wang, Lingwei Chen, Wei Cheng , et al. · 2025

Extreme events frequently occur in real-world time series and often carry significant practical implications. In domains such as climate and healthcare, these events, such as floods, heatwaves, or acute medical episodes, can lead to seriou…

Seeing but Not Believing: Probing the Disconnect Between Visual Attention and Answer Correctness in VLMs Open

Zhining Liu, Ziyi Chen, Hui Liu, Luo Chen, Xianfeng Tang , et al. · 2025

Vision-Language Models (VLMs) achieve strong results on multimodal tasks such as visual question answering, yet they can still fail even when the correct visual evidence is present. In this work, we systematically investigate whether these…

TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use Open

Zhenwei Dai, Bing He, Xianfeng Tang, Hanqing Lu, Juanhui Li , et al. · 2025

Large language model (LLM)-based agents increasingly rely on tool use to complete real-world tasks. While existing works evaluate the LLMs' tool use capability, they largely focus on the final answers yet overlook the detailed tool usage t…

SpecDetect: Simple, Fast, and Training-Free Detection of LLM-Generated Text via Spectral Analysis Open

Haitong Luo, Weiyao Zhang, Suhang Wang, W. Zou, Chungang Lin , et al. · 2025

The proliferation of high-quality text from Large Language Models (LLMs) demands reliable and efficient detection methods. While existing training-free approaches show promise, they often rely on surface-level statistics and overlook funda…

Generalize across Homophily and Heterophily: Hybrid Spectral Graph Pre-Training and Prompt Tuning Open

Haitong Luo, Suhang Wang, Weiyao Zhang, Rong Meng, Xuying Meng , et al. · 2025

Graph ``pre-training and prompt-tuning'' aligns downstream tasks with pre-trained objectives to enable efficient knowledge transfer under limited supervision. However, existing methods rely on homophily-based low-frequency knowledge, faili…

A Survey on Small Language Models in the Era of Large Language Models: Architecture, Capabilities, and Trustworthiness Open

Fali Wang, Minhua Lin, Yao Ma, Hui Liu, Qi He , et al. · 2025

Are You Using Reliable Graph Prompts? Trojan Prompt Attacks on Graph Neural Networks Open

Minhua Lin, Zhiwei Zhang, Enyan Dai, Zongyu Wu, Yilong Wang , et al. · 2025

Bradley-Terry and Multi-Objective Reward Modeling Are Complementary Open

Zhiwei Zhang, Hui Liu, Xiaomin Li, Zhenwei Dai, Jingying Zeng , et al. · 2025

Reward models trained on human preference data have demonstrated strong effectiveness in aligning Large Language Models (LLMs) with human intent under the framework of Reinforcement Learning from Human Feedback (RLHF). However, RLHF remain…

Image Corruption-Inspired Membership Inference Attacks against Large Vision-Language Models Open

Zongyu Wu, Minhua Lin, Zhiwei Zhang, Fali Wang, Xianren Zhang , et al. · 2025

Large vision-language models (LVLMs) have demonstrated outstanding performance in many downstream tasks. However, LVLMs are trained on large-scale datasets, which can pose privacy risks if training images contain sensitive information. The…

SUA: Stealthy Multimodal Large Language Model Unlearning Attack Open

Xianren Zhang, Hui Liu, Delvin Ce Zhang, Xianfeng Tang, Qi He , et al. · 2025

Multimodal Large Language Models (MLLMs) trained on massive data may memorize sensitive personal information and photos, posing serious privacy risks. To mitigate this, MLLM unlearning methods are proposed, which fine-tune MLLMs to reduce …

BioMol-MQA: A Multi-Modal Question Answering Dataset For LLM Reasoning Over Bio-Molecular Interactions Open

Saptarshi Sengupta, Shuhua Yang, Pu Yu, Fali Wang, Suhang Wang · 2025

Retrieval augmented generation (RAG) has shown great power in improving Large Language Models (LLMs). However, most existing RAG-based LLMs are dedicated to retrieving single modality information, mainly text; while for many real-world pro…

Attention Knows Whom to Trust: Attention-based Trust Management for LLM Multi-Agent Systems Open

Pengfei He, Zhenwei Dai, Xianfeng Tang, Yaqiong Xing, Hui Liu , et al. · 2025

Large Language Model-based Multi-Agent Systems (LLM-MAS) have demonstrated strong capabilities in solving complex tasks but remain vulnerable when agents receive unreliable messages. This vulnerability stems from a fundamental gap: LLM age…

Unlearning Inversion Attacks for Graph Neural Networks Open

Jiahao Zhang, Yilong Wang, Zhiwei Zhang, Xiaorui Liu, Suhang Wang · 2025

Graph unlearning methods aim to efficiently remove the impact of sensitive data from trained GNNs without full retraining, assuming that deleted information cannot be recovered. In this work, we challenge this assumption by introducing the…

Keeping an Eye on LLM Unlearning: The Hidden Risk and Remedy Open

Jie Ren, Zhenwei Dai, Xianfeng Tang, Yue Xing, Shenglai Zeng , et al. · 2025

Although Large Language Models (LLMs) have demonstrated impressive capabilities across a wide range of tasks, growing concerns have emerged over the misuse of sensitive, copyrighted, or harmful data during training. To address these concer…

GPR: Empowering Generation with Graph-Pretrained Retriever Open

Xiaochen Wang, Zongyu Wu, Yuan Zhong, Xiang Zhang, Suhang Wang , et al. · 2025

Graph retrieval-augmented generation (GRAG) places high demands on graph-specific retrievers. However, existing retrievers often rely on language models pretrained on plain text, limiting their effectiveness due to domain misalignment and …

Bridging Source and Target Domains via Link Prediction for Unsupervised Domain Adaptation on Graphs Open

Yilong Wang, Tianxiang Zhao, Zongyu Wu, Suhang Wang · 2025

Graph neural networks (GNNs) have shown great ability for node classification on graphs. However, the success of GNNs relies on abundant labeled data, while obtaining high-quality labels is costly and challenging, especially for newly emer…

Diagnosing and Addressing Pitfalls in KG-RAG Datasets: Toward More Reliable Benchmarking Open

Liangliang Zhang, Zhaoxia Jiang, Hao Chi, Haoyang Chen, Mohammed Elkoumy , et al. · 2025

Knowledge Graph Question Answering (KGQA) systems rely on high-quality benchmarks to evaluate complex multi-hop reasoning. However, despite their widespread use, popular datasets such as WebQSP and CWQ suffer from critical quality issues, …

Fairness-aware Prompt Tuning for Graph Neural Networks Open

Zhengpin Li, Minhua Lin, Jian Wang, Suhang Wang · 2025

Towards Graph Foundation Models: A Transferability Perspective Open

Yuxiang Wang, Wenqi Fan, Suhang Wang, Yao Ma · 2025

In recent years, Graph Foundation Models (GFMs) have gained significant attention for their potential to generalize across diverse graph domains and tasks. Some works focus on Domain-Specific GFMs, which are designed to address a variety o…

A General Framework to Enhance Fine-tuning-based LLM Unlearning Open

Jie Ren, Zhenwei Dai, Xianfeng Tang, Hui Liu, Jingying Zeng , et al. · 2025

Unlearning has been proposed to remove copyrighted and privacy-sensitive data from Large Language Models (LLMs). Existing approaches primarily rely on fine-tuning-based methods, which can be categorized into gradient ascent-based (GA-based…

How Far are LLMs from Real Search? A Comprehensive Study on Efficiency, Completeness, and Inherent Capabilities Open

Minhua Lin, Hongxing Liu, Xianfeng Tang, Junjie Zeng, Zhenwei Dai , et al. · 2025

Search plays a fundamental role in problem-solving across various domains, with most real-world decision-making problems being solvable through systematic search. Drawing inspiration from recent discussions on search and learning, we syste…

Stepwise Perplexity-Guided Refinement for Efficient Chain-of-Thought Reasoning in Large Language Models Open

Yingqian Cui, Pengfei He, Jingying Zeng, Hui Liu, Xianfeng Tang , et al. · 2025

Chain-of-Thought (CoT) reasoning, which breaks down complex tasks into intermediate reasoning steps, has significantly enhanced the performance of large language models (LLMs) on challenging tasks. However, the detailed reasoning process i…

LanP: Rethinking the Impact of Language Priors in Large Vision-Language Models Open

Zongyu Wu, Yuwei Niu, Hongcheng Gao, Minhua Lin, Zhiwei Zhang , et al. · 2025

Large Vision-Language Models (LVLMs) have shown impressive performance in various tasks. However, LVLMs suffer from hallucination, which hinders their adoption in the real world. Existing studies emphasized that the strong language priors …

Graph-based Molecular In-context Learning Grounded on Morgan Fingerprints Open

Ali Al-Lawati, Jason Samuel Lucas, Zhiwei Zhang, Prasenjit Mitra, Suhang Wang · 2025

In-context learning (ICL) effectively conditions large language models (LLMs) for molecular tasks, such as property prediction and molecule captioning, by embedding carefully selected demonstration examples into the input prompt. This appr…

Counterfactual Learning on Graphs: A Survey Open

Zhimeng Guo, Zongyu Wu, Teng Xiao, Charų C. Aggarwal, Hui Liu , et al. · 2025

Graph-structured data are pervasive in the real-world such as social networks, molecular graphs and transaction networks. Graph neural networks (GNNs) have achieved great success in representation learning on graphs, facilitating various d…

GAMIC: Graph-Aligned Molecular In-context Learning for Molecule Analysis via LLMs Open

Ali Al Lawati, Jason Samuel Lucas, Zhiwei Zhang, Prasenjit Mitra, Suhang Wang · 2025

Divide-Verify-Refine: Can LLMs Self-align with Complex Instructions? Open

Xianren Zhang, Xianfeng Tang, Hui Liu, Zongyu Wu, Qi He , et al. · 2025

Learning with Less: Knowledge Distillation from Large Language Models via Unlabeled Data Open

Juanhui Li, Sreyashi Nag, Hui Liu, Xianfeng Tang, Sheikh Muhammad Sarwar , et al. · 2025

Suhang Wang YOU? Author Swipe