Explanipedia

Diagnosing and Addressing Pitfalls in KG-RAG Datasets: Toward More Reliable Benchmarking Open

Liangliang Zhang, Zhaoxia Jiang, Hao Chi, Haoyang Chen, Mohammed Elkoumy , et al. · 2025

Knowledge Graph Question Answering (KGQA) systems rely on high-quality benchmarks to evaluate complex multi-hop reasoning. However, despite their widespread use, popular datasets such as WebQSP and CWQ suffer from critical quality issues, …

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models Open

Fali Wang, Hui Liu, Xianfeng Tang, Yuyin Zhou · 2025

This work revisits the dominant supervised fine-tuning (SFT) then reinforcement learning (RL) paradigm for training Large Vision-Language Models (LVLMs), and reveals a key finding: SFT can significantly undermine subsequent RL by inducing …

HC-GST: Heterophily-aware Distribution Consistency based Graph Self-training Open

Fali Wang, Tianxiang Zhao, Junjie Xu, Suhang Wang · 2024

Computer science Mathematics

Graph self-training (GST), which selects and assigns pseudo-labels to unlabeled nodes, is popular for tackling label sparsity in graphs. However, recent study on homophily graphs show that GST methods could introduce and amplify distributi…

Enhance Graph Alignment for Large Language Models Open

Haitong Luo, Xuying Meng, Suhang Wang, Tianxiang Zhao, Fali Wang , et al. · 2024

Computer science

Graph-structured data is prevalent in the real world. Recently, due to the powerful emergent capabilities, Large Language Models (LLMs) have shown promising performance in modeling graphs. The key to effectively applying LLMs on graphs is …

Distribution Consistency based Self-Training for Graph Neural Networks with Sparse Labels Open

Fali Wang, Tianxiang Zhao, Suhang Wang · 2024

Computer science Physics

Few-shot node classification poses a significant challenge for Graph Neural\nNetworks (GNNs) due to insufficient supervision and potential distribution\nshifts between labeled and unlabeled nodes. Self-training has emerged as a\nwidely pop…

InfuserKI: Enhancing Large Language Models with Knowledge Graphs via Infuser-Guided Knowledge Integration Open

Fali Wang, Runxue Bao, Suhang Wang, Wenchao Yu, Yanchi Liu , et al. · 2024

Computer science Business

Large Language Models (LLMs) have achieved exceptional capabilities in open generation across various domains, yet they encounter difficulties with tasks that require intensive knowledge. To address these challenges, methods for integratin…

Maximum Entropy Loss, the Silver Bullet Targeting Backdoor Attacks in Pre-trained Language Models Open

Zhengxiao Liu, Bowen Shen, Zheng Lin, Fali Wang, Weiping Wang · 2023

Computer science Physics

Pre-trained language model (PLM) can be stealthily misled to target outputs by backdoor attacks when encountering poisoned samples, without performance degradation on clean samples. The stealthiness of backdoor attacks is commonly attained…

Fali Wang YOU? Author Swipe