Explanipedia

On Evaluating LLM Alignment by Evaluating LLMs as Judges Open

Liu, Yixin, Liu Peng-fei, Cohan, Arman · 2025

Alignment with human preferences is an important evaluation aspect of LLMs, requiring them to be helpful, honest, safe, and to precisely follow human instructions. Evaluating large language models' (LLMs) alignment typically involves direc…

On Evaluating LLM Alignment by Evaluating LLMs as Judges Open

Liu, Yixin, Liu Peng-fei, Cohan, Arman · 2025

Alignment with human preferences is an important evaluation aspect of LLMs, requiring them to be helpful, honest, safe, and to precisely follow human instructions. Evaluating large language models' (LLMs) alignment typically involves direc…

Signatures of magnetism in zigzag graphene nanoribbon embedded in h-BN lattice Open

Jiang Cheng-xin, Wang Hui-shan, Chen Chen, Chen Ling-Xiu, Wang Xiujun , et al. · 2025

Zigzag edges of graphene have long been predicted to exhibit magnetic electronic state near the Fermi level, which can cause spin-related phenomena and offer unique potentials for graphene-based spintronics. However, the magnetic conductio…

Correcting False Alarms from Unseen: Adapting Graph Anomaly Detectors at Test Time Open

Pan Junjun, Liu Yixin, Zhou, Chuan, Xiong Fei, Liew, Alan Wee-Chung , et al. · 2025

Graph anomaly detection (GAD), which aims to detect outliers in graph-structured data, has received increasing research attention recently. However, existing GAD methods assume identical training and testing distributions, which is rarely …

Correcting False Alarms from Unseen: Adapting Graph Anomaly Detectors at Test Time Open

Pan Junjun, Liu, Yixin, Zhou Chuan, Xiong, Fei, Liew, Alan Wee-Chung , et al. · 2025

Graph anomaly detection (GAD), which aims to detect outliers in graph-structured data, has received increasing research attention recently. However, existing GAD methods assume identical training and testing distributions, which is rarely …

Correcting False Alarms from Unseen: Adapting Graph Anomaly Detectors at Test Time Open

Pan Jun-jun, Liu Yixin, Zhou Chuan, Xiong Fei, Liew Alan Wee-Chung , et al. · 2025

Graph anomaly detection (GAD), which aims to detect outliers in graph-structured data, has received increasing research attention recently. However, existing GAD methods assume identical training and testing distributions, which is rarely …

Understanding the Information Propagation Effects of Communication Topologies in LLM-based Multi-Agent Systems Open

Association for Computational Linguistics 2025, Dai, Yiwei, Liu, Yixin, Miao Rui, Pan, Shiri , et al. · 2025

The communication topology in large language model-based multi-agent systems fundamentally governs inter-agent collaboration patterns, critically shaping both the efficiency and effectiveness of collective decision-making. While recent stu…

CourtReasoner: Can LLM Agents Reason Like Judges? Open

Association for Computational Linguistics 2025, Cohan, Arman, Han, Simeng, Knowlton, Sonia, Liu, Yixin , et al. · 2025

LLMs are increasingly applied in the legal domain in tasks such as summarizing legal texts and providing basic legal advice. Yet, their capacity to draft full judicial analyses in U.S. court opinions is still largely uncharted, such as gen…

Agentic AutoSurvey: Let LLMs Survey LLMs Open

Liu, Yixin, Wu, Yonghui, Zhang Deng-hui, Sun, Lichao · 2025

The exponential growth of scientific literature poses unprecedented challenges for researchers attempting to synthesize knowledge across rapidly evolving fields. We present \textbf{Agentic AutoSurvey}, a multi-agent framework for automated…

BlindGuard: Safeguarding LLM-based Multi-Agent Systems under Unknown Attacks Open

Miao Rui, Liu, Yixin, Wang Yi-li, Shen Xu, Tan Yue , et al. · 2025

The security of LLM-based multi-agent systems (MAS) is critically threatened by propagation vulnerability, where malicious agents can distort collective decision-making through inter-agent message interactions. While existing supervised de…

Understanding the Information Propagation Effects of Communication Topologies in LLM-based Multi-Agent Systems Open

Shen Xu, Liu, Yixin, Dai, Yiwei, Wang Yi-li, Miao Rui , et al. · 2025

The communication topology in large language model-based multi-agent systems fundamentally governs inter-agent collaboration patterns, critically shaping both the efficiency and effectiveness of collective decision-making. While recent stu…

NodeRAG: Structuring Graph-based RAG with Heterogeneous Nodes Open

Xu Tianyang, Zheng Haojie, Li, Chengze, Chen, Haoxiang, Liu, Yixin , et al. · 2025

Retrieval-augmented generation (RAG) empowers large language models to access external and private corpus, enabling factually consistent responses in specific domains. By exploiting the inherent structure of the corpus, graph-based RAG met…

Could AI Trace and Explain the Origins of AI-Generated Images and Text? Open

Fang Hongchao, Liu, Yixin, Du, Jiangshu, Qin, Can, Xu, Ran , et al. · 2025

AI-generated content is becoming increasingly prevalent in the real world, leading to serious ethical and societal concerns. For instance, adversaries might exploit large multimodal models (LMMs) to create images that violate ethical or le…

PHYSICS: Benchmarking Foundation Models on University-Level Physics Problem Solving Open

Feng Kaiyue, Zhao Yilun, Liu, Yixin, Yang Tian-yu, Zhao Chen , et al. · 2025

We introduce PHYSICS, a comprehensive benchmark for university-level physics problem solving. It contains 1297 expert-annotated problems covering six core areas: classical mechanics, quantum mechanics, thermodynamics and statistical mechan…

Liu Yixin YOU? Author Swipe