Explanipedia

Soundness-Aware Level: A Microscopic Signature that Predicts LLM Reasoning Potential Open

Xuansheng Wu, Xiaoman Pan, Wenlin Yao, Jianshu Chen · 2025

Reinforcement learning with verifiable rewards (RLVR) can elicit strong reasoning in large language models (LLMs), while their performance after RLVR varies dramatically across different base models. This raises a fundamental question: wha…

Enhancing LLM Steering through Sparse Autoencoder-Based Vector Refinement Open

Anyi Wang, Xuansheng Wu, Dong Shu, Ninghao Liu · 2025

Steering has emerged as a promising approach in controlling large language models (LLMs) without modifying model parameters. However, most existing steering methods rely on large-scale datasets to learn clear behavioral information, which …

Self-Regularization with Sparse Autoencoders for Controllable LLM-based Classification Open

Xuansheng Wu, Wenhao Yu, Xiaoming Zhaı, Ninghao Liu · 2025

Concept-Centric Token Interpretation for Vector-Quantized Generative Models Open

Tianze Yang, Yucheng Shi, Mengnan Du, Xuansheng Wu, Qiaoyu Tan , et al. · 2025

Vector-Quantized Generative Models (VQGMs) have emerged as powerful tools for image generation. However, the key component of VQGMs -- the codebook of discrete tokens -- is still not well understood, e.g., which tokens are critical to gene…

Denoising Concept Vectors with Sparse Autoencoders for Improved Language Model Steering Open

Haiyan Zhao, Xuansheng Wu, Fan Yang, Bo Shen, Ninghao Liu , et al. · 2025

Linear concept vectors effectively steer LLMs, but existing methods suffer from noisy features in diverse datasets that undermine steering robustness. We propose Sparse Autoencoder-Denoised Concept Vectors (SDCV), which selectively keep th…

Self-Regularization with Sparse Autoencoders for Controllable LLM-based Classification Open

Xuansheng Wu, Wenhao Yu, Xiaoming Zhaı, Ninghao Liu · 2025

Modern text classification methods heavily rely on contextual embeddings from large language models (LLMs). Compared to human-engineered features, these embeddings provide automatic and effective representations for classification model tr…

Beyond Input Activations: Identifying Influential Latents by Gradient Sparse Autoencoders Open

Dong Shu, Xuansheng Wu, Haiyan Zhao, Mengnan Du, Ninghao Liu · 2025

LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models Open

Zhenyue Qin, Yu Yin, Dylan Campbell, Xuansheng Wu, Ke Zou , et al. · 2025

A Survey on Sparse Autoencoders: Interpreting the Internal Mechanisms of Large Language Models Open

Dong Shu, Xuansheng Wu, Haiyan Zhao, Daking Rai, Ziyu Yao , et al. · 2025

Retrieval-enhanced Knowledge Editing in Language Models for Multi-Hop Question Answering Open

Yucheng Shi, Qiaoyu Tan, Xuansheng Wu, Shaochen Zhong, Kaixiong Zhou , et al. · 2024

LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models Open

Zhenyue Qin, Yu Yin, Dylan Campbell, Xuansheng Wu, Ke Zou , et al. · 2024

The prevalence of vision-threatening eye diseases is a significant global burden, with many cases remaining undiagnosed or diagnosed too late for effective treatment. Large vision-language models (LVLMs) have the potential to assist in und…

Unveiling Scoring Processes: Dissecting the Differences between LLMs and Human Graders in Automatic Scoring Open

Xuansheng Wu, Padmaja Pravin Saraf, Gyeong-Geon Lee, Ehsan Latif, Ninghao Liu , et al. · 2024

Large language models (LLMs) have demonstrated strong potential in performing automatic scoring for constructed response assessments. While constructed responses graded by humans are usually based on given grading rubrics, the methods by w…

Could Small Language Models Serve as Recommenders? Towards Data-centric Cold-start Recommendation Open

Xuansheng Wu, Huachi Zhou, Yucheng Shi, Wenlin Yao, Xiao Huang , et al. · 2024

DIRECT: Dual Interpretable Recommendation with Multi-aspect Word Attribution Open

Xuansheng Wu, Hanqin Wan, Qiaoyu Tan, Wenlin Yao, Ninghao Liu · 2024

Recommending products to users with intuitive explanations helps improve the system in transparency, persuasiveness, and satisfaction. Existing interpretation techniques include post hoc methods and interpretable modeling. The former categ…

Retrieval-enhanced Knowledge Editing in Language Models for Multi-Hop Question Answering Open

Yucheng Shi, Qiaoyu Tan, Xuansheng Wu, Shaochen Zhong, Kaixiong Zhou , et al. · 2024

Large Language Models (LLMs) have shown proficiency in question-answering tasks but often struggle to integrate real-time knowledge, leading to potentially outdated or inaccurate responses. This problem becomes even more challenging when d…

Applying large language models and chain-of-thought for automatic scoring Open

Gyeong-Geon Lee, Ehsan Latif, Xuansheng Wu, Ninghao Liu, Xiaoming Zhaı · 2024

This study investigates the application of large language models (LLMs), specifically GPT-3.5 and GPT-4, with Chain-of-Though (CoT) in the automatic scoring of student-written responses to science assessments. We focused on overcoming the …

InFoBench: Evaluating Instruction Following Ability in Large Language Models Open

Yiwei Qin, Kaiqiang Song, Yebowen Hu, Wenlin Yao, Sangwoo Cho , et al. · 2024

This paper introduces the Decomposed Requirements Following Ratio (DRFR), a new metric for evaluating Large Language Models' (LLMs) ability to follow instructions. Addressing a gap in current methodologies, DRFR breaks down complex instruc…

Applying Large Language Models and Chain-of-Thought for Automatic Scoring Open

Gyeong-Geon Lee, Ehsan Latif, Xuansheng Wu, Ninghao Liu, Xiaoming Zhaı · 2023

This study investigates the application of large language models (LLMs), specifically GPT-3.5 and GPT-4, with Chain-of-Though (CoT) in the automatic scoring of student-written responses to science assessments. We focused on overcoming the …

Could Small Language Models Serve as Recommenders? Towards Data-centric Cold-start Recommendations Open

Xuansheng Wu, Huachi Zhou, Wenlin Yao, Xiao Huang, Ninghao Liu · 2023

Recommendation systems help users find matched items based on their previous behaviors. Personalized recommendation becomes challenging in the absence of historical user-item interactions, a practical problem for startups known as the syst…

AGI: Artificial General Intelligence for Education Open

Ehsan Latif, Gengchen Mai, Matthew Nyaaba, Xuansheng Wu, Ninghao Liu , et al. · 2023

Artificial general intelligence (AGI) has gained global recognition as a future technology due to the emergence of breakthrough large language models and chatbots such as GPT-4 and ChatGPT, respectively. Compared to conventional AI models,…

Black-box Backdoor Defense via Zero-shot Image Purification Open

Yucheng Shi, Mengnan Du, Xuansheng Wu, Zihan Guan, Ninghao Liu · 2023

Backdoor attacks inject poisoned samples into the training data, resulting in the misclassification of the poisoned input during a model's deployment. Defending against such attacks is challenging, especially for real-world black-box model…

A Survey of Graph Prompting Methods: Techniques, Applications, and Challenges Open

Xuansheng Wu, Kaixiong Zhou, Mingchen Sun, Xin Wang, Ninghao Liu · 2023

The recent "pre-train, prompt, predict training" paradigm has gained popularity as a way to learn generalizable models with limited labeled data. The approach involves using a pre-trained model and a prompting function that applies a templ…

NoPPA: Non-Parametric Pairwise Attention Random Walk Model for Sentence Representation Open

Xuansheng Wu, Zhiyi Zhao, Ninghao Liu · 2023

We propose a novel non-parametric/un-trainable language model, named Non-Parametric Pairwise Attention Random Walk Model (NoPPA), to generate sentence embedding only with pre-trained word embedding and pre-counted word frequency. To the be…

Matching Exemplar as Next Sentence Prediction (MeNSP): Zero-shot Prompt Learning for Automatic Scoring in Science Education Open

Xuansheng Wu, Xinyu He, Tianming Liu, Ninghao Liu, Xiaoming Zhaı · 2023

Developing models to automatically score students' written responses to science problems is critical for science education. However, collecting and labeling sufficient student responses for training models is time and cost-consuming. Recen…

Xuansheng Wu YOU? Author Swipe