Explanipedia

Towards Inference-time Scaling for Continuous Space Reasoning Open

Minghan Wang, Thuy-Trang Vu, Ehsan Shareghi, Gholamreza Haffari · 2025

Inference-time scaling through multiple sample generation in combination with Process- or Outcome-Reward Model (PRM or ORM) re-ranking has proven effective for text-based reasoning in large language models. This paper investigates whether …

Beyond Imitation: Recovering Dense Rewards from Demonstrations Open

Jiangnan Li, Thuy-Trang Vu, Ehsan Abbasnejad, Gholamreza Haffari · 2025

Conventionally, supervised fine-tuning (SFT) is treated as a simple imitation learning process that only trains a policy to imitate expert behavior on demonstration datasets. In this work, we challenge this view by establishing a fundament…

G-reasoner: Foundation Models for Unified Reasoning over Graph-structured Knowledge Open

Linhao Luo, Zicheng Zhao, Junnan Liu, Zhangchi Qiu, Junnan Dong , et al. · 2025

Large language models (LLMs) excel at complex reasoning but remain limited by static and incomplete parametric knowledge. Retrieval-augmented generation (RAG) mitigates this by incorporating external knowledge, yet existing RAGs struggle w…

Physics-Grounded Motion Forecasting via Equation Discovery for Trajectory-Guided Image-to-Video Generation Open

Ting Feng, Xianbing Zhao, Zhenhua Chen, Timothy Wong, Hamid Rezatofighi , et al. · 2025

Recent advances in diffusion-based and autoregressive video generation models have achieved remarkable visual realism. However, these models typically lack accurate physical alignment, failing to replicate real-world dynamics in object mot…

Table-r1: Self-Supervised and Reinforcement Learning for Program-Based Table Reasoning in Small Language Models Open

Rihui Jin, Xin Zheng, Xing Xie, Zhaoping Li, Guilin Qi , et al. · 2025

Table reasoning (TR) requires structured reasoning over semi-structured tabular data and remains challenging, particularly for small language models (SLMs, e.g., LLaMA-8B) due to their limited capacity compared to large LMs (LLMs, e.g., GP…

Continual Speech Learning with Fused Speech Features Open

Guitao Wang, Jinming Zhao, Yang Hao, Guilin Qi, Tongtong Wu , et al. · 2025

Rapid growth in speech data demands adaptive models, as traditional static methods fail to keep pace with dynamic and diverse speech information. We introduce continuous speech learning, a new set-up targeting at bridging the adaptation ga…

Reshaping Representation Space to Balance the Safety and Over-rejection in Large Audio Language Models Open

Yang Hao, Lizhen Qu, Ehsan Shareghi, Gholamreza Haffari · 2025

Large Audio Language Models (LALMs) have extended the capabilities of Large Language Models (LLMs) by enabling audio-based human interactions. However, recent research has revealed that LALMs remain vulnerable to harmful queries due to ins…

RIDE: Enhancing Large Language Model Alignment through Restyled In-Context Learning Demonstration Exemplars Open

Yuncheng Hua, Lizhen Qu, Zhuang Li, Hao Xue, Flora D. Salim , et al. · 2025

Alignment tuning is crucial for ensuring large language models (LLMs) behave ethically and helpfully. Current alignment approaches require high-quality annotations and significant training resources. This paper proposes a low-cost, tuning-…

ACCESS : A Benchmark for Abstract Causal Event Discovery and Reasoning Open

Vy A. Vo, Lizhen Qu, Tao Feng, Yuncheng Hua, Xiaoxi Kang , et al. · 2025

Identifying cause-and-effect relationships is critical to understanding real-world dynamics and ultimately causal reasoning. Existing methods for identifying event causality in NLP, including those based on Large Language Models (LLMs), ex…

Unbiased Sliced Wasserstein Kernels for High-Quality Audio Captioning Open

Manh Luong, Khai Nguyen, Dinh Phung, Gholamreza Haffari, Lizhen Qu · 2025

Teacher-forcing training for audio captioning usually leads to exposure bias due to training and inference mismatch. Prior works propose the contrastive method to deal with caption degeneration. However, the contrastive method ignores the …

GFM-RAG: Graph Foundation Model for Retrieval Augmented Generation Open

Linhao Luo, Zicheng Zhao, Gholamreza Haffari, Dinh Phung, Gong Chen , et al. · 2025

Retrieval-augmented generation (RAG) has proven effective in integrating knowledge into large language models (LLMs). However, conventional RAGs struggle to capture complex relationships between pieces of knowledge, limiting their performa…

SCAR: Data Selection via Style Consistency-Aware Response Ranking for Efficient Instruction-Tuning of Large Language Models Open

Zhuang Li, Yuncheng Hua, Thuy-Trang Vu, Haolan Zhan, Lizhen Qu , et al. · 2025

CultureInstruct: Curating Multi-Cultural Instructions at Scale Open

Viet Thanh Pham, Zhuang Li, Lizhen Qu, Gholamreza Haffari · 2025

IRIS: An Iterative and Integrated Framework for Verifiable Causal Discovery in the Absence of Tabular Data Open

Tao Feng, Lizhen Qu, Niket Tandon, Gholamreza Haffari · 2025

Reshaping Representation Space to Balance the Safety and Over-rejection in Large Audio Language Models Open

Yang Hao, Lizhen Qu, Ehsan Shareghi, Gholamreza Haffari · 2025

Conversational SimulMT: Efficient Simultaneous Translation with Large Language Models Open

Minghan Wang, Thuy-Trang Vu, Yuxia Wang, Ehsan Shareghi, Gholamreza Haffari · 2025

Continual Learning of Large Language Models Open

Tongtong Wu, Thuy-Trang Vu, Linhao Luo, Gholamreza Haffari · 2025

SurveyPilot: an Agentic Framework for Automated Human Opinion Collection from Social Media Open

Viet Thanh Pham, Lizhen Qu, Zhuang Li, Suraj Sharma, Gholamreza Haffari · 2025

Zero-Shot Privacy-Aware Text Rewriting via Iterative Tree Search Open

Shuo Huang, Xingliang Yuan, Gholamreza Haffari, Lizhen Qu · 2025

NAP2: A Benchmark for Naturalness and Privacy-Preserving Text Rewriting by Learning from Human Open

Shuo Huang, William MacLean, Xiaoxi Kang, Qiongkai Xu, Zhuang Li , et al. · 2025

Audio Is the Achilles’ Heel: Red Teaming Audio Large Multimodal Models Open

Hao Yang, Lizhen Qu, Ehsan Shareghi, Gholamreza Haffari · 2025

SG-FSM: A Self-Guiding Zero-Shot Prompting Paradigm for Multi-Hop Question Answering Based on Finite State Machine Open

Xiaochen Wang, Junqing He, Liang Chen, Gholamreza Haffari, Yiru Wang , et al. · 2025

(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts Open

Minghao Wu, Jiahao Xu, Yulin Yuan, Gholamreza Haffari, Lili Wan , et al. · 2025

Literary translations remains one of the most challenging frontiers in machine translation due to the complexity of capturing figurative language, cultural nuances, and unique stylistic elements. In this work, we introduce TransAgents, a n…

ACCESS : A Benchmark for Abstract Causal Event Discovery and Reasoning Open

Vy A. Vo, Lizhen Qu, Tao Feng, Yuncheng Hua, Xiaoxi Kang , et al. · 2025

Discrete Minds in a Continuous World: Do Language Models Know Time Passes? Open

Minghan Wang, Ye Bai, Thuy-Trang Vu, Ehsan Shareghi, Gholamreza Haffari · 2025

Extending LLMs to New Languages: A Case Study of Llama and Persian Adaptation Open

Samin Mahdizadeh Sani, Pouya Sadeghi, Thuy-Trang Vu, Yadollah Yaghoobzadeh, Gholamreza Haffari · 2024

Large language models (LLMs) have made great progress in classification and text generation tasks. However, they are mainly trained on English data and often struggle with low-resource languages. In this study, we explore adding a new lang…

An Empirical Analysis on Spatial Reasoning Capabilities of Large Multimodal Models Open

Fatemeh Shiri, Xiaoyu Guo, Mona Golestan Far, Xin Yu, Gholamreza Haffari , et al. · 2024

Large Multimodal Models (LMMs) have achieved strong performance across a range of vision and language tasks. However, their spatial reasoning capabilities are under-investigated. In this paper, we construct a novel VQA dataset, Spatial-MM,…

Audio Is the Achilles' Heel: Red Teaming Audio Large Multimodal Models Open

Hao Yang, Lizhen Qu, Ehsan Shareghi, Gholamreza Haffari · 2024

Large Multimodal Models (LMMs) have demonstrated the ability to interact with humans under real-world conditions by combining Large Language Models (LLMs) and modality encoders to align multimodal information (visual and auditory) with tex…

The Best of Both Worlds: Bridging Quality and Diversity in Data Selection with Bipartite Graph Open

Minghao Wu, Thuy-Trang Vu, Lizhen Qu, Gholamreza Haffari · 2024

The performance of large language models (LLMs) is strongly influenced by the quality and diversity of data used during supervised fine-tuning (SFT). However, current data selection methods often prioritize one aspect over the other, resul…

Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models Open

Linhao Luo, Zicheng Zhao, Gong Chen, Gholamreza Haffari, Shirui Pan · 2024

Large language models (LLMs) have demonstrated impressive reasoning abilities, but they still struggle with faithful reasoning due to knowledge gaps and hallucinations. To address these issues, knowledge graphs (KGs) have been utilized to …

Gholamreza Haffari YOU? Author Swipe