Explanipedia

Beyond Magic Words: Sharpness-Aware Prompt Evolving for Robust Large Language Models with TARE Open

Guancheng Wan, Liuliu Fu, Henry Liu, Yiqiao Jin, Hong Va Leong , et al. · 2025

The performance of Large Language Models (LLMs) hinges on carefully engineered prompts. However, prevailing prompt optimization methods, ranging from heuristic edits and reinforcement learning to evolutionary search, primarily target point…

First Experience with Real-Time Control Using Simulated VQC-Based Quantum Policies Open

Yize Sun, Mohamad Hagog, Volker Tresp, Yunpu Ma · 2025

This paper investigates the integration of quantum computing into offline reinforcement learning and the deployment of the resulting quantum policy in a real-time control hardware realization of the cart-pole system. Variational Quantum Ci…

SwarmAgentic: Towards Fully Automated Agentic System Generation via Swarm Intelligence Open

Yao Zhang, Chenyang Lin, Shijie Tang, Haokun Chen, Shijie Zhou , et al. · 2025

The rapid progress of Large Language Models has advanced agentic systems in decision-making, coordination, and task execution. Yet, existing agentic system generation frameworks lack full autonomy, missing from-scratch agent generation, se…

ImpliRet: Benchmarking the Implicit Fact Retrieval Challenge Open

Zeinab Taghavi, Ali Modarressi, Yunpu Ma, Hinrich Schütze · 2025

Retrieval systems are central to many NLP pipelines, but often rely on surface-level cues such as keyword overlap and lexical semantic similarity. To evaluate retrieval beyond these shallow signals, recent benchmarks introduce reasoning-he…

ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM Open

Yujun Wang, Jing Bi, Yunpu Ma, Sören Pirk · 2025

Multimodal large language models (MLLMs) frequently hallucinate by over-committing to spurious visual cues. Prior remedies-Visual and Instruction Contrastive Decoding (VCD, ICD)-mitigate this issue, yet the mechanism remains opaque. We fir…

FedNano: Toward Lightweight Federated Tuning for Pretrained Multimodal Large Language Models Open

Yao Zhang, Hewei Gao, Haokun Chen, W. G. Li, Yunpu Ma , et al. · 2025

Multimodal Large Language Models (MLLMs) excel in tasks like multimodal reasoning and cross-modal retrieval but face deployment challenges in real-world scenarios due to distributed multimodal data and strict privacy requirements. Federate…

Agentic Neural Networks: Self-Evolving Multi-Agent Systems via Textual Backpropagation Open

Xiaowen Ma, Chenyang Lin, Yao Zhang, Volker Tresp, Yunpu Ma · 2025

Leveraging multiple Large Language Models(LLMs) has proven effective for addressing complex, high-dimensional tasks, but current approaches often rely on static, manually engineered multi-agent configurations. To overcome these constraints…

Improving LLM Reasoning through Interpretable Role-Playing Steering Open

Anyi Wang, Dong Shu, Yifan Wang, Yunpu Ma, Mengnan Du · 2025

Role-playing has emerged as an effective technique for enhancing the reasoning capabilities of large language models (LLMs). However, existing methods primarily rely on prompt engineering, which often lacks stability and interpretability. …

Language Mixing in Reasoning Language Models: Patterns, Impact, and Internal Causes Open

Mingyang Wang, Lukas Lange, Heike Adel, Yunpu Ma, Jannik Strötgen , et al. · 2025

Reasoning language models (RLMs) excel at complex tasks by leveraging a chain-of-thought process to generate structured intermediate steps. However, language mixing, i.e., reasoning steps containing tokens from languages other than the pro…

CoT-Kinetics: A Theoretical Modeling Assessing LRM Reasoning Process Open

Jing Bi, Danqi Yan, Yifan Wang, Wenke Huang, Haokun Chen , et al. · 2025

Recent Large Reasoning Models significantly improve the reasoning ability of Large Language Models by learning to reason, exhibiting the promising performance in solving complex tasks. LRMs solve tasks that require complex reasoning by exp…

WebPilot: A Versatile and Autonomous Multi-Agent System for Web Task Execution with Strategic Exploration Open

Yao Zhang, Zijian Ma, Yunpu Ma, Zhen Han, Yu Wu , et al. · 2025

LLM-based autonomous agents often fail to execute complex web tasks that require dynamic interaction, largely due to the inherent uncertainty and complexity of these environments. Existing LLM-based web agents typically rely on rigid, expe…

In-depth Analysis of Graph-based RAG in a Unified Framework Open

Yang Su, Yung‐Nien Sun, Shu Wang, Taotao Wang, Rui He , et al. · 2025

Graph-based Retrieval-Augmented Generation (RAG) has proven effective in integrating external knowledge into large language models (LLMs), improving their factual accuracy, adaptability, interpretability, and trustworthiness. A number of g…

PRISM: Self-Pruning Intrinsic Selection Method for Training-Free Multimodal Data Selection Open

Jinhe Bi, Yifan Wang, Danqi Yan, Xun Xiao, Artur Hecker , et al. · 2025

Computer science Geography Biology

Visual instruction tuning adapts pre-trained Multimodal Large Language Models (MLLMs) to follow human instructions for real-world applications. However, the rapid growth of these datasets introduces significant redundancy, leading to incre…

LLaVA Steering: Visual Instruction Tuning with 500x Fewer Parameters through Modality Linear Representation-Steering Open

Jinhe Bi, Yujun Wang, Haokun Chen, Xun Xiao, Artur Hecker , et al. · 2024

Computer science Mathematics Political science

Multimodal Large Language Models (MLLMs) have significantly advanced visual tasks by integrating visual representations into large language models (LLMs). The textual modality, inherited from LLMs, equips MLLMs with abilities like instruct…

PERFT: Parameter-Efficient Routed Fine-Tuning for Mixture-of-Expert Model Open

Yilun Liu, Yunpu Ma, Shuo Chen, Zifeng Ding, Bailan He , et al. · 2024

Computer science

The Mixture-of-Experts (MoE) paradigm has emerged as a powerful approach for scaling transformers with improved resource utilization. However, efficiently fine-tuning MoE models remains largely underexplored. Inspired by recent works on Pa…

VideoINSTA: Zero-shot Long Video Understanding via Informative Spatial-Temporal Reasoning with LLMs Open

Ruotong Liao, Max Erler, Huiyu Wang, Guangyao Zhai, Gengyuan Zhang , et al. · 2024

Chemistry Philosophy

In the video-language domain, recent works in leveraging zero-shot Large Language Model-based reasoning for video understanding have become competitive challengers to previous end-to-end models. However, long video understanding presents u…

WebPilot: A Versatile and Autonomous Multi-Agent System for Web Task Execution with Strategic Exploration Open

Yao Zhang, Zijian Ma, Yunpu Ma, Zhen Han, Yu Wu , et al. · 2024

Computer science Business Engineering

LLM-based autonomous agents often fail to execute complex web tasks that require dynamic interaction due to the inherent uncertainty and complexity of these environments. Existing LLM-based web agents typically rely on rigid, expert-design…

SA-DQAS: Self-attention Enhanced Differentiable Quantum Architecture Search Open

Yize Sun, Jiarui Liu, Zixin Wu, Zifeng Ding, Yunpu Ma , et al. · 2024

Computer science Mathematics Physics

We introduce SA-DQAS, a novel framework that enhances Differentiable Quantum Architecture Search (DQAS) by integrating a self-attention mechanism, enabling more effective quantum circuit design for variational quantum algorithms. Unlike DQ…

Quantum Architecture Search with Unsupervised Representation Learning Open

Yize Sun, Zixin Wu, Yunpu Ma, Volker Tresp · 2024

Computer science Political science

Unsupervised representation learning presents new opportunities for advancing Quantum Architecture Search (QAS) on Noisy Intermediate-Scale Quantum (NISQ) devices. QAS is designed to optimize quantum circuits for Variational Quantum Algori…

Differentiable Quantum Architecture Search For Job Shop Scheduling Problem Open

Yize Sun, Jiarui Liu, Yunpu Ma, Volker Tresp · 2024

Computer science Mathematics Engineering

The Job shop scheduling problem (JSSP) plays a pivotal role in industrial applications, such as signal processing (SP) and steel manufacturing, involving sequencing machines and jobs to maximize scheduling efficiency. Before, JSSP was solv…

zrLLM: Zero-Shot Relational Learning on Temporal Knowledge Graphs with Large Language Models Open

Zifeng Ding, Heling Cai, Jingpei Wu, Yunpu Ma, Ruotong Liao , et al. · 2023

Computer science Philosophy Biology

Modeling evolving knowledge over temporal knowledge graphs (TKGs) has become a heated topic. Various methods have been proposed to forecast links on TKGs. Most of them are embedding-based, where hidden representations are learned to repres…

GraphextQA: A Benchmark for Evaluating Graph-Enhanced Large Language Models Open

Yuanchun Shen, Ruotong Liao, Zhen Han, Yunpu Ma, Volker Tresp · 2023

Computer science Geography

While multi-modal models have successfully integrated information from image, video, and audio modalities, integrating graph modality into large language models (LLMs) remains unexplored. This discrepancy largely stems from the inherent di…

GenTKG: Generative Forecasting on Temporal Knowledge Graph with Large Language Models Open

Ruotong Liao, Jia Xu, Yunpu Ma, Volker Tresp · 2023

Computer science Mathematics

The rapid advancements in large language models (LLMs) have ignited interest in the temporal knowledge graph (tKG) domain, where conventional embedding-based and rule-based methods dominate. The question remains open of whether pre-trained…

Differentiable Quantum Architecture Search for Quantum Reinforcement Learning Open

Yize Sun, Yunpu Ma, Volker Tresp · 2023

Computer science Mathematics Physics

Differentiable quantum architecture search (DQAS) is a gradient-based framework to design quantum circuits automatically in the NISQ era. It was motivated by such as low fidelity of quantum hardware, low flexibility of circuit architecture…

Benchmarking Robustness of Adaptation Methods on Pre-trained Vision-Language Models Open

Shuo Chen, Jindong Gu, Zhen Han, Yunpu Ma, Philip Torr , et al. · 2023

Computer science Psychology Chemistry

Various adaptation methods, such as LoRA, prompts, and adapters, have been proposed to enhance the performance of pre-trained vision-language models in specific domains. The robustness of these adaptation methods against distribution shift…

QNEAT: Natural Evolution of Variational Quantum Circuit Architecture Open

Alessandro Giovagnoli, Yunpu Ma, Volker Tresp · 2023

Computer science Mathematics Physics

Quantum Machine Learning (QML) is a recent and rapidly evolving field where the theoretical framework and logic of quantum mechanics are employed to solve machine learning tasks. Various techniques with different levels of quantum-classica…

Yunpu Ma YOU? Author Swipe