Explanipedia

Parameter-free Algorithms for the Stochastically Extended Adversarial Model Open

Shuche Wang, Adarsh Barik, Peng Zhao, Vincent Y. F. Tan · 2025

We develop the first parameter-free algorithms for the Stochastically Extended Adversarial (SEA) model, a framework that bridges adversarial and stochastic online convex optimization. Existing approaches for the SEA model require prior kno…

Muon Outperforms Adam in Tail-End Associative Memory Learning Open

Shuche Wang, Fengzhuo Zhang, Jiaxiang Li, Canwei Du, Chao‐Hai Du , et al. · 2025

The Muon optimizer is consistently faster than Adam in training Large Language Models (LLMs), yet the mechanism underlying its success remains unclear. This paper demystifies this mechanism through the lens of associative memory. By ablati…

Memory Limitations of Prompt Tuning in Transformers Open

Maxime Meyer, Mario Michelessa, Caroline Chaux, Vincent Y. F. Tan · 2025

Despite the empirical success of prompt tuning in adapting pretrained language models to new tasks, theoretical analyses of its capabilities remain limited. Existing theoretical work primarily addresses universal approximation properties, …

Algorithm unrolling for solving inverse problems in signal and image processing Open

Caroline Chaux, Abhijit Singh, Emmanuel Soubies, Vincent Y. F. Tan · 2025

International audience

Automatic Rank Determination for Low-Rank Adaptation via Submodular Function Maximization Open

Yihang Gao, Vincent Y. F. Tan · 2025

In this paper, we propose SubLoRA, a rank determination method for Low-Rank Adaptation (LoRA) based on submodular function maximization. In contrast to prior approaches, such as AdaLoRA, that rely on first-order (linearized) approximations…

Immune Checkpoint Inhibitors for Metastatic Colorectal Cancer: A Systematic Review Open

A. S. Gilson, Vincent Y. F. Tan, Thibaud Koessler, Jérémy Meyer, G. Meurette , et al. · 2025

Medicine Chemistry

Background: Colorectal cancer is a significant health concern. Immunotherapy has become a promising approach in colorectal cancer, offering a wider array of therapeutic strategies. This study aims to summarize the current evidence regardin…

Finite-Time Minimax Bounds and an Optimal Lyapunov Policy in Queueing Control Open

Yujie Liu, Vincent Y. F. Tan, Yunbei Xu · 2025

We introduce an original minimax framework for finite-time performance analysis in queueing control and propose a surprisingly simple Lyapunov-based scheduling policy with superior finite-time performance. The framework quantitatively char…

Log-Sum-Exponential Estimator for Off-Policy Evaluation and Learning Open

Armin Behnamnia, Gholamali Aminian, Alireza Afzal Aghaei, Chengchun Shi, Vincent Y. F. Tan , et al. · 2025

Off-policy learning and evaluation leverage logged bandit feedback datasets, which contain context, action, propensity score, and feedback for each data point. These scenarios face significant challenges due to high variance and poor perfo…

Asymptotically Optimal Linear Best Feasible Arm Identification with Fixed Budget Open

Jie Bian, Vincent Y. F. Tan · 2025

The challenge of identifying the best feasible arm within a fixed budget has attracted considerable interest in recent years. However, a notable gap remains in the literature: the exact exponential rate at which the error probability appro…

Best Arm Identification with Possibly Biased Offline Data Open

Le Yang, Vincent Y. F. Tan, Wang Chi Cheung · 2025

We study the best arm identification (BAI) problem with potentially biased offline data in the fixed confidence setting, which commonly arises in real-world scenarios such as clinical trials. We prove an impossibility result for adaptive a…

BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms Open

Y. R. Hou, Fengzhuo Zhang, Canwei Du, Xuan Zhang, Jiachun Pan , et al. · 2025

Speculative decoding has emerged as a popular method to accelerate the inference of Large Language Models (LLMs) while retaining their superior text generation performance. Previous methods either adopt a fixed speculative decoding configu…

p-Mean Regret for Stochastic Bandits Open

Anand Krishna, Philips George John, Adarsh Barik, Vincent Y. F. Tan · 2025

Mathematics Economics Computer science

In this work, we extend the concept of the p-mean welfare objective from social choice theory to study p-mean regret in stochastic multi-armed bandit problems. The p-mean regret, defined as the difference between the optimal mean among the…

Error Analyses of Auto-Regressive Video Diffusion Models: A Unified Framework Open

Jing Wang, Fengzhuo Zhang, Xiaoli Li, Vincent Y. F. Tan, Tianyu Pang , et al. · 2025

Auto-Regressive Video Diffusion Models (AR-VDMs) have shown strong capabilities in generating long, photorealistic videos, but suffer from two key limitations: (i) history forgetting, where the model loses track of previously generated con…

Low Tensor-Rank Adaptation of Kolmogorov--Arnold Networks Open

Yihang Gao, Michael K. Ng, Vincent Y. F. Tan · 2025

Mathematics Computer science Psychology

Kolmogorov--Arnold networks (KANs) have demonstrated their potential as an alternative to multi-layer perceptions (MLPs) in various domains, especially for science-related tasks. However, transfer learning of KANs remains a relatively unex…

Ensemble-Tight Second-Order Asymptotics and Exponents for Guessing-Based Decoding with Abandonment Open

Vincent Y. F. Tan, Hamdi Joudeh · 2025

Mathematics Physics Economics

This paper considers guessing-based decoders with abandonment for discrete memoryless channels in which all codewords have the same composition. This class of decoders rank-orders all input sequences in the codebook's composition class fro…

Optimal Multi-Objective Best Arm Identification with Fixed Confidence Open

Zhirui Chen, P. N. Karthik, Yeow Meng Chee, Vincent Y. F. Tan · 2025

Computer science Mathematics Biology

We consider a multi-armed bandit setting with finitely many arms, in which each arm yields an $M$-dimensional vector reward upon selection. We assume that the reward of each dimension (a.k.a. {\em objective}) is generated independently of …

A General Framework for Clustering and Distribution Matching With Bandit Feedback Open

Recep Can Yavas, Yuqi Huang, Vincent Y. F. Tan, Jonathan Scarlett · 2025

Computer science Mathematics

We develop a general framework for clustering and distribution matching problems with bandit feedback. We consider a $K$-armed bandit model where some subset of $K$ arms is partitioned into $M$ groups. Within each group, the random variabl…

Enhancing Multi-Text Long Video Generation Consistency without Tuning: Time-Frequency Analysis, Prompt Alignment, and Theory Open

X X Li, Fengzhuo Zhang, Jiachun Pan, Y. Thomas Hou, Vincent Y. F. Tan , et al. · 2024

Computer science

Despite the considerable progress achieved in the long video generation problem, there is still significant room to improve the consistency of the videos, particularly in terms of smoothness and transitions between scenes. We address these…

p-Mean Regret for Stochastic Bandits Open

Anand Krishna, Philips George John, Adarsh Barik, Vincent Y. F. Tan · 2024

Mathematics Economics Computer science

In this work, we extend the concept of the $p$-mean welfare objective from social choice theory (Moulin 2004) to study $p$-mean regret in stochastic multi-armed bandit problems. The $p$-mean regret, defined as the difference between the op…

Towards Understanding Why FixMatch Generalizes Better Than Supervised Learning Open

Jingyang Li, Jiachun Pan, Vincent Y. F. Tan, Kim-Chuan Toh, Pan Zhou · 2024

Computer science Psychology

Semi-supervised learning (SSL), exemplified by FixMatch (Sohn et al., 2020), has shown significant generalization advantages over supervised learning (SL), particularly in the context of deep neural networks (DNNs). However, it is still un…

On the Convergence of (Stochastic) Gradient Descent for Kolmogorov--Arnold Networks Open

Yihang Gao, Vincent Y. F. Tan · 2024

Computer science Mathematics Economics

Kolmogorov--Arnold Networks (KANs), a recently proposed neural network architecture, have gained significant attention in the deep learning community, due to their potential as a viable alternative to multi-layer perceptrons (MLPs) and the…

Almost Minimax Optimal Best Arm Identification in Piecewise Stationary Linear Bandits Open

Yunlong Hou, Vincent Y. F. Tan, Zixin Zhong · 2024

Mathematics Computer science Biology

We propose a {\em novel} piecewise stationary linear bandit (PSLB) model, where the environment randomly samples a context from an unknown probability distribution at each changepoint, and the quality of an arm is measured by its return av…

Stochastic Bandits for Egalitarian Assignment Open

Eugene A. Lim, Vincent Y. F. Tan, Harold Soh · 2024

Economics Computer science

We study EgalMAB, an egalitarian assignment problem in the context of stochastic multi-armed bandits. In EgalMAB, an agent is tasked with assigning a set of users to arms. At each time step, the agent must assign exactly one arm to each us…

Best Arm Identification with Minimal Regret Open

Junwen Yang, Vincent Y. F. Tan, Tianyuan Jin · 2024

Computer science Biology

Motivated by real-world applications that necessitate responsible experimentation, we introduce the problem of best arm identification (BAI) with minimal regret. This innovative variant of the multi-armed bandit problem elegantly amalgamat…

A Sample Efficient Alternating Minimization-based Algorithm For Robust Phase Retrieval Open

Adarsh Barik, Anand Krishna, Vincent Y. F. Tan · 2024

Computer science Mathematics Physics

In this work, we study the robust phase retrieval problem where the task is to recover an unknown signal $θ^* \in \mathbb{R}^d$ in the presence of potentially arbitrarily corrupted magnitude-only linear measurements. We propose an alternat…

LEARN: An Invex Loss for Outlier Oblivious Robust Online Optimization Open

Adarsh Barik, Anand Krishna, Vincent Y. F. Tan · 2024

Computer science Mathematics

We study a robust online convex optimization framework, where an adversary can introduce outliers by corrupting loss functions in an arbitrary number of rounds k, unknown to the learner. Our focus is on a novel setting allowing unbounded d…

A Mirror Descent-Based Algorithm for Corruption-Tolerant Distributed Gradient Descent Open

Shuche Wang, Vincent Y. F. Tan · 2024

Computer science Engineering Art

Distributed gradient descent algorithms have come to the fore in modern machine learning, especially in parallelizing the handling of large datasets that are distributed across several workers. However, scant attention has been paid to ana…

Influence Maximization via Graph Neural Bandits Open

Yuting Feng, Vincent Y. F. Tan, Bogdan Cautis · 2024

Computer science Mathematics

We consider a ubiquitous scenario in the study of Influence Maximization (IM), in which there is limited knowledge about the topology of the diffusion network. We set the IM problem in a multi-round diffusion campaign, aiming to maximize t…

Order-Optimal Instance-Dependent Bounds for Offline Reinforcement Learning with Preference Feedback Open

Zhirui Chen, Vincent Y. F. Tan · 2024

Computer science Mathematics Psychology

We consider offline reinforcement learning (RL) with preference feedback in which the implicit reward is a linear function of an unknown parameter. Given an offline dataset, our objective consists in ascertaining the optimal action for eac…

MIMO Capacity Analysis and Channel Estimation for Electromagnetic Information Theory Open

Jieao Zhu, Vincent Y. F. Tan, Linglong Dai · 2024

Computer science Environmental science Engineering

Electromagnetic information theory (EIT) is an interdisciplinary subject that serves to integrate deterministic electromagnetic theory with stochastic Shannon's information theory. Existing EIT analysis operates in the continuous space dom…

Vincent Y. F. Tan YOU? Author Swipe