Explanipedia

A Saddle Point Remedy: Power of Variable Elimination in Non-convex Optimization Open

Min Gan, Guangyong Chen, Lin F. Yang · 2025

The proliferation of saddle points, rather than poor local minima, is increasingly understood to be a primary obstacle in large-scale non-convex optimization for machine learning. Variable elimination algorithms, like Variable Projection (…

ARMOR: High-Performance Semi-Structured Pruning via Adaptive Matrix Factorization Open

L. Liu, Alexander Liu, Mengdi Wang, Tuo Zhao, Lin F. Yang · 2025

Large language models (LLMs) present significant deployment challenges due to their immense computational and memory requirements. While semi-structured pruning, particularly 2:4 sparsity, offers a path to practical hardware acceleration, …

Research on a Controlled Knife Recognition System Based on YOLOv11s Open

Ya Xu, Jian‐Bing Zeng, Lin F. Yang, R.Z. Wang, X. Li , et al. · 2025

With the rapid development of computer technology, the importance of object detection technology in the field of dangerous item detection is increasingly highlighted. This paper focuses on the precise detection of dangerous knives in publi…

Near-Optimal Sample Complexity Bounds for Constrained Average-Reward MDPs Open

Yuyang Wei, Xudong Li, Lin F. Yang · 2025

Recent advances have significantly improved our understanding of the sample complexity of learning in average-reward Markov decision processes (AMDPs) under the generative model. However, much less is known about the constrained average-re…

Research on the influencing factors of generative artificial intelligence usage intent in post-secondary education: an empirical analysis based on the AIDUA extended model Open

Xue Bai, Lin F. Yang · 2025

Objective Generative Artificial Intelligence (AIGC) presents a profound dialectic in higher education: its transformative potential is challenged by deep-seated psychological and ethical barriers. Traditional adoption models fail to captur…

Supplemental Material: Two episodes of orogenic gold mineralization at Chaihulanzi, NE China, in response to superimposed orogeny associated with subduction of the Paleo-Asian and Paleo-Pacific Ocean plates Open

Lin F. Yang, et al. · 2025

Tables S1–S6

Advancing Large Language Models for Tibetan with Curated Data and Continual Pre-Training Open

Leiyu Pan, Bojian Xiong, Lin F. Yang, Renren Jin, Shaowei Zhang , et al. · 2025

Large language models have achieved remarkable progress across many languages. However, Tibetan, as a representative low-resource language, is particularly underrepresented in existing models due to the scarcity of high-quality training co…

Research on Capacitated Multi-Ship Replenishment Path Planning Problem Based on the Synergistic Hybrid Optimization Algorithm Open

Lin F. Yang, Qinghua Chen, Jie Mu, Tangying Liu, Xiaoxiao Li , et al. · 2025

Ship replenishment path planning is a critical problem in the field of maritime logistics. This study proposes a novel synergistic hybrid optimization algorithm (SHOA) that effectively integrates ant colony optimization (ACO), the Clarke–W…

Integrating Generative AI-Based Assistance Tool in Programming Education for Medical Students: A Cross-Sectional Study Open

Xiaowei Xu, Si Zheng, Lin F. Yang, Jie Hao, Xuwen Wang , et al. · 2025

Backgroud With the increasing importance of computational skills in healthcare, there is a growing need to equip medical students with programming knowledge to address complex healthcare challenges effectively. Traditional programming meth…

Effective equidistribution in rank 2 homogeneous spaces and values of quadratic forms Open

Elon Lindenstrauss, Amir Mohammadi, Zhiren Wang, Lin F. Yang · 2025

We establish effective equidistribution theorems, with a polynomial error rate, for orbits of unipotent subgroups in quotients of quasi-split, almost simple Linear algebraic groups of absolute rank 2. As an application, inspired by the res…

Research on Ship Replenishment Path Planning Based on the Modified Whale Optimization Algorithm Open

Qinghua Chen, Gang Yao, Lin F. Yang, Tangying Liu, Jin Sun , et al. · 2025

Ship replenishment path planning has always been a critical concern for researchers in the field of security. This study proposes a modified whale optimization algorithm (MWOA) to address single-task ship replenishment path planning proble…

Transition Transfer $Q$-Learning for Composite Markov Decision Processes Open

John Chai, Elynn Chen, Lin F. Yang · 2025

To bridge the gap between empirical success and theoretical understanding in transfer reinforcement learning (RL), we study a principled approach with provable performance guarantees. We introduce a novel composite MDP framework where high…

Nearly Linear Row Sampling Algorithm for Quantile Regression Open

Yi Li, Ruosong Wang, Lin F. Yang, Hanrui Zhang · 2025

We give a row sampling algorithm for the quantile loss function with sample complexity nearly linear in the dimensionality of the data, improving upon the previous best algorithm whose sampling complexity has at least cubic dependence on t…

Hyper: Hyperparameter Robust Efficient Exploration in Reinforcement Learning Open

Yiran Wang, Chenshu Liu, Yunfan Li, Sanae Amani, Bolei Zhou , et al. · 2024

The exploration \& exploitation dilemma poses significant challenges in reinforcement learning (RL). Recently, curiosity-based exploration methods achieved great success in tackling hard-exploration problems. However, they necessitate exte…

Misspecified $Q$-Learning with Sparse Linear Function Approximation: Tight Bounds on Approximation Error Open

Ally Yalei Du, Lin F. Yang, Ruosong Wang · 2024

The recent work by Dong & Yang (2023) showed for misspecified sparse linear bandits, one can obtain an $O\left(ε\right)$-optimal policy using a polynomial number of samples when the sparsity is a constant, where $ε$ is the misspecification…

Confident Natural Policy Gradient for Local Planning in $q_π$-realizable Constrained MDPs Open

Tian Tian, Lin F. Yang, Csaba Szepesvári · 2024

The constrained Markov decision process (CMDP) framework emerges as an important reinforcement learning approach for imposing safety or other critical objectives while maximizing cumulative reward. However, the current understanding of how…

Learning for Bandits under Action Erasures Open

Osama A. Hanna, Merve Karakas, Lin F. Yang, Christina Fragouli · 2024

We consider a novel multi-arm bandit (MAB) setup, where a learner needs to communicate the actions to distributed agents over erasure channels, while the rewards for the actions are directly available to the learner through external sensor…

Don't Forget to Connect! Improving RAG with Graph-based Reranking Open

Jialin Dong, Bahare Fatemi, Bryan Perozzi, Lin F. Yang, Anton Tsitsulin · 2024

Retrieval Augmented Generation (RAG) has greatly improved the performance of Large Language Model (LLM) responses by grounding generation with context from existing documents. These systems work well when documents are clearly relevant to …

Research on acoustic methods for buried PE pipeline detection based on LSTM neural networks Open

Yongsheng Qi, Xinhua Wang, Xuyun Yang, Tao Sun, Izzat Razzaq , et al. · 2024

As an essential component of urban infrastructure construction, polyethylene (PE) pipelines face the challenging task of underground detection due to the complex and dynamic nature of the subsurface environment, diverse installation paths,…

Uniform Last-Iterate Guarantee for Bandits and Reinforcement Learning Open

Junyan Liu, Yunfan Li, Lin F. Yang · 2024

Existing metrics for reinforcement learning (RL) such as regret, PAC bounds, or uniform-PAC (Dann et al., 2017), typically evaluate the cumulative performance, while allowing the agent to play an arbitrarily bad policy at any finite time t…

Modeling Bellman Error with Logistic Distribution with Applications in Reinforcement Learning Open

Outongyi Lv, Bingxin Zhou, Lin F. Yang · 2024

Ir-Ids: A Network Intrusion Detection Method Based on Causal Feature Selection and Explainable Model Optimization Open

Yazhuo Gao, Lin F. Yang, Ran Zhu, Yixuan Wu, Feng Yang , et al. · 2024

Multi-Agent Bandit Learning through Heterogeneous Action Erasure Channels Open

Osama A. Hanna, Merve Karakas, Lin F. Yang, Christina Fragouli · 2023

Multi-Armed Bandit (MAB) systems are witnessing an upswing in applications within multi-agent distributed environments, leading to the advancement of collaborative MAB algorithms. In such settings, communication between agents executing ac…

Horizon-Free and Instance-Dependent Regret Bounds for Reinforcement Learning with General Function Approximation Open

Jiayi Huang, Han Zhong, Liwei Wang, Lin F. Yang · 2023

To tackle long planning horizon problems in reinforcement learning with general function approximation, we propose the first algorithm, termed as UCRL-WVTR, that achieves both \emph{horizon-free} and \emph{instance-dependent}, since it eli…

Adaptive Liquidity Provision in Uniswap V3 with Deep Reinforcement Learning Open

Haochen Zhang, Xi Chen, Lin F. Yang · 2023

Decentralized exchanges (DEXs) are a cornerstone of decentralized finance (DeFi), allowing users to trade cryptocurrencies without the need for third-party authorization. Investors are incentivized to deposit assets into liquidity pools, a…

Scaling Distributed Multi-task Reinforcement Learning with Experience Sharing Open

Sanae Amani, Khushbu Pahwa, Vladimir Braverman, Lin F. Yang · 2023

Recently, DARPA launched the ShELL program, which aims to explore how experience sharing can benefit distributed lifelong learning agents in adapting to new challenges. In this paper, we address this issue by conducting both theoretical an…

On the Model-Misspecification in Reinforcement Learning Open

Yunfan Li, Lin F. Yang · 2023

The success of reinforcement learning (RL) crucially depends on effective function approximation when dealing with complex ground-truth models. Existing sample-efficient RL algorithms primarily employ three approaches to function approxima…

Low-Switching Policy Gradient with Exploration via Online Sensitivity Sampling Open

Yunfan Li, Yiran Wang, Yu Cheng, Lin F. Yang · 2023

Policy optimization methods are powerful algorithms in Reinforcement Learning (RL) for their flexibility to deal with policy parameterization and ability to handle model misspecification. However, these methods usually suffer from slow con…

Tackling Heavy-Tailed Rewards in Reinforcement Learning with Function Approximation: Minimax Optimal and Instance-Dependent Regret Bounds Open

Jiayi Huang, Han Zhong, Liwei Wang, Lin F. Yang · 2023

While numerous works have focused on devising efficient algorithms for reinforcement learning (RL) with uniformly bounded rewards, it remains an open question whether sample or time-efficient algorithms for RL with large state-action space…

MetaVL: Transferring In-Context Learning Ability From Language Models to Vision-Language Models Open

Masoud Monajatipoor, Liunian Harold Li, Mozhdeh Rouhsedaghat, Lin F. Yang, Kai-Wei Chang · 2023

Large-scale language models have shown the ability to adapt to a new task via conditioning on a few demonstrations (i.e., in-context learning). However, in the vision-language domain, most large-scale pre-trained vision-language (VL) model…

Lin F. Yang YOU? Author Swipe