Explanipedia

Offline Imitation Learning upon Arbitrary Demonstrations by Pre-Training Dynamics Representations Open

Hao Ma, Bo Dai, Zhaolin Ren, Yebin Wang, Na Li · 2025

Limited data has become a major bottleneck in scaling up offline imitation learning (IL). In this paper, we propose enhancing IL performance under limited expert data by introducing a pre-training stage that learns dynamics representations…

Regression-Based Single-Point Zeroth-Order Optimization Open

Xin Chen, Zhaolin Ren · 2025

Zeroth-order optimization (ZO) is widely used for solving black-box optimization and control problems. In particular, single-point ZO (SZO) is well-suited to online or dynamic problem settings due to its requirement of only a single functi…

An AI-Cyborg System for Adaptive Intelligent Modulation of Organoid Maturation Open

Ren Liu, Zhaolin Ren, Xinhe Zhang, Qiang Li, Wenbo Wang , et al. · 2024

Recent advancements in flexible bioelectronics have enabled continuous, long-term stable interrogation and intervention of biological systems. However, effectively utilizing the interrogated data to modulate biological systems to achieve s…

Scalable spectral representations for multi-agent reinforcement learning in network MDPs Open

Zhaolin Ren, Runyu, Zhang, Bo Dai · 2024

Computer science

Network Markov Decision Processes (MDPs), a popular model for multi-agent control, pose a significant challenge to efficient learning due to the exponential growth of the global state-action space with the number of agents. In this work, u…

Distributed Thompson sampling under constrained communication Open

Saba Zerefa, Zhaolin Ren, Hao Ma, Na Li · 2024

Computer science

In Bayesian optimization, a black-box function is maximized via the use of a surrogate model. We apply distributed Thompson sampling, using a Gaussian process as a surrogate model, to approach the multi-agent Bayesian optimization problem.…

Enhancing Preference-based Linear Bandits via Human Response Time Open

Li Shen, Yuyang Zhang, Zhaolin Ren, C. H. Liang, Na Li , et al. · 2024

Computer science Economics

Interactive preference learning systems infer human preferences by presenting queries as pairs of options and collecting binary choices. Although binary choices are simple and widely used, they provide limited information about preference …

Skill Transfer and Discovery for Sim-to-Real Learning: A Representation-Based Viewpoint Open

Haitong Ma, Zhaolin Ren, Bo Dai, Na Li · 2024

Computer science Psychology Political science

We study sim-to-real skill transfer and discovery in the context of robotics control using representation learning. We draw inspiration from spectral decomposition of Markov decision processes. The spectral decomposition brings about repre…

TS-RSR: A provably efficient approach for batch Bayesian Optimization Open

Zhaolin Ren, Na Li · 2024

Computer science Mathematics Engineering

This paper presents a new approach for batch Bayesian Optimization (BO) called Thompson Sampling-Regret to Sigma Ratio directed sampling (TS-RSR), where we sample a new batch of actions by minimizing a Thompson Sampling approximation of a …

Research On The Performance Test Method Of Elevator Brake Open

Zhaolin Ren · 2023

Computer science Engineering Physics

The elevator brake is an important part to ensure the safe operation of the elevator, and its performance directly determines the safety of the whole machine. This paper describes a brake performance test scheme and test principle, and mak…

Explainable multi-task learning for multi-modality biological data analysis Open

Xin Tang, Jiawei Zhang, Yichun He, Xinhe Zhang, Zuwan Lin , et al. · 2023

Computer science Sociology Economics

Current biotechnologies can simultaneously measure multiple high-dimensional modalities (e.g., RNA, DNA accessibility, and protein) from the same cells. A combination of different analytical tasks (e.g., multi-modal integration and cross-m…

Stochastic Nonlinear Control via Finite-dimensional Spectral Dynamic Embedding Open

Tongzheng Ren, Zhaolin Ren, Na Li, Bo Dai, Dai, Bo · 2023

Computer science Mathematics Physics

This paper proposes an approach, Spectral Dynamics Embedding Control (SDEC), to optimal control for nonlinear stochastic systems. This method reveals an infinite-dimensional feature representation induced by the system's nonlinear stochast…

The Failure Analysis of Unintended Elevator Car Movement Protection Open

Zhaolin Ren · 2023

Computer science Engineering

The unintended movement protection device of elevator car can prevent the accident of passenger shearing and extrusion caused by the fracture of elevator traction wheel shaft and the failure of control system components. However, due to th…

On Controller Reduction in Linear Quadratic Gaussian Control with Performance Bounds Open

Zhaolin Ren, Yang Zheng, Maryam Fazel, Na Li · 2022

Computer science Mathematics Biology

The problem of controller reduction has a rich history in control theory. Yet, many questions remain open. In particular, there exist very few results on the order reduction of general non-observer based controllers and the subsequent quan…

Escaping saddle points in zeroth-order optimization: the power of two-point estimators Open

Zhaolin Ren, Yujie Tang, Na Li · 2022

Mathematics Physics Economics

Two-point zeroth order methods are important in many applications of zeroth-order optimization, such as robotics, wind farms, power systems, online optimization, and adversarial robustness to black-box attacks in deep neural networks, wher…

FedDAR: Federated Domain-Aware Representation Learning Open

Aoxiao Zhong, Hao He, Zhaolin Ren, Na Li, Quanzheng Li · 2022

Computer science Mathematics Political science

Cross-silo Federated learning (FL) has become a promising tool in machine learning applications for healthcare. It allows hospitals/institutions to train models with sufficient data while the data is kept private. To make sure the FL model…

Gradient Play in Stochastic Games: Stationary Points and Local Geometry Open

Runyu Zhang, Zhaolin Ren, Na Li · 2022

Mathematics Biology

We study the stationary points and local geometry of gradient play for stochastic games (SGs), where each agent tries to maximize its own total discounted reward by making decisions independently based on current state information which is…

Analysis on mechanical characteristics of brake wheel and brake shoe of elevator traction machine Open

Tao Jiang, Ziwei Wang, Zhaolin Ren, Guangjun Liu, Facai Ren · 2021

Engineering Physics

This paper analyzes the change of brake torque during normal stop and emergency braking of elevator. Taking the permanent magnet synchronous elevator traction machine as an example, the mechanical characteristics of the brake wheel and bra…

Gradient play in stochastic games: stationary points, convergence, and sample complexity Open

Runyu Zhang, Zhaolin Ren, Na Li · 2021

Mathematics Computer science Biology

We study the performance of the gradient play algorithm for stochastic games (SGs), where each agent tries to maximize its own total discounted reward by making decisions independently based on current state information which is shared bet…

Gradient Play in Multi-Agent Markov Stochastic Games: Stationary Points and Convergence Open

Runyu Zhang, Zhaolin Ren, Na Li · 2021

Mathematics Computer science Economics

We study the performance of the gradient play algorithm for multi-agent tabular Markov decision processes (MDPs), which are also known as stochastic games (SGs), where each agent tries to maximize its own total discounted reward by making …

Zeroth-Order Feedback Optimization for Cooperative Multi-Agent Systems Open

Yujie Tang, Zhaolin Ren, Na Li · 2020

Computer science Mathematics Economics

We study a class of cooperative multi-agent optimization problems, where each agent is associated with a local action vector and a local cost, and the goal is to cooperatively find the joint action profile that minimizes the average of the…

LQR with Tracking: A Zeroth-order Approach and Its Global Convergence Open

Zhaolin Ren, Aoxiao Zhong, Na Li · 2020

Mathematics Computer science Economics

There has been substantial recent progress on the theoretical understanding of model-free approaches to Linear Quadratic Regulator (LQR) problems. Much attention has been devoted to the special case when the goal is to drive the state clos…

Federated LQR: Learning through Sharing. Open

Zhaolin Ren, Aoxiao Zhong, Zhengyuan Zhou, Na Li · 2020

Computer science Mathematics Materials science

In many multi-agent reinforcement learning applications such as flocking, multi-robot applications and smart manufacturing, distinct agents share similar dynamics but face different objectives. In these applications, an important question …

Delay-Adaptive Distributed Stochastic Optimization Open

Zhaolin Ren, Zhengyuan Zhou, Linhai Qiu, Ajay Deshpande, Jayant Kalagnanam · 2020

Computer science Mathematics Economics

In large-scale optimization problems, distributed asynchronous stochastic gradient descent (DASGD) is a commonly used algorithm. In most applications, there are often a large number of computing nodes asynchronously computing gradient info…

Zhaolin Ren YOU? Author Swipe