Kaixiang Lin
YOU?
Author Swipe
View article: Proposer-Agent-Evaluator(PAE): Autonomous Skill Discovery For Foundation Model Internet Agents
Proposer-Agent-Evaluator(PAE): Autonomous Skill Discovery For Foundation Model Internet Agents Open
The vision of a broadly capable and goal-directed agent, such as an Internet-browsing agent in the digital world and a household humanoid in the physical world, has rapidly advanced, thanks to the generalization capability of foundation mo…
View article: Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk
Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk Open
Large language models (LLMs) are powerful dialogue agents, but specializing them towards fulfilling a specific function can be challenging. Instructing tuning, i.e. tuning models on instruction and sample responses generated by humans (Ouy…
View article: A Unified Linear Speedup Analysis of Federated Averaging and Nesterov FedAvg
A Unified Linear Speedup Analysis of Federated Averaging and Nesterov FedAvg Open
Federated learning (FL) learns a model jointly from a set of participating devices without sharing each other’s privately held data. The characteristics of non-i.i.d. data across the network, low device participation, high communication co…
View article: Transfer Learning in Deep Reinforcement Learning: A Survey
Transfer Learning in Deep Reinforcement Learning: A Survey Open
Reinforcement learning is a learning paradigm for solving sequential decision-making problems. Recent years have witnessed remarkable progress in reinforcement learning upon the fast development of deep neural networks. Along with the prom…
View article: Automated Few-shot Classification with Instruction-Finetuned Language Models
Automated Few-shot Classification with Instruction-Finetuned Language Models Open
A particularly successful class of approaches for few-shot learning combines language models with prompts -- hand-crafted task descriptions that complement data samples. However, designing prompts by hand for each task commonly requires do…
View article: Automated Few-Shot Classification with Instruction-Finetuned Language Models
Automated Few-Shot Classification with Instruction-Finetuned Language Models Open
A particularly successful class of approaches for few-shot learning combines language models with prompts - hand-crafted task descriptions that complement data samples. However, designing prompts by hand for each task commonly requires dom…
View article: Parameter and Data Efficient Continual Pre-training for Robustness to Dialectal Variance in Arabic
Parameter and Data Efficient Continual Pre-training for Robustness to Dialectal Variance in Arabic Open
The use of multilingual language models for tasks in low and high-resource languages has been a success story in deep learning. In recent times, Arabic has been receiving widespread attention on account of its dialectal variance. While pri…
View article: CH-MARL: A Multimodal Benchmark for Cooperative, Heterogeneous Multi-Agent Reinforcement Learning
CH-MARL: A Multimodal Benchmark for Cooperative, Heterogeneous Multi-Agent Reinforcement Learning Open
We propose a multimodal (vision-and-language) benchmark for cooperative and heterogeneous multi-agent learning. We introduce a benchmark multimodal dataset with tasks involving collaboration between multiple simulated heterogeneous robots …
View article: DialFRED: Dialogue-Enabled Agents for Embodied Instruction Following
DialFRED: Dialogue-Enabled Agents for Embodied Instruction Following Open
Language-guided Embodied AI benchmarks requiring an agent to navigate an environment and manipulate objects typically allow one-way communication: the human user gives a natural language command to the agent, and the agent can only follow …
View article: Self-Adaptive Imitation Learning: Learning Tasks with Delayed Rewards from Sub-optimal Demonstrations
Self-Adaptive Imitation Learning: Learning Tasks with Delayed Rewards from Sub-optimal Demonstrations Open
Reinforcement learning (RL) has demonstrated its superiority in solving sequential decision-making problems. However, heavy dependence on immediate reward feedback impedes the wide application of RL. On the other hand, imitation learning (…
View article: Learning to Act with Affordance-Aware Multimodal Neural SLAM
Learning to Act with Affordance-Aware Multimodal Neural SLAM Open
Recent years have witnessed an emerging paradigm shift toward embodied artificial intelligence, in which an agent must learn to solve challenging tasks by interacting with its environment. There are several challenges in solving embodied m…
View article: Learning Two-Step Hybrid Policy for Graph-Based Interpretable Reinforcement Learning
Learning Two-Step Hybrid Policy for Graph-Based Interpretable Reinforcement Learning Open
We present a two-step hybrid reinforcement learning (RL) policy that is designed to generate interpretable and robust hierarchical policies on the RL problem with graph-based input. Unlike prior deep reinforcement learning policies paramet…
View article: Research on Real-Time Analysis of Combustion State of Swirl Burner Based on High-Temperature Image Acquisition Probe
Research on Real-Time Analysis of Combustion State of Swirl Burner Based on High-Temperature Image Acquisition Probe Open
View article: DSResSol: A Sequence-Based Solubility Predictor Created with Dilated Squeeze Excitation Residual Networks
DSResSol: A Sequence-Based Solubility Predictor Created with Dilated Squeeze Excitation Residual Networks Open
Protein solubility is an important thermodynamic parameter that is critical for the characterization of a protein’s function, and a key determinant for the production yield of a protein in both the research setting and within industrial (e…
View article: LUMINOUS: Indoor Scene Generation for Embodied AI Challenges
LUMINOUS: Indoor Scene Generation for Embodied AI Challenges Open
Learning-based methods for training embodied agents typically require a large number of high-quality scenes that contain realistic layouts and support meaningful interactions. However, current simulators for Embodied AI (EAI) challenges on…
View article: DSResSol: A sequence-based solubility predictor created with dilated squeeze excitation residual networks
DSResSol: A sequence-based solubility predictor created with dilated squeeze excitation residual networks Open
Protein solubility is an important thermodynamic parameter critical for the characterization of a protein’s function, and a key determinant for the production yield of a protein in both the research setting and within industrial (e.g. phar…
View article: RCA: A Deep Collaborative Autoencoder Approach for Anomaly Detection
RCA: A Deep Collaborative Autoencoder Approach for Anomaly Detection Open
Unsupervised anomaly detection plays a crucial role in many critical applications. Driven by the success of deep learning, recent years have witnessed growing interests in applying deep neural networks (DNNs) to anomaly detection problems.…
View article: PowerNet: Multi-Agent Deep Reinforcement Learning for Scalable Powergrid Control
PowerNet: Multi-Agent Deep Reinforcement Learning for Scalable Powergrid Control Open
This paper develops an efficient multi-agent deep reinforcement learning algorithm for cooperative controls in powergrids. Specifically, we consider the decentralized inverter-based secondary voltage control problem in distributed generato…
View article: Federated Learning's Blessing: FedAvg has Linear Speedup
Federated Learning's Blessing: FedAvg has Linear Speedup Open
Federated learning (FL) learns a model jointly from a set of participating devices without sharing each other's privately held data. The characteristics of non-\textit{i.i.d.} data across the network, low device participation, high communi…
View article: Off-Policy Imitation Learning from Observations
Off-Policy Imitation Learning from Observations Open
Learning from Observations (LfO) is a practical reinforcement learning scenario from which many applications can benefit through the reuse of incomplete resources. Compared to conventional imitation learning (IL), LfO is more challenging b…
View article: PowerNet: Multi-agent Deep Reinforcement Learning for Scalable Powergrid Control
PowerNet: Multi-agent Deep Reinforcement Learning for Scalable Powergrid Control Open
This paper develops an efficient multi-agent deep reinforcement learning algorithm for cooperative controls in powergrids. Specifically, we consider the decentralized inverter-based secondary voltage control problem in distributed generato…
View article: Transfer Learning in Deep Reinforcement Learning: A Survey
Transfer Learning in Deep Reinforcement Learning: A Survey Open
Reinforcement learning is a learning paradigm for solving sequential decision-making problems. Recent years have witnessed remarkable progress in reinforcement learning upon the fast development of deep neural networks. Along with the prom…
View article: A Unified Linear Speedup Analysis of Federated Averaging and Nesterov FedAvg
A Unified Linear Speedup Analysis of Federated Averaging and Nesterov FedAvg Open
Federated learning (FL) learns a model jointly from a set of participating devices without sharing each other's privately held data. The characteristics of non-i.i.d. data across the network, low device participation, high communication co…
View article: Learning Sparse Rewarded Tasks from Sub-Optimal Demonstrations
Learning Sparse Rewarded Tasks from Sub-Optimal Demonstrations Open
Model-free deep reinforcement learning (RL) has demonstrated its superiority on many complex sequential decision-making problems. However, heavy dependence on dense rewards and high sample-complexity impedes the wide adoption of these meth…
View article: Ranking Policy Gradient
Ranking Policy Gradient Open
Sample inefficiency is a long-lasting problem in reinforcement learning (RL). The state-of-the-art estimates the optimal action values while it usually involves an extensive search over the state-action space and unstable optimization. Tow…
View article: Alkaline Quinone Flow Battery with Long Lifetime at pH 12
Alkaline Quinone Flow Battery with Long Lifetime at pH 12 Open
(Joule 2, 1894–1906; September 19, 2018) In the originally published version of this article, the first panel of Figure 1 showing the DBEAQ synthetic route was omitted. It has now been included and appears below. Additionally, one of the r…
View article: Efficient Large-Scale Fleet Management via Multi-Agent Deep Reinforcement Learning
Efficient Large-Scale Fleet Management via Multi-Agent Deep Reinforcement Learning Open
Large-scale online ride-sharing platforms have substantially transformed our lives by reallocating transportation resources to alleviate traffic congestion and promote transportation efficiency. An efficient fleet management strategy not o…
View article: Differentially Private Generative Adversarial Network
Differentially Private Generative Adversarial Network Open
Generative Adversarial Network (GAN) and its variants have recently attracted intensive research interests due to their elegant theoretical foundation and excellent empirical performance as generative models. These tools provide a promisin…
View article: Efficient Collaborative Multi-Agent Deep Reinforcement Learning for Large-Scale Fleet Management
Efficient Collaborative Multi-Agent Deep Reinforcement Learning for Large-Scale Fleet Management Open
Large-scale online ride-sharing platforms have substantially transformed our lives by reallocating transportation resources to alleviate traffic congestion and promote transportation efficiency. An efficient fleet management strategy not o…
View article: Improving Mild Cognitive Impairment Prediction via Reinforcement Learning and Dialogue Simulation
Improving Mild Cognitive Impairment Prediction via Reinforcement Learning and Dialogue Simulation Open
Mild cognitive impairment (MCI) is a prodromal phase in the progression from normal aging to dementia, especially Alzheimers disease. Even though there is mild cognitive decline in MCI patients, they have normal overall cognition and thus …