Explanipedia

Deep Reinforcement Learning with Double Q-Learning Open

Hado van Hasselt, Arthur Guez, David Silver · 2016

The popular Q-learning algorithm is known to overestimate action values under certain conditions. It was not previously known whether, in practice, such overestimations are common, whether they harm performance, and whether they can genera…

Q-Learning Algorithms: A Comprehensive Classification and Applications Open

Beakcheol Jang, Myeonghwi Kim, Gaspard Harerimana, Jong Wook Kim · 2019

Computer science

Q-learning is arguably one of the most applied representative reinforcement learning approaches and one of the off-policy strategies. Since the emergence of Q-learning, many studies have described its uses in reinforcement learning and art…

Conservative Q-Learning for Offline Reinforcement Learning Open

Aviral Kumar, Aurick Zhou, George Tucker, Sergey Levine · 2020

Computer science Mathematics Physics

Effectively leveraging large, previously collected datasets in reinforcement learning (RL) is a key challenge for large-scale real-world applications. Offline RL algorithms promise to learn effective policies from previously-collected, sta…

Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks Open

Jingjing Cui, Yuanwei Liu, Arumugam Nallanathan · 2019

Computer science Engineering Mathematics

Unmanned aerial vehicles (UAVs) are capable of serving as aerial base stations (BSs) for providing both cost-effective and on-demand wireless communications. This article investigates dynamic resource allocation of multiple UAVs enabled co…

Spectrum Sharing in Vehicular Networks Based on Multi-Agent Reinforcement Learning Open

Le Liang, Hao Ye, Geoffrey Ye Li · 2019

Computer science Biology

This paper investigates the spectrum sharing problem in vehicular networks based on multi-agent reinforcement learning, where multiple vehicle-to-vehicle (V2V) links reuse the frequency spectrum preoccupied by vehicle-to-infrastructure (V2…

Distributional Reinforcement Learning With Quantile Regression Open

Will Dabney, Mark Rowland, Marc G. Bellemare, Rémi Munos · 2018

Computer science Mathematics Engineering

In reinforcement learning (RL), an agent interacts with the environment by taking actions and observing the next state and reward. When sampled probabilistically, these state transitions, rewards, and actions can all induce randomness in t…

A Multi-Agent Reinforcement Learning-Based Data-Driven Method for Home Energy Management Open

Xu Xu, Youwei Jia, Yan Xu, Zhao Xu, Songjian Chai , et al. · 2020

Computer science Engineering Mathematics

This paper proposes a novel framework for home energy management (HEM) based on reinforcement learning in achieving efficient home-based demand response (DR). The concerned hour-ahead energy consumption scheduling problem is duly formulate…

Continuous Deep Q-Learning with Model-based Acceleration Open

Shixiang Gu, Timothy Lillicrap, Ilya Sutskever, Sergey Levine · 2016

Computer science Mathematics Biology

Model-free reinforcement learning has been successfully applied to a range of challenging problems, and has recently been extended to handle large neural network policies and value functions. However, the sample complexity of model-free al…

Multi-Objective Workflow Scheduling With Deep-Q-Network-Based Multi-Agent Reinforcement Learning Open

Yuandou Wang, Hang Liu, Wanbo Zheng, Yunni Xia, Yawen Li , et al. · 2019

Computer science Mathematics

Cloud Computing provides an effective platform for executing large-scale and complex workflow applications with a pay-as-you-go model. Nevertheless, various challenges, especially its optimal scheduling for multiple conflicting objectives,…

Energy Management Strategy for a Hybrid Electric Vehicle Based on Deep Reinforcement Learning Open

Yue Hu, Weimin Li, Kun Xu, Taimoor Zahid, Feiyan Qin , et al. · 2018

Computer science Engineering Physics

An energy management strategy (EMS) is important for hybrid electric vehicles (HEVs) since it plays a decisive role on the performance of the vehicle. However, the variation of future driving conditions deeply influences the effectiveness …

Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction Open

Aviral Kumar, Justin Fu, George Tucker, Sergey Levine · 2019

Computer science Mathematics Biology

Off-policy reinforcement learning aims to leverage experience collected from prior policies for sample-efficient learning. However, in practice, commonly used off-policy approximate dynamic programming methods based on Q-learning and actor…

DeepRMSA: A Deep Reinforcement Learning Framework for Routing, Modulation and Spectrum Assignment in Elastic Optical Networks Open

Xiaoliang Chen, Baojia Li, Roberto Proietti, Hongbo Lu, Zuqing Zhu , et al. · 2019

Computer science

This paper proposes DeepRMSA, a deep reinforcement learning framework for routing, modulation and spectrum assignment (RMSA) in elastic optical networks (EONs). DeepRMSA learns the correct online RMSA policies by parameterizing the policie…

Collision Avoidance in Pedestrian-Rich Environments With Deep Reinforcement Learning Open

Michael Everett, Yu Fan Chen, Jonathan P. How · 2021

Computer science Engineering

Collision avoidance algorithms are essential for safe and efficient robot\noperation among pedestrians. This work proposes using deep reinforcement (RL)\nlearning as a framework to model the complex interactions and cooperation with\nnearb…

High-level Decision Making for Safe and Reasonable Autonomous Lane Changing using Reinforcement Learning Open

Branka Mirchevska, Christian Pek, Moritz Werling, Matthias Althoff, Joschka Boedecker · 2018

Computer science Engineering Political science

Machine learning techniques have been shown to outperform many rule-based systems for the decision-making of autonomous vehicles. However, applying machine learning is challenging due to the possibility of executing unsafe actions and slow…

Multi-Robot Path Planning Method Using Reinforcement Learning Open

Hyansu Bae, Gidong Kim, Jonguk Kim, Dianwei Qian, Suk-Gyu Lee · 2019

Computer science Mathematics Geography

This paper proposes a noble multi-robot path planning algorithm using Deep q learning combined with CNN (Convolution Neural Network) algorithm. In conventional path planning algorithms, robots need to search a comparatively wide area for n…

Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction Open

Aviral Kumar, Justin Fu, Matthew Soh, George Tucker, Sergey Levine · 2019

Computer science Mathematics Biology

Off-policy reinforcement learning aims to leverage experience collected from prior policies for sample-efficient learning. However, in practice, commonly used off-policy approximate dynamic programming methods based on Q-learning and actor…

Hyperparameter Optimization for Tracking with Continuous Deep Q-Learning Open

Xingping Dong, Jianbing Shen, Wenguan Wang, Yu Liu, Ling Shao , et al. · 2018

Computer science Psychology

Hyperparameters are numerical presets whose values are assigned prior to the commencement of the learning process. Selecting appropriate hyperparameters is critical for the accuracy of tracking algorithms, yet it is difficult to determine …

Quantum agents in the Gym: a variational quantum algorithm for deep Q-learning Open

Andrea Skolik, Sofiène Jerbi, Vedran Dunjko · 2022

Computer science Physics

Quantum machine learning (QML) has been identified as one of the key fields that could reap advantages from near-term quantum devices, next to optimization and quantum chemistry. Research in this area has focused primarily on variational q…

Reinforcement Learning with Parameterized Actions Open

Warwick Masson, Pravesh Ranchod, George Konidaris · 2016

Computer science Mathematics Physics

We introduce a model-free algorithm for learning in Markov decision processes with parameterized actions—discrete actions with continuous parameters. At each step the agent must select both which action to use and which parameters to use w…

Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space Open

Jiechao Xiong, Qing Wang, Zhuoran Yang, Peng Sun, Lei Han , et al. · 2018

Computer science Mathematics Physics

Most existing deep reinforcement learning (DRL) frameworks consider either discrete action space or continuous action space solely. Motivated by applications in computer games, we consider the scenario with discrete-continuous hybrid actio…

Distributional Reinforcement Learning with Quantile Regression Open

Will Dabney, Mark Rowland, Marc G. Bellemare, Rémi Munos · 2024

Computer science Mathematics Psychology

In reinforcement learning an agent interacts with the environment by taking actions and observing the next state and reward. When sampled probabilistically, these state transitions, rewards, and actions can all induce randomness in the obs…

Cooperative Deep Q-Learning With Q-Value Transfer for Multi-Intersection Signal Control Open

Hongwei Ge, Yumei Song, Chunguo Wu, Jiankang Ren, Guozhen Tan · 2019

Computer science Engineering

The problem of adaptive traffic signal control in the multi-intersection system has attracted the attention of researchers. Among the existing methods, reinforcement learning has shown to be effective. However, the complex intersection fea…

Dynamic Offloading for Multiuser Muti-CAP MEC Networks: A Deep Reinforcement Learning Approach Open

Chao Li, Junjuan Xia, Fagui Liu, Dong Li, Lisheng Fan , et al. · 2021

Computer science Engineering Mathematics

In this paper, we study a multiuser mobile edge computing (MEC) network, where tasks from users can be partially offloaded to multiple computational access points (CAPs). We consider practical cases where task characteristics and computati…

Off-Policy Interleaved $Q$ -Learning: Optimal Control for Affine Nonlinear Discrete-Time Systems Open

Jinna Li, Tianyou Chai, Frank L. Lewis, Zhengtao Ding, Yi Jiang · 2018

Computer science Mathematics Biology

In this paper, a novel off-policy interleaved Q-learning algorithm is presented for solving optimal control problem of affine nonlinear discrete-time (DT) systems, using only the measured data along the system trajectories. Affine nonlinea…

Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels Open

Ilya Kostrikov, Denis Yarats, Rob Fergus · 2020

Computer science History Biology

We propose a simple data augmentation technique that can be applied to standard model-free reinforcement learning algorithms, enabling robust learning directly from pixels without the need for auxiliary losses or pre-training. The approach…

Relations between Model Predictive Control and Reinforcement Learning Open

Daniel Görges · 2017

Computer science Mathematics Biology

In this paper relations between model predictive control and reinforcement learning are studied for discrete-time linear time-invariant systems with state and input constraints and a quadratic value function. The principles of model predic…

A Theoretical Analysis of Deep Q-Learning Open

Jianqing Fan, Zhaoran Wang, Yuchen Xie, Zhuoran Yang · 2019

Computer science Mathematics Biology

Despite the great empirical success of deep reinforcement learning, its theoretical foundation is less well understood. In this work, we make the first attempt to theoretically understand the deep Q-network (DQN) algorithm (Mnih et al., 20…

Offline Reinforcement Learning with Implicit Q-Learning Open

Ilya Kostrikov, Ashvin Nair, Sergey Levine · 2021

Computer science Psychology

Offline reinforcement learning requires reconciling two conflicting aims: learning a policy that improves over the behavior policy that collected the dataset, while at the same time minimizing the deviation from the behavior policy so as t…

Path Planning via an Improved DQN-Based Learning Policy Open

Liangheng Lv, Sunjie Zhang, Derui Ding, Yongxiong Wang · 2019

Computer science Mathematics Economics

The path planning technology is an important part of navigation, which is the core of robotics research. Reinforcement learning is a fashionable algorithm that learns from experience by mimicking the process of human learning skills. When …

Comparative Analysis of Energy Management Strategies for HEV: Dynamic Programming and Reinforcement Learning Open

Heeyun Lee, Changhee Song, Namwook Kim, Suk Won · 2020

Computer science Engineering Mathematics

Energy management strategy is an important factor in determining the fuel economy of hybrid electric vehicles; thus, much research on how to distribute the required power to engines and motors of hybrid vehicles is required. Recently, vari…

Q-learning ≈ Q-learning