Explanipedia

A policy gradient approach for optimization of smooth risk measures Open

Nithia Vijayan, Prashanth L. A · 2022

We propose policy gradient algorithms for solving a risk-sensitive reinforcement learning (RL) problem in on-policy as well as off-policy settings. We consider episodic Markov decision processes, and model the risk using the broad class of…

Smoothed functional-based gradient algorithms for off-policy reinforcement learning: A non-asymptotic viewpoint Open

Nithia Vijayan, Prashanth L. A · 2021

Likelihood ratio-based policy gradient methods for distorted risk measures: A non-asymptotic analysis. Open

Nithia Vijayan, L. A. Prashanth · 2021

We propose policy-gradient algorithms for solving the problem of control in a risk-sensitive reinforcement learning (RL) context. The objective of our algorithm is to maximize the distorted risk measure (DRM) of the cumulative reward in an…

Policy Gradient Methods for Distortion Risk Measures Open

Nithia Vijayan, Prashanth L. A · 2021

We propose policy gradient algorithms which learn risk-sensitive policies in a reinforcement learning (RL) framework. Our proposed algorithms maximize the distortion risk measure (DRM) of the cumulative reward in an episodic Markov decisio…

Smoothed functional-based gradient algorithms for off-policy reinforcement learning. Open

Nithia Vijayan, L. A. Prashanth · 2021

We consider the problem of control in an off-policy reinforcement learning (RL) context. We propose a policy gradient scheme that incorporates a smoothed functional-based gradient estimation scheme. We provide an asymptotic convergence gua…

Smoothed functional-based gradient algorithms for off-policy\n reinforcement learning: A non-asymptotic viewpoint Open

Nithia Vijayan, Prashanth L. A · 2021

We propose two policy gradient algorithms for solving the problem of control\nin an off-policy reinforcement learning (RL) context. Both algorithms\nincorporate a smoothed functional (SF) based gradient estimation scheme. The\nfirst algori…

Nithia Vijayan YOU? Author Swipe