Explanipedia

Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning Open

Younggyo Seo, Kimin Lee, Ignasi Clavera, Thanard Kurutach, Jinwoo Shin , et al. · 2020

Model-based reinforcement learning (RL) has shown great potential in various control tasks in terms of both sample-efficiency and final performance. However, learning a generalizable dynamics model robust to changes in dynamics remains a c…

Trajectory-wise Multiple Choice Learning for Dynamics Generalization in\n Reinforcement Learning Open

Younggyo Seo, Kimin Lee, Ignasi Clavera, Thanard Kurutach, Jinwoo Shin , et al. · 2020

Computer science Mathematics Biology

Model-based reinforcement learning (RL) has shown great potential in various\ncontrol tasks in terms of both sample-efficiency and final performance.\nHowever, learning a generalizable dynamics model robust to changes in dynamics\nremains …

Model-Augmented Actor-Critic: Backpropagating through Paths Open

Ignasi Clavera, Violet Fu, Pieter Abbeel · 2020

Computer science Mathematics Biology

Current model-based reinforcement learning approaches use the model simply as a learned black-box simulator to augment the data for policy optimization or value function learning. In this paper, we show how to make more effective use of th…

Mutual Information Maximization for Robust Plannable Representations Open

Yiming Ding, Ignasi Clavera, Pieter Abbeel · 2020

Computer science Mathematics Biology

Extending the capabilities of robotics to real-world complex, unstructured environments requires the need of developing better perception systems while maintaining low sample complexity. When dealing with high-dimensional state spaces, cur…

Model-Augmented Actor-Critic: Backpropagating through Paths Open

Ignasi Clavera, Violet Fu, Pieter Abbeel · 2020

Computer science

Current model-based reinforcement learning approaches use the model simply as a learned black-box simulator to augment the data for policy optimization or value function learning. In this paper, we show how to make more effective use of th…

Asynchronous Methods for Model-Based Reinforcement Learning Open

Yunzhi Zhang, Ignasi Clavera, Boren Tsai, Pieter Abbeel · 2019

Computer science Psychology

Significant progress has been made in the area of model-based reinforcement learning. State-of-the-art algorithms are now able to match the asymptotic performance of model-free methods while being significantly more data efficient. However…

Asynchronous Methods for Model-Based Reinforcement Learning. Open

Yunzhi Zhang, Ignasi Clavera, Boren Tsai, Pieter Abbeel · 2019

Computer science

Significant progress has been made in the area of model-based reinforcement learning. State-of-the-art algorithms are now able to match the asymptotic performance of model-free methods while being significantly more data efficient. However…

Benchmarking Model-Based Reinforcement Learning Open

Tingwu Wang, Xuchan Bao, Ignasi Clavera, Jerrick Hoang, Yeming Wen , et al. · 2019

Computer science Engineering Business

Model-based reinforcement learning (MBRL) is widely seen as having the potential to be significantly more sample efficient than model-free RL. However, research in model-based RL has not been very standardized. It is fairly common for auth…

Sub-policy Adaptation for Hierarchical Reinforcement Learning Open

Alexander C. Li, Carlos Florensa, Ignasi Clavera, Pieter Abbeel · 2019

Computer science Mathematics Engineering

Hierarchical reinforcement learning is a promising approach to tackle long-horizon decision-making problems with sparse rewards. Unfortunately, most methods still decouple the lower-level skill acquisition process and the training of a hig…

ProMP: Proximal Meta-Policy Search Open

Jonas Rothfuss, Dennis Lee, Ignasi Clavera, Tamim Asfour, Pieter Abbeel · 2018

Computer science Economics Psychology

Credit assignment in Meta-reinforcement learning (Meta-RL) is still poorly understood. Existing methods either neglect credit assignment to pre-adaptation behavior or implement it naively. This leads to poor sample-efficiency during meta-t…

Model-Based Reinforcement Learning via Meta-Policy Optimization Open

Ignasi Clavera, Jonas Rothfuss, John Schulman, Yasuhiro Fujita, Tamim Asfour , et al. · 2018

Computer science Economics Physics

Model-based reinforcement learning approaches carry the promise of being data efficient. However, due to challenges in learning dynamics models that sufficiently match the real-world dynamics, they struggle to achieve the same asymptotic p…

Learning to Adapt in Dynamic, Real-World Environments Through Meta-Reinforcement Learning Open

Anusha Nagabandi, Ignasi Clavera, Simin Liu, Ronald S. Fearing, Pieter Abbeel , et al. · 2018

Computer science Engineering Physics

Although reinforcement learning methods can achieve impressive results in simulation, the real world presents two major challenges: generating samples is exceedingly expensive, and unexpected perturbations or unseen situations cause profic…

Learning to Adapt: Meta-Learning for Model-Based Control Open

Ignasi Clavera, Anusha Nagabandi, Ronald S. Fearing, Pieter Abbeel, Sergey Levine , et al. · 2018

Computer science Engineering Biology

Although reinforcement learning methods can achieve impressive results in simulation, the real world presents two major challenges: generating samples is exceedingly expensive, and unexpected perturbations can cause proficient but narrowly…

Learning to Adapt in Dynamic, Real-World Environments Through\n Meta-Reinforcement Learning Open

Anusha Nagabandi, Ignasi Clavera, Simin Liu, Ronald S. Fearing, Pieter Abbeel , et al. · 2018

Computer science Engineering Biology

Although reinforcement learning methods can achieve impressive results in\nsimulation, the real world presents two major challenges: generating samples is\nexceedingly expensive, and unexpected perturbations or unseen situations cause\npro…

Model-Ensemble Trust-Region Policy Optimization Open

Thanard Kurutach, Ignasi Clavera, Yan Duan, Aviv Tamar, Pieter Abbeel · 2018

Computer science Geography Chemistry

Model-free reinforcement learning (RL) methods are succeeding in a growing number of tasks, aided by recent advances in deep learning. However, they tend to suffer from high sample complexity, which hinders their use in real-world domains.…

Model-Ensemble Trust-Region Policy Optimization Open

Thanard Kurutach, Ignasi Clavera, Yan Duan, Aviv Tamar, Pieter Abbeel · 2018

Computer science Chemistry Geography

Model-free reinforcement learning (RL) methods are succeeding in a growing number of tasks, aided by recent advances in deep learning. However, they tend to suffer from high sample complexity, which hinders their use in real-world domains.…

Ignasi Clavera YOU? Author Swipe