Explanipedia

Steering Language Models with Game-Theoretic Solvers Open

Ian Gemp, Yoram Bachrach, Marc Lanctot, Roma Patel, Vibhavari Dasagi , et al. · 2024

Computer science Mathematics

Mathematical models of interactions among rational agents have long been studied in game theory. However these interactions are often over a small set of discrete game actions which is very different from how humans communicate in natural …

Bayesian controller fusion: Leveraging control priors in deep reinforcement learning for robotics Open

Krishan Rana, Vibhavari Dasagi, Jesse Haviland, Ben Talbot, Michael Milford , et al. · 2023

Computer science Engineering Biology

We present Bayesian Controller Fusion (BCF): a hybrid control strategy that combines the strengths of traditional hand-crafted controllers and model-free deep reinforcement learning (RL). BCF thrives in the robotics domain, where reliable …

Human-Timescale Adaptation in an Open-Ended Task Space Open

Adaptive Agent Team, Jakob Bauer, Kate Baumli, Satinder Baveja, Feryal Behbahani , et al. · 2023

Computer science Engineering Physics

Foundation models have shown impressive adaptation and scalability in supervised and self-supervised learning problems, but so far these successes have not fully translated to reinforcement learning (RL). In this work, we demonstrate that …

Developing, Evaluating and Scaling Learning Agents in Multi-Agent Environments Open

Ian Gemp, T Anthony, Yoram Bachrach, Avishkar Bhoopchand, Kalesha Bullard , et al. · 2022

Computer science

The Game Theory & Multi-Agent team at DeepMind studies several aspects of multi-agent learning ranging from computing approximations to fundamental concepts in game theory to simulating social dilemmas in rich spatial environments and trai…

The Challenges of Exploration for Offline Reinforcement Learning Open

Nathan Lambert, Markus Wulfmeier, William Dwight Whitney, Arunkumar Byravan, Michael Bloesch , et al. · 2022

Computer science Psychology Philosophy

Offline Reinforcement Learning (ORL) enablesus to separately study the two interlinked processes of reinforcement learning: collecting informative experience and inferring optimal behaviour. The second step has been widely studied in the o…

Efficient and stable reinforcement learning for robotics Open

Vibhavari Dasagi · 2022

Computer science Geography Mathematics

Reinforcement Learning (RL) has long been used for learning behaviour through agent-collected experience, recently boosted by deep neural networks. However, typical deep RL agents require millions of training data samples, equating to days…

Zero-Shot Uncertainty-Aware Deployment of Simulation Trained Policies on Real-World Robots Open

Krishan Rana, Vibhavari Dasagi, Jesse Haviland, Ben Talbot, MIchael Milford , et al. · 2021

Computer science Engineering Biology

While deep reinforcement learning (RL) agents have demonstrated incredible potential in attaining dexterous behaviours for robotics, they tend to make errors when deployed in the real world due to mismatches between the training and execut…

Is Curiosity All You Need? On the Utility of Emergent Behaviours from Curious Exploration Open

Oliver Groth, Markus Wulfmeier, Giulia Vezzani, Vibhavari Dasagi, Tim Hertweck , et al. · 2021

Computer science Psychology Engineering

Curiosity-based reward schemes can present powerful exploration mechanisms which facilitate the discovery of solutions for complex, sparse or long-horizon tasks. However, as the agent learns to reach previously unexplored spaces and the ob…

Bayesian Controller Fusion: Leveraging Control Priors in Deep Reinforcement Learning for Robotics Open

Krishan Rana, Vibhavari Dasagi, Jesse Haviland, Ben Talbot, Michael Milford , et al. · 2021

Computer science Engineering Biology

We present Bayesian Controller Fusion (BCF): a hybrid control strategy that combines the strengths of traditional hand-crafted controllers and model-free deep reinforcement learning (RL). BCF thrives in the robotics domain, where reliable …

Multiplicative Controller Fusion: Leveraging Algorithmic Priors for Sample-efficient Reinforcement Learning and Safe Sim-To-Real Transfer Open

Krishan Rana, Vibhavari Dasagi, Ben Talbot, Michael Milford, Niko Sünderhauf · 2020

Computer science Engineering Chemistry

Learning-based approaches often outperform hand-coded algorithmic solutions for many problems in robotics. However, learning long-horizon tasks on real robot hardware can be intractable, and transferring a learned policy from simulation to…

Learning Arbitrary-Goal Fabric Folding with One Hour of Real Robot Experience Open

Robert Lee, Daniel B. Ward, Akansel Cosgun, Vibhavari Dasagi, Peter Corke , et al. · 2020

Computer science Engineering Biology

Manipulating deformable objects, such as fabric, is a long standing problem in robotics, with state estimation and control posing a significant challenge for traditional methods. In this paper, we show that it is possible to learn fabric f…

Residual Reactive Navigation: Combining Classical and Learned Navigation Strategies For Deployment in Unknown Environments Open

Krishan Rana, Ben Talbot, Vibhavari Dasagi, Michael Milford, Niko Sünderhauf · 2020

Computer science Biology Physics

In this work we focus on improving the efficiency and generalisation of learned navigation strategies when transferred from its training environment to previously unseen ones. We present an extension of the residual reinforcement learning …

Multiplicative Controller Fusion: Leveraging Algorithmic Priors for\n Sample-efficient Reinforcement Learning and Safe Sim-To-Real Transfer Open

Krishan Rana, Vibhavari Dasagi, Ben Talbot, Michael Milford, Niko Sünderhauf · 2020

Computer science Engineering Mathematics

Learning-based approaches often outperform hand-coded algorithmic solutions\nfor many problems in robotics. However, learning long-horizon tasks on real\nrobot hardware can be intractable, and transferring a learned policy from\nsimulation…

Evaluating task-agnostic exploration for fixed-batch learning of arbitrary future tasks Open

Vibhavari Dasagi, Robert Lee, Jake Bruce, Jürgen Leitner · 2019

Computer science Engineering Mathematics

Deep reinforcement learning has been shown to solve challenging tasks where large amounts of training experience is available, usually obtained online while learning the task. Robotics is a significant potential application domain for many…

Ctrl-Z: Recovering from Instability in Reinforcement Learning Open

Vibhavari Dasagi, Jake Bruce, Thierry Peynot, Jürgen Leitner · 2019

Psychology Computer science Physics

When learning behavior, training data is often generated by the learner itself; this can result in unstable training dynamics, and this problem has particularly important applications in safety-sensitive real-world control tasks such as ro…

Sim-to-Real Transfer of Robot Learning with Variable Length Inputs Open

Vibhavari Dasagi, Robert Lee, Serena Mou, Jake Bruce, Niko Sünderhauf , et al. · 2018

Computer science Mathematics Engineering

Current end-to-end deep Reinforcement Learning (RL) approaches require jointly learning perception, decision-making and low-level control from very sparse reward signals and high-dimensional inputs, with little capability of incorporating …

Zero-shot Sim-to-Real Transfer with Modular Priors. Open

Robert Lee, Serena Mou, Vibhavari Dasagi, Jake Bruce, Jürgen Leitner , et al. · 2018

Computer science Engineering

Current end-to-end deep Reinforcement Learning (RL) approaches require jointly learning perception, decision-making and low-level control from very sparse reward signals and high-dimensional inputs, with little capability of incorporating …

Vibhavari Dasagi YOU? Author Swipe