Explanipedia

Walking and falling: Using robot simulations to model the role of errors in infant walking Open

Ori Ossmy, Danyang Han, Patrick MacAlpine, Justine E. Hoch, Peter Stone , et al. · 2023

What is the optimal penalty for errors in infant skill learning? Behavioral analyses indicate that errors are frequent but trivial as infants acquire foundational skills. In learning to walk, for example, falling is commonplace but appears…

Design and Optimization of an Omnidirectional Humanoid Walk: A Winning Approach at the RoboCup 2011 3D Simulation Competition Open

Patrick MacAlpine, Samuel Barrett, Daniel Urieli, Victor Vu, Peter Stone · 2021

Computer science Mathematics Biology

This paper presents the design and learning architecture for an omnidirectional walk used by a humanoid robot soccer agent acting in the RoboCup 3D simulation environment. The walk, which was originally designed for and tested on an actual…

Cross-Trajectory Representation Learning for Zero-Shot Generalization in RL Open

Bogdan Mazoure, Ahmed M. Ahmed, Patrick MacAlpine, R Devon Hjelm, Andrey Kolobov · 2021

Computer science Mathematics Economics

A highly desirable property of a reinforcement learning (RL) agent -- and a major difficulty for deep RL approaches -- is the ability to generalize policies learned on a few tasks over a high-dimensional observation space to similar tasks …

Measuring Sample Efficiency and Generalization in Reinforcement Learning Benchmarks: NeurIPS 2020 Procgen Benchmark Open

Sharada P. Mohanty, Jyotish Poonganam, Adrien Gaidon, Andrey Kolobov, Blake Wulfe , et al. · 2021

Computer science Psychology Mathematics

The NeurIPS 2020 Procgen Competition was designed as a centralized benchmark with clearly defined tasks for measuring Sample Efficiency and Generalization in Reinforcement Learning. Generalization remains one of the most fundamental challe…

Special issue on adaptive and learning agents 2019 Open

Patrick Mannion, Patrick MacAlpine, Bei Peng, Roxana Rădulescu · 2020

Computer science Mathematics Physics

An abstract is not available for this content. As you have access to this content, full HTML content is provided on this page. A PDF of this content is also available in through the ‘Save PDF’ action button.

Multi-Preference Actor Critic Open

Ishan Durugkar, Matthew Hausknecht, Adith Swaminathan, Patrick MacAlpine · 2019

Computer science Mathematics Psychology

Policy gradient algorithms typically combine discounted future rewards with an estimated value function, to compute the direction and magnitude of parameter updates. However, for most Reinforcement Learning tasks, humans can provide additi…

Variety Wins: Soccer-Playing Robots and Infant Walking Open

Ori Ossmy, Justine E. Hoch, Patrick MacAlpine, Shohan Hasan, Peter Stone , et al. · 2018

Computer science

Although both infancy and artificial intelligence (AI) researchers are interested in developing systems that produce adaptive, functional behavior, the two disciplines rarely capitalize on their complementary expertise. Here, we used socce…

Multilayered skill learning and movement coordination for autonomous robotic agents Open

Patrick MacAlpine · 2017

Psychology Computer science Physics

With advances in technology expanding the capabilities of robots, while at the same time making robots cheaper to manufacture, robots are rapidly becoming more prevalent in both industrial and domestic settings. An increase in the number o…

Patrick MacAlpine YOU? Author Swipe