Explanipedia

An Optimisation Framework for Unsupervised Environment Design Open

Nathan Monette, Alistair Letcher, Michael Beukman, Matthew Jackson, Alexander R. Rutherford , et al. · 2025

For reinforcement learning agents to be deployed in high-risk settings, they must achieve a high level of robustness to unfamiliar scenarios. One method for improving robustness is unsupervised environment design (UED), a suite of methods …

Kinetix: Investigating the Training of General Agents through Open-Ended Physics-Based Control Tasks Open

Michael R. Matthews, Michael Beukman, Chris J. Lu, Jakob Foerster · 2024

While large models trained with self-supervised learning on offline datasets have shown remarkable capabilities in text and image domains, achieving the same generalisation for agents that act in sequential decision problems remains an ope…

JaxLife: An Open-Ended Agentic Simulator Open

Chris J. Lu, Michael Beukman, Michael R. Matthews, Jakob Foerster · 2024

Computer science Psychology

Human intelligence emerged through the process of natural selection and evolution on Earth. We investigate what it would take to re-create this process in silico. While past work has often focused on low-level processes (such as simulating…

RobocupGym: A challenging continuous control benchmark in Robocup Open

Michael Beukman, Branden Ingram, Geraud Nangue Tasse, Benjamin Rosman, Pravesh Ranchod · 2024

Computer science Geography

Reinforcement learning (RL) has progressed substantially over the past decade, with much of this progress being driven by benchmarks. Many benchmarks are focused on video or board games, and a large number of robotics benchmarks lack diver…

JaxUED: A simple and useable UED library in Jax Open

Samuel Coward, Michael Beukman, Jakob Foerster · 2024

Computer science Philosophy

We present JaxUED, an open-source library providing minimal dependency implementations of modern Unsupervised Environment Design (UED) algorithms in Jax. JaxUED leverages hardware acceleration to obtain on the order of 100x speedups compar…

Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning Open

Michael R. Matthews, Michael Beukman, Benjamin J. Ellis, Mikayel Samvelyan, Matthew Jackson , et al. · 2024

Computer science Engineering Geography

Benchmarks play a crucial role in the development and analysis of reinforcement learning (RL) algorithms. We identify that existing benchmarks used for research into open-ended learning fall into one of two categories. Either they are too …

Refining Minimax Regret for Unsupervised Environment Design Open

Michael Beukman, Samuel Coward, Michael R. Matthews, Mattie Fellows, Minqi Jiang , et al. · 2024

Computer science Mathematics Chemistry

In unsupervised environment design, reinforcement learning agents are trained on environment configurations (levels) generated by an adversary that maximises some objective. Regret is a commonly used objective that theoretically results in…

Dynamics Generalisation in Reinforcement Learning via Adaptive Context-Aware Policies Open

Michael Beukman, Devon Jarvis, Richard Klein, Steven James, Benjamin Rosman · 2023

Computer science Biology Art

While reinforcement learning has achieved remarkable successes in several domains, its real-world application is limited due to many methods failing to generalise to unfamiliar conditions. In this work, we consider the problem of generalis…

Hierarchical WaveFunction Collapse Open

Michael Beukman, Branden Ingram, Ireton Liu, Benjamin Rosman · 2023

Computer science Mathematics Philosophy

Video game developers are increasingly utilising procedural content generation (PCG) techniques in order to generate more content far quicker than if it were designed. Although promising, much of the successful work to date has been achiev…

Analysing Cross-Lingual Transfer in Low-Resourced African Named Entity Recognition Open

Michael Beukman, Manuel Fokam · 2023

Computer science Engineering Philosophy

Transfer learning has led to large gains in performance for nearly all NLP tasks while making downstream models easier and faster to train. This has also been extended to low-resourced languages, with some success. We investigate the prope…

Hierarchically Composing Level Generators for the Creation of Complex Structures Open

Michael Beukman, Manuel Fokam, Marcel Krüger, Guy Axelrod, Muhammad Umair Nasir , et al. · 2023

Computer science Mathematics Chemistry

Procedural content generation (PCG) is a growing field, with numerous applications in the video game industry and great potential to help create better games at a fraction of the cost of manual creation. However, much of the work in PCG is…

Analysing Cross-Lingual Transfer in Low-Resourced African Named Entity Recognition Open

Michael Beukman, Manuel Fokam · 2023

Computer science Engineering Philosophy

Michael Beukman, Manuel Fokam. Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (Volume 1: Long Papers…

MasakhaNER 2.0: Africa-centric Transfer Learning for Named Entity Recognition Open

David Ifeoluwa Adelani, Graham Neubig, Sebastian Ruder, Shruti Rijhwani, Michael Beukman , et al. · 2022

Computer science Geography Engineering

African languages are spoken by over a billion people, but are underrepresented in NLP research and development. The challenges impeding progress include the limited availability of annotated datasets, as well as a lack of understanding of…

Augmentative Topology Agents For Open-Ended Learning Open

Muhammad Umair Nasir, Michael Beukman, Steven James, Christopher W. Cleghorn · 2022

Computer science Mathematics Geology

In this work, we tackle the problem of open-ended learning by introducing a method that simultaneously evolves agents and increasingly challenging environments. Unlike previous open-ended approaches that optimize agents using a fixed neura…

Separating Grains from the Chaff: Using Data Filtering to Improve Multilingual Translation for Low-Resourced African Languages Open

Idris Abdulmumin, Michael Beukman, Jesujoba O. Alabi, Chris Chinenye Emezue, Everlyn Asiko , et al. · 2022

Computer science Chemistry

We participated in the WMT 2022 Large-Scale Machine Translation Evaluation for the African Languages Shared Task. This work describes our approach, which is based on filtering the given noisy data using a sentence-pair classifier that was …

Procedural content generation using neuroevolution and novelty search for diverse video game levels Open

Michael Beukman, Christopher W. Cleghorn, Steven James · 2022

Computer science Mathematics Business

Procedurally generated video game content has the potential to drastically\nreduce the content creation budget of game developers and large studios.\nHowever, adoption is hindered by limitations such as slow generation, as well\nas low qua…

Adaptive Online Value Function Approximation with Wavelets Open

Michael Beukman, Michael Mitchley, Dean Wookey, Steven James, George Konidaris · 2022

Computer science Mathematics Biology

Using function approximation to represent a value function is necessary for continuous and high-dimensional state spaces. Linear function approximation has desirable theoretical guarantees and often requires less compute and samples than n…

Towards Objective Metrics for Procedurally Generated Video Game Levels Open

Michael Beukman, Steven James, Christopher Cleghorn · 2022

Computer science Mathematics Sociology

With increasing interest in procedural content generation by academia and game developers alike, it is vital that different approaches can be compared fairly. However, evaluating procedurally generated video game levels is often difficult,…

A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News Translation Open

David Ifeoluwa Adelani, Jesujoba O. Alabi, Angela Fan, Julia Kreutzer, Xiaoyu Shen , et al. · 2022

Art Philosophy

David Adelani, Jesujoba Alabi, Angela Fan, Julia Kreutzer, Xiaoyu Shen, Machel Reid, Dana Ruiter, Dietrich Klakow, Peter Nabende, Ernie Chang, Tajuddeen Gwadabe, Freshia Sackey, Bonaventure F. P. Dossou, Chris Emezue, Colin Leong, Michael …

MasakhaNER 2.0: Africa-centric Transfer Learning for Named Entity Recognition Open

David Ifeoluwa Adelani, Graham Neubig, Sebastian Ruder, Shruti Rijhwani, Michael Beukman , et al. · 2022

Art Philosophy

David Adelani, Graham Neubig, Sebastian Ruder, Shruti Rijhwani, Michael Beukman, Chester Palen-Michel, Constantine Lignos, Jesujoba Alabi, Shamsuddeen Muhammad, Peter Nabende, Cheikh M. Bamba Dione, Andiswa Bukula, Rooweither Mabuya, Bonav…

Michael Beukman YOU? Author Swipe