Maxime Chevalier-Boisvert

Evaluating YJIT’s Performance in a Production Context: A Pragmatic Approach Open

Maxime Chevalier-Boisvert, Takashi Kokubun, Noah Gibbs, Si Xing Wu, Aaron Patterson , et al. · 2023

Ruby is a dynamically-typed programming language with a large breadth of features which has grown in popularity with the rise of the modern web, and remains at the core of the implementation of widely-used online platforms such as Shopify,…

Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented Tasks Open

Maxime Chevalier-Boisvert, Bolun Dai, Mark Towers, Rodrigo de Lazcano, Lucas Willems , et al. · 2023

Computer science Engineering History

We present the Minigrid and Miniworld libraries which provide a suite of goal-oriented 2D and 3D environments. The libraries were explicitly created with a minimalistic design paradigm to allow users to rapidly develop new environments for…

Proceedings of the 3rd Wordplay: When Language Meets Games Workshop (Wordplay 2022) Open

Shrimai Prabhumoye Nvidia, Tim Rocktäschel, Hal Daumé, Peter Jansen, Lynn Cherny Ghostweather , et al. · 2022

Computer science

Since the dawn of the digital age, interactive virtual environments and electronic games have played a huge role in shaping our lives.Not only are they a source of entertainment but they also teach us important life skills such as strategi…

YJIT: a basic block versioning JIT compiler for CRuby Open

Maxime Chevalier-Boisvert, Noah Gibbs, Jean Boussier, Si Xing Wu, Aaron Patterson , et al. · 2021

Computer science Psychology Mathematics

Ruby is a dynamically typed programming language with a large breadth of features which has grown in popularity with the rise of the modern web, and remains at the core of the implementation of many widely-used websites.

Combating False Negatives in Adversarial Imitation Learning Open

Konrad Żołna, Chitwan Saharia, Léonard Boussioux, David Yu-Tung Hui, Maxime Chevalier-Boisvert , et al. · 2021

Computer science Psychology Engineering

In adversarial imitation learning, a discriminator is trained to differentiate agent episodes from expert demonstrations representing the desired behavior. However, as the trained policy learns to be more successful, the negative examples …

DeepDrummer : Generating Drum Loops using Deep Learning and a Human in the Loop Open

Guillaume Alain, Maxime Chevalier-Boisvert, Frédéric Osterrath, Remi Piché-Taillefer · 2020

Computer science Mathematics Engineering

DeepDrummer is a drum loop generation tool that uses active learning to learn the preferences (or current artistic intentions) of a human user from a small number of interactions. The principal goal of this tool is to enable an efficient e…

BabyAI 1.1 Open

David Yu-Tung Hui, Maxime Chevalier-Boisvert, Dzmitry Bahdanau, Yoshua Bengio · 2020

Business

The BabyAI platform is designed to measure the sample efficiency of training an agent to follow grounded-language instructions. BabyAI 1.0 presents baseline results of an agent trained by deep imitation or reinforcement learning. BabyAI 1.…

BabyAI 1.1. Open

David Yu-Tung Hui, Maxime Chevalier-Boisvert, Dzmitry Bahdanau, Yoshua Bengio · 2020

Computer science Psychology Chemistry

The BabyAI platform is designed to measure the sample efficiency of training an agent to follow grounded-language instructions. BabyAI 1.0 presents baseline results of an agent trained by deep imitation or reinforcement learning. BabyAI 1.…

Combating False Negatives in Adversarial Imitation Learning (Student Abstract) Open

Konrad Żołna, Chitwan Saharia, Léonard Boussioux, David Yu-Tung Hui, Maxime Chevalier-Boisvert , et al. · 2020

Computer science Psychology Chemistry

We define the False Negatives problem and show that it is a significant limitation in adversarial imitation learning. We propose a method that solves the problem by leveraging the nature of goal-conditioned tasks. The method, dubbed Fake C…

Options of Interest: Temporal Abstraction with Interest Functions Open

Khimya Khetarpal, Martin Klissarov, Maxime Chevalier-Boisvert, Pierre‐Luc Bacon, Doina Precup · 2020

Computer science Mathematics Philosophy

Temporal abstraction refers to the ability of an agent to use behaviours of controllers which act for a limited, variable amount of time. The options framework describes such behaviours as consisting of a subset of states in which they can…

Options of Interest: Temporal Abstraction with Interest Functions Open

Khimya Khetarpal, Martin Klissarov, Maxime Chevalier-Boisvert, Pierre‐Luc Bacon, Doina Precup · 2020

Computer science Mathematics Philosophy

Temporal abstraction refers to the ability of an agent to use behaviours of controllers which act for a limited, variable amount of time. The options framework describes such behaviours as consisting of a subset of states in which they can…

Automated curriculum generation for Policy Gradients from Demonstrations Open

Anirudh Srinivasan, Dzmitry Bahdanau, Maxime Chevalier-Boisvert, Yoshua Bengio · 2019

Computer science Political science Psychology

In this paper, we present a technique that improves the process of training an agent (using RL) for instruction following. We develop a training curriculum that uses a nominal number of expert demonstrations and trains the agent in a manne…

Option-Critic in Cooperative Multi-agent Systems Open

Jhelum Chakravorty, Nadeem Ward, Julien Le Roy, Maxime Chevalier-Boisvert, Sumana Basu , et al. · 2019

Computer science Mathematics Economics

In this paper, we investigate learning temporal abstractions in cooperative multi-agent systems, using the options framework (Sutton et al, 1999). First, we address the planning problem for the decentralized POMDP represented by the multi-…

Robo-PlaNet: Learning to Poke in a Day Open

Maxime Chevalier-Boisvert, Guillaume Alain, Florian Golemo, Derek Nowrouzezahrai · 2019

Computer science Physics Economics

Recently, the Deep Planning Network (PlaNet) approach was introduced as a model-based reinforcement learning method that learns environment dynamics directly from pixel observations. This architecture is useful for learning tasks in which …

BabyAI: First Steps Towards Grounded Language Learning With a Human In the Loop. Open

Maxime Chevalier-Boisvert, Dzmitry Bahdanau, Salem Lahlou, Lucas Willems, Chitwan Saharia , et al. · 2018

Computer science Psychology Chemistry

Allowing humans to interactively train artificial agents to understand language instructions is desirable for both practical and scientific reasons, but given the poor data efficiency of the current learning methods, this goal may require …

BabyAI: A Platform to Study the Sample Efficiency of Grounded Language Learning Open

Maxime Chevalier-Boisvert, Dzmitry Bahdanau, Salem Lahlou, Lucas Willems, Chitwan Saharia , et al. · 2018

Computer science Mathematics Chemistry

Allowing humans to interactively train artificial agents to understand language instructions is desirable for both practical and scientific reasons, but given the poor data efficiency of the current learning methods, this goal may require …

Interprocedural Type Specialization of JavaScript Programs Without Type Analysis Open

Maxime Chevalier-Boisvert, Marc Feeley · 2016

Computer science

Previous work proposed lazy basic block versioning, a technique for just-in-time compilation of dynamic languages which we believe represents an interesting point in the design space. Basic block versioning is simple to implement, simple e…

Interprocedural Type Specialization of JavaScript Programs Without Type Analysis Open

Maxime Chevalier-Boisvert, Marc Feeley · 2016

Computer science Philosophy Mathematics

Previous work proposed lazy basic block versioning, a technique for just-in-time compilation of dynamic languages which we believe represents an interesting point in the design space. Basic block versioning is simple to implement, simple e…

Maxime Chevalier-Boisvert YOU? Author Swipe