Explanipedia

Malice in Agentland: Down the Rabbit Hole of Backdoors in the AI Supply Chain Open

Léo Boisvert, Abhay Puri, Chandra Kiran Reddy Evuru, Nicolas Chapados, Quentin Cappart , et al. · 2025

The practice of fine-tuning AI agents on data from their own interactions--such as web browsing or tool use--, while being a strong general recipe for improving agentic capabilities, also introduces a critical security vulnerability within…

DoomArena: A framework for Testing AI Agents Against Evolving Security Threats Open

Léo Boisvert, Megha Bansal, Chandra Kiran Reddy Evuru, Gabriel Huang, Abhay Puri , et al. · 2025

We present DoomArena, a security evaluation framework for AI agents. DoomArena is designed on three principles: 1) It is a plug-in framework and integrates easily into realistic agentic frameworks like BrowserGym (for web agents) and $τ$-b…

Learning Valid Dual Bounds in Constraint Programming: Boosted Lagrangian Decomposition with Self-Supervised Learning Open

Swann Bessa, Darius Dabert, Max Bourgeat, Louis-Martin Rousseau, Quentin Cappart · 2025

Lagrangian decomposition (LD) is a relaxation method that provides a dual bound for constrained optimization problems by decomposing them into more manageable sub-problems. This bound can be used in branch-and-bound algorithms to prune the…

Improving Column Complementarity in a Restricted Master Heuristic with a Grasp-Guided Completion Application to the Vehicle Routing Problem with Stochastic Demands Open

Gaël Reynal, Quentin Cappart, Guy Desaulniers, Louis-Martin Rousseau · 2025

The BrowserGym Ecosystem for Web Agent Research Open

Thibault Le Sellier De Chezelles, Maxime Gasse, Alexandre Drouin, M. Caccia, Léo Boisvert , et al. · 2024

The BrowserGym ecosystem addresses the growing need for efficient evaluation and benchmarking of web agents, particularly those leveraging automation and Large Language Models (LLMs). Many existing benchmarks suffer from fragmentation and …

Learning and fine-tuning a generic value-selection heuristic inside a constraint programming solver Open

Tom Marty, Léo Boisvert, Tristan François, Pierre Tessier, Louis Gautier , et al. · 2024

Constraint programming is known for being an efficient approach to solving combinatorial problems. Important design choices in a solver are the branching heuristics , designed to lead the search to the best solutions in a minimum amount of…

Winning the 2023 CityLearn Challenge: A Community-Based Hierarchical Energy Systems Coordination Algorithm Open

Andoni I. Garmendia, Francesco Morri, Quentin Cappart, Hélène Le Cadre · 2024

The effective management and control of building energy systems are crucial for reducing the energy consumption peak loads, CO2 emissions, and ensuring the stability of the power grid, while maintaining optimal comfort levels within buildi…

Learning Valid Dual Bounds in Constraint Programming: Boosted Lagrangian Decomposition with Self-Supervised Learning Open

Swann Bessa, Darius Dabert, Max Bourgeat, Louis-Martin Rousseau, Quentin Cappart · 2024

Lagrangian decomposition (LD) is a relaxation method that provides a dual bound for constrained optimization problems by decomposing them into more manageable sub-problems. This bound can be used in branch-and-bound algorithms to prune the…

Boost Embodied AI Models with Robust Compression Boundary Open

Andoni I. Garmendia, Quentin Cappart, Josu Ceberio, Alexander Mendiburu · 2024

The rapid improvement of deep learning models with the integration of the physical world has dramatically improved embodied AI capabilities. Meanwhile, the powerful embodied AI models and their scales place an increasing burden on deployme…

WorkArena++: Towards Compositional Planning and Reasoning-based Common Knowledge Work Tasks Open

Léo Boisvert, Megh Thakkar, Maxime Gasse, Massimo Caccia, Thibault Le Sellier De Chezelles , et al. · 2024

The ability of large language models (LLMs) to mimic human-like intelligence has led to a surge in LLM-based autonomous agents. Though recent LLMs seem capable of planning and reasoning given user instructions, their effectiveness in apply…

WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks? Open

Alexandre Drouin, Maxime Gasse, M. Caccia, Issam Laradji, Manuel Del Verme , et al. · 2024

We study the use of large language model-based agents for interacting with software via web browsers. Unlike prior work, we focus on measuring the agents' ability to perform tasks that span the typical daily work of knowledge workers utili…

Towards a Generic Representation of Combinatorial Problems for Learning-Based Approaches Open

Léo Boisvert, Hélène Verhaeghe, Quentin Cappart · 2024

In recent years, there has been a growing interest in using learning-based approaches for solving combinatorial problems, either in an end-to-end manner or in conjunction with traditional optimization algorithms. In both scenarios, the cha…

Deep Learning for Data-Driven Districting-and-Routing Open

ARTHUR MONTEIRO FERRAZ, Quentin Cappart, Thibaut Vidal · 2024

Districting-and-routing is a strategic problem aiming to aggregate basic geographical units (e.g., zip codes) into delivery districts. Its goal is to minimize the expected long-term routing cost of performing deliveries in each district se…

Learning Lagrangian Multipliers for the Travelling Salesman Problem Open

Augustin Parjadis, Quentin Cappart, Bistra Dilkina, Aaron Ferber, Louis-Martin Rousseau · 2023

Lagrangian relaxation is a versatile mathematical technique employed to relax constraints in an optimization problem, enabling the generation of dual bounds to prove the optimality of feasible solutions and the design of efficient propagat…

Global Rewards in Multi-Agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems Open

Heiko Hoppe, Tobias Enders, Quentin Cappart, Maximilian Schiffer · 2023

We study vehicle dispatching in autonomous mobility on demand (AMoD) systems, where a central operator assigns vehicles to customer requests or rejects these with the aim of maximizing its total profit. Recent approaches use multi-agent de…

An Exact Framework for Solving the Space-Time Dependent TSP Open

Isaac Rudich, Quentin Cappart, Manuel López‐Ibáñez, Michael Römer, Louis-Martin Rousseau · 2023

Many real-world scenarios involve solving bi-level optimization problems in which there is an outer discrete optimization problem, and an inner problem involving expensive or black-box computation. This arises in space-time dependent varia…

Improved Peel-and-Bound: Methods for Generating Dual Bounds with Multivalued Decision Diagrams Open

Isaac Rudich, Quentin Cappart, Louis-Martin Rousseau · 2023

Decision diagrams are an increasingly important tool in cutting-edge solvers for discrete optimization. However, the field of decision diagrams is relatively new, and is still incorporating the library of techniques that conventional solve…

Dynamic Routing and Wavelength Assignment with Reinforcement Learning Open

Peyman Kafaei, Quentin Cappart, Nicolas Chapados, Hamed Pouya, Louis-Martin Rousseau · 2023

With the rapid developments in communication systems, and considering their dynamic nature, all-optical networks are becoming increasingly complex. This study proposes a novel method based on deep reinforcement learning for the routing and…

Improved Peel-and-Bound: Methods for Generating Dual Bounds with Multivalued Decision Diagrams Open

Isaac Rudich, Quentin Cappart, Louis-Martin Rousseau · 2023

Decision diagrams are an increasingly important tool in cutting-edge solvers for discrete optimization. However, the field of decision diagrams is relatively new, and is still incorporating the library of techniques that conventional solve…

Learning a Generic Value-Selection Heuristic Inside a Constraint Programming Solver Open

Tom Marty, Tristan François, Pierre Tessier, Louis Gauthier, Quentin Cappart , et al. · 2023

Constraint programming is known for being an efficient approach for solving combinatorial problems. Important design choices in a solver are the branching heuristics, which are designed to lead the search to the best solutions in a minimum…

Peel-and-Bound: Generating Stronger Relaxed Bounds with Multivalued Decision Diagrams Open

Isaac Rudich, Quentin Cappart, Louis-Martin Rousseau · 2022

Decision diagrams are an increasingly important tool in cutting-edge solvers for discrete optimization. However, the field of decision diagrams is relatively new, and is still incorporating the library of techniques that conventional solve…

Learning the travelling salesperson problem requires rethinking generalization Open

Chaitanya K. Joshi, Quentin Cappart, Louis-Martin Rousseau, Thomas Laurent · 2022

End-to-end training of neural network solvers for graph combinatorial optimization problems such as the Travelling Salesperson Problem (TSP) have seen a surge of interest recently, but remain intractable and inefficient beyond graphs with …

The Machine Learning for Combinatorial Optimization Competition (ML4CO):\n Results and Insights Open

Maxime Gasse, Quentin Cappart, Jonas Charfreitag, Laurent Charlin, Didier Chételat , et al. · 2022

Combinatorial optimization is a well-established area in operations research\nand computer science. Until recently, its methods have focused on solving\nproblem instances in isolation, ignoring that they often stem from related data\ndistr…

The Machine Learning for Combinatorial Optimization Competition (ML4CO): Results and Insights Open

Maxime Gasse, Quentin Cappart, Jonas Charfreitag, Laurent Charlin, Didier Chételat , et al. · 2022

Combinatorial optimization is a well-established area in operations research and computer science. Until recently, its methods have focused on solving problem instances in isolation, ignoring that they often stem from related data distribu…

Peel-And-Bound: Generating Stronger Relaxed Bounds with Multivalued Decision Diagrams Open

Isaac Rudich, Quentin Cappart, Louis-Martin Rousseau · 2022

Decision diagrams are an increasingly important tool in cutting-edge solvers for discrete optimization. However, the field of decision diagrams is relatively new, and is still incorporating the library of techniques that conventional solve…

Efficient Minimum Weight Vertex Cover Heuristics Using Graph Neural Networks Open

Quentin Cappart, Didier Chételat, Elias L. Khalil, Andrea Lodi, Christopher G. Morris , et al. · 2022

Minimum weighted vertex cover is the NP-hard graph problem of choosing a subset of vertices incident to all edges such that the sum of the weights of the chosen vertices is minimum. Previous efforts for solving this in practice have typica…

On Causal Inference for Data-free Structured Pruning Open

Martin Ferianc, Anush Sankaran, Olivier Mastropietro, Ehsan Saboori, Quentin Cappart · 2021

Neural networks (NNs) are making a large impact both on research and industry. Nevertheless, as NNs' accuracy increases, it is followed by an expansion in their size, required number of compute operations and energy consumption. Increase i…

Combinatorial Optimization and Reasoning with Graph Neural Networks Open

Quentin Cappart, Didier Chételat, Elias B. Khalil, Andrea Lodi, Christopher Morris , et al. · 2021

Combinatorial optimization is a well-established area in operations research and computer science. Until recently, its methods have mostly focused on solving problem instances in isolation, ignoring the fact that they often stem from relat…

Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization Open

Quentin Cappart, Thierry Moisan, Louis-Martin Rousseau, Isabeau Prémont-Schwarz, André A. Ciré · 2021

Combinatorial optimization has found applications in numerous fields, from aerospace to transportation planning and economics. The goal is to find an optimal solution among a finite set of possibilities. The well-known challenge one faces …

SeaPearl: A Constraint Programming Solver guided by Reinforcement\n Learning Open

Félix Chalumeau, Ilan Coulon, Quentin Cappart, Louis-Martin Rousseau · 2021

The design of efficient and generic algorithms for solving combinatorial\noptimization problems has been an active field of research for many years.\nStandard exact solving approaches are based on a clever and complete\nenumeration of the …

Quentin Cappart YOU? Author Swipe