Explanipedia

Pilot Trainees Benefit from Modelling and Adaptive Feedback Open

Yalmaz Ali Abdullah, Michael Guevarra, Minghao Cai, Jingye Yan, Matthew E. Taylor , et al. · 2025

A Call to Arms: Automated Methods for Identifying Weapons in Social Media Analysis of Conflict Zones Open

Afia Fairoose Abedin, Abdul Bais, Cody Buntain, Laura Courchesne, Brian McQuinn , et al. · 2025

A Systematic Approach to Design Real-World Human-in-the-Loop Deep Reinforcement Learning: Salient Features, Challenges and Trade-offs Open

Jalal Arabneydi, Saiful Islam, Srijita Das, Sai Krishna Gottipati, William Duguay , et al. · 2025

With the growing popularity of deep reinforcement learning (DRL), human-in-the-loop (HITL) approach has the potential to revolutionize the way we approach decision-making problems and create new opportunities for human-AI collaboration. In…

An LLM-Guided Tutoring System for Social Skills Training Open

Michael Guevarra, Indronil Bhattacharjee, Srijita Das, Christabel Wayllace, Carrie Demmans Epp , et al. · 2025

Social skills training targets behaviors necessary for success in social interactions. However, traditional classroom training for such skills is often insufficient to teach effective communication — one-to-one interaction in real-world sc…

Model-Based Exploration in Monitored Markov Decision Processes Open

Alireza Kazemipour, Simone Parisi, Matthew E. Taylor, Michael Bowling · 2025

A tenet of reinforcement learning is that the agent always observes rewards. However, this is not true in many realistic settings, e.g., a human observer may not always be available to provide rewards, sensors may be limited or malfunction…

An LLM-Guided Tutoring System for Social Skills Training Open

Michael Guevarra, Indronil Bhattacharjee, Srijita Das, Christabel Wayllace, Carrie Demmans Epp , et al. · 2025

Social skills training targets behaviors necessary for success in social interactions. However, traditional classroom training for such skills is often insufficient to teach effective communication -- one-to-one interaction in real-world s…

Decentralized coordination of distributed energy resources through local energy markets and deep reinforcement learning Open

Daniel May, Matthew E. Taylor, Petr Musı́lek · 2024

As the energy landscape evolves towards sustainability, the accelerating integration of distributed energy resources poses challenges to the operability and reliability of the electricity grid. One significant aspect of this issue is the n…

Investigating the Benefits of Nonlinear Action Maps in Data-Driven Teleoperation Open

Michael Przystupa, Gauthier Gidel, Matthew E. Taylor, Martin Jägersand, Justus Piater , et al. · 2024

As robots become more common for both able-bodied individuals and those living with a disability, it is increasingly important that lay people be able to drive multi-degree-of-freedom platforms with low-dimensional controllers. One approac…

A novel framework for automated warehouse layout generation Open

Atefeh Shahroudnejad, Payam Mousavi, Oleksii Perepelytsia, Sahir, David Staszak , et al. · 2024

Optimizing warehouse layouts is crucial due to its significant impact on efficiency and productivity. We present an AI-driven framework for automated warehouse layout generation. This framework employs constrained beam search to derive opt…

CANDERE-COACH: Reinforcement Learning from Noisy Feedback Open

Yuxuan Li, Srijita Das, Matthew E. Taylor · 2024

In recent times, Reinforcement learning (RL) has been widely applied to many challenging tasks. However, in order to perform well, it requires access to a good reward function which is often sparse or manually engineered with scope for err…

A Novel Framework for Automated Warehouse Layout Generation Open

Atefeh Shahroudnejad, Payam Mousavi, Oleksii Perepelytsia, Sahir, David Staszak , et al. · 2024

Optimizing warehouse layouts is crucial due to its significant impact on efficiency and productivity. We present an AI-driven framework for automated warehouse layout generation. This framework employs constrained beam search to derive opt…

Video Occupancy Models Open

Manan Tomar, Philippe Hansen-Estruch, Philip Bachman, Alex Lamb, John Langford , et al. · 2024

We introduce a new family of video prediction models designed to support downstream control tasks. We call these models Video Occupancy models (VOCs). VOCs operate in a compact latent space, thus avoiding the need to make predictions about…

Boosting Robustness in Preference-Based Reinforcement Learning with Dynamic Sparsity Open

Calarina Muslimani, Bram Grooten, Deepak Ranganatha Sastry Mamillapalli, Mykola Pechenizkiy, Decebal Constantin Mocanu , et al. · 2024

To integrate into human-centered environments, autonomous agents must learn from and adapt to humans in their native settings. Preference-based reinforcement learning (PbRL) can enable this by learning reward functions from human preferenc…

Applying reinforcement learning to learn best net to rip and re-route in global routing Open

Upma Gandhi, Erfan Aghaeekiasaraee, Sahir, Payam Mousavi, Ismail Bustany , et al. · 2024

Physical designers typically employ heuristics to solve challenging problems in global routing. However, these heuristic solutions are not adaptable to the ever-changing fabrication demands, and the experience and creativity of designers c…

Leveraging Sub-Optimal Data for Human-in-the-Loop Reinforcement Learning Open

Calarina Muslimani, Matthew E. Taylor · 2024

To create useful reinforcement learning (RL) agents, step zero is to design a suitable reward function that captures the nuances of the task. However, reward engineering can be a difficult and time-consuming process. Instead, human-in-the-…

Decentralized Coordination of Distributed Energy Resources through Local Energy Markets and Deep Reinforcement Learning Open

Daniel May, Matthew E. Taylor, Petr Musı́lek · 2024

As distributed energy resources (DERs) grow, the electricity grid faces increased net load variability at the grid edge, impacting operability and reliability. Transactive energy, facilitated through local energy markets, offers a decentra…

FPGA Divide-and-Conquer Placement using Deep Reinforcement Learning Open

Shang Wang, Deepak Ranganatha Sastry Mamillapalli, Tianpei Yang, Matthew E. Taylor · 2024

This paper introduces the problem of learning to place logic blocks in Field-Programmable Gate Arrays (FPGAs) and a learning-based method. In contrast to previous search-based placement algorithms, we instead employ Reinforcement Learning …

A Transfer Approach Using Graph Neural Networks in Deep Reinforcement Learning Open

Tianpei Yang, Heng You, Jianye Hao, Yan Zheng, Matthew E. Taylor · 2024

Transfer learning (TL) has shown great potential to improve Reinforcement Learning (RL) efficiency by leveraging prior knowledge in new tasks. However, much of the existing TL research focuses on transferring knowledge between tasks that s…

PORTAL: Automatic Curricula Generation for Multiagent Reinforcement Learning Open

Jizhou Wu, Jianye Hao, Tianpei Yang, Xiaotian Hao, Yan Zheng , et al. · 2024

Despite many breakthroughs in recent years, it is still hard for MultiAgent Reinforcement Learning (MARL) algorithms to directly solve complex tasks in MultiAgent Systems (MASs) from scratch. In this work, we study how to use Automatic Cur…

Monitored Markov Decision Processes Open

Simone Parisi, Montaser Mohammedalamen, Alireza Kazemipour, Matthew E. Taylor, Michael Bowling · 2024

In reinforcement learning (RL), an agent learns to perform a task by interacting with an environment and receiving feedback (a numerical reward) for its actions. However, the assumption that rewards are always observable is often not appli…

Human-in-the-Loop Reinforcement Learning: A Survey and Position on Requirements, Challenges, and Opportunities Open

Carl Orge Retzlaff, Srijita Das, Christabel Wayllace, Payam Mousavi, Mohammad Afshari , et al. · 2024

Artificial intelligence (AI) and especially reinforcement learning (RL) have the potential to enable agents to learn and perform tasks autonomously with superhuman performance. However, we consider RL as fundamentally a Human-in-the-Loop (…

GLIDE-RL: Grounded Language Instruction through DEmonstration in RL Open

Chaitanya Kharyal, Sai Krishna Gottipati, Tanmay Kumar Sinha, Srijita Das, Matthew E. Taylor · 2024

One of the final frontiers in the development of complex human - AI collaborative systems is the ability of AI agents to comprehend the natural language and perform tasks accordingly. However, training efficient Reinforcement Learning (RL)…

LaFFi: Leveraging Hybrid Natural Language Feedback for Fine-tuning Language Models Open

Qianxi Li, Yingyue Cao, Jikun Kang, Tianpei Yang, Xi Chen , et al. · 2023

Fine-tuning Large Language Models (LLMs) adapts a trained model to specific downstream tasks, significantly improving task-specific performance. Supervised Fine-Tuning (SFT) is a common approach, where an LLM is trained to produce desired …

MaDi: Learning to Mask Distractions for Generalization in Visual Deep Reinforcement Learning Open

Bram Grooten, Tristan Tomilin, Gautham Vasan, Matthew E. Taylor, A. Rupam Mahmood , et al. · 2023

The visual world provides an abundance of information, but many input pixels received by agents often contain distracting stimuli. Autonomous agents need the ability to distinguish useful information from task-irrelevant perceptions, enabl…

Curriculum Learning for Cooperation in Multi-Agent Reinforcement Learning Open

Rupali Bhati, Sai Krishna Gottipati, Clodéric Mars, Matthew E. Taylor · 2023

While there has been significant progress in curriculum learning and continuous learning for training agents to generalize across a wide variety of environments in the context of single-agent reinforcement learning, it is unclear if these …

Human-Machine Teaming for UAVs: An Experimentation Platform Open

Laila El Moujtahid, Sai Krishna Gottipati, Clodéric Mars, Matthew E. Taylor · 2023

Full automation is often not achievable or desirable in critical systems with high-stakes decisions. Instead, human-AI teams can achieve better results. To research, develop, evaluate, and validate algorithms suited for such teaming, light…

A Call to Arms: AI Should be Critical for Social Media Analysis of Conflict Zones Open

Afia Fairoose Abedin, Abdul Bais, Cody Buntain, Laura Courchesne, Brian McQuinn , et al. · 2023

The massive proliferation of social media data represents a transformative opportunity for conflict studies and for tracking the proliferation and use of weaponry, as conflicts are increasingly documented in these online spaces. At the sam…

Can You Improve My Code? Optimizing Programs with Local Search Open

Fatemeh Abdollahi, Saqib Ameen, Matthew E. Taylor, Levi H. S. Lelis · 2023

This paper introduces a local search method for improving an existing program with respect to a measurable objective. Program Optimization with Locally Improving Search (POLIS) exploits the structure of a program, defined by its lines. POL…

Multi-Agent Advisor Q-Learning (Extended Abstract) Open

Sriram Ganapathi Subramanian, Matthew E. Taylor, Kate Larson, Mark Crowley · 2023

In the last decade, there have been significant advances in multi-agent reinforcement learning (MARL) but there are still numerous challenges, such as high sample complexity and slow convergence to stable policies, that need to be overcome…

Can You Improve My Code? Optimizing Programs with Local Search Open

Fatemeh Abdollahi, Saqib Ameen, Matthew E. Taylor, Levi H. S. Lelis · 2023

This paper introduces a local search method for improving an existing program with respect to a measurable objective. Program Optimization with Locally Improving Search (POLIS) exploits the structure of a program, defined by its lines. POL…

Matthew E. Taylor YOU? Author Swipe