Explanipedia

Observation Adaptation via Annealed Importance Resampling for Partially Observable Markov Decision Processes Open

Yunuo Zhang, Baiting Luo, Ayan Mukhopadhyay, Abhishek Dubey · 2025

Partially observable Markov decision processes (POMDPs) are a general mathematical model for sequential decision-making in stochastic environments under state uncertainty. POMDPs are often solved online, which enables the algorithm to adap…

Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction Open

Baiting Luo, Ava Pettet, Áron Lászka, Abhishek Dubey, Ayan Mukhopadhyay · 2025

Sequential decision-making in high-dimensional continuous action spaces, particularly in stochastic environments, faces significant computational challenges. We explore this challenge in the traditional offline RL setting, where an agent m…

NS-Gym: Open-Source Simulation Environments and Benchmarks for Non-Stationary Markov Decision Processes Open

Nathaniel S. Keplinger, Baiting Luo, Iliyas Bektas, Yunuo Zhang, Kyle Hollins Wray , et al. · 2025

Computer science Mathematics

In many real-world applications, agents must make sequential decisions in environments where conditions are subject to change due to various exogenous factors. These non-stationary environments pose significant challenges to traditional de…

Shrinking POMCP: A Framework for Real-Time UAV Search and Rescue Open

Yunuo Zhang, Baiting Luo, Ayan Mukhopadhyay, Dániel Stojcsics, Daniel Elenius , et al. · 2024

Computer science Business

Efficient path optimization for drones in search and rescue operations faces challenges, including limited visibility, time constraints, and complex information gathering in urban environments. We present a comprehensive approach to optimi…

Act as You Learn: Adaptive Decision-Making in Non-Stationary Markov Decision Processes Open

Baiting Luo, Yunuo Zhang, Abhishek Dubey, Ayan Mukhopadhyay · 2024

Computer science Mathematics Engineering

A fundamental (and largely open) challenge in sequential decision-making is dealing with non-stationary environments, where exogenous environmental conditions change over time. Such problems are traditionally modeled as non-stationary Mark…

Dynamic Simplex: Balancing Safety and Performance in Autonomous Cyber Physical Systems Open

Baiting Luo, Shreyas Ramakrishna, Ava Pettet, Christopher B. Kuhn, Gábor Karsai , et al. · 2023

Computer science Engineering Biology

Learning Enabled Components (LEC) have greatly assisted cyber-physical systems in achieving higher levels of autonomy. However, LEC's susceptibility to dynamic and uncertain operating conditions is a critical challenge for the safety of th…

Baiting Luo YOU? Author Swipe