Explanipedia

Continuous Mean-Zero Disagreement-Regularized Imitation Learning (CMZ-DRIL) Open

Noah Ford, Ryan W. Gardner, Austin Juhl, Nathan Larson · 2024

Mathematics Psychology Philosophy

Machine-learning paradigms such as imitation learning and reinforcement learning can generate highly performant agents in a variety of complex environments. However, commonly used methods require large quantities of data and/or a known rew…

Attacking the Diebold Signature Variant -- RSA Signatures with Unverified High-order Padding Open

Ryan W. Gardner, Tadayoshi Kohno, Alec Yasinsac · 2024

Computer science Mathematics Economics

We examine a natural but improper implementation of RSA signature verification deployed on the widely used Diebold Touch Screen and Optical Scan voting machines. In the implemented scheme, the verifier fails to examine a large number of th…

A Risk-Sensitive Approach to Policy Optimization Open

J. Markowitz, Ryan W. Gardner, Ashley J. Llorens, Raman Arora, I-Jeng Wang · 2023

Computer science Mathematics Economics

Standard deep reinforcement learning (DRL) aims to maximize expected reward, considering collected experiences equally in formulating a policy. This differs from human decision-making, where gains and losses are valued differently and outl…

A Risk-Sensitive Approach to Policy Optimization Open

J. Markowitz, Ryan W. Gardner, Ashley J. Llorens, Raman Arora, I-Jeng Wang · 2022

Computer science Mathematics Economics

Standard deep reinforcement learning (DRL) aims to maximize expected reward, considering collected experiences equally in formulating a policy. This differs from human decision-making, where gains and losses are valued differently and outl…

Adaptive Stress Testing: Finding Likely Failure Events with Reinforcement Learning Open

Ritchie Lee, Ole J. Mengshoel, Anshu Saksena, Ryan W. Gardner, Daniel Genin , et al. · 2020

Computer science Mathematics Physics

Finding the most likely path to a set of failure states is important to the analysis of safety-critical systems that operate over a sequence of time steps, such as aircraft collision avoidance systems and autonomous cars. In many app…

On the Complexity of Reconnaissance Blind Chess Open

J. Markowitz, Ryan W. Gardner, Ashley J. Llorens · 2018

Computer science Mathematics Philosophy

This paper provides a complexity analysis for the game of reconnaissance blind chess (RBC), a recently-introduced variant of chess where each player does not know the positions of the opponent's pieces a priori but may reveal a subset of t…

Adaptive Stress Testing: Finding Likely Failure Events with Reinforcement Learning Open

Ritchie Lee, Ole J. Mengshoel, Anshu Saksena, Ryan W. Gardner, Daniel Genin , et al. · 2018

Computer science Mathematics Physics

Finding the most likely path to a set of failure states is important to the analysis of safety-critical systems that operate over a sequence of time steps, such as aircraft collision avoidance systems and autonomous cars. In many applicati…

Adaptive Stress Testing: Finding Failure Events with Reinforcement Learning Open

Ritchie Lee, Ole J. Mengshoel, Anshu Saksena, Ryan W. Gardner, Daniel Genin , et al. · 2018

Computer science Mathematics Physics

Finding the most likely path to a set of failure states is important to the analysis of safety-critical systems that operate over a sequence of time steps, such as aircraft collision avoidance systems and autonomous cars. In many applicati…

Formal verification of ACAS X, an industrial airborne collision avoidance system Open

Jean-Baptiste Jeannin, Khalil Ghorbal, Yanni Kouskoulas, Ryan W. Gardner, Aurora Schmidt , et al. · 2015

Computer science

Formal verification of industrial systems is very challenging, due to reasons ranging from scalability issues to communication difficulties with engineering-focused teams. More importantly, industrial systems are rarely designed for verifi…

Ryan W. Gardner YOU? Author Swipe