Levi Lu YOU? Author Swipe

Last 10y

Open Invitation to Help Curate This Field & Enhance Impact .ORG

Boosted Off-Policy Learning Open

Ben London, Levi Lu, Ted Sandler, Thorsten Joachims · 2022

Computer science Chemistry

We propose the first boosting algorithm for off-policy learning from logged bandit feedback. Unlike existing boosting methods for supervised learning, our algorithm directly optimizes an estimate of the policy's expected reward. We analyze…

Creating related items for first view…