Syndicated Bandits: A Framework for Auto Tuning Hyper-parameters in Contextual Bandit Algorithms

Exploring foci of: arXiv (Cornell University) Syndicated Bandits: A Framework for Auto Tuning Hyper-parameters in Contextual Bandit Algorithms June 2021 • Qin Ding, Yiwei Liu, Cho‐Jui Hsieh, James Sharpnack The stochastic contextual bandit problem, which models the trade-off between exploration and exploitation, has many real applications, including recommender systems, online advertising and clinical trials. As many other machine learning algorithms, contextual bandit algorithms often have one or more hyper-parameters. As an example, in most optimal stochastic contextual bandit algorithms, there is an unknown exploration parameter which controls the trade-off between exploration and exploitation. A proper choice of … Open Article Page

Computer Science Multi-Armed Bandit Artificial Intelligence Machine Learning Recommender System Algorithm Mathematics Open Article