Citation Impact
Citing Papers
Addressing environment non-stationarity by repeating Q-learning updates
2016
Taking the Human Out of the Loop: A Review of Bayesian Optimization
2015 Standout
Optimal Experimental Design for Staggered Rollouts
2019 StandoutNobel
Balanced Linear Contextual Bandits
2019 StandoutNobel
Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems
2012
Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications
2020 Standout
Works of Varun Kanade being referenced
Online Learning in Markov Decision Processes with Adversarially Chosen Transition Probability Distributions
2013
Sleeping Experts and Bandits with Stochastic Action Availability and Adversarial Rewards
2009