Varun Kanade

25 papers · 208 indexed citations

Citation Impact

Addressing environment non-stationarity by repeating Q-learning updates

2016

Taking the Human Out of the Loop: A Review of Bayesian Optimization

2015 Standout

Optimal Experimental Design for Staggered Rollouts

2019 StandoutNobel

Balanced Linear Contextual Bandits

2019 StandoutNobel

Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems

2012

Deep Reinforcement Learning for Multiagent Systems: A Review of Challenges, Solutions, and Applications

2020 Standout

Online Learning in Markov Decision Processes with Adversarially Chosen Transition Probability Distributions

2013

Sleeping Experts and Bandits with Stochastic Action Availability and Adversarial Rewards

2009