Standout Papers

Noisy Networks For Exploration 2018 2026 2020 2023 221
  1. Noisy Networks For Exploration (2018)
    Meire Fortunato, Mohammad Gheshlaghi Azar et al. arXiv (Cornell University)
  2. A distributional code for value in dopamine-based reinforcement learning (2020)
    Will Dabney, Zeb Kurth‐Nelson et al. Nature

Citation Impact

Citing Papers

Navigating the protein fitness landscape with Gaussian processes
2012 StandoutNobel
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
2018 StandoutScienceNobel
Bayesian inference in Monte-Carlo tree search
2010
On correlation and budget constraints in model-based bandit optimization with application to automatic machine learning
2014
Integrating opponent models with Monte-Carlo tree search in poker
2010
Multi-agent Monte Carlo Go
2011
Simulation-based approach to general game playing
2008
Managing Power Consumption and Performance of Computing Systems Using Reinforcement Learning
2007
Bayes meets bellman: the Gaussian process approach to temporal difference learning
2003
Hyperband: a novel bandit-based approach to hyperparameter optimization
2017
Feature Selection as a One-Player Game
2010
Optimistic Bayesian sampling in contextual-bandit problems
2012
QUIC-SVD: Fast SVD Using Cosine Trees
2008
Towards MCTS for Creative Domains.
2011
{Bayesian Multi-Scale Optimistic Optimization}
2014
Contextual Gaussian Process Bandit Optimization
2011
Convergence Rates of Efficient Global Optimization Algorithms
2011
Achieving master level play in 9×9 computer go
2008
Learning from Demonstrations for Real World Reinforcement Learning
2017
Neural Mechanisms of Hierarchical Planning in a Virtual Subway Network
2016 StandoutNobel
Gaussian Process Optimization in the Bandit Setting: No Regret and Experimental Design
2009
Advancing mathematics by guiding human intuition with AI
2021 StandoutNatureNobel
Cardiologist-level arrhythmia detection and classification in ambulatory electrocardiograms using a deep neural network
2018 Standout
Mastering the game of Go without human knowledge
2017 StandoutNatureNobel
Backpropagation and the brain
2020 StandoutNobel
Gaussian Process Dynamical Models for Human Motion
2007 Standout
Dermatologist-level classification of skin cancer with deep neural networks
2017 StandoutNature
Human-level control through deep reinforcement learning
2015 StandoutNatureNobel
Reinforcement Learning, Fast and Slow
2019 StandoutNobel
Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
2005 Standout
Machine learning to design integral membrane channelrhodopsins for efficient eukaryotic expression and plasma membrane localization
2017 StandoutNobel
Reinforcement Learning on Slow Features of High-Dimensional Input Streams
2010
A guide to deep learning in healthcare
2018 Standout
Mastering the game of Go with deep neural networks and tree search
2016 StandoutNatureNobel
Neuroscience-Inspired Artificial Intelligence
2017 StandoutNobel
Discovering faster matrix multiplication algorithms with reinforcement learning
2022 StandoutNatureNobel
Review of deep learning: concepts, CNN architectures, challenges, applications, future directions
2021 Standout
Artificial intelligence in radiology
2018 Standout
Deep learning in neural networks: An overview
2014 Standout
The Computational Intelligence of MoGo Revealed in Taiwan's Computer Go Tournaments
2009
Taking the Human Out of the Loop: A Review of Bayesian Optimization
2015 Standout
Machine learning: Trends, perspectives, and prospects
2015 StandoutScience
Machine learning for data-driven discovery in solid Earth geoscience
2019 StandoutScience
Learning Simulation Control in General Game-Playing Agents
2010
Computer poker: A review
2011
Optimization for Machine Learning
2011
All-optical machine learning using diffractive deep neural networks
2018 StandoutScience
Federated Learning in Mobile Edge Networks: A Comprehensive Survey
2020 Standout
Monte-Carlo Simulation Balancing Applied to 9x9 Go
2010
Active learning machine learns to create new quantum experiments
2018 StandoutNobel
An object-oriented representation for efficient reinforcement learning
2008
CadiaPlayer: A Simulation-Based General Game Player
2009
Reinforcement learning for building controls: The opportunities and challenges
2020 Standout
Deep learning with coherent nanophotonic circuits
2017 Standout
Big Data and Machine Learning in Health Care
2018 Standout
Evaluation of Game Tree Search Methods by Game Records
2010
Temporal-difference search in computer Go
2012
Monte-Carlo Tree Search for the game of Scotland Yard
2011
Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems
2012
Reconfigurable Intelligent Surfaces: Principles and Opportunities
2021 Standout
The Power of Forgetting: Improving the Last-Good-Reply Policy in Monte Carlo Go
2010
Information-Theoretic Regret Bounds for Gaussian Process Optimization in the Bandit Setting
2012
Upper Confidence Trees and Billiards for Optimal Active Learning
2009
Monte-Carlo tree search and rapid action value estimation in computer Go
2011
Evaluating Root Parallelization in Go
2010
Knowledge Distillation: A Survey
2021 Standout
Using Intelligent Search Techniques to Play the Game Khet
2009
Residential Demand Response of Thermostatically Controlled Loads Using Batch Reinforcement Learning
2016
Deep learning for AI
2021 StandoutNobel
PARAMETER TUNING BY SIMPLE REGRET ALGORITHMS AND MULTIPLE SIMULTANEOUS HYPOTHESIS TESTING
2010
COMPUTING “ELO RATINGS” OF MOVE PATTERNS IN THE GAME OF GO1
2007
Current Frontiers in Computer Go
2010
Human-level performance in 3D multiplayer games with population-based reinforcement learning
2019 StandoutScienceNobel
A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients
2012
Balancing MCTS by Dynamically Adjusting the Komi Value
2011
Monte-Carlo Tree Search in Backgammon
2007
Fast Approximate Max-n Monte Carlo Tree Search for Ms Pac-Man
2011
Applications of Deep Reinforcement Learning in Communications and Networking: A Survey
2019 Standout
Learning to Optimize via Posterior Sampling
2014
A Survey of Monte Carlo Tree Search Methods
2012 Standout
Deep Reinforcement Learning: A Survey
2022 Standout
Efficient Processing of Deep Neural Networks: A Tutorial and Survey
2017 Standout
CROSS-ENTROPY FOR MONTE-CARLO TREE SEARCH
2008
PLAYING THE RIGHT ATARI
2007
Multi-armed bandits with episode context
2011
On hyperparameter optimization of machine learning algorithms: Theory and practice
2020 Standout
A Comprehensive Survey of Multiagent Reinforcement Learning
2008 Standout
A survey of deep neural network architectures and their applications
2016 Standout
Places: A 10 Million Image Database for Scene Recognition
2017 Standout
T-GCN: A Temporal Graph Convolutional Network for Traffic Prediction
2019 Standout
Deep Reinforcement Learning: A Brief Survey
2017 Standout
Fuego—An Open-Source Framework for Board Games and Go Engine Based on Monte Carlo Tree Search
2010
CadiaPlayer: Search-Control Techniques
2011
Sample-Based Planning for Continuous Action Markov Decision Processes
2011
Selective search in games of different complexity
2011
A Shogi Program Based on Monte-Carlo Tree Search
2010
Infrared Patch-Image Model for Small Target Detection in a Single Image
2013 Standout
Deep learning with convolutional neural networks for EEG decoding and visualization
2017 Standout
Top-down design of protein architectures with reinforcement learning
2023 StandoutScienceNobel
Deep Learning in Medical Image Analysis
2017 Standout
Active Opening Book Application for Monte-Carlo Tree Search in 19×19 Go
2011
Utilisation de la recherche arborescente Monte-Carlo au Hex
2009
Monte Carlo Tree Search in Hex
2010
Approximate policy iteration: a survey and some new methods
2011
The Last-Good-Reply Policy for Monte-Carlo Go
2009
Gaussian Process Optimization with Mutual Information
2013

Works of Rémi Munos being referenced

X -Armed Bandits
2011
Noisy Networks For Exploration
2018 StandoutNobel
Online Optimization in X-Armed Bandits
2008
Finite-Sample Analysis of LSTD
2010
Error bounds for approximate policy iteration
2003
Error bounds for approximate value iteration
2005
Combining policy gradient and Q-learning
2017
Adaptive play in Texas Hold'em Poker
2008
Optimistic Optimization of a Deterministic Function without the Knowledge of its Smoothness
2011
Adaptive Bandits: Towards the best history-dependent strategy
2011
Algorithms for Infinitely Many-Armed Bandits
2008
Count-based exploration with neural density models
2017
Finite Time Analysis of Stratified Sampling for Monte Carlo
2011
A Study of Reinforcement Learning in the Continuous Case by the Means of Viscosity Solutions
2000
From Bandits to Monte-Carlo Tree Search: The Optimistic Principle Applied to Optimization and Planning
2014
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
2007
Distributional Reinforcement Learning With Quantile Regression
2018
A Finite-Time Analysis of Multi-armed Bandits Problems with Kullback-Leibler Divergences
2011
Pure exploration in finitely-armed and continuous-armed bandits
2010
Variable Resolution Discretization in Optimal Control
2002
Modification of UCT with Patterns in Monte-Carlo Go
2006
Performance Bounds in $L_p$‐norm for Approximate Value Iteration
2007
Exploration–exploitation tradeoff using variance estimates in multi-armed bandits
2009
Distributional Reinforcement Learning with Quantile Regression
2017
Use of variance estimation in the multi-armed bandit problem
2006
Finite time bounds for sampling based fitted value iteration
2005
Rankless by CCL
2026