Rémi Munos

72 papers · 2.4k indexed citations

Standout Papers

Noisy Networks For Exploration (2018)
Meire Fortunato, Mohammad Gheshlaghi Azar et al. arXiv (Cornell University)
A distributional code for value in dopamine-based reinforcement learning (2020)
Will Dabney, Zeb Kurth‐Nelson et al. Nature

Citation Impact

Citing Papers

Navigating the protein fitness landscape with Gaussian processes

2012 StandoutNobel

A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play

2018 StandoutScienceNobel

Bayesian inference in Monte-Carlo tree search

2010

On correlation and budget constraints in model-based bandit optimization with application to automatic machine learning

2014

Integrating opponent models with Monte-Carlo tree search in poker

2010

Multi-agent Monte Carlo Go

2011

Simulation-based approach to general game playing

2008

Managing Power Consumption and Performance of Computing Systems Using Reinforcement Learning

2007

Bayes meets bellman: the Gaussian process approach to temporal difference learning

2003

Hyperband: a novel bandit-based approach to hyperparameter optimization

2017

Feature Selection as a One-Player Game

2010

Optimistic Bayesian sampling in contextual-bandit problems

2012

QUIC-SVD: Fast SVD Using Cosine Trees

2008

Towards MCTS for Creative Domains.

2011

{Bayesian Multi-Scale Optimistic Optimization}

2014

Contextual Gaussian Process Bandit Optimization

2011

Convergence Rates of Efficient Global Optimization Algorithms

2011

Achieving master level play in 9×9 computer go

2008

Learning from Demonstrations for Real World Reinforcement Learning

2017

Neural Mechanisms of Hierarchical Planning in a Virtual Subway Network

2016 StandoutNobel

Gaussian Process Optimization in the Bandit Setting: No Regret and Experimental Design

2009

Advancing mathematics by guiding human intuition with AI

2021 StandoutNatureNobel

Cardiologist-level arrhythmia detection and classification in ambulatory electrocardiograms using a deep neural network

2018 Standout

Mastering the game of Go without human knowledge

2017 StandoutNatureNobel

Backpropagation and the brain

2020 StandoutNobel

Gaussian Process Dynamical Models for Human Motion

2007 Standout

Dermatologist-level classification of skin cancer with deep neural networks

2017 StandoutNature

Human-level control through deep reinforcement learning

2015 StandoutNatureNobel

Reinforcement Learning, Fast and Slow

2019 StandoutNobel

Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control

2005 Standout

Machine learning to design integral membrane channelrhodopsins for efficient eukaryotic expression and plasma membrane localization

2017 StandoutNobel

Reinforcement Learning on Slow Features of High-Dimensional Input Streams

2010

A guide to deep learning in healthcare

2018 Standout

Mastering the game of Go with deep neural networks and tree search

2016 StandoutNatureNobel

Neuroscience-Inspired Artificial Intelligence

2017 StandoutNobel

Discovering faster matrix multiplication algorithms with reinforcement learning

2022 StandoutNatureNobel

Review of deep learning: concepts, CNN architectures, challenges, applications, future directions

2021 Standout

Artificial intelligence in radiology

2018 Standout

Deep learning in neural networks: An overview

2014 Standout

The Computational Intelligence of MoGo Revealed in Taiwan's Computer Go Tournaments

2009

Taking the Human Out of the Loop: A Review of Bayesian Optimization

2015 Standout

Machine learning: Trends, perspectives, and prospects

2015 StandoutScience

Machine learning for data-driven discovery in solid Earth geoscience

2019 StandoutScience

Learning Simulation Control in General Game-Playing Agents

2010

Computer poker: A review

2011

Optimization for Machine Learning

2011

All-optical machine learning using diffractive deep neural networks

2018 StandoutScience

Federated Learning in Mobile Edge Networks: A Comprehensive Survey

2020 Standout

Monte-Carlo Simulation Balancing Applied to 9x9 Go

2010

Active learning machine learns to create new quantum experiments

2018 StandoutNobel

An object-oriented representation for efficient reinforcement learning

2008

CadiaPlayer: A Simulation-Based General Game Player

2009

Reinforcement learning for building controls: The opportunities and challenges

2020 Standout

Deep learning with coherent nanophotonic circuits

2017 Standout

Big Data and Machine Learning in Health Care

2018 Standout

Evaluation of Game Tree Search Methods by Game Records

2010

Temporal-difference search in computer Go

2012

Monte-Carlo Tree Search for the game of Scotland Yard

2011

Regret Analysis of Stochastic and Nonstochastic Multi-armed Bandit Problems

2012

Reconfigurable Intelligent Surfaces: Principles and Opportunities

2021 Standout

The Power of Forgetting: Improving the Last-Good-Reply Policy in Monte Carlo Go

2010

Information-Theoretic Regret Bounds for Gaussian Process Optimization in the Bandit Setting

2012

Upper Conﬁdence Trees and Billiards for Optimal Active Learning

2009

Monte-Carlo tree search and rapid action value estimation in computer Go

2011

Evaluating Root Parallelization in Go

2010

Knowledge Distillation: A Survey

2021 Standout

Using Intelligent Search Techniques to Play the Game Khet

2009

Residential Demand Response of Thermostatically Controlled Loads Using Batch Reinforcement Learning

2016

Deep learning for AI

2021 StandoutNobel

PARAMETER TUNING BY SIMPLE REGRET ALGORITHMS AND MULTIPLE SIMULTANEOUS HYPOTHESIS TESTING

2010

COMPUTING “ELO RATINGS” OF MOVE PATTERNS IN THE GAME OF GO1

2007

Current Frontiers in Computer Go

2010

Human-level performance in 3D multiplayer games with population-based reinforcement learning

2019 StandoutScienceNobel

A Survey of Actor-Critic Reinforcement Learning: Standard and Natural Policy Gradients

2012

Balancing MCTS by Dynamically Adjusting the Komi Value

2011

Monte-Carlo Tree Search in Backgammon

2007

Fast Approximate Max-n Monte Carlo Tree Search for Ms Pac-Man

2011

Applications of Deep Reinforcement Learning in Communications and Networking: A Survey

2019 Standout

Learning to Optimize via Posterior Sampling

2014

A Survey of Monte Carlo Tree Search Methods

2012 Standout

Deep Reinforcement Learning: A Survey

2022 Standout

Efficient Processing of Deep Neural Networks: A Tutorial and Survey

2017 Standout

CROSS-ENTROPY FOR MONTE-CARLO TREE SEARCH

2008

PLAYING THE RIGHT ATARI

2007

Multi-armed bandits with episode context

2011

On hyperparameter optimization of machine learning algorithms: Theory and practice

2020 Standout

A Comprehensive Survey of Multiagent Reinforcement Learning

2008 Standout

A survey of deep neural network architectures and their applications

2016 Standout

Places: A 10 Million Image Database for Scene Recognition

2017 Standout

T-GCN: A Temporal Graph Convolutional Network for Traffic Prediction

2019 Standout

Deep Reinforcement Learning: A Brief Survey

2017 Standout

Fuego—An Open-Source Framework for Board Games and Go Engine Based on Monte Carlo Tree Search

2010

CadiaPlayer: Search-Control Techniques

2011

Sample-Based Planning for Continuous Action Markov Decision Processes

2011

Selective search in games of different complexity

2011

A Shogi Program Based on Monte-Carlo Tree Search

2010

Infrared Patch-Image Model for Small Target Detection in a Single Image

2013 Standout

Deep learning with convolutional neural networks for EEG decoding and visualization

2017 Standout

Top-down design of protein architectures with reinforcement learning

2023 StandoutScienceNobel

Deep Learning in Medical Image Analysis

2017 Standout

Active Opening Book Application for Monte-Carlo Tree Search in 19×19 Go

2011

Utilisation de la recherche arborescente Monte-Carlo au Hex

2009

Monte Carlo Tree Search in Hex

2010

Approximate policy iteration: a survey and some new methods

2011

The Last-Good-Reply Policy for Monte-Carlo Go

2009

Gaussian Process Optimization with Mutual Information

2013

Works of Rémi Munos being referenced

X -Armed Bandits

2011

Noisy Networks For Exploration

2018 StandoutNobel

Online Optimization in X-Armed Bandits

2008

Finite-Sample Analysis of LSTD

2010

Error bounds for approximate policy iteration

2003

Error bounds for approximate value iteration

2005

Combining policy gradient and Q-learning

2017

Adaptive play in Texas Hold'em Poker

2008

Optimistic Optimization of a Deterministic Function without the Knowledge of its Smoothness

2011

Adaptive Bandits: Towards the best history-dependent strategy

2011

Algorithms for Infinitely Many-Armed Bandits

2008

Count-based exploration with neural density models

2017

Finite Time Analysis of Stratified Sampling for Monte Carlo

2011

A Study of Reinforcement Learning in the Continuous Case by the Means of Viscosity Solutions

2000

From Bandits to Monte-Carlo Tree Search: The Optimistic Principle Applied to Optimization and Planning

2014

Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path

2007

Distributional Reinforcement Learning With Quantile Regression

2018

A Finite-Time Analysis of Multi-armed Bandits Problems with Kullback-Leibler Divergences

2011

Pure exploration in finitely-armed and continuous-armed bandits

2010

Variable Resolution Discretization in Optimal Control

2002

Modiﬁcation of UCT with Patterns in Monte-Carlo Go

2006

Performance Bounds in $L_p$‐norm for Approximate Value Iteration

2007

Exploration–exploitation tradeoff using variance estimates in multi-armed bandits

2009

Distributional Reinforcement Learning with Quantile Regression

2017

Use of variance estimation in the multi-armed bandit problem

2006

Finite time bounds for sampling based fitted value iteration

2005