Standout Papers

Policy Gradient Methods for Reinforcement Learning with Function Approximation 1995 2026 2005 2015 1.9k
  1. Policy Gradient Methods for Reinforcement Learning with Function Approximation (1999)
    Richard S. Sutton, David McAllester et al. Neural Information Processing Systems
  2. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning (1999)
    Richard S. Sutton, Doina Precup et al. Artificial Intelligence
  3. Learning to act using real-time dynamic programming (1995)
    Andrew G. Barto, Steven J. Bradtke et al. Artificial Intelligence
  4. Reward is enough (2021)
    David Silver, Satinder Singh et al. Artificial Intelligence

Immediate Impact

29 by Nobel laureates 54 from Science/Nature 89 standout
Sub-graph 1 of 20

Citing Papers

Machine Learning Aided Design and Optimization of Thermal Metamaterials
2024 Standout
Dense reinforcement learning for safety validation of autonomous vehicles
2023 StandoutNature
8 intermediate papers

Works of Satinder Singh being referenced

Reward is enough
2021 Standout
Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning
2014
and 12 more

Author Peers

Author Last Decade Papers Cites
Satinder Singh 4845 1048 989 1168 115 7.4k
Leslie Pack Kaelbling 4911 833 860 1471 137 8.6k
Michael L. Littman 6670 1412 1139 1360 157 11.7k
Mohammed Azmi Al‐Betar 3682 467 1048 876 212 7.4k
Peter Stone 3677 841 440 2034 290 7.1k
Essam H. Houssein 5647 314 2001 1237 235 10.3k
Doina Precup 2596 424 487 680 148 4.5k
Mauro Birattari 4154 659 1847 1095 158 9.6k
Manuela Veloso 3320 400 390 1587 265 6.3k
Dongrui Wu 3302 1262 631 1229 183 7.6k
Hani Hagras 2856 1027 538 950 119 4.6k

All Works

Loading papers...

Rankless by CCL
2026