Standout Papers
- Policy Gradient Methods for Reinforcement Learning with Function Approximation (1999)
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning (1999)
- Learning to act using real-time dynamic programming (1995)
- Reward is enough (2021)
Immediate Impact
29 by Nobel laureates 54 from Science/Nature 89 standout
Citing Papers
Machine Learning Aided Design and Optimization of Thermal Metamaterials
2024 Standout
Dense reinforcement learning for safety validation of autonomous vehicles
2023 StandoutNature
Works of Satinder Singh being referenced
Reward is enough
2021 Standout
Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning
2014
Author Peers
| Author | Last Decade | Papers | Cites | ||||
|---|---|---|---|---|---|---|---|
| Satinder Singh | 4845 | 1048 | 989 | 1168 | 115 | 7.4k | |
| Leslie Pack Kaelbling | 4911 | 833 | 860 | 1471 | 137 | 8.6k | |
| Michael L. Littman | 6670 | 1412 | 1139 | 1360 | 157 | 11.7k | |
| Mohammed Azmi Al‐Betar | 3682 | 467 | 1048 | 876 | 212 | 7.4k | |
| Peter Stone | 3677 | 841 | 440 | 2034 | 290 | 7.1k | |
| Essam H. Houssein | 5647 | 314 | 2001 | 1237 | 235 | 10.3k | |
| Doina Precup | 2596 | 424 | 487 | 680 | 148 | 4.5k | |
| Mauro Birattari | 4154 | 659 | 1847 | 1095 | 158 | 9.6k | |
| Manuela Veloso | 3320 | 400 | 390 | 1587 | 265 | 6.3k | |
| Dongrui Wu | 3302 | 1262 | 631 | 1229 | 183 | 7.6k | |
| Hani Hagras | 2856 | 1027 | 538 | 950 | 119 | 4.6k |
All Works
Login with ORCID to disown or claim papers
Loading papers...