Satinder Singh

22.5k total citations · 5 hit papers
150 papers, 11.2k citations indexed

About

Satinder Singh is a scholar working on Artificial Intelligence, Management Science and Operations Research and Computational Theory and Mathematics. According to data from OpenAlex, Satinder Singh has authored 150 papers receiving a total of 11.2k indexed citations (citations by other indexed papers that have themselves been cited), including 98 papers in Artificial Intelligence, 28 papers in Management Science and Operations Research and 18 papers in Computational Theory and Mathematics. Recurrent topics in Satinder Singh's work include Reinforcement Learning in Robotics (75 papers), Machine Learning and Algorithms (20 papers) and Evolutionary Algorithms and Applications (16 papers). Satinder Singh is often cited by papers focused on Reinforcement Learning in Robotics (75 papers), Machine Learning and Algorithms (20 papers) and Evolutionary Algorithms and Applications (16 papers). Satinder Singh collaborates with scholars based in United States, India and United Kingdom. Satinder Singh's co-authors include Richard S. Sutton, Doina Precup, David McAllester, Yishay Mansour, Michael Kearns, Andrew G. Barto, Tommi Jaakkola, Michael I. Jordan, Steven J. Bradtke and Richard L. Lewis and has published in prestigious journals such as Nature, CHEST Journal and Neuropsychologia.

In The Last Decade

Satinder Singh

140 papers receiving 10.3k citations

Hit Papers

Policy Gradient Methods for Reinforcement Learning with F... 1995 2026 2005 2015 1999 1999 1995 2002 2021 500 1000 1.5k 2.0k 2.5k

Peers — A (Enhanced Table)

Peers by citation overlap · career bar shows stage (early→late) cites · hero ref

Name h Career Trend Papers Cites
Satinder Singh United States 40 7.5k 1.8k 1.5k 1.4k 1.4k 150 11.2k
Christopher J. Watkins United Kingdom 10 4.2k 0.6× 1.8k 1.0× 831 0.6× 989 0.7× 2.1k 1.5× 14 9.8k
Leslie Pack Kaelbling United States 44 8.6k 1.1× 3.1k 1.8× 1.3k 0.9× 1.4k 1.0× 2.5k 1.8× 221 14.9k
Manuela Veloso United States 51 6.7k 0.9× 3.4k 1.9× 808 0.5× 690 0.5× 2.0k 1.4× 459 13.5k
Hani Hagras United Kingdom 43 4.8k 0.6× 1.5k 0.9× 1.5k 1.0× 744 0.5× 712 0.5× 268 7.5k
Peter Stone United States 55 7.0k 0.9× 3.9k 2.2× 1.5k 1.0× 767 0.5× 1.4k 1.0× 443 13.3k
Mohammed Azmi Al‐Betar Jordan 52 4.4k 0.6× 1.1k 0.6× 632 0.4× 1.2k 0.9× 1.1k 0.8× 265 9.1k
Doina Precup Canada 34 4.0k 0.5× 1.0k 0.6× 671 0.5× 786 0.6× 640 0.5× 194 6.6k
Dongrui Wu China 55 5.0k 0.7× 1.6k 0.9× 1.8k 1.3× 943 0.7× 394 0.3× 256 10.4k
Joëlle Pineau Canada 39 4.6k 0.6× 1.2k 0.7× 437 0.3× 470 0.3× 928 0.7× 163 8.2k
Tom Erez United States 18 3.8k 0.5× 3.0k 1.7× 382 0.3× 782 0.6× 1.1k 0.8× 27 8.3k

Countries citing papers authored by Satinder Singh

Since Specialization
Citations

This map shows the geographic impact of Satinder Singh's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Satinder Singh with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Satinder Singh more than expected).

Fields of papers citing papers by Satinder Singh

Since Specialization
Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by Satinder Singh. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Satinder Singh. The network helps show where Satinder Singh may publish in the future.

Co-authorship network of co-authors of Satinder Singh

This figure shows the co-authorship network connecting the top 25 collaborators of Satinder Singh. A scholar is included among the top collaborators of Satinder Singh based on the total number of citations received by their joint publications. Widths of edges represent the number of papers authors have co-authored together. Node borders signify the number of papers an author published with Satinder Singh. Satinder Singh is excluded from the visualization to improve readability, since they are connected to all nodes in the network.

All Works

20 of 20 papers shown
1.
Singh, Satinder, et al.. (2020). To Study the Long Term Outcome of Endoscopic Septoplasty with Microdebrider Assisted Inferior Turbinoplasty (MAIT) Versus Medial Flap Turbinoplasty (MFT). Indian Journal of Otolaryngology and Head & Neck Surgery. 74(S2). 863–869. 1 indexed citations
2.
Oh, Junhyuk, Matteo Hessel, Wojciech Marian Czarnecki, et al.. (2020). Discovering Reinforcement Learning Algorithms. Neural Information Processing Systems. 33. 1060–1070. 1 indexed citations
3.
Zahavy, Tom, Zhongwen Xu, Vivek Veeriah, et al.. (2020). A Self-Tuning Actor-Critic Algorithm. Neural Information Processing Systems. 33. 20913–20924. 2 indexed citations
4.
Guo, Xiaoxiao, Tim Klinger, Joseph P. Bigus, et al.. (2017). Learning to Query, Reason, and Answer Questions On Ambiguous Texts. International Conference on Learning Representations. 7 indexed citations
5.
Jiang, Nan, Satinder Singh, & Ambuj Tewari. (2016). On structural properties of MDPs that bound loss due to shallow planning. International Joint Conference on Artificial Intelligence. 1640–1647. 2 indexed citations
6.
Jiang, Nan, Alex Kulesza, Satinder Singh, & Richard L. Lewis. (2015). The Dependence of Effective Planning Horizon on Model Accuracy. International Joint Conference on Artificial Intelligence. 1181–1189.
7.
Jiang, Nan, Satinder Singh, & Richard L. Lewis. (2014). Improving UCT planning via approximate homomorphisms. Adaptive Agents and Multi-Agents Systems. 1289–1296. 16 indexed citations
8.
Singh, Satinder, et al.. (2012). Strong mitigation: nesting search for good policies within search for good reward. Adaptive Agents and Multi-Agents Systems. 407–414. 11 indexed citations
9.
Sorg, Jonathan, Richard L. Lewis, & Satinder Singh. (2010). Reward Design via Online Gradient Ascent. Neural Information Processing Systems. 23. 2190–2198. 38 indexed citations
10.
Sorg, Jonathan, Satinder Singh, & Richard L. Lewis. (2010). Variance-based rewards for approximate Bayesian reinforcement learning. Uncertainty in Artificial Intelligence. 564–571. 15 indexed citations
11.
Precup, Doina, et al.. (2005). Off-policy Learning with Options and Recognizers. Neural Information Processing Systems. 18. 1097–1104. 5 indexed citations
12.
Isbell, Charles L., et al.. (2000). Cobot in LambdaMOO: A Social Statistics Agent. National Conference on Artificial Intelligence. 36–41. 41 indexed citations
13.
Kearns, Michael & Satinder Singh. (2000). Bias-Variance Error Bounds for Temporal Difference Updates. Conference on Learning Theory. 142–147. 27 indexed citations
14.
Precup, Doina, Richard S. Sutton, & Satinder Singh. (2000). Eligibility Traces for Off-Policy Policy Evaluation. Scholarworks (University of Massachusetts Amherst). 759–766. 172 indexed citations
15.
Sutton, Richard S., David McAllester, Satinder Singh, & Yishay Mansour. (1999). Policy Gradient Methods for Reinforcement Learning with Function Approximation. Neural Information Processing Systems. 12. 1057–1063. 2738 indexed citations breakdown →
16.
Kearns, Michael & Satinder Singh. (1998). Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms. Neural Information Processing Systems. 11. 996–1002. 107 indexed citations
17.
Singh, Satinder, Tommi Jaakkola, & Michael I. Jordan. (1994). Reinforcement Learning with Soft State Aggregation. Neural Information Processing Systems. 7. 361–368. 149 indexed citations
18.
Singh, Satinder. (1994). Reinforcement learning algorithms for average-payoff markovian decision processes. National Conference on Artificial Intelligence. 700–705. 52 indexed citations
19.
Singh, Satinder, Andrew G. Barto, Roderic A. Grupen, & Christopher I. Connolly. (1993). Robust Reinforcement Learning in Motion Planning. ScholarWorks@UMassAmherst (University of Massachusetts Amherst). 6. 655–662. 30 indexed citations
20.
Singh, Satinder. (1991). The Efficient Learning of Multiple Task Sequences. Neural Information Processing Systems. 4. 251–258. 16 indexed citations

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact

Rankless by CCL
2026