Satinder Singh

22.5k total citations · 5 hit papers
150 papers, 11.2k citations indexed

About

Satinder Singh is a scholar working on Artificial Intelligence, Management Science and Operations Research and Computational Theory and Mathematics. According to data from OpenAlex, Satinder Singh has authored 150 papers receiving a total of 11.2k indexed citations (citations by other indexed papers that have themselves been cited), including 98 papers in Artificial Intelligence, 28 papers in Management Science and Operations Research and 18 papers in Computational Theory and Mathematics. Recurrent topics in Satinder Singh's work include Reinforcement Learning in Robotics (75 papers), Machine Learning and Algorithms (20 papers) and Evolutionary Algorithms and Applications (16 papers). Satinder Singh is often cited by papers focused on Reinforcement Learning in Robotics (75 papers), Machine Learning and Algorithms (20 papers) and Evolutionary Algorithms and Applications (16 papers). Satinder Singh collaborates with scholars based in United States, India and United Kingdom. Satinder Singh's co-authors include Richard S. Sutton, Doina Precup, David McAllester, Yishay Mansour, Michael Kearns, Andrew G. Barto, Tommi Jaakkola, Michael I. Jordan, Steven J. Bradtke and Richard L. Lewis and has published in prestigious journals such as Nature, CHEST Journal and Neuropsychologia.

In The Last Decade

Satinder Singh

140 papers receiving 10.3k citations

Hit Papers

5 papers align trajectories

What are hit papers?

Hit papers significantly outperform the citation benchmark for their cohort. A paper qualifies if it has ≥500 total citations, achieves ≥1.5× the top-1% citation threshold for papers in the same subfield and year (this is the minimum needed to enter the top 1%, not the average within it), or reaches the top citation threshold in at least one of its specific research topics.

Policy Gradient Methods for Reinforcement Learning with Function Approximation

1999 2.7k citations Richard S. Sutton, David McAllester et al. Neural Information Processing Systems profile →
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning

1999 1.6k citations Richard S. Sutton, Doina Precup et al. profile →
Learning to act using real-time dynamic programming

1995 714 citations Andrew G. Barto, Satinder Singh et al. profile →
Near-Optimal Reinforcement Learning in Polynomial Time

2002 393 citations Michael Kearns, Satinder Singh profile →
Reward is enough

2021 219 citations David Silver, Satinder Singh et al. profile →

Peers — A (Enhanced Table)

Peers by citation overlap · career bar shows stage (early→late) cites · hero ref

Name	h						Papers	Cites
Satinder Singh United States	40	7.5k	1.8k	1.5k	1.4k	1.4k	150	11.2k
Christopher J. Watkins United Kingdom	10	4.2k 0.6×	1.8k 1.0×	831 0.6×	989 0.7×	2.1k 1.5×	14	9.8k
Leslie Pack Kaelbling United States	44	8.6k 1.1×	3.1k 1.8×	1.3k 0.9×	1.4k 1.0×	2.5k 1.8×	221	14.9k
Manuela Veloso United States	51	6.7k 0.9×	3.4k 1.9×	808 0.5×	690 0.5×	2.0k 1.4×	459	13.5k
Hani Hagras United Kingdom	43	4.8k 0.6×	1.5k 0.9×	1.5k 1.0×	744 0.5×	712 0.5×	268	7.5k
Peter Stone United States	55	7.0k 0.9×	3.9k 2.2×	1.5k 1.0×	767 0.5×	1.4k 1.0×	443	13.3k
Mohammed Azmi Al‐Betar Jordan	52	4.4k 0.6×	1.1k 0.6×	632 0.4×	1.2k 0.9×	1.1k 0.8×	265	9.1k
Doina Precup Canada	34	4.0k 0.5×	1.0k 0.6×	671 0.5×	786 0.6×	640 0.5×	194	6.6k
Dongrui Wu China	55	5.0k 0.7×	1.6k 0.9×	1.8k 1.3×	943 0.7×	394 0.3×	256	10.4k
Joëlle Pineau Canada	39	4.6k 0.6×	1.2k 0.7×	437 0.3×	470 0.3×	928 0.7×	163	8.2k
Tom Erez United States	18	3.8k 0.5×	3.0k 1.7×	382 0.3×	782 0.6×	1.1k 0.8×	27	8.3k

Countries citing papers authored by Satinder Singh

Since Specialization

Citations

This map shows the geographic impact of Satinder Singh's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Satinder Singh with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Satinder Singh more than expected).

Fields of papers citing papers by Satinder Singh

Since Specialization

Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by Satinder Singh. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Satinder Singh. The network helps show where Satinder Singh may publish in the future.

Co-authorship network of co-authors of Satinder Singh

This figure shows the co-authorship network connecting the top 25 collaborators of Satinder Singh. A scholar is included among the top collaborators of Satinder Singh based on the total number of citations received by their joint publications. Widths of edges represent the number of papers authors have co-authored together. Node borders signify the number of papers an author published with Satinder Singh. Satinder Singh is excluded from the visualization to improve readability, since they are connected to all nodes in the network.

All Works

Sort: Min cites: Since: Top N: Style:

20 of 20 papers shown

Singh, Satinder, et al.. (2020). To Study the Long Term Outcome of Endoscopic Septoplasty with Microdebrider Assisted Inferior Turbinoplasty (MAIT) Versus Medial Flap Turbinoplasty (MFT). Indian Journal of Otolaryngology and Head & Neck Surgery. 74(S2). 863–869. 1 indexed citations

Oh, Junhyuk, Matteo Hessel, Wojciech Marian Czarnecki, et al.. (2020). Discovering Reinforcement Learning Algorithms. Neural Information Processing Systems. 33. 1060–1070. 1 indexed citations

Zahavy, Tom, Zhongwen Xu, Vivek Veeriah, et al.. (2020). A Self-Tuning Actor-Critic Algorithm. Neural Information Processing Systems. 33. 20913–20924. 2 indexed citations

Guo, Xiaoxiao, Tim Klinger, Joseph P. Bigus, et al.. (2017). Learning to Query, Reason, and Answer Questions On Ambiguous Texts. International Conference on Learning Representations. 7 indexed citations

Jiang, Nan, Satinder Singh, & Ambuj Tewari. (2016). On structural properties of MDPs that bound loss due to shallow planning. International Joint Conference on Artificial Intelligence. 1640–1647. 2 indexed citations

Jiang, Nan, Alex Kulesza, Satinder Singh, & Richard L. Lewis. (2015). The Dependence of Effective Planning Horizon on Model Accuracy. International Joint Conference on Artificial Intelligence. 1181–1189.

Jiang, Nan, Satinder Singh, & Richard L. Lewis. (2014). Improving UCT planning via approximate homomorphisms. Adaptive Agents and Multi-Agents Systems. 1289–1296. 16 indexed citations

Singh, Satinder, et al.. (2012). Strong mitigation: nesting search for good policies within search for good reward. Adaptive Agents and Multi-Agents Systems. 407–414. 11 indexed citations

Sorg, Jonathan, Richard L. Lewis, & Satinder Singh. (2010). Reward Design via Online Gradient Ascent. Neural Information Processing Systems. 23. 2190–2198. 38 indexed citations

10.

Sorg, Jonathan, Satinder Singh, & Richard L. Lewis. (2010). Variance-based rewards for approximate Bayesian reinforcement learning. Uncertainty in Artificial Intelligence. 564–571. 15 indexed citations

11.

Precup, Doina, et al.. (2005). Off-policy Learning with Options and Recognizers. Neural Information Processing Systems. 18. 1097–1104. 5 indexed citations

12.

Isbell, Charles L., et al.. (2000). Cobot in LambdaMOO: A Social Statistics Agent. National Conference on Artificial Intelligence. 36–41. 41 indexed citations

13.

Kearns, Michael & Satinder Singh. (2000). Bias-Variance Error Bounds for Temporal Difference Updates. Conference on Learning Theory. 142–147. 27 indexed citations

14.

Precup, Doina, Richard S. Sutton, & Satinder Singh. (2000). Eligibility Traces for Off-Policy Policy Evaluation. Scholarworks (University of Massachusetts Amherst). 759–766. 172 indexed citations

15.

Sutton, Richard S., David McAllester, Satinder Singh, & Yishay Mansour. (1999). Policy Gradient Methods for Reinforcement Learning with Function Approximation. Neural Information Processing Systems. 12. 1057–1063. 2738 indexed citations breakdown →

16.

Kearns, Michael & Satinder Singh. (1998). Finite-Sample Convergence Rates for Q-Learning and Indirect Algorithms. Neural Information Processing Systems. 11. 996–1002. 107 indexed citations

17.

Singh, Satinder, Tommi Jaakkola, & Michael I. Jordan. (1994). Reinforcement Learning with Soft State Aggregation. Neural Information Processing Systems. 7. 361–368. 149 indexed citations

18.

Singh, Satinder. (1994). Reinforcement learning algorithms for average-payoff markovian decision processes. National Conference on Artificial Intelligence. 700–705. 52 indexed citations

19.

Singh, Satinder, Andrew G. Barto, Roderic A. Grupen, & Christopher I. Connolly. (1993). Robust Reinforcement Learning in Motion Planning. ScholarWorks@UMassAmherst (University of Massachusetts Amherst). 6. 655–662. 30 indexed citations

20.

Singh, Satinder. (1991). The Efficient Learning of Multiple Task Sequences. Neural Information Processing Systems. 4. 251–258. 16 indexed citations

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact