Ian Osband

4.8k total citations · 1 hit paper
20 papers, 1.4k citations indexed

About

Ian Osband is a scholar working on Artificial Intelligence, Management Science and Operations Research and Economics and Econometrics. According to data from OpenAlex, Ian Osband has authored 20 papers receiving a total of 1.4k indexed citations (citations by other indexed papers that have themselves been cited), including 18 papers in Artificial Intelligence, 9 papers in Management Science and Operations Research and 2 papers in Economics and Econometrics. Recurrent topics in Ian Osband's work include Reinforcement Learning in Robotics (13 papers), Advanced Bandit Algorithms Research (9 papers) and Machine Learning and Algorithms (4 papers). Ian Osband is often cited by papers focused on Reinforcement Learning in Robotics (13 papers), Advanced Bandit Algorithms Research (9 papers) and Machine Learning and Algorithms (4 papers). Ian Osband collaborates with scholars based in United States, United Kingdom and France. Ian Osband's co-authors include Benjamin Van Roy, Zheng Wen, Daniel Russo, Abbas Kazerouni, Olivier Pietquin, Bilal Piot, Tom Schaul, Todd Hester, Marc Lanctot and Joel Z. Leibo and has published in prestigious journals such as Journal of Machine Learning Research, now publishers, Inc. eBooks and arXiv (Cornell University).

In The Last Decade

Ian Osband

19 papers receiving 1.4k citations

Hit Papers

Deep Q-learning From Demonstrations 2018 2026 2020 2023 2018 100 200 300 400

Peers

Ian Osband
Comparison fields: 5 of 108
  • Artificial Intelligence 875
  • Management Science and Operations Research 360
  • Control and Systems Engineering 266
  • Computer Networks and Communications 253
  • Electrical and Electronic Engineering 234
Replace Mohammad Gheshlaghi Azar with:
Mohammad Gheshlaghi Azar United Kingdom
Frans A. Oliehoek Netherlands
Dan Horgan United Kingdom
Bilal Piot United Kingdom
Gregory Farquhar United Kingdom
Nantas Nardelli United Kingdom
Olivier Pietquin France
Daniel Whitehouse United Kingdom
Philipp Rohlfshagen United Kingdom
Robert Givan United States
Mohammad Gheshlaghi Azar United Kingdom View profile →
Citations per field, relative to Ian Osband
Ian Osband · 1×
Citations per year, relative to Ian Osband
Ian Osband · 1×

Countries citing papers authored by Ian Osband

Since Specialization
Citations

This map shows the geographic impact of Ian Osband's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Ian Osband with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Ian Osband more than expected).

Fields of papers citing papers by Ian Osband

Since Specialization
Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by Ian Osband. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Ian Osband. The network helps show where Ian Osband may publish in the future.

Co-authorship network of co-authors of Ian Osband

This figure shows the co-authorship network connecting the top 25 collaborators of Ian Osband. A scholar is included among the top collaborators of Ian Osband based on the total number of citations received by their joint publications. Widths of edges represent the number of papers authors have co-authored together. Node borders signify the number of papers an author published with Ian Osband. Ian Osband is excluded from the visualization to improve readability, since they are connected to all nodes in the network.

All Works

20 of 20 papers shown
# Work Indexed citations
1 7
2 2
3
Matrix games with bandit feedback
1
4 1
5 4
6
Deep Exploration via Randomized Value Functions
47
7
Randomized prior functions for deep reinforcement learning
27
8
Noisy Networks For Exploration
115
9
The Uncertainty Bellman Equation and Exploration.
13
10
Scalable Coordinated Exploration in Concurrent Reinforcement Learning
3
11 260
12 181
13
Deep Q-learning From Demonstrations breakdown →
485
14
Learning from Demonstrations for Real World Reinforcement Learning
43
15 123
16
Model-based Reinforcement Learning and the Eluder Dimension
7
17 20
18
Near-optimal Regret Bounds for Reinforcement Learning in Factored MDPs.
1
19 77
20
Deep Learning for Time Series Modeling CS 229 Final Project Report
14

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact

Rankless by CCL
2026