Shayne Longpre

2.1k total citations
15 papers, 177 citations indexed

About

Shayne Longpre is a scholar working on Artificial Intelligence, Computer Vision and Pattern Recognition and Safety Research. According to data from OpenAlex, Shayne Longpre has authored 15 papers receiving a total of 177 indexed citations (citations by other indexed papers that have themselves been cited), including 8 papers in Artificial Intelligence, 3 papers in Computer Vision and Pattern Recognition and 3 papers in Safety Research. Recurrent topics in Shayne Longpre's work include Topic Modeling (5 papers), Natural Language Processing Techniques (5 papers) and Multimodal Machine Learning Applications (3 papers). Shayne Longpre is often cited by papers focused on Topic Modeling (5 papers), Natural Language Processing Techniques (5 papers) and Multimodal Machine Learning Applications (3 papers). Shayne Longpre collaborates with scholars based in United States, Israel and United Kingdom. Shayne Longpre's co-authors include Yi Lu, Joachim Daiber, Anthony Chen, Sameer Singh, Niklas Muennighoff, Rishi Bommasani, Xiao Ling, Arvind Narayanan, Percy Liang and Sara Hooker and has published in prestigious journals such as Science, Nature Machine Intelligence and Transactions of the Association for Computational Linguistics.

In The Last Decade

Shayne Longpre

13 papers receiving 164 citations

Peers — A (Enhanced Table)

Peers by citation overlap · career bar shows stage (early→late) cites · hero ref

Name h Career Trend Papers Cites
Shayne Longpre United States 9 125 30 28 19 11 15 177
John Aslanides United Kingdom 4 142 1.1× 18 0.6× 22 0.8× 27 1.4× 9 0.8× 5 182
Saffron Huang United Kingdom 2 103 0.8× 18 0.6× 15 0.5× 29 1.5× 9 0.8× 2 145
Amelia Glaese United States 4 100 0.8× 18 0.6× 15 0.5× 22 1.2× 5 0.5× 6 145
Joe Barrow United States 5 134 1.1× 26 0.9× 13 0.5× 17 0.9× 20 1.8× 9 210
Juliano Rabelo Canada 9 118 0.9× 26 0.9× 35 1.3× 5 0.3× 5 0.5× 20 186
Dmitry Ustalov Russia 7 111 0.9× 19 0.6× 40 1.4× 7 0.4× 10 0.9× 32 174
Josh Gardner United States 6 81 0.6× 15 0.5× 9 0.3× 24 1.3× 7 0.6× 13 148
Kawin Ethayarajh United States 8 202 1.6× 30 1.0× 26 0.9× 11 0.6× 19 1.7× 12 247
Md Mehrab Tanjim United States 5 135 1.1× 79 2.6× 22 0.8× 18 0.9× 20 1.8× 8 232
Vasileios Iosifidis Germany 5 134 1.1× 16 0.5× 11 0.4× 62 3.3× 11 1.0× 7 176

Countries citing papers authored by Shayne Longpre

Since Specialization
Citations

This map shows the geographic impact of Shayne Longpre's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Shayne Longpre with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Shayne Longpre more than expected).

Fields of papers citing papers by Shayne Longpre

Since Specialization
Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by Shayne Longpre. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Shayne Longpre. The network helps show where Shayne Longpre may publish in the future.

Co-authorship network of co-authors of Shayne Longpre

This figure shows the co-authorship network connecting the top 25 collaborators of Shayne Longpre. A scholar is included among the top collaborators of Shayne Longpre based on the total number of citations received by their joint publications. Widths of edges represent the number of papers authors have co-authored together. Node borders signify the number of papers an author published with Shayne Longpre. Shayne Longpre is excluded from the visualization to improve readability, since they are connected to all nodes in the network.

All Works

15 of 15 papers shown
1.
McGregor, Sean, Allyson Ettinger, Liwei Jiang, et al.. (2025). To Err Is AI: A Case Study Informing LLM Flaw Reporting Practices. Proceedings of the AAAI Conference on Artificial Intelligence. 39(28). 28938–28945.
2.
Longpre, Shayne, Anthony Chen, Damien Sileo, et al.. (2024). A large-scale audit of dataset licensing and attribution in AI. Nature Machine Intelligence. 6(8). 975–987. 18 indexed citations
3.
Bommasani, Rishi, Shayne Longpre, Sayash Kapoor, et al.. (2024). Foundation Model Transparency Reports. Proceedings of the AAAI/ACM Conference on AI Ethics and Society. 7. 181–195. 9 indexed citations
4.
Üstün, Ahmet, Zheng Yong, Wei-Yin Ko, et al.. (2024). Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model. 15894–15939. 17 indexed citations
6.
Longpre, Shayne, Emily Reif, Katherine Lee, et al.. (2024). A Pretrainer’s Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity. 3245–3276. 11 indexed citations
7.
Longpre, Shayne, et al.. (2024). Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models. 4334–4353. 6 indexed citations
8.
Bommasani, Rishi, Sayash Kapoor, Shayne Longpre, et al.. (2024). Considerations for governing open foundation models. Science. 386(6718). 151–153. 10 indexed citations
9.
Li, Hanlin, et al.. (2024). A Systematic Review of NeurIPS Dataset Management Practices. 32813–32827.
10.
Asai, Akari, Shayne Longpre, Jungo Kasai, et al.. (2022). MIA 2022 Shared Task: Evaluating Cross-lingual Open-Retrieval Question Answering for 16 Diverse Languages. 6 indexed citations
11.
12.
13.
Longpre, Shayne, Yi Lu, & Joachim Daiber. (2021). MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering. Transactions of the Association for Computational Linguistics. 9. 1389–1406. 51 indexed citations
14.
Longpre, Shayne, et al.. (2020). How Big Data Confers Market Power to Big Tech: Leveraging the Perspective of Data Science. The Antitrust Bulletin. 65(3). 459–485. 11 indexed citations
15.
Longpre, Shayne, et al.. (2019). An Exploration of Data Augmentation and Sampling Techniques for Domain-Agnostic Question Answering. 220–227. 14 indexed citations

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact

Rankless by CCL
2026