Jimmy Lin

22.3k citations

431 papers · 11.1k indexed · 1 hit paper · h-index 58

Artificial Intelligence top 0.05%
Information Systems top 0.05%
Computer Vision and Pattern Recognition top 0.5%
Computer Networks and Communications top 0.5%
Molecular Biology top 10%

Co-authors: Dina Demner‐Fushman Chris Dyer Boris Katz Rodrigo Nogueira Hua He Raphael Tang Michael C. Schatz Aneesh Sharma
Topics: Topic Modeling (212 papers)Natural Language Processing Techniques (139 papers)Information Retrieval and Search Behavior (53 papers)
Cited by: Artificial Intelligence Information Systems Computer Vision and Pattern Recognition
Journals: Genome biology BMC Bioinformatics Hydrological Processes
Partner nations: United States Canada United Kingdom

In The Last Decade

Jimmy Lin

405 papers receiving 10.2k citations

Hit Papers

align trajectories

What are hit papers?

Hit papers significantly outperform the citation benchmark for their cohort. A paper qualifies if it has ≥500 total citations, achieves ≥1.5× the top-1% citation threshold for papers in the same subfield and year (this is the minimum needed to enter the top 1%, not the average within it), or reaches the top citation threshold in at least one of its specific research topics.

Data-intensive text processing with MapReduce

2009 280 citations Jimmy Lin et al. profile →

Peers

Replace Steffen Staab with:

Steffen Staab Germany

Anupam Joshi United States

Berthier Ribeiro‐Neto Brazil

Oren Etzioni United States

James Allan United States

Xueqi Cheng China

Bamshad Mobasher United States

Ji-Rong Wen China

ChengXiang Zhai United States

Francesco Ricci⋆ Italy

Jimmy Lin relative to Steffen Staab Germany Steffen Staab's profile →

Citations per field

00.5×2×2.9×

Steffen Staab · 1×

×1.0 7k/7k

AI

×0.9 4k/5k

IS

×2.5 2k/851

CVPR

×0.9 2k/2k

CNC

×0.8 1k/1k

MB

Citations per year

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

2026

Countries citing papers authored by Jimmy Lin

Since Specialization

Citations

This map shows the geographic impact of Jimmy Lin's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Jimmy Lin with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Jimmy Lin more than expected).

Fields of papers citing papers by Jimmy Lin

Since Specialization

Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by Jimmy Lin. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Jimmy Lin. The network helps show where Jimmy Lin may publish in the future.

Co-authorship network of co-authors of Jimmy Lin

This figure shows the co-authorship network connecting the top 25 collaborators of Jimmy Lin. A scholar is included among the top collaborators of Jimmy Lin based on the total number of citations received by their joint publications. Widths of edges represent the number of papers authors have co-authored together. Node borders signify the number of papers an author published with Jimmy Lin. Jimmy Lin is excluded from the visualization to improve readability, since they are connected to all nodes in the network.

All Works

Sort: Min cites: Since: Top N: Style:

20 of 20 papers shown

#	Work	Indexed citations
1	VISA: Retrieval Augmented Generation with Visual Source Attribution 2025 ·Xueguang Ma,Shengyao Zhuang,Bevan Koopman,Guido Zuccon,Wenhu Chen,Jimmy Lin	0
2	AfroBench: How Good are Large Language Models on African Languages? 2025 ·(unknown),(unknown),(unknown),Kelechi Ogueji,Jimmy Lin,Pontus Stenetorp,David Ifeoluwa Adelani	0
3	The Great Nugget Recall: Automating Fact Extraction and RAG Evaluation with Large Language Models 2025 ·Ronak Pradeep,Nandan Thakur,(unknown),Daniel Campos,Nick Craswell,Ian Soboroff,Hoa Trang Dang,Jimmy Lin	0
4	Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models 2024 ·Raphael Tang,(unknown),Xueguang Ma,Jimmy Lin,Ferhan Türe	2
5	ConvKGYarn: Spinning Configurable and Scalable Conversational Knowledge Graph QA Datasets with Large Language Models 2024 ·Ronak Pradeep,Daniel Lee,Ali Mousavi,Jeffrey Pound,(unknown),Jimmy Lin,Ihab F. Ilyas,Saloni Potdar,(unknown),Yunyao Li	3
6	EWEK-QA : Enhanced Web and Efficient Knowledge Graph Retrieval for Citation-based Question Answering Systems 2024 ·(unknown),(unknown),(unknown),(unknown),(unknown),(unknown),Yingxue Zhang,Xiaoguang Li,Jianye Hao,Qun Liu,Jimmy Lin,Boxing Chen,(unknown),(unknown),Mehdi Rezagholizadeh	1
7	Unifying Multimodal Retrieval via Document Screenshot Embedding 2024 ·Xueguang Ma,Sheng-Chieh Lin,Minghan Li,Wenhu Chen,Jimmy Lin	4
8	Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval 2024 ·Nandan Thakur,Jianmo Ni,Gustavo Hernández Ábrego,John Wieting,Jimmy Lin,Daniel Cer	0
9	Can Query Expansion Improve Generalization of Strong Cross-Encoder Rankers? 2024 ·(unknown),Honglei Zhuang,Kai Hui,Zhen Qin,Jimmy Lin,Rolf Jagerman,Xuanhui Wang,Michael Bendersky	0
10	Towards Robust QA Evaluation via Open LLMs 2024 ·Ehsan Kamalloo,(unknown),Jimmy Lin	4
11	How Does Generative Retrieval Scale to Millions of Passages? 2023 ·Ronak Pradeep,(unknown),Jai Prakash Gupta,(unknown),Honglei Zhuang,Jimmy Lin,Donald Metzler,(unknown)	10
12	GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training Data Exploration 2023 ·IRIS Research product catalog (Sapienza University of Rome) ·Aleksandra Piktus,(unknown),Christopher Akiki,(unknown),(unknown),(unknown),Stella Biderman,Martin Potthast,Jimmy Lin	0
13	Evaluating Embedding APIs for Information Retrieval 2023 ·Ehsan Kamalloo,(unknown),(unknown),Nandan Thakur,(unknown),Mehdi Rezagholizadeh,Jimmy Lin	7
14	One Blade for One Purpose: Advancing Math Information Retrieval using Hybrid Search 2023 ·Wei Zhong,Sheng-Chieh Lin,Jheng-Hong Yang,Jimmy Lin	2
15	AfriCLIRMatrix: Enabling Cross-Lingual Information Retrieval for African Languages 2022 ·(unknown),(unknown),(unknown),Kevin Duh,Jimmy Lin	7
16	Evaluating Token-Level and Passage-Level Dense Retrieval Models for Math Information Retrieval 2022 ·(unknown),Jheng-Hong Yang,Yuqing Xie,Jimmy Lin	9
17	The proper care and feeding of CAMELS: How limited training data affects streamflow prediction 2020 ·Environmental Modelling & Software ·Martin Gauch,Juliane Mai,Jimmy Lin	109
18	H2oloo at TREC 2020: When all you got is a hammer... Deep Learning, Health Misinformation, and Precision Medicine. 2020 ·Text REtrieval Conference ·Ronak Pradeep,Xueguang Ma,(unknown),Hang Cui,(unknown),Rodrigo Nogueira,Jimmy Lin	4
19	Aligning Cross-Lingual Entities with Multi-Aspect Information 2019 ·(unknown),Yanyan Zou,Peng Shi,Wei Lu,Jimmy Lin,Xu Sun	98
20	Of Ivory and Smurfs: Loxodontan MapReduce Experiments for Web Search 2009 ·Text REtrieval Conference ·Jimmy Lin,Donald Metzler,Tamer Elsayed,Lidan Wang	33

About Jimmy Lin

Jimmy Lin is a scholar working on Artificial Intelligence, Information Systems and Computer Vision and Pattern Recognition, having authored 431 papers that have together received 11.1k indexed citations. Recurring topics across this work include Topic Modeling (212 papers), Natural Language Processing Techniques (139 papers) and Information Retrieval and Search Behavior (53 papers). The work is most often cited by research in Artificial Intelligence (7.4k citations), Information Systems (4.1k citations) and Computer Vision and Pattern Recognition (2.1k citations). Jimmy Lin has collaborated with scholars based in United States, Canada and United Kingdom. Frequent co-authors include Dina Demner‐Fushman, Chris Dyer, Boris Katz, Rodrigo Nogueira, Hua He, Raphael Tang, Michael C. Schatz, Aneesh Sharma, W. John Wilbur and Pankaj Gupta. Their work appears in journals such as Genome biology, BMC Bioinformatics and Hydrological Processes.

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact