Hit papers significantly outperform the citation benchmark for their cohort. A paper qualifies
if it has ≥500 total citations, achieves ≥1.5× the top-1% citation threshold for papers in the
same subfield and year (this is the minimum needed to enter the top 1%, not the average
within it), or reaches the top citation threshold in at least one of its specific research
topics.
Crowdsourcing systems on the World-Wide Web
2011870 citationsAnHai Doan, Raghu Ramakrishnan et al.profile →
Deep Learning for Entity Matching
2018282 citationsSidharth Mudgal, AnHai Doan et al.profile →
Peers — A (Enhanced Table)
Peers by citation overlap · career bar shows stage (early→late)
cites ·
hero ref
This map shows the geographic impact of AnHai Doan's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by AnHai Doan with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites AnHai Doan more than expected).
This network shows the impact of papers produced by AnHai Doan. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by AnHai Doan. The network helps show where AnHai Doan may publish in the future.
Co-authorship network of co-authors of AnHai Doan
This figure shows the co-authorship network connecting the top 25 collaborators of AnHai Doan.
A scholar is included among the top collaborators of AnHai Doan based on the total number of
citations received by their joint publications. Widths of edges
represent the number of papers authors have co-authored together.
Node borders
signify the number of papers an author published with AnHai Doan. AnHai Doan is excluded from
the visualization to improve readability, since they are connected to all nodes in the network.
Chen, Chen, Behzad Golshan, Alon Halevy, Wang-Chiew Tan, & AnHai Doan. (2018). BigGorilla: An Open-Source Ecosystem for Data Preparation and Integration.. IEEE Data(base) Engineering Bulletin. 41. 10–22.30 indexed citations
3.
Doan, AnHai, Pradap Konda, Adel Ardalan, et al.. (2018). Toward a System Building Agenda for Data Integration (and Data Science).. IEEE Data(base) Engineering Bulletin. 41. 35–46.3 indexed citations
Doan, AnHai. (2017). What is Our Agenda for Data Science. Conference on Innovative Data Systems Research.2 indexed citations
6.
Chai, Xiaoyong, Nikesh Garera, Lu Liu, et al.. (2013). Social Media Analytics: The Kosmix Story.. IEEE Data(base) Engineering Bulletin. 36. 4–12.10 indexed citations
7.
Doan, AnHai, Jeffrey F. Naughton, Xiaoyong Chai, et al.. (2009). The Case for a Structured Approach to Managing Unstructured Data.. Conference on Innovative Data Systems Research.10 indexed citations
8.
Shen, Warren, AnHai Doan, Jeffrey F. Naughton, & Raghu Ramakrishnan. (2007). Declarative information extraction using datalog with embedded extraction predicates. Very Large Data Bases. 1033–1044.117 indexed citations
9.
DeRose, Pedro, Warren Shen, Fei Chen, AnHai Doan, & Raghu Ramakrishnan. (2007). Building structured web community portals: a top-down, compositional, and incremental approach. Very Large Data Bases. 399–410.52 indexed citations
10.
Chu, Eric, et al.. (2007). A relational approach to incrementally extracting and querying structure in unstructured data. Very Large Data Bases. 1045–1056.39 indexed citations
11.
Burdick, Doug, AnHai Doan, Raghu Ramakrishnan, & Shivakumar Vaithyanathan. (2007). OLAP over imprecise data with domain constraints. Minds at UW (University of Wisconsin). 39–50.23 indexed citations
12.
Doan, AnHai, Raghu Ramakrishnan, Fei Chen, et al.. (2006). Community Information Management.. IEEE Data(base) Engineering Bulletin. 29. 64–72.63 indexed citations
13.
McCann, Robert, et al.. (2005). Mapping maintenance for data integration systems. Very Large Data Bases. 1018–1029.37 indexed citations
14.
Sayyadian, Mayssam, et al.. (2005). Tuning schema matching software using synthetic scenarios. Very Large Data Bases. 994–1005.20 indexed citations
15.
Doan, AnHai, Robert McCann, & Warren Shen. (2005). Collaborative Development of Information Integration Systems.. National Conference on Artificial Intelligence. 34–41.2 indexed citations
Doan, AnHai, Alon Halevy, & Natalya F. Noy. (2004). Semantic Integration Workshop at the 2nd International Semantic Web Conference (ISWC-2003).. International Conference on Management of Data. 33. 138–140.4 indexed citations
19.
Etzioni, Oren, Alon Halevy, AnHai Doan, et al.. (2003). Crossing the Structure Chasm. ScholarlyCommons (University of Pennsylvania).57 indexed citations
20.
Doan, AnHai & Robert McCann. (2003). Building data integration systems: a mass collaboration approach. 183–188.21 indexed citations
Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive
bibliographic database. While OpenAlex provides broad and valuable coverage of the global
research landscape, it—like all bibliographic datasets—has inherent limitations. These include
incomplete records, variations in author disambiguation, differences in journal indexing, and
delays in data updates. As a result, some metrics and network relationships displayed in
Rankless may not fully capture the entirety of a scholar's output or impact.