AnHai Doan

12.1k total citations · 2 hit papers
122 papers, 6.5k citations indexed

About

AnHai Doan is a scholar working on Artificial Intelligence, Management Science and Operations Research and Information Systems. According to data from OpenAlex, AnHai Doan has authored 122 papers receiving a total of 6.5k indexed citations (citations by other indexed papers that have themselves been cited), including 76 papers in Artificial Intelligence, 69 papers in Management Science and Operations Research and 54 papers in Information Systems. Recurrent topics in AnHai Doan's work include Data Quality and Management (68 papers), Semantic Web and Ontologies (50 papers) and Advanced Database Systems and Queries (49 papers). AnHai Doan is often cited by papers focused on Data Quality and Management (68 papers), Semantic Web and Ontologies (50 papers) and Advanced Database Systems and Queries (49 papers). AnHai Doan collaborates with scholars based in United States, Spain and United Kingdom. AnHai Doan's co-authors include Alon Halevy, Pedro Domingos, Raghu Ramakrishnan, Jayant Madhavan, Jeffrey F. Naughton, Warren Shen, Robin Dhamankar, Wensheng Wu, Robert McCann and Jude Shavlik and has published in prestigious journals such as Bioinformatics, Communications of the ACM and Machine Learning.

In The Last Decade

AnHai Doan

120 papers receiving 5.8k citations

Hit Papers

Crowdsourcing systems on the World-Wide Web 2011 2026 2016 2021 2011 2018 250 500 750

Peers — A (Enhanced Table)

Peers by citation overlap · career bar shows stage (early→late) cites · hero ref

Name h Career Trend Papers Cites
AnHai Doan United States 40 4.3k 2.9k 2.4k 2.2k 881 122 6.5k
Tom Heath United Kingdom 18 4.2k 1.0× 2.6k 0.9× 1.2k 0.5× 1.3k 0.6× 450 0.5× 44 5.4k
Sören Auer Germany 28 5.2k 1.2× 2.1k 0.7× 1.7k 0.7× 824 0.4× 375 0.4× 238 6.8k
Divesh Srivastava United States 58 7.5k 1.7× 3.2k 1.1× 2.8k 1.1× 6.2k 2.8× 4.7k 5.3× 352 12.4k
Christian Bizer Germany 37 9.7k 2.2× 4.9k 1.7× 3.2k 1.3× 2.4k 1.1× 950 1.1× 121 11.9k
Erhard Rahm Germany 35 5.9k 1.4× 4.3k 1.5× 3.1k 1.3× 3.4k 1.5× 968 1.1× 204 8.6k
Raghu Ramakrishnan United States 51 4.9k 1.1× 4.7k 1.6× 901 0.4× 7.1k 3.2× 2.6k 2.9× 190 11.7k
Laks V. S. Lakshmanan Canada 51 3.9k 0.9× 3.3k 1.2× 983 0.4× 3.4k 1.6× 2.6k 3.0× 209 9.1k
Natalya F. Noy United States 30 5.7k 1.3× 3.4k 1.2× 859 0.4× 1.3k 0.6× 399 0.5× 97 7.5k
Ora Lassila United States 14 4.5k 1.0× 3.9k 1.3× 543 0.2× 1.8k 0.8× 441 0.5× 34 6.6k
Gerhard Weikum Germany 57 7.9k 1.8× 4.0k 1.4× 1.3k 0.5× 4.9k 2.3× 2.2k 2.5× 472 12.8k

Countries citing papers authored by AnHai Doan

Since Specialization
Citations

This map shows the geographic impact of AnHai Doan's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by AnHai Doan with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites AnHai Doan more than expected).

Fields of papers citing papers by AnHai Doan

Since Specialization
Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by AnHai Doan. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by AnHai Doan. The network helps show where AnHai Doan may publish in the future.

Co-authorship network of co-authors of AnHai Doan

This figure shows the co-authorship network connecting the top 25 collaborators of AnHai Doan. A scholar is included among the top collaborators of AnHai Doan based on the total number of citations received by their joint publications. Widths of edges represent the number of papers authors have co-authored together. Node borders signify the number of papers an author published with AnHai Doan. AnHai Doan is excluded from the visualization to improve readability, since they are connected to all nodes in the network.

All Works

20 of 20 papers shown
1.
Li, Yuliang, et al.. (2023). Effective entity matching with transformers. The VLDB Journal. 32(6). 1215–1235. 9 indexed citations
2.
Chen, Chen, Behzad Golshan, Alon Halevy, Wang-Chiew Tan, & AnHai Doan. (2018). BigGorilla: An Open-Source Ecosystem for Data Preparation and Integration.. IEEE Data(base) Engineering Bulletin. 41. 10–22. 30 indexed citations
3.
Doan, AnHai, Pradap Konda, Adel Ardalan, et al.. (2018). Toward a System Building Agenda for Data Integration (and Data Science).. IEEE Data(base) Engineering Bulletin. 41. 35–46. 3 indexed citations
4.
Bernstein, Matthew N., AnHai Doan, & Colin N. Dewey. (2017). MetaSRA: normalized human sample-specific metadata for the Sequence Read Archive. Bioinformatics. 33(18). 2914–2923. 47 indexed citations
5.
Doan, AnHai. (2017). What is Our Agenda for Data Science. Conference on Innovative Data Systems Research. 2 indexed citations
6.
Chai, Xiaoyong, Nikesh Garera, Lu Liu, et al.. (2013). Social Media Analytics: The Kosmix Story.. IEEE Data(base) Engineering Bulletin. 36. 4–12. 10 indexed citations
7.
Doan, AnHai, Jeffrey F. Naughton, Xiaoyong Chai, et al.. (2009). The Case for a Structured Approach to Managing Unstructured Data.. Conference on Innovative Data Systems Research. 10 indexed citations
8.
Shen, Warren, AnHai Doan, Jeffrey F. Naughton, & Raghu Ramakrishnan. (2007). Declarative information extraction using datalog with embedded extraction predicates. Very Large Data Bases. 1033–1044. 117 indexed citations
9.
DeRose, Pedro, Warren Shen, Fei Chen, AnHai Doan, & Raghu Ramakrishnan. (2007). Building structured web community portals: a top-down, compositional, and incremental approach. Very Large Data Bases. 399–410. 52 indexed citations
10.
Chu, Eric, et al.. (2007). A relational approach to incrementally extracting and querying structure in unstructured data. Very Large Data Bases. 1045–1056. 39 indexed citations
11.
Burdick, Doug, AnHai Doan, Raghu Ramakrishnan, & Shivakumar Vaithyanathan. (2007). OLAP over imprecise data with domain constraints. Minds at UW (University of Wisconsin). 39–50. 23 indexed citations
12.
Doan, AnHai, Raghu Ramakrishnan, Fei Chen, et al.. (2006). Community Information Management.. IEEE Data(base) Engineering Bulletin. 29. 64–72. 63 indexed citations
13.
McCann, Robert, et al.. (2005). Mapping maintenance for data integration systems. Very Large Data Bases. 1018–1029. 37 indexed citations
14.
Sayyadian, Mayssam, et al.. (2005). Tuning schema matching software using synthetic scenarios. Very Large Data Bases. 994–1005. 20 indexed citations
15.
Doan, AnHai, Robert McCann, & Warren Shen. (2005). Collaborative Development of Information Integration Systems.. National Conference on Artificial Intelligence. 34–41. 2 indexed citations
16.
Doan, AnHai & Alon Halevy. (2005). Semantic-integration research in the database community. AI Magazine. 26(1). 83–94. 127 indexed citations
17.
Doan, AnHai, Alon Halevy, & Natalya F. Noy. (2004). Semantic integration workshop at the second international semantic web conference (ISWC-2003). AI Magazine. 25(1). 109–111. 2 indexed citations
18.
Doan, AnHai, Alon Halevy, & Natalya F. Noy. (2004). Semantic Integration Workshop at the 2nd International Semantic Web Conference (ISWC-2003).. International Conference on Management of Data. 33. 138–140. 4 indexed citations
19.
Etzioni, Oren, Alon Halevy, AnHai Doan, et al.. (2003). Crossing the Structure Chasm. ScholarlyCommons (University of Pennsylvania). 57 indexed citations
20.
Doan, AnHai & Robert McCann. (2003). Building data integration systems: a mass collaboration approach. 183–188. 21 indexed citations

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact

Rankless by CCL
2026