Warren Shen

1.6k total citations
22 papers, 981 citations indexed

About

Warren Shen is a scholar working on Information Systems, Management Science and Operations Research and Computer Networks and Communications. According to data from OpenAlex, Warren Shen has authored 22 papers receiving a total of 981 indexed citations (citations by other indexed papers that have themselves been cited), including 15 papers in Information Systems, 14 papers in Management Science and Operations Research and 8 papers in Computer Networks and Communications. Recurrent topics in Warren Shen's work include Data Quality and Management (14 papers), Web Data Mining and Analysis (14 papers) and Advanced Database Systems and Queries (7 papers). Warren Shen is often cited by papers focused on Data Quality and Management (14 papers), Web Data Mining and Analysis (14 papers) and Advanced Database Systems and Queries (7 papers). Warren Shen collaborates with scholars based in United States, Denmark and Spain. Warren Shen's co-authors include AnHai Doan, Raghu Ramakrishnan, Jayant Madhavan, Alon Halevy, Pedro DeRose, Robert McCann, Jeffrey F. Naughton, Fei Wu, Héctor González and Marius Paşca and has published in prestigious journals such as Journal of Thoracic Oncology, Proceedings of the VLDB Endowment and ACM SIGMOD Record.

In The Last Decade

Warren Shen

21 papers receiving 888 citations

Peers — A (Enhanced Table)

Peers by citation overlap · career bar shows stage (early→late) cites · hero ref

Name h Career Trend Papers Cites
Warren Shen United States 16 551 499 431 327 183 22 981
Yannis Tzitzikas Greece 13 569 1.0× 315 0.6× 147 0.3× 213 0.7× 118 0.6× 109 794
Giovanni Tummarello Ireland 14 632 1.1× 442 0.9× 154 0.4× 298 0.9× 104 0.6× 36 796
George Papadakis Greece 21 930 1.7× 558 1.1× 996 2.3× 269 0.8× 84 0.5× 73 1.3k
Kris Ganjam United States 11 376 0.7× 347 0.7× 466 1.1× 248 0.8× 213 1.2× 16 686
Paul Ogilvie United States 14 631 1.1× 485 1.0× 161 0.4× 127 0.4× 142 0.8× 23 977
Zhengxiang Pan United States 9 687 1.2× 330 0.7× 153 0.4× 437 1.3× 128 0.7× 17 898
Olaf Hartig Sweden 13 440 0.8× 290 0.6× 200 0.5× 267 0.8× 69 0.4× 48 652
Alvaro A. A. Fernandes United Kingdom 18 249 0.5× 313 0.6× 141 0.3× 584 1.8× 181 1.0× 84 862
Atanas Kiryakov United Kingdom 13 791 1.4× 473 0.9× 126 0.3× 197 0.6× 106 0.6× 25 923
Johannes Hoffart Germany 12 1.2k 2.3× 358 0.7× 293 0.7× 144 0.4× 152 0.8× 26 1.4k

Countries citing papers authored by Warren Shen

Since Specialization
Citations

This map shows the geographic impact of Warren Shen's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Warren Shen with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Warren Shen more than expected).

Fields of papers citing papers by Warren Shen

Since Specialization
Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by Warren Shen. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Warren Shen. The network helps show where Warren Shen may publish in the future.

Co-authorship network of co-authors of Warren Shen

This figure shows the co-authorship network connecting the top 25 collaborators of Warren Shen. A scholar is included among the top collaborators of Warren Shen based on the total number of citations received by their joint publications. Widths of edges represent the number of papers authors have co-authored together. Node borders signify the number of papers an author published with Warren Shen. Warren Shen is excluded from the visualization to improve readability, since they are connected to all nodes in the network.

All Works

20 of 20 papers shown
2.
Balakrishnan, S., Alon Halevy, Boulos Harb, et al.. (2015). Applying WebTables in Practice. Conference on Innovative Data Systems Research. 36 indexed citations
3.
Madhavan, Jayant, S. Balakrishnan, Héctor González, et al.. (2012). Big Data Storytelling Through Interactive Maps. IEEE Data(base) Engineering Bulletin. 35. 46–54. 9 indexed citations
4.
Venetis, Petros, Alon Halevy, Jayant Madhavan, et al.. (2011). Recovering semantics of tables on the web. Proceedings of the VLDB Endowment. 4(9). 528–538. 195 indexed citations
5.
González, Héctor, et al.. (2010). Socialising Data with Google Fusion Tables.. IEEE Data(base) Engineering Bulletin. 33. 25–32. 26 indexed citations
6.
González, Héctor, et al.. (2010). Google fusion tables. VBN Forskningsportal (Aalborg Universitet). 175–180. 63 indexed citations
7.
González, Héctor, et al.. (2010). Google fusion tables. VBN Forskningsportal (Aalborg Universitet). 1061–1066. 105 indexed citations
8.
Doan, AnHai, Jeffrey F. Naughton, Xiaoyong Chai, et al.. (2009). The Case for a Structured Approach to Managing Unstructured Data.. Conference on Innovative Data Systems Research. 10 indexed citations
9.
Bohannon, Philip, Srujana Merugu, Cong Yu, et al.. (2009). Purple SOX extraction management system. ACM SIGMOD Record. 37(4). 21–27. 18 indexed citations
10.
McCann, Robert, Warren Shen, & AnHai Doan. (2008). Matching Schemas in Online Communities: A Web 2.0 Approach. 110–119. 61 indexed citations
11.
Shen, Warren, Pedro DeRose, Robert McCann, AnHai Doan, & Raghu Ramakrishnan. (2008). Toward best-effort information extraction. 1031–1042. 27 indexed citations
12.
DeRose, Pedro, Xiaoyong Chai, Byron J. Gao, et al.. (2008). Building Community Wikipedias: A Machine-Human Partnership Approach. 29. 646–655. 18 indexed citations
13.
Shen, Warren, AnHai Doan, Jeffrey F. Naughton, & Raghu Ramakrishnan. (2007). Declarative information extraction using datalog with embedded extraction predicates. Very Large Data Bases. 1033–1044. 117 indexed citations
14.
DeRose, Pedro, Warren Shen, Fei Chen, AnHai Doan, & Raghu Ramakrishnan. (2007). Building structured web community portals: a top-down, compositional, and incremental approach. Very Large Data Bases. 399–410. 52 indexed citations
15.
Doan, AnHai, Philip Bohannon, Raghu Ramakrishnan, et al.. (2007). User-Centric Research Challenges in Community Information Management Systems.. IEEE Data(base) Engineering Bulletin. 30. 32–40. 6 indexed citations
16.
DeRose, Pedro, Warren Shen, Fei Chen, et al.. (2007). DBLife: A Community Information Management Platform for the Database Research Community (Demonstration). Conference on Innovative Data Systems Research. 169–172. 54 indexed citations
17.
Shen, Warren, Pedro DeRose, Long Vu, AnHai Doan, & Raghu Ramakrishnan. (2007). Source-aware Entity Matching: A Compositional Approach. 29. 196–205. 19 indexed citations
18.
Doan, AnHai, Raghu Ramakrishnan, Fei Chen, et al.. (2006). Community Information Management.. IEEE Data(base) Engineering Bulletin. 29. 64–72. 63 indexed citations
19.
Doan, AnHai, Robert McCann, & Warren Shen. (2005). Collaborative Development of Information Integration Systems.. National Conference on Artificial Intelligence. 34–41. 2 indexed citations
20.
Shen, Warren, Xin Li, & AnHai Doan. (2005). Constraint-based entity matching. National Conference on Artificial Intelligence. 862–867. 51 indexed citations

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact

Rankless by CCL
2026