Michael Cafarella

7.8k total citations · 4 hit papers
98 papers, 4.9k citations indexed

About

Michael Cafarella is a scholar working on Artificial Intelligence, Information Systems and Computer Networks and Communications. According to data from OpenAlex, Michael Cafarella has authored 98 papers receiving a total of 4.9k indexed citations (citations by other indexed papers that have themselves been cited), including 48 papers in Artificial Intelligence, 39 papers in Information Systems and 38 papers in Computer Networks and Communications. Recurrent topics in Michael Cafarella's work include Advanced Database Systems and Queries (24 papers), Web Data Mining and Analysis (23 papers) and Data Quality and Management (22 papers). Michael Cafarella is often cited by papers focused on Advanced Database Systems and Queries (24 papers), Web Data Mining and Analysis (23 papers) and Data Quality and Management (22 papers). Michael Cafarella collaborates with scholars based in United States, Israel and Switzerland. Michael Cafarella's co-authors include Oren Etzioni, Stephen Soderland, Michele Banko, Alon Halevy, Alexander Yates, Doug Downey, Ana-Maria Popescu, Tal Shaked, Daniel S. Weld and Eugene Wu and has published in prestigious journals such as PLoS ONE, Chemistry of Materials and Communications of the ACM.

In The Last Decade

Michael Cafarella

94 papers receiving 4.4k citations

Hit Papers

align trajectories

What are hit papers?

Hit papers significantly outperform the citation benchmark for their cohort. A paper qualifies if it has ≥500 total citations, achieves ≥1.5× the top-1% citation threshold for papers in the same subfield and year (this is the minimum needed to enter the top 1%, not the average within it), or reaches the top citation threshold in at least one of its specific research topics.

Open information extraction from the web

2007 831 citations Michele Banko, Michael Cafarella et al. International Joint Conference on Artificial Intelligence profile →
Unsupervised named-entity extraction from the Web: An experimental study

2005 710 citations Oren Etzioni, Michael Cafarella et al. profile →
Web-scale information extraction in knowitall

2004 498 citations Oren Etzioni, Michael Cafarella et al. profile →
WebTables

2008 409 citations Michael Cafarella, Alon Halevy et al. Proceedings of the VLDB Endowment profile →

Peers

Countries citing papers authored by Michael Cafarella

Since Specialization

Citations

This map shows the geographic impact of Michael Cafarella's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Michael Cafarella with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Michael Cafarella more than expected).

Fields of papers citing papers by Michael Cafarella

Since Specialization

Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by Michael Cafarella. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Michael Cafarella. The network helps show where Michael Cafarella may publish in the future.

Co-authorship network of co-authors of Michael Cafarella

This figure shows the co-authorship network connecting the top 25 collaborators of Michael Cafarella. A scholar is included among the top collaborators of Michael Cafarella based on the total number of citations received by their joint publications. Widths of edges represent the number of papers authors have co-authored together. Node borders signify the number of papers an author published with Michael Cafarella. Michael Cafarella is excluded from the visualization to improve readability, since they are connected to all nodes in the network.

All Works

Sort: Min cites: Since: Top N: Style:

20 of 20 papers shown

#	Work	Indexed citations
1	Press ECCS to Doubt (Your Causal Graph) 2024 ·(unknown), (unknown), (unknown), (unknown), Chunwei Liu, Ibrahim Sabek, Michael Cafarella	0
2	Databases Unbound: Querying All of the World's Bytes with AI 2024 ·Proceedings of the VLDB Endowment ·Samuel Madden, Michael Cafarella, Michael J. Franklin, Tim Kraska	5
3	Cackle: Analytical Workload Cost and Performance Stability With Elastic Pools 2023 ·Proceedings of the ACM on Management of Data ·Matthew Perron, Raul Castro Fernandez, David J. DeWitt, Michael Cafarella, Samuel Madden	3
4	Using Machine Learning to Construct Hedonic Price Indices 2023 ·SSRN Electronic Journal ·Michael Cafarella, Gabriel Ehrlich, Tian Gao, John Haltiwanger, Matthew D. Shapiro, (unknown)	1
5	Constructing Expressive Relational Queries with Dual-Specification Synthesis. 2020 ·Conference on Innovative Data Systems Research ·(unknown), (unknown), Michael Cafarella, H. V. Jagadish	2
6	Sledgehammer: cluster-fueled debugging 2018 ·Operating Systems Design and Implementation ·(unknown), Jason Flinn, Michael Cafarella	3
7	HARE: Hardware accelerator for regular expressions 2016 ·(unknown), Aasheesh Kolli, Michael Cafarella, Loris D’Antoni, Thomas F. Wenisch	38
8	Runtime Support for Human-in-the-Loop Feature Engineering System. 2016 ·IEEE Data(base) Engineering Bulletin ·Michael R. Anderson, (unknown), Michael Cafarella	4
9	DQBarge: improving data-quality tradeoffs in large-scale internet services 2016 ·Operating Systems Design and Implementation ·Michael Chow, Kaushik Veeraraghavan, Michael Cafarella, Jason Flinn	7
10	Brainwash: A data system for feature engineering 2013 ·Conference on Innovative Data Systems Research ·Michael R. Anderson, (unknown), Victor Bittorf, Matthew Burgess, Michael Cafarella, (unknown), Feng Niu, Yongjoo Park, Christopher Ré, Ce Zhang	68
11	Ringtail: Feature Selection For Easier Nowcasting. 2013 ·(unknown), Michael Cafarella, Margaret C. Levenstein, Christopher Ré, Matthew D. Shapiro	7
12	Extracting and Querying a Comprehensive Web Database. 2009 ·Conference on Innovative Data Systems Research ·Michael Cafarella	22
13	Uncovering the Relational Web 2008 ·Michael Cafarella, Alon Halevy, Yang Zhang, Daisy Zhe Wang, Eugene Wu	91
14	Structured querying of web text 2007 ·Conference on Innovative Data Systems Research ·Michael Cafarella, Christopher Ré, Dan Suciu, Oren Etzioni, Michele Banko	21
15	Open information extraction from the web breakdown → 2007 ·International Joint Conference on Artificial Intelligence ·Michele Banko, Michael Cafarella, Stephen Soderland, (unknown), Oren Etzioni	831
16	Navigating Extracted Data with Schema Discovery. 2007 ·Michael Cafarella, Dan Suciu, Oren Etzioni	15
17	Structured Querying of Web Text Data: A Technical Challenge. 2007 ·Conference on Innovative Data Systems Research ·Michael Cafarella, Christopher Ré, Dan Suciu, Oren Etzioni	39
18	Machine reading 2006 ·National Conference on Artificial Intelligence ·Oren Etzioni, Michele Banko, Michael Cafarella	68
19	Ontology-driven information extraction with OntoSyphon 2006 ·Defense Technical Information Center (DTIC) ·Luke K. McDowell, Michael Cafarella	10
20	Methods for domain-independent information extraction from the web: an experimental comparison 2004 ·National Conference on Artificial Intelligence ·Oren Etzioni, Michael Cafarella, Doug Downey, Ana-Maria Popescu, Tal Shaked, Stephen Soderland, Daniel S. Weld, Alexander Yates	73

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact