Foster Provost

19.3k total citations · 8 hit papers
172 papers, 12.1k citations indexed

About

Foster Provost is a scholar working on Artificial Intelligence, Information Systems and Management Science and Operations Research. According to data from OpenAlex, Foster Provost has authored 172 papers receiving a total of 12.1k indexed citations (citations by other indexed papers that have themselves been cited), including 109 papers in Artificial Intelligence, 54 papers in Information Systems and 27 papers in Management Science and Operations Research. Recurrent topics in Foster Provost's work include Imbalanced Data Classification Techniques (42 papers), Machine Learning and Data Classification (41 papers) and Data Mining Algorithms and Applications (38 papers). Foster Provost is often cited by papers focused on Imbalanced Data Classification Techniques (42 papers), Machine Learning and Data Classification (41 papers) and Data Mining Algorithms and Applications (38 papers). Foster Provost collaborates with scholars based in United States, Switzerland and France. Foster Provost's co-authors include Tom Fawcett, Panagiotis G. Ipeirotis, Gary M. Weiss, Ron Kohavi, Victor S. Sheng, Jing Wang, Maytal Saar‐Tsechansky, Sofus A. Macskassy, Claudia Perlich and Pedro Domingos and has published in prestigious journals such as Management Science, MIS Quarterly and The American Historical Review.

In The Last Decade

Foster Provost

166 papers receiving 11.0k citations

Hit Papers

Data Science and its Relationship to B... 1997 2026 2006 2016 2013 2001 2008 1998 2010 250 500 750

Peers — A (Enhanced Table)

Peers by citation overlap · career bar shows stage (early→late) cites · hero ref

Name h Career Trend Papers Cites
Foster Provost United States 48 7.0k 2.8k 1.5k 1.4k 989 172 12.1k
Jie Lü Australia 67 6.8k 1.0× 4.2k 1.5× 2.7k 1.8× 465 0.3× 2.3k 2.3× 631 16.8k
Guangquan Zhang Australia 63 5.9k 0.8× 3.0k 1.1× 2.6k 1.7× 340 0.2× 1.8k 1.8× 521 13.2k
Micheline Kamber Canada 15 7.0k 1.0× 5.5k 1.9× 797 0.5× 586 0.4× 1.8k 1.8× 20 15.0k
Eric Horvitz United States 73 8.0k 1.1× 4.1k 1.4× 1.2k 0.8× 1.9k 1.3× 3.0k 3.1× 393 20.5k
Padhraic Smyth United States 62 8.3k 1.2× 4.6k 1.6× 904 0.6× 714 0.5× 1.9k 1.9× 271 17.6k
Hsinchun Chen United States 57 5.4k 0.8× 4.6k 1.6× 1.4k 0.9× 284 0.2× 1.2k 1.3× 299 11.8k
Enhong Chen China 63 7.9k 1.1× 5.2k 1.8× 871 0.6× 1.4k 1.0× 3.6k 3.7× 665 16.2k
Avrim Blum United States 50 9.0k 1.3× 1.5k 0.5× 2.4k 1.6× 561 0.4× 3.1k 3.1× 187 15.4k
Qing Li China 63 7.7k 1.1× 4.9k 1.7× 1.1k 0.7× 735 0.5× 5.6k 5.6× 1.3k 22.2k
Tie‐Yan Liu China 50 9.2k 1.3× 4.6k 1.6× 1.8k 1.2× 520 0.4× 3.6k 3.6× 258 18.9k

Countries citing papers authored by Foster Provost

Since Specialization
Citations

This map shows the geographic impact of Foster Provost's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Foster Provost with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Foster Provost more than expected).

Fields of papers citing papers by Foster Provost

Since Specialization
Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by Foster Provost. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Foster Provost. The network helps show where Foster Provost may publish in the future.

Co-authorship network of co-authors of Foster Provost

This figure shows the co-authorship network connecting the top 25 collaborators of Foster Provost. A scholar is included among the top collaborators of Foster Provost based on the total number of citations received by their joint publications. Widths of edges represent the number of papers authors have co-authored together. Node borders signify the number of papers an author published with Foster Provost. Foster Provost is excluded from the visualization to improve readability, since they are connected to all nodes in the network.

All Works

20 of 20 papers shown
1.
Fernández, Carlos, et al.. (2019). Counterfactual Explanations for Data-Driven Decisions. Journal of the Association for Information Systems. 8 indexed citations
2.
Provost, Foster, et al.. (2016). Matrix-Factorization-Based Dimensionality Reduction in the Predictive Modeling Process: A Design Science Perspective. The Faculty Digital Archive (New York University). 2 indexed citations
3.
D'Alessandro, B, et al.. (2015). Evaluating and Optimizing Online Advertising: Forget the Click, but There Are Good Proxies. Big Data. 3(2). 90–102. 6 indexed citations
4.
Provost, Foster, Geoffrey I. Webb, Ron Bekkerman, et al.. (2014). A Data Scientist's Guide to Start-Ups. Big Data. 2(3). 117–128. 1 indexed citations
5.
Provost, Foster & Tom Fawcett. (2014). Authors' Response to Gong's, “Comment on Data Science and its Relationship to Big Data and Data-Driven Decision Making”. Big Data. 2(1). 1–1. 16 indexed citations
6.
Provost, Foster & Tom Fawcett. (2013). Data Science and its Relationship to Big Data and Data-Driven Decision Making. Big Data. 1(1). 51–59. 868 indexed citations breakdown →
7.
Provost, Foster & Tom Fawcett. (2013). Data science for business. CERN Document Server (European Organization for Nuclear Research). 45 indexed citations
8.
Fortuny, Enric Junqué de, David Martens, & Foster Provost. (2013). Predictive Modeling With Big Data: Is Bigger Really Better ?. Big Data. 1(4). 215–226. 122 indexed citations
9.
Martens, David & Foster Provost. (2011). Pseudo-social network targeting from consumer transaction data. The Faculty Digital Archive (New York University). 18 indexed citations
10.
Sheng, Victor S., Foster Provost, & Panagiotis G. Ipeirotis. (2008). Get Another Label? Improving Data Quality and Data Mining Using Multiple, Noisy Labelers. The Faculty Digital Archive (New York University). 58 indexed citations
11.
Macskassy, Sofus A. & Foster Provost. (2007). Classification in Networked Data: A Toolkit and a Univariate Case Study. Journal of Machine Learning Research. 8(34). 935–983. 294 indexed citations
12.
Hill, Shawndra, Foster Provost, & Chris Volinsky. (2007). Learning and Inference in Massive Social Networks. The Faculty Digital Archive (New York University). 95(3). 501–14. 12 indexed citations
13.
Perlich, Claudia & Foster Provost. (2005). ACORA: Distribution-Based Aggregation for Relational Learning from Identifier Attributes. The Faculty Digital Archive (New York University). 16 indexed citations
14.
Macskassy, Sofus A. & Foster Provost. (2004). Simple Models and Classification in Networked Data. The Faculty Digital Archive (New York University). 6 indexed citations
15.
Bernstein, Abraham, et al.. (2002). Discovering Knowledge from Relational Data Extracted from Business News. The Faculty Digital Archive (New York University). 19 indexed citations
16.
Lee, Doheon, Mario Schkolnick, Foster Provost, & Ramakrishnan Srikant. (2001). Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining. Knowledge Discovery and Data Mining. 20 indexed citations
17.
Dhar, Vasant, et al.. (2000). Discovering Interesting Patterns for Investment Decision Making with Glower C - a Genetic Learner Overlaid with Entropy Reduction. The Faculty Digital Archive (New York University). 2 indexed citations
18.
Provost, Foster, Tom Fawcett, & Ron Kohavi. (1998). The Case against Accuracy Estimation for Comparing Induction Algorithms. International Conference on Machine Learning. 445–453. 715 indexed citations breakdown →
19.
Fawcett, Tom & Foster Provost. (1996). Combining data mining and machine learning for effective user profiling. Knowledge Discovery and Data Mining. 8–13. 115 indexed citations
20.
Aronis, John M. & Foster Provost. (1994). Efficiently constructing relational features from background knowledge for inductive machine learning. Knowledge Discovery and Data Mining. 347–358. 11 indexed citations

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact

Rankless by CCL
2026