Foster Provost

19.3k citations

172 papers · 12.1k indexed · 8 hit papers · h-index 48

Computer Science Applications top 0.1%
Artificial Intelligence top 0.05%
- Imbalanced Data Classification Techniques 42
- Machine Learning and Data Classification 41
- Machine Learning and Algorithms 29
- Anomaly Detection Techniques and Applications 13
- Bayesian Modeling and Causal Inference 12
Information Systems top 0.1%
- Data Mining Algorithms and Applications 38
Management Science and Operations Research top 0.2%
Management Information Systems top 0.5%
Statistical and Nonlinear Physics
- Complex Network Analysis Techniques 16
Marketing
- Consumer Market Behavior and Pricing 15

Co-authors: Tom Fawcett Panagiotis G. Ipeirotis Gary M. Weiss Ron Kohavi Victor S. ShengJing WangMaytal Saar‐Tsechansky Sofus A. Macskassy
Cited by: Computer Science Applications Artificial Intelligence Information Systems
Journals: Machine Learning (13 papers)Big Data (9 papers)Information Systems Research (6 papers)
Partner nations: United States Switzerland France

In The Last Decade

Foster Provost

166 papers receiving 11.0k citations

Hit Papers

8 papers align trajectories log scale

What are hit papers?

Hit papers significantly outperform the citation benchmark for their cohort. A paper qualifies if it has ≥500 total citations, achieves ≥1.5× the top-1% citation threshold for papers in the same subfield and year (this is the minimum needed to enter the top 1%, not the average within it), or reaches the top citation threshold in at least one of its specific research topics.

2013 Data Science for Business: What You Need to Know about Data Mining and Data-Analytic Thinking
2013 Big Data
2010 Rare & Special e-Zone (The Hong Kong University of Science and Technology)
2008 Get another label? improving data quality and data mining using multiple, noisy labelers
2003 Journal of Artificial Intelligence Research
2001 Machine Learning
1998 International Conference on Machine Learning
1997 Data Mining and Knowledge Discovery

Peers

Replace Jie Lü with:

Jie Lü Australia

Padhraic Smyth United States

Micheline Kamber Canada

Qing Li China

Eric Horvitz United States

Hsinchun Chen United States

Enhong Chen China

Victor Chang United Kingdom

Hui Xiong China

Guangquan Zhang Australia

Foster Provost relative to Jie Lü Australia Jie Lü's profile →

Citations per field

00.5×2×3.1×

Jie Lü · 1×

×3.1 1k/465

CSA

×1.0 7k/7k

AI

×0.7 3k/4k

IS

×0.6 2k/3k

MSOR

×1.2 777/641

MIS

Citations per year

2016

2017

2018

2019

2020

2021

2022

2023

2024

2025

Countries citing papers authored by Foster Provost

Since Specialization

Citations

This map shows the geographic impact of Foster Provost's research. It shows the number of citations coming from papers published by authors working in each country. You can also color the map by specialization and compare the number of citations received by Foster Provost with the expected number of citations based on a country's size and research output (numbers larger than one mean the country cites Foster Provost more than expected).

Fields of papers citing papers by Foster Provost

Since Specialization

Physical SciencesHealth SciencesLife SciencesSocial Sciences

This network shows the impact of papers produced by Foster Provost. Nodes represent research fields, and links connect fields that are likely to share authors. Colored nodes show fields that tend to cite the papers produced by Foster Provost. The network helps show where Foster Provost may publish in the future.

Co-authorship network

The 25 scholars most cited alongside Foster Provost, linked wherever they have co-authored with each other. Click a name or a connecting line to browse the papers they share.

Border = papers with Foster Provost Line = papers co-authored together Foster Provost links everyone, so they are left out of the graph.

All Works

Sort: Min cites: Since: Top N: Style:

20 of 20 papers shown

#	Work
1	The Impact of Cloaking Digital Footprints on User Privacy and Personalization Big Data ·Sofie Goethals,Sandra Matz,Foster Provost,David Martens,Y. Chen	2025	0
2	Counterfactual Explanations for Data-Driven Decisions Journal of the Association for Information Systems ·Carlos Fernández,Foster Provost,Ting Jyun Yan	2019	8
3	Data science for business CERN Document Server (European Organization for Nuclear Research) ·Foster Provost,Tom Fawcett	2013	45
4	Explaining Data-Driven Document Classifications Journal of the Association for Information Systems ·David Martens,Foster Provost	2013	8
5	Evaluating and Optimizing Online Advertising: Forget the click, but\nthere are good proxies The Faculty Digital Archive (New York University) ·B D'Alessandro,Bizhan Koucheki Golfazani,Claudia Perlich,Foster Provost	2012	20
6	Explaining Documents' Classiﬁcations SSRN Electronic Journal ·David Martens,Foster Provost	2011	2
7	Pseudo-social network targeting from consumer transaction data The Faculty Digital Archive (New York University) ·David Martens,Foster Provost	2011	18
8	Proceedings of the First Workshop on Social Media Analytics Knowledge Discovery and Data Mining ·Prem Melville,Jure Leskovec,Foster Provost	2010	13
9	Get Another Label? Improving Data Quality and Data Mining Using Multiple, Noisy Labelers The Faculty Digital Archive (New York University) ·Victor S. Sheng,Foster Provost,Panagiotis G. Ipeirotis	2008	58
10	Classification in Networked Data: A Toolkit and a Univariate Case Study Journal of Machine Learning Research ·Sofus A. Macskassy,Foster Provost	2007	294
11	Handling Missing Values when Applying Classification Models Journal of Machine Learning Research ·Maytal Saar‐Tsechansky,Foster Provost	2007	204
12	ROC Confidence Bands: An Empirical Study The Faculty Digital Archive (New York University) ·J. Čeiková,Foster Provost,Saharon Rosset	2005	1
13	Classification in Networked Data: a Toolkit and a Univariate Case Study The Faculty Digital Archive (New York University) ·Sofus A. Macskassy,Foster Provost	2004	9
14	Intelligent Assistance for the Data Mining Process: An Ontology-based Approach The Faculty Digital Archive (New York University) ·Abraham Bernstein,Shawndra Hill,Foster Provost	2002	18
15	Tree Induction Vs. Logistic Regression: a Learning-Curve Analysis The Faculty Digital Archive (New York University) ·Claudia Perlich,Foster Provost,Jeffrey S. Simonoff	2001	114
16	Variance-based Active Learning The Faculty Digital Archive (New York University) ·Maytal Saar‐Tsechansky,Foster Provost	2000	1
17	Robust classification systems for imprecise environments National Conference on Artificial Intelligence ·Foster Provost,Tom Fawcett	1998	88
18	Analysis and visualization of classifier performance: comparison under imprecise class and cost distributions Knowledge Discovery and Data Mining ·Foster Provost,Tom Fawcett	1997	490
19	Scaling up inductive algorithms: an overview Knowledge Discovery and Data Mining ·Foster Provost,Venkateswarlu Kolluri	1997	12
20	Inductive policy National Conference on Artificial Intelligence ·Foster Provost,Bruce G. Buchanan	1992	14

About Foster Provost

Foster Provost is a scholar working on Artificial Intelligence, Information Systems and Management Science and Operations Research, having authored 172 papers that have together received 12.1k indexed citations. Recurring topics across this work include Imbalanced Data Classification Techniques (42 papers), Machine Learning and Data Classification (41 papers), Data Mining Algorithms and Applications (38 papers), Machine Learning and Algorithms (29 papers), Complex Network Analysis Techniques (16 papers), Consumer Market Behavior and Pricing (15 papers), Anomaly Detection Techniques and Applications (13 papers) and Bayesian Modeling and Causal Inference (12 papers). The work is most often cited by research in Computer Science Applications (1.4k citations), Artificial Intelligence (7.0k citations) and Information Systems (2.8k citations). Foster Provost has collaborated with scholars based in United States, Switzerland and France. Frequent co-authors include Tom Fawcett, Panagiotis G. Ipeirotis, Gary M. Weiss, Ron Kohavi, Victor S. Sheng, Jing Wang, Maytal Saar‐Tsechansky, Sofus A. Macskassy, Claudia Perlich and Pedro Domingos. Their work appears in journals such as Machine Learning, Big Data, Information Systems Research, Data Mining and Knowledge Discovery and MIS Quarterly.

Rankless uses publication and citation data sourced from OpenAlex, an open and comprehensive bibliographic database. While OpenAlex provides broad and valuable coverage of the global research landscape, it—like all bibliographic datasets—has inherent limitations. These include incomplete records, variations in author disambiguation, differences in journal indexing, and delays in data updates. As a result, some metrics and network relationships displayed in Rankless may not fully capture the entirety of a scholar's output or impact.

Explore authors with similar magnitude of impact