Standout Papers

Bias in random forest variable importance measures: Illustrations, sources and a solution 2007 2026 2013 2019 2.3k
  1. Bias in random forest variable importance measures: Illustrations, sources and a solution (2007)
    Carolin Strobl, Anne‐Laure Boulesteix et al. BMC Bioinformatics
  2. Conditional variable importance for random forests (2008)
    Carolin Strobl, Anne‐Laure Boulesteix et al. BMC Bioinformatics
  3. An introduction to recursive partitioning: Rationale, application, and characteristics of classification and regression trees, bagging, and random forests. (2009)
    Carolin Strobl, James D. Malley et al. Psychological Methods

Citation Impact

Citing Papers

clusterProfiler: an R Package for Comparing Biological Themes Among Gene Clusters
2012 Standout
Warning times for species extinctions due to climate change
2014
Crippling life support for SARS-CoV-2 and other viruses through synthetic lethality
2020 StandoutNobel
Accelerating extinction risk from climate change
2015 StandoutScience
Supervised learning with decision tree-based methods in computational and systems biology
2009
Understanding variable importances in forests of randomized trees
2013
2016 American College of Rheumatology/European League Against Rheumatism Classification Criteria for Primary Sjögren's Syndrome: A Consensus and Data‐Driven Methodology Involving Three International Patient Cohorts
2016 Standout
Stability of person-specific blood-based infrared molecular fingerprints opens up prospects for health monitoring
2021 StandoutNobel
r2VIM: A new variable selection method for random forests in genome-wide association studies
2016
Deep learning algorithm predicts diabetic retinopathy progression in individual patients
2019
Object-oriented mapping of landslides using Random Forests
2011
A comparison of random forest and its Gini importance with standard chemometric methods for the feature selection and classification of spectral data
2009
Probability estimation with machine learning methods for dichotomous and multicategory outcome: Applications
2014
Data mining in the Life Sciences with Random Forest: a walk in the park or lost in the jungle?
2012
Genetic effects on gene expression across human tissues
2017 StandoutNature
Inferring Regulatory Networks from Expression Data Using Tree-Based Methods
2010
The evolving landscape of biomarkers for checkpoint inhibitor immunotherapy
2019 Standout
Stability and aggregation of ranked gene lists
2009
Empirical Bayes analysis of single nucleotide polymorphisms
2008
SNP interaction detection with Random Forests in high-dimensional genetic data
2012
Classification and mutation prediction from non–small cell lung cancer histopathology images using deep learning
2018 Standout
ilastik: interactive machine learning for (bio)image analysis
2019 Standout
AI in health and medicine
2022 Standout
2016 American College of Rheumatology/European League Against Rheumatism classification criteria for primary Sjögren's syndrome
2016 Standout
Applying Linear and Non-Linear Methods for Parallel Prediction of Volume of Distribution and Fraction of Unbound Drug
2013
SCENIC: single-cell regulatory network inference and clustering
2017 Standout
Cancer immunotherapy via dendritic cells
2012 Standout
Hepatocellular carcinoma
2021 Standout
Putative mechanisms of antitumor activity of cyano-substituted heteroaryles in HeLa cells
2010
Contamination features and health risk of soil heavy metals in China
2015 Standout
Predicting and interpreting large-scale mutagenesis data using analyses of protein stability and conservation
2022
Classification with correlated features: unreliability of feature ranking and solutions
2011
REVIGO Summarizes and Visualizes Long Lists of Gene Ontology Terms
2011 Standout
High-performance medicine: the convergence of human and artificial intelligence
2018 Standout
GIS-based groundwater potential mapping using boosted regression tree, classification and regression tree, and random forest machine learning models in Iran
2015 Standout
Drug Concentration Thresholds Predictive of Therapy Failure and Death in Children With Tuberculosis: Bread Crumb Trails in Random Forests
2016
ADMETlab: a platform for systematic ADMET evaluation based on a comprehensively collected ADMET database
2018
Deep learning and process understanding for data-driven Earth system science
2019 StandoutNature
SoilGrids250m: Global gridded soil information based on machine learning
2017 Standout
GeneSrF and varSelRF: a web-based tool and R package for gene selection and classification using random forest
2007
Transcription factors: from enhancer binding to developmental control
2012 Standout
Chemogenomic model identifies synergistic drug combinations robust to the pathogen microenvironment
2018
Applications of machine learning in drug discovery and development
2019 Standout
Keystone taxa as drivers of microbiome structure and functioning
2018 Standout
Mapping Soil Properties of Africa at 250 m Resolution: Random Forests Significantly Improve Current Predictions
2015
Random Forests for Genetic Association Studies
2011
The Musicality of Non-Musicians: An Index for Assessing Musical Sophistication in the General Population
2014 Standout
Pyrazinamide clearance is impaired among HIV/tuberculosis patients with high levels of systemic immune activation
2017 StandoutNobel
Reinforcement Learning Trees
2015
An empirical assessment of validation practices for molecular classifiers
2011
Machine learning models in electronic health records can outperform conventional survival models for predicting patient mortality in coronary artery disease
2018
Hepatocellular carcinoma
2022 Standout
The microbial nitrogen-cycling network
2018 Standout
Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features
2016
Integrating gut microbiota immaturity and disease‐discriminatory taxa to diagnose the initiation and severity of shrimp disease
2017 Standout
Evaluation of variable selection methods for random forests and omics data sets
2017
Recently identified microbial guild mediates soil N2O sink capacity
2014
pkCSM: Predicting Small-Molecule Pharmacokinetic and Toxicity Properties Using Graph-Based Signatures
2015 Standout
Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI
2019 Standout
Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): Explanation and Elaboration
2015 Standout
Correlation and variable importance in random forests
2016
A Bias Correction Algorithm for the Gini Variable Importance Measure in Classification Trees
2008
Variable selection using random forests
2010 Standout
A Genomic Approach to Human Autoimmune Diseases
2010
On safari to Random Jungle: a fast implementation of Random Forests for high-dimensional data
2010
Artificial neural network model for forecasting sub-hourly electricity usage in commercial buildings
2015
A review of data-driven building energy consumption prediction studies
2017 Standout
Free-riders to forerunners
2015 StandoutNobel
Economics- and policy-driven organic carbon input enhancement dominates soil organic carbon accumulation in Chinese croplands
2018 Standout
Improving the accuracy of rainfall rates from optical satellite sensors with machine learning — A random forests-based approach applied to MSG SEVIRI
2013
Forest Garrote
2009
Application of Stochastic Models in Identification and Apportionment of Heavy Metal Pollution Sources in the Surface Soils of a Large-Scale Region
2013
Predicting functionality of protein–DNA interactions by integrating diverse evidence
2009
On the Importance of Training Data Sample Selection in Random Forest Image Classification: A Case Study in Peatland Ecosystem Mapping
2015
A REVIEW OF STATISTICAL METHODS FOR DETERMINATION OF RELATIVE IMPORTANCE OF CORRELATED PREDICTORS AND IDENTIFICATION OF DRIVERS OF CONSUMER LIKING
2012
Stability Investigations of Multivariable Regression Models Derived from Low- and High-Dimensional Data
2011
A random forest guided tour
2016 Standout
Recent advances and applications of machine learning in solid-state materials science
2019 Standout
When does no-till yield more? A global meta-analysis
2015
Predictor correlation impacts machine learning algorithms: implications for genomic studies
2009
Dynamic Random Forests
2012
Hierarchic Interaction of Factors Associated With Liver Decompensation After Resection for Hepatocellular Carcinoma
2016
Over-optimism in bioinformatics: an illustration
2010
A review of sensitivity analysis methods in building energy analysis
2013
The Immune Response in Tuberculosis
2013 Standout
Tumor Immunity and Survival as a Function of Alternative Neopeptides in Human Cancer
2018
Consumer credit risk: Individual probability estimates using machine learning
2013
A review on time series forecasting techniques for building energy consumption
2017 Standout
A comparative study of different classification techniques for marine oil spill identification using RADARSAT-1 imagery
2013
A new variable selection approach using Random Forests
2012
Interpretation of nonlinear relationships between process variables by use of random forests
2012
Interpretability of deep learning models: A survey of results
2017
Overview of random forest methodology and practical guidance with emphasis on computational biology and bioinformatics
2012
EASL Clinical Practice Guidelines: Management of hepatocellular carcinoma
2018 Standout
Predictors of public climate change awareness and risk perception around the world
2015 Standout
ADMETlab 2.0: an integrated online platform for accurate and comprehensive predictions of ADMET properties
2021 Standout
In silico discovery of metal-organic frameworks for precombustion CO 2 capture using a genetic algorithm
2016 StandoutNobel
Random forest in remote sensing: A review of applications and future directions
2016 Standout
Spatial application of Random Forest models for fine-scale coastal vegetation classification using object based analysis of aerial orthophoto and DEM data
2015
Enriched random forests
2008
Definitions, methods, and applications in interpretable machine learning
2019 Standout
Variable importance analysis: A comprehensive review
2015 Standout
What Are the Best Materials To Separate a Xenon/Krypton Mixture?
2015
Navigating Random Forests and related advances in algorithmic modeling
2009
A computationally fast variable importance test for random forests for high-dimensional data
2016
Structural Analysis and Identification of Colloidal Aggregators in Drug Discovery
2019
The revival of the Gini importance?
2018 Standout
Modelling Forest α-Diversity and Floristic Composition — On the Added Value of LiDAR plus Hyperspectral Remote Sensing
2012
Transcription factor and chromatin features predict genes associated with eQTLs
2012
Permutation importance: a corrected feature importance measure
2010
Mining data with random forests: current options for real‐world applications
2013
Empirical characterization of random forest variable importance measures
2007
AIRBORNE LIDAR FEATURE SELECTION FOR URBAN CLASSIFICATION USING RANDOM FORESTS
2009
An integrated approach to reduce the impact of minor allele frequency and linkage disequilibrium on variable importance measures for genome-wide data
2012
Accurate proteome-wide missense variant effect prediction with AlphaMissense
2023 StandoutScienceNobel
Probability estimation with machine learning methods for dichotomous and multicategory outcome: Theory
2014
A peek into the black box: exploring classifiers by randomization
2014
Variable Importance Assessment in Regression: Linear Regression versus Random Forest
2009
Empirical comparison of tree ensemble variable importance measures
2010
Recursive partitioning on incomplete data using surrogate decisions and multiple imputation
2011

Works of Carolin Strobl being referenced

Multiple Testing for SNP-SNP Interactions
2007
Bias in random forest variable importance measures: Illustrations, sources and a solution
2007 Standout
Optimal classifier selection and negative bias in error rate estimation: an empirical study on high-dimensional prediction
2009
Conditional variable importance for random forests
2008 Standout
An introduction to recursive partitioning: Rationale, application, and characteristics of classification and regression trees, bagging, and random forests.
2009 Standout
Random forest Gini importance favours SNPs with large minor allele frequency: impact, sources and recommendations
2011
An AUC-based permutation variable importance measure for random forests
2013
The behaviour of random forest permutation-based variable importance measures under predictor correlation
2010
A new variable importance measure for random forests with missing data
2012
Random Forests with Missing Values in the Covariates
2010
Evaluating Microarray-based Classifiers: An Overview
2008
Unbiased split selection for classification trees based on the Gini Index
2006
Danger: High Power! - Exploring the Statistical Properties of a Test for Random Forest Variable Importance
2008
Rankless by CCL
2026