Ensemble Methods
Bagging, boosting, random forests, and model ensembles.
AdaBoost
Sequentially training weak learners that focus on previously misclassified examples – boosting accuracy through reweighting.
Bagging and Bootstrap
Training multiple models on bootstrapped samples and averaging predictions – reducing variance through diversity.
Gradient Boosting
Building an additive model by fitting each new tree to the residual errors of the ensemble – the most powerful tabular method.
Random Forests
Bagged decision trees with random feature subsets – robust, parallelizable, and hard to overfit with more trees.
Stacking and Blending
Training a meta-learner on base model predictions – combining diverse model families for competition-winning performance.
XGBoost, LightGBM, and CatBoost
Industrial-strength gradient boosting implementations with regularization, histogram binning, and GPU acceleration.