Observations not included in the bootstrap sample during bagging are called Out of Bag observations.

We can use the Out off Bag observations in a similar way to how validation sets are used during cross validation.

We can make predictions for where some data point in the Out of Bag set.

How many data points will not be selected at each point during bagging?

We can see here that approximately of the values will not be selected for very large values of in bootstrapping.

In Regression, average the predicted value across all models in ensemble. In Classification, we take the majority vote.

Key Ideas

  1. Out of Bag Estimate is an efficient and reliable alternative to cross-validation
  2. We use make predictions on the Out of Bag observations. See Out of Bag Observations