We have a binary class problem with output

Suppose we have a vector of predictors and a Classifier we can produce a single prediction .

We have the following expression for error

Error

This error expression is using an Indicator Function which counts the number of misclassified predictions. The error is a ratio of misclassified over values classified.

We define a weak classifier as one whose prediction is only slightly better than chance.

The core idea behind the AdaBoost algorithm is to chain these weak learners and sequentially pass the data through them. The algorithm tries to add more weight to training examples which the previous classifiers have incorrectly classified. Since the goal of each classifier is to minimize the error, examples with higher weight will be paid more “attention” to and hopefully fixed.

  1. We set weight for each training example to be for training examples.
  2. For to
    1. is fit to the training data with

For each classifier, we compute the weighted error and calculate alpha for the corresponding classifier. Then, update all the weights. Repeat this process for all the classifiers. The final result will the the sign produced by the classifier.

AdaBoost

AdaBoost is very good and can outperform large classification trees.