We consider a sequence of trees alongside some tuning parameter and wish to minimize the following

is the number of terminal nodes is the terminal node region

If we set then we will have . scales with the number of terminal nodes, so there is a price to pay for having many terminal nodes.

We can then obtain a sequence of subtrees as a function of . (For some we would get the best subtree)