We consider a sequence of trees alongside some tuning parameter and wish to minimize the following
is the number of terminal nodes is the terminal node region
If we set then we will have . scales with the number of terminal nodes, so there is a price to pay for having many terminal nodes.
We can then obtain a sequence of subtrees as a function of . (For some we would get the best subtree)