What is an extreme case of bagging and model averaging?
What is an extreme case of bagging and model averaging?
Dropout. For each training case, we randomly select a few hidden units so we end up with a different architecture for each case. Should not be used as an inference as it's not necessary.