How would you split up a data set in order to choose from multiple models?
How would you split up a data set in order to choose from multiple models?
Answer: In such a situation, you should split the data into three parts: a training set for building models, a validation set for choosing among trained models (called the cross-validation set), and a test set for judging the final model.