What impurity measures do you know?

What impurity measures do you know?



Gini
-Gini=1−∑jp2jGini=1−∑jpj2

Information Gain/Deviance
-InformationGain=∑jpjlog2pjInformationGain=∑jpjlog2pj
-Better than Gini when pjpj are very small: multiplying very small numbers leads to rounding errors, we can instead take logs.

Popular posts from this blog

After analyzing the model, your manager has informed that your regression model is suffering from multicollinearity. How would you check if he's true? Without losing any information, can you still build a better model?

Is rotation necessary in PCA? If yes, Why? What will happen if you don't rotate the components?

What does Latency mean?