Is mean imputation of missing data acceptable practice? Why or why not?

Is mean imputation of missing data acceptable practice? Why or why not?



-Bad practice in general
-If just estimating means: mean imputation preserves the mean of the observed data
-Leads to an underestimate of the standard deviation
-Distorts relationships between variables by "pulling" estimates of the correlation toward zero

Popular posts from this blog

After analyzing the model, your manager has informed that your regression model is suffering from multicollinearity. How would you check if he's true? Without losing any information, can you still build a better model?

Is rotation necessary in PCA? If yes, Why? What will happen if you don't rotate the components?

What does Latency mean?