Why is naive Bayes so bad? How would you improve a spam detection algorithm that uses naive Bayes?

Why is naive Bayes so bad? How would you improve a spam detection algorithm that uses naive Bayes?



-Naïve: the features are assumed independent/uncorrelated
-Assumption not feasible in many cases
-Improvement: decorrelate features (covariance matrix into identity matrix)

Popular posts from this blog

After analyzing the model, your manager has informed that your regression model is suffering from multicollinearity. How would you check if he's true? Without losing any information, can you still build a better model?

Is rotation necessary in PCA? If yes, Why? What will happen if you don't rotate the components?

Given that merge sort splits the array into 2, performs a recursive call on each of the 2 arrays, and repeats until the base case of 1 or less items in an array is reached, how many levels of recursion are there and why?