Why is naive Bayes so bad? How would you improve a spam detection algorithm that uses naive Bayes?
Why is naive Bayes so bad? How would you improve a spam detection algorithm that uses naive Bayes?
-Naïve: the features are assumed independent/uncorrelated
-Assumption not feasible in many cases
-Improvement: decorrelate features (covariance matrix into identity matrix)