What is the maximal margin classifier? How this margin can be achieved?
What is the maximal margin classifier? How this margin can be achieved?
-When the data can be perfectly separated using a hyperplane, there actually exists an infinite number of these hyperplanes
-Intuition: a hyperplane can usually be shifted a tiny bit up, or down, or rotated, without coming into contact with any of the observations
-Large margin classifier: choosing the hyperplance that is farthest from the training observations
-This margin can be achieved using support vectors