Thursday, June 23, 2011

Data mining Blog, Preprocessing, Normalization

Data mining Blog, Preprocessing, Normalization

Outliers can be found by using a technique called box plot (http://en.wikipedia.org/wiki/Box_plot)

Before we normalize the data set, we need to check for outliers

There are two possible ways to handle outliers in the data
1. Ignore them, remove from the data set (or)
2. Reassign the value to of appropriate upper or lower threshold.

No comments:

Post a Comment