Data Discretization with K-Means ClusteringΒΆ
K-Means Clustering algorithm is first used to partition the input data values into clusters. Then, the discretization strategy for the input data is made using the information of maximum and minimum values of the data set, computed cluster centers and midpoints between each two clusters.
Input Parameters
- Input data
Output Parameters
- Discretized data
Workflow
Algorithm
References
- J. Han, M. Kamber and J. Pei, Data Mining - Concepts and Techniques, 3rd ed., Amsterdam: Morgan Kaufmann Publishers, 2012.