Data Discretization with Chi-Squared TestΒΆ
This step first preforms an initial discretization for the input discrete data, then repeats a bottom-up merging process continuously until a termination condition is fulfilled. The merging process consists of two steps: (1) perform the Chi-squared test for each pair of adjacent intervals, (2) merge the pair of adjacent intervals with the lowest Chi-square value. Merging continues until all pairs. For details refer to the ChiMerge algorithm.
Input Parameters
- Input data
- Chi-squared significance threshold
Output Parameters
- Discretized data
Workflow
Algorithm
References
- R. Kerber, ChiMerge: Discretization of Numeric Attributes, Learning: Inductive, AAAI 92, pp. 123-128, 1992.