Hierarchical fuzzy rule based classification systems with genetic rule selection for imbalanced data-sets | A. Fernandez, M.J. del Jesus, F. Herrera, Hierarchical Fuzzy Rule Based Classification Systems with Genetic Rule Selection for Imbalanced Data-Sets. International Journal of Approximate Reasoning 50 (2009) 561-577, doi: 10.1016/j.ijar.2008.11.004 | |
Abstract: In many real application areas, the data used are highly skewed and the number of instances for some classes are much higher than that of the other classes. Solving a classification task using such an imbalanced data-set is difficult due to the bias of the training towards the majority classes.
The aim of this paper is to improve the performance of fuzzy rule based classification systems on imbalanced domains, increasing the granularity of the fuzzy partitions on the boundary areas between the classes, in order to obtain a better separability. We propose the use of a hierarchical fuzzy rule based classification system, which is based on the refinement of a simple linguistic fuzzy model by means of the extension of the structure of the knowledge base in a hierarchical way and the use of a genetic rule selection process in order to get a compact and accurate model. The good performance of this approach is shown through an extensive experimental study carried out over a large collection of imbalanced data-sets.
Summary: 1. Introduction
2. Imbalanced data-sets in classification
3. Hierarchical fuzzy rule based classification system
4. Experimental study
5. Concluding remarks
Experimental study: - Algorithms analyzed: Chi et al, Ishibuchi05, E-Algorithm, C4.5, HFRBCS.
- Data sets used: ZIP file
- Imbalanced: [5fcv] glass1, ecoli0vs1, wisconsin, pima, iris0, glass0, yeast1, vehicle1, vehicle2, vehicle3, haberman, glass0123vs456, vehicle0, ecoli1, new-thyroid2, new-thyroid1, ecoli2, segment0, glass6, yeast3, ecoli3, page-blocks0, yeast2vs4, yeast05679vs4, vowel0, glass016vs2, glass2, ecoli4, yeast1vs7, shuttle0vs4, glass4, page-blocks13vs2, abalone9vs18, glass016vs5, shuttle2vs4, yeast1458vs7, glass5, yeast2vs8, yeast4, yeast1289vs7, yeast5, ecoli0137vs26, yeast6, abalone19.
- Imbalanced (SMOTE): [5fcv] glass1, ecoli0vs1, wisconsin, pima, iris0, glass0, yeast1, vehicle1, vehicle2, vehicle3, haberman, glass0123vs456, vehicle0, ecoli1, new-thyroid2, new-thyroid1, ecoli2, segment0, glass6, yeast3, ecoli3, page-blocks0, yeast2vs4, yeast05679vs4, vowel0, glass016vs2, glass2, ecoli4, yeast1vs7, shuttle0vs4, glass4, page-blocks13vs2, abalone9vs18, glass016vs5, shuttle2vs4, yeast1458vs7, glass5, yeast2vs8, yeast4, yeast1289vs7, yeast5, ecoli0137vs26, yeast6, abalone19.
- Results obtained: ZIP file
|