|
KEEL (Knowledge Extraction based on Evolutionary Learning) is an open source (GPLv3) Java software tool which empowers the user to assess
the behavior of evolutionary learning and Soft Computing based techniques for different kinds of DM problems: regression, classification,
clustering, pattern mining and so on.
KEEL is being developed under the Spanish National Projects
TIC2002-04036-C05, TIN2005-08386-C05 and TIN2008-06681-C06 with the collaboration of the six following Spanish Research Groups:
If you want to refer to KEEL in a publication, please cite us using the following references:
KEEL description papers:
- J. Alcalá-Fdez, L. Sánchez, S. García, M.J. del Jesus, S. Ventura, J.M. Garrell, J. Otero, C. Romero, J. Bacardit, V.M. Rivas, J.C. Fernández, F. Herrera.
KEEL: A Software Tool to Assess Evolutionary Algorithms to Data Mining Problems.
Soft Computing 13:3 (2009) 307-318, doi: 10.1007/s00500-008-0323-y.

- J. Alcalá-Fdez, A. Fernandez, J. Luengo, J. Derrac, S. García, L. Sánchez, F. Herrera.
KEEL Data-Mining Software Tool: Data Set Repository, Integration of Algorithms and Experimental Analysis Framework.
Journal of Multiple-Valued Logic and Soft Computing 17:2-3 (2011) 255-287.

|
KEEL description
KEEL is a software tool to assess EAs for DM problems including regression,
classification, clustering, pattern mining and so on. The version of KEEL
presently available consists of the following function blocks:
- Data Management
This part is composed by a set of
tools that can be used to build new data, export and import data
in other formats to KEEL format, data edition and visualization, apply
transformations and partitioning to data, etc...
- Design of Experiments
The aim of this part is the
design of the desired experimentation over the selected data
sets. It provides options for many choices: type of validation,
type of learning (classification, regression, unsupervised
learning, subgroup discovery), etc...
- Design of Imbalanced Experiments
The aim of this part is the
design of the desired experimentation over the selected imbalanced data
sets. These experiments are created for 5cfv datasets and include specific
algorithms for imbalanced data and general classification algorithms.
- Statistical Tests
KEEL is one of the fewest Data Mining software tools that provides to the
researcher a complete set of statistical procedures for pairwise and multiple
comparisons. Inside the KEEL environment, several parametric and nonparametric
procedures have been coded, which should help to contrast the
results obtained in any experiment performed with the software tool.
- Educational Experiments
With a similar structure to
the Design of Experimets part, allows us to design an experiment which can
be step-by-step debugged in order to use this as a guideline to
show the learning process of a certain model by using the
platform with educational objectives.
Taking into account each one of the function blocks, KEEL can be
useful by different types of user, which expect to find determined
features in a Data Mining (DM) software.
In the following, we describe the user profiles who it is designed
for, its main features and the different ways of working integrated in
the software tool.
User Profiles
KEEL is an integration of an environment with a
defined architecture and a development of knowledge extraction as
expandable modules. It is mainly intended for two categories of
users: researchers and students. Either group has a different set
of needs:
- KEEL as a research tool
The most common use of this tool for a researcher will be the
automated execution of experiments, and the statistical analysis of
their results. Routinely, an experimental design includes a mix of
evolutionary algorithms, statistical and AI-related techniques.
Special care was taken to make possible that a researcher can use
KEEL to assess the relevance of his own procedures. Since the actual
standards in machine learning require heavy computational work, the
research tool is not designed to offer a real-time view of the
progress of the algorithms, it is designed to rather generate a script and be
batch-executed in a cluster of computers. The tool allows the
researcher to apply the same sequence of pre-processing, experiments
and analysis to large batteries of problems and focus his attention
in the summary of the results.
- KEEL as an educational tool
The needs of a student are quite different
to those of a researcher. Generally speaking,
the objective is no longer that of making statistically sound
comparisons between algorithms.
There is no need of repeating each experiment a large number of times.
If the tool is to be used in class, the execution time
must be short and a real-time view of the evolution
of the algorithms is needed, since the student will
use this information to learn how to adjust the parameters
of the algorithms. In this sense, the educational tool
is a simplified version of the research tool, where
only the most relevant algorithms are available.
The execution is made in real time. The user has
a visual feedback of the progress of the algorithms,
and can access the final results from the same interface
used to design the experimentation.
Both types of user require an availability of a set of features in
order to be interested in using KEEL. Then, this is when we
describe the main features of the KEEL software tool.
Main Features
KEEL is a software tool developed to ensemble and use different
DM models. We would like to remark that this is the first software
toolkit of this type containing a library of evolutionary learning
algorithms with open source code in Java. The main features of KEEL
are:
- Evolutionary Algorithms (EAs) are presented in predicting models,
pre-processing (evolutionary
feature and training set selection) and post-processing
(evolutionary tuning of fuzzy rules).
- It includes data pre-processing algorithms proposed in
specialized literature: data transformation, discretization, training set selection,
feature selection, imputation methods for missing values and noisy data filtering methods.
- It has a statistical library to analyze
algorithms' results. It comprises a set of statistical tests for
analyzing the normality and heteroscedasticity of the results
and performing parametric and non-parametric comparisons among
the algorithms.
- Some algorithms have been developed by using a Java Class Library for Evolutionary
Computation (JCLEC)
- It provides an user-friendly interface, oriented to
the analysis of algorithms.
- The software is aimed to create experimentations
containing multiple data sets and algorithms connected among
themselves to obtain a result expected. Experiments are
independently script-generated from the user interface for an off-line
run in the same or other machines.
- KEEL also allows to create experiments in on-line mode, aiming an educational support in order to learn the operation of the
algorithms included.
- It contains a Knowledge Extraction Algorithms
Library, remarking the incorporation of multiple evolutionary
learning algorithms, together with classical learning approaches. The main employment lines are:
- Different evolutionary rule learning models have been
implemented
- Fuzzy rule learning models with a good trade-off between
accuracy and interpretability.
- Evolution and pruning
in neural networks, product unit neural networks, and radial
base models.
- Genetic Programming: Evolutionary algorithms that
use tree representations for extracting knowledge.
- Algorithms for extracting
descriptive rules based on patterns subgroup discovery have been
integrated.
- Data reduction (training set selection, feature selection and discretization).
EAs for data reduction have been included.
|
|
University of Granada (Subproject TIN2008-06681-C06-01)
The head of group is Ph.D. Francisco Herrera Triguero of Department of Computer Science and Artificial Intelligence of the University of Granada.
This group is the coordinating node in the development of KEEL.
| Name |
E-mail |
Institution |
| Ph.D. Herrera Triguero, Francisco (IP) |
 |
University of Granada |
| Ph.D. Alcalá Fernández, Jesus |
 |
University of Granada |
| Ph.D. Alcalá Fernández, Rafael |
 |
University of Granada |
| Ph.D. Bull, Larry |
 |
University of the west of England, Bristol |
| Ph.D. Casillas Barranquero, Jorge |
 |
University of Granada |
| Mr Derrac Rus, Joaquin |
 |
University of Granada |
| Ph.D. Fernández Hilario, Alberto |
 |
University of Jaén |
| Ph.D. García López, Salvador |
 |
University of Jaén |
| Ms. López Morales, Victoria |
 |
University of Granada |
| Mr. Luengo Martín, Julián |
 |
University of Granada |
| Ph.D. Martínez López, Francisco José |
 |
University of Granada |
| Ph.D. Pérez Márquez, Elena |
 |
University of Valladolid |
| Mr. Sáez Muñoz, José Antonio |
 |
University of Granada |
| Mr. Triguero Velázquez, Isaac |
 |
University of Granada |
| Ph.D. Villar Castro, Pedro |
 |
University of Granada |
University of Jaen (Subproject TIN2008-06681-C06-02)
The head of group is Ph.D. María José Del Jesus of Department of Computer Science of the University of Jaen.
| Name |
E-mail |
Institution |
| Ph.D. Del Jesus, María José (IP) |
 |
University of Jaén |
| Mr. Aguilera García, José Joaquín |
 |
University of Jaén |
| Mr. Berlanga Rivera, Francisco José |
 |
University of Jaén |
| Ph.D. Cano de Amo, Jose Ramon |
 |
University of Jaén |
| Ph.D. González García, Pedro |
 |
University of Jaén |
| Ph.D. Pérez Godoy, Mº Dolores |
 |
University of Jaén |
| Ph.D. Rivas Santos, Víctor Manuel |
 |
University of Jaén |
| Ph.D. Rivera Rivas, Antonio Jesús |
 |
University of Jaen |
University of Cordoba (Subproject TIN2008-06681-C06-03)
The head of group is Ph.D. Sebastián Ventura Soto of Department of Computer Science and Numerical Analysis of the University of Cordoba.
| Name |
E-mail |
Institution |
| Ph.D. Ventura Soto, Sebastián (IP) |
 |
University of Córdoba |
| Mr. Fernández Caballero, Juan Carlos |
 |
University of Córdoba |
| Mr. González Espejo, Pedro |
 |
University of Córdoba |
| Mr. Gutiérrez Peña, Pedro Antonio |
 |
University of Córdoba |
| Ph.D. Hervás Martínez, César |
 |
University of Córdoba |
| Ph.D. Martínez Estudillo, Francisco |
 |
INSA-ETEA |
| Ph.D. Martínez Estudillo, Alfonso Carlos |
 |
INSA-ETEA |
| Ph.D. Pechenizkiy, Mykola |
 |
Eindhoven University of Technology |
| Ph.D. Romero Morales, Cristóbal |
 |
University of Córdoba |
| Ph.D. Romero Salguero, José Raúl |
 |
University of Cordoba |
| Ph.D. Zafra Gómez, Amelia |
 |
University of Córdoba |
University of Oviedo (Subproject TIN2008-06681-C06-04)
The head of group is Ph.D. Luciano Sánchez Ramos of Department of Computer Science of the University of Oviedo.
| Name |
E-mail |
Institution |
| Ph.D. Sánchez Ramos, Luciano (IP) |
 |
University of Oviedo |
| Ph.D. de la Cal, Enrique Antonio |
 |
University of Oviedo |
| Mr. Junco Navascues, Luis |
 |
University of Oviedo |
| Ph.D. Otero Rodríguez, José |
 |
University of Oviedo |
| Mr. Otero Rodríguez, Adolfo |
 |
University of Oviedo |
| Mrs. Suárez Fernández, M. de Rosario |
 |
University of Oviedo |
| Ph.D. Villar Flecha, Jose Ramon |
 |
University of Oviedo |
University Ramon Llull (Subproject TIN2008-06681-C06-05)
The head of group is Ph.D. Ester Bernadó Mansilla of Department of Computer Science of the University Ramon Llull.
| Name |
E-mail |
Institution |
| Ph.D. Bernadó Mansilla, Ester (IP) |
 |
University Ramon Llull |
| Ph.D. Bacardit Peñarroya, Jaume |
 |
University Ramon Llull |
| Mr. Camps Dausà, Joan |
 |
University Ramon Llull |
| Mr. Farguell Matesanz, Enric |
 |
University Ramon Llull |
| Ph.D. Garrel Guiu, Josep M. |
 |
University Ramon Llull |
| Ph.D. Ho, Tin Kam |
 |
Bell Laboratories |
| Mrs. Macià Antolínez , Núria |
 |
University Ramon Llull |
| Ph.D. Orriols Puig, Albert |
 |
University Ramon Llull |
| Mr. Rios Boutin, Joaquim |
 |
University Pompeu Fabra |
| Mr. Teixidó Navarro, Francesc |
 |
University Ramon Llull |
University of Huelva (Subproject TIN2008-06681-C06-06)
The head of group is Ph.D. Antonio Peregrin Rubio of Department of Computer Science of the University of Huelva.
| Name |
E-mail |
Institution |
| Ph.D. Peregrin Rubio, Antonio (IP) |
 |
University of Huelva |
| Mr. López Gómez, Ignacio |
 |
University of Huelva |
| Ph.D. Márquez Hernández, Francisco Alfredo |
 |
University of Huelva |
| Ph.D. Moreno Velo, Francisco José |
 |
University of Huelva |
| Mr. Rodríguez Román, Miguel Ángel |
 |
University of Huelva |
|
|