main main
Description

KEEL (Knowledge Extraction based on Evolutionary Learning) is an open source (GPLv3) Java software tool which empowers the user to assess the behavior of evolutionary learning and Soft Computing based techniques for different kinds of DM problems: regression, classification, clustering, pattern mining and so on. KEEL is being developed under the Spanish National Projects TIC2002-04036-C05, TIN2005-08386-C05 and TIN2008-06681-C06 with the collaboration of the six following Spanish Research Groups:

sci2s
SCI2S
(Spanish National Proyects TIC2002-04036-C05-01, TIN2005-08386-C05-01 and TIN2008-06681-C06-01)
ayrna
Ayrna
(Spanish National Proyects TIC2002-04036-C05-02, TIN2005-08386-C05-02 and TIN2008-06681-C06-03)
grsi
GRSI
(Spanish National Proyects TIC2002-04036-C05-03, TIN2005-08386-C05-04 and TIN2008-06681-C06-05)
simidat
Intelligent Systems and Data Mining
(Spanish National Proyects TIC2002-04036-C05-04, TIN2005-08386-C05-03 and TIN2008-06681-C06-02)
mm
Metrology and Models
(Spanish National Proyects TIC2002-04036-C05-05, TIN2005-08386-C05-05 and TIN2008-06681-C06-04)
simd
Intelligent Systems and Data Mining
(Spanish National Proyect TIN2008-06681-C06-06)

If you want to refer to KEEL in a publication, please cite us using the following references:

KEEL description papers:
  • J. Alcalá-Fdez, L. Sánchez, S. García, M.J. del Jesus, S. Ventura, J.M. Garrell, J. Otero, C. Romero, J. Bacardit, V.M. Rivas, J.C. Fernández, F. Herrera. KEEL: A Software Tool to Assess Evolutionary Algorithms to Data Mining Problems. Soft Computing 13:3 (2009) 307-318, doi: 10.1007/s00500-008-0323-y.   Pdf
  • J. Alcalá-Fdez, A. Fernandez, J. Luengo, J. Derrac, S. García, L. Sánchez, F. Herrera. KEEL Data-Mining Software Tool: Data Set Repository, Integration of Algorithms and Experimental Analysis Framework. Journal of Multiple-Valued Logic and Soft Computing 17:2-3 (2011) 255-287.   Pdf



Main KEEL description

KEEL is a software tool to assess EAs for DM problems including regression, classification, clustering, pattern mining and so on. The version of KEEL presently available consists of the following function blocks:

  • Data Management

    This part is composed by a set of tools that can be used to build new data, export and import data in other formats to KEEL format, data edition and visualization, apply transformations and partitioning to data, etc...


    KEEL Data Management

  • Design of Experiments

    The aim of this part is the design of the desired experimentation over the selected data sets. It provides options for many choices: type of validation, type of learning (classification, regression, unsupervised learning, subgroup discovery), etc...


    KEEL Research Module

  • Design of Imbalanced Experiments

    The aim of this part is the design of the desired experimentation over the selected imbalanced data sets. These experiments are created for 5cfv datasets and include specific algorithms for imbalanced data and general classification algorithms.


    KEEL Imbalanced Module

  • Statistical Tests

    KEEL is one of the fewest Data Mining software tools that provides to the researcher a complete set of statistical procedures for pairwise and multiple comparisons. Inside the KEEL environment, several parametric and nonparametric procedures have been coded, which should help to contrast the results obtained in any experiment performed with the software tool.


    KEEL Statistical Module

  • Educational Experiments

    With a similar structure to the Design of Experimets part, allows us to design an experiment which can be step-by-step debugged in order to use this as a guideline to show the learning process of a certain model by using the platform with educational objectives.

    KEEL Educational Module

Taking into account each one of the function blocks, KEEL can be useful by different types of user, which expect to find determined features in a Data Mining (DM) software.

In the following, we describe the user profiles who it is designed for, its main features and the different ways of working integrated in the software tool.


Main User Profiles

KEEL is an integration of an environment with a defined architecture and a development of knowledge extraction as expandable modules. It is mainly intended for two categories of users: researchers and students. Either group has a different set of needs:

  • KEEL as a research tool

    The most common use of this tool for a researcher will be the automated execution of experiments, and the statistical analysis of their results. Routinely, an experimental design includes a mix of evolutionary algorithms, statistical and AI-related techniques. Special care was taken to make possible that a researcher can use KEEL to assess the relevance of his own procedures. Since the actual standards in machine learning require heavy computational work, the research tool is not designed to offer a real-time view of the progress of the algorithms, it is designed to rather generate a script and be batch-executed in a cluster of computers. The tool allows the researcher to apply the same sequence of pre-processing, experiments and analysis to large batteries of problems and focus his attention in the summary of the results.

  • KEEL as an educational tool

    The needs of a student are quite different to those of a researcher. Generally speaking, the objective is no longer that of making statistically sound comparisons between algorithms. There is no need of repeating each experiment a large number of times. If the tool is to be used in class, the execution time must be short and a real-time view of the evolution of the algorithms is needed, since the student will use this information to learn how to adjust the parameters of the algorithms. In this sense, the educational tool is a simplified version of the research tool, where only the most relevant algorithms are available. The execution is made in real time. The user has a visual feedback of the progress of the algorithms, and can access the final results from the same interface used to design the experimentation.

Both types of user require an availability of a set of features in order to be interested in using KEEL. Then, this is when we describe the main features of the KEEL software tool.


Main Main Features

KEEL is a software tool developed to ensemble and use different DM models. We would like to remark that this is the first software toolkit of this type containing a library of evolutionary learning algorithms with open source code in Java. The main features of KEEL are:

  • Evolutionary Algorithms (EAs) are presented in predicting models, pre-processing (evolutionary feature and training set selection) and post-processing (evolutionary tuning of fuzzy rules).

  • It includes data pre-processing algorithms proposed in specialized literature: data transformation, discretization, training set selection, feature selection, imputation methods for missing values and noisy data filtering methods.

  • It has a statistical library to analyze algorithms' results. It comprises a set of statistical tests for analyzing the normality and heteroscedasticity of the results and performing parametric and non-parametric comparisons among the algorithms.

  • Some algorithms have been developed by using a Java Class Library for Evolutionary Computation (JCLEC)

  • It provides an user-friendly interface, oriented to the analysis of algorithms.

  • The software is aimed to create experimentations containing multiple data sets and algorithms connected among themselves to obtain a result expected. Experiments are independently script-generated from the user interface for an off-line run in the same or other machines.

  • KEEL also allows to create experiments in on-line mode, aiming an educational support in order to learn the operation of the algorithms included.

  • It contains a Knowledge Extraction Algorithms Library, remarking the incorporation of multiple evolutionary learning algorithms, together with classical learning approaches. The main employment lines are:

    • Different evolutionary rule learning models have been implemented

    • Fuzzy rule learning models with a good trade-off between accuracy and interpretability.

    • Evolution and pruning in neural networks, product unit neural networks, and radial base models.

    • Genetic Programming: Evolutionary algorithms that use tree representations for extracting knowledge.

    • Algorithms for extracting descriptive rules based on patterns subgroup discovery have been integrated.

    • Data reduction (training set selection, feature selection and discretization). EAs for data reduction have been included.

Top of the page  University of Granada (Subproject TIN2008-06681-C06-01)

   The head of group is Ph.D. Francisco Herrera Triguero of Department of Computer Science and Artificial Intelligence of the University of Granada.

   This group is the coordinating node in the development of KEEL.



Name E-mail Institution
Ph.D. Herrera Triguero, Francisco (IP) member_email University of Granada
Ph.D. Alcalá Fernández, Jesus member_email University of Granada
Ph.D. Alcalá Fernández, Rafael member_email University of Granada
Ph.D. Bull, Larry member_email University of the west of England, Bristol
Ph.D. Casillas Barranquero, Jorge member_email University of Granada
Mr Derrac Rus, Joaquin member_email University of Granada
Ph.D. Fernández Hilario, Alberto member_email University of Jaén
Ph.D. García López, Salvador member_email University of Jaén
Ms. López Morales, Victoria member_email University of Granada
Mr. Luengo Martín, Julián member_email University of Granada
Ph.D. Martínez López, Francisco José member_email University of Granada
Ph.D. Pérez Márquez, Elena member_email University of Valladolid
Mr. Sáez Muñoz, José Antonio member_email University of Granada
Mr. Triguero Velázquez, Isaac member_email University of Granada
Ph.D. Villar Castro, Pedro member_email University of Granada



Top of the page  University of Jaen (Subproject TIN2008-06681-C06-02)

   The head of group is Ph.D. María José Del Jesus of Department of Computer Science of the University of Jaen.



Name E-mail Institution
Ph.D. Del Jesus, María José (IP) member_email University of Jaén
Mr. Aguilera García, José Joaquín member_email University of Jaén
Mr. Berlanga Rivera, Francisco José member_email University of Jaén
Ph.D. Cano de Amo, Jose Ramon member_email University of Jaén
Ph.D. González García, Pedro member_email University of Jaén
Ph.D. Pérez Godoy, Mº Dolores member_email University of Jaén
Ph.D. Rivas Santos, Víctor Manuel member_email University of Jaén
Ph.D. Rivera Rivas, Antonio Jesús member_email University of Jaen



Top of the page  University of Cordoba (Subproject TIN2008-06681-C06-03)

   The head of group is Ph.D. Sebastián Ventura Soto of Department of Computer Science and Numerical Analysis of the University of Cordoba.



Name E-mail Institution
Ph.D. Ventura Soto, Sebastián (IP) member_email University of Córdoba
Mr. Fernández Caballero, Juan Carlos member_email University of Córdoba
Mr. González Espejo, Pedro member_email University of Córdoba
Mr. Gutiérrez Peña, Pedro Antonio member_email University of Córdoba
Ph.D. Hervás Martínez, César member_email University of Córdoba
Ph.D. Martínez Estudillo, Francisco member_email INSA-ETEA
Ph.D. Martínez Estudillo, Alfonso Carlos member_email INSA-ETEA
Ph.D. Pechenizkiy, Mykola member_email Eindhoven University of Technology
Ph.D. Romero Morales, Cristóbal member_email University of Córdoba
Ph.D. Romero Salguero, José Raúl member_email University of Cordoba
Ph.D. Zafra Gómez, Amelia member_email University of Córdoba



Top of the page  University of Oviedo (Subproject TIN2008-06681-C06-04)

   The head of group is Ph.D. Luciano Sánchez Ramos of Department of Computer Science of the University of Oviedo.



Name E-mail Institution
Ph.D. Sánchez Ramos, Luciano (IP) member_email University of Oviedo
Ph.D. de la Cal, Enrique Antonio member_email University of Oviedo
Mr. Junco Navascues, Luis member_email University of Oviedo
Ph.D. Otero Rodríguez, José member_email University of Oviedo
Mr. Otero Rodríguez, Adolfo member_email University of Oviedo
Mrs. Suárez Fernández, M. de Rosario member_email University of Oviedo
Ph.D. Villar Flecha, Jose Ramon member_email University of Oviedo



Top of the page  University Ramon Llull (Subproject TIN2008-06681-C06-05)

   The head of group is Ph.D. Ester Bernadó Mansilla of Department of Computer Science of the University Ramon Llull.



Name E-mail Institution
Ph.D. Bernadó Mansilla, Ester (IP) member_email University Ramon Llull
Ph.D. Bacardit Peñarroya, Jaume member_email University Ramon Llull
Mr. Camps Dausà, Joan member_email University Ramon Llull
Mr. Farguell Matesanz, Enric member_email University Ramon Llull
Ph.D. Garrel Guiu, Josep M. member_email University Ramon Llull
Ph.D. Ho, Tin Kam member_email Bell Laboratories
Mrs. Macià Antolínez , Núria member_email University Ramon Llull
Ph.D. Orriols Puig, Albert member_email University Ramon Llull
Mr. Rios Boutin, Joaquim member_email University Pompeu Fabra
Mr. Teixidó Navarro, Francesc member_email University Ramon Llull



Top of the page  University of Huelva (Subproject TIN2008-06681-C06-06)

   The head of group is Ph.D. Antonio Peregrin Rubio of Department of Computer Science of the University of Huelva.



Name E-mail Institution
Ph.D. Peregrin Rubio, Antonio (IP) member_email University of Huelva
Mr. López Gómez, Ignacio member_email University of Huelva
Ph.D. Márquez Hernández, Francisco Alfredo member_email University of Huelva
Ph.D. Moreno Velo, Francisco José member_email University of Huelva
Mr. Rodríguez Román, Miguel Ángel member_email University of Huelva





 
 Copyright 2004-2011, KEEL (Knowledge Extraction based on Evolutionary Learning)
About the Webmaster Team
Valid XHTML 1.1   Valid CSS!