Description
The data set contains classes of 150 instances each one, where each class talks about a type of plant rainbow. A class is linearly separable of the other two and the two last ones are not linearly separable one of another one.
Data Set
You can download the original UCI data file http://www.keel.es/dataset/uci/iris.data and the UCI description file of the dataset http://www.keel.es/dataset/uci/iris.names or you can visit the UCI web site of the dataset http://archive.ics.uci.edu/ml/datasets/Iris. In Keel, the original data set is partitioned using 10-fold cross-validation procedure. The initial data set T, is randomly divided into 10 disjoint sets of equal size T1,...,TN. We maintain the original class distribution (before partitioning) within each set when carrying out the partition process. We then conduct 10 pairs of training and test sets.
Moreover, the original data set of 150 examples has been randomly divided into 2 different subsets (each one with 75 examples) for 5 times. We have builded 5 different partitions of the original data set at 50%-50% to apply the 5x2cv paired t-test , namely we perform 5 replications of 2-fold cross-validation.
You can also download the whole data set in Keel format here.
|