main main
KEEL-dataset - data set description
dataset/images/breast.jpg



This section describes main characteristics of the breast data set and its attributes:

General information

Breast Cancer data set
TypeClassificationOriginReal world
Features 9(Real / Integer / Nominal)(0 / 0 / 9)
Classes 2 Missing values? Yes
Total instances 286 Instances without missing values 277

Attribute description

AttributeDomain
Age{10-19, 20-29, 30-39, 40-49, ... 90-99}
Menopause{lt40, ge40, premeno}
Tumor-size{0-4, 5-9, 10-14, 15-19, ... 55-59}
Inv-nodes{0-2, 3-5, 6-8, 9-11, ... , 36-39}
Node-caps{yes, no}
Deg-malig{1, 2, 3}
Breast{left, right}
Breast-quad{left_up, left_low, right_up, right_low, central}
Irradiated{yes, no}
Class{no-recurrence-events, recurrence-events}

Additional information

This is one of three domains provided by the Oncology Institute that has repeatedly appeared in the machine learning literature.

This data set includes 201 instances of one class and 85 instances of another class. The instances are described by 9 attributes, some of which are linear and some are nominal.




In this section you can download some files related to the breast data set:

  • The complete data set already formatted in KEEL formatcan be downloaded from herezip.gif.

  • A copy of the data set already partitioned by means of a 10-folds cross validation procedure can be downloaded from herezip.gif.

  • A copy of the data set already partitioned by means of a 5-folds cross validation procedure can be downloaded from herezip.gif.

  • The header file associated to this data set can be downloaded from heretxt.png.
  • This is not a native data set from the KEEL project. It has been obtained from the UCI Machine Learning Repository. The original page where the data set can be found is: http://archive.ics.uci.edu/ml/datasets/Breast+Cancer.


 
 Copyright 2004-2018, KEEL (Knowledge Extraction based on Evolutionary Learning)
About the Webmaster Team
Valid XHTML 1.1   Valid CSS!