main main
KEEL-dataset - data set description
dataset/images/mammographic.jpg



This section describes main characteristics of the mammographic data set and its attributes:

General information

Mammographic Mass data set
TypeClassificationOriginReal world
Features 5(Real / Integer / Nominal)(0 / 5 / 0)
Classes 2 Missing values? Yes
Total instances 961 Instances without missing values 830

Attribute description

AttributeDomain
BI-RADS[0,6]
Age[18,96]
Shape[1,4]
Margin[1,5]
Density[1,4]
Severity{0, 1}

Additional information

This data set can be used to predict the severity (benign or malignant) of a mammographic mass lesion from BI-RADS attributes and the patient's age.

It contains a BI-RADS assessment, the patient's age and three BI-RADS attributes together with the ground truth (the severity field, which is the target attribute).

The data was collected at the Institute of Radiology of the University Erlangen-Nuremberg between 2003 and 2006.




In this section you can download some files related to the mammographic data set:

  • The complete data set already formatted in KEEL formatcan be downloaded from herezip.gif.

  • A copy of the data set already partitioned by means of a 10-folds cross validation procedure can be downloaded from herezip.gif.

  • A copy of the data set already partitioned by means of a 5-folds cross validation procedure can be downloaded from herezip.gif.

  • The header file associated to this data set can be downloaded from heretxt.png.
  • This is not a native data set from the KEEL project. It has been obtained from the UCI Machine Learning Repository. The original page where the data set can be found is: http://archive.ics.uci.edu/ml/datasets/Mammographic+Mass.


 
 Copyright 2004-2018, KEEL (Knowledge Extraction based on Evolutionary Learning)
About the Webmaster Team
Valid XHTML 1.1   Valid CSS!