This section describes main characteristics of the pima data set and its attributes:
General information
Pima (Imbalanced) data set |
Type | Imbalanced | Origin | Real world |
Features | 8 | (Real / Integer / Nominal) | (8 / 0 / 0) |
Instances | 768 |
IR | 1.87 |
% Positive instances | 34.84 | % Negative instances | 65.16 | Missing values? | No |
Attribute description
Attribute | Domain |
Preg | [0.0, 17.0] |
Plas | [0.0, 199.0] |
Pres | [0.0, 122.0] |
Skin | [0.0, 99.0] |
Insu | [0.0, 846.0] |
Mass | [0.0, 67.1] |
Pedi | [0.078, 2.42] |
Age | [21.0, 81.0] |
Class | {negative,positive} |
Additional information
A imbalanced version of the Pima data set, where the classes has been renamed to positive and negative.
In this section you can download some files related to the pima data set:
- The complete data set already formatted in KEEL format can be downloaded from
here.
- A copy of the data set already partitioned by means of a 5-folds cross validation procedure can be downloaded from here.
- The header file associated to this data set can be downloaded from here.
|