This section describes main characteristics of the thyroid data set and its attributes:
General information
Thyroid Disease (thyroid0387) Multi-class Imbalanced data set |
Type | Imbalanced | Origin | Real world |
Features | 21 | (Real / Integer / Nominal) | (6 / 0 / 15) |
Instances | 720 |
IR | 36.94 |
% Positive instances | 2.64 | % Negative instances | 97.36 | Missing values? | No |
Attribute description
Attribute | Domain | Attribute | Domain | Attribute | Domain |
Sintoma1 | [0.01, 0.97] | Sintoma8 | [0, 1] | Sintoma15 | [0, 1] |
Sintoma2 | [0, 1] | Sintoma9 | [0, 1] | Sintoma16 | [0, 1] |
Sintoma3 | [0, 1] | Sintoma10 | [0, 1] | Sintoma17 | [0.0, 0.53] |
Sintoma4 | [0, 1] | Sintoma11 | [0, 1] | Sintoma18 | [0.0005, 0.18] |
Sintoma5 | [0, 1] | Sintoma12 | [0, 1] | Sintoma19 | [0.0020, 0.6] |
Sintoma6 | [0, 1] | Sintoma13 | [0, 1] | Sintoma20 | [0.017, 0.233] |
Sintoma7 | [0, 1] | Sintoma14 | [0, 1] | Sintoma21 | [0.0020, 0.642] |
class | {1,2,3} |
Additional information
An imbalanced version of the Thyroid Disease (thyroid0387) data set, where there are some classes with a small number of examples while other classes have a large number of examples.
In this section you can download some files related to the thyroid data set:
- The complete data set already formatted in KEEL format can be downloaded from
here.
- A copy of the data set already partitioned by means of a 5-folds cross validation procedure can be downloaded from here.
- The header file associated to this data set can be downloaded from here.
|