main main
KEEL-dataset - data set description
dataset/images/thyroid.gif



This section describes main characteristics of the thyroid data set and its attributes:

General information

Thyroid Disease (thyroid0387) Multi-class Imbalanced data set
TypeImbalancedOriginReal world
Features 21(Real / Integer / Nominal)(6 / 0 / 15)
Instances720 IR36.94
% Positive instances2.64% Negative instances97.36
Missing values?No

Attribute description

AttributeDomainAttributeDomainAttributeDomain
Sintoma1[0.01, 0.97]Sintoma8[0, 1]Sintoma15[0, 1]
Sintoma2[0, 1]Sintoma9[0, 1]Sintoma16[0, 1]
Sintoma3[0, 1]Sintoma10[0, 1]Sintoma17[0.0, 0.53]
Sintoma4[0, 1]Sintoma11[0, 1]Sintoma18[0.0005, 0.18]
Sintoma5[0, 1]Sintoma12[0, 1]Sintoma19[0.0020, 0.6]
Sintoma6[0, 1]Sintoma13[0, 1]Sintoma20[0.017, 0.233]
Sintoma7[0, 1]Sintoma14[0, 1]Sintoma21[0.0020, 0.642]
class{1,2,3}

Additional information

An imbalanced version of the Thyroid Disease (thyroid0387) data set, where there are some classes with a small number of examples while other classes have a large number of examples.




In this section you can download some files related to the thyroid data set:

  • The complete data set already formatted in KEEL format can be downloaded from herezip.gif.
  • A copy of the data set already partitioned by means of a 5-folds cross validation procedure can be downloaded from herezip.gif.
  • The header file associated to this data set can be downloaded from heretxt.png.


 
 Copyright 2004-2018, KEEL (Knowledge Extraction based on Evolutionary Learning)
About the Webmaster Team
Valid XHTML 1.1   Valid CSS!