main main
KEEL-dataset - data set description
dataset/images/phoneme.jpg



This section describes main characteristics of the phoneme data set and its attributes:

General information

Phoneme data set
TypeClassificationOriginReal world
Features 5(Real / Integer / Nominal)(5 / 0 / 0)
Instances5404 Classes2
Missing values?No

Attribute description

AttributeDomain
Aa[-1.7, 4.107]
Ao[-1.327, 4.378]
Dcl[-1.823, 3.199]
Iy[-1.581, 2.826]
Sh[-1.284, 2.719]
Class{0, 1}

Additional information

The aim of this dataset is to distinguish between nasal (class 0) and oral sounds (class 1). The class distribution is 3,818 samples in class 0 and 1,586 samples in class 1.

The phonemes are transcribed as follows: sh as in she, dcl as in dark, iy as the vowel in she, aa as the vowel in dark, and ao as the first vowel in water.




In this section you can download some files related to the phoneme data set:

  • The complete data set already formatted in KEEL format can be downloaded from herezip.gif.
  • A copy of the data set already partitioned by means of a 10-folds cross validation procedure can be downloaded from herezip.gif.
  • A copy of the data set already partitioned by means of a 5-folds cross validation procedure can be downloaded from herezip.gif.
  • The header file associated to this data set can be downloaded from heretxt.png.
  • This is not a native data set from the KEEL project. It has been obtained from the ELENA Project. The original page where the data set can be found is: https://www.elen.ucl.ac.be/neural-nets/Research/Projects/ELENA/databases/REAL/phoneme/.


 
 Copyright 2004-2018, KEEL (Knowledge Extraction based on Evolutionary Learning)
About the Webmaster Team
Valid XHTML 1.1   Valid CSS!