main main
KEEL-dataset - data set description
dataset/images/yeast.jpg



This section describes main characteristics of the yeast data set and its attributes:

General information

Yeast data set
TypeClassificationOriginReal world
Features 8(Real / Integer / Nominal)(8 / 0 / 0)
Instances1484 Classes10
Missing values?No

Attribute description

AttributeDomain
Mcg[0.11, 1.0]
Gvh[0.13, 1.0]
Alm[0.21, 1.0]
Mit[0.0, 1.0]
Erl[0.5, 1.0]
Pox[0.0, 0.83]
Vac[0.0, 0.73]
Nuc[0.0, 1.0]
Class{MIT, NUC, CYT, ME1, ME2, ME3, EXC, VAC, POX, ERL}

Additional information

This database contains information about a set of Yeast cells. The task is to determine the localization site of each cell among 10 possible alternatives.

Attributes description:
- Mcg: McGeoch's method for signal sequence recognition.
- Gvh: von Heijne's method for signal sequence recognition.
- Alm: Score of the ALOM membrane spanning region prediction program.
- Mit: Score of discriminant analysis of the amino acid content of the N-terminal region (20 residues long) of mitochondrial and non-mitochondrial proteins.
- Erl: Presence of "HDEL" substring (thought to act as a signal for retention in the endoplasmic reticulum lumen). Binary attribute.
- Pox: Peroxisomal targeting signal in the C-terminus.
- Vac: Score of discriminant analysis of the amino acid content of vacuolar and extracellular proteins.
- Nuc: Score of discriminant analysis of nuclear localization signals of nuclear and non-nuclear proteins.




In this section you can download some files related to the yeast data set:

  • The complete data set already formatted in KEEL format can be downloaded from herezip.gif.
  • A copy of the data set already partitioned by means of a 10-folds cross validation procedure can be downloaded from herezip.gif.
  • A copy of the data set already partitioned by means of a 5-folds cross validation procedure can be downloaded from herezip.gif.
  • The header file associated to this data set can be downloaded from heretxt.png.
  • This is not a native data set from the KEEL project. It has been obtained from the UCI Machine Learning Repository. The original page where the data set can be found is: http://archive.ics.uci.edu/ml/datasets/Yeast.


 
 Copyright 2004-2018, KEEL (Knowledge Extraction based on Evolutionary Learning)
About the Webmaster Team
Valid XHTML 1.1   Valid CSS!