This section describes main characteristics of the yeast data set and its attributes:
General information
Yeast data set |
Type | Classification | Origin | Real world |
Features | 8 | (Real / Integer / Nominal) | (8 / 0 / 0) |
Instances | 1484 |
Classes | 10 |
Missing values? | No |
Attribute description
Attribute | Domain |
Mcg | [0.11, 1.0] |
Gvh | [0.13, 1.0] |
Alm | [0.21, 1.0] |
Mit | [0.0, 1.0] |
Erl | [0.5, 1.0] |
Pox | [0.0, 0.83] |
Vac | [0.0, 0.73] |
Nuc | [0.0, 1.0] |
Class | {MIT, NUC, CYT, ME1, ME2, ME3, EXC, VAC, POX, ERL} |
Additional information
This database contains information about a set of Yeast cells. The task is to determine the localization site of each cell among 10 possible alternatives.
Attributes description:
- Mcg: McGeoch's method for signal sequence recognition.
- Gvh: von Heijne's method for signal sequence recognition.
- Alm: Score of the ALOM membrane spanning region prediction program.
- Mit: Score of discriminant analysis of the amino acid content of the N-terminal region (20 residues long) of mitochondrial and non-mitochondrial proteins.
- Erl: Presence of "HDEL" substring (thought to act as a signal for retention in the endoplasmic reticulum lumen). Binary attribute.
- Pox: Peroxisomal targeting signal in the C-terminus.
- Vac: Score of discriminant analysis of the amino acid content of vacuolar and extracellular proteins.
- Nuc: Score of discriminant analysis of nuclear localization signals of nuclear and non-nuclear proteins.
In this section you can download some files related to the yeast data set:
- The complete data set already formatted in KEEL format can be downloaded from
here.
- A copy of the data set already partitioned by means of a 10-folds cross validation procedure can be downloaded from here.
- A copy of the data set already partitioned by means of a 5-folds cross validation procedure can be downloaded from here.
- The header file associated to this data set can be downloaded from here.
- This is not a native data set from the KEEL project. It has been obtained from the UCI Machine Learning Repository. The original page where the data set can be found is: http://archive.ics.uci.edu/ml/datasets/Yeast.
|