main main
KEEL-dataset - data set description

This section describes main characteristics of the mutagenesis-atoms data set and its attributes:

General information

Mutagenesis-Atoms data set
TypeMulti instanceOriginReal world
Features 11(Real / Integer / Nominal)(10 / 0 / 1)
Instances1618 Classes2
Missing values?No

Attribute description

Atoms-bag-id{1646, ... , 1439}Type=f[0.0, 1.0]
Charge[-0.781, 1.002]Type=h[0.0, 1.0]
Quantatype[1.0, 232.0]Type=i[0.0, 1.0]
Type=br[0.0, 1.0]Type=n[0.0, 1.0]
Type=c[0.0, 1.0]Type=o[0.0, 1.0]
Type=cl[0.0, 1.0]Class{0, 1}

Additional information

The problem consists of predicting the mutagenicity of the molecules, that is, determining whether a molecule is mutagenic or non-mutagenic. The dataset for mutagenesis consists of 188 molecules, of which 125 are mutagenic (active) and 63 are non-mutagenic (inactive). From a MIL perspective different transformations are considered, concretely, mutagenesis-atoms represents as a bag all atoms.

In this section you can download some files related to the mutagenesis-atoms data set:

  • The complete data set already formatted in KEEL format can be downloaded from herezip.gif.
  • A copy of the data set already partitioned by means of a 10-folds cross validation procedure can be downloaded from herezip.gif.
  • The header file associated to this data set can be downloaded from heretxt.png.

 Copyright 2004-2018, KEEL (Knowledge Extraction based on Evolutionary Learning)
About the Webmaster Team
Valid XHTML 1.1   Valid CSS!