KEEL-dataset - data set description

This section describes main characteristics of the mutagenesis-atoms data set and its attributes:

General information

Mutagenesis-Atoms data set
TypeMulti instanceOriginReal world
Features 11(Real / Integer / Nominal)(10 / 0 / 1)
Instances1618 Classes2
Missing values?No

Attribute description

Atoms-bag-id{1646, ... , 1439}Type=f[0.0, 1.0]
Charge[-0.781, 1.002]Type=h[0.0, 1.0]
Quantatype[1.0, 232.0]Type=i[0.0, 1.0]
Type=br[0.0, 1.0]Type=n[0.0, 1.0]
Type=c[0.0, 1.0]Type=o[0.0, 1.0]
Type=cl[0.0, 1.0]Class{0, 1}

Additional information

The problem consists of predicting the mutagenicity of the molecules, that is, determining whether a molecule is mutagenic or non-mutagenic. The dataset for mutagenesis consists of 188 molecules, of which 125 are mutagenic (active) and 63 are non-mutagenic (inactive). From a MIL perspective different transformations are considered, concretely, mutagenesis-atoms represents as a bag all atoms.

