This section describes main characteristics of the mutagenesis-atoms data set and its attributes:
General information
Mutagenesis-Atoms data set |
Type | Multi instance | Origin | Real world |
Features | 11 | (Real / Integer / Nominal) | (10 / 0 / 1) |
Instances | 1618 |
Classes | 2 |
Missing values? | No |
Attribute description
Attribute | Domain | Attribute | Domain |
Atoms-bag-id | {1646, ... , 1439} | Type=f | [0.0, 1.0] |
Charge | [-0.781, 1.002] | Type=h | [0.0, 1.0] |
Quantatype | [1.0, 232.0] | Type=i | [0.0, 1.0] |
Type=br | [0.0, 1.0] | Type=n | [0.0, 1.0] |
Type=c | [0.0, 1.0] | Type=o | [0.0, 1.0] |
Type=cl | [0.0, 1.0] | Class | {0, 1} |
Additional information
The problem consists of predicting the mutagenicity of the molecules, that is, determining whether a molecule is mutagenic or non-mutagenic. The dataset for mutagenesis consists of 188 molecules, of which 125 are mutagenic (active) and 63 are non-mutagenic (inactive). From a MIL perspective different transformations are considered, concretely, mutagenesis-atoms represents as a bag all atoms.
In this section you can download some files related to the mutagenesis-atoms data set:
- The complete data set already formatted in KEEL format can be downloaded from
here.
- A copy of the data set already partitioned by means of a 10-folds cross validation procedure can be downloaded from here.
- The header file associated to this data set can be downloaded from here.
|