![]() |
![]() |
|||||||||||
|
Table of ContentsKEEL Reference Manual
Data filesIn KEEL, the data sets are managed by plain ASCII text files, with the .dat extension. Usually, they are located under the ../dist/data directory, each one in its own folder (which also should contains the partitions created from the whole data set). In addition, preprocess methods will also create data files as its output, which will be placed on the ../datasets directory of its experiment. This section describes the format employed to define them (which is fairly similar to WEKA arff format). Each KEEL data file is composed by 2 sections:
In both sections it is possible to insert comments, by employing the "%"character. HeaderThe header is composed by the following metadata:
The @inputs and @outputs definitions are optional. If they are missing, all the attributes will be considered as input attributes, except the last, which will be considered as output attribute. DataThe data instances are represented as rows of comma separated values, where each value corresponds to one attribute, in the order defined by the header. Missing or null values are defined as <null> or ?. If the dataset corresponds to a classification problem, the output type must be nominal: If the dataset corresponds to a regression problem, the output type must be real: Example of useThis is a valid example of a data file: |
|||||||||||
|