KEEL: A software tool to assess evolutionary algorithms for Data Mining problems (regression, classification, clustering, pattern mining and so on)

KEEL-dataset - data set description

Marketing data set

Description
Files and additional references

Description

This section describes main characteristics of the marketing data set and its attributes:

General information

Marketing data set
Type	Classification	Origin	Real world
Features	13	(Real / Integer / Nominal)	(0 / 13 / 0)
Classes	9	Missing values?	Yes
Total instances	8993	Instances without missing values	6876

Attribute description

Attribute	Domain	Attribute	Domain
Sex	[1, 2]	HouseholdMembers	[1, 9]
MaritalStatus	[1, 5]	Under18	[0, 9]
Age	[1, 7]	HouseholdStatus	[1, 3]
Education	[1, 6]	TypeOfHome	[1, 5]
Occupation	[1, 9]	EthnicClass	[1, 8]
YearsInSf	[1, 5]	Language	[1, 3]
DualIncome	[1, 3]	Income	{1, 2, 3, 4, 5, 6, 7, 8, 9}

Additional information

This dataset contains questions from questionaries that were filled out by shopping mall customers in the San Francisco Bay area. The goal is to predict the Anual Income of Household from the other 13 demographics attributes.

Files and additional references

In this section you can download some files related to the marketing data set:

The complete data set already formatted in KEEL formatcan be downloaded from here .
A copy of the data set already partitioned by means of a 10-folds cross validation procedure can be downloaded from here .
A copy of the data set already partitioned by means of a 5-folds cross validation procedure can be downloaded from here .
The header file associated to this data set can be downloaded from here .
This is not a native data set from the KEEL project. It has been obtained from the Orange Repository. The original page where the data set can be found is: http://orange.biolab.si.