main main
KEEL-dataset - data set description
dataset/images/california.jpg



This section describes main characteristics of the california data set and its attributes:

General information

California Housing data set
TypeRegressionOriginReal world
Features 8(Real / Integer / Nominal)(3 / 5 / 0)
Instances20640Missing values?No

Attribute description

AttributeDomain
Longitude[-124.35,-114.31]
Latitude[32.54,41.95]
HousingMedianAge[1,52]
TotalRooms[2,39320]
TotalBedrooms[1,6445]
Population[3,35682]
Households[1,6082]
MedianIncome[0.4999,15.0001]
MedianHouseValue[14999,500001]

Additional information

This data set contains information about all the block groups in California from the 1990 Census. In this sample a block group on average includes 1425.5 individuals living in a geographically compact area.

The task is to aproximate the median house value of each block from the values of the rest of the variables.




In this section you can download some files related to the california data set:

  • The complete data set already formatted in KEEL format can be downloaded from herezip.gif.
  • A copy of the data set already partitioned by means of a 5-folds cross validation procedure can be downloaded from herezip.gif.
  • The header file associated to this data set can be downloaded from heretxt.png.
  • This is not a native data set from the KEEL project. It has been obtained from the LIACC repository. The original page where the data set can be found is: http://www.liaad.up.pt/~ltorgo/Regression/DataSets.html.


 
 Copyright 2004-2018, KEEL (Knowledge Extraction based on Evolutionary Learning)
About the Webmaster Team
Valid XHTML 1.1   Valid CSS!