public class myDataset
extends java.lang.Object
Modifier and Type | Field and Description |
---|---|
static int |
INTEGER
Number to represent type of variable integer.
|
static int |
NOMINAL
Number to represent type of variable nominal.
|
static int |
REAL
Number to represent type of variable real or double.
|
Constructor and Description |
---|
myDataset()
Init a new set of instances
|
Modifier and Type | Method and Description |
---|---|
double |
average(int position)
It return the average of an specific attribute
|
java.lang.String[] |
clases()
It returns the name of the classes
|
void |
computeInstancesPerClass()
It computes the number the instances per class.
|
java.lang.String |
copyHeader()
It copies the header of the dataset
|
double[][] |
devuelveRangos()
Returns the minimum and maximum values of every attributes as a matrix.
|
double[] |
getemax()
It returns an array with the maximum values of the attributes
|
double[] |
getemin()
It returns an array with the minimum values of the attributes
|
double[] |
getExample(int pos)
Output a specific example
|
double[][] |
getExampleXf(int pos)
Output a specific example
|
double |
getMax(int variable)
It returns the maximum value of the attribute specified
|
double |
getMin(int variable)
It returns the minimum value of the attribute specified
|
int |
getnClasses()
It gets the number of output attributes of the data-set (for example number of classes in classification)
|
int |
getnData()
It gets the size of the data-set
|
int |
getnInputs()
It gets the number of input attributes of the data-set
|
int |
getnVars()
It gets the number of variables of the data-set (including the output)
|
int[] |
getOutputAsInteger()
Returns the output of the data-set as integer values
|
int |
getOutputAsInteger(int pos)
It returns the output value of the example "pos"
|
double[] |
getOutputAsReal()
Returns the output of the data-set as real values
|
double |
getOutputAsReal(int pos)
It returns the output value of the example "pos"
|
java.lang.String[] |
getOutputAsString()
Returns the output of the data-set as nominal values
|
java.lang.String |
getOutputAsString(int pos)
It returns the output value of the example "pos"
|
java.lang.String |
getOutputValue(int intValue)
It returns the name of the class of index intValue
|
int |
getTipo(int variable)
It returns the type of the attribute specified
|
double[][] |
getX()
Outputs an array of examples with their corresponding attribute values.
|
boolean |
hasMissingAttributes()
It checks if the data-set has any missing value
|
boolean |
hasNumericalAttributes()
It checks if the data-set has any numerical value
|
boolean |
hasRealAttributes()
It checks if the data-set has any real value
|
boolean |
isMissing(int i,
int j)
This function checks if the attribute value is missing
|
boolean |
isNominal(int i)
This function checks if the attribute value is nominal
|
java.lang.String[] |
names()
It returns the name of the attributes
|
void |
normalize()
It transform the input space into the [0,1] range
|
int |
numberInstances(int clas)
It returns the number of instances in the dataset of the given class
|
int |
numberValues(int attribute)
It returns the number of different values of an attribute
|
void |
readClassificationSet(java.lang.String datasetFile,
boolean train)
It reads the whole input data-set and it stores each example and its associated output value in
local arrays to ease their use.
|
int |
size()
It returns the number of examples
|
int |
sizeWithoutMissing()
It return the size of the data-set without having account the missing values
|
double |
stdDev(int position)
It returns the standard deviation of an specific attribute
|
void |
transform(DataBase dataBase)
Transform the data-set given as argument to correspond the fuzzy labels.
|
public static final int REAL
public static final int INTEGER
public static final int NOMINAL
public double[][] getX()
public double[] getExample(int pos)
pos
- position (id) of the example in the data-setpublic double[][] getExampleXf(int pos)
pos
- position (id) of the example in the data-setpublic int[] getOutputAsInteger()
public double[] getOutputAsReal()
public java.lang.String[] getOutputAsString()
public java.lang.String getOutputAsString(int pos)
pos
- the position (id) of the examplepublic int getOutputAsInteger(int pos)
pos
- the position (id) of the examplepublic double getOutputAsReal(int pos)
pos
- the position (id) of the examplepublic double[] getemax()
public double[] getemin()
public double getMax(int variable)
variable
- index of the attributepublic double getMin(int variable)
variable
- index of the attributepublic int getnData()
public int getnVars()
public int getnInputs()
public int getnClasses()
public boolean isMissing(int i, int j)
i
- Example idj
- Variable idpublic boolean isNominal(int i)
i
- attribute idpublic void readClassificationSet(java.lang.String datasetFile, boolean train) throws java.io.IOException
datasetFile
- name of the file containing the datasettrain
- It must have the value "true" if we are reading the training data-setjava.io.IOException
- If there ocurs any problem with the reading of the data-setpublic void transform(DataBase dataBase)
dataBase
- DataBase where the instances and the labels for each variables are stored and needed to transform the data.public java.lang.String copyHeader()
public void normalize()
public boolean hasRealAttributes()
public boolean hasNumericalAttributes()
public boolean hasMissingAttributes()
public int sizeWithoutMissing()
public int size()
public double stdDev(int position)
position
- attribute id (position of the attribute)public double average(int position)
position
- attribute id (position of the attribute)public void computeInstancesPerClass()
public int numberInstances(int clas)
clas
- the index of the classpublic int numberValues(int attribute)
attribute
- the index of the attributepublic java.lang.String getOutputValue(int intValue)
intValue
- the index of the classpublic int getTipo(int variable)
variable
- index of the attributepublic double[][] devuelveRangos()
public java.lang.String[] names()
public java.lang.String[] clases()