public class myDataset
extends java.lang.Object
It contains the methods to read a Dataset for the Association Rules Mining problem
Modifier and Type | Field and Description |
---|---|
static int |
INTEGER
Number to represent type of variable integer.
|
static int |
NOMINAL
Number to represent type of variable nominal.
|
static int |
REAL
Number to represent type of variable real or double.
|
Constructor and Description |
---|
myDataset(int nPartition)
Initialize a new set of instances
|
Modifier and Type | Method and Description |
---|---|
java.lang.String |
getAttributeName(int id_attr)
It returns the name of the attribute in "id_attr"
|
int |
getAttributeType(int id_attr)
It returns the type of the attribute in "id_attr"
|
double[] |
getemax()
It returns an array with the maximum values of the attributes
|
double[] |
getemin()
It returns an array with the minimum values of the attributes
|
int[][] |
getFakeTransactions()
Outputs an array of transactions with their recasted attribute values.
|
double |
getMax(int variable)
It returns the upper bound of the variable
|
double |
getMin(int variable)
It returns the lower bound of the variable
|
boolean[] |
getMissing(int pos)
It returns an array indicating the position of the missing values on a specific example
|
java.lang.String |
getNominalValue(int id_attr,
int id_val)
It returns the nominal value "id_val" within the attribute "id_attr"
|
int |
getnTrans()
It gets the size of the data-set
|
int |
getnVars()
It gets the number of variables of the data-set
|
double[] |
getSteps()
It returns an array with the step values of each attribute depending on the chosen number of partitions
|
java.util.Hashtable<java.lang.Integer,java.util.HashSet<java.lang.Integer>> |
getTIDList()
It outputs an array of attribute values with their corresponding TIDs
|
double[][] |
getTrueTransactions()
Outputs an array of transactions with their corresponding attribute values.
|
boolean |
hasMissingAttributes()
It checks if the data-set has any missing value
|
boolean |
hasNumericalAttributes()
It checks if the data-set has any numerical value (real or integer)
|
boolean |
hasRealAttributes()
It checks if the data-set has any real value
|
boolean |
isMissing(int i,
int j)
This function checks if the attribute value is missing
|
void |
readDataSet(java.lang.String datasetFile)
It reads the whole input data-set and it stores each transaction in
local array
|
int |
sizeWithoutMissing()
It return the size of the data-set without having account the missing values
|
public static final int REAL
public static final int INTEGER
public static final int NOMINAL
public myDataset(int nPartition)
Initialize a new set of instances
nPartition
- The number of partition in which numeric attributes are uniformly dividedpublic double[][] getTrueTransactions()
public int[][] getFakeTransactions()
public double[] getemax()
public double[] getemin()
public double getMax(int variable)
variable
- Id otf the attributepublic double getMin(int variable)
variable
- Id of the attributepublic int getnTrans()
public int getnVars()
public boolean isMissing(int i, int j)
i
- int Example idj
- int Variable idpublic void readDataSet(java.lang.String datasetFile) throws java.io.IOException
datasetFile
- String name of the file containing the data-setjava.io.IOException
- If there occurs any problem with the reading of the data-setpublic boolean hasRealAttributes()
public boolean hasNumericalAttributes()
public boolean hasMissingAttributes()
public int sizeWithoutMissing()
public boolean[] getMissing(int pos)
pos
- int Id of the examplepublic double[] getSteps()
public java.util.Hashtable<java.lang.Integer,java.util.HashSet<java.lang.Integer>> getTIDList()
public java.lang.String getAttributeName(int id_attr)
id_attr
- int Id of the attributepublic int getAttributeType(int id_attr)
id_attr
- int Id of the attributepublic java.lang.String getNominalValue(int id_attr, int id_val)
id_attr
- int Id of the attributeid_val
- int Id of the nominal value within the attribute