public class EventCovering
extends java.lang.Object
Based on the work of Wong et al., a mixed-mode probability model is approximated by a discrete one. First, they discretize the continuous components using a minimum loss of information criterion. Treating a mixed-mode feature n-tuple as a discrete-valued one, the authors propose a new statistical approach for synthesis of knowledge based on cluster analysis. As main advantage, this method does not require neither scale normalization nor ordering of discrete values. By synthesis of the data into statistical knowledge, they refer to the following processes: 1) synthesize and detect from data inherent patterns which indicate statistical interdependency; 2) group the given data into inherent clusters based on these detected interdependency; and 3) interpret the underlying patterns for each clusters identified.
The method of synthesis is based on author's eventcovering approach. With the developed inference method, we are able to estimate the MVs in the data. This method assumes the data is DISCRETIZED (but won't throw any error with continuous data).Constructor and Description |
---|
EventCovering(java.lang.String fileParam)
Creates a new instance of EventCovering
|
Modifier and Type | Method and Description |
---|---|
protected java.util.Vector |
clusterInitation(double[] Px)
Initializes the set of clusters using information of the data set
|
protected double[][] |
computeMutualInformation()
Estimates the mutual information between the instances in the data set
|
protected double[] |
computePx(java.util.Vector tree)
Computes the conjunctive probabilities using the second order probabilities.
|
protected java.util.Vector |
computeTree(double[][] I)
Computes the dependece Tree using Dijkstra algorithm
|
protected double |
D(Instance x,
java.util.Vector S)
Computes the minimum Hamming distance between the instance given and one
of the instances in the set given.
|
protected double |
dist(Instance i1,
Instance i2)
Computes the Hamming distance between 2 instances
|
protected double |
NS(Instance inst,
int numCluster,
int sizeCluster,
double[] R,
FreqListPair[] acj_xk,
java.util.Vector[] Ekc,
double NS_denom)
Returns the NS value of a instance given.
|
void |
process()
Process the training and test files provided in the parameters file to the constructor.
|
protected java.util.Vector |
refineClusters(java.util.Vector Clusters)
This method refines the initial clusters obtained by clusterInitiation()
|
public EventCovering(java.lang.String fileParam)
fileParam
- The path to the configuration file with all the parameters in KEEL formatprotected double dist(Instance i1, Instance i2)
Computes the Hamming distance between 2 instances
i1
- First Instancei2
- Second instanceprotected double[][] computeMutualInformation()
Estimates the mutual information between the instances in the data set
protected java.util.Vector computeTree(double[][] I)
Computes the dependece Tree using Dijkstra algorithm
I
- The paired-mutual information of this data setprotected double[] computePx(java.util.Vector tree)
Computes the conjunctive probabilities using the second order probabilities.
tree
- The dependence tree of this data setprotected java.util.Vector clusterInitation(double[] Px)
Initializes the set of clusters using information of the data set
Px
- The second order probablity estimationprotected java.util.Vector refineClusters(java.util.Vector Clusters)
This method refines the initial clusters obtained by clusterInitiation()
Clusters
- The set of clusters to be refinedprotected double NS(Instance inst, int numCluster, int sizeCluster, double[] R, FreqListPair[] acj_xk, java.util.Vector[] Ekc, double NS_denom)
inst
- instance given.numCluster
- number of cluster considered.sizeCluster
- size of the cluster.R
- Double array R parameter.acj_xk
- Frequencies list pair.Ekc
- Ekc matrix.NS_denom
- NS fraction to divide the NS value.protected double D(Instance x, java.util.Vector S)
x
- instance given.S
- instances set given.public void process()
Process the training and test files provided in the parameters file to the constructor.