spark-DEMD-discretizer
A Distributed Evolutionary Multivariate Discretizer (DEMD)
Distributed multivariate discretizer for Apache Spark based on a evolutionary points selection scheme, called Distributed Evolutionary Multivariate Discretizer (DEMD). It has been inspired by EMD evolutionary discretizer, so both algorithm have things in common as a binary representation and a fitness function, however many improvements have been introduced in DEMD to suit a distributed environment, for example that partial solutions are generated locally and fused eventually to produce the final discretization scheme.
Status
Use
To Include this package in your Spark application via spark-shell or pySpark, you must use it like:
$SPARK_HOME/bin/spark-shell\ --packages sramirez:spark-DEMD-discretizer:1.0
where $SPARK_HOME is your Spark path.
Release
The latest version is : 1.0.0 Date: 2016-02-04 / Scala version: 2.10
Reference
S. Ramírez-Gallego, S. García, J.M. Benítez, F. Herrera. A distributed evolutionary multivariate discretizer for Big Data processing on Apache Spark. Swarm and Evolutionary Computation (2017), in press. doi: 10.1016/j.swevo.2017.08.005