spark-stemming
Spark stemming
Spark MLlib wrapper around Snowball stemming ,which is a small string processing language designed for creating stemming algorithms for use in Information Retrieval. This package allows to use it as a part of Spark ML Pipeline API.
Status
Use
To Include this package in your Spark application via spark-shell or pySpark, you must use it like:
$SPARK_HOME/bin/spark-shell --packages com.github.master:spark-stemming_2.10:0.2.0
where $SPARK_HOME is your Spark path.
You can also link it using SBT:
libraryDependencies += "com.github.master" %% "spark-stemming" % "0.2.0"
Or using Maven:
<dependency>
<groupId>com.github.master</groupId>
<artifactId>spark-stemming_2.10</artifactId>
<version>0.2.0</version>
</dependency>
Release
The latest version is : 0.2.0 Date: 2017-12-06 / Scala version: 2.10