spark-stemming

Spark stemming

Spark MLlib wrapper around Snowball stemming ,which is a small string processing language designed for creating stemming algorithms for use in Information Retrieval. This package allows to use it as a part of Spark ML Pipeline API.

Status

           

Use

To Include this package in your Spark application via spark-shell or pySpark, you must use it like:

$SPARK_HOME/bin/spark-shell --packages com.github.master:spark-stemming_2.10:0.2.0

where $SPARK_HOME is your Spark path.

You can also link it using SBT:

libraryDependencies += "com.github.master" %% "spark-stemming" % "0.2.0"

Or using Maven:

<dependency>

          <groupId>com.github.master</groupId>

          <artifactId>spark-stemming_2.10</artifactId>

          <version>0.2.0</version>

</dependency>

Release

The latest version is : 0.2.0      Date: 2017-12-06 / Scala version: 2.10