NLP Text processing library built on top of Apache Spark
John Snow Labs Spark-NLP is a natural language processing library built on top of Apache Spark ML. It provides simple, performant & accurate NLP annotations for machine learning pipelines, that scale easily in a distributed environment.
SparkNLP is built on top of Apache Spark 2.4.0 and works with any user provided Spark 2.x.x it is advised to have basic knowledge of the framework and a working environment before using Spark-NLP.
Spark-NLP for Python
wheel python module
Build python package with
python3 setup.py sdist bdist_wheel
python3 -m pip install --force-reinstall --user dist/spark_nlp-1.8.3-py2.py3-none-any.whl
Release history Release notifications | RSS feed
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
|Filename, size||File type||Python version||Upload date||Hashes|
|Filename, size spark_nlp-1.8.3-py2.py3-none-any.whl (101.7 MB)||File type Wheel||Python version py2.py3||Upload date||Hashes View|
|Filename, size spark-nlp-1.8.3.tar.gz (101.6 MB)||File type Source||Python version None||Upload date||Hashes View|
Hashes for spark_nlp-1.8.3-py2.py3-none-any.whl