NLP Text processing library built on top of Apache Spark
Project description
Spark-NLP
John Snow Labs Spark-NLP is a natural language processing library built on top of Apache Spark ML. It provides simple, performant & accurate NLP annotations for machine learning pipelines, that scale easily in a distributed environment.
Requirements
SparkNLP is built on top of Apache Spark 2.4.0 and works with any user provided Spark 2.x.x it is advised to have basic knowledge of the framework and a working environment before using Spark-NLP.
Spark-NLP for Python
Dependencies on python3-devel
and wheel
python module
Build python package with python3 setup.py sdist bdist_wheel
Install with python3 -m pip install --force-reinstall --user dist/spark_nlp-1.8.3-py2.py3-none-any.whl
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for spark_nlp-2.0.1-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 420ad245a4076a2de367a221c17d5724b8e55fa863efd57f80a68de338666235 |
|
MD5 | 9057da69fc70f9efb53af8fbaec46055 |
|
BLAKE2b-256 | 8a831d973a2d7edbbb0c2ca6b1503638a04f44cda9e3d69a603d979e2b112122 |