Skip to main content

NLP Text processing library built on top of Apache Spark

Project description

Spark-NLP

John Snow Labs Spark-NLP is a natural language processing library built on top of Apache Spark ML. It provides simple, performant & accurate NLP annotations for machine learning pipelines, that scale easily in a distributed environment.

Requirements

SparkNLP is built on top of Apache Spark 2.4.0 and works with any user provided Spark 2.x.x it is advised to have basic knowledge of the framework and a working environment before using Spark-NLP.

Spark-NLP for Python

Dependencies on python3-devel and wheel python module

Build python package with python3 setup.py sdist bdist_wheel

Install with python3 -m pip install --force-reinstall --user dist/spark_nlp-1.8.3-py2.py3-none-any.whl

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

spark-nlp-1.8.4.tar.gz (16.6 kB view details)

Uploaded Source

Built Distribution

spark_nlp-1.8.4-py2.py3-none-any.whl (18.8 kB view details)

Uploaded Python 2Python 3

File details

Details for the file spark-nlp-1.8.4.tar.gz.

File metadata

  • Download URL: spark-nlp-1.8.4.tar.gz
  • Upload date:
  • Size: 16.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.2 pkginfo/1.4.2 requests/2.20.0 setuptools/40.8.0 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.7.2

File hashes

Hashes for spark-nlp-1.8.4.tar.gz
Algorithm Hash digest
SHA256 5f8f1a28af4a27290276a6438e274126625f99323c2a0eabaffd5839b87c58e7
MD5 bc2d3d0c8f32c5bf7306d7b4ba2eb20b
BLAKE2b-256 4aef233e5d2502aae0f235524efa86a65cbabc55894a149d8627c1e3aae20f5e

See more details on using hashes here.

File details

Details for the file spark_nlp-1.8.4-py2.py3-none-any.whl.

File metadata

  • Download URL: spark_nlp-1.8.4-py2.py3-none-any.whl
  • Upload date:
  • Size: 18.8 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.2 pkginfo/1.4.2 requests/2.20.0 setuptools/40.8.0 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.7.2

File hashes

Hashes for spark_nlp-1.8.4-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 6738732b20af843dd515db86cbfcb0c9ded58f131e97177badd7a32ee609492a
MD5 b324632015733c0207a44c501ed2bf8d
BLAKE2b-256 2fa1adfdc0c00b21cc90fb504eb8638ea2edb4512e9514ddc4fe0570ae59620e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page