Skip to main content

Industrial-strength Natural Language Processing (NLP) with Python and Cython

Project description

spaCy: Industrial-strength NLP

spaCy is a library for advanced Natural Language Processing in Python and Cython. It's built on the very latest research, and was designed from day one to be used in real products. spaCy comes with pre-trained statistical models and word vectors, and currently supports tokenization for 50+ languages. It features state-of-the-art speed, convolutional neural network models for tagging, parsing and named entity recognition and easy deep learning integration. It's commercial open-source software, released under the MIT license.

💫 Version 2.1 out now! Check out the release notes here.

Azure Pipelines Travis Build Status Current Release Version pypi Version conda Version Python wheels Code style: black spaCy on Twitter

📖 Documentation

Documentation
spaCy 101 New to spaCy? Here's everything you need to know!
Usage Guides How to use spaCy and its features.
New in v2.1 New features, backwards incompatibilities and migration guide.
API Reference The detailed reference for spaCy's API.
Models Download statistical language models for spaCy.
Universe Libraries, extensions, demos, books and courses.
Changelog Changes and version history.
Contribute How to contribute to the spaCy project and code base.

💬 Where to ask questions

The spaCy project is maintained by @honnibal and @ines. Please understand that we won't be able to provide individual support via email. We also believe that help is much more valuable if it's shared publicly, so that more people can benefit from it.

Type Platforms
🚨 Bug Reports GitHub Issue Tracker
🎁 Feature Requests GitHub Issue Tracker
👩‍💻 Usage Questions Stack Overflow · Gitter Chat · Reddit User Group
🗯 General Discussion Gitter Chat · Reddit User Group

Features

  • Non-destructive tokenization
  • Named entity recognition
  • Support for 50+ languages
  • Pre-trained statistical models and word vectors
  • State-of-the-art speed
  • Easy deep learning integration
  • Part-of-speech tagging
  • Labelled dependency parsing
  • Syntax-driven sentence segmentation
  • Built in visualizers for syntax and NER
  • Convenient string-to-hash mapping
  • Export to numpy data arrays
  • Efficient binary serialization
  • Easy model packaging and deployment
  • Robust, rigorously evaluated accuracy

📖 For more details, see the facts, figures and benchmarks.

Install spaCy

For detailed installation instructions, see the documentation.

  • Operating system: macOS / OS X · Linux · Windows (Cygwin, MinGW, Visual Studio)
  • Python version: Python 2.7, 3.5+ (only 64 bit)
  • Package managers: pip · conda (via conda-forge)

pip

Using pip, spaCy releases are available as source packages and binary wheels (as of v2.0.13).

pip install spacy

When using pip it is generally recommended to install packages in a virtual environment to avoid modifying system state:

python -m venv .env
source .env/bin/activate
pip install spacy

conda

Thanks to our great community, we've finally re-added conda support. You can now install spaCy via conda-forge:

conda config --add channels conda-forge
conda install spacy

For the feedstock including the build recipe and configuration, check out this repository. Improvements and pull requests to the recipe and setup are always appreciated.

Updating spaCy

Some updates to spaCy may require downloading new statistical models. If you're running spaCy v2.0 or higher, you can use the validate command to check if your installed models are compatible and if not, print details on how to update them:

pip install -U spacy
python -m spacy validate

If you've trained your own models, keep in mind that your training and runtime inputs must match. After updating spaCy, we recommend retraining your models with the new version.

📖 For details on upgrading from spaCy 1.x to spaCy 2.x, see the migration guide.

Download models

As of v1.7.0, models for spaCy can be installed as Python packages. This means that they're a component of your application, just like any other module. Models can be installed using spaCy's download command, or manually by pointing pip to a path or URL.

Documentation
Available Models Detailed model descriptions, accuracy figures and benchmarks.
Models Documentation Detailed usage instructions.
# download best-matching version of specific model for your spaCy installation
python -m spacy download en_core_web_sm

# out-of-the-box: download best-matching default model
python -m spacy download en

# pip install .tar.gz archive from path or URL
pip install /Users/you/en_core_web_sm-2.1.0.tar.gz
pip install https://github.com/explosion/spacy-models/releases/download/en_core_web_sm-2.1.0/en_core_web_sm-2.1.0.tar.gz

Loading and using models

To load a model, use spacy.load() with the model name, a shortcut link or a path to the model data directory.

import spacy
nlp = spacy.load("en_core_web_sm")
doc = nlp(u"This is a sentence.")

You can also import a model directly via its full name and then call its load() method with no arguments.

import spacy
import en_core_web_sm

nlp = en_core_web_sm.load()
doc = nlp(u"This is a sentence.")

📖 For more info and examples, check out the models documentation.

Support for older versions

If you're using an older version (v1.6.0 or below), you can still download and install the old models from within spaCy using python -m spacy.en.download all or python -m spacy.de.download all. The .tar.gz archives are also attached to the v1.6.0 release. To download and install the models manually, unpack the archive, drop the contained directory into spacy/data and load the model via spacy.load('en') or spacy.load('de').

Compile from source

The other way to install spaCy is to clone its GitHub repository and build it from source. That is the common way if you want to make changes to the code base. You'll need to make sure that you have a development environment consisting of a Python distribution including header files, a compiler, pip, virtualenv and git installed. The compiler part is the trickiest. How to do that depends on your system. See notes on Ubuntu, OS X and Windows for details.

# make sure you are using the latest pip
python -m pip install -U pip
git clone https://github.com/explosion/spaCy
cd spaCy

python -m venv .env
source .env/bin/activate
export PYTHONPATH=`pwd`
pip install -r requirements.txt
python setup.py build_ext --inplace

Compared to regular install via pip, requirements.txt additionally installs developer dependencies such as Cython. For more details and instructions, see the documentation on compiling spaCy from source and the quickstart widget to get the right commands for your platform and Python version.

Ubuntu

Install system-level dependencies via apt-get:

sudo apt-get install build-essential python-dev git

macOS / OS X

Install a recent version of XCode, including the so-called "Command Line Tools". macOS and OS X ship with Python and git preinstalled.

Windows

Install a version of the Visual C++ Build Tools or Visual Studio Express that matches the version that was used to compile your Python interpreter. For official distributions these are VS 2008 (Python 2.7), VS 2010 (Python 3.4) and VS 2015 (Python 3.5).

Run tests

spaCy comes with an extensive test suite. In order to run the tests, you'll usually want to clone the repository and build spaCy from source. This will also install the required development dependencies and test utilities defined in the requirements.txt.

Alternatively, you can find out where spaCy is installed and run pytest on that directory. Don't forget to also install the test utilities via spaCy's requirements.txt:

python -c "import os; import spacy; print(os.path.dirname(spacy.__file__))"
pip install -r path/to/requirements.txt
python -m pytest <spacy-directory>

See the documentation for more details and examples.

Project details


Release history Release notifications | RSS feed

This version

2.1.8

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

spacy-2.1.8.tar.gz (30.7 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

spacy-2.1.8-cp37-cp37m-win_amd64.whl (30.0 MB view details)

Uploaded CPython 3.7mWindows x86-64

spacy-2.1.8-cp37-cp37m-manylinux1_x86_64.whl (30.8 MB view details)

Uploaded CPython 3.7m

spacy-2.1.8-cp37-cp37m-macosx_10_6_intel.macosx_10_9_intel.macosx_10_9_x86_64.macosx_10_10_intel.macosx_10_10_x86_64.whl (34.4 MB view details)

Uploaded CPython 3.7mmacOS 10.10+ Intel (x86-64, i386)macOS 10.10+ x86-64macOS 10.6+ Intel (x86-64, i386)macOS 10.9+ Intel (x86-64, i386)macOS 10.9+ x86-64

spacy-2.1.8-cp36-cp36m-win_amd64.whl (30.0 MB view details)

Uploaded CPython 3.6mWindows x86-64

spacy-2.1.8-cp36-cp36m-manylinux1_x86_64.whl (30.8 MB view details)

Uploaded CPython 3.6m

spacy-2.1.8-cp36-cp36m-macosx_10_6_intel.macosx_10_9_intel.macosx_10_9_x86_64.macosx_10_10_intel.macosx_10_10_x86_64.whl (34.7 MB view details)

Uploaded CPython 3.6mmacOS 10.10+ Intel (x86-64, i386)macOS 10.10+ x86-64macOS 10.6+ Intel (x86-64, i386)macOS 10.9+ Intel (x86-64, i386)macOS 10.9+ x86-64

spacy-2.1.8-cp35-cp35m-win_amd64.whl (29.9 MB view details)

Uploaded CPython 3.5mWindows x86-64

spacy-2.1.8-cp35-cp35m-manylinux1_x86_64.whl (30.8 MB view details)

Uploaded CPython 3.5m

spacy-2.1.8-cp27-cp27mu-manylinux1_x86_64.whl (30.8 MB view details)

Uploaded CPython 2.7mu

File details

Details for the file spacy-2.1.8.tar.gz.

File metadata

  • Download URL: spacy-2.1.8.tar.gz
  • Upload date:
  • Size: 30.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/39.0.1 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.6.6

File hashes

Hashes for spacy-2.1.8.tar.gz
Algorithm Hash digest
SHA256 9c510459a66703739d6ba6c958fcff2627399dd813829a020d5644b532034ab6
MD5 7d9b0b496ea7867cecd351cb313654c9
BLAKE2b-256 58f25a23bb7251988da474eec844b692760cb0a317912291afc77b516f399cff

See more details on using hashes here.

File details

Details for the file spacy-2.1.8-cp37-cp37m-win_amd64.whl.

File metadata

  • Download URL: spacy-2.1.8-cp37-cp37m-win_amd64.whl
  • Upload date:
  • Size: 30.0 MB
  • Tags: CPython 3.7m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for spacy-2.1.8-cp37-cp37m-win_amd64.whl
Algorithm Hash digest
SHA256 1dab40ef0ba0beebdd98a05598930d0091d5ad7db88f615cc6446d86335d0054
MD5 93d4ef9d5609b656c9321cb296e775df
BLAKE2b-256 03a1c7a25240f92aae3f89d97c8a7584212d19b145881f9fca97196e9a100406

See more details on using hashes here.

File details

Details for the file spacy-2.1.8-cp37-cp37m-manylinux1_x86_64.whl.

File metadata

  • Download URL: spacy-2.1.8-cp37-cp37m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 30.8 MB
  • Tags: CPython 3.7m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for spacy-2.1.8-cp37-cp37m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 2a1cf2eb8851f27341bde6a597482cc408ca5b54a80fa5ed163d31efacc6ab24
MD5 4f4597b70c5f082b070ce0bf51f9859a
BLAKE2b-256 e8753c000560b15248530694b4bf6222357549accf24b9ee5b27a3f0acc8323e

See more details on using hashes here.

File details

Details for the file spacy-2.1.8-cp37-cp37m-macosx_10_6_intel.macosx_10_9_intel.macosx_10_9_x86_64.macosx_10_10_intel.macosx_10_10_x86_64.whl.

File metadata

File hashes

Hashes for spacy-2.1.8-cp37-cp37m-macosx_10_6_intel.macosx_10_9_intel.macosx_10_9_x86_64.macosx_10_10_intel.macosx_10_10_x86_64.whl
Algorithm Hash digest
SHA256 a8b7d63a71a60d8512ff18f181d3c0f0fedba8522b6cf5f47df20dc2db2b166c
MD5 5979a168e52d114d70607eef1c19c382
BLAKE2b-256 9a2a057bca697905031a6179227278a49fe9e7841a6d444bc2e13f1266b9f7dc

See more details on using hashes here.

File details

Details for the file spacy-2.1.8-cp36-cp36m-win_amd64.whl.

File metadata

  • Download URL: spacy-2.1.8-cp36-cp36m-win_amd64.whl
  • Upload date:
  • Size: 30.0 MB
  • Tags: CPython 3.6m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for spacy-2.1.8-cp36-cp36m-win_amd64.whl
Algorithm Hash digest
SHA256 d7b29b6689360540fa209529a855428c7742390b20b1dc7a4c7cf6c0e5bc7ff9
MD5 12acaaaf477013265399de283d249bce
BLAKE2b-256 dcc6786987e38465b63ef492b4b7b05ccceaf0413dc1f49127313da243779823

See more details on using hashes here.

File details

Details for the file spacy-2.1.8-cp36-cp36m-manylinux1_x86_64.whl.

File metadata

  • Download URL: spacy-2.1.8-cp36-cp36m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 30.8 MB
  • Tags: CPython 3.6m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for spacy-2.1.8-cp36-cp36m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 c026295614a487c7d0340eca94f9dae1beb8ca6f7eabef57e62baf011c304bfc
MD5 a97b545595ea9ff86c009d7726f9b152
BLAKE2b-256 959cafd55bb35cc03e4b3dadc41dd48bc26e0678b08d59f32411735c35bda550

See more details on using hashes here.

File details

Details for the file spacy-2.1.8-cp36-cp36m-macosx_10_6_intel.macosx_10_9_intel.macosx_10_9_x86_64.macosx_10_10_intel.macosx_10_10_x86_64.whl.

File metadata

File hashes

Hashes for spacy-2.1.8-cp36-cp36m-macosx_10_6_intel.macosx_10_9_intel.macosx_10_9_x86_64.macosx_10_10_intel.macosx_10_10_x86_64.whl
Algorithm Hash digest
SHA256 e66698cd7f6737d1ac1af55428c91d5f7af86bd2295cd43b7bc62ccd3646d216
MD5 bd4ed5618af15c6c6135c11d65623f59
BLAKE2b-256 5c38fc37ad63427e9781e4bf5f350f9a1b9e472b3e48bc856ada9ace7fcf1b7d

See more details on using hashes here.

File details

Details for the file spacy-2.1.8-cp35-cp35m-win_amd64.whl.

File metadata

  • Download URL: spacy-2.1.8-cp35-cp35m-win_amd64.whl
  • Upload date:
  • Size: 29.9 MB
  • Tags: CPython 3.5m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for spacy-2.1.8-cp35-cp35m-win_amd64.whl
Algorithm Hash digest
SHA256 4d9b22447e2671ac95d7e88d65f8edf719235ff26a9f76f35c95cb5bb9970be2
MD5 2dfe2283812969e7cef4218ba8107bc6
BLAKE2b-256 2a1424abf4e9e0ec01ff12f78e2914ade6e384d116d3c29ce9dba832195558f4

See more details on using hashes here.

File details

Details for the file spacy-2.1.8-cp35-cp35m-manylinux1_x86_64.whl.

File metadata

  • Download URL: spacy-2.1.8-cp35-cp35m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 30.8 MB
  • Tags: CPython 3.5m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for spacy-2.1.8-cp35-cp35m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 4a744768cdd10a5d3ab2383fd7ce13b9051885e819dd7dfdb4cf9dc111117358
MD5 15a85b9dbcef04dfe0a640ed1461a643
BLAKE2b-256 44e28e0feebd995ea9c0969d0504cfe1664a1899bf57d820080116354bbda096

See more details on using hashes here.

File details

Details for the file spacy-2.1.8-cp27-cp27mu-manylinux1_x86_64.whl.

File metadata

  • Download URL: spacy-2.1.8-cp27-cp27mu-manylinux1_x86_64.whl
  • Upload date:
  • Size: 30.8 MB
  • Tags: CPython 2.7mu
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for spacy-2.1.8-cp27-cp27mu-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 3986e74403e2ec5a1fc21bf57db4f144ba98a67c61f16e3bd41807af8aee9a32
MD5 347a562c791baf2942324402fda594a8
BLAKE2b-256 ee8388473c86493ba44a35c23c39cff77bb708dbb42f40f2959143e12db70a6d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page