Skip to main content

A package to ease processing (not only) TEI documents with spaCy

Project description

acdh-spacytei is a python package providing utility classes and functions to processing XML (TEI, TCF) encoded documents with/for spaCy


pip install acdh-spacytei


All code unless otherwise noted is licensed under the terms of the MIT License (MIT). Please refer to the file LICENSE in the root directory of this repository.


0.0.9 (2019-02-27)

  • added functions to process prodigy output files

  • pipline processes access model dir

0.0.6 (2019-02-27)

  • added a tokenize method to TeiReader

  • NE information written as rs-tags into TEI

  • minor things

0.0.6 (2019-02-27)

  • added langid to install_requires

  • new function recogito.recogito_dump_to_spacy_ner added

  • minor things

0.0.3 (2019-02-25)

  • minor things

0.0.1 (2019-02-25)

  • First version

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

acdh-spacytei-0.1.2.tar.gz (107.8 kB view hashes)

Uploaded Source

Built Distribution

acdh_spacytei-0.1.2-py3-none-any.whl (118.7 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page