Skip to main content

English Tokenizer from ELIT

Project description

ELIT Tokenizer

ELIT (Emory Information and Language Technology) features several tokenizers to split text into a sequence of tokens and segment them into sentences. This project is led by the Emory NLP Research Laboratory and under the Apache 2.0 license.

  • Latest release: 1.0 (10/15/2021)


Python 3.7 or higher is recommended:

pip install elit_tokenizer



Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

elit_tokenizer-1.0.tar.gz (10.8 kB view hashes)

Uploaded source

Built Distribution

elit_tokenizer-1.0-py3-none-any.whl (17.2 kB view hashes)

Uploaded py3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page