Skip to main content

English Tokenizer from ELIT

Project description

ELIT Tokenizer

ELIT (Emory Information and Language Technology) features several tokenizers to split text into a sequence of tokens and segment them into sentences. This project is led by the Emory NLP Research Laboratory and under the Apache 2.0 license.

  • Latest release: 1.0 (10/15/2021)

Installation

Python 3.7 or higher is recommended:

pip install elit_tokenizer

Documentations

Contact

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

elit_tokenizer-1.0.tar.gz (10.8 kB view hashes)

Uploaded Source

Built Distribution

elit_tokenizer-1.0-py3-none-any.whl (17.2 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page