English Tokenizer from ELIT
Project description
ELIT Tokenizer
ELIT (Emory Information and Language Technology) features several tokenizers to split text into a sequence of tokens and segment them into sentences. This project is led by the Emory NLP Research Laboratory and under the Apache 2.0 license.
- Latest release: 1.0 (10/15/2021)
Installation
Python 3.7 or higher is recommended:
pip install elit_tokenizer
Documentations
Contact
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
elit_tokenizer-1.0.tar.gz
(10.8 kB
view hashes)
Built Distribution
Close
Hashes for elit_tokenizer-1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 084e6dc53892366814e68c3715a5b2b95b519a7a86fe60d74370d4adc65bdfe7 |
|
MD5 | 25087d4571b862f8d68c430649234a0b |
|
BLAKE2b-256 | 947548263a682d8772e32a06a46a1479d0240cf52c6de90919b6476a3530582b |