English Tokenizer from ELIT
Project description
ELIT Tokenizer
ELIT (Emory Information and Language Technology) features several tokenizers to split text into a sequence of tokens and segment them into sentences. This project is led by the Emory NLP Research Laboratory and under the Apache 2.0 license.
- Latest release: 1.0 (10/15/2021)
Installation
Python 3.7 or higher is recommended:
pip install elit_tokenizer
Documentations
Contact
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
elit_tokenizer-1.0.tar.gz
(10.8 kB
view details)
Built Distribution
File details
Details for the file elit_tokenizer-1.0.tar.gz
.
File metadata
- Download URL: elit_tokenizer-1.0.tar.gz
- Upload date:
- Size: 10.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f1b80940fbf69e39050ce9ca4f1640dede9e5f395dc35db3d4007f309175ca16 |
|
MD5 | 84ed6e187670f2e81f2b638f8d5a7389 |
|
BLAKE2b-256 | 878fad54fd98a7399aa0fc3d49f329a9ce02ecc098d11c574f933bfcb0b5b08b |
File details
Details for the file elit_tokenizer-1.0-py3-none-any.whl
.
File metadata
- Download URL: elit_tokenizer-1.0-py3-none-any.whl
- Upload date:
- Size: 17.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 084e6dc53892366814e68c3715a5b2b95b519a7a86fe60d74370d4adc65bdfe7 |
|
MD5 | 25087d4571b862f8d68c430649234a0b |
|
BLAKE2b-256 | 947548263a682d8772e32a06a46a1479d0240cf52c6de90919b6476a3530582b |