English Tokenizer from ELIT
Project description
ELIT Tokenizer
ELIT (Emory Information and Language Technology) features several tokenizers to split text into a sequence of tokens and segment them into sentences. This project is led by the Emory NLP Research Laboratory and under the Apache 2.0 license.
- Latest release: 0.9 (10/15/2021)
Installation
Python 3.7 or higher is recommended:
pip install elit_tokenizer
Documentations
Contact
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
File details
Details for the file elit_tokenizer-0.9-py3-none-any.whl
.
File metadata
- Download URL: elit_tokenizer-0.9-py3-none-any.whl
- Upload date:
- Size: 17.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 81d6d6d4f4e48d378abab73758d7e3c6c5d73bf088fc53f4e36c5aa7cd2a5a67 |
|
MD5 | 2c4073e680305b7c2d8004fbd90d186a |
|
BLAKE2b-256 | ba736923427c9d736eb03cfdd4755b8d41b47d466bd486e820d3a8f9675fc3b8 |