English Tokenizer from ELIT
Project description
ELIT Tokenizer
ELIT (Emory Information and Language Technology) features several tokenizers to split text into a sequence of tokens and segment them into sentences. This project is led by the Emory NLP Research Laboratory and under the Apache 2.0 license.
- Latest release: 0.9 (10/15/2021)
Installation
Python 3.7 or higher is recommended:
pip install elit_tokenizer
Documentations
Contact
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
Close
Hashes for elit_tokenizer-0.9-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 81d6d6d4f4e48d378abab73758d7e3c6c5d73bf088fc53f4e36c5aa7cd2a5a67 |
|
MD5 | 2c4073e680305b7c2d8004fbd90d186a |
|
BLAKE2b-256 | ba736923427c9d736eb03cfdd4755b8d41b47d466bd486e820d3a8f9675fc3b8 |