Skip to main content

English Tokenizer from ELIT

Project description

ELIT Tokenizer

ELIT (Emory Information and Language Technology) features several tokenizers to split text into a sequence of tokens and segment them into sentences. This project is led by the Emory NLP Research Laboratory and under the Apache 2.0 license.

  • Latest release: 0.9 (10/15/2021)

Installation

Python 3.7 or higher is recommended:

pip install elit_tokenizer

Documentations

Contact

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

elit_tokenizer-0.9-py3-none-any.whl (17.2 kB view details)

Uploaded Python 3

File details

Details for the file elit_tokenizer-0.9-py3-none-any.whl.

File metadata

  • Download URL: elit_tokenizer-0.9-py3-none-any.whl
  • Upload date:
  • Size: 17.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.1

File hashes

Hashes for elit_tokenizer-0.9-py3-none-any.whl
Algorithm Hash digest
SHA256 81d6d6d4f4e48d378abab73758d7e3c6c5d73bf088fc53f4e36c5aa7cd2a5a67
MD5 2c4073e680305b7c2d8004fbd90d186a
BLAKE2b-256 ba736923427c9d736eb03cfdd4755b8d41b47d466bd486e820d3a8f9675fc3b8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page