Skip to main content

Light-weight sentence tokenizer for Korean.

Project description

A light-weight sentence tokenizer for Korean.

Half-width punctuation is generally used in Korean, but this tokenizer also supports full-width punctuation. (For details about full-width punctuation in Korean, please see https://www.w3.org/TR/klreq/).

Sample Code:

from kr_sentence.tokenizer import tokenize

paragraph_str = "저는 미국인이에요. 만나서 반갑습니다."

sentence_list = tokenize(paragraph_str)

for sentence in sentence_list: print(sentence)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kr_sentence-0.0.3.tar.gz (2.7 kB view details)

Uploaded Source

Built Distribution

kr_sentence-0.0.3-py3-none-any.whl (3.5 kB view details)

Uploaded Python 3

File details

Details for the file kr_sentence-0.0.3.tar.gz.

File metadata

  • Download URL: kr_sentence-0.0.3.tar.gz
  • Upload date:
  • Size: 2.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.7.9

File hashes

Hashes for kr_sentence-0.0.3.tar.gz
Algorithm Hash digest
SHA256 2194025896bf6903b299dc908579325a40ba5d7897d2f36119de8129cc735856
MD5 c916c54e2c7a9d036a066e86d9ffb31f
BLAKE2b-256 c40eb16df331142d8874ecda6993edcca45b7d85c558f10397f056b59cb476fb

See more details on using hashes here.

File details

Details for the file kr_sentence-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: kr_sentence-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 3.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.7.9

File hashes

Hashes for kr_sentence-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 64d9073f5389f8d0a48afca2804ecad02ac609834e7c523ee0df5023a988f762
MD5 e11f294f4310b564b44c742cc03a7328
BLAKE2b-256 e0430496f75daed088f1ddd46f4b3dcde1bca9774871aa420101faee69945747

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page