Skip to main content

Unsupervised Korean Natural Language Processing Toolkits

Project description

It contains unsupervised word extraction, tokenizers and noun extractors.
These algorithms are not depending training corpus but extract patterns from data by theirselves.

Current version has follows
- Word extraction
- Cohesion score
- Branching Entropy
- Accessor Variety
- Tokenizers
- RegexTokenizer
- LTokenizer
- MaxScoreTokenizer
- Noun extractor
- LRNounExtractor


Following packages are helpful
- krwordrank: Unsupervised Korean word/keyword extractor
- https://github.com/lovit/KR-WordRank
- pip install krwordrank
- soyspacing: Korean spacing error corrector
- https://github.com/lovit/soyspacing
- pip install soyspacing


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

soynlp-0.0.24.tar.gz (26.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

soynlp-0.0.24-py3-none-any.whl (30.7 kB view details)

Uploaded Python 3

File details

Details for the file soynlp-0.0.24.tar.gz.

File metadata

  • Download URL: soynlp-0.0.24.tar.gz
  • Upload date:
  • Size: 26.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for soynlp-0.0.24.tar.gz
Algorithm Hash digest
SHA256 e3c50b92a00c42caee8553cc61ce9cbcc75c68d32ff7b84ac21e712c6f98c3ad
MD5 ca59f85d5b20d7c387cdff8bc825391a
BLAKE2b-256 31805993ce70d0db11a206079307301ac660b1c5a43e01da04988c3f526bea38

See more details on using hashes here.

File details

Details for the file soynlp-0.0.24-py3-none-any.whl.

File metadata

File hashes

Hashes for soynlp-0.0.24-py3-none-any.whl
Algorithm Hash digest
SHA256 fff9643a8f73d2e0560b10e81c2c81030580624bf4848bf1a462374714fdf623
MD5 4a86b3df6ade6e7458900439a36d5717
BLAKE2b-256 e232d60ef05d14e64b2dfe7b9d5dc29cfca7ccf3409ec41f75e0a7db1ca91697

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page