Skip to main content

Unsupervised Korean Natural Language Processing Toolkits

Project description

It contains unsupervised word extraction, tokenizers and noun extractors.
These algorithms are not depending training corpus but extract patterns from data by theirselves.

Current version has follows
- Word extraction
- Cohesion score
- Branching Entropy
- Accessor Variety
- Tokenizers
- RegexTokenizer
- LTokenizer
- MaxScoreTokenizer
- Noun extractor
- LRNounExtractor


Following packages are helpful
- krwordrank: Unsupervised Korean word/keyword extractor
- https://github.com/lovit/KR-WordRank
- pip install krwordrank
- soyspacing: Korean spacing error corrector
- https://github.com/lovit/soyspacing
- pip install soyspacing


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

soynlp-0.0.22.tar.gz (26.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

soynlp-0.0.22-py3-none-any.whl (30.5 kB view details)

Uploaded Python 3

File details

Details for the file soynlp-0.0.22.tar.gz.

File metadata

  • Download URL: soynlp-0.0.22.tar.gz
  • Upload date:
  • Size: 26.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for soynlp-0.0.22.tar.gz
Algorithm Hash digest
SHA256 443aef267d03e6246d997c377b926e09439a4c9737458fd979f956fad8f4d093
MD5 080693892fdbf7e28f306353e2a6123f
BLAKE2b-256 2d56bc9fd576b3b35e2914f3705f5db6634911846159d5a54c927641540817d8

See more details on using hashes here.

File details

Details for the file soynlp-0.0.22-py3-none-any.whl.

File metadata

File hashes

Hashes for soynlp-0.0.22-py3-none-any.whl
Algorithm Hash digest
SHA256 b5b9beb624f0e1431599b48ec60576bd4234a6be5fe6af147ebefa49a338f132
MD5 0a26ab0d521f67b2fcece2da1625e8ba
BLAKE2b-256 5ac10c80fb6a4205235c03c04ff13c28ee3c657bf942d85f1b5e490772a0cde4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page