Skip to main content

Unsupervised Korean Natural Language Processing Toolkits

Project description

It contains unsupervised word extraction, tokenizers and noun extractors.
These algorithms are not depending training corpus but extract patterns from data by theirselves.

Current version has follows
- Word extraction
- Cohesion score
- Branching Entropy
- Accessor Variety
- Tokenizers
- RegexTokenizer
- LTokenizer
- MaxScoreTokenizer
- Noun extractor
- LRNounExtractor


Following packages are helpful
- krwordrank: Unsupervised Korean word/keyword extractor
- https://github.com/lovit/KR-WordRank
- pip install krwordrank
- soyspacing: Korean spacing error corrector
- https://github.com/lovit/soyspacing
- pip install soyspacing


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

soynlp-0.0.23.tar.gz (26.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

soynlp-0.0.23-py3-none-any.whl (30.6 kB view details)

Uploaded Python 3

File details

Details for the file soynlp-0.0.23.tar.gz.

File metadata

  • Download URL: soynlp-0.0.23.tar.gz
  • Upload date:
  • Size: 26.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for soynlp-0.0.23.tar.gz
Algorithm Hash digest
SHA256 0723cd07b5b0815d2f7bfe2aa0182f602cabb91ace167d20b71e0c7f9ed7f798
MD5 8fe9a2e6bde76267ac1dbfa0e712756a
BLAKE2b-256 ada5a8752b7bf84dabd4fde2ecdf37691ac394be51acf07258fac0890b4ec4ac

See more details on using hashes here.

File details

Details for the file soynlp-0.0.23-py3-none-any.whl.

File metadata

File hashes

Hashes for soynlp-0.0.23-py3-none-any.whl
Algorithm Hash digest
SHA256 e6131d8ce9bef98ffb75f69305ee97fb4249397ca9353a1b008ebca508ae3a06
MD5 d3b0125dec1dd0060894a3143689f6cd
BLAKE2b-256 51997d6af9c669116113a6330284209c986869fe67ab36f83bbc2fe1faf49d2b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page