soynlp

Unsupervised Korean Natural Language Processing Toolkits

Project description

It contains unsupervised word extraction, tokenizers and noun extractors.
These algorithms are not depending training corpus but extract patterns from data by theirselves.

Current version has follows
- Word extraction
- Cohesion score
- Branching Entropy
- Accessor Variety
- Tokenizers
- RegexTokenizer
- LTokenizer
- MaxScoreTokenizer
- Noun extractor
- LRNounExtractor

Following packages are helpful
- krwordrank: Unsupervised Korean word/keyword extractor
- https://github.com/lovit/KR-WordRank
- pip install krwordrank
- soyspacing: Korean spacing error corrector
- https://github.com/lovit/soyspacing
- pip install soyspacing

Project details

Release history Release notifications | RSS feed

0.0.493

Aug 25, 2019

0.0.492

Mar 8, 2019

0.0.491

Feb 17, 2019

0.0.49

Oct 16, 2018

0.0.48

Oct 11, 2018

0.0.47

Oct 8, 2018

0.0.46

Jun 5, 2018

0.0.45

May 1, 2018

0.0.43

Apr 25, 2018

0.0.42

Apr 20, 2018

0.0.41

Feb 22, 2018

0.0.28

Sep 12, 2017

0.0.27

Sep 1, 2017

0.0.26

Sep 1, 2017

0.0.25

Aug 12, 2017

0.0.24

Jun 23, 2017

0.0.23

Jun 16, 2017

0.0.22

Jun 16, 2017

0.0.21

Jun 16, 2017

0.0.18

May 25, 2017

0.0.17

May 22, 2017

0.0.16

May 20, 2017

0.0.15

May 19, 2017

0.0.14

May 19, 2017

0.0.13

May 19, 2017

0.0.12

May 19, 2017

0.0.11

May 19, 2017

0.0.4

Feb 12, 2018

0.0.3

Sep 13, 2017

0.0.2

May 25, 2017

This version

0.0.1

May 18, 2017

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

soynlp-0.0.1.tar.gz (7.8 kB view hashes)

Uploaded May 18, 2017 Source

Built Distribution

soynlp-0.0.1-py3-none-any.whl (9.8 kB view hashes)

Uploaded May 18, 2017 Python 3

Hashes for soynlp-0.0.1.tar.gz

Hashes for soynlp-0.0.1.tar.gz
Algorithm	Hash digest
SHA256	`b0fca30d65aa8c768291a9493282765ca214e2ac08a34291d2147d8f0ce7e4cd`
MD5	`7d16a68ebf717f1177ee377d4e2ca677`
BLAKE2b-256	`8bff7be07e954db3af8fea6a12c5a16470c04edc2171c6af3a77e33dd364f9bd`

Hashes for soynlp-0.0.1-py3-none-any.whl

Hashes for soynlp-0.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`49bd4086e8cbcc83f9da09ade11491608f0e358e804b8c960c14e754277e1726`
MD5	`7d78f5e777908118d7b38df23b09ade1`
BLAKE2b-256	`8f21f731a83480828b26bad0f71e8911bfa53ea45e2337e43e9cff6abe461dfc`