Korean lexical complexity analyzer.
Project description
klca
klca is a Korean lexical complexity analyzer.
Usage
Show help:
python3 -m klca --help
Analyze one file:
python3 -m klca file --input-file path/to/text.txt --output output.json
Analyze a folder:
python3 -m klca folder --input-dir path/to/texts --output results.csv
- Use
--recursiveto include text files in subfolders. Without it, only files directly inside--input-dirare processed.
Included Resources
This package includes bundled resources used by the analyzer, including:
- Reference databases for frequency, range, and association, released as an open-source dataset (korean-fineweb-edu)
- Vocabulary grade database (National Institute of Korean Language resources), released under Korea Open Government License Type 1
Morpheme Parsing and Tagging
- By default,
klcauses the KoreanstanzaGSD model for tokenization, POS tagging, and lemmatization. - The model is downloaded by
stanzaat runtime and is not bundled in this package. - If you want to use a different Korean
stanzamodel or a custom local model, you can modify the Stanza pipeline settings in the setting (both core.py and batch.py).
License
- This project is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
klca-0.1.3.tar.gz
(20.4 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
klca-0.1.3-py3-none-any.whl
(20.7 kB
view details)
File details
Details for the file klca-0.1.3.tar.gz.
File metadata
- Download URL: klca-0.1.3.tar.gz
- Upload date:
- Size: 20.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.9.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
19af2d659c0f80cca6dc5d8cb3bf74d8e826cfc709f349a0af4704b0852e7d9e
|
|
| MD5 |
01e87dc349443ae453d4147d4ece9611
|
|
| BLAKE2b-256 |
4cf93700dae8cad8090990c4971ee321a2efe8555296a44980f039d4c0654896
|
File details
Details for the file klca-0.1.3-py3-none-any.whl.
File metadata
- Download URL: klca-0.1.3-py3-none-any.whl
- Upload date:
- Size: 20.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.9.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
26bee47b1bf5bda45224da1123dd9c265a1a57af32e81ab20e814fa96a6df0f2
|
|
| MD5 |
b57e26a706af0297ecc9ab7552bf7248
|
|
| BLAKE2b-256 |
12ef43943c0715c74035d412dab4be596f78ac647a6f7aa49509cef04fd0c404
|