Skip to main content

Korean lexical complexity analyzer.

Project description

klca

klca is a Korean lexical complexity analyzer.

Usage

Show help:

python3 -m klca --help

Analyze one file:

python3 -m klca file --input-file path/to/text.txt --output output.json

Analyze a folder:

python3 -m klca folder --input-dir path/to/texts --output results.csv
  • Use --recursive to include text files in subfolders. Without it, only files directly inside --input-dir are processed.

Included Resources

This package includes bundled resources used by the analyzer, including:

  • Reference databases for frequency, range, and association, released as an open-source dataset (korean-fineweb-edu)
  • Vocabulary grade database (National Institute of Korean Language resources), released under Korea Open Government License Type 1

Morpheme Parsing and Tagging

  • By default, klca uses the Korean stanza GSD model for tokenization, POS tagging, and lemmatization.
  • The model is downloaded by stanza at runtime and is not bundled in this package.
  • If you want to use a different Korean stanza model or a custom local model, you can modify the Stanza pipeline settings in the setting (both core.py and batch.py).

License

  • This project is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

klca-0.1.3.tar.gz (20.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

klca-0.1.3-py3-none-any.whl (20.7 kB view details)

Uploaded Python 3

File details

Details for the file klca-0.1.3.tar.gz.

File metadata

  • Download URL: klca-0.1.3.tar.gz
  • Upload date:
  • Size: 20.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for klca-0.1.3.tar.gz
Algorithm Hash digest
SHA256 19af2d659c0f80cca6dc5d8cb3bf74d8e826cfc709f349a0af4704b0852e7d9e
MD5 01e87dc349443ae453d4147d4ece9611
BLAKE2b-256 4cf93700dae8cad8090990c4971ee321a2efe8555296a44980f039d4c0654896

See more details on using hashes here.

File details

Details for the file klca-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: klca-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 20.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for klca-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 26bee47b1bf5bda45224da1123dd9c265a1a57af32e81ab20e814fa96a6df0f2
MD5 b57e26a706af0297ecc9ab7552bf7248
BLAKE2b-256 12ef43943c0715c74035d412dab4be596f78ac647a6f7aa49509cef04fd0c404

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page