The Classical Language Toolkit
Project description
The Classical Language Toolkit (CLTK) is a Python library offering natural language processing (NLP) for pre-modern languages.
Installation
For the CLTK’s latest version:
$ pip install cltk
For more information, see Installation docs or, to install from source, Development.
Pre-1.0 software remains available on the branch v0.1.x and docs at https://legacy.cltk.org. Install it with pip install "cltk<1.0".
Documentation
Documentation at https://docs.cltk.org.
Citation
When using the CLTK, please cite the following publication, including the DOI:
Johnson, Kyle P., Patrick J. Burns, John Stewart, Todd Cook, Clément Besnier, and William J. B. Mattingly. “The Classical Language Toolkit: An NLP Framework for Pre-Modern Languages.” In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: System Demonstrations, pp. 20-29. 2021. 10.18653/v1/2021.acl-demo.3
The complete BibTeX entry:
@inproceedings{johnson-etal-2021-classical,
title = "The {C}lassical {L}anguage {T}oolkit: {A}n {NLP} Framework for Pre-Modern Languages",
author = "Johnson, Kyle P. and
Burns, Patrick J. and
Stewart, John and
Cook, Todd and
Besnier, Cl{\'e}ment and
Mattingly, William J. B.",
booktitle = "Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: System Demonstrations",
month = aug,
year = "2021",
address = "Online",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2021.acl-demo.3",
doi = "10.18653/v1/2021.acl-demo.3",
pages = "20--29",
abstract = "This paper announces version 1.0 of the Classical Language Toolkit (CLTK), an NLP framework for pre-modern languages. The vast majority of NLP, its algorithms and software, is created with assumptions particular to living languages, thus neglecting certain important characteristics of largely non-spoken historical languages. Further, scholars of pre-modern languages often have different goals than those of living-language researchers. To fill this void, the CLTK adapts ideas from several leading NLP frameworks to create a novel software architecture that satisfies the unique needs of pre-modern languages and their researchers. Its centerpiece is a modular processing pipeline that balances the competing demands of algorithmic diversity with pre-configured defaults. The CLTK currently provides pipelines, including models, for almost 20 languages.",
}
License
Copyright (c) 2014-2024 Kyle P. Johnson under the MIT License.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file cltk-1.4.0.tar.gz
.
File metadata
- Download URL: cltk-1.4.0.tar.gz
- Upload date:
- Size: 626.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.4 CPython/3.12.6 Darwin/23.3.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 90c7078dc64d93f39e3067c080b577e1371bbf8595d716d01fd0755079ce10c6 |
|
MD5 | 82bca682ecbef2b0c7caf5f4fd681de2 |
|
BLAKE2b-256 | f9dd3b29d531585c4523450928a92d6cc0ed96b2bdb30ddb6d032f13727ec10a |
File details
Details for the file cltk-1.4.0-py3-none-any.whl
.
File metadata
- Download URL: cltk-1.4.0-py3-none-any.whl
- Upload date:
- Size: 697.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.4 CPython/3.12.6 Darwin/23.3.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7f14822db7b676b457c3f3e133e5f7c2f96b4d13f66974b878bf854c0f7c5726 |
|
MD5 | 8180b86e1300881203eefa77ac8fdaa1 |
|
BLAKE2b-256 | ecaf578ced03b6e8d72e0e1b6e79b93af07ec8389d9a98a8f9cca531449ec2dc |