NLP for the ancient world
Project description
# The Classical Language Toolkit
[![PyPi downloads](http://img.shields.io/pypi/v/cltk.svg?style=flat)](https://pypi.python.org/pypi/cltk/) [![Documentation Status](https://readthedocs.org/projects/cltk/badge/?version=latest)](http://docs.cltk.org/en/latest/?badge=latest) [![DOI](https://zenodo.org/badge/doi/10.5281/zenodo.60021.svg)](http://dx.doi.org/10.5281/zenodo.60021)
[![Build Status](https://travis-ci.org/cltk/cltk.svg?branch=master)](https://travis-ci.org/cltk/cltk) [![codecov.io](http://codecov.io/github/cltk/cltk/coverage.svg?branch=master)](http://codecov.io/github/cltk/cltk?branch=master)
[![Join the chat at https://gitter.im/cltk/cltk](https://badges.gitter.im/Join%20Chat.svg)](https://gitter.im/cltk/cltk?utm_source=badge&utm_medium=badge&utm_campaign=pr-badge&utm_content=badge) <a href=”https://zenhub.io”><img src=”https://raw.githubusercontent.com/ZenHubIO/support/master/zenhub-badge.png”></a>
## About
The Classical Language Toolkit (CLTK) offers natural language processing support for Classical languages. In some areas, it extends the NLTK. The goals of the CLTK are to:
compile analysis-friendly corpora in a variety of Classical languages (currently available for Chinese, Coptic, Greek, Hebrew, Latin, Pali, Punjabi, Sanskrit, Telugu, and Tibetan);
gather, improve, and generate linguistic data required for NLP (Greek and Latin are in progress, with [more in the pipeline](https://github.com/cltk/cltk/wiki/List-of-Classical-languages));
develop a free and open platform for generating reproducible, scientific research that advances the study of the languages and literatures of the ancient world.
## Documentation
The docs are at [docs.cltk.org](http://docs.cltk.org).
### Installation
CLTK supports Python version 3.5. The software only runs on POSIX–compliant operating systems (Linux, Mac OS X, FreeBSD, etc.).
` bash $ pip install cltk `
See docs for [complete installation instructions](http://docs.cltk.org/en/latest/installation.html).
The [CLTK organization curates corpora](https://github.com/cltk) which can be downloaded directly or, better, [imported by the toolkit](http://docs.cltk.org/en/latest/importing_corpora.html).
## Contributing
See the [Quickstart for contributors](https://github.com/cltk/cltk/wiki/Quickstart-for-contributors) for an overview of the process. If you’re looking to start with a small contribution, see the [Issue tracker for “easy” jobs](https://github.com/cltk/cltk/issues?q=is%3Aopen+is%3Aissue+label%3Aeasy) needing to be done. Bigger projects may be found at [Project ideas](https://github.com/cltk/cltk/wiki/Project-ideas) page. Of course, new ideas are always welcome.
## Citation
Each major release of the CLTK is given a [DOI](http://en.wikipedia.org/wiki/Digital_object_identifier), a type of unique identity for digital documents. This DOI ought to be included in your citation, as it will allow researchers to reproduce your results should the CLTK’s API or codebase change. To find the CLTK’s current DOI, observe the blue DOI button in the repository’s home on GitHub. To the end of your bibliographic entry, append DOI ` plus the current identifier. You may also add version/release number, located in the `pypi button at the project’s GitHub repository homepage.
Thus, please cite core software as something like: ` Kyle P. Johnson et al.. (2014-2016). CLTK: The Classical Language Toolkit. DOI 10.5281/zenodo.<current_release_id> `
A style-neutral BibTeX entry would look like this: ` @Misc{johnson2014, author = {Kyle P. Johnson et al.}, title = {CLTK: The Classical Language Toolkit}, howpublished = {\url{https://github.com/cltk/cltk}}, note = {{DOI} 10.5281/zenodo.<current_release_id>}, year = {2014--2016}, } `
[Many contributors](https://github.com/cltk/cltk/blob/master/contributors.md) have made substantial contributions to the CLTK. For scholarship about particular code, it might be proper to cite these individuals as authors of the work under discussion.
# Gratitude
We are thankful for the following organizations that have offered support:
Google Summer of Code (sponsoring two students, 2016)
JetBrains (licenses for PyCharm and WebStorm)
Google Cloud Platform (with credits for the Classical Language Archive and API)
## License
The CLTK is Copyright (c) 2016 Kyle P. Johnson, under the MIT License. See [LICENSE](https://github.com/cltk/cltk/blob/master/LICENSE) for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file cltk-0.1.45.tar.gz
.
File metadata
- Download URL: cltk-0.1.45.tar.gz
- Upload date:
- Size: 421.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 84419f5f1eb35a57f88bb25d3a36aab69bf7baf2ef9c75e766389789ebec122f |
|
MD5 | 62c851db366d3e1fa04051e56bd8a944 |
|
BLAKE2b-256 | 80f6fe26eb1749bc41ce141128b3edae7ae6279027ec3388529721d73484c1a2 |