corpy

Tools for processing language data.

These details have not been verified by PyPI

Project links

Project description

What is CorPy?

A fancy plural for corpus ;) Also, a collection of handy but not especially mutually integrated tools for dealing with linguistic data. It abstracts away functionality which is often needed in practice in day to day work at the Czech National Corpus, without aspiring to be a fully featured or consistent NLP framework.

Currently available sub-packages are:

morphodita: tokenizing and tagging raw textual data using MorphoDiTa
vertical: parsing corpora in the vertical format devised originally for CWB, used also by (No)SketchEngine
phonetics: rule-based phonetic transcription of Czech

Installation

$ pip3 install corpy

Requirements

Only recent versions of Python 3 are supported by design.

License

Distributed under the GNU General Public License v3.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.6.1

Apr 5, 2023

0.6

Mar 14, 2023

0.5.2

Mar 13, 2023

0.5.1

Mar 13, 2023

0.5

Jan 17, 2023

0.4.1

Jan 3, 2022

0.4.0

Sep 8, 2021

0.3.1

May 1, 2021

0.3.0

Feb 6, 2021

0.2.4

Jan 26, 2021

0.2.3

Aug 20, 2019

0.2.2

Jun 19, 2019

0.2.1

Jun 14, 2019

0.2.0

May 27, 2019

This version

0.1.2

May 23, 2019

0.1.1

May 23, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

corpy-0.1.2.tar.gz (27.6 kB view hashes)

Uploaded May 23, 2019 Source

Built Distribution

corpy-0.1.2-py3-none-any.whl (75.5 kB view hashes)

Uploaded May 23, 2019 Python 3

Hashes for corpy-0.1.2.tar.gz

Hashes for corpy-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`96ec8943fff81fc277ad99f4929a2df41a69ca027693154b6b648f0ebc9611f2`
MD5	`e3919e5a8d7281be19db65b15fce430f`
BLAKE2b-256	`a20eecf253af6e3fadb16c5d1af36e545a66110653d642c388d0a364ffff0468`

Hashes for corpy-0.1.2-py3-none-any.whl

Hashes for corpy-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1438a42e281a7aeb05e0903155f394db36dd35169f1a4a4988de4131cd437818`
MD5	`5993fefa3cae6e9b33befa61b117e418`
BLAKE2b-256	`b935a183ca7f214cd2293deb1ee5711dac38de548faa8f317c3cfeb9a0b99da4`