Python wrapper for the CETEMPublico corpus
Project description
cetem-publico is a Python wrapper for the CETEMPublico corpus. It takes care of downloading, storing and importing the corpus into NLTK.
THIS IS STILL A WORK IN PROGRESS, API MIGHT BREAK WITHOUT WARNING.
Installing
Install and update using pip:
pip install [--user] cetem-publico
A Simple Example
import CETEMPublico
cp = CETEMPublico.load() # loads a small 10KB sample
# or
cp = CETEMPublico.load(full=True) # loads the full 12GB
print(cp.tagged_sents())
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
cetem-publico-0.0.15.tar.gz
(4.0 kB
view hashes)
Built Distribution
Close
Hashes for cetem_publico-0.0.15-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0b17f3fa2c61d42739fa0878ed698ddbbd312e408da044e30ec1784f566e82bc |
|
MD5 | 8c9fb9670103274644ecfc91cad3099c |
|
BLAKE2b-256 | f759589d2eb5088707fcc5ad6383be506229a8935885f0974388c3ce73d7165e |