Python wrapper for the CETEMPublico corpus
Project description
cetem_publico is a Python wrapper for the CETEMPublico corpus. It takes care of downloading, storing and importing the corpus into NLTK.
Installing
Install and update using pip:
pip install [--user] cetem_publico
A Simple Example
import cetem_publico
cetem_publico.download() # downloads small 10KB file
# or
cetem_publico.download(full=True) # downloads full 12GB file
cp = cetem_publico.load()
print(cp.tagged_sents())
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
cetem_publico-0.0.10.tar.gz
(3.6 kB
view hashes)
Built Distribution
Close
Hashes for cetem_publico-0.0.10-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9444baa8faf8b04c218167fa0cbd7d2c931b0c6ae40e1f1d005af9c8d8939157 |
|
MD5 | 34574046b3761a94a692e45bb6733947 |
|
BLAKE2b-256 | 4263f44b3376e5c1977c5c53239174ff4350c4e383d77c9682dfc878f1f0b650 |