creating vrt corpora
Project description
vrt_generator
Python class for creating vrt-annotated corpora. Still in very early testing stage.
Install by typing:
pip install vrt_generator
Usage Example:
from vrt import Corpus, S, Text
with Corpus("~","meinkorpus",4,"text_name") as c:
with Text(c, text_name="Text2") as t:
with S(c) as s:
s.writep("Test","TAG","TAG","Lemma")
Features:
- Represent Corpus, Text, P and S Attributes
- Integration of spacy for automatic generation of a vrt-representation of texts
- Using Context Manager for xml-hierarchy representation
- Reduces to utf8mb3 and checks formatting compatibility
- If you want to add texts that are automatically POS-Tagged with Spacy, you might look at vrt_spacy
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
vrt_generator-0.0.6.tar.gz
(3.1 kB
view details)
Built Distribution
File details
Details for the file vrt_generator-0.0.6.tar.gz
.
File metadata
- Download URL: vrt_generator-0.0.6.tar.gz
- Upload date:
- Size: 3.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.40.2 CPython/3.8.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a7df328fad85ad72ae84f53ba1e59fd1dc6173ab24f8f05ba218283eae562fa0 |
|
MD5 | d766ed70761adbe060c8864d2bef8c22 |
|
BLAKE2b-256 | afe0d366fb86534bca6003da38a0a983affb7d4c759add4cd193b8c2222f480e |
File details
Details for the file vrt_generator-0.0.6-py3-none-any.whl
.
File metadata
- Download URL: vrt_generator-0.0.6-py3-none-any.whl
- Upload date:
- Size: 3.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.40.2 CPython/3.8.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3981967b8e8a269e08aa923bbcec3d93a4903fcd3c6a2a7e07197e43c4b2de17 |
|
MD5 | 0cefd17198f9a4c1c84b2ec63356ebe3 |
|
BLAKE2b-256 | 0ea796f0fda49bb2b3c01ce130097336163d8b625d3082cf29cd2512fc907b32 |