Skip to main content

creating vrt corpora

Project description

vrt_generator

Python class for creating vrt-annotated corpora. Still in very early testing stage.

Install by typing:

pip install vrt_generator

Usage Example:

from vrt import Corpus,  S, Text
with Corpus("~","meinkorpus",4,"text_name") as c:
    with Text(c, text_name="Text2") as t:
        with S(c) as s:
            s.writep("Test","TAG","TAG","Lemma")  

Features:

  • Represent Corpus, Text, P and S Attributes
  • Integration of spacy for automatic generation of a vrt-representation of texts
  • Using Context Manager for xml-hierarchy representation
  • Reduces to utf8mb3 and checks formatting compatibility
  • If you want to add texts that are automatically POS-Tagged with Spacy, you might look at vrt_spacy

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vrt_generator-0.0.6.tar.gz (3.1 kB view hashes)

Uploaded source

Built Distribution

vrt_generator-0.0.6-py3-none-any.whl (3.2 kB view hashes)

Uploaded py3

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page