Skip to main content

creating vrt corpora

Project description

vrt_generator

Python class for creating vrt-annotated corpora. Still in very early testing stage.

Install by typing:

pip install vrt_generator

Usage Example:

from vrt import Corpus,  S, Text
with Corpus("~","meinkorpus",4,"text_name") as c:
    with Text(c, text_name="Text2") as t:
        with S(c) as s:
            s.writep("Test","TAG","TAG","Lemma")  

Features:

  • Represent Corpus, Text, P and S Attributes
  • Integration of spacy for automatic generation of a vrt-representation of texts
  • Using Context Manager for xml-hierarchy representation
  • Reduces to utf8mb3 and checks formatting compatibility
  • If you want to add texts that are automatically POS-Tagged with Spacy, you might look at vrt_spacy

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for vrt-generator, version 0.0.6
Filename, size File type Python version Upload date Hashes
Filename, size vrt_generator-0.0.6-py3-none-any.whl (3.2 kB) File type Wheel Python version py3 Upload date Hashes View
Filename, size vrt_generator-0.0.6.tar.gz (3.1 kB) File type Source Python version None Upload date Hashes View

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring DigiCert DigiCert EV certificate Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page