Skip to main content

creating vrt corpora

Project description

vrt_spacy

Python class for creating vrt-annotated corpora. Still in very early testing stage.

Install by typing:

pip install vrt_spacy

Usage Example:

from vrt import Corpus, S, Text
from vrt_spacy import Annotate
with Corpus("~","meinkorpus",4,"text_name") as c:
    annotate = Annotate(c, spacymodel="de_core_news_md")
    annotate("Das hier ist mein Text", text_name="Text1")
    with Text(c, text_name="Text2") as t:
        with S(c) as s:
            s.writep("Test","TAG","TAG","Lemma")  

Features:

  • Represent Corpus, Text, P and S Attributes
  • Integration of spacy for automatic generation of a vrt-representation of texts
  • Using Context Manager for xml-hierarchy representation
  • Reduces to utf8mb3 and checks formatting compatibility

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vrt_spacy-0.0.1.tar.gz (2.5 kB view hashes)

Uploaded source

Built Distribution

vrt_spacy-0.0.1-py3-none-any.whl (14.8 kB view hashes)

Uploaded py3

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page