Skip to main content

creating vrt corpora

Project description

vrt_spacy

Python class for creating vrt-annotated corpora. Still in very early testing stage.

Install by typing:

pip install vrt_spacy

Usage Example:

from vrt import Corpus, S, Text
from vrt_spacy import Annotate
with Corpus("~","meinkorpus",4,"text_name") as c:
    annotate = Annotate(c, spacymodel="de_core_news_md")
    annotate("Das hier ist mein Text", text_name="Text1")
    with Text(c, text_name="Text2") as t:
        with S(c) as s:
            s.writep("Test","TAG","TAG","Lemma")  

Features:

  • Represent Corpus, Text, P and S Attributes
  • Integration of spacy for automatic generation of a vrt-representation of texts
  • Using Context Manager for xml-hierarchy representation
  • Reduces to utf8mb3 and checks formatting compatibility

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vrt_spacy-0.0.1.tar.gz (2.5 kB view hashes)

Uploaded Source

Built Distribution

vrt_spacy-0.0.1-py3-none-any.whl (14.8 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page