creating vrt corpora
Project description
vrt_spacy
Python class for creating vrt-annotated corpora. Still in very early testing stage.
Install by typing:
pip install vrt_spacy
Usage Example:
from vrt import Corpus, S, Text
from vrt_spacy import Annotate
with Corpus("~","meinkorpus",4,"text_name") as c:
annotate = Annotate(c, spacymodel="de_core_news_md")
annotate("Das hier ist mein Text", text_name="Text1")
with Text(c, text_name="Text2") as t:
with S(c) as s:
s.writep("Test","TAG","TAG","Lemma")
Features:
- Represent Corpus, Text, P and S Attributes
- Integration of spacy for automatic generation of a vrt-representation of texts
- Using Context Manager for xml-hierarchy representation
- Reduces to utf8mb3 and checks formatting compatibility
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
vrt_spacy-0.0.1.tar.gz
(2.5 kB
view hashes)
Built Distribution
vrt_spacy-0.0.1-py3-none-any.whl
(14.8 kB
view hashes)
Close
Hashes for vrt_spacy-0.0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b84de5d2486af58a1965614d63a1bbebbde3e4e53739e53941469f87c1211b4b |
|
MD5 | c6883ab55d84be37b69b24226bb005bc |
|
BLAKE2b-256 | a482a4989292d77e7c9428dafba4b03c49be7152d8f08d82217a2a19e677731a |