creating vrt corpora
Project description
vrt_spacy
Python class for creating vrt-annotated corpora. Still in very early testing stage.
Install by typing:
pip install vrt_spacy
Usage Example:
from vrt import Corpus, S, Text
from vrt_spacy import Annotate
with Corpus("~","meinkorpus",4,"text_name") as c:
annotate = Annotate(c, spacymodel="de_core_news_md")
annotate("Das hier ist mein Text", text_name="Text1")
with Text(c, text_name="Text2") as t:
with S(c) as s:
s.writep("Test","TAG","TAG","Lemma")
Features:
- Represent Corpus, Text, P and S Attributes
- Integration of spacy for automatic generation of a vrt-representation of texts
- Using Context Manager for xml-hierarchy representation
- Reduces to utf8mb3 and checks formatting compatibility
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
vrt_spacy-0.0.1.tar.gz
(2.5 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
vrt_spacy-0.0.1-py3-none-any.whl
(14.8 kB
view details)
File details
Details for the file vrt_spacy-0.0.1.tar.gz.
File metadata
- Download URL: vrt_spacy-0.0.1.tar.gz
- Upload date:
- Size: 2.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.40.2 CPython/3.8.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d99b386c430de86dd20d7767595da8f4124694dad468b47efff9d96ed6654deb
|
|
| MD5 |
ae98f1ff49832609e36fc8ea00f7251e
|
|
| BLAKE2b-256 |
40de0cb9e200e5cf02274f4772be42af42cef537f27bfa85e68f93766eb09c19
|
File details
Details for the file vrt_spacy-0.0.1-py3-none-any.whl.
File metadata
- Download URL: vrt_spacy-0.0.1-py3-none-any.whl
- Upload date:
- Size: 14.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.40.2 CPython/3.8.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b84de5d2486af58a1965614d63a1bbebbde3e4e53739e53941469f87c1211b4b
|
|
| MD5 |
c6883ab55d84be37b69b24226bb005bc
|
|
| BLAKE2b-256 |
a482a4989292d77e7c9428dafba4b03c49be7152d8f08d82217a2a19e677731a
|