Utilty functions to work with TEI Documents
Project description
acdh-tei-pyutils
Utilty functions to work with TEI Documents
install
run pip install acdh-tei-pyutils
usage
some examples on how to use this package
parse an XML/TEI Document from and URL, string or file:
from acdh_tei_pyutils.tei import TeiReader
doc = TeiReader("https://raw.githubusercontent.com/acdh-oeaw/acdh-tei-pyutils/main/acdh_tei_pyutils/files/tei.xml")
print(doc.tree)
>>> <Element {http://www.tei-c.org/ns/1.0}TEI at 0x7ffb926f9c40>
doc = TeiReader("./acdh_tei_pyutils/files/tei.xml")
doc.tree
>>> <Element {http://www.tei-c.org/ns/1.0}TEI at 0x7ffb926f9c40>
write the current XML/TEI tree object to file
doc.tree_to_file("out.xml")
>>> 'out.xml'
see acdh_tei_pyutils/cli.py for further examples
command line scripts
Batch process a collection of XML/Documents by adding xml:id, xml:base next and prev attributes to the documents root element run:
add-attributes -g "/path/to/your/xmls/*.xml" -b "https://value/of-your/base.com"
add-attributes -g "../../xml/grundbuecher/gb-data/data/editions/*.xml" -b "https://id.acdh.oeaw.ac.at/grundbuecher"
Write mentions as listEvents into index-files:
mentions-to-indices -t "erwähnt in " -i "/path/to/your/xmls/indices/*.xml" -f "/path/to/your/xmls/editions/*.xml"
Write mentions as listEvents of index-files and copy enriched index entries into files
denormalize-indices -f "../../xml/schnitzler/schnitzler-tagebuch-data-public/editions/*.xml" -i "../../xml/schnitzler/schnitzler-tagebuch-data-public/indices/*.xml"
denormalize-indices -f "./data/*/*.xml" -i "./data/indices/*.xml" -m ".//*[@key]/@key" -x ".//tei:title[@level='a']/text()"
denormalize-indices -f "./data/*/*.xml" -i "./data/indices/*.xml" -m ".//*[@key]/@key" -x ".//tei:title[@level='a']/text()" -b pmb2121 -b pmb10815 -b pmb50
Register handle-ids and add them as tei:idno elements:
add-handles -g "../../xml/grundbuecher/gb-data/data/editions/*.xml" -user "user12.3456-01" -pw "verysecret" -hixpath ".//tei:publicationStmt"
develop
- install dev-dependencies:
pip install -r requirements_dev.txt
- install local package
pip install -e .
- before commiting run
flake8
to check linting andcoverage run -m pytest -v
to run the tests
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file acdh_tei_pyutils-1.6.tar.gz
.
File metadata
- Download URL: acdh_tei_pyutils-1.6.tar.gz
- Upload date:
- Size: 19.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
387f99bf5248bf0716d6a3e7ce049937a1e8f2754e454bbc84fc9b6cd876b5f0
|
|
MD5 |
f2c4f4a2af2454ce380232048657c994
|
|
BLAKE2b-256 |
90b57fdadb24c2e5bb410aa56ff8db32156b5c431624062ba233964f82e684b0
|
File details
Details for the file acdh_tei_pyutils-1.6-py3-none-any.whl
.
File metadata
- Download URL: acdh_tei_pyutils-1.6-py3-none-any.whl
- Upload date:
- Size: 19.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
94b586bb6c83dc583b7da51fb141fd56afee58f4f842950c7ac85fe01c8ca059
|
|
MD5 |
a318865d7437455e1949009ace64f3e3
|
|
BLAKE2b-256 |
00980fadf37f2ce6d41b19bb1bb121a90af115469e61b71b1adc0343dfee4bef
|