Python package containing various utilities relevant in the field of digital humanities.
Project description
Digital Humanities Utilities
Python 3.6+ package containing various utilities relevant in the field of digital humanities.
$ pip install dh-utils
Some examples
Tag languages in a given string based on its script:
>>> from dh_utils import tei as t
>>> t.tag('Hebr', 'A line contaning the hebrew אגוז מלך inline')
'A line contaning the hebrew <foreign xml:lang="he-Hebr">אגוז מלך</foreign> inline'
It is also possible to tag a given language based on its script in a TEI XML document (NB: file will be overwritten!):
>>> t.tag_xml('path/to/file.xml', 'Arab')
Decompose any unicode string:
>>> from dh_utils import unicode as u
>>> u.decompose('λόγος')
λ U+03bb GREEK SMALL LETTER LAMDA
ο U+03bf GREEK SMALL LETTER OMICRON
́ U+0301 COMBINING ACUTE ACCENT
γ U+03b3 GREEK SMALL LETTER GAMMA
ο U+03bf GREEK SMALL LETTER OMICRON
ς U+03c2 GREEK SMALL LETTER FINAL SIGMA
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
dh-utils-0.1.4.tar.gz
(3.3 kB
view hashes)