Encoding tools for DDHI
A collection of command-line utilities to assist in the creation of TEI-encoded oral history interviews. Part of the Dartmouth Digital History Initiative.
The ddhi-encoder package is being developed to assist encoders in the DDHI project in encoding oral history interview transcripts in TEI. At present, it contains two command-line utilities:
- ddhi_convert: convert a Dartmouth DVP transcript from docx to tei.xml.
- ddhi_tag: perform named-entity tagging on a DDHI TEI transcription.
You can use pip to install this package:
pip install ddhi-encoder
To peform named-entity tagging with ddhi_tag, you will need a Spacy model. Before running ddhi_tag, install Spacy’s small English model:
python -m spacy download en_core_web_sm
See the Spacy documentation for more information.
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
|Filename, size||File type||Python version||Upload date||Hashes|
|Filename, size ddhi_encoder-1.0.5-py2.py3-none-any.whl (14.7 kB)||File type Wheel||Python version py2.py3||Upload date||Hashes View|
Hashes for ddhi_encoder-1.0.5-py2.py3-none-any.whl