Skip to main content

Encoding tools for DDHI

Project description

A collection of command-line utilities to assist in the creation of TEI-encoded oral history interviews. Part of the Dartmouth Digital History Initiative.

DDHI Encoder

The ddhi-encoder package is being developed to assist encoders in the DDHI project in encoding oral history interview transcripts in TEI. At present, it contains two command-line utilities:

  1. ddhi_convert: convert a Dartmouth DVP transcript from docx to tei.xml.
  2. ddhi_tag: perform named-entity tagging on a DDHI TEI transcription.


You can use pip to install this package:

pip install ddhi-encoder

To peform named-entity tagging with ddhi_tag, you will need a Spacy model. Before running ddhi_tag, install Spacy’s small English model:

python -m spacy download en_core_web_sm

See the Spacy documentation for more information.

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for ddhi-encoder, version 1.0.5
Filename, size File type Python version Upload date Hashes
Filename, size ddhi_encoder-1.0.5-py2.py3-none-any.whl (14.7 kB) File type Wheel Python version py2.py3 Upload date Hashes View

Supported by

Pingdom Pingdom Monitoring Google Google Object Storage and Download Analytics Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page