Skip to main content

Dramatist is a project for parsing plays from DraCor (https://dracor.org/) into a data structure which allows to access the text of the play "character" perfect, that is, character start and end positions for various parts, for example, act or scenes, can be retrieved. This also allows for functionality like getting the act and scene number for a character position.

Project description

Readme

Dramatist is a project for parsing plays from DraCor into a data structure which allows to access the text of the play "character" perfect, that is, character start and end positions for various parts, for example, act or scenes, can be retrieved. This also allows for functionality like getting the act and scene number for a character position.

Installation

pip install Dramatist

Usage

There are to ways to use Dramatist. The recommended way is to use Dramatist in code in other projects. The second option, from the command line, is only for illustration purposes, and rather limited.

In code

from dramatist.core.Dramatist import Dramatist

dramatist = Dramatist()

# load from local file
drama = dramatist.from_file(path_to_play_xml_file)

# load from DraCor API using corpus name and play name
drama = await dramatist.from_api_by_name('ger', 'goethe-iphigenie-auf-tauris')

# load from DraCor API using play id
drama = await dramatist.from_api_by_id(play_id)

# get the text of the drama
drama_test = drama.get_text()

# get the number of acts
act_count = drama.act_count

# get the number of total scenes for all acts
scene_count = drama.scene_count

# get the start and end position of a scene
scene_start, scene_end = drama.get_range_for_scene(act_nr, scene_nr)

Command line

The command line interface is very limited. Dramatist is meant to be used in code in other projects. If the --output-folder-path the resulting the data structure is saved to a json file. Otherwise the drama text is printed to the console.

Local file

dramatist --drama-path path-to-xml-drama-file

DraCor API with corpus name and play name

dramatist --corpus-name ger --play-name goethe-iphigenie-auf-tauris

DraCor API with play id

dramatist --play-id ger000001

Functionality (Not complete)

  • get_text()
    • Get the text of the drama.
  • get_text_with_markup()
    • Get the text of the drama a list of markups.
  • get_text_for_scene_in_blocks(act_nr, scene_nr)
    • Get a list of text blocks for the given scene in the given act. Blocks that are longer than the given maximum length (in words) are split into multiple blocks.
  • get_text_for_scene_by_speaker(act_nr, scene_nr)
    • Get a list of text blocks by speaker for the given scene in the given act. This means that for every speaker change a new text block is created. Blocks that are longer than the given maximum length (in words) are split into multiple blocks.
  • get_scene_act_for_position(char_pos)
    • Return a tuple of act and scene number for the given position.
  • get_range_for_scene(act_nr, scene_nr)
    • Return a tuple of character start and end position for the given act and scene number.
  • get_structure()
    • Get the structure of the drama, for example:
      1. Act (0:500)
      1. Scene (10:50)
      where the numbers in parentheses represent the character start and end positions.
      

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dramatist-0.1.0.tar.gz (17.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dramatist-0.1.0-py3-none-any.whl (23.2 kB view details)

Uploaded Python 3

File details

Details for the file dramatist-0.1.0.tar.gz.

File metadata

  • Download URL: dramatist-0.1.0.tar.gz
  • Upload date:
  • Size: 17.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for dramatist-0.1.0.tar.gz
Algorithm Hash digest
SHA256 0ca6d272b7f1aeab9176b4e70d184e440756e57bc484fa932e1f966a67c54bdf
MD5 167a0fc1aeb9bb9887002f19656b6809
BLAKE2b-256 7ef3ee6dba85105a8c9c0f783b497154dac88120cafdcf96a012070eeecfb3e2

See more details on using hashes here.

File details

Details for the file dramatist-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: dramatist-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 23.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for dramatist-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 62c3a7755b01d58d4ff4efbf9421f1324de50a38bf14a48fc6970956a5b1c440
MD5 f1e014ed41f4ba97901986e10ab08570
BLAKE2b-256 f5bf50b43b5bca84e2bc6969f7a6d93550823f38008c2b3c720e2ddb9a2f23d1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page