Decision letter and author response document parser.
Project description
decision-letter-parser
Parse docx file containing decision letter and author response content and produce output in other formats.
The contents of the .docx
must follow a specific formatting scheme for it to be separated out into multiple JATS XML <sub-article>
tags, and for figure and table data to be recognised.
Requirements
Parsing .docx
files requires pandoc
, and there are two options to make it available.
- Install pandoc so it can be executed locally, or
- Install docker and
pandoc
can be called using thedocker_image
specified in theletterparser.cfg
configuration file
Example usage
This library is meant to be integrated into another operational system, however the following are examples using interactive Python:
Example 1 - Convert a test fixture zip containing a .docx
and asset files
>>> from letterparser import generate
>>> jats_xml = generate.generate_xml_from_file("tests/test_data/elife-00666.zip")
Example 2 - Convert just a .docx
file only
>>> from letterparser import generate
>>> jats_xml = generate.generate_xml_from_file("tests/test_data/elife-68041.docx")
License
Licensed under MIT.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for letterparser-0.2.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 72870a5389881103cce3b7a2507da881f7d1ed42c2f03996b7dd3c49157900f6 |
|
MD5 | a949db1f1bd8432eb7cf01b6c3b53a54 |
|
BLAKE2b-256 | 04924dac9aff294c9df29d4dbe6799548895f2260a22d3e99857bb43e649a750 |