Decision letter and author response document parser.
Project description
decision-letter-parser
Parse docx file containing decision letter and author response content and produce output in other formats.
The contents of the .docx
must follow a specific formatting scheme for it to be separated out into multiple JATS XML <sub-article>
tags, and for figure and table data to be recognised.
Requirements
Parsing .docx
files requires pandoc
, and there are two options to make it available.
- Install pandoc so it can be executed locally, or
- Install docker and
pandoc
can be called using thedocker_image
specified in theletterparser.cfg
configuration file
Example usage
This library is meant to be integrated into another operational system, however the following are examples using interactive Python:
Example 1 - Convert a test fixture zip containing a .docx
and asset files
>>> from letterparser import generate
>>> jats_xml = generate.generate_xml_from_file("tests/test_data/elife-00666.zip")
Example 2 - Convert just a .docx
file only
>>> from letterparser import generate
>>> jats_xml = generate.generate_xml_from_file("tests/test_data/elife-68041.docx")
License
Licensed under MIT.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for letterparser-0.3.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5d7e982a7166976d3a168a95df67a7536f104f6984fb0529a81a72249b2da21d |
|
MD5 | 1c487a903141eba3cad01e464eb08adb |
|
BLAKE2b-256 | f313422bb1ee537df1daa4c6663c0122df4c86aceac27987d617aa9ff5d8a03e |