Decision letter and author response document parser.
Project description
decision-letter-parser
Parse docx file containing decision letter and author response content and produce output in other formats.
The contents of the .docx
must follow a specific formatting scheme for it to be separated out into multiple JATS XML <sub-article>
tags, and for figure and table data to be recognised.
Requirements
Parsing .docx
files requires pandoc
, and there are two options to make it available.
- Install pandoc so it can be executed locally, or
- Install docker and
pandoc
can be called using thedocker_image
specified in theletterparser.cfg
configuration file
Example usage
This library is meant to be integrated into another operational system, however the following are examples using interactive Python:
Example 1 - Convert a test fixture zip containing a .docx
and asset files
>>> from letterparser import generate
>>> jats_xml = generate.generate_xml_from_file("tests/test_data/elife-00666.zip")
Example 2 - Convert just a .docx
file only
>>> from letterparser import generate
>>> jats_xml = generate.generate_xml_from_file("tests/test_data/elife-68041.docx")
License
Licensed under MIT.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for letterparser-0.14.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3a5463dc04b003aa82506d325a67f185c686f9c6ce34ab3512fd0a0c998d836b |
|
MD5 | 69883d00279c7a3474e3af2b8044796f |
|
BLAKE2b-256 | b107efbdbafcf0d135d094199f69d368565985f9fd536ded94dfa650d3abc68a |