Command line tool to extract review changes from a docx file as plain text
Project description
docxreviews2txt
Command line tool to extract review changes from a docx file as plain text. It is useful when reviewing a PDF file as docx, and you need to share the changes as plain text.
How to install?
pip install docxreviews2txt
How to use it?
usage: docxreviews2txt [-h] [--version] docx
Command line tool to extract review changes from a docx file as plain text using HTML tags <ins> and <del>.
positional arguments:
docx input docx
options:
-h, --help show this help message and exit
--version show version
Example:
$ docxreviews2txt tests/lorem_ipsum.docx
txt reviews at file:///home/alan/src/docxreviews2txt/tests/lorem_ipsum_review.txt
$ cat /home/alan/src/docxreviews2txt/tests/lorem_ipsum_review.txt
Typos suggestions using HTML tags <ins> and <del>:
- dolor sit amet, consectetur <ins>Lorem ipsum</ins><del>adipiscing</del>
- sit amet, consectetur adipiscing<ins>s</ins> elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim <ins>do</ins>
- Ut enim ad minim <ins>Lorem</ins>veniam<ins>ipsum</ins>
- dolor sit amet, consectetur <del>adipiscing</del>
Known issues
The tool fails to capture changes in Docx files with text organized in tables (e.g., pdf2docx converts columns to tables).
References
This project takes inspiration from:
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
File details
Details for the file docxreviews2txt-0.4.6-py3-none-any.whl
.
File metadata
- Download URL: docxreviews2txt-0.4.6-py3-none-any.whl
- Upload date:
- Size: 5.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4466405c55d53a63846d98466f523ddb4d9a3ce7d2b325b1fdbd69698516962b |
|
MD5 | 10b32ec1ad38025db04cdf134bf50905 |
|
BLAKE2b-256 | c05fd7113add5a4d0a0a58c6b7cce8710f5f1274f67c0381cf5a99fcc9bf72a9 |