Extracts tables from .docx files and saves them as csv or xlsx
Project description
```
Usage: docx2csv [OPTIONS] FILENAME
docx to csv convertor (http://github.com/ivbeg/docx2csv)
Extracts tables from DOCX files as CSV or XLSX.
Use command: "docx2csv convert <filename>" to run extraction. It will
create files like filename_1.csv, filename_2.csv for each table found.
Options:
--format TEXT Output format: CSV, XLSX
--singlefile TEXT Outputs single XLS file with multiple sheets: True or False
--sizefilter INTEGER Filters table by size number of rows
--help Show this message and exit.
```
## Examples
docx2csv --format csv --sizefilter 3 CP_CONTRACT_160166.docx
Extracts tables from file CP_CONTRACT_160166.docx with number of rows > 3 and
saves results as CSV files.
##Requirements
* click https://github.com/pallets/click
* xlwt https://github.com/python-excel/xlwt
* python-docx https://github.com/python-openxml/python-docx
* openpyxl https://bitbucket.org/openpyxl/openpyxl/src
## Acknowledgements
Thanks to Vsevolod Oparin (https://www.facebook.com/vsevolod.oparin) for optimized "extract_table" code
.. :changelog:
History
=======
0.1.0 (2018-01-14)
------------------
* First public release on PyPI and updated github code
Usage: docx2csv [OPTIONS] FILENAME
docx to csv convertor (http://github.com/ivbeg/docx2csv)
Extracts tables from DOCX files as CSV or XLSX.
Use command: "docx2csv convert <filename>" to run extraction. It will
create files like filename_1.csv, filename_2.csv for each table found.
Options:
--format TEXT Output format: CSV, XLSX
--singlefile TEXT Outputs single XLS file with multiple sheets: True or False
--sizefilter INTEGER Filters table by size number of rows
--help Show this message and exit.
```
## Examples
docx2csv --format csv --sizefilter 3 CP_CONTRACT_160166.docx
Extracts tables from file CP_CONTRACT_160166.docx with number of rows > 3 and
saves results as CSV files.
##Requirements
* click https://github.com/pallets/click
* xlwt https://github.com/python-excel/xlwt
* python-docx https://github.com/python-openxml/python-docx
* openpyxl https://bitbucket.org/openpyxl/openpyxl/src
## Acknowledgements
Thanks to Vsevolod Oparin (https://www.facebook.com/vsevolod.oparin) for optimized "extract_table" code
.. :changelog:
History
=======
0.1.0 (2018-01-14)
------------------
* First public release on PyPI and updated github code
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distributions
docx2csv-0.1.0-py3.6.egg
(6.0 kB
view hashes)
Close
Hashes for docx2csv-0.1.0-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 05a5968442fc5adcce846c2e9de0179d7c6f3b975be3d168f21b0596f0a672c3 |
|
MD5 | 72acc6b17b3568914bb30e406b438957 |
|
BLAKE2b-256 | b5080c4227c79ba88d6f1a2af4d8847bef1b0935fcac9f8bb51200b326f6c974 |