No project description provided
Project description
Extractify
Extractify is a command-line tool for converting documents in various formats (.pdf, .doc, .docx, .xlsx, .txt) to plain text. The tool creates a 'txt' subdirectory within the specified input directory and saves the plain text files with the same filenames but with a .txt extension.
Installation
Install Extractify using pip:
pip install extractify
Usage
To use Extractify, run the following command:
extractify <directory_with_non_text_files>
Replace <directory_with_non_text_files>
with the path to the directory containing the documents you want to convert.
Extractify will create a 'txt' subdirectory within the input directory and save the plain text files there.
Supported Formats
Extractify currently supports the following document formats:
- .doc
- .docx
- .xlsx
- .txt
Dependencies
Extractify requires the following Python libraries:
tika
openpyxl
argparse
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for extractify-0.0.2.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | dd2692778e3ae54a78d4286762d0e765cab76892f14c85a2644dafea14c2ccaf |
|
MD5 | fe819f6685496fa4ea7812c0ea72b1f7 |
|
BLAKE2b-256 | 4e9295d8589b718e76743c224ccabf401e568a81d94120a5ab9ae4c24a71f77d |