For converting pdf documents to txt files
Project description
PDF TO TEXT CONVERTER
convert PDF to TXT.
Instructions
- Install:
pip install aradf
-
Install Tesseract, include arabic training data in the installation from: https://github.com/UB-Mannheim/tesseract/wiki
-
convert PDF to TXT:
from aradf import convertor
# get the text, it also saves txt file to the same directory of the pdf
txt = convertor.pdf_to_txt('path/to/pdf_file')
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
aradf-0.0.2.tar.gz
(3.5 kB
view hashes)
Built Distribution
aradf-0.0.2-py3-none-any.whl
(3.8 kB
view hashes)