A package that enables extraction of text, images, and tables.
Project description
#LexStruct_PDF
This is an efficient python library build to extract the contents(text,images,tables) from a pdf.
It accepts one argument i.e. path of your pdf.
How to use:-
from LexStruct_PDF import ContentExtractor
obj = ContentExtractor("pdf to your path") extracted_text = obj.extract_content()
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
lexstruct_pdf-0.0.1.tar.gz
(4.2 kB
view hashes)
Built Distribution
Close
Hashes for lexstruct_pdf-0.0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 664a3c8fbfa32a50241dfb3c85cf6835d4ae93f9e6160eca1a443737066289f1 |
|
MD5 | e80898b27fbe24f5dcc522d09a9bf75d |
|
BLAKE2b-256 | 294e7a8159990afa357e4fdb6f67ee9a75080c1238617380f1209bdc125590e3 |