Extract text content from many filetypes.
Project description
Usage:
>>> from textractor import TExtractor >>> extractor = TExtractor() >>> extractor.index('test.docx') ['workflow_history', 'portal_workflow', 'review_history', 'implementation', 'organizations', 'Illustrations', ...] >>> extractor.index('test.pdf') ['workflow_history', 'portal_workflow', 'review_history', 'implementation', 'organizations', 'Illustrations', ...] >>>
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
TExtractor-0.1.tar.gz
(10.1 kB
view hashes)