Python interface to Apache Tika, text extraction from PDF pages
Project description
python-apachetika
A python wrapper for apache tika, a Java toolkit that detects and extracts metadata and text from over a thousand different file types
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
python-apachetika-2.6.1.tar.gz
(46.7 MB
view hashes)
Built Distribution
Close
Hashes for python_apachetika-2.6.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 273cfaf96bb717e1f5d46ba1036bf6c8a1b5b871032e5ba9e0cb142edcf1fa8b |
|
MD5 | d433e06f005d4d4d276c102640fac3b8 |
|
BLAKE2b-256 | 735268c64c5ffff3005af86cc7bc93d957b8e594b953f80fa96e621251c3e732 |