Python interface to Apache Tika, text extraction from PDF pages
Project description
python-apachetika
A python wrapper for apache tika, a Java toolkit that detects and extracts metadata and text from over a thousand different file types
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
python-apachetika-2.6.1.tar.gz
(46.7 MB
view details)
Built Distribution
File details
Details for the file python-apachetika-2.6.1.tar.gz
.
File metadata
- Download URL: python-apachetika-2.6.1.tar.gz
- Upload date:
- Size: 46.7 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.8.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3cfd84cfbfaea67c4b2b0e48c48680b5000c5d676b81583bd7db8736120bfc9c |
|
MD5 | ca73f4c8c967748d416d1d5df8567703 |
|
BLAKE2b-256 | 71586bfa334b40d15608f2a727dc7842c5f11861a3f821e188109a4240d18a2b |
File details
Details for the file python_apachetika-2.6.1-py3-none-any.whl
.
File metadata
- Download URL: python_apachetika-2.6.1-py3-none-any.whl
- Upload date:
- Size: 46.8 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.8.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 273cfaf96bb717e1f5d46ba1036bf6c8a1b5b871032e5ba9e0cb142edcf1fa8b |
|
MD5 | d433e06f005d4d4d276c102640fac3b8 |
|
BLAKE2b-256 | 735268c64c5ffff3005af86cc7bc93d957b8e594b953f80fa96e621251c3e732 |