Python interface to Apache Tika, text extraction from PDF pages
Project description
python-apachetika
A python wrapper for apache tika, a Java toolkit that detects and extracts metadata and text from over a thousand different file types
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
python-apachetika-2.6.1.tar.gz
(46.7 MB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file python-apachetika-2.6.1.tar.gz.
File metadata
- Download URL: python-apachetika-2.6.1.tar.gz
- Upload date:
- Size: 46.7 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.8.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3cfd84cfbfaea67c4b2b0e48c48680b5000c5d676b81583bd7db8736120bfc9c
|
|
| MD5 |
ca73f4c8c967748d416d1d5df8567703
|
|
| BLAKE2b-256 |
71586bfa334b40d15608f2a727dc7842c5f11861a3f821e188109a4240d18a2b
|
File details
Details for the file python_apachetika-2.6.1-py3-none-any.whl.
File metadata
- Download URL: python_apachetika-2.6.1-py3-none-any.whl
- Upload date:
- Size: 46.8 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.8.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
273cfaf96bb717e1f5d46ba1036bf6c8a1b5b871032e5ba9e0cb142edcf1fa8b
|
|
| MD5 |
d433e06f005d4d4d276c102640fac3b8
|
|
| BLAKE2b-256 |
735268c64c5ffff3005af86cc7bc93d957b8e594b953f80fa96e621251c3e732
|