Skip to main content

Python interface to Apache Tika, text extraction from PDF pages

Project description

python-apachetika

A python wrapper for apache tika, a Java toolkit that detects and extracts metadata and text from over a thousand different file types

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

python-apachetika-2.6.1.tar.gz (46.7 MB view details)

Uploaded Source

Built Distribution

python_apachetika-2.6.1-py3-none-any.whl (46.8 MB view details)

Uploaded Python 3

File details

Details for the file python-apachetika-2.6.1.tar.gz.

File metadata

  • Download URL: python-apachetika-2.6.1.tar.gz
  • Upload date:
  • Size: 46.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.8.0

File hashes

Hashes for python-apachetika-2.6.1.tar.gz
Algorithm Hash digest
SHA256 3cfd84cfbfaea67c4b2b0e48c48680b5000c5d676b81583bd7db8736120bfc9c
MD5 ca73f4c8c967748d416d1d5df8567703
BLAKE2b-256 71586bfa334b40d15608f2a727dc7842c5f11861a3f821e188109a4240d18a2b

See more details on using hashes here.

File details

Details for the file python_apachetika-2.6.1-py3-none-any.whl.

File metadata

File hashes

Hashes for python_apachetika-2.6.1-py3-none-any.whl
Algorithm Hash digest
SHA256 273cfaf96bb717e1f5d46ba1036bf6c8a1b5b871032e5ba9e0cb142edcf1fa8b
MD5 d433e06f005d4d4d276c102640fac3b8
BLAKE2b-256 735268c64c5ffff3005af86cc7bc93d957b8e594b953f80fa96e621251c3e732

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page