Skip to main content

Extractous Python Binding

Project description

Extractous Python Bindings

This project provides Python bindings for the Extractous library, allowing you to use extractous functionality in your Python applications.

Installation

To install the extractous Python bindings, you can use pip:

pip install extractous

Usage

Extracting a file to string:

from extractous import Extractor

extractor = Extractor()
extractor.set_extract_string_max_length(1000)
result = extractor.extract_file_to_string("README.md")

print(result)

Extracting a file to a buffered stream:

from extractous import Extractor

extractor = Extractor()
reader = extractor.extract_file("tests/quarkus.pdf")

result = ""
buffer = reader.read(4096)
while len(buffer) > 0:
    result += buffer.decode("utf-8")
    buffer = reader.read(4096)

print(result)

Extracting a file with OCR:

from extractous import Extractor, TesseractOcrConfig

extractor = Extractor().set_ocr_config(TesseractOcrConfig().set_language("deu"))
result = extractor.extract_file_to_string("../../test_files/documents/eng-ocr.pdf")

print(result)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

extractous-0.1.7.tar.gz (150.3 kB view details)

Uploaded Source

Built Distributions

extractous-0.1.7-cp38-abi3-win_amd64.whl (39.0 MB view details)

Uploaded CPython 3.8+ Windows x86-64

extractous-0.1.7-cp38-abi3-manylinux_2_28_x86_64.whl (40.2 MB view details)

Uploaded CPython 3.8+ manylinux: glibc 2.28+ x86-64

extractous-0.1.7-cp38-abi3-macosx_11_0_arm64.whl (46.7 MB view details)

Uploaded CPython 3.8+ macOS 11.0+ ARM64

extractous-0.1.7-cp38-abi3-macosx_10_12_x86_64.whl (47.5 MB view details)

Uploaded CPython 3.8+ macOS 10.12+ x86-64

File details

Details for the file extractous-0.1.7.tar.gz.

File metadata

  • Download URL: extractous-0.1.7.tar.gz
  • Upload date:
  • Size: 150.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for extractous-0.1.7.tar.gz
Algorithm Hash digest
SHA256 7c70780b04f23b30753e296d5c01dbf4ec652a81688da9237476b17ee43851cb
MD5 ef273785adf115223197520b1b0c30a4
BLAKE2b-256 15038e0424feb9860bfdd81cb7bbabf19fa581a6a0e8d32b28674190efe2f4c8

See more details on using hashes here.

Provenance

The following attestation bundles were made for extractous-0.1.7.tar.gz:

Publisher: release_python.yml on yobix-ai/extractous

Attestations:

File details

Details for the file extractous-0.1.7-cp38-abi3-win_amd64.whl.

File metadata

  • Download URL: extractous-0.1.7-cp38-abi3-win_amd64.whl
  • Upload date:
  • Size: 39.0 MB
  • Tags: CPython 3.8+, Windows x86-64
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for extractous-0.1.7-cp38-abi3-win_amd64.whl
Algorithm Hash digest
SHA256 3d1de35dae574a8bddc14d506eab3389ea928bcc390a804f7029d232c0cd0014
MD5 7d159cf52ab64eac88d38b81bc9eae99
BLAKE2b-256 4aa2be1d8253647f0b95e3265c3c4de2540ed365391f04c47d512f8652de1964

See more details on using hashes here.

Provenance

The following attestation bundles were made for extractous-0.1.7-cp38-abi3-win_amd64.whl:

Publisher: release_python.yml on yobix-ai/extractous

Attestations:

File details

Details for the file extractous-0.1.7-cp38-abi3-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for extractous-0.1.7-cp38-abi3-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 0f3d061be6e12dd3a130b04c8e26a1869687f9699f413f71025a400e417b9f11
MD5 9f623cdcc4735f4dbb34fe103b6868b0
BLAKE2b-256 5af30f07324a2939b814b023ccec11eb90e90cd50d10da9dd293545e663ea0e5

See more details on using hashes here.

Provenance

The following attestation bundles were made for extractous-0.1.7-cp38-abi3-manylinux_2_28_x86_64.whl:

Publisher: release_python.yml on yobix-ai/extractous

Attestations:

File details

Details for the file extractous-0.1.7-cp38-abi3-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for extractous-0.1.7-cp38-abi3-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 92f3589160129cff1b7211cff6d0bb9814779d7c50f898fb25bb6249d99b96f9
MD5 56cdbcf1937faab8500e4972a38818ad
BLAKE2b-256 f55b2149736751be2ffaf4e4add49d271687a601b8a6c0917b15b530dc4cc75a

See more details on using hashes here.

Provenance

The following attestation bundles were made for extractous-0.1.7-cp38-abi3-macosx_11_0_arm64.whl:

Publisher: release_python.yml on yobix-ai/extractous

Attestations:

File details

Details for the file extractous-0.1.7-cp38-abi3-macosx_10_12_x86_64.whl.

File metadata

File hashes

Hashes for extractous-0.1.7-cp38-abi3-macosx_10_12_x86_64.whl
Algorithm Hash digest
SHA256 75325b8ab9eda8e49bc6d1a6ce6c7d2fafa940fc6b72478d713305f9b3760bf8
MD5 834fb57a4763184b812987e7c383dceb
BLAKE2b-256 f272fc8a361f0511f9f4947c23c861ccf3f52fdcbe865aa24a6541fd5767c509

See more details on using hashes here.

Provenance

The following attestation bundles were made for extractous-0.1.7-cp38-abi3-macosx_10_12_x86_64.whl:

Publisher: release_python.yml on yobix-ai/extractous

Attestations:

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page