Skip to main content

A simple package for working with PDFs

Project description

pdfbyte

pdfbyte is a capable Python package for dealing with both digital and scanned PDF files. It provides utilities to extract text and other data from PDFs, whether they are digitally created or scanned documents.

Features

  • Extract text from digital PDFs: Works with PDFs that contain machine-readable text.
  • Extract text from scanned PDFs: Uses OCR to extract text from scanned (image-based) PDFs.
  • Easy to use: Simple API for extracting text from PDFs with minimal setup.

Installation

You can install pdfbyte using pip:

pip install pdfbyte

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pdfbyte-0.1.1.tar.gz (2.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pdfbyte-0.1.1-py3-none-any.whl (2.9 kB view details)

Uploaded Python 3

File details

Details for the file pdfbyte-0.1.1.tar.gz.

File metadata

  • Download URL: pdfbyte-0.1.1.tar.gz
  • Upload date:
  • Size: 2.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for pdfbyte-0.1.1.tar.gz
Algorithm Hash digest
SHA256 599ec8e607f5e21d9a60b9e8b4d33b6ea1ebdf40e092d8f234be275713fdd5e6
MD5 1b392e72b5fa49e0e29f8ec6a076bd7b
BLAKE2b-256 b00160fd9cad8d8ea2c2cf7f138d35ece974d8af34c6ed09815ce4c863280296

See more details on using hashes here.

File details

Details for the file pdfbyte-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: pdfbyte-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 2.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for pdfbyte-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 ece617f02bd60ed251cf31330f818d5d84601749730f9f58002f42347397ea1f
MD5 320eb2a3bb87783a51bf1eee2403c907
BLAKE2b-256 8c7f772a3138720f799d23c421d2f51212591b57715423a9a822b1d86e11c2e7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page