Skip to main content

Any2Text - any format to text using Unstructured.io

Project description

py-any2text-parser

install

pip install py-any2text-parser

usage


developer only below:

initialize

python3 -m venv ./venv
source venv/bin/activate
pip install -r requirements.txt

test

python3 test_pdf2text.py

usage

from pdf2text.pdf2text import extract_pdf_file_to_text

file_path = "/Users/user/Downloads/AUDIT_MATERIALS/budget_materials/personal/2021/2021 03 remarks 2.pdf"
  
with open(file_path, "rb") as file:
  text_data, text = extract_pdf_file_to_text(
    filename="abc.pdf",
    file=file,
    meta_data_mapping = {
        "document_category": "DEF",
    }
  )
  
  print(text_data, text)

develop - upload to pypi

pip install twine build
python3 -m build
twine upload dist/*

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

py_any2text_parser-1.0.1.tar.gz (8.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

py_any2text_parser-1.0.1-py3-none-any.whl (8.7 kB view details)

Uploaded Python 3

File details

Details for the file py_any2text_parser-1.0.1.tar.gz.

File metadata

  • Download URL: py_any2text_parser-1.0.1.tar.gz
  • Upload date:
  • Size: 8.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.12.3

File hashes

Hashes for py_any2text_parser-1.0.1.tar.gz
Algorithm Hash digest
SHA256 370331c3ecfd99580b3c7417042197142f3e26e10e740b7ce55e41e09468787b
MD5 0a24b904ac1fc408691413bf09a2e580
BLAKE2b-256 1d4626b65b3133bab1a8d0ef63669f6bb8ef7ee01cb9803e878385080fd85620

See more details on using hashes here.

File details

Details for the file py_any2text_parser-1.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for py_any2text_parser-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 64dc5044a8ec0020f5bb32b8305467b9567bcc2bbf80f2ef54955560fd4ffb5c
MD5 e82d9b34b667d7797898e629cf1d32bc
BLAKE2b-256 44c953e7abfec90f2ac51cf919c7419895816a3c89fd4f1a6f0c32a42a6be4e0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page