Skip to main content

Any2Text - any format to text using Unstructured.io

Project description

py-any2text-parser

install

pip install py-any2text-parser

usage


developer only below:

initialize

python3 -m venv ./venv
source venv/bin/activate
pip install -r requirements.txt

test

python3 test_pdf2text.py

usage

from pdf2text.pdf2text import extract_pdf_file_to_text

file_path = "/Users/user/Downloads/AUDIT_MATERIALS/budget_materials/personal/2021/2021 03 remarks 2.pdf"
  
with open(file_path, "rb") as file:
  text_data, text = extract_pdf_file_to_text(
    filename="abc.pdf",
    file=file,
    meta_data_mapping = {
        "document_category": "DEF",
    }
  )
  
  print(text_data, text)

develop - upload to pypi

pip install twine build
pip install setuppy_generator
pip install setuptools
pip install wheel
python -m setuppy_generator > setup.py
python3 setup.py sdist bdist_wheel
python3 -m build
twine upload dist/*

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

py_any2text_parser-1.0.2.tar.gz (8.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

py_any2text_parser-1.0.2-py3-none-any.whl (8.9 kB view details)

Uploaded Python 3

File details

Details for the file py_any2text_parser-1.0.2.tar.gz.

File metadata

  • Download URL: py_any2text_parser-1.0.2.tar.gz
  • Upload date:
  • Size: 8.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.12.3

File hashes

Hashes for py_any2text_parser-1.0.2.tar.gz
Algorithm Hash digest
SHA256 22c442e78091eff5c7464bced02c601c9b09f4fdc7a00d6d332a989194bc9c7b
MD5 4f9498cbf5f4328b414e9bdbb6ae9771
BLAKE2b-256 dc50794140c273e1f148cdeb8e0ec429def3ba1490cc523868dc90433b42f06a

See more details on using hashes here.

File details

Details for the file py_any2text_parser-1.0.2-py3-none-any.whl.

File metadata

File hashes

Hashes for py_any2text_parser-1.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 5aa9fc234741cd0d67c66ed21b61705bc79e3aa427d620f645c1e5490592f854
MD5 81d5561defde68f926c6e29c8ec72336
BLAKE2b-256 254b76e709d77fd1380d483e065f2332a9616e87ac510deedfa738765db647b9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page