Skip to main content

Convert PDF to Markdown using LlamaParse

Project description

ak_pdf2md

Convert PDFs to Markdown using llamaparse


Table of Contents

About the Project

Getting Started

Prerequisites

Python 3.11 or above

Installation

Production

  pip install ak_pdf2md

Usage

Python

from ak_pdf2md import test_config, pdfparser

test_config() # Test if config.toml has been correctly set up

# Convert to markdown in same dir
pdfparser.convert(filepath="/path/to/pdf")

# Explicit Conversion
pdfparser.convert(filepath="/path/to/pdf", dest_dir="/dest", extension=".md")

#Can pass additional parser commands per #https://docs.cloud.llamaindex.ai/llamaparse/features/parsing_options
pdfparser.convert(
    filepath="/path/to/pdf", 
    parsing_instruction="You are parsing a receipt from a restaurant."
  )

Terminal

app --help    # Input Help

test          # Test if config is correctly setup

License

See LICENSE for more information.

Contact

Arun Kishore - @rpakishore

Project Link: https://github.com/rpakishore/ak_pdf2md

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ak_pdf2md-0.0.2.tar.gz (11.5 kB view details)

Uploaded Source

Built Distribution

ak_pdf2md-0.0.2-py3-none-any.whl (11.0 kB view details)

Uploaded Python 3

File details

Details for the file ak_pdf2md-0.0.2.tar.gz.

File metadata

  • Download URL: ak_pdf2md-0.0.2.tar.gz
  • Upload date:
  • Size: 11.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.32.3

File hashes

Hashes for ak_pdf2md-0.0.2.tar.gz
Algorithm Hash digest
SHA256 8f531696425f2d1adfc6bb9ed70833d8e0271aac6b66f74fad75ea9ffbb22284
MD5 e95c098a39e7ea70505d80037f9fda90
BLAKE2b-256 1d387176b0877fdc59f078ac4e0c01c22073788ae1f46069b55308d11f24f9d3

See more details on using hashes here.

File details

Details for the file ak_pdf2md-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: ak_pdf2md-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 11.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.32.3

File hashes

Hashes for ak_pdf2md-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 e96be0cd341f7bb696f168121fee8258b3a5ec00fb4aaaa39d28dc6fbf419b2b
MD5 5f104d34d0f01ff8e737e47f3c5638c8
BLAKE2b-256 cdf488dc6e228f196bd7171a5ab72e02e4812c17b3e8d77cb6ce0cbab46dbd2b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page