Skip to main content

A dead-simple tool to extract high-quality Markdown from any URL or PDF for LLMs

Project description

📦 contextpack-md

Release Status Documentation PyPI version License

A dead-simple tool to extract high-quality Markdown from any URL or PDF, optimized for LLMs.

🚀 Quick Start

Extract clean, LLM-ready Markdown from any URL in seconds.

# Using uv (recommended)
uvx contextpack-md web https://docs.python.org/3/

# Download and convert PDF
uvx contextpack-md pdf https://arxiv.org/pdf/1706.03762.pdf

✨ Key Features

  • 🎯 LLM-Ready Output: Clean, readable Markdown with links, but no junk.
  • 📄 PDF Support: High-fidelity PDF-to-Markdown conversion (via marker-pdf).
  • 📂 Local Caching: Optional timestamped local storage in ~/.contextpack-md/.
  • ⚡ Fast & Lean: Built on top of trafilatura for superior extraction speed and quality.

🛠️ Installation

pip install contextpack-md

# For PDF support (requires PyTorch)
pip install "contextpack-md[pdf]"

📖 Documentation

Full documentation is available at https://theguega.github.io/contextpack-md/.

🤝 Contributing

Contributions are welcome! Please see CONTRIBUTING.md for more details.

⚖️ License

MIT License. See LICENSE for more information.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

contextpack_md-0.1.1.tar.gz (4.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

contextpack_md-0.1.1-py3-none-any.whl (6.0 kB view details)

Uploaded Python 3

File details

Details for the file contextpack_md-0.1.1.tar.gz.

File metadata

  • Download URL: contextpack_md-0.1.1.tar.gz
  • Upload date:
  • Size: 4.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Linux Mint","version":"22.3","id":"zena","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for contextpack_md-0.1.1.tar.gz
Algorithm Hash digest
SHA256 a191ff515f0fcdb29fdff2d73b49a7eb283ea1111bc8290a5c3208baaf34d82b
MD5 95c23d7e788caabd4c78f12fc5400c89
BLAKE2b-256 9c031e53d2e302d419841f192678fe73a0eed578ab72ec8b2d9b532c03720f09

See more details on using hashes here.

File details

Details for the file contextpack_md-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: contextpack_md-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 6.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Linux Mint","version":"22.3","id":"zena","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for contextpack_md-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 aa1d1facb8669e288dba370a9ce628972ebe78f90fc88c775a0ca36f4e9eeb36
MD5 6edd14291762e87ad50c5b29f272fd8a
BLAKE2b-256 6b7f88aa833338a80701deadcbc9e987ee83ba3fbbeccf3f71f2ba44c55e1888

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page