Skip to main content

A dead-simple tool to extract high-quality Markdown from any URL or PDF for LLMs

Project description

📦 contextpack-md

Release Status Documentation PyPI version License

A dead-simple tool to extract high-quality Markdown from any URL or PDF, optimized for LLMs.

🚀 Quick Start

Extract clean, LLM-ready Markdown from any URL in seconds.

# Using uv (recommended)
uvx contextpack-md web https://docs.python.org/3/

# Download and convert PDF
uvx contextpack-md pdf https://arxiv.org/pdf/1706.03762.pdf

✨ Key Features

  • 🎯 LLM-Ready Output: Clean, readable Markdown with links, but no junk.
  • 📄 PDF Support: High-fidelity PDF-to-Markdown conversion (via marker-pdf).
  • 📂 Local Caching: Optional timestamped local storage in ~/.contextpack-md/.
  • ⚡ Fast & Lean: Built on top of trafilatura for superior extraction speed and quality.

🛠️ Installation

pip install contextpack-md

# For PDF support (requires PyTorch)
pip install "contextpack-md[pdf]"

📖 Documentation

Full documentation is available at https://theguega.github.io/contextpack-md/.

🤝 Contributing

Contributions are welcome! Please see CONTRIBUTING.md for more details.

⚖️ License

MIT License. See LICENSE for more information.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

contextpack_md-0.2.0.tar.gz (4.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

contextpack_md-0.2.0-py3-none-any.whl (6.0 kB view details)

Uploaded Python 3

File details

Details for the file contextpack_md-0.2.0.tar.gz.

File metadata

  • Download URL: contextpack_md-0.2.0.tar.gz
  • Upload date:
  • Size: 4.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Linux Mint","version":"22.3","id":"zena","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for contextpack_md-0.2.0.tar.gz
Algorithm Hash digest
SHA256 0a6cb74167a1430c187b04e8a1cee96e7d790bc355b19718a562df8b2c312960
MD5 41bf65057de357602a0c7d0dabc223ec
BLAKE2b-256 ddb28d4988d66d11a3e1678e9cdd588610b61ee417d895ff4211e7b7818986f7

See more details on using hashes here.

File details

Details for the file contextpack_md-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: contextpack_md-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 6.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Linux Mint","version":"22.3","id":"zena","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for contextpack_md-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 e81ab940d96376bf307e8f85b6b6559d5328b7c25056b4c2f73595b3ab9d4069
MD5 cc0728af53996e4702152889b47f9293
BLAKE2b-256 c1f330e36fde92779a3a2ad4f23a1d535c76eb209e87fd18c0d71688d1216af8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page