Skip to main content

A dead-simple tool to extract high-quality Markdown from any URL or PDF for LLMs

Project description

📦 contextpack-md

Release Status Documentation PyPI version License

A dead-simple tool to extract high-quality Markdown from any URL or PDF, optimized for LLMs.

🚀 Quick Start

Extract clean, LLM-ready Markdown from any URL in seconds.

# Using uv (recommended)
uvx contextpack-md web https://docs.python.org/3/

# Download and convert PDF
uvx contextpack-md pdf https://arxiv.org/pdf/1706.03762.pdf

✨ Key Features

  • 🎯 LLM-Ready Output: Clean, readable Markdown with links, but no junk.
  • 📄 PDF Support: High-fidelity PDF-to-Markdown conversion (via marker-pdf).
  • 📂 Local Caching: Optional timestamped local storage in ~/.contextpack-md/.
  • ⚡ Fast & Lean: Built on top of trafilatura for superior extraction speed and quality.

🛠️ Installation

pip install contextpack-md

# For PDF support (requires PyTorch)
pip install "contextpack-md[pdf]"

📖 Documentation

Full documentation is available at https://theguega.github.io/contextpack-md/.

🤝 Contributing

Contributions are welcome! Please see CONTRIBUTING.md for more details.

⚖️ License

MIT License. See LICENSE for more information.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

contextpack_md-0.2.2.tar.gz (4.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

contextpack_md-0.2.2-py3-none-any.whl (6.0 kB view details)

Uploaded Python 3

File details

Details for the file contextpack_md-0.2.2.tar.gz.

File metadata

  • Download URL: contextpack_md-0.2.2.tar.gz
  • Upload date:
  • Size: 4.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.3 {"installer":{"name":"uv","version":"0.11.3","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for contextpack_md-0.2.2.tar.gz
Algorithm Hash digest
SHA256 2d05585748bd08d6e8257ecc83d8076f534fdd52be0da3e00eaee5cb9a786e0e
MD5 0aaf5f9352fdb0e8c8c973dad0d0fec5
BLAKE2b-256 ed80e8aa635bf801857a65f7985a8e206b5790f0ad39cc48c3f3156274cc2ae5

See more details on using hashes here.

File details

Details for the file contextpack_md-0.2.2-py3-none-any.whl.

File metadata

  • Download URL: contextpack_md-0.2.2-py3-none-any.whl
  • Upload date:
  • Size: 6.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.3 {"installer":{"name":"uv","version":"0.11.3","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for contextpack_md-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 951b7bd1593304f88ede5692bae80edd23d857cf4e2706d01134625f7b784a7c
MD5 905ebd246651f1b63957d729beb301b7
BLAKE2b-256 47f22ef4b6525b565934a3374e043efc8a53aaba2cec4e711ab5c84b32405d29

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page