Skip to main content

A dead-simple tool to extract high-quality Markdown from any URL or PDF for LLMs

Project description

📦 contextpack-md

Release Status Documentation PyPI version License

A dead-simple tool to extract high-quality Markdown from any URL or PDF, optimized for LLMs.

🚀 Quick Start

Extract clean, LLM-ready Markdown from any URL in seconds.

# Using uv (recommended)
uvx contextpack-md web https://docs.python.org/3/

# Download and convert PDF
uvx contextpack-md pdf https://arxiv.org/pdf/1706.03762.pdf

✨ Key Features

  • 🎯 LLM-Ready Output: Clean, readable Markdown with links, but no junk.
  • 📄 PDF Support: High-fidelity PDF-to-Markdown conversion (via marker-pdf).
  • 📂 Local Caching: Optional timestamped local storage in ~/.contextpack-md/.
  • ⚡ Fast & Lean: Built on top of trafilatura for superior extraction speed and quality.

🛠️ Installation

pip install contextpack-md

# For PDF support (requires PyTorch)
pip install "contextpack-md[pdf]"

📖 Documentation

Full documentation is available at https://theguega.github.io/contextpack-md/.

🤝 Contributing

Contributions are welcome! Please see CONTRIBUTING.md for more details.

⚖️ License

MIT License. See LICENSE for more information.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

contextpack_md-0.2.1.tar.gz (4.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

contextpack_md-0.2.1-py3-none-any.whl (6.0 kB view details)

Uploaded Python 3

File details

Details for the file contextpack_md-0.2.1.tar.gz.

File metadata

  • Download URL: contextpack_md-0.2.1.tar.gz
  • Upload date:
  • Size: 4.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Linux Mint","version":"22.3","id":"zena","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for contextpack_md-0.2.1.tar.gz
Algorithm Hash digest
SHA256 1514148de122700c38e94a78d4c8e2770b521318470f730b21ec5e950b562a8e
MD5 640681324fa64e6a42170f473cebfe19
BLAKE2b-256 949271443a64a6cd92297ee0f3bfe8000e37e5cf7af58418900fb89b819a7c88

See more details on using hashes here.

File details

Details for the file contextpack_md-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: contextpack_md-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 6.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Linux Mint","version":"22.3","id":"zena","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for contextpack_md-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 5cf88c4b2c682fe7fa2b68646e32ba7362c2aeac136b93c42e3b5108329a1a77
MD5 ec6f33fae1344fb4c3d2c2f67698bb53
BLAKE2b-256 31e6db7e7c61ed165b9e056152f69d567aaea9813d8491d7c41fcde68570f19a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page