A dead-simple tool to extract high-quality Markdown from any URL or PDF for LLMs
Project description
📦 contextpack-md
A dead-simple tool to extract high-quality Markdown from any URL or PDF, optimized for LLMs.
🚀 Quick Start
Extract clean, LLM-ready Markdown from any URL in seconds.
# Using uv (recommended)
uvx contextpack-md web https://docs.python.org/3/
# Download and convert PDF
uvx contextpack-md pdf https://arxiv.org/pdf/1706.03762.pdf
✨ Key Features
- 🎯 LLM-Ready Output: Clean, readable Markdown with links, but no junk.
- 📄 PDF Support: High-fidelity PDF-to-Markdown conversion (via
marker-pdf). - 📂 Local Caching: Optional timestamped local storage in
~/.contextpack-md/. - ⚡ Fast & Lean: Built on top of
trafilaturafor superior extraction speed and quality.
🛠️ Installation
pip install contextpack-md
# For PDF support (requires PyTorch)
pip install "contextpack-md[pdf]"
📖 Documentation
Full documentation is available at https://theguega.github.io/contextpack-md/.
🤝 Contributing
Contributions are welcome! Please see CONTRIBUTING.md for more details.
⚖️ License
MIT License. See LICENSE for more information.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file contextpack_md-0.1.1.tar.gz.
File metadata
- Download URL: contextpack_md-0.1.1.tar.gz
- Upload date:
- Size: 4.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Linux Mint","version":"22.3","id":"zena","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a191ff515f0fcdb29fdff2d73b49a7eb283ea1111bc8290a5c3208baaf34d82b
|
|
| MD5 |
95c23d7e788caabd4c78f12fc5400c89
|
|
| BLAKE2b-256 |
9c031e53d2e302d419841f192678fe73a0eed578ab72ec8b2d9b532c03720f09
|
File details
Details for the file contextpack_md-0.1.1-py3-none-any.whl.
File metadata
- Download URL: contextpack_md-0.1.1-py3-none-any.whl
- Upload date:
- Size: 6.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Linux Mint","version":"22.3","id":"zena","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
aa1d1facb8669e288dba370a9ce628972ebe78f90fc88c775a0ca36f4e9eeb36
|
|
| MD5 |
6edd14291762e87ad50c5b29f272fd8a
|
|
| BLAKE2b-256 |
6b7f88aa833338a80701deadcbc9e987ee83ba3fbbeccf3f71f2ba44c55e1888
|