A tool for converting PDFs to text files using Ollama.
Project description
llama-scan
A tool for converting PDFs to text files using Ollama.
Features
- Convert PDFs to text files locally, no token costs.
- Use the latest multimodal models supported by Ollama.
- Turn images and diagrams into detailed text descriptions.
Requirements
- Python 3.10+
- Ollama installed and running locally
Installing Ollama and the Default Model
- Install Ollama
- Pull the default model:
ollama run qwen2.5vl:latest
Installation
Install using pip:
pip install llama-scan
or uv:
uv tool install llama-scan
Usage
Basic usage:
llama-scan path/to/your/file.pdf
Options
--output,-o: Output directory (default: "output")--model,-m: Ollama model to use (default: "qwen2.5vl:latest")--keep-images,-k: Keep the intermediate image files (default: False)--width,-w: Width of the resized images (0 to skip resizing; default: 0)--start,-s: Start page number (default: 0)--end,-e: End page number (default: 0)--stdout,-s: Write merged output to stdout (default: False)
Examples
Process specific pages with custom width:
llama-scan document.pdf --start 1 --end 5 --width 1000
Use a different Ollama model:
llama-scan document.pdf --model qwen2.5vl:3b
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
llama_scan-0.1.5.tar.gz
(7.3 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file llama_scan-0.1.5.tar.gz.
File metadata
- Download URL: llama_scan-0.1.5.tar.gz
- Upload date:
- Size: 7.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d8828b960569d1c9b72e6af6478199c79f771185127ccfd3fa2d7d872a257e30
|
|
| MD5 |
c2d85cdc1c5a33d5778eda23ab26e96a
|
|
| BLAKE2b-256 |
0fe5175f20fca9b27d81f8c30ab9eaa7ec4aff7a89c833c0685f4434bbb2b129
|
File details
Details for the file llama_scan-0.1.5-py3-none-any.whl.
File metadata
- Download URL: llama_scan-0.1.5-py3-none-any.whl
- Upload date:
- Size: 7.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e17b552cfbd953d966bfca04464171ae6e861b7b88a664b50023fe513f483f0f
|
|
| MD5 |
8fb22d06148b452c04123ff8ef5435e1
|
|
| BLAKE2b-256 |
883e7ecd3376a00404ae8746ac5b22888e091170131fa793b26f25206d4174a8
|