Docling → Chroma → Ollama: Simple RAG pipeline

These details have not been verified by PyPI

Project links

Development Status
- 3 - Alpha
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
Typing
- Typed

Project description

📄 DocRAG LLM

Docling → Chroma → Ollama: Simple Local RAG Pipeline

🔎 What is DocRAG LLM?

DocRAG LLM is a local-first Retrieval-Augmented Generation (RAG) pipeline.
It connects Docling for parsing → ChromaDB for vector storage → Ollama for local LLM inference.

No cloud lock-in. No API costs. Just local docs → local vectors → local LLMs.

✨ Features

🔍 Parse documents with Docling (PDF, DOCX, PPTX, HTML, etc.)
📑 Intelligent chunking for retrieval
🧠 Store embeddings in ChromaDB
🤖 Answer questions using Ollama (default: llama3.2:1b)
🛡️ Privacy-first → all local execution
🖥️ Use as a CLI tool or Python library

📦 Installation

pip install docrag-llm

Requirements:

Python 3.10+
Ollama installed & running

Local models:

ollama pull llama3.2:1b -- or any other model
ollama pull nomic-embed-text

🚀 Quickstart

CLI – Ingest and Ask

# Ingest a document (default collection: demo)
python -m docrag.cli ingest https://arxiv.org/pdf/2508.20755

# Ask a question (default LLM: llama3.2:1b) you can always add the tag `-llm gpt-oss:20b` for better response assuming you have the computing power for it. 
python -m docrag.cli ask "Summarize in 1 paragraph with 5 bullet points" 
python -m docrag.cli ask "Summarize in 1 paragraph with 5 bullet points" -llm gpt-oss:20b

Python API - llama3.2:1b is just for testing, reccomend if you have the computing power to ues gpt-oss:20b you will get better results. change as needed, pull from ollama first.

from docrag import DocragSettings, RAGPipeline

cfg = DocragSettings(
    persist_path="./.chroma",
    collection="demo",
    embed_model="nomic-embed-text",
    llm_model="llama3.2:1b",
)

pipeline = RAGPipeline(cfg)

# Ingest
n_chunks = pipeline.ingest("https://arxiv.org/pdf/2508.20755")
print(f"Ingested {n_chunks} chunks")

# Ask
answer = pipeline.ask("Give a concise bullet summary of the paper's contributions.")
print(answer)

⚙️ Configuration

Both CLI & Python API let you customize:

persist_path → where ChromaDB stores vectors
collection → logical collection name
embed_model → embedding model (Ollama tag)
llm_model → LLM model (default: llama3.2:1b)
chunk_chars / chunk_overlap → chunking granularity

📊 Roadmap

model-check CLI → list installed Ollama models
Support multiple backends (Weaviate, Milvus)
Streaming output for long answers
Expanded test suite (large document regression cases)
Example notebooks & Hugging Face demo

🤝 Contributing

PRs and issues welcome!

pip install "docrag-llm[dev]"
ruff check .
pytest

📜 License

Project details

These details have not been verified by PyPI

Project links

Development Status
- 3 - Alpha
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
Typing
- Typed

Release history Release notifications | RSS feed

This version

0.1.27

Sep 2, 2025

0.1.26

Sep 2, 2025

0.1.25

Aug 31, 2025

0.1.24

Aug 31, 2025

0.1.23

Aug 31, 2025

0.1.22

Aug 31, 2025

0.1.21

Aug 31, 2025

0.1.20

Aug 31, 2025

0.1.19

Aug 31, 2025

0.1.18

Aug 31, 2025

0.1.17

Aug 30, 2025

0.1.16

Aug 30, 2025

0.1.15

Aug 30, 2025

0.1.14

Aug 30, 2025

0.1.13

Aug 30, 2025

0.1.12

Aug 30, 2025

0.1.11

Aug 30, 2025

0.1.10

Aug 30, 2025

0.1.9

Aug 30, 2025

0.1.5

Aug 30, 2025

0.1.3

Aug 30, 2025

0.1.2

Aug 30, 2025

0.1.1

Aug 30, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

docrag_llm-0.1.27.tar.gz (7.5 kB view details)

Uploaded Sep 2, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

docrag_llm-0.1.27-py3-none-any.whl (10.2 kB view details)

Uploaded Sep 2, 2025 Python 3

File details

Details for the file docrag_llm-0.1.27.tar.gz.

File metadata

Download URL: docrag_llm-0.1.27.tar.gz
Upload date: Sep 2, 2025
Size: 7.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.11

File hashes

Hashes for docrag_llm-0.1.27.tar.gz
Algorithm	Hash digest
SHA256	`527f0b5a770053f8d790f1984265dbcf25fd1929d58f1134357bb917cd1ca683`
MD5	`18e65dd7246b79f6c7181e28342d507c`
BLAKE2b-256	`f5766ee854c433c116bdbdd54fb6ecffec918f04cd137e7d49b7ee5a1b601a76`

See more details on using hashes here.

File details

Details for the file docrag_llm-0.1.27-py3-none-any.whl.

File metadata

Download URL: docrag_llm-0.1.27-py3-none-any.whl
Upload date: Sep 2, 2025
Size: 10.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.11

File hashes

Hashes for docrag_llm-0.1.27-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b3412fb3baee2e6dae2d14aa30d87b55cfd84df58a5218e650c02b2b1bdec61f`
MD5	`f1ec9c7330771a2ca43a01b7d9ebf436`
BLAKE2b-256	`beffd8339966cb9b030a6ca334997c7213fd80b7554eba5f5ae93bf49dba39d7`

See more details on using hashes here.

docrag-llm 0.1.27

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

📄 DocRAG LLM

🔎 What is DocRAG LLM?

✨ Features

📦 Installation

🚀 Quickstart

CLI – Ingest and Ask

Python API - llama3.2:1b is just for testing, reccomend if you have the computing power to ues gpt-oss:20b you will get better results. change as needed, pull from ollama first.

⚙️ Configuration

📊 Roadmap

🤝 Contributing

📜 License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes