DocSeer: RAG over research papers — FastAPI + Celery + ChromaDB + Ollama.

Project description

DocSeer

DocSeer is a self-hosted RAG (Retrieval-Augmented Generation) application for research papers. Ingest PDFs, query them in natural language, and explore your library — all running locally via Docker, with no data leaving your machine.

Seer: One who perceives hidden knowledge — interpreting and revealing insights beyond the surface.

version

Architecture

Layer	Technology
API	FastAPI (async, SSE streaming)
Task queue	Celery + Redis
Vector store	ChromaDB
Document store	Local file store (parent chunks)
Relational DB	PostgreSQL
LLM / embeddings	Ollama (fully local)
PDF processing	Docling (content) + GROBID (metadata)
Metadata import	Zotero Translation Server + BibTeX parser
TUI	Textual
Monitoring	Flower (Celery dashboard)

All services run as Docker containers. The only external network call is the initial Ollama model pull, which happens automatically at startup.

Prerequisites

Docker with the Compose plugin (docker compose version)
~10 GB free disk space (models + database volumes)

That's it. No Python, Ollama, or Postgres installation needed on the host — unless you want to use native Ollama for GPU acceleration (see below).

Quick start

Using the CLI (recommended)

# 1. Clone and install
git clone https://github.com/fellajimed/docseer.git
cd docseer
uv pip install -e .

# 2. Start everything and launch the TUI
docseer

One command starts all Docker services (Postgres, Redis, ChromaDB, Ollama, GROBID, Zotero, API, worker, Flower), waits for healthchecks, then opens the Textual TUI. Press Ctrl+C or Ctrl+Q to quit — services stop automatically.

Using `make`

# 1. Clone the repo
git clone https://github.com/fellajimed/docseer.git
cd docseer

# 2. Create your .env (safe to use defaults for local dev)
make .env        # copies .env.example → .env

# 3. Build, start everything, and open the TUI — all in one shot
make run

make run does the full sequence:

Builds the API and worker images
Starts all 9 backend services (Postgres, Redis, ChromaDB, Ollama, GROBID, Zotero, API, worker, Flower) and waits for every healthcheck to pass
Pulls the configured LLM and embedding models from Ollama if not already present (first boot may take a few minutes)
Launches the Textual TUI — chat, manage documents, and tail live container logs

To start only the backend without the TUI:

make up

Running with `make`

DocSeer has two operating modes depending on how Ollama is run. Choose the one that fits your setup.

Mode 1 — Fully Dockerized (default)

Ollama runs as a Docker container. Works on any OS, no extra setup needed. Inference is CPU-only.

make up          # build + start all backend services, wait until healthy
make run         # same as above, then launch the TUI
make down        # stop and remove containers (volumes are kept)

Mode 2 — Native macOS Ollama (recommended on Apple Silicon)

Ollama runs on the host using Apple Metal (GPU), giving 10–50× faster inference than the Docker variant. The rest of the stack (Postgres, Redis, ChromaDB, etc.) still runs in Docker.

One-time setup:

# 1. Install and start Ollama — must bind to 0.0.0.0 so Docker containers
#    can reach it via host.docker.internal:11434
brew install ollama
OLLAMA_HOST=0.0.0.0 ollama serve

# 2. Pull the required models into the native Ollama
make pull-models-native

Day-to-day usage:

make up-native   # start backend services (skips the Docker Ollama container)
make run-native  # same as above, then launch the TUI

If Ollama.app is already running, make sure it is configured to listen on 0.0.0.0. You can set OLLAMA_HOST=0.0.0.0 in your shell or in the Ollama.app environment before launching it.

Service URLs

Service	URL
REST API	http://localhost:8000
API docs (Swagger)	http://localhost:8000/docs
Flower (Celery monitor)	http://localhost:5555
Ollama	http://localhost:11434
GROBID	http://localhost:8070

Configuration

All settings are environment variables prefixed with DOCSEER_. Copy .env.example to .env and adjust as needed.

cp .env.example .env

You can also pass a YAML config file at runtime with the -c / --config flag. Short names (without the DOCSEER_ prefix) are automatically expanded:

docseer -c docseer.example.yaml

# docseer.example.yaml
llm_model: qwen3.5:4b
embedding_model: nomic-embed-text
retriever_topk: 10
chat_num_ctx: 32000

Key settings

Variable	Default	Description
`DOCSEER_LLM_MODEL`	`qwen3.5:4b`	Ollama model used for chat
`DOCSEER_EMBEDDING_MODEL`	`nomic-embed-text`	Ollama model used for embeddings
`DOCSEER_OLLAMA_PULL_ON_STARTUP`	`true`	Pull models at startup if not present locally
`DOCSEER_RETRIEVER_TOPK`	`5`	Number of chunks retrieved per query
`DOCSEER_RERANKER_MODEL`	`ms-marco-MultiBERT-L-12`	FlashRank reranker model
`DOCSEER_CHAT_NUM_CTX`	`20000`	KV-cache context window (tokens)
`DOCSEER_CHAT_NUM_PREDICT`	`4096`	Max tokens per response

To use a different LLM:

# in your .env
DOCSEER_LLM_MODEL=llama3.2

Important: do not mix embedding models across an existing database. nomic-embed-text produces 768-dimensional vectors. Switching models requires wiping and re-ingesting all papers (make clean-db).

Set DOCSEER_OLLAMA_PULL_ON_STARTUP=false if you pre-pull models yourself or work in an air-gapped environment.

All `make` commands

Backend

Command	Description
`make up`	Build + start all backend services (Dockerized Ollama), wait until healthy
`make up-native`	Same, but skips Docker Ollama — uses native host Ollama instead
`make down`	Stop and remove containers (volumes are kept)
`make clean`	Full teardown including all volumes — destructive
`make clean-db`	Wipe paper data (Postgres + ChromaDB + docstore) while keeping Ollama models
`make logs`	Tail logs for all backend services
`make status`	Show container status (`docker compose ps`)
`make build`	Build images without starting

TUI

Command	Description
`make run`	Start backend (Dockerized Ollama) then launch the TUI
`make run-native`	Start backend (native Ollama) then launch the TUI

Models

Command	Description
`make pull-models`	Pull LLM + embedding models into the Docker Ollama container
`make pull-models-native`	Pull LLM + embedding models into the native host Ollama

Development

Command	Description
`make migrate`	Apply Alembic migrations to HEAD
`make shell`	Open a bash shell inside the API container
`make test`	Run the pytest suite inside the API container

CLI reference

Once installed (uv pip install -e . or pip install docseer), the docseer command manages the full stack.

Command	Description
`docseer`	Start services, launch TUI, then stop on exit (default)
`docseer run`	Same as above
`docseer run --keep`	Keep services running after TUI exits
`docseer run --native`	Use native macOS Ollama (Metal GPU)
`docseer run --no-wait`	Don't wait for healthchecks (faster startup)
`docseer run --rebuild`	Force rebuild of Docker images
`docseer run -c config.yaml`	Start with YAML config overrides
`docseer start`	Start all Docker services in background
`docseer stop`	Stop all Docker services
`docseer clean`	Stop services and wipe all volumes
`docseer tui`	Launch TUI only (services must already be running)
`docseer ingest <src> [<src> ...]`	Ingest papers — URLs, PDF paths, or `.bib` files
`docseer ingest --no-trigger <url>`	Save URL metadata only, skip PDF ingestion
`docseer --version`	Show version

TUI keyboard shortcuts

Key	Action
`Ctrl+C` / `Ctrl+Q`	Quit
`Ctrl+T`	Chat tab
`Ctrl+F`	Papers tab
`Ctrl+L`	Logs tab
`Ctrl+S`	DocSeer Settings (LLM model, embedding model, theme)
`Ctrl+P`	Textual Command Palette
`Alt+P`	Filter Papers (open paper picker)
`Alt+M`	Open Macro Selector

Chat tab:

Key	Action
`Ctrl+J` / `Ctrl+M` / `Ctrl+Enter`	Send message
`Tab`	Auto-complete `/macro` name
`<char>` after `/`	Opens Macro Selector modal

Available macros:

Macro	Action
`/papers`	Open paper filter picker
`/summarize`	Structured summary of selected papers
`/extract`	Extract contributions, methodology, results
`/synthesize`	Cross-paper synthesis and insights
`/compare`	Side-by-side comparison of papers
`/critique`	Critical analysis of papers

Type /<char> in the chat input to open the Macro Selector modal, or type the full macro name and press Enter.

Papers tab:

Key	Action
Type a path or URL	Add a paper (PDF, `.bib`, or any URL)
`Tab` → `Enter`	Select/deselect papers for the chat filter

REST API overview

The full interactive documentation is available at http://localhost:8000/docs once the stack is running.

Papers

Method	Path	Description
`GET`	`/papers/`	List all papers
`POST`	`/papers/`	Add a paper and queue ingestion
`GET`	`/papers/{id}`	Get a paper by ID
`PUT`	`/papers/{id}`	Update paper metadata
`DELETE`	`/papers/{id}`	Delete paper and its embeddings
`POST`	`/papers/import-bibtex`	Import papers from a BibTeX string
`POST`	`/papers/import-url`	Import metadata via Zotero Translation Server
`POST`	`/papers/{id}/ingest`	(Re-)trigger PDF ingestion

Chat

Method	Path	Description
`POST`	`/chat/stream`	SSE stream — yields `thinking`, `response`, `done`, `error` events
`POST`	`/chat/invoke`	Blocking single-turn response
`GET`	`/chat/history`	Return conversation history
`DELETE`	`/chat/history`	Clear conversation history

Tasks

Method	Path	Description
`GET`	`/tasks/{task_id}`	Poll a Celery task (PENDING / STARTED / SUCCESS / FAILURE)

Pipeline

Ingestion

  PDF / URL
      │
      ▼
  ┌─────────────┐
  │ get_file_bytes()│
  └──────┬──────┘
         │  doc_bytes
         ▼
  ┌──────────────┐     ┌──────────────────┐
  │   GROBID     │     │     Docling      │
  │  (metadata)  │     │ (PDF → Markdown) │
  └──────┬───────┘     └────────┬─────────┘
         │                      │
         ▼                      ▼
     metadata             page_content
         │                      │
         └──────────┬───────────┘
                    ▼
          ┌─────────────────┐
          │ MarkdownHeader  │
          │ TextSplitter    │  parent chunks (by heading)
          └────────┬────────┘
                   │
          ┌─────────────────┐
          │RecursiveCharText│  child chunks (~800 chars, 80 overlap)
          │ TextSplitter    │
          └────────┬────────┘
                   │
         ┌─────────┴──────────┐
         ▼                    ▼
  ┌──────────────┐   ┌────────────────┐
  │   Ollama     │   │LocalFileStore  │
  │  nomic-embed │   │(parent chunks) │
  └──────┬───────┘   └────────────────┘
         │
         ▼
  ┌──────────────┐
  │   ChromaDB   │   child chunk vectors + metadata
  └──────────────┘

Retrieval

  User query
      │
      ▼
  ┌──────────────┐
  │ Ollama embed │   embed query → vector
  └──────┬───────┘
         │
         ▼
  ┌──────────────┐
  │   ChromaDB   │   cosine similarity search (optionally filtered by paper_ids)
  └──────┬───────┘
         │  top-k child chunks (contain parent_id references)
         ▼
  ┌──────────────┐
  │LocalFileStore│   resolve child → parent chunk (full section context)
  └──────┬───────┘
         │  parent chunk text
         ▼
  ┌──────────────┐
  │  Ollama LLM  │   qwen3.5:4b + retrieved context → answer
  └──────┬───────┘
         │  SSE stream (thinking + response tokens)
         ▼
       TUI chat

For the retrieval step, paper_ids can optionally be passed to restrict the search to specific papers. This is how the paper filter in the Chat tab works.

Chunking strategy

  Parent chunk  ───→  Child chunk   ───→  Embedding in ChromaDB
  (heading section)     (800 chars overlap)
       │
       └── stored in LocalFileStore
       │
       └── 120 char overlap carried from previous parent for continuity

During retrieval, child chunks are matched by similarity, then resolved to their parent for richer context.

Ingestion pipeline

A paper is created via POST /papers/ (with source_path) or POST /papers/import-url.
Celery picks up the ingest task on the ingest queue.
The worker converts the PDF to Markdown using Docling, extracts metadata via GROBID, chunks the content with a parent-child chunker, and stores vectors in ChromaDB + parent chunks in the local docstore.
Ingestion is idempotent — re-ingesting a paper first purges its existing vectors and chunks before rebuilding them.
Poll GET /tasks/{task_id} or watch Flower at http://localhost:5555 to track progress.

Development

# Install all dependencies (including dev) locally with uv
uv sync

# Run tests
uv run pytest tests/ -v

# Run the API locally (requires running infra services)
uv run uvicorn backend.app.main:app --reload

# Run a Celery worker locally
uv run celery -A backend.app.celery_app.celery_app worker --loglevel=info --queues=ingest

License

MIT License

Project details

Release history Release notifications | RSS feed

This version

0.4.0

May 16, 2026

0.3.1

Dec 31, 2025

0.3.0

Dec 18, 2025

0.2.8

Oct 22, 2025

0.2.7

Oct 22, 2025

0.2.6

Sep 2, 2025

0.2.6a0 pre-release

Sep 17, 2025

0.2.5

Aug 26, 2025

0.2.4

Aug 25, 2025

0.2.3

Aug 24, 2025

0.2.2

Aug 24, 2025

0.2.1

Aug 22, 2025

0.2.0

Aug 22, 2025

0.1.0

May 23, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

docseer-0.4.0.tar.gz (27.0 kB view details)

Uploaded May 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

docseer-0.4.0-py3-none-any.whl (27.2 kB view details)

Uploaded May 16, 2026 Python 3

File details

Details for the file docseer-0.4.0.tar.gz.

File metadata

Download URL: docseer-0.4.0.tar.gz
Upload date: May 16, 2026
Size: 27.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for docseer-0.4.0.tar.gz
Algorithm	Hash digest
SHA256	`63a5e690a867614f4611c9aa9eea2c3b9057f6e27f9d8eac5ec4f6e66da97601`
MD5	`9536d2ae96748b05549fe30f72137629`
BLAKE2b-256	`94181cb0ff4a83695d82cd81a00c60a49b5ef7c1d9e24186e076db22d0fc88d3`

See more details on using hashes here.

Provenance

The following attestation bundles were made for docseer-0.4.0.tar.gz:

Publisher: python-publish.yml on fellajimed/DocSeer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: docseer-0.4.0.tar.gz
- Subject digest: 63a5e690a867614f4611c9aa9eea2c3b9057f6e27f9d8eac5ec4f6e66da97601
- Sigstore transparency entry: 1554608335
- Sigstore integration time: May 16, 2026
Source repository:
- Permalink: fellajimed/DocSeer@d7709c343a1997d6e328645a7bd7d71cd7864fda
- Branch / Tag: refs/tags/v0.4.0
- Owner: https://github.com/fellajimed
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@d7709c343a1997d6e328645a7bd7d71cd7864fda
- Trigger Event: push

File details

Details for the file docseer-0.4.0-py3-none-any.whl.

File metadata

Download URL: docseer-0.4.0-py3-none-any.whl
Upload date: May 16, 2026
Size: 27.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for docseer-0.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`94f2cb96fbfad82a734f56b67bafb5c9e531dd4bc55b47fbe5c23e0bc00a3bdd`
MD5	`0c1cf3f355eeff2054736ce856a7c685`
BLAKE2b-256	`68c295d120059b75dbafeb5e33bac2d45d730339a6a6a3feeaa0d1a061d30b93`

See more details on using hashes here.

Provenance

The following attestation bundles were made for docseer-0.4.0-py3-none-any.whl:

Publisher: python-publish.yml on fellajimed/DocSeer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: docseer-0.4.0-py3-none-any.whl
- Subject digest: 94f2cb96fbfad82a734f56b67bafb5c9e531dd4bc55b47fbe5c23e0bc00a3bdd
- Sigstore transparency entry: 1554608354
- Sigstore integration time: May 16, 2026
Source repository:
- Permalink: fellajimed/DocSeer@d7709c343a1997d6e328645a7bd7d71cd7864fda
- Branch / Tag: refs/tags/v0.4.0
- Owner: https://github.com/fellajimed
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@d7709c343a1997d6e328645a7bd7d71cd7864fda
- Trigger Event: push

docseer 0.4.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

DocSeer

Architecture

Prerequisites

Quick start

Using the CLI (recommended)

Using make

Running with make

Mode 1 — Fully Dockerized (default)

Mode 2 — Native macOS Ollama (recommended on Apple Silicon)

Service URLs

Configuration

Key settings

All make commands

Backend

TUI

Models

Development

CLI reference

TUI keyboard shortcuts

REST API overview

Papers

Chat

Tasks

Pipeline

Ingestion

Retrieval

Chunking strategy

Ingestion pipeline

Development

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Using `make`

Running with `make`

All `make` commands