deepresearch-flow

Workflow tools for paper extraction, review, and research automation.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

nerdneils

These details have not been verified by PyPI

Project description

ai-deepresearch-flow logo

ai-deepresearch-flow

From documents to deep research insight — automatically.

English | 中文

The Core Pain Points

OCR Chaos: Raw markdown from OCR tools is often broken -- tables drift, formulas break, and references are non-clickable.
Translation Nightmares: Translating technical papers often destroys code blocks, LaTeX formulas, and table structures.
Information Overload: Extracting structured insights (authors, venues, summaries) from hundreds of PDFs manually is impossible.
Context Switching: Managing PDFs, summaries, and translations in different windows kills focus.

The Solution

DeepResearch Flow provides a unified pipeline to Repair, Translate, Extract, and Serve your research library.

Key Features

Smart Extraction: Turn unstructured Markdown into schema-enforced JSON (summaries, metadata, Q&A) using LLMs (OpenAI, Claude, Gemini, etc.).
Precision Translation: Translate OCR Markdown to Chinese/Japanese (.zh.md, .ja.md) while freezing formulas, code, tables, and references. No more broken layout.
Local Knowledge DB: A high-performance local Web UI to browse papers with Split View (Source vs. Translated vs. Summary), full-text search, and multi-dimensional filtering.
Coverage Compare: Compare JSON/PDF/Markdown/Translated datasets to find missing artifacts and export CSV reports.
OCR Post-Processing: Automatically fix broken references ([1] -> [^1]), merge split paragraphs, and standardize layouts.

Quick Start

1) Installation

# Recommended: using uv for speed
uv pip install deepresearch-flow

# Or standard pip
pip install deepresearch-flow

2) Configuration

Set up your LLM providers. We support OpenAI, Claude, Gemini, Ollama, and more.

cp config.example.toml config.toml
# Edit config.toml to add your API keys (e.g., env:OPENAI_API_KEY)

3) The "Zero to Hero" Workflow

Step 1: Extract Insights

Scan a folder of markdown files and extract structured summaries.

uv run deepresearch-flow paper extract \
  --input ./docs \
  --model openai/gpt-4o-mini \
  --prompt-template deep_read

Step 2: Translate Safely

Translate papers to Chinese, protecting LaTeX and tables.

uv run deepresearch-flow translator translate \
  --input ./docs \
  --target-lang zh \
  --model openai/gpt-4o-mini \
  --fix-level moderate

Step 3: Repair OCR Outputs (Recommended)

Recommended sequence to stabilize markdown before serving:

# 1) Fix OCR markdown (auto-detects JSON if inputs are .json)
uv run deepresearch-flow recognize fix \
  --input ./docs \
  --in-place

# 2) Fix LaTeX formulas
uv run deepresearch-flow recognize fix-math \
  --input ./docs \
  --model openai/gpt-4o-mini \
  --in-place

# 3) Fix Mermaid diagrams
uv run deepresearch-flow recognize fix-mermaid \
  --input ./paper_outputs \
  --json \
  --model openai/gpt-4o-mini \
  --in-place

# 4) Fix again to normalize formatting
uv run deepresearch-flow recognize fix \
  --input ./docs \
  --in-place

Step 4: Serve Your Database

Launch a local UI to read and manage your papers.

uv run deepresearch-flow paper db serve \
  --input paper_infos.json \
  --md-root ./docs \
  --md-translated-root ./docs \
  --host 127.0.0.1

Deployment (Static CDN)

Use a separate static server (CDN) for PDFs/Markdown/images and keep the API/UI on another host.

1) Export static assets

uv run deepresearch-flow paper db serve \
  --input paper_infos.json \
  --md-root ./docs \
  --md-translated-root ./docs \
  --pdf-root ./pdfs \
  --static-mode prod \
  --static-base-url https://static.example.com \
  --static-export-dir /data/paper-static

Notes:

The API host must be able to read the original PDF/Markdown roots to build the index and hashes.
The CDN host only needs the exported directory (e.g. /data/paper-static).

2) Serve the export directory with CORS + cache headers (Caddy example)

:8002 {
  root * /data/paper-static
  encode zstd gzip

  @static path /pdf/* /md/* /md_translate/* /images/*
  header @static {
    Access-Control-Allow-Origin *
    Access-Control-Allow-Methods GET,HEAD,OPTIONS
    Access-Control-Allow-Headers *
    Cache-Control "public, max-age=31536000, immutable"
  }

  @options method OPTIONS
  respond @options 204

  file_server
}

3) Start the API/UI with static base

export PAPER_DB_STATIC_BASE_URL="https://static.example.com"
export PAPER_DB_STATIC_MODE="prod"
export PAPER_DB_STATIC_EXPORT_DIR="/data/paper-static"
export PAPER_DB_PDFJS_CDN_BASE_URL="https://cdnjs.cloudflare.com/ajax/libs/pdf.js/4.0.379"

uv run deepresearch-flow paper db serve \
  --input paper_infos.json \
  --md-root ./docs \
  --md-translated-root ./docs \
  --pdf-root ./pdfs

Comprehensive Guide

1. Translator: OCR-Safe Translation

The translator module is built for scientific documents. It uses a node-based architecture to ensure stability.

Structure Protection: automatically detects and "freezes" code blocks, LaTeX ($$...$$), HTML tables, and images before sending text to the LLM.
OCR Repair: use --fix-level to merge broken paragraphs and convert text references ([1]) to clickable Markdown footnotes ([^1]).
Context-Aware: supports retries for failed chunks and falls back gracefully.

# Translate with structure protection and OCR repairs
uv run deepresearch-flow translator translate \
  --input ./paper.md \
  --target-lang ja \
  --fix-level aggressive \
  --model claude/claude-3-5-sonnet-20240620

2. Paper Extract: Structured Knowledge

Turn loose markdown files into a queryable database.

Templates: built-in prompts like simple, eight_questions, and deep_read guide the LLM to extract specific insights.
Async and throttled: precise control over concurrency (--max-concurrency) and rate limits (--sleep-every).
Incremental: skips already processed files; resumes from where you left off.

uv run deepresearch-flow paper extract \
  --input ./library \
  --output paper_data.json \
  --template-dir ./my-custom-prompts \
  --max-concurrency 10

3. Database and UI: Your Personal ArXiv

The db serve command creates a local research station.

Split View: read the original PDF/Markdown on the left and the Summary/Translation on the right.
Full Text Search: search by title, author, year, or content tags (tag:fpga year:2023..2024).
Stats: visualize publication trends and keyword frequencies.
PDF Viewer: built-in PDF.js viewer prevents cross-origin issues with local files.

uv run deepresearch-flow paper db serve \
  --input paper_infos.json \
  --pdf-root ./pdfs \
  --cache-dir .cache/db

4. Paper DB Compare: Coverage Audit

Compare two datasets (A/B) to find missing PDFs, markdowns, translations, or JSON items, with match metadata.

uv run deepresearch-flow paper db compare \
  --input-a ./a.json \
  --md-root-b ./md_root \
  --output-csv ./compare.csv

# Compare translated markdowns by language
uv run deepresearch-flow paper db compare \
  --md-translated-root-a ./translated_a \
  --md-translated-root-b ./translated_b \
  --lang zh

5. Recognize: OCR Post-Processing

Tools to clean up raw outputs from OCR engines like MinerU.

Embed Images: convert local image links to Base64 for a portable single-file Markdown.
Unpack Images: extract Base64 images back to files.
Organize: flatten nested OCR output directories.
Fix: apply OCR fixes and rumdl formatting during organize, or as a standalone step.
Fix JSON: apply the same fixes to markdown fields inside paper JSON outputs.
Fix Math: validate and repair LaTeX formulas with optional LLM assistance.
Fix Mermaid: validate and repair Mermaid diagrams (requires mmdc from mermaid-cli).
Recommended order: fix -> fix-math -> fix-mermaid -> fix.

uv run deepresearch-flow recognize md embed --input ./raw_ocr --output ./clean_md

# Organize MinerU output and apply OCR fixes
uv run deepresearch-flow recognize organize \
  --input ./mineru_outputs \
  --output-simple ./ocr_md \
  --fix

# Fix and format existing markdown outputs
uv run deepresearch-flow recognize fix \
  --input ./ocr_md \
  --output ./ocr_md_fixed

# Fix in place
uv run deepresearch-flow recognize fix \
  --input ./ocr_md \
  --in-place

# Fix JSON outputs in place
uv run deepresearch-flow recognize fix \
  --json \
  --input ./paper_outputs \
  --in-place

# Fix LaTeX formulas in markdown
uv run deepresearch-flow recognize fix-math \
  --input ./docs \
  --model openai/gpt-4o-mini \
  --in-place

# Fix Mermaid diagrams in JSON outputs
uv run deepresearch-flow recognize fix-mermaid \
  --json \
  --input ./paper_outputs \
  --model openai/gpt-4o-mini \
  --in-place

Docker Support

Don't want to manage Python environments?

docker run --rm -v $(pwd):/app -it ghcr.io/nerdneilsfield/deepresearch-flow --help

Configuration

The config.toml is your control center. It supports:

Multiple Providers: mix and match OpenAI, DeepSeek (DashScope), Gemini, Claude, and Ollama.
Model Routing: explicit routing to specific models (--model provider/model_name).
Environment Variables: keep secrets safe using env:VAR_NAME syntax.

See config.example.toml for a full reference.

Built with love for the Open Science community.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

nerdneils

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.9.7

Apr 19, 2026

0.9.6

Apr 18, 2026

0.9.5

Apr 18, 2026

0.9.4

Apr 18, 2026

0.9.3

Apr 18, 2026

0.9.2

Apr 17, 2026

0.9.1

Apr 16, 2026

0.9.0

Apr 15, 2026

0.8.5

Apr 10, 2026

0.8.4

Apr 1, 2026

0.8.3

Apr 1, 2026

0.8.2

Mar 30, 2026

0.8.1

Mar 28, 2026

0.7.10

Feb 11, 2026

0.7.9

Feb 11, 2026

0.7.8

Feb 11, 2026

0.7.7

Feb 11, 2026

0.7.6

Feb 11, 2026

0.7.5

Feb 11, 2026

0.7.4

Feb 10, 2026

0.7.3

Feb 5, 2026

0.7.2

Feb 5, 2026

0.7.1

Feb 5, 2026

0.7.0

Feb 5, 2026

0.6.1

Jan 30, 2026

0.6.0

Jan 30, 2026

This version

0.5.1

Jan 19, 2026

0.5.0

Jan 18, 2026

0.4.1

Jan 16, 2026

0.4.0

Jan 15, 2026

0.3.0

Jan 12, 2026

0.2.1

Jan 12, 2026

0.2.0

Jan 12, 2026

0.1.2

Jan 11, 2026

0.1.1

Jan 11, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deepresearch_flow-0.5.1.tar.gz (5.6 MB view details)

Uploaded Jan 19, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

deepresearch_flow-0.5.1-py3-none-any.whl (6.0 MB view details)

Uploaded Jan 19, 2026 Python 3

File details

Details for the file deepresearch_flow-0.5.1.tar.gz.

File metadata

Download URL: deepresearch_flow-0.5.1.tar.gz
Upload date: Jan 19, 2026
Size: 5.6 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for deepresearch_flow-0.5.1.tar.gz
Algorithm	Hash digest
SHA256	`774cac54ec7e0869f9ad1243130c1f5c9e3b8e57a25f2e48a7f02a2ef6f7d5eb`
MD5	`4e0fda0dea7e2af91a10414ba6389ad6`
BLAKE2b-256	`ceaabe35b6e03d08c0084cbbc70b801cbe621d52b84ff4478e4a3e3fe9a5535f`

See more details on using hashes here.

Provenance

The following attestation bundles were made for deepresearch_flow-0.5.1.tar.gz:

Publisher: push-to-pypi.yml on nerdneilsfield/ai-deepresearch-flow

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: deepresearch_flow-0.5.1.tar.gz
- Subject digest: 774cac54ec7e0869f9ad1243130c1f5c9e3b8e57a25f2e48a7f02a2ef6f7d5eb
- Sigstore transparency entry: 834389865
- Sigstore integration time: Jan 19, 2026
Source repository:
- Permalink: nerdneilsfield/ai-deepresearch-flow@a680aaacb6cc96070df5eaf7a0ceb7bcf1260a18
- Branch / Tag: refs/tags/v0.5.1
- Owner: https://github.com/nerdneilsfield
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: push-to-pypi.yml@a680aaacb6cc96070df5eaf7a0ceb7bcf1260a18
- Trigger Event: push

File details

Details for the file deepresearch_flow-0.5.1-py3-none-any.whl.

File metadata

Download URL: deepresearch_flow-0.5.1-py3-none-any.whl
Upload date: Jan 19, 2026
Size: 6.0 MB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for deepresearch_flow-0.5.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`cb1230bf2dffea92583679caa68781851d65335a92a08ecd5dec85317187e5bf`
MD5	`c5701bc18b17613ba68e6dd1a8ce0e51`
BLAKE2b-256	`96f5b13b61155cef06b06cb42bbdbec4b87b239c0a1cb543c01076879427035a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for deepresearch_flow-0.5.1-py3-none-any.whl:

Publisher: push-to-pypi.yml on nerdneilsfield/ai-deepresearch-flow

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: deepresearch_flow-0.5.1-py3-none-any.whl
- Subject digest: cb1230bf2dffea92583679caa68781851d65335a92a08ecd5dec85317187e5bf
- Sigstore transparency entry: 834389866
- Sigstore integration time: Jan 19, 2026
Source repository:
- Permalink: nerdneilsfield/ai-deepresearch-flow@a680aaacb6cc96070df5eaf7a0ceb7bcf1260a18
- Branch / Tag: refs/tags/v0.5.1
- Owner: https://github.com/nerdneilsfield
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: push-to-pypi.yml@a680aaacb6cc96070df5eaf7a0ceb7bcf1260a18
- Trigger Event: push

deepresearch-flow 0.5.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Project description

ai-deepresearch-flow

The Core Pain Points

The Solution

Key Features

Quick Start

1) Installation

2) Configuration

3) The "Zero to Hero" Workflow

Step 1: Extract Insights

Step 2: Translate Safely

Step 3: Repair OCR Outputs (Recommended)

Step 4: Serve Your Database

Deployment (Static CDN)

1) Export static assets

2) Serve the export directory with CORS + cache headers (Caddy example)

3) Start the API/UI with static base

Comprehensive Guide

Docker Support

Configuration

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance