Skip to main content

Read arXiv papers for real: structured analysis, grounded RAG Q&A, and reproduction tutoring from the command line.

Project description

PaperMind

English · 简体中文

Read any paper — from skim to reproduction — with every key claim traced back to the source. Paste an arXiv link, DOI, paper title, or any paper page / PDF URL (or upload a PDF): structured analysis, grounded & citation‑verified Q&A, a whole‑method framework diagram (downloadable SVG), and a runnable reproduction guide.

CI PyPI Python License: MIT

A paper-style teaching diagram PaperMind generates for a technical point

↑ A paper-style teaching diagram PaperMind generates for a technical point (real output) · full sample report · gallery →

Why it isn't just another chat‑with‑PDF

  • 🔬 We trained our own retrieval reranker — +14pt Recall@5 over a strong dense baseline (QASPER, held‑out test). Most "chat with a paper" tools wrap an API; PaperMind's evidence retrieval is a cross‑encoder we fine‑tuned and measured (see Benchmarks).
  • 🔎 Key claims are labeled and citation‑verified. Answers are split into fact / inference (with confidence) / out‑of‑scope, and each cited quote is checked against the actual paper text — if it can't be found, it's flagged ⚠️ rather than silently trusted.
  • 📐 Whole‑method framework diagram. Beyond per‑point figures, PaperMind reconstructs the paper's end‑to‑end method as a single Figure‑1‑style architecture diagram (steps the paper only implies are marked inferred) — view it on the web /framework tab and download it as SVG.
  • 🛠️ Reproduction grounded in the real code repo. PaperMind locates the paper's code repository and builds setup.sh from the repo's actual dependency file and README run‑commands — not a hallucinated guess.
  • 🆓 Runs fully local & free. papermind demo works offline; --local routes everything through Ollama; with no key it auto‑falls back to local.
PaperMind Typical chat‑with‑PDF / SaaS readers
Per‑claim citation with verification (flags unverifiable ⚠️)
Reproduction → runnable setup.sh from the real repo
Self‑trained RAG reranker, measured on QASPER
Runs fully local / offline, zero cost
Open‑source & self‑hostable

“—” = not a documented capability we're aware of; open an issue to correct us. Open‑source/local rows are clear‑cut for closed SaaS tools.

Three ways to use it

How Best for
🌐 Online open papermind.try2026.cn — zero install trying it instantly
🖥️ GUI pip install "papermind-ai[web]"papermind ui full features, in your browser
⌨️ CLI / API pip install papermind-aipapermind analyze https://arxiv.org/abs/2307.08691 scripting, batch, integration

🆓 No spend: papermind demo shows the output offline (no key); add --local to any command to run fully on local Ollama at zero cost.

What you get

╭──────────────── Attention Is All You Need · 2017 · arXiv:1706.03762 ───────────────╮
🎯 Contributions   A fully attention-based model (Transformer): no recurrence/convolution…
🔬 Technical       1. Scaled Dot-Product Attention [high]  softmax(QKᵀ/√d_k)·V; scaling avoids
                      saturated gradients   💡 analogy + 📊 paper-style teaching diagram (SVG)
💬 Grounded Q&A    "Why divide by √d_k?" → [FACT] large d_k inflates dot-product variance…
                      📌 source Section 3.2.1 (p.4) ✓ verified
🛠️ Reproduction    code repo (★24k) github.com/… → setup.sh (real deps + README run commands)

Full rendered reports (open on GitHub — paper‑style diagrams, jump‑to‑source citations, reproduction tables): Transformer · ViT · PPO · the full gallery →

Gallery — 7 papers across 5 areas

PaperMind isn't NLP‑only. Real reports generated by the pipeline, grouped by area:

Area Papers
NLP / Attention Transformer · FlashAttention‑2 · Llama 2
Computer Vision ViT
Generative / Diffusion Latent Diffusion (Stable Diffusion)
Reinforcement Learning PPO
Efficient fine‑tuning LoRA

Benchmarks — the reranker is trained and measured, not a black box

We fine‑tuned a cross‑encoder reranker (bge‑reranker‑base) on QASPER. Over all‑paragraph candidates, versus a strong dense baseline:

Recall@5 MRR nDCG@10
Dense (bge‑small‑en‑v1.5) 0.519 0.463 0.469
+ our reranker 0.660 0.612 0.609

Consistent across dev (888 q) and a held‑out test (1309 q) — no overfitting. Reproduce it: docs/RESEARCH_PLAN.md (trainer/ to train · evaluation/ to evaluate).

Self‑host the online service

One Docker command. Public exposure is safe by default (demo mode is read‑only cache, never spends a key):

git clone https://github.com/Wenhao-Hua/papermind && cd papermind
docker build -t papermind . && docker run -p 8080:8080 papermind            # demo mode

docker run -p 8080:8080 -e OPENAI_API_KEY=sk-... -e PAPERMIND_TRUST_PROXY=1 papermind \
       papermind serve --host 0.0.0.0 --port 8080 --live                    # live (your key, billed)

--live is rate‑limited by default (8/IP/day, 300/day global; tune with --rate-per-ip / --rate-global, 0 = unlimited) so a public deployment can't drain your key. Free ops (search / cached / offline demo) aren't limited.

Behind Cloudflare/a reverse proxy, set PAPERMIND_TRUST_PROXY=1 — otherwise every visitor shares one per‑IP quota (the proxy's IP) and per‑IP limiting silently breaks. Only set it when you're actually behind a trusted proxy that sets cf-connecting-ip.

Faster figures: diagrams default to the main model (cleanest layout, slower). Point figures at a fast non‑thinking model to speed them up: papermind config set figure-model deepseek/deepseek-chat.

What it can do

analyze (4‑module report) · summary (TL;DR) · ask/chat (grounded Q&A) · tutor/debug (reproduction help) · compare (multi‑paper) · reproduce (export setup.sh / notebook) · search/batch/list · cite.

Models via litellm: OpenAI / Anthropic / DeepSeek / Gemini / Qwen (DashScope) / local Ollama. Results are cached locally — re‑runs are instant. More: papermind --help.

Contributing / License

PRs and issues welcome (CONTRIBUTING.md) · MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

papermind_ai-0.1.0.tar.gz (181.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

papermind_ai-0.1.0-py3-none-any.whl (147.8 kB view details)

Uploaded Python 3

File details

Details for the file papermind_ai-0.1.0.tar.gz.

File metadata

  • Download URL: papermind_ai-0.1.0.tar.gz
  • Upload date:
  • Size: 181.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for papermind_ai-0.1.0.tar.gz
Algorithm Hash digest
SHA256 940e1f69c8500c11688c86d607eb491e1c3ff9097c9af745c814557f8d884a42
MD5 827823a15469a73c5cc6f98c9cb8738c
BLAKE2b-256 7538cd425b68acb3d86ab3ef925cc6da86211d763442dcedf6bd01cc8559c928

See more details on using hashes here.

File details

Details for the file papermind_ai-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: papermind_ai-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 147.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for papermind_ai-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 a526da8bad357fbac16600d19d8276d7de5d19cca24e846e671b1195be198285
MD5 3146c9b9fd1f7d5f7887f5195a1f22e6
BLAKE2b-256 906a7bf4dafc6b9311ff45d98dbda1928ccb2499e1fc18f56e247942b9f24295

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page