End-to-End Researcher — automated research pipeline for economics, finance, and IS

Project description

E2ER — turn a research question into a paper

Hand E2ER a research question; get back a LaTeX paper with citations, an internal peer-review pass, and a runnable replication package — typically in ~25 minutes.

pip install e2er
e2er install-skills
export LLM_BACKEND=claude_code
e2er run "Does X affect Y?" --methodology empirical --max-cost 5

That's everything you need to run your first paper. See First run below for what happens next.

Install
First run
Pick a backend
What you get
Methodologies
Costs
Resume a paused paper
Data sources
Literature: bring your own BibTeX
Going deeper
Examples
Troubleshooting
Development (contributing)
Citing · Related work · Contact

Install

Prerequisites: Python 3.11 or 3.12. That's it — SQLite is auto-created at ~/.e2er/papers.db, so no database setup is needed for the default flow.

pip install e2er
e2er install-skills      # bundles the skill files used by the specialists

To verify your install without spending any tokens:

e2er run --help          # CLI is wired

That's all you need to run a paper. The rest of this section covers optional setup.

Optional — Postgres + pgvector (for production, multi-user, or the literature KB):

export DATABASE_URL=postgresql://user:pass@host:5432/e2er
e2er migrate              # runs the schema migrations

Optional — GitHub integration (push each paper's LaTeX + replication package to its own repo):

export GITHUB_TOKEN=ghp_...     # token with `repo` scope
export GITHUB_OWNER=your-user-or-org

First run

export LLM_BACKEND=claude_code   # see "Pick a backend" below
e2er run "Does liquidity concentration in Uniswap v3 affect price discovery?" \
   --methodology empirical \
   --max-cost 5

What happens:

e2er run starts a local API server (uvicorn on :8280) if one isn't already running.
It submits the paper to POST /api/papers and gets back a paper_id + workspace path.
It tails the run to your terminal. Press ^C at any time — the run keeps going in the background; re-attach via the dashboard.
When the pipeline finishes, you'll see a summary line with the paper's terminal status (completed / rejected / paused).

Open the dashboard at http://127.0.0.1:8280 to see all papers, drill into per-specialist artifacts, watch the live cost meter, and download the audit bundle.

Files for a paper land in two places:

workspaces/<paper_id>/ on your filesystem — every artifact, every reviewer report, the replication package.
A dedicated GitHub repo per paper (if you've set GITHUB_TOKEN + GITHUB_OWNER), structured for direct Overleaf import.

Pick a backend

E2ER is "bring your own LLM" — choose whichever you already have access to. The CLI backends use your existing subscription, so the marginal cost per paper is $0.

Backend	Setting	Cost per paper	Install
Claude Code CLI (Anthropic Max)	`LLM_BACKEND=claude_code`	$0/token	`npm i -g @anthropic-ai/claude-code`
Codex CLI (ChatGPT Plus/Pro)	`LLM_BACKEND=codex_cli`	$0/token	`npm i -g @openai/codex`
Gemini CLI (Google AI Pro/Ultra)	`LLM_BACKEND=gemini_cli`	$0/token	`npm i -g @google/gemini-cli`
Anthropic SDK	`LLM_BACKEND=anthropic`	per-token	`export ANTHROPIC_API_KEY=...`
OpenRouter	`LLM_BACKEND=openrouter`	per-token	`export OPENROUTER_API_KEY=...` (200+ models)

First-run guardrail: the first paper at any (model, methodology, mode) combination is capped at $1.00 until one has completed successfully — protects against a runaway tool-use loop on a model that hasn't been validated yet. Pass --acknowledge-unproven to lift the floor and use the full --max-cost you provided.

What you get

Every paper produces this artifact set in workspaces/<paper_id>/:

File	Description
`paper_plan.md`	Research design, propositions, identification strategy
`literature_review.md`	Related-work synthesis with citations
`identification_strategy.md`	Causal identification argument and threats
`econometric_spec.md`	Econometric specification with equations
`data_dictionary.json`	Pre-specified data footprint (fields, time filter, granularity)
`data_summary.md`	Data acquisition narrative
`summary_statistics.json`	Machine-readable descriptive stats — consumed by `verify_numbers` and the drafter
`estimation_results.json`	Machine-readable point estimates, SEs, t-stats, p-values
`figure_spec.json`	Numeric values for every figure
`paper_draft.tex`	Full LaTeX manuscript
`abstract.tex`	Standalone abstract
`self_attack_report.json`	Adversarial flaw-finding report with severity scores
`review_*.md`	Structured reviews from 6 specialist reviewers
`review_aggregation.json`	Mechanical aggregation verdict (`ACCEPT` / `MINOR_REVISION` / `MAJOR_REVISION` / `HARD_REJECT`)
`number_verification.json`	Anti-hallucination gate report — every table number checked against the JSON sidecars
`replication/estimation.py`	Main econometric estimation code
`replication/data_queries.sql`	All data queries used in the paper
`replication/audit_log.csv`	Complete data-access audit trail

If GITHUB_TOKEN is set, all of the above are also pushed to a dedicated paper repo with an Overleaf-compatible layout.

Methodologies

Pick one per paper via --methodology:

empirical (default) — data-driven; runs identification, data, and econometrics specialists.
theoretical — formal model + propositions; skips data and replication phases (and the data reviewer).
mixed — formal model AND empirical test.

Most users want empirical. theoretical is for pure-model papers (no data, just propositions and proofs); the pipeline costs ~30% less because the data specialists and replication packager are skipped.

Costs

Mode	Model	Typical cost	Notes
`single_pass`	Haiku 4.5	~$0.50	Fast draft. What `make smoke-paid` uses.
`single_pass`	Sonnet 4.6	$3 – $8	Better depth, one pass through the pipeline.
`iterative`	Sonnet 4.6	$15 – $25	Full loop: ceiling check → self-attack → polish → review → revision. Hard-capped at `--max-cost` (default $25).
any	Claude Code / Codex / Gemini CLI	$0	Flat-rate subscription absorbs the cost. The dollar meter is a synthetic estimate at Sonnet rates and still drives the budget gate.

Budget safety. Every paper has a hard cap (--max-cost, default $25). The pipeline checks cumulative cost at every phase boundary; when the cap is reached the run transitions to paused (resumable — see below) rather than crashing.

Resume a paused paper

Papers pause for two reasons, both recoverable:

Budget exhausted — the per-paper cap was reached.
Circuit breaker — a non-tolerant specialist failed _MAX_SPECIALIST_ATTEMPTS times in a row (typically a data-layer outage).

For budget pauses, raise the cap atomically with the resume:

curl -X POST http://127.0.0.1:8280/api/papers/<paper_id>/resume \
  -H "Content-Type: application/json" \
  -d '{"max_cost_usd": 15}'

For circuit-breaker pauses, fix the underlying problem (e.g. restore data-source access) first, then POST with no body to retry. The runner's resume-from-disk logic skips any phase that already produced its canonical artifact, so you don't re-pay for completed work.

The dashboard's "Resume" button does the same thing through the UI.

Data sources

The data module is optional. Set DATA_MODULE_ENABLED=false to run literature-only papers, or supply your own data files in the workspace's data/ directory.

Currently wired in:

Source	Coverage	Setup
yfinance	Equities, ETFs, crypto, FX, indices	No key required
FRED	US + international macro time series	Free key (~30s registration at https://fred.stlouisfed.org)
Allium	On-chain blockchain data	Bring your own key (`ALLIUM_API_KEY`)

Allium guardrails (when enabled)

Every Allium query passes through 5 guardrails before execution:

No SELECT * — all fields must be listed explicitly.
All requested fields must be declared in the paper's data_dictionary.json.
A time-bound WHERE clause is required on every query.
Transaction-level granularity requires written justification.
Production queries require a prior approved feasibility run on the same table.

Two-phase workflow: feasibility queries (1000-row sample) are auto-approved; production queries are queued for researcher approval at GET /api/papers/{id}/pending-queries.

We gratefully acknowledge Allium for supporting this research through data access and technical collaboration.

Literature: bring your own BibTeX

E2ER does not automatically retrieve papers from the internet. Supply a .bib file of your own curated references:

export LITERATURE_BIBTEX_FILE=/path/to/refs.bib

When set, the pipeline:

Parses all entries at startup (requires bibtexparser — included in pip install e2er).
Injects a compact reference list into the prompts of literature_scanner, paper_drafter, section_writer, abstract_writer, and revisor.
Copies the .bib file into the workspace so LaTeX can compile with \bibliography{refs}.

A typical workflow: export your references from Zotero / Mendeley as refs.bib, set the env var, and the drafter uses \cite{} commands aligned with your BibTeX keys.

Planned: open-access paper fetching via OpenAlex, Semantic Scholar, and arXiv is implemented in src/modules/literature/ but not yet wired into the pipeline. Contributions welcome.

Going deeper

For a high-level mental model before diving into the code:

Pipeline overview — full flow from idea to completion (mermaid diagram).
Specialist DAG — execution dependencies and parallel groups.
Review aggregation — the 3 mechanical rules that turn 6 reviewer scores into a verdict.
Interactive architecture diagram — open in a browser.

Pipeline phases

[Researcher input: RQ + optional BibTeX + optional data]
          |
          v
    1. Study Design      idea_developer, literature_scanner, identification_strategist
    2. Data              data_architect → data_analyst → summary_statistics.json
    3. Estimation        econometrics_specialist → estimation_results.json
    4. Writing           paper_drafter, abstract_writer, latex_formatter
          |
          v  (iterative mode only)
    5. Ceiling Check     Strategist assesses whether further iteration adds value
    6. Self-Attack       Adversarial specialist finds critical flaws (severity 1-10)
    7. Polish            5 parallel specialists: formula, numerics, institutions, bibliography, equilibria
          |
          v
    8. verify_numbers    Programmatic gate: every table number must match a JSON sidecar
    9. Review            6 parallel reviewers (5 for theoretical): mechanism, technical,
                         identification, literature, data, writing
   10. Aggregation       3-rule mechanical verdict
   11. Revision          Revisor specialist addresses feedback (if MAJOR_REVISION)
   12. Replication       Packages all queries, code, and audit trail
   13. GitHub Push       LaTeX + replication package committed to paper repo

Review aggregation rules

Applied in order; first match wins:

Rule	Condition	Verdict
1	Mechanism reviewer score < 5	`MECHANISM_FAIL` — fundamental revision required
2	Any reviewer score < 4	`HARD_REJECT` — floor violation
3	Weighted average (technical ×1.5, identification ×1.5, data ×1.25)	`ACCEPT` / `MINOR_REVISION` / `MAJOR_REVISION` / `HARD_REJECT`

Examples

The repo ships with worked examples — real artifacts from real runs:

examples/e2er_v3_haiku_smoke/ — single-pass v3 run on Haiku 4.5 (~$1.50, ~11 min), data module disabled. Pipeline plumbing only — not findings.
examples/starter_theoretical/ — minimal theoretical paper template you can copy as a starting point.
examples/e2er_v1_nft_seasonality/ — full v1 paper (PDF + LaTeX + replication) testing whether the Halloween effect extends to NFT markets. Null result; 35.8M Ethereum NFT trades.
examples/e2er_v1_bitcoin_institutionalization/ — full v1 paper on Bitcoin volatility convergence around the January 2024 ETF approval. GARCH + Markov-switching + DiD + Rambachan-Roth.

These results have not been submitted to a journal and should not be cited as peer-reviewed findings.

Monthly NFT Returns

Monthly return distribution by platform — pipeline-generated, from the NFT seasonality example

Troubleshooting

e2er: command not found — pip install e2er succeeded but the script directory isn't on your PATH. Try python -m e2er run "..." instead, or add your ~/.local/bin (or venv bin/) to PATH.

pip install e2er errors with ImportError: cannot import name 'UTC' from 'datetime' — your local Python is < 3.11. E2ER requires 3.11+. Use pyenv install 3.11 or brew install python@3.12.

Paper stuck in in_progress forever — check workspaces/<paper_id>/.pipeline_state.json for the last completed phase and ~/.e2er/uvicorn.log for errors. Restart uvicorn and hit /resume — the runner reads state.json and skips completed phases.

Paper paused with BudgetExceededError — raise the cap and resume: curl -X POST http://127.0.0.1:8280/api/papers/<id>/resume -d '{"max_cost_usd": 15}' -H "Content-Type: application/json".

Paper rejected with verify_numbers: N critical mismatches — the drafter cited table numbers that don't match the JSON sidecars. Open number_verification.json for the specific mismatches. Either revise the source artifacts (summary_statistics.json etc.) to match the draft, or revise the draft to match the sources, then resume.

Allium API key error / data module crashes — set DATA_MODULE_ENABLED=false in your environment. The pipeline runs literature-only (or with manually uploaded data files) without Allium.

OpenRouter 402 Payment Required — your OpenRouter balance is zero. Top up at https://openrouter.ai/credits. The pipeline correctly bails rather than looping.

Authorization header missing on JSON POSTs — you set API_AUTH_TOKEN but didn't include -H "Authorization: Bearer <token>" on the request. The HTML dashboard form is exempt.

Development (contributing)

For local development on the repo itself (rather than pip install e2er):

git clone https://github.com/bhanneke/E2ER-project.git
cd E2ER-project
pip install -e ".[dev]"
make smoke          # full mocked test suite — ~15s, no API key needed

If make smoke reports 420+ passed, your install is good and the orchestration works end-to-end. Then:

make lint           # ruff check + format check
make typecheck      # mypy
make smoke-paid     # ~$0.50 Haiku run end-to-end (requires ANTHROPIC_API_KEY)

Docker path (postgres + dashboard in one command):

./scripts/quickstart.sh    # prompts for ANTHROPIC_API_KEY, runs `docker compose up --build`

See AGENTS.md for the branch model, lane structure, and contribution conventions. See CONTRIBUTING.md for the PR process, and skills/CONTRIBUTING_SKILLS.md for the skill-file pattern (the lowest-friction way to contribute — markdown only, no code changes).

Related projects

The automated research space is developing quickly. Two projects most relevant to E2ER:

Project APE (Social Catalyst Lab, University of Zurich) — AI agents identifying policy questions with credible causal identification strategies, running econometric analysis, and producing complete papers. ~1,000 papers generated; now in systematic evaluation against peer-reviewed journals. Closest in spirit to E2ER.
ZeroPaper (Institute for Automated Research) — ~30 specialised agents across 10 stages, focused on theory-first finance and macroeconomics. E2ER adopts four quality-control ideas from ZeroPaper (ceiling detection, self-attack, parallel polish, mechanical aggregation).

Roadmap highlights

More data sources: WRDS, OpenBB, Census, BLS, ECB, World Bank, Dune, Flipside — the data module is designed to be extended. See docs/iv_database.md for the natural-experiments catalogue.
Evaluation framework: docs/evaluation_framework.md — six scored dimensions (identification, execution, writing, literature, replication, novelty) plus automated metrics.
Testers wanted: if you're working on an empirical question in IS, economics, finance, or adjacent fields and want to run the pipeline on your own data, contact hanneke@wiwi.uni-frankfurt.de.

Citing

@software{hanneke2026e2er,
  author       = {Hanneke, Bj{\"o}rn},
  title        = {{E2ER: End-to-End Researcher, An Open-Source Pipeline
                   for Automated Empirical Research}},
  year         = {2026},
  version      = {0.5.0},
  url          = {https://github.com/bhanneke/E2ER-project},
  doi          = {10.5281/zenodo.20187238},
  license      = {MIT},
  institution  = {Goethe University Frankfurt},
}

Cite the concept DOI 10.5281/zenodo.20187238 to credit any version (resolves to the latest release), or browse all versions on Zenodo to pin a specific snapshot. A companion paper describing the system architecture is in preparation.

Contact

Björn Hanneke · bjornhanneke.com · hanneke@wiwi.uni-frankfurt.de

PhD Candidate, Goethe University Frankfurt — Chair of Information Systems and Information Management (Prof. Dr. Oliver Hinz).

ORCID · Google Scholar · LinkedIn

MIT License: see LICENSE.

Project details

Release history Release notifications | RSS feed

This version

0.5.0

May 21, 2026

0.4.5

May 19, 2026

0.4.4

May 19, 2026

0.4.3

May 19, 2026

0.4.2

May 18, 2026

0.4.1

May 18, 2026

0.4.0

May 18, 2026

0.3.0

May 16, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

e2er-0.5.0.tar.gz (349.2 kB view details)

Uploaded May 21, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

e2er-0.5.0-py3-none-any.whl (361.4 kB view details)

Uploaded May 21, 2026 Python 3

File details

Details for the file e2er-0.5.0.tar.gz.

File metadata

Download URL: e2er-0.5.0.tar.gz
Upload date: May 21, 2026
Size: 349.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for e2er-0.5.0.tar.gz
Algorithm	Hash digest
SHA256	`5c8c9c445651ff59eb69e43ec1d50b0a24abbb056df299768f1de85128a84f29`
MD5	`606cdd0a90cee1dc41bdd9a7022a2d59`
BLAKE2b-256	`d4220ae5937a225dee931a26f70217a30babad4794edb978157d223a1eb6f520`

See more details on using hashes here.

Provenance

The following attestation bundles were made for e2er-0.5.0.tar.gz:

Publisher: release.yml on bhanneke/E2ER-project

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: e2er-0.5.0.tar.gz
- Subject digest: 5c8c9c445651ff59eb69e43ec1d50b0a24abbb056df299768f1de85128a84f29
- Sigstore transparency entry: 1591514446
- Sigstore integration time: May 21, 2026
Source repository:
- Permalink: bhanneke/E2ER-project@b4e5b55797ae7ac8996d68a172aa2930bde20f54
- Branch / Tag: refs/tags/v0.5.0
- Owner: https://github.com/bhanneke
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@b4e5b55797ae7ac8996d68a172aa2930bde20f54
- Trigger Event: push

File details

Details for the file e2er-0.5.0-py3-none-any.whl.

File metadata

Download URL: e2er-0.5.0-py3-none-any.whl
Upload date: May 21, 2026
Size: 361.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for e2er-0.5.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`099f31a837766bac4c2eade0543c1cca712cc3dca63f8ff1d6d14d724ff0fee5`
MD5	`a9dbf38c8d6cd14dc881d3372dd3af79`
BLAKE2b-256	`cd1c6b6fd2626b05b38fc62f1fb23daa13494738511a64ecb95919608314cacd`

See more details on using hashes here.

Provenance

The following attestation bundles were made for e2er-0.5.0-py3-none-any.whl:

Publisher: release.yml on bhanneke/E2ER-project

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: e2er-0.5.0-py3-none-any.whl
- Subject digest: 099f31a837766bac4c2eade0543c1cca712cc3dca63f8ff1d6d14d724ff0fee5
- Sigstore transparency entry: 1591514485
- Sigstore integration time: May 21, 2026
Source repository:
- Permalink: bhanneke/E2ER-project@b4e5b55797ae7ac8996d68a172aa2930bde20f54
- Branch / Tag: refs/tags/v0.5.0
- Owner: https://github.com/bhanneke
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@b4e5b55797ae7ac8996d68a172aa2930bde20f54
- Trigger Event: push

e2er 0.5.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

E2ER — turn a research question into a paper

Table of contents

Install

First run

Pick a backend

What you get

Methodologies

Costs

Resume a paused paper

Data sources

Allium guardrails (when enabled)

Literature: bring your own BibTeX

Going deeper

Pipeline phases

Review aggregation rules

Examples

Troubleshooting

Development (contributing)

Related projects

Roadmap highlights

Citing

Contact

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance