Skip to main content

Bulge-tier Excel financial model factory. Every cell live-formulated, every number traceable. MCP-native.

Project description

ModelForge

Version Tests Trust MCP Templates SBOM

Bulge-tier Excel financial model factory for credit & structured finance. Every cell live-formulated. Every number traceable back to the source document page it came from.

A developer tool for analysts and engineers who build credit and corporate-finance models programmatically. Covers unitranche, sponsor-backed LBO, project finance, real estate credit, NPL, structured credit, restructuring, M&A, DCF and IPO templates. Extensible to any asset class.

The moat: builds are byte-identical deterministic (same spec → same workbook bytes, every run) and ship with a verifiable manifest + certificate — formula integrity, accounting/conservation invariants (balance sheet balances, cash ties out), and SHA-256 hashes of spec + sources + workbook. Run certify --strict / build --trust-strict and it's fail-closed: non-zero exit on any integrity violation, so a broken model never ships. That's model generation with a portable audit trail — not just generation. For an AI agent or an app that emits financial models, it's the layer that turns "the LLM produced a spreadsheet" into "here is a certificate that the spreadsheet is internally correct and reproducible."


🚀 Using ModelForge in production — or want managed features, priority support, or a specific template/connector? Tell me about your use case → — I read every one.


What this solves

  • Your agent needs to produce an Excel model from a structured spec — without an LLM hallucinating numbers directly into cells. ModelForge keeps the model deterministic: the LLM writes a typed YAML spec with source IDs, and a Python builder emits the live-formula workbook.
  • You need every output number to be auditable back to where it came from — without manually maintaining a sources sheet. Each hardcoded input carries a source ID, and the model's linkage graph is persisted to SQLite so a cell can be traced to its driver, source, and document page.
  • You want a model that recalculates instead of being a static dump — without writing formula strings by hand. Every cell is a real Excel formula, with named ranges, sign conventions, and WORST/BASE/BEST scenario toggles wired across sheets.
  • You need to gate a workbook for review — without eyeballing it. The QC tool runs an automated structural check suite (QC sheet present, named ranges populated, source references resolve, print areas set, no orphan sheets) and returns a per-check pass/fail report.
  • You need to triage many candidate deals fast — without building a workbook for each one. The screening tool filters and ranks a directory of spec YAMLs by quantitative criteria (margins, leverage, IRR) on their screening: block alone.
  • You want the whole pipeline available to an AI assistant — without bespoke glue code. ModelForge ships an MCP server (modelforge-mcp) so agents in Claude Code, Cursor, Cline, or ChatGPT Enterprise can list templates, build, QC, trace lineage, ingest a data room, and export deliverables.

Use it inside Claude Code, Cursor, ChatGPT Enterprise (MCP-native)

PyPI name: modelforge-finance (the unscoped modelforge was taken by source{d}'s ML library). Import name stays modelforge.

pip install "modelforge-finance[mcp,export]"

# wire into your MCP client config:
{
  "mcpServers": {
    "modelforge": { "command": "modelforge-mcp" }
  }
}

Then in your AI assistant:

"Build me a unitranche LBO model from this YAML spec, export the committee deck."

Tools available: list_templates · build_model · qc_workbook · list_sources · lineage_walk · ingest_dataroom · screen_deals · compute_tax · export_pptx · export_docx · plus 7 unified-feed tools (data_providers_status · quote · history · fundamentals · search_filings · entity_lookup · search_securities) across a 14-provider data stack.

The architectural principle

LLMs produce specs + sources + narrative. Deterministic Python produces the workbook.

The LLM never writes a number into a cell. It writes a typed YAML spec with source IDs. A deterministic builder emits the Excel via openpyxl. A QC gate validates before export. Excel is a render of a linkage graph; the graph is persisted to SQLite and is the canonical artifact.

Quality standards (bulge-tier, non-negotiable)

Formatting

  • Blue = hardcoded input. Black = formula. Green = cross-sheet link. Red = warning.
  • No mixed formulas (no magic numbers embedded). Named ranges for every driver.
  • Costs NEGATIVE (sign convention enforced and checked).
  • EN primary labels, multi-language secondary (DE / ES / IT shipped; SV / NO / DA / NL on the v0.10 roadmap as design-partner asks).
  • Historical vs Projected column separator, obvious.
  • Check row at top of every sheet (BS balance, CFS tie, covenant headroom — TRUE or 0).

Sourcing

  • Every hardcoded cell has a comment with source ID (S-001, S-002, ...).
  • Sources sheet lists each source: doc, page, publisher, date, URL, verified-flag.
  • Assumptions (not sourced) tagged A-001 with rationale + confidence H/M/L.

Scenarios

  • WORST / BASE / BEST toggle on Assumptions. Drives every sheet via CHOOSE.
  • Every sheet respects the toggle — no orphan assumptions.

Audit

  • QC sheet with 8 automated checks, all must pass.
  • Revision log on Cover.
  • Named ranges mandatory.
  • Print areas set. Print-ready on every sheet.

Quick start

pip install "modelforge-finance[mcp,export]"

# Scaffold a ready-to-build spec — no repo checkout needed (works for any of the 19
# templates; run `modelforge list-templates` to see them all)
modelforge scaffold dcf -o demo_dcf.yaml

# Build it: live-formula workbook + linkage graph + manifest sidecar
modelforge build demo_dcf.yaml            # -> output/demo_dcf.xlsx

# Certify the delivered artifact: zero formula errors, byte-identical, manifest-valid
modelforge certify output/demo_dcf.xlsx

Trust Layer v1 (new in v0.9.7)

Why should a buyer trust the number in cell B42?

The Trust Layer is a semantic gate (separate from the structural QC gate). It answers the question every IC asks in the first five minutes: is this number plausible? It catches issues like a DCF EV that's 8× the company's real market cap before the model ever leaves QA.

25+ built-in rules cover all shipped templates:

  • DCF: WACC band (3-25%), terminal growth ≤ GDP + 1%, EV vs market-cap deviation, terminal-value share, sensitivity-table monotonicity
  • Three-statement: balance-sheet integrity, cash reconciliation, retained-earnings link
  • NPL: cumulative recovery ≤ 100%, vintage staircase monotone
  • Project finance: DSCR floor, wire degradation > 0, P90 < P50
  • Sponsor LBO: XIRR plausibility, multiple expansion vs entry
  • M&A / fairness / structured credit / unitranche / credit memo: per-template plausibility

Each violation produces a RedFlags worksheet inside the built workbook with severity (info / warn / fail), the rule that fired, expected-vs-actual, and the recommended remediation.

modelforge audit-all examples/   # every shipped example, 0 FAIL violations in current ship

See AUDIT_REPORT.md for the current ship's audit.

Data-room ingestion (v0.3.1)

Turn a directory of PDFs, XLSXs and CSVs into a validated ModelForge YAML spec using Claude Opus. Every extracted number traces back to a doc page via the auto-built Sources registry.

pip install -e .[ingest]                # installs anthropic, pdfplumber, pypdf
export ANTHROPIC_API_KEY=sk-ant-...      # required

modelforge ingest path/to/dataroom/ \
    --template project_finance \
    -o output/my_deal.yaml --verbose

# Review output/my_deal.yaml + output/my_deal.ingestion.md
# (INGESTION_REPORT.md lists every extracted field, S-id, confidence)

modelforge build output/my_deal.yaml     # produces the workbook
modelforge qc output/my_deal.xlsx        # 8/8 quality gate

Supported template: project_finance (MVP). Templates 1, 3, 5-8 queued for v0.3.2.

Package layout

modelforge/
├── graph/            # First-class linkage graph (nodes, edges, SQLite persistence)
├── spec/             # Pydantic schemas per template
│   ├── base.py       # Source, Assumption, Scenario, Target (shared types)
│   └── unitranche.py # Template 1: Unitranche LBO
├── builder/          # Deterministic openpyxl writer
│   ├── styles.py     # Bulge-tier formatting library
│   ├── formulas.py   # Formula string builders
│   ├── i18n.py       # EN/IT label dictionary
│   ├── workbook.py   # Top-level builder
│   └── sheets/       # One module per sheet (cover, sources, assumptions, ...)
├── qc/               # Quality gate (8 structural checks + PDF report)
├── data/             # Market data loaders (Damodaran, ECB, Borsa minibond)
└── cli.py            # build | certify | qc | scaffold | validate | screen | ingest | ...

Templates (19: 17 shipped + 2 preview)

  1. Unitranche LBO — Mid-market direct lending (Cash sweep + IFRS 9 EIR + covenant package)
  2. Minibond / Private Placement Bond — Direct private debt instrument (Gross YTM + Net YTM + jurisdiction-specific WHT)
  3. Credit Memo — Extends Unitranche with recovery waterfall + PD×LGD×EAD
  4. Project Finance — Construction + operating phases, DSCR-driven
  5. Real Estate — NOI build, exit cap, LP/GP promote waterfall
  6. NPL Portfolio — Collection curves, servicing fees, senior/mezz capital structure
  7. Structured Credit — Tranche waterfall with attachment/detachment points
  8. 3-Statement — P&L + BS + CFS with BS balance integrity check
  9. DCF — WACC build, fade, terminal normalization, 2D sensitivity (Trust Layer protected)
  10. Merger — Accretion/dilution, breakeven, contribution, collar, PPA
  11. Fairness Opinion — Selected comps, regression, premium analysis
  12. Sponsor LBO — Returns waterfall, debt schedule, 14-story block
  13. IPO — Float build, lock-up, stabilization, fee schedule
  14. Restructuring — Going-concern recovery, plan-feasibility, creditor classes
  15. Development (RE) — Ground-up development: phased capex, lease-up S-curve, forward-NOI exit, LTC debt, promote
  16. Bank / FIG — NII, RWA, CET1 & leverage ratios, MDA-gated dividends & buybacks (Basel III/IV)
  17. Loan-Tape Securitization — CLO/RMBS: stratified tape, pool cashflow (CPR/CDR/recovery), sequential-pay turbo waterfall (OC/IC + reserve), note WAL/IRR/rating
  18. 🔬 HGB Carveout (preview) — German HGB carve-out financials
  19. 🔬 Portfolio Review (preview) — Multi-asset portfolio performance review

Run modelforge list-templates to see them all (preview templates are flagged). Each shipped template has an anonymized example YAML in examples/.

Tax jurisdictions (7)

US  · Federal CIT + state + NOL + R&D credit + GILTI + BEAT + ASC 740
UK  · FRS 102 + main rate + marginal relief + RDEC + AIA + WDA + group relief
DE  · KSt + SolZ + GewSt (Hebesatz + § 8 add-backs + min-tax loss CF) — HGB roadmap v0.10
FR  · IS + small-profits + social surcharge + CVAE + CIR + 88% participation
ES  · IS + SME 23% + newly-created 15% + 95% participation + R&D + min-tax 15%
JP  · NCT + LCT + Enterprise Tax + Special Local Corp Tax + R&D credit
IT  · IRES / IRAP / SIIQ / PEX

Data providers (14, unified Provider Protocol)

Tier-0 (free, live today): EDGAR · OpenFIGI · GLEIF · Yahoo Finance · FRED Tier-1 (low-cost paid): Polygon ($29/mo) · FMP ($19/mo) · Finnhub · Tiingo · Alpha Vantage Tier-2 (institutional): Bloomberg · Refinitiv · FactSet · S&P Capital IQ

Tier-1 and Tier-2 are interface-complete — paid keys activate them via env vars. Local TTL cache prevents rate-limit blow-ups.

Security & SBOM

  • CycloneDX 1.5 SBOM auto-generated by CI on every push and attached to every GitHub release (scripts/generate_sbom.py)
  • CI gates: pytest across Python 3.11 + 3.12, ruff lint, SBOM structure validation (.github/workflows/ci.yml)
  • Audit log with append-only SQLite (modelforge/audit_log.py)
  • Trust Layer semantic gates auto-injected into every built workbook
  • Security policy: see SECURITY.md

Procurement-grade controls (SOC 2 Type II, ISO 27001, pen-test, multi-tenant SaaS with SSO/SCIM) are Phase-B work.

The pitch

Bulge-tier Excel models, every cell live-formulated, every number traceable back to the data room page it came from.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

modelforge_finance-0.12.0.tar.gz (795.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

modelforge_finance-0.12.0-py3-none-any.whl (892.1 kB view details)

Uploaded Python 3

File details

Details for the file modelforge_finance-0.12.0.tar.gz.

File metadata

  • Download URL: modelforge_finance-0.12.0.tar.gz
  • Upload date:
  • Size: 795.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for modelforge_finance-0.12.0.tar.gz
Algorithm Hash digest
SHA256 261e6d5ff8ac3118d266bb03d192fa1da2ef166fe34e71e256f09ee41be9a0f5
MD5 1942b58dd0812c101f952fac74d3b282
BLAKE2b-256 9ce9b043fb0b60d79e449f3465d9e36050e270c49aeb2e25c9d8c63d42f3bfd8

See more details on using hashes here.

File details

Details for the file modelforge_finance-0.12.0-py3-none-any.whl.

File metadata

File hashes

Hashes for modelforge_finance-0.12.0-py3-none-any.whl
Algorithm Hash digest
SHA256 dd9c32e4e36cc87dc70cba25545566f0e6679938082f191b5f5bbc06a429d995
MD5 da65509700d12d93c390bc6cf39e9620
BLAKE2b-256 df4a1812220e60e76d96098bef19b29d12dbc6799e3ca476ce66ae58ce5d170a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page